Docs: Agent streaming notebooks (#15858)

Update information about streaming in the agents section. Show how to
use astream_events to get token by token streaming.
This commit is contained in:
Eugene Yurtsev 2024-01-22 21:54:55 -05:00 committed by GitHub
parent e5672bc944
commit c88750d54b
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 880 additions and 1186 deletions

File diff suppressed because one or more lines are too long

View File

@ -1,350 +0,0 @@
{
"cells": [
{
"cell_type": "markdown",
"id": "b69e747b-4e79-4caf-8f8b-c6e70275a31d",
"metadata": {},
"source": [
"# Event Streaming\n",
"\n",
"**NEW** This is a new API only works with recent versions of langchain-core!\n",
"\n",
"In this notebook, we'll see how to use `astream_events` to stream **token by token** from LLM calls used within the tools invoked by the agent. \n",
"\n",
"We will **only** stream tokens from LLMs used within tools and from no other LLMs (just to show that we can)! \n",
"\n",
"Feel free to adapt this example to the needs of your application.\n",
"\n",
"Our agent will use the OpenAI tools API for tool invocation, and we'll provide the agent with two tools:\n",
"\n",
"1. `where_cat_is_hiding`: A tool that uses an LLM to tell us where the cat is hiding\n",
"2. `tell_me_a_joke_about`: A tool that can use an LLM to tell a joke about the given topic\n",
"\n",
"\n",
"## ⚠️ Beta API ⚠️ ##\n",
"\n",
"Event Streaming is a **beta** API, and may change a bit based on feedback.\n",
"\n",
"Keep in mind the following constraints (repeated in tools section):\n",
"\n",
"* streaming only works properly if using `async`\n",
"* propagate callbacks if definning custom functions / runnables\n",
"* If creating a tool that uses an LLM, make sure to use `.astream()` on the LLM rather than `.ainvoke` to ask the LLM to stream tokens.\n",
"\n",
"## Event Hooks Reference\n",
"\n",
"\n",
"Here is a reference table that shows some events that might be emitted by the various Runnable objects.\n",
"Definitions for some of the Runnable are included after the table.\n",
"\n",
"⚠️ When streaming the inputs for the runnable will not be available until the input stream has been entirely consumed This means that the inputs will be available at for the corresponding `end` hook rather than `start` event.\n",
"\n",
"\n",
"| event | name | chunk | input | output |\n",
"|----------------------|------------------|---------------------------------|-----------------------------------------------|-------------------------------------------------|\n",
"| on_chat_model_start | [model name] | | {\"messages\": [[SystemMessage, HumanMessage]]} | |\n",
"| on_chat_model_stream | [model name] | AIMessageChunk(content=\"hello\") | | |\n",
"| on_chat_model_end | [model name] | | {\"messages\": [[SystemMessage, HumanMessage]]} | {\"generations\": [...], \"llm_output\": None, ...} |\n",
"| on_llm_start | [model name] | | {'input': 'hello'} | |\n",
"| on_llm_stream | [model name] | 'Hello' | | |\n",
"| on_llm_end | [model name] | | 'Hello human!' |\n",
"| on_chain_start | format_docs | | | |\n",
"| on_chain_stream | format_docs | \"hello world!, goodbye world!\" | | |\n",
"| on_chain_end | format_docs | | [Document(...)] | \"hello world!, goodbye world!\" |\n",
"| on_tool_start | some_tool | | {\"x\": 1, \"y\": \"2\"} | |\n",
"| on_tool_stream | some_tool | {\"x\": 1, \"y\": \"2\"} | | |\n",
"| on_tool_end | some_tool | | | {\"x\": 1, \"y\": \"2\"} |\n",
"| on_retriever_start | [retriever name] | | {\"query\": \"hello\"} | |\n",
"| on_retriever_chunk | [retriever name] | {documents: [...]} | | |\n",
"| on_retriever_end | [retriever name] | | {\"query\": \"hello\"} | {documents: [...]} |\n",
"| on_prompt_start | [template_name] | | {\"question\": \"hello\"} | |\n",
"| on_prompt_end | [template_name] | | {\"question\": \"hello\"} | ChatPromptValue(messages: [SystemMessage, ...]) |\n",
"\n",
"\n",
"Here are declarations associated with the events shown above:\n",
"\n",
"`format_docs`:\n",
"\n",
"```python\n",
"def format_docs(docs: List[Document]) -> str:\n",
" '''Format the docs.'''\n",
" return \", \".join([doc.page_content for doc in docs])\n",
"\n",
"format_docs = RunnableLambda(format_docs)\n",
"```\n",
"\n",
"`some_tool`:\n",
"\n",
"```python\n",
"@tool\n",
"def some_tool(x: int, y: str) -> dict:\n",
" '''Some_tool.'''\n",
" return {\"x\": x, \"y\": y}\n",
"```\n",
"\n",
"`prompt`:\n",
"\n",
"```python\n",
"template = ChatPromptTemplate.from_messages(\n",
" [(\"system\", \"You are Cat Agent 007\"), (\"human\", \"{question}\")]\n",
").with_config({\"run_name\": \"my_template\", \"tags\": [\"my_template\"]})\n",
"```\n",
"\n"
]
},
{
"cell_type": "code",
"execution_count": 6,
"id": "29205bef-2288-48e9-9067-f19072277a97",
"metadata": {
"tags": []
},
"outputs": [],
"source": [
"from langchain import hub\n",
"from langchain.agents import AgentExecutor, create_openai_tools_agent\n",
"from langchain.tools import tool\n",
"from langchain_core.callbacks import Callbacks\n",
"from langchain_core.prompts import ChatPromptTemplate\n",
"from langchain_openai import ChatOpenAI"
]
},
{
"cell_type": "markdown",
"id": "d6b0fafa-ce3b-489b-bf1d-d37b87f4819e",
"metadata": {},
"source": [
"## Create the model\n",
"\n",
"**Attention** For older versions of langchain, we must set `streaming=True`"
]
},
{
"cell_type": "code",
"execution_count": 7,
"id": "fa3c3761-a1cd-4118-8559-ea4d8857d394",
"metadata": {
"tags": []
},
"outputs": [],
"source": [
"model = ChatOpenAI(temperature=0, streaming=True)"
]
},
{
"cell_type": "markdown",
"id": "b76e1a3b-2983-42d9-ac12-4a0f32cd4a24",
"metadata": {},
"source": [
"## Tools\n",
"\n",
"We define two tools that rely on a chat model to generate output!\n",
"\n",
"Please note a few different things:\n",
"\n",
"1. The tools are **async**\n",
"1. The model is invoked using **.astream()** to force the output to stream\n",
"1. For older langchain versions you should set `streaming=True` on the model!\n",
"1. We attach tags to the model so that we can filter on said tags in our callback handler\n",
"1. The tools accept callbacks and propagate them to the model as a runtime argument"
]
},
{
"cell_type": "code",
"execution_count": 8,
"id": "c767f760-fe52-47e5-9c2a-622f03507aaf",
"metadata": {
"tags": []
},
"outputs": [],
"source": [
"@tool\n",
"async def where_cat_is_hiding(callbacks: Callbacks) -> str: # <--- Accept callbacks\n",
" \"\"\"Where is the cat hiding right now?\"\"\"\n",
" chunks = [\n",
" chunk\n",
" async for chunk in model.astream(\n",
" \"Give one up to three word answer about where the cat might be hiding in the house right now.\",\n",
" {\n",
" \"tags\": [\"tool_llm\"],\n",
" \"callbacks\": callbacks,\n",
" }, # <--- Propagate callbacks and assign a tag to this model\n",
" )\n",
" ]\n",
" return \"\".join(chunk.content for chunk in chunks)\n",
"\n",
"\n",
"@tool\n",
"async def tell_me_a_joke_about(\n",
" topic: str, callbacks: Callbacks\n",
") -> str: # <--- Accept callbacks\n",
" \"\"\"Tell a joke about a given topic.\"\"\"\n",
" template = ChatPromptTemplate.from_messages(\n",
" [\n",
" (\"system\", \"You are Cat Agent 007. You are funny and know many jokes.\"),\n",
" (\"human\", \"Tell me a long joke about {topic}\"),\n",
" ]\n",
" )\n",
" chain = template | model.with_config({\"tags\": [\"tool_llm\"]})\n",
" chunks = [\n",
" chunk\n",
" async for chunk in chain.astream({\"topic\": topic}, {\"callbacks\": callbacks})\n",
" ]\n",
" return \"\".join(chunk.content for chunk in chunks)"
]
},
{
"cell_type": "markdown",
"id": "cba476f8-29da-4c2c-9134-186871caf7ae",
"metadata": {},
"source": [
"## Initialize the Agent"
]
},
{
"cell_type": "code",
"execution_count": 9,
"id": "0bab4488-bf4c-461f-b41e-5e60310fe0f2",
"metadata": {
"tags": []
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"input_variables=['agent_scratchpad', 'input'] input_types={'chat_history': typing.List[typing.Union[langchain_core.messages.ai.AIMessage, langchain_core.messages.human.HumanMessage, langchain_core.messages.chat.ChatMessage, langchain_core.messages.system.SystemMessage, langchain_core.messages.function.FunctionMessage, langchain_core.messages.tool.ToolMessage]], 'agent_scratchpad': typing.List[typing.Union[langchain_core.messages.ai.AIMessage, langchain_core.messages.human.HumanMessage, langchain_core.messages.chat.ChatMessage, langchain_core.messages.system.SystemMessage, langchain_core.messages.function.FunctionMessage, langchain_core.messages.tool.ToolMessage]]} messages=[SystemMessagePromptTemplate(prompt=PromptTemplate(input_variables=[], template='You are a helpful assistant')), MessagesPlaceholder(variable_name='chat_history', optional=True), HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['input'], template='{input}')), MessagesPlaceholder(variable_name='agent_scratchpad')]\n",
"[SystemMessagePromptTemplate(prompt=PromptTemplate(input_variables=[], template='You are a helpful assistant')), MessagesPlaceholder(variable_name='chat_history', optional=True), HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['input'], template='{input}')), MessagesPlaceholder(variable_name='agent_scratchpad')]\n"
]
}
],
"source": [
"# Get the prompt to use - you can modify this!\n",
"prompt = hub.pull(\"hwchase17/openai-tools-agent\")\n",
"print(prompt)\n",
"print(prompt.messages)"
]
},
{
"cell_type": "code",
"execution_count": 10,
"id": "1762f4e1-402a-4bfb-af26-eb5b7b8f56bd",
"metadata": {
"tags": []
},
"outputs": [],
"source": [
"tools = [tell_me_a_joke_about, where_cat_is_hiding]\n",
"agent = create_openai_tools_agent(model.with_config({\"tags\": [\"agent\"]}), tools, prompt)\n",
"executor = AgentExecutor(agent=agent, tools=tools)"
]
},
{
"cell_type": "markdown",
"id": "841271d7-1de1-41a9-9387-bb04368537f1",
"metadata": {},
"source": [
"## Stream the output\n",
"\n",
"The streamed output is shown with a `|` as the delimiter between tokens. "
]
},
{
"cell_type": "code",
"execution_count": 11,
"id": "a5d94bd8-4a55-4527-b21a-4245a38c7c26",
"metadata": {
"tags": []
},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"/home/eugene/src/langchain/libs/core/langchain_core/_api/beta_decorator.py:86: LangChainBetaWarning: This API is in beta and may change in the future.\n",
" warn_beta(\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"--\n",
"Starting tool: where_cat_is_hiding with inputs: {}\n",
"\n",
"\n",
"|Under| the| bed|.||\n",
"\n",
"Ended tool: where_cat_is_hiding\n",
"--\n",
"Starting tool: tell_me_a_joke_about with inputs: {'topic': 'under the bed'}\n",
"\n",
"\n",
"|Sure|,| here|'s| a| long| joke| about| what|'s| hiding| under| the| bed|:\n",
"\n",
"|Once| upon| a| time|,| there| was| a| mis|chie|vous| little| boy| named| Tim|my|.| Tim|my| had| always| been| afraid| of| what| might| be| lurking| under| his| bed| at| night|.| Every| evening|,| he| would| ti|pt|oe| into| his| room|,| turn| off| the| lights|,| and| then| make| a| daring| leap| onto| his| bed|,| ensuring| that| nothing| could| grab| his| ankles|.\n",
"\n",
"|One| night|,| Tim|my|'s| parents| decided| to| play| a| prank| on| him|.| They| hid| a| remote|-controlled| toy| monster| under| his| bed|,| complete| with| glowing| eyes| and| a| grow|ling| sound| effect|.| As| Tim|my| settled| into| bed|,| his| parents| quietly| sn|uck| into| his| room|,| ready| to| give| him| the| scare| of| a| lifetime|.\n",
"\n",
"|Just| as| Tim|my| was| about| to| drift| off| to| sleep|,| he| heard| a| faint| grow|l| coming| from| under| his| bed|.| His| eyes| widened| with| fear|,| and| his| heart| started| racing|.| He| must|ered| up| the| courage| to| peek| under| the| bed|,| and| to| his| surprise|,| he| saw| a| pair| of| glowing| eyes| staring| back| at| him|.\n",
"\n",
"|Terr|ified|,| Tim|my| jumped| out| of| bed| and| ran| to| his| parents|,| screaming|,| \"|There|'s| a| monster| under| my| bed|!| Help|!\"\n",
"\n",
"|His| parents|,| trying| to| st|ifle| their| laughter|,| rushed| into| his| room|.| They| pretended| to| be| just| as| scared| as| Tim|my|,| and| together|,| they| brav|ely| approached| the| bed|.| Tim|my|'s| dad| grabbed| a| bro|om|stick|,| ready| to| defend| his| family| against| the| imaginary| monster|.\n",
"\n",
"|As| they| got| closer|,| the| \"|monster|\"| under| the| bed| started| to| move|.| Tim|my|'s| mom|,| unable| to| contain| her| laughter| any| longer|,| pressed| a| button| on| the| remote| control|,| causing| the| toy| monster| to| sc|urry| out| from| under| the| bed|.| Tim|my|'s| fear| quickly| turned| into| confusion|,| and| then| into| laughter| as| he| realized| it| was| all| just| a| prank|.\n",
"\n",
"|From| that| day| forward|,| Tim|my| learned| that| sometimes| the| things| we| fear| the| most| are| just| fig|ments| of| our| imagination|.| And| as| for| what|'s| hiding| under| his| bed|?| Well|,| it|'s| just| dust| b|unn|ies| and| the| occasional| missing| sock|.| Nothing| to| be| afraid| of|!\n",
"\n",
"|Remember|,| laughter| is| the| best| monster| repell|ent|!||\n",
"\n",
"Ended tool: tell_me_a_joke_about\n"
]
}
],
"source": [
"async for event in executor.astream_events(\n",
" {\"input\": \"where is the cat hiding? Tell me a joke about that location?\"},\n",
" include_tags=[\"tool_llm\"],\n",
" include_types=[\"tool\"],\n",
"):\n",
" hook = event[\"event\"]\n",
" if hook == \"on_chat_model_stream\":\n",
" print(event[\"data\"][\"chunk\"].content, end=\"|\")\n",
" elif hook in {\"on_chat_model_start\", \"on_chat_model_end\"}:\n",
" print()\n",
" print()\n",
" elif hook == \"on_tool_start\":\n",
" print(\"--\")\n",
" print(\n",
" f\"Starting tool: {event['name']} with inputs: {event['data'].get('input')}\"\n",
" )\n",
" elif hook == \"on_tool_end\":\n",
" print(f\"Ended tool: {event['name']}\")\n",
" else:\n",
" pass"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.11.4"
}
},
"nbformat": 4,
"nbformat_minor": 5
}