langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	2f209d84fa	core[patch]: Add pydantic get_fields adapter (#25187 ) Add adapter to get fields	2024-08-08 17:47:42 +00:00
Eugene Yurtsev	7b1a132aff	core[patch]: Add unit tests for Serializable (#25152 ) Add a few test cases for serializable (many other test cases already covered throguh runnable tests).	2024-08-07 21:01:36 +00:00
ccurme	803eba3163	core[patch]: check for model_fields attribute (#25108 ) `__fields__` raises a warning in pydantic v2	2024-08-07 13:32:56 -07:00
Erick Friis	dff83cce66	core[patch]: base language model disable_streaming (#25070 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-07 09:26:21 -07:00
Eugene Yurtsev	28e0958ff4	core[patch]: Relax rate limit unit tests in terms of timing (#25140 ) Relax rate limit unit tests	2024-08-07 14:04:58 +00:00
Eugene Yurtsev	d283f452cc	core[minor]: Add support for DocumentIndex in the index api (#25100 ) Support document index in the index api.	2024-08-06 12:30:49 -07:00
William FH	267855b3c1	Set Context in RunnableSequence & RunnableParallel (#25073 )	2024-08-06 11:10:37 -07:00
Gram Liu	88a9a6a758	core[patch]: Add pydantic metadata to subset model (#25032 ) - Description: This includes Pydantic field metadata in `_create_subset_model_v2` so that it gets included in the final serialized form that get sent out. - Issue: #25031 - Dependencies: n/a - Twitter handle: @gramliu --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-05 17:57:39 -07:00
Bagatur	e572521f2a	core[patch]: exclude special pydantic init params (#25084 )	2024-08-05 23:32:51 +00:00
Eugene Yurtsev	4bcd2aad6c	core[patch]: Relax time constraints on rate limit test (#25071 ) Try to keep the unit test fast, but also have it repeat more robustly	2024-08-05 17:04:22 -04:00
Eugene Yurtsev	41dfad5104	core[minor]: Introduce DocumentIndex abstraction (#25062 ) This PR adds a minimal document indexer abstraction. The goal of this abstraction is to allow developers to create custom retrievers that also have a standard indexing API and allow updating the document content in them. The abstraction comes with a test suite that can verify that the indexer implements the correct semantics. This is an iteration over a previous PRs (https://github.com/langchain-ai/langchain/pull/24364). The main difference is that we're sub-classing from BaseRetriever in this iteration and as so have consolidated the sync and async interfaces. The main problem with the current design is that runt time search configuration has to be specified at init rather than provided at run time. We will likely resolve this issue in one of the two ways: (1) Define a method (`get_retriever`) that will allow creating a retriever at run time with a specific configuration.. If we do this, we will likely break the subclass on BaseRetriever (2) Generalize base retriever so it can support structured queries --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 18:06:33 +00:00
Bagatur	1dcee68cb8	docs: show beta directive (#25013 ) ![Screenshot 2024-08-02 at 7 15 34 PM](https://github.com/user-attachments/assets/086831c7-36f3-4962-98dc-d707b6289747)	2024-08-03 03:07:45 +00:00
Bagatur	57747892ce	docs: show deprecation warning first in api ref (#25001 ) OLD ![Screenshot 2024-08-02 at 3 29 39 PM](https://github.com/user-attachments/assets/7f169121-1202-4770-a006-d72ac7a1aa33) NEW ![Screenshot 2024-08-02 at 3 29 45 PM](https://github.com/user-attachments/assets/9cc07cbd-2ae9-4077-95c5-03cb051e6cd7)	2024-08-02 17:35:25 -07:00
Bagatur	199e9c5ae0	core[patch]: Fix tool args schema inherited field parsing (#24936 ) Fix #24925	2024-08-01 18:36:33 -07:00
WU LIFU	ad16eed119	core[patch]: runnable config ensure_config deep copy from var_child_runnable… (#24862 ) issue: #24660 RunnableWithMessageHistory.stream result in error because the [evaluation](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/runnables/branch.py#L220) of the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) unexpectedly trigger the "[on_end](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`)" (exit_history) callback of the default branch descriptions After a lot of investigation I'm convinced that the root cause is that 1. during the execution of the runnable, the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L122)`) is shared between the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) runnable and the [default branch runnable](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`) within the same context 2. when the default branch runnable runs, it gets the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L163)`) and may unintentionally [add more handlers ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L325)`)to the callback manager of this config 3. when it is again the turn for the [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) to run, it gets the `var_child_runnable_config` whose callback manager has the handlers added by the default branch. When it runs that handler (`exit_history`) it leads to the error with the assumption that, the `ensure_config` function actually does want to create a immutable copy from `var_child_runnable_config` because it starts with an [`empty` variable ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L156)`), i go ahead to do a deepcopy to ensure that future modification to the returned value won't affect the `var_child_runnable_config` variable Having said that I actually 1. don't know if this is a proper fix 2. don't know whether it will lead to other unintended consequence 3. don't know why only "stream" runs into this issue while "invoke" runs without problem so @nfcampos @hwchase17 please help review, thanks! --------- Co-authored-by: Lifu Wu <lifu@nextbillion.ai> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 17:30:32 -07:00
Eugene Yurtsev	75776e4a54	core[patch]: In unit tests, use `_schema()` instead of BaseModel.schema() (#24930 ) This PR introduces a module with some helper utilities for the pydantic 1 -> 2 migration. They're meant to be used in the following way: 1) Use the utility code to get unit tests pass without requiring modification to the unit tests 2) (If desired) upgrade the unit tests to match pydantic 2 output 3) (If desired) stop using the utility code Currently, this module contains a way to map `schema()` generated by pydantic 2 to (mostly) match the output from pydantic v1.	2024-08-01 11:59:04 -04:00
Bagatur	25b93cc4c0	core[patch]: stringify tool non-content blocks (#24626 ) Slightly breaking bugfix. Shouldn't cause too many issues since no models would be able to handle non-content block ToolMessage.content anyways.	2024-07-31 16:42:38 -07:00
Eugene Yurtsev	210623b409	core[minor]: Add support for pydantic 2 to utility to get fields (#24899 ) Add compatibility for pydantic 2 for a utility function. This will help push some small changes to master, so they don't have to be kept track of on a separate branch.	2024-07-31 19:11:07 +00:00
Eugene Yurtsev	5099a9c9b4	core[patch]: Update unit tests with a workaround for using AnyID in pydantic 2 (#24892 ) Pydantic 2 ignores __eq__ overload for subclasses of strings.	2024-07-31 14:42:12 -04:00
Bagatur	8461934c2b	core[patch], integrations[patch]: convert TypedDict to tool schema support (#24641 ) supports following UX ```python class SubTool(TypedDict): """Subtool docstring""" args: Annotated[Dict[str, Any], {}, "this does bar"] class Tool(TypedDict): """Docstring Args: arg1: foo """ arg1: str arg2: Union[int, str] arg3: Optional[List[SubTool]] arg4: Annotated[Literal["bar", "baz"], ..., "this does foo"] arg5: Annotated[Optional[float], None] ``` - can parse google style docstring - can use Annotated to specify default value (second arg) - can use Annotated to specify arg description (third arg) - can have nested complex types	2024-07-31 18:27:24 +00:00
Nuno Campos	68ecebf1ec	core: Fix implementation of trim_first_node/trim_last_node to use exact same definition of first/last node as in the getter methods (#24802 )	2024-07-30 08:44:27 -07:00
Bagatur	a6d1fb4275	core[patch]: introduce ToolMessage.status (#24628 ) Anthropic models (including via Bedrock and other cloud platforms) accept a status/is_error attribute on tool messages/results (specifically in `tool_result` content blocks for Anthropic API). Adding a ToolMessage.status attribute so that users can set this attribute when using those models	2024-07-29 14:01:53 -07:00
ccurme	9998e55936	core[patch]: support tool calls with non-pickleable args in tools (#24741 ) Deepcopy raises with non-pickleable args.	2024-07-29 13:18:39 -04:00
William FH	01ab2918a2	core[patch]: Respect injected in bound fns (#24733 ) Since right now you cant use the nice injected arg syntas directly with model.bind_tools()	2024-07-28 15:45:19 -07:00
Bagatur	ad7581751f	core[patch]: ChatPromptTemplate.init same as ChatPromptTemplate.from_… (#24486 )	2024-07-26 10:48:39 -07:00
Eugene Yurtsev	20690db482	core[minor]: Add BaseModel.rate_limiter, RateLimiter abstraction and in-memory implementation (#24669 ) This PR proposes to create a rate limiter in the chat model directly, and would replace: https://github.com/langchain-ai/langchain/pull/21992 It resolves most of the constraints that the Runnable rate limiter introduced: 1. It's not annoying to apply the rate limiter to existing code; i.e., possible to roll out the change at the location where the model is instantiated, rather than at every location where the model is used! (Which is necessary if the model is used in different ways in a given application.) 2. batch rate limiting is enforced properly 3. the rate limiter works correctly with streaming 4. the rate limiter is aware of the cache 5. The rate limiter can take into account information about the inputs into the model (we can add optional inputs to it down-the road together with outputs!) The only downside is that information will not be properly reflected in tracing as we don't have any metadata evens about a rate limiter. So the total time spent on a model invocation will be: * time spent waiting for the rate limiter * time spend on the actual model request ## Example ```python from langchain_core.rate_limiters import InMemoryRateLimiter from langchain_groq import ChatGroq groq = ChatGroq(rate_limiter=InMemoryRateLimiter(check_every_n_seconds=1)) groq.invoke('hello') ```	2024-07-26 03:03:34 +00:00
ccurme	58dd69f7f2	core[patch]: fix mutating tool calls (#24677 ) In some cases tool calls are mutated when passed through a tool.	2024-07-25 16:46:36 +00:00
남광우	256bad3251	core[minor]: Support asynchronous in InMemoryVectorStore (#24472 ) ### Description * support asynchronous in InMemoryVectorStore * since embeddings might be possible to call asynchronously, ensure that both asynchronous and synchronous functions operate correctly.	2024-07-25 11:36:55 -04:00
Eugene Yurtsev	7dd6b32991	core[minor]: Add InMemoryRateLimiter (#21992 ) This PR introduces the following Runnables: 1. BaseRateLimiter: an abstraction for specifying a time based rate limiter as a Runnable 2. InMemoryRateLimiter: Provides an in-memory implementation of a rate limiter ## Example ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda from datetime import datetime foo = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): print(datetime.now().strftime("%H:%M:%S.%f")) return x chain = foo \| meow for _ in range(10): print(chain.invoke('hello')) ``` Produces: ``` 17:12:07.530151 hello 17:12:09.537932 hello 17:12:11.548375 hello 17:12:13.558383 hello 17:12:15.568348 hello 17:12:17.578171 hello 17:12:19.587508 hello 17:12:21.597877 hello 17:12:23.607707 hello 17:12:25.617978 hello ``` ![image](https://github.com/user-attachments/assets/283af59f-e1e1-408b-8e75-d3910c3c44cc) ## Interface The rate limiter uses the following interface for acquiring a token: ```python class BaseRateLimiter(Runnable[Input, Output], abc.ABC): @abc.abstractmethod def acquire(self, *, blocking: bool = True) -> bool: """Attempt to acquire the necessary tokens for the rate limiter.``` ``` The flag `blocking` has been added to the abstraction to allow supporting streaming (which is easier if blocking=False). ## Limitations - The rate limiter is not designed to work across different processes. It is an in-memory rate limiter, but it is thread safe. - The rate limiter only supports time-based rate limiting. It does not take into account the size of the request or any other factors. - The current implementation does not handle streaming inputs well and will consume all inputs even if the rate limit has been reached. Better support for streaming inputs will be added in the future. - When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. One way to mitigate this is to use batch_as_completed() or abatch_as_completed(). ## Bursty behavior in `batch` and `abatch` When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. This becomes a problem if users are using `batch` and `abatch` with many inputs (e.g., 100). In this case, there will be a burst of 100 inputs into the batch of the rate limited runnable. 1. Using a RunnableBinding The API would look like: ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda rate_limiter = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): return x rate_limited_meow = RunnableLambda(meow).with_rate_limiter(rate_limiter) ``` 2. Another option is to add some init option to RunnableSequence that changes `.batch()` to be depth first (e.g., by delegating to `batch_as_completed`) ```python RunnableSequence(first=rate_limiter, last=model, how='batch-depth-first') ``` Pros: Does not require Runnable Binding Cons: Feels over-complicated	2024-07-25 01:34:03 +00:00
Bagatur	70c71efcab	core[patch]: merge_content fix (#24526 )	2024-07-22 22:20:22 -07:00
Bagatur	8a140ee77c	core[patch]: don't serialize BasePromptTemplate.input_types (#24516 ) Candidate fix for #24513	2024-07-22 13:30:16 -07:00
Bagatur	236e957abb	core,groq,openai,mistralai,robocorp,fireworks,anthropic[patch]: Update BaseModel subclass and instance checks to handle both v1 and proper namespaces (#24417 ) After this PR chat models will correctly handle pydantic 2 with bind_tools and with_structured_output. ```python import pydantic print(pydantic.__version__) ``` 2.8.2 ```python from langchain_openai import ChatOpenAI from pydantic import BaseModel, Field class Add(BaseModel): x: int y: int model = ChatOpenAI().bind_tools([Add]) print(model.invoke('2 + 5').tool_calls) model = ChatOpenAI().with_structured_output(Add) print(type(model.invoke('2 + 5'))) ``` ``` [{'name': 'Add', 'args': {'x': 2, 'y': 5}, 'id': 'call_PNUFa4pdfNOYXxIMHc6ps2Do', 'type': 'tool_call'}] <class '__main__.Add'> ``` ```python from langchain_openai import ChatOpenAI from pydantic.v1 import BaseModel, Field class Add(BaseModel): x: int y: int model = ChatOpenAI().bind_tools([Add]) print(model.invoke('2 + 5').tool_calls) model = ChatOpenAI().with_structured_output(Add) print(type(model.invoke('2 + 5'))) ``` ```python [{'name': 'Add', 'args': {'x': 2, 'y': 5}, 'id': 'call_hhiHYP441cp14TtrHKx3Upg0', 'type': 'tool_call'}] <class '__main__.Add'> ``` Addresses issues: https://github.com/langchain-ai/langchain/issues/22782 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-22 20:07:39 +00:00
ccurme	0f7569ddbc	core[patch]: enable RunnableWithMessageHistory without config (#23775 ) Feedback that `RunnableWithMessageHistory` is unwieldy compared to ConversationChain and similar legacy abstractions is common. Legacy chains using memory typically had no explicit notion of threads or separate sessions. To use `RunnableWithMessageHistory`, users are forced to introduce this concept into their code. This possibly felt like unnecessary boilerplate. Here we enable `RunnableWithMessageHistory` to run without a config if the `get_session_history` callable has no arguments. This enables minimal implementations like the following: ```python from langchain_core.chat_history import InMemoryChatMessageHistory from langchain_core.runnables.history import RunnableWithMessageHistory from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-3.5-turbo-0125") memory = InMemoryChatMessageHistory() chain = RunnableWithMessageHistory(llm, lambda: memory) chain.invoke("Hi I'm Bob") # Hello Bob! chain.invoke("What is my name?") # Your name is Bob. ```	2024-07-22 10:36:53 -04:00
Nuno Campos	947628311b	core[patch]: Accept configurable keys top-level (#23806 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-20 03:49:00 +00:00
Eugene Yurtsev	5e48f35fba	core[minor]: Relax constraints on type checking for tools and parsers (#24459 ) This will allow tools and parsers to accept pydantic models from any of the following namespaces: * pydantic.BaseModel with pydantic 1 * pydantic.BaseModel with pydantic 2 * pydantic.v1.BaseModel with pydantic 2	2024-07-19 21:47:34 -04:00
Eun Hye Kim	9aae8ef416	core[patch]: Fix utils.json_schema.dereference_refs (#24335 KeyError: 400 in JSON schema processing) (#24337 ) Description: This PR fixes a KeyError: 400 that occurs in the JSON schema processing within the reduce_openapi_spec function. The _retrieve_ref function in json_schema.py was modified to handle missing components gracefully by continuing to the next component if the current one is not found. This ensures that the OpenAPI specification is fully interpreted and the agent executes without errors. Issue: Fixes issue #24335 Dependencies: No additional dependencies are required for this change. Twitter handle: @lunara_x	2024-07-19 13:31:00 -04:00
Bagatur	cd19ba9a07	core[patch]: core lint fix (#24447 )	2024-07-19 09:01:22 -07:00
Nuno Campos	62b6965d2a	core: In ensure_config don't copy dunder configurable keys to metadata (#24420 )	2024-07-18 22:28:52 +00:00
Eugene Yurtsev	ef22ebe431	standard-tests[patch]: Add pytest assert rewrites (#24408 ) This will surface nice error messages in subclasses that fail assertions.	2024-07-18 21:41:11 +00:00
Eugene Yurtsev	f62b323108	core[minor]: Support all versions of pydantic base model in argsschema (#24418 ) This adds support to any pydantic base model for tools. The only potential issue is that `get_input_schema()` will not always return a v1 base model.	2024-07-18 17:14:23 -04:00
William FH	c5a07e2dd8	core[patch]: add InjectedToolArg annotation (#24279 ) ```python from typing_extensions import Annotated from langchain_core.tools import tool, InjectedToolArg from langchain_anthropic import ChatAnthropic @tool def multiply(x: int, y: int, not_for_model: Annotated[dict, InjectedToolArg]) -> str: """multiply.""" return x * y ChatAnthropic(model='claude-3-sonnet-20240229',).bind_tools([multiply]).invoke('5 times 3').tool_calls ''' -> [{'name': 'multiply', 'args': {'x': 5, 'y': 3}, 'id': 'toolu_01Y1QazYWhu4R8vF4hF4z9no', 'type': 'tool_call'}] ''' ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-17 15:28:40 -07:00
Eugene Yurtsev	96bac8e20d	core[patch]: Fix regression requiring input_variables in few chat prompt templates (#24360 ) * Fix regression that requires users passing input_variables=[]. * Regression introduced by my own changes to this PR: https://github.com/langchain-ai/langchain/pull/22851	2024-07-17 18:14:57 -04:00
Eugene Yurtsev	9e4a0e76f6	core[patch]: Fix one unit test for chat prompt template (#24362 ) Minor change that fixes a unit test that had missing assertions.	2024-07-17 18:56:48 +00:00
Shenhai Ran	5f2dea2b20	core[patch]: Add encoding options when create prompt template from a file (#24054 ) - Uses default utf-8 encoding for loading prompt templates from file --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-16 09:35:09 -04:00
JP-Ellis	f77659463a	core[patch]: allow message utils to work with lcel (#23743 ) The functions `convert_to_messages` has had an expansion of the arguments it can take: 1. Previously, it only could take a `Sequence` in order to iterate over it. This has been broadened slightly to an `Iterable` (which should have no other impact). 2. Support for `PromptValue` and `BaseChatPromptTemplate` has been added. These are generated when combining messages using the overloaded `+` operator. Functions which rely on `convert_to_messages` (namely `filter_messages`, `merge_message_runs` and `trim_messages`) have had the type of their arguments similarly expanded. Resolves #23706. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --> --------- Signed-off-by: JP-Ellis <josh@jpellis.me> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-15 08:58:05 -07:00
Harold Martin	ccdaf14eff	docs: Spell check fixes (#24217 ) Description: Spell check fixes for docs, comments, and a couple of strings. No code change e.g. variable names. Issue: none Dependencies: none Twitter handle: hmartin	2024-07-15 15:51:43 +00:00
ccurme	888fbc07b5	core[patch]: support passing `args_schema` through `as_tool` (#24269 ) Note: this allows the schema to be passed in positionally. ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_core.runnables import RunnableLambda class Add(BaseModel): """Add two integers together.""" a: int = Field(..., description="First integer") b: int = Field(..., description="Second integer") def add(input: dict) -> int: return input["a"] + input["b"] runnable = RunnableLambda(add) as_tool = runnable.as_tool(Add) as_tool.args_schema.schema() ``` ``` {'title': 'Add', 'description': 'Add two integers together.', 'type': 'object', 'properties': {'a': {'title': 'A', 'description': 'First integer', 'type': 'integer'}, 'b': {'title': 'B', 'description': 'Second integer', 'type': 'integer'}}, 'required': ['a', 'b']} ```	2024-07-15 07:51:05 -07:00
Bagatur	d0728b0ba0	core[patch]: add tool name to tool message (#24243 ) Copying current ToolNode behavior	2024-07-15 00:42:40 +00:00
Bagatur	65321bf975	core[patch]: fix ToolCall "type" when streaming (#24218 )	2024-07-13 08:59:03 -07:00
Bagatur	6166ea67a8	core[minor]: rename ToolMessage.raw_output -> artifact (#24185 )	2024-07-12 09:52:44 -07:00

1 2 3 4 5 ...

329 Commits