langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-11 19:11:02 +00:00

Author	SHA1	Message	Date
Bagatur	933bc0d6ff	core[patch]: support additional kwargs on StructuredPrompt (#25645 )	2024-09-02 14:55:26 -07:00
Nuno Campos	464dae8ac2	core: Include global variables in variables found by get_function_nonlocals (#25936 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-09-02 11:49:25 -07:00
Bagatur	d19e074374	core[patch]: handle serializable fields that cant be converted to bool (#25903 )	2024-09-01 16:44:33 -07:00
Bagatur	fabd3295fa	core[patch]: dont mutate merged lists/dicts (#25858 ) Update merging utils to - not mutate objects - have special handling to 'type' keys in dicts	2024-08-29 20:34:54 +00:00
Erick Friis	c8b8335b82	core: prompt variable error msg (#25787 )	2024-08-28 22:54:00 +00:00
Christophe Bornet	ff0df5ea15	core[patch]: Add B(bugbear) ruff rules (#25520 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-28 07:09:29 +00:00
Erick Friis	3dc7d447aa	infra: reenable min version testing 2, ci ignore ai21 (#25709 )	2024-08-23 23:28:42 +00:00
Erick Friis	6096c80b71	core: pydantic output parser streaming fix (#24415 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 18:00:09 -07:00
Eugene Yurtsev	c316361115	core[patch]: Add _api.rename_parameter to support renaming of parameters in functions (#25101 ) Add ability to rename paramerters in function signatures ```python @rename_parameter(since="2.0.0", removal="3.0.0", old="old_name", new="new_name") def foo(new_name: str) -> str: """original doc""" return new_name ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-22 17:16:31 -07:00
Yusuke Fukasawa	0258cb96fa	core[patch]: add additionalProperties recursively to oai function if strict (#25169 ) Hello. First of all, thank you for maintaining such a great project. ## Description In https://github.com/langchain-ai/langchain/pull/25123, support for structured_output is added. However, `"additionalProperties": false` needs to be included at all levels when a nested object is generated. error from current code: https://gist.github.com/fufufukakaka/e9b475300e6934853d119428e390f204 ``` BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for response_format 'JokeWithEvaluation': In context=('properties', 'self_evaluation'), 'additionalProperties' is required to be supplied and to be false", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}} ``` Reference: [Introducing Structured Outputs in the API](https://openai.com/index/introducing-structured-outputs-in-the-api/) ```json { "model": "gpt-4o-2024-08-06", "messages": [ { "role": "system", "content": "You are a helpful math tutor." }, { "role": "user", "content": "solve 8x + 31 = 2" } ], "response_format": { "type": "json_schema", "json_schema": { "name": "math_response", "strict": true, "schema": { "type": "object", "properties": { "steps": { "type": "array", "items": { "type": "object", "properties": { "explanation": { "type": "string" }, "output": { "type": "string" } }, "required": ["explanation", "output"], "additionalProperties": false } }, "final_answer": { "type": "string" } }, "required": ["steps", "final_answer"], "additionalProperties": false } } } } ``` In the current code, `"additionalProperties": false` is only added at the last level. This PR introduces the `_add_additional_properties_key` function, which recursively adds `"additionalProperties": false` to the entire JSON schema for the request. Twitter handle: `@fukkaa1225` Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-23 00:08:58 +00:00
Bagatur	b35ee09b3f	infra: xfail pydantic v2 arg to py function (#25686 ) Issue to track: #25687	2024-08-22 23:52:57 +00:00
Christophe Bornet	ee98da4f4e	core[patch]: Add UP(upgrade) ruff rules (#25358 )	2024-08-22 16:29:22 -07:00
Vadym Barda	46d344c33d	core[patch]: support drawing nested subgraphs in draw_mermaid (#25581 ) Previously the code was able to only handle a single level of nesting for subgraphs in mermaid. This change adds support for arbitrary nesting of subgraphs.	2024-08-22 16:08:49 -07:00
CastaChick	7d13a2f958	core[patch]: add option to specify the chunk separator in `merge_message_runs` (#24783 ) Description: LLM will stop generating text even in the middle of a sentence if `finish_reason` is `length` (for OpenAI) or `stop_reason` is `max_tokens` (for Anthropic). To obtain longer outputs from LLM, we should call the message generation API multiple times and merge the results into the text to circumvent the API's output token limit. The extra line breaks forced by the `merge_message_runs` function when seamlessly merging messages can be annoying, so I added the option to specify the chunk separator. Issue: No corresponding issues. Dependencies: No dependencies required. Twitter handle: @hanama_chem https://x.com/hanama_chem --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 19:46:25 +00:00
Ivan	5b9290a449	Fix UnionType type var replacement (#25566 ) [langchain_core] Fix UnionType type var replacement - Added types.UnionType to typing.Union mapping Type replacement cause `TypeError: 'type' object is not subscriptable` if any of union type comes as function `_py_38_safe_origin` return `types.UnionType` instead of `typing.Union` ```python >>> from types import UnionType >>> from typing import Union, get_origin >>> type_ = get_origin(str \| None) >>> type_ <class 'types.UnionType'> >>> UnionType[(str, None)] Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'type' object is not subscriptable >>> Union[(str, None)] typing.Optional[str] ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:22:09 -04:00
William FH	8230ba47f3	core[patch]: Improve some error messages and add another test for checking RunnableWithMessageHistory (#25209 ) Also add more useful error messages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-22 18:14:27 +00:00
Bagatur	39c44817ae	infra: test convert_message (#25632 )	2024-08-21 18:24:06 +00:00
Bagatur	8a71f1b41b	core[minor]: add langsmith document loader (#25493 ) needs tests	2024-08-20 10:22:14 -07:00
Bagatur	4bd005adb6	core[patch]: Allow bound models as token_counter in trim_messages (#25563 )	2024-08-20 00:21:22 -07:00
Bagatur	6b98207eda	infra: test chat prompt ser/des (#25557 )	2024-08-19 15:27:36 -07:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
William FH	75ae585deb	Merge support for group manager (#25360 )	2024-08-15 09:56:31 -07:00
Bagatur	2494cecabf	core[patch]: tool import fix (#25419 )	2024-08-14 22:54:13 +00:00
Chengyu Yan	d0ad713937	core: fix issue#24660, slove error messages about `ValueError` when use model with history (#25183 ) - Description: This PR will slove error messages about `ValueError` when use model with history. Detail in #24660. #22933 causes that `langchain_core.runnables.history.RunnableWithMessageHistory._get_output_messages` miss type check of `output_val` if `output_val` is `False`. After running `RunnableWithMessageHistory._is_not_async`, `output` is `False`. `249945a572/libs/core/langchain_core/runnables/history.py (L323-L334)` `15a36dd0a2/libs/core/langchain_core/runnables/history.py (L461-L471)` ~~I suggest that `_get_output_messages` return empty list when `output_val == False`.~~ - Issue: - #24660 - Dependencies:: No Change. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-14 14:26:22 +00:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
Eugene Yurtsev	429a0ee7fd	core[minor]: Add factory for looking up secrets from the env (#25198 ) Add factory method for looking secrets from the env.	2024-08-08 16:41:58 -04:00
Erick Friis	c6ece6a96d	core: autodetect more ls params (#25044 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-08 12:44:21 -07:00
Eugene Yurtsev	30fb345342	core[minor]: Add from_env utility (#25189 ) Add a utility that can be used as a default factory The goal will be to start migrating from of the pydantic models to use `from_env` as a default factory if possible. ```python from pydantic import Field, BaseModel from langchain_core.utils import from_env class Foo(BaseModel): name: str = Field(default_factory=from_env('HELLO')) ```	2024-08-08 14:52:35 -04:00
Eugene Yurtsev	2f209d84fa	core[patch]: Add pydantic get_fields adapter (#25187 ) Add adapter to get fields	2024-08-08 17:47:42 +00:00
Eugene Yurtsev	7b1a132aff	core[patch]: Add unit tests for Serializable (#25152 ) Add a few test cases for serializable (many other test cases already covered throguh runnable tests).	2024-08-07 21:01:36 +00:00
ccurme	803eba3163	core[patch]: check for model_fields attribute (#25108 ) `__fields__` raises a warning in pydantic v2	2024-08-07 13:32:56 -07:00
Erick Friis	dff83cce66	core[patch]: base language model disable_streaming (#25070 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-07 09:26:21 -07:00
Eugene Yurtsev	28e0958ff4	core[patch]: Relax rate limit unit tests in terms of timing (#25140 ) Relax rate limit unit tests	2024-08-07 14:04:58 +00:00
Eugene Yurtsev	d283f452cc	core[minor]: Add support for DocumentIndex in the index api (#25100 ) Support document index in the index api.	2024-08-06 12:30:49 -07:00
William FH	267855b3c1	Set Context in RunnableSequence & RunnableParallel (#25073 )	2024-08-06 11:10:37 -07:00
Gram Liu	88a9a6a758	core[patch]: Add pydantic metadata to subset model (#25032 ) - Description: This includes Pydantic field metadata in `_create_subset_model_v2` so that it gets included in the final serialized form that get sent out. - Issue: #25031 - Dependencies: n/a - Twitter handle: @gramliu --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-05 17:57:39 -07:00
Bagatur	e572521f2a	core[patch]: exclude special pydantic init params (#25084 )	2024-08-05 23:32:51 +00:00
Eugene Yurtsev	4bcd2aad6c	core[patch]: Relax time constraints on rate limit test (#25071 ) Try to keep the unit test fast, but also have it repeat more robustly	2024-08-05 17:04:22 -04:00
Eugene Yurtsev	41dfad5104	core[minor]: Introduce DocumentIndex abstraction (#25062 ) This PR adds a minimal document indexer abstraction. The goal of this abstraction is to allow developers to create custom retrievers that also have a standard indexing API and allow updating the document content in them. The abstraction comes with a test suite that can verify that the indexer implements the correct semantics. This is an iteration over a previous PRs (https://github.com/langchain-ai/langchain/pull/24364). The main difference is that we're sub-classing from BaseRetriever in this iteration and as so have consolidated the sync and async interfaces. The main problem with the current design is that runt time search configuration has to be specified at init rather than provided at run time. We will likely resolve this issue in one of the two ways: (1) Define a method (`get_retriever`) that will allow creating a retriever at run time with a specific configuration.. If we do this, we will likely break the subclass on BaseRetriever (2) Generalize base retriever so it can support structured queries --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 18:06:33 +00:00
Bagatur	1dcee68cb8	docs: show beta directive (#25013 ) ![Screenshot 2024-08-02 at 7 15 34 PM](https://github.com/user-attachments/assets/086831c7-36f3-4962-98dc-d707b6289747)	2024-08-03 03:07:45 +00:00
Bagatur	57747892ce	docs: show deprecation warning first in api ref (#25001 ) OLD ![Screenshot 2024-08-02 at 3 29 39 PM](https://github.com/user-attachments/assets/7f169121-1202-4770-a006-d72ac7a1aa33) NEW ![Screenshot 2024-08-02 at 3 29 45 PM](https://github.com/user-attachments/assets/9cc07cbd-2ae9-4077-95c5-03cb051e6cd7)	2024-08-02 17:35:25 -07:00
Bagatur	199e9c5ae0	core[patch]: Fix tool args schema inherited field parsing (#24936 ) Fix #24925	2024-08-01 18:36:33 -07:00
WU LIFU	ad16eed119	core[patch]: runnable config ensure_config deep copy from var_child_runnable… (#24862 ) issue: #24660 RunnableWithMessageHistory.stream result in error because the [evaluation](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/runnables/branch.py#L220) of the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) unexpectedly trigger the "[on_end](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`)" (exit_history) callback of the default branch descriptions After a lot of investigation I'm convinced that the root cause is that 1. during the execution of the runnable, the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L122)`) is shared between the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) runnable and the [default branch runnable](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`) within the same context 2. when the default branch runnable runs, it gets the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L163)`) and may unintentionally [add more handlers ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L325)`)to the callback manager of this config 3. when it is again the turn for the [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) to run, it gets the `var_child_runnable_config` whose callback manager has the handlers added by the default branch. When it runs that handler (`exit_history`) it leads to the error with the assumption that, the `ensure_config` function actually does want to create a immutable copy from `var_child_runnable_config` because it starts with an [`empty` variable ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L156)`), i go ahead to do a deepcopy to ensure that future modification to the returned value won't affect the `var_child_runnable_config` variable Having said that I actually 1. don't know if this is a proper fix 2. don't know whether it will lead to other unintended consequence 3. don't know why only "stream" runs into this issue while "invoke" runs without problem so @nfcampos @hwchase17 please help review, thanks! --------- Co-authored-by: Lifu Wu <lifu@nextbillion.ai> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 17:30:32 -07:00
Eugene Yurtsev	75776e4a54	core[patch]: In unit tests, use `_schema()` instead of BaseModel.schema() (#24930 ) This PR introduces a module with some helper utilities for the pydantic 1 -> 2 migration. They're meant to be used in the following way: 1) Use the utility code to get unit tests pass without requiring modification to the unit tests 2) (If desired) upgrade the unit tests to match pydantic 2 output 3) (If desired) stop using the utility code Currently, this module contains a way to map `schema()` generated by pydantic 2 to (mostly) match the output from pydantic v1.	2024-08-01 11:59:04 -04:00
Bagatur	25b93cc4c0	core[patch]: stringify tool non-content blocks (#24626 ) Slightly breaking bugfix. Shouldn't cause too many issues since no models would be able to handle non-content block ToolMessage.content anyways.	2024-07-31 16:42:38 -07:00
Eugene Yurtsev	210623b409	core[minor]: Add support for pydantic 2 to utility to get fields (#24899 ) Add compatibility for pydantic 2 for a utility function. This will help push some small changes to master, so they don't have to be kept track of on a separate branch.	2024-07-31 19:11:07 +00:00
Eugene Yurtsev	5099a9c9b4	core[patch]: Update unit tests with a workaround for using AnyID in pydantic 2 (#24892 ) Pydantic 2 ignores __eq__ overload for subclasses of strings.	2024-07-31 14:42:12 -04:00
Bagatur	8461934c2b	core[patch], integrations[patch]: convert TypedDict to tool schema support (#24641 ) supports following UX ```python class SubTool(TypedDict): """Subtool docstring""" args: Annotated[Dict[str, Any], {}, "this does bar"] class Tool(TypedDict): """Docstring Args: arg1: foo """ arg1: str arg2: Union[int, str] arg3: Optional[List[SubTool]] arg4: Annotated[Literal["bar", "baz"], ..., "this does foo"] arg5: Annotated[Optional[float], None] ``` - can parse google style docstring - can use Annotated to specify default value (second arg) - can use Annotated to specify arg description (third arg) - can have nested complex types	2024-07-31 18:27:24 +00:00

1 2 3 4 5 ...

359 Commits