langchain

Commit Graph

Author	SHA1	Message	Date
Lincoln Stein	c314222796	Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer (#22155 ) langchain: ConversationVectorStoreTokenBufferMemory -Description: This PR adds ConversationVectorStoreTokenBufferMemory. It is similar in concept to ConversationSummaryBufferMemory. It maintains an in-memory buffer of messages up to a preset token limit. After the limit is hit timestamped messages are written into a vectorstore retriever rather than into a summary. The user's prompt is then used to retrieve relevant fragments of the previous conversation. By persisting the vectorstore, one can maintain memory from session to session. -Issue: n/a -Dependencies: none -Twitter handle: Please no!!! - [X] Add tests and docs: I looked to see how the unit tests were written for the other ConversationMemory modules, but couldn't find anything other than a test for successful import. I need to know whether you are using pytest.mock or another fixture to simulate the LLM and vectorstore. In addition, I would like guidance on where to place the documentation. Should it be a notebook file in docs/docs? - [X] Lint and test: I am seeing some linting errors from a couple of modules unrelated to this PR. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	3 months ago
Bagatur	32f8f39974	core[patch]: use args_schema doc for tool description (#23503 )	3 months ago
ccurme	6f7fe82830	text-splitters: release 0.2.2 (#23508 )	3 months ago
ccurme	62b16fcc6b	experimental: release 0.0.62 (#23507 )	3 months ago
ccurme	99ce84ef23	community: release 0.2.6 (#23501 )	3 months ago
ccurme	03c41e725e	langchain: release 0.2.6 (#23426 )	3 months ago
ccurme	86ca44d451	core: release 0.2.10 (#23420 )	3 months ago
Isaac Francisco	85f5d14cef	[docs]: split up tool docs (#22919 )	3 months ago
ccurme	f788d0982d	docs: update trim messages guide (#23418 ) - rerun to remove warnings following https://github.com/langchain-ai/langchain/pull/23363 - `raise` -> `return`	3 months ago
ccurme	c9619349d6	docs: rerun chatbot tutorial to remove warnings (#23417 )	3 months ago
Nuradil	c93d9e66e4	Community: Update and fix ZenGuardTool docs and add ZenguardTool to init files (#23415 ) Thank you for contributing to LangChain! - [x] PR title: "community: update docs and add tool to init.py" - [x] PR message: - Description: Fixed some errors and comments in the docs and added our ZenGuardTool and additional classes to init.py for easy access when importing - Question: when will you update the langchain-community package in pypi to make our tool available? - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Thank you for review! --------- Co-authored-by: Baur <baur.krykpayev@gmail.com>	3 months ago
William FH	8955bc1866	[Core] Logging: Suppress missing parent warning (#23363 )	3 months ago
ccurme	730c551819	core[patch]: export tool output parsers from langchain_core.output_parsers (#23305 ) These currently read off AIMessage.tool_calls, and only fall back to OpenAI parsing if tool calls aren't populated. Importing these from `openai_tools` (e.g., in our [tool calling docs](https://python.langchain.com/v0.2/docs/how_to/tool_calling/#tool-calls)) can lead to confusion. After landing, would need to release core and update docs.	3 months ago
Eugene Yurtsev	7e9e69c758	core[patch]: Add unit test for str and repr for Document (#23414 )	3 months ago
Bagatur	f055f2a1e3	infra: install integration deps as needed (#23413 )	3 months ago
Bagatur	92ac0fc9bd	openai[patch]: Release 0.1.10 (#23410 )	3 months ago
Bagatur	fb3df898b5	docs: Update README.md (#23409 )	3 months ago
Bagatur	9d145b9630	openai[patch]: fix tool calling token counting (#23408 ) Resolves https://github.com/langchain-ai/langchain/issues/23388	3 months ago
Tomaz Bratanic	22fa32e164	LLM Graph transformer dealing with empty strings (#23368 ) Pydantic allows empty strings: ``` from langchain.pydantic_v1 import Field, BaseModel class Property(BaseModel): """A single property consisting of key and value""" key: str = Field(..., description="key") value: str = Field(..., description="value") x = Property(key="", value="") ``` Which can produce errors downstream. We simply ignore those records	3 months ago
Rajendra Kadam	d3520a784f	docs: Added providers page for Pebblo and docs for PebbloRetrievalQA (#20746 ) - Description: Added providers page for Pebblo and docs for PebbloRetrievalQA - Issue: NA - Dependencies: None - Unit tests: NA	3 months ago
clement.l	a75b32a54a	docs: Fix typo in LLMChain tutorial (#23380 ) Description: Fix a typo Issue: n/a Dependencies: None Twitter handle:	3 months ago
Riccardo Schirone	4530d851e4	Merge pull request #22662 * core: runnables: special handling GeneratorExit because no error	3 months ago
Qingchuan Hao	ad50702934	community: add default value to bing_search_url (#23306 ) bing_search_url is an endpoint to requests bing search resource and is normally invariant to users, we can give it the default value to simply the uesages of this utility/tool	3 months ago
ccurme	68e0ae3286	langchain[patch]: update removal target for LLMChain (#23373 ) to 1.0 Also improve replacement example in docstring.	3 months ago
wenngong	b33d2346db	community: FlashrankRerank support loading customer client (#23350 ) Description: FlashrankRerank Document compressor support loading customer client Issue: #23338 Co-authored-by: gongwn1 <gongwn1@lenovo.com>	3 months ago
maang-h	f58c40b4e3	docs: Update QianfanChatEndpoint ChatModel docstring (#23337 ) - Description: Update QianfanChatEndpoint ChatModel rich docstring - Issue: the issue #22296	3 months ago
Rahul Triptahi	9ef93ecd7c	community[minor]: Added classification_location parameter in PebbloSafeLoader. (#22565 ) Description: Add classifier_location feature flag. This flag enables Pebblo to decide the classifier location, local or pebblo-cloud. Unit Tests: N/A Documentation: N/A --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	3 months ago
Mirna Wong	2115fb76de	Replace llm variable with model (#23280 ) The code snippet under ‘pdfs_qa’ contains an small incorrect code example , resulting in users getting errors. This pr replaces ‘llm’ variable with ‘model’ to help user avoid a NameError message. Resolves #22689 If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	3 months ago
wenngong	af620db9c7	partners: add lint docstrings for azure-dynamic-sessions/together modules (#23303 ) Description: add lint docstrings for azure-dynamic-sessions/together modules Issue: #23188 @baskaryan test: ruff check passed. <img width="782" alt="image" src="https://github.com/langchain-ai/langchain/assets/76683249/bf11783d-65b3-4e56-a563-255eae89a3e4"> --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	3 months ago
yuncliu	398b2b9c51	community[minor]: Add Ascend NPU optimized Embeddings (#20260 ) - Description: Add NPU support for embeddings --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	3 months ago
Ikko Eltociear Ashimine	7b1066341b	docs: update sql_query_checking.ipynb (#23345 ) creat -> create	3 months ago
S M Zia Ur Rashid	d5b2a93c6d	package: security update urllib3 to @1.26.19 (#23366 ) urllib3 version update 1.26.18 to 1.26.19 to address a security vulnerability. Reference: https://security.snyk.io/vuln/SNYK-PYTHON-URLLIB3-7267250	3 months ago
Jacob Lee	57c13b4ef8	docs[patch]: Fix typo in how to guide for message history (#23364 )	3 months ago
Luis Rueda	168e9ed3a5	partners: add custom options to MongoDBChatMessageHistory (#22944 ) Description: Adds options for configuring MongoDBChatMessageHistory (no breaking changes): - session_id_key: name of the field that stores the session id - history_key: name of the field that stores the chat history - create_index: whether to create an index on the session id field - index_kwargs: additional keyword arguments to pass to the index creation Discussion: https://github.com/langchain-ai/langchain/discussions/22918 Twitter handle: @userlerueda --------- Co-authored-by: Jib <Jibzade@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	3 months ago
Eugene Yurtsev	1e750f12f6	standard-tests[minor]: Add standard read write test suite for vectorstores (#23355 ) Add standard read write test suite for vectorstores	3 months ago
Eugene Yurtsev	3b3ed72d35	standard-tests[minor]: Add standard tests for BaseStore (#23360 ) Add standard tests to base store abstraction. These only work on [str, str] right now. We'll need to check if it's possible to add encoder/decoders to generalize	3 months ago
ccurme	e1190c8f3c	mongodb[patch]: fix CI for python 3.12 (#23369 )	3 months ago
RUO	2b87e330b0	community: fix issue with nested field extraction in MongodbLoader (#22801 ) Description: This PR addresses an issue in the `MongodbLoader` where nested fields were not being correctly extracted. The loader now correctly handles nested fields specified in the `field_names` parameter. Issue: Fixes an issue where attempting to extract nested fields from MongoDB documents resulted in `KeyError`. Dependencies: No new dependencies are required for this change. Twitter handle: (Optional, your Twitter handle if you'd like a mention when the PR is announced) ### Changes 1. Field Name Parsing: - Added logic to parse nested field names and safely extract their values from the MongoDB documents. 2. Projection Construction: - Updated the projection dictionary to include nested fields correctly. 3. Field Extraction: - Updated the `aload` method to handle nested field extraction using a recursive approach to traverse the nested dictionaries. ### Example Usage Updated usage example to demonstrate how to specify nested fields in the `field_names` parameter: ```python loader = MongodbLoader( connection_string=MONGO_URI, db_name=MONGO_DB, collection_name=MONGO_COLLECTION, filter_criteria={"data.job.company.industry_name": "IT", "data.job.detail": { "$exists": True }}, field_names=[ "data.job.detail.id", "data.job.detail.position", "data.job.detail.intro", "data.job.detail.main_tasks", "data.job.detail.requirements", "data.job.detail.preferred_points", "data.job.detail.benefits", ], ) docs = loader.load() print(len(docs)) for doc in docs: print(doc.page_content) ``` ### Testing Tested with a MongoDB collection containing nested documents to ensure that the nested fields are correctly extracted and concatenated into a single page_content string. ### Note This change ensures backward compatibility for non-nested fields and improves functionality for nested field extraction. ### Output Sample ```python print(docs[:3]) ``` ```shell # output sample: [ Document( # Here in this example, page_content is the combined text from the fields below # "position", "intro", "main_tasks", "requirements", "preferred_points", "benefits" page_content='all combined contents from the requested fields in the document', metadata={'database': 'Your Database name', 'collection': 'Your Collection name'} ), ... ] ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	3 months ago
Tomaz Bratanic	aeeda370aa	Sanitize backticks from neo4j labels and types for import (#23367 )	3 months ago
Jacob Lee	d2db561347	docs[patch]: Adds callout in LLM concept docs, remove deprecated code (#23361 ) CC @baskaryan @hwchase17	3 months ago
Rave Harpaz	f5ff7f178b	Add OCI Generative AI new model support (#22880 ) - [x] PR title: community: Add OCI Generative AI new model support - [x] PR message: - Description: adding support for new models offered by OCI Generative AI services. This is a moderate update of our initial integration PR 16548 and includes a new integration for our chat models under /langchain_community/chat_models/oci_generative_ai.py - Issue: NA - Dependencies: No new Dependencies, just latest version of our OCI sdk - Twitter handle: NA - [x] Add tests and docs: 1. we have updated our unit tests 2. we have updated our documentation including a new ipynb for our new chat integration - [x] Lint and test: `make format`, `make lint`, and `make test` run successfully --------- Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com>	3 months ago
Jacob Lee	753edf9c80	docs[patch]: Update chatbot tools how-to guide (#23362 )	3 months ago
Baur	aa358f2be4	community: Add ZenGuard tool (#22959 ) Description This is the community integration of ZenGuard AI - the fastest guardrails for GenAI applications. ZenGuard AI protects against: - Prompts Attacks - Veering of the pre-defined topics - PII, sensitive info, and keywords leakage. - Toxicity - Etc. Twitter Handle : @zenguardai - [x] Add tests and docs: If you're adding a new integration, please include 1. Added an integration test 2. Added colab - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. --------- Co-authored-by: Nuradil <nuradil.maksut@icloud.com> Co-authored-by: Nuradil <133880216+yaksh0nti@users.noreply.github.com>	3 months ago
Mathis Joffre	60103fc4a5	community: Fix OVHcloud 401 Unauthorized on embedding. (#23260 ) They are now rejecting with code 401 calls from users with expired or invalid tokens (while before they were being considered anonymous). Thus, the authorization header has to be removed when there is no token. Related to: #23178 --------- Signed-off-by: Joffref <mariusjoffre@gmail.com>	3 months ago
Baskar Gopinath	4964ba74db	Update multimodal_prompts.ipynb (#23301 ) fixes #23294 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	3 months ago
Eugene Yurtsev	d90379210a	standard-tests[minor]: Add standard tests for cache (#23357 ) Add standard tests for cache abstraction	3 months ago
Leonid Ganeline	987099cfcd	community: `toolkits` docstrings (#23286 ) Added missed docstrings. Formatted docstrings to the consistent form. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	3 months ago
Rahul Triptahi	0cd3f93361	Enhance metadata of sharepointLoader. (#22248 ) Description: 2 feature flags added to SharePointLoader in this PR: 1. load_auth: if set to True, adds authorised identities to metadata 2. load_extended_metadata, adds source, owner and full_path to metadata Unit tests:N/A Documentation: To be done. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	3 months ago
Yuki Watanabe	5d4133d82f	community: Overhaul Databricks provider documentation (#23203 ) Description: Update [Databricks Provider](https://python.langchain.com/v0.2/docs/integrations/providers/databricks/) documentations to the latest component notebooks and draw better navigation path to related notebooks. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	3 months ago
Bagatur	bcac6c3aff	openai[patch]: temp fix ignore lint (#23290 )	3 months ago

... 2 3 4 5 6 ...

10211 Commits (2be66a38d849cd81296e5dad9353a2d246fa158f) All Branches Search

10211 Commits (2be66a38d849cd81296e5dad9353a2d246fa158f)

All Branches