langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
Nuno Campos	a99eb3abf4	openai[patch]: Assign message id in ChatOpenAI (#17837 )	2024-02-27 17:32:54 -08:00
Isaac Francisco	733367b795	docs: deprecation of OpenAI functions agent, astream_events docstring (#18164 ) Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-27 09:14:53 -08:00
Harrison Chase	b0ccaf5917	Harrison/add structured output (#18165 )	2024-02-27 08:25:09 -08:00
Bagatur	242af4b5a4	openai[patch], mistral[patch], fireworks[patch]: releases 0.0.8, 0.0.5, 0.0.2 (#18186 )	2024-02-27 04:22:24 -08:00
Bagatur	7e66d964c6	core[patch]: Release 0.1.27 (#18159 )	2024-02-26 17:27:38 -08:00
Harrison Chase	d7c607ca00	core[minor]: move document compressor base (#17910 )	2024-02-26 17:20:50 -08:00
Bagatur	b3f4de38ae	mistral[minor]: Function calling and with_structured_output (#18150 ) ![Screenshot 2024-02-26 at 2 07 06 PM](https://github.com/langchain-ai/langchain/assets/22008038/20cacb47-3b24-45b5-871b-dd169f1acd37)	2024-02-26 16:22:30 -08:00
Bagatur	c53aa5cd37	core[patch]: support JS message serial namespaces (#18151 )	2024-02-26 16:19:46 -08:00
Harrison Chase	c673717c2b	add optimization notebook (#18155 )	2024-02-26 16:09:31 -08:00
Max Jakob	5ab69f907f	partners: add Elasticsearch package (#17467 ) ### Description This PR moves the Elasticsearch classes to a partners package. Note that we will not move (and later remove) `ElasticKnnSearch`. It were previously deprecated. `ElasticVectorSearch` is going to stay in the community package since it is used quite a lot still. Also note that I left the `ElasticsearchTranslator` for self query untouched because it resides in main `langchain` package. ### Dependencies There will be another PR that updates the notebooks (potentially pulling them into the partners package) and templates and removes the classes from the community package, see https://github.com/langchain-ai/langchain/pull/17468 #### Open question How to make the transition smooth for users? Do we move the import aliases and require people to install `langchain-elasticsearch`? Or do we remove the import aliases from the `langchain` package all together? What has worked well for other partner packages? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-26 23:19:47 +00:00
matt haigh	a4896da2a0	Experimental: Add other threshold types to SemanticChunker (#16807 ) Description Adding different threshold types to the semantic chunker. I’ve had much better and predictable performance when using standard deviations instead of percentiles. ![image](https://github.com/langchain-ai/langchain/assets/44395485/066e84a8-460e-4da5-9fa1-4ff79a1941c5) For all the documents I’ve tried, the distribution of distances look similar to the above: positively skewed normal distribution. All skews I’ve seen are less than 1 so that explains why standard deviations perform well, but I’ve included IQR if anyone wants something more robust. Also, using the percentile method backwards, you can declare the number of clusters and use semantic chunking to get an ‘optimal’ splitting. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 13:50:48 -08:00
Jaskirat Singh	ce682f5a09	community: vectorstores.kdbai - Added support for when no docs are present (#18103 ) - Description: By default it expects a list but that's not the case in corner scenarios when there is no document ingested(use case: Bootstrap application). \ Hence added as check, if the instance is panda Dataframe instead of list then it will procced with return immediately. - Issue: NA - Dependencies: NA - Twitter handle: jaskiratsingh1 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 12:47:06 -08:00
am-kinetica	9b8f6455b1	Langchain vectorstore integration with Kinetica (#18102 ) - Description: New vectorstore integration with the Kinetica database - Issue: - Dependencies: the Kinetica Python API `pip install gpudb==7.2.0.1`, - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: --------- Co-authored-by: Chad Juliano <cjuliano@kinetica.com>	2024-02-26 12:46:48 -08:00
Bagatur	1e8ab83d7b	langchain[patch], core[patch], openai[patch], fireworks[minor]: ChatFireworks.with_structured_output (#18078 ) <img width="1192" alt="Screenshot 2024-02-24 at 3 39 39 PM" src="https://github.com/langchain-ai/langchain/assets/22008038/1cf74774-a23f-4b06-9b9b-85dfa2f75b63">	2024-02-26 12:46:39 -08:00
GoodBai	3589a135ef	community: make `SET allow_experimental_[engine]_index` configurabe in vectorstores.clickhouse (#18107 ) ## Description & Issue While following the official doc to use clickhouse as a vectorstore, I found only the default `annoy` index is properly supported. But I want to try another engine `usearch` for `annoy` is not properly supported on ARM platforms. Here is the settings I prefer: ``` python settings = ClickhouseSettings( table="wiki_Ethereum", index_type="usearch", # annoy by default index_param=[], ) ``` The above settings do not work for the command `set allow_experimental_annoy_index=1` is hard-coded. This PR will make sure the experimental feature follow the `index_type` which is also consistent with Clickhouse's naming conventions.	2024-02-26 12:39:17 -08:00
Dan Stambler	69344a0661	community: Add Laser Embedding Integration (#18111 ) - Description: Added Integration with Meta AI's LASER Language-Agnostic SEntence Representations embedding library, which supports multilingual embedding for any of the languages listed here: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200, including several low resource languages - Dependencies: laser_encoders	2024-02-26 12:16:37 -08:00
Erick Friis	257879e98d	infra: api docs setup action location (#18148 )	2024-02-26 11:50:21 -08:00
Erick Friis	28cf3aab45	infra: api docs build commit dir (#18147 )	2024-02-26 11:47:04 -08:00
Heidi Steen	166f3d8351	Docs: azuresearch.ipynb (in docs/docs/integrations/vectorstores) -- fixed headings and comments (#18135 ) This PR updates azuresearch.ipynb with an edit to the introduction sentence, consistent heading levels, and disambiguation in code comments.	2024-02-26 11:46:55 -08:00
Luan Fernandes	e867557936	[docs] Update doc-string for buffer_as_messages method in ConversationBufferWindowMemory (#18136 ) minor fix stated in #18080	2024-02-26 11:46:43 -08:00
Barun Amalkumar Halder	23fc7c8c90	docs [patch] : fix import to use community path for handler in fiddler notebook (#18140 ) Description: Update the example fiddler notebook to use community path, instead of langchain.callback Dependencies: None Twitter handle: @bhalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-26 11:41:07 -08:00
Bagatur	767523f364	core[patch], langchain[patch], templates: move openai functions parsers to core (#18060 ) ![Screenshot 2024-02-23 at 7 48 03 PM](https://github.com/langchain-ai/langchain/assets/22008038/e5540c4d-0020-4ece-869f-ae19db2a1f3f)	2024-02-26 11:12:53 -08:00
Bagatur	96bff0ed5d	infra: create api rst for specific pkg (#18144 ) Example: create rst for libs/core only ```bash poetry run python docs/api_reference/create_api_rst.py core ```	2024-02-26 11:04:22 -08:00
Nuno Campos	cd3ab3703b	Improve runnable generator error messages (#18142 ) h/t @hinthornw Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-26 18:54:25 +00:00
Nuno Campos	62a30efb12	Fix bug with using configurable_fields after configurable_alternatives (#18139 ) Closes #17915	2024-02-26 10:27:07 -08:00
Erick Friis	f5cf6975ba	docs: anthropic partner package docs (#18109 )	2024-02-26 17:51:44 +00:00
Nuno Campos	b1d9ce541d	Add BaseMessage.id (#17835 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-26 09:27:47 -08:00
Harrison Chase	935aefa8db	add run name for query constructor (#18101 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-26 08:17:05 -08:00
Mohammad Mohtashim	719a1cde75	langchain[patch]: Update doc-string for a method in ConversationBufferWindowMemory (#18090 ) A minor doc fix stated in #18080	2024-02-26 10:15:02 -05:00
Simon Schmidt	2716d58603	langchain: Import from langchain_core in langchain.smith to avoid deprecation warning (#18129 ) Avoids deprecation warning that triggered at import time, e.g. with `python -c 'import langchain.smith'` /opt/venv/lib/python3.12/site-packages/langchain/callbacks/__init__.py:37: LangChainDeprecationWarning: Importing this callback from langchain is deprecated. Importing it from langchain will no longer be supported as of langchain==0.2.0. Please import from langchain-community instead: `from langchain_community.callbacks import base`. To install langchain-community run `pip install -U langchain-community`.	2024-02-26 10:14:10 -05:00
rongchenlin	9147a437f1	docs: Fix the bug in MongoDBChatMessageHistory notebook (#18128 ) I tried to configure MongoDBChatMessageHistory using the code from the original documentation to store messages based on the passed session_id in MongoDB. However, this configuration did not take effect, and the session id in the database remained as 'test_session'. To resolve this issue, I found that when configuring MongoDBChatMessageHistory, it is necessary to set session_id=session_id instead of session_id=test_session. Issue: DOC: Ineffective Configuration of MongoDBChatMessageHistory for Custom session_id Storage previous code： ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id="test_session", connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` ![image](https://github.com/langchain-ai/langchain/assets/83388493/c372f785-1ec1-43f5-8d01-b7cc07b806b7) Modified code: ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id=session_id, # here is my modify code connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` Effect after modification (it works)： ![image](https://github.com/langchain-ai/langchain/assets/83388493/5776268c-9098-4da3-bf41-52825be5fafb)	2024-02-26 15:02:56 +00:00
Erick Friis	e3b7779926	docs: api docs for external repos (#17904 ) Stacked on google removal PR. Will make google continue to show up in API docs even from external repo	2024-02-26 06:19:09 +00:00
Erick Friis	248c5b84ee	google-genai, google-vertexai: move to langchain-google (#17899 ) These packages have moved to https://github.com/langchain-ai/langchain-google Left tombstone readmes incase anyone ends up at the "Source Code" link from old pypi releases. Can keep these around for a few months.	2024-02-25 21:58:05 -08:00
Erick Friis	3b5bdbfee8	anthropic[minor]: package move (#17974 )	2024-02-25 21:57:26 -08:00
Christophe Bornet	a2d5fa7649	community[patch]: Fix GenericRequestsWrapper _aget_resp_content must be async (#18065 ) There are existing tests in `libs/community/tests/unit_tests/tools/requests/test_tool.py`	2024-02-25 19:07:07 -08:00
Neli Hateva	a01e8473f8	community[patch]: Fix GraphSparqlQAChain so that it works with Ontotext GraphDB (#15009 ) - Description: Introduce a new parameter `graph_kwargs` to `RdfGraph` - parameters used to initialize the `rdflib.Graph` if `query_endpoint` is set. Also, do not set `rdflib.graph.DATASET_DEFAULT_GRAPH_ID` as default value for the `rdflib.Graph` `identifier` if `query_endpoint` is set. - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-02-25 19:05:21 -08:00
Christophe Bornet	4d6cd5b46a	astradb[patch]: Use astrapy's upsert_one method in AstraDBStore (#18063 ) As `upsert` is deprecated	2024-02-25 19:04:18 -08:00
Danny McAteer	e42110f720	docs: Additional examples for partners/exa README (#18081 ) Description: Add additional examples for other modules to partners/exa README Issue: #17545 Dependencies: None Twitter handle: @DannyMcAteer8 --------- Co-authored-by: Daniel McAteer <danielmcateer@Daniels-MBP.attlocal.net> Co-authored-by: Daniel McAteer <danielmcateer@Daniels-MacBook-Pro.local>	2024-02-25 18:53:47 -08:00
dokato	5afb242161	langchain[patch]: Make BooleanOutputParser more robust to non-binary responses (#17810 ) - Description: I encountered this error when I tried to use LLMChainFilter. Even if the message slightly differs, like `Not relevant (NO)` this results in an error. It has been reported already here: https://github.com/langchain-ai/langchain/issues/. This change hopefully makes it more robust. - Issue: #11408 - Dependencies: No - Twitter handle: dokatox	2024-02-25 18:48:33 -08:00
Matt	3b08617a89	docs: update azure search langchain notebook (#18053 ) Description: Update the azure search notebook to have more descriptive comments, and an option to choose between OpenAI and AzureOpenAI Embeddings --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:48:13 -08:00
kYLe	17ecf6e119	community[patch]: Remove model limitation on Anyscale LLM (#17662 ) Description: Llama Guard is deprecated from Anyscale public endpoint. Issue: Change the default model. and remove the limitation of only use Llama Guard with Anyscale LLMs Anyscale LLM can also works with all other Chat model hosted on Anyscale. Also added `async_client` for Anyscale LLM	2024-02-25 18:21:19 -08:00
Barun Amalkumar Halder	cc69976860	community[minor] : adds callback handler for Fiddler AI (#17708 ) Description: Callback handler to integrate fiddler with langchain. This PR adds the following - 1. `FiddlerCallbackHandler` implementation into langchain/community 2. Example notebook `fiddler.ipynb` for usage documentation [Internal Tracker : FDL-14305] Issue: NA Dependencies: - Installation of langchain-community is unaffected. - Usage of FiddlerCallbackHandler requires installation of latest fiddler-client (2.5+) Twitter handle: @fiddlerlabs @behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-25 18:17:03 -08:00
Christophe Bornet	b8b5ce0c8c	astradb: Add AstraDBChatMessageHistory to langchain-astradb package (#17732 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:14:49 -08:00
Maxime Perrin	c06a8732aa	community[patch]: fix llama index imports and fields access (#17870 ) - Description: Fixing outdated imports after v0.10 llama index update and updating metadata and source text access - Issue: #17860 - Twitter handle: @maximeperrin_ --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-02-25 18:14:23 -08:00
BeatrixCohere	5d2d80a9a8	docs: Add Cohere examples in documentation (#17794 ) - Description: Add cohere examples to documentation - Issue:N/A - Dependencies: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:10:09 -08:00
Jacob Lee	c9eac3287e	docs[patch]: Remove redundant Pinecone import (#18079 ) CC @efriis	2024-02-24 19:27:54 -08:00
2jimoo	7fc903464a	community: Add document manager and mongo document manager (#17320 ) - Description: - Add DocumentManager class, which is a nosql record manager. - In order to use index and aindex in libs/langchain/langchain/indexes/_api.py, DocumentManager inherits RecordManager. - Also I added the MongoDB implementation of Document Manager too. - Dependencies: pymongo, motor <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Add DocumentManager class, which is a no sql record manager. To use index method and aindex method in indexes._api.py, Document Manager inherits RecordManager.Add the MongoDB implementation of Document Manager. - Dependencies: pymongo, motor Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-23 21:32:52 -05:00
Leonid Ganeline	3f6bf852ea	experimental: docstrings update (#18048 ) Added missed docstrings. Formatted docsctrings to the consistent format.	2024-02-23 21:24:16 -05:00
kYLe	56b955fc31	community[minor]: Add async_client for Anyscale Chat model (#18050 ) Add `async_client` for Anyscale Chat_model	2024-02-23 21:22:54 -05:00
Eugene Yurtsev	68527b809d	core[patch]: Runnable with message history to use add_messages (#17958 ) This PR updates RunnableWithMessageHistory to use add_messages which will save on round-trips for any chat history abstractions that implement the optimization. If the optimization isn't implemented, add_messages automatically invokes add_message serially.	2024-02-23 21:19:38 -05:00

... 5 6 7 8 9 ...

8026 Commits