langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-06 03:20:49 +00:00

Author	SHA1	Message	Date
Leonid Ganeline	d937fa4f9c	docs: `Tutorials` update (#18230 ) A big update of the `Tutorials` page. Cleaned it up. Added several new resources.	2024-03-01 11:07:39 -08:00
Yujie Qian	cbb65741a7	community[patch]: Voyage AI updates default model and batch size (#17655 ) - Description: update the default model and batch size in VoyageEmbeddings - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: fodizoltan <zoltan@conway.expert>	2024-03-01 10:22:24 -08:00
Shengsheng Huang	ae471a7dcb	community[minor]: add BigDL-LLM integrations (#17953 ) - Description: [`bigdl-llm`](https://github.com/intel-analytics/BigDL) is a library for running LLM on Intel XPU (from Laptop to GPU to Cloud) using INT4/FP4/INT8/FP8 with very low latency (for any PyTorch model). This PR adds bigdl-llm integrations to langchain. - Issue: NA - Dependencies: `bigdl-llm` library - Contribution maintainer: @shane-huang Examples added: - docs/docs/integrations/llms/bigdl.ipynb	2024-03-01 10:04:53 -08:00
Ethan Yang	f61cb8d407	community[minor]: Add openvino backend support (#11591 ) - Description: add openvino backend support by HuggingFace Optimum Intel, - Dependencies: “optimum[openvino]”, --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-01 10:04:24 -08:00
Leonid Ganeline	6d0af4e805	docs: nvidia: provider page update (#18054 ) Nvidia provider page is missing a Triton Inference Server package reference. Changes: - added the Triton Inference Server reference - copied the example notebook from the package into the doc files. - added the Triton Inference Server description and links, the link to the above example notebook - formatted page to the consistent format NOTE: It seems that the [example notebook](https://github.com/langchain-ai/langchain/blob/master/libs/partners/nvidia-trt/docs/llms.ipynb) was originally created in wrong place. It should be in the LangChain docs [here](https://github.com/langchain-ai/langchain/tree/master/docs/docs/integrations/llms). So, I've created a copy of this example. The original example is still in the nvidia-trt package.	2024-03-01 10:00:42 -08:00
Jacob Lee	590d47bff4	docs[patch]: Add Neo4j GraphAcademy to tutorials section (#18353 )	2024-02-29 20:50:24 -07:00
Bagatur	4730ee2766	docs: update api ref nav (#18362 )	2024-02-29 19:04:56 -08:00
Bagatur	12f19b8a6a	infra: update create_api_rst (#18361 )	2024-02-29 19:04:44 -08:00
Bagatur	5efb5c099f	text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346 )	2024-02-29 18:33:21 -08:00
Erick Friis	bce0684327	docs: airbyte deps note (#18243 )	2024-02-29 16:02:13 -08:00
Jib	72bfc1d3db	mongodb[minor]: MongoDB Partner Package -- Porting MongoDBAtlasVectorSearch (#17652 ) This PR migrates the existing MongoDBAtlasVectorSearch abstraction from the `langchain_community` section to the partners package section of the codebase. - [x] Run the partner package script as advised in the partner-packages documentation. - [x] Add Unit Tests - [x] Migrate Integration Tests - [x] Refactor `MongoDBAtlasVectorStore` (autogenerated) to `MongoDBAtlasVectorSearch` - [x] ~Remove~ deprecate the old `langchain_community` VectorStore references. ## Additional Callouts - Implemented the `delete` method - Included any missing async function implementations - `amax_marginal_relevance_search_by_vector` - `adelete` - Added new Unit Tests that test for functionality of `MongoDBVectorSearch` methods - Removed [`del res[self._embedding_key]`](`e0c81e1cb0/libs/community/langchain_community/vectorstores/mongodb_atlas.py (L218)`) in `_similarity_search_with_score` function as it would make the `maximal_marginal_relevance` function fail otherwise. The `Document` needs to store the embedding key in metadata to work. Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [x] Add tests and docs: If you're adding a new integration, please include 1. Existing tests supplied in docs/docs do not change. Updated docstrings for new functions like `delete` 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. (This already exists) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Steven Silvester <steven.silvester@ieee.org> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 23:09:48 +00:00
Bagatur	a6f0506aaf	docs: query analysis use case (#17766 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-29 12:33:49 -08:00
Filip Schouwenaars	4c62362eab	Add links to relevant DataCamp code alongs (#18332 ) This PR adds links to some more free resources for people to get acquainted with Langhchain without having to configure their system. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --> Co-authored-by: Filip Schouwenaars <filipsch@users.noreply.github.com>	2024-02-29 11:25:01 -08:00
Virat Singh	cd926ac3dd	community: Add PolygonFinancials Tool (#18324 ) Description: In this PR, I am adding a `PolygonFinancials` tool, which can be used to get financials data for a given ticker. The financials data is the fundamental data that is found in income statements, balance sheets, and cash flow statements of public US companies. Twitter: [@virattt](https://twitter.com/virattt)	2024-02-29 10:56:05 -08:00
Leonid Ganeline	d43fa2eab1	docs `providers` update (#18336 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-02-29 10:53:12 -08:00
Bagatur	6a5b084704	docs: update func calling doc (#18300 )	2024-02-29 09:45:07 -08:00
Averi Kitsch	1b63530274	docs: update Google documentation (#18297 ) Description: update Google documentation Issue: Dependencies:	2024-02-29 01:42:44 +00:00
Leonid Ganeline	1d865a7e86	docs: `google` provider page fixes (#18290 ) Several URL-s were broken (in the yesterday PR). Like [Integrations/platforms/google/Document Loaders](https://python.langchain.com/docs/integrations/platforms/google#document-loaders) page, Example link to "Document Loaders / Cloud SQL for PostgreSQL" and most of the new example links in the Document Loaders, Vectorstores, Memory sections. - fixed URL-s (manually verified all example links) - sorted sections in page to follow the "integrations/components" menu item order. - fixed several page titles to fix Navbar item order --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 00:45:03 +00:00
aditya thomas	eb0c178d75	docs: update to the list of partner packages in the list of providers (#18252 ) Description: Update to the list of partner packages in the list of providers Issue: Google & Nvidia had two entries each, both pointing to the same page Dependencies: None	2024-02-28 15:40:14 -08:00
ccurme	9bf58ec7dd	update extraction use-case docs (#17979 ) Update extraction use-case docs to showcase and explain all modes of `create_structured_output_runnable`.	2024-02-28 17:32:04 -05:00
kkdamowang	4899a72b56	docs: remove duplicate word in lcel/streaming (#18249 ) - Description: Remove duplicate word in lcel/streaming. - Issue: No. - Dependencies: No.	2024-02-28 21:50:26 +00:00
Eugene Yurtsev	cd52433ba0	community[minor]: Add `SQLDatabaseLoader` document loader (#18281 ) - Description: A generic document loader adapter for SQLAlchemy on top of LangChain's `SQLDatabaseLoader`. - Needed by: https://github.com/crate-workbench/langchain/pull/1 - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev Hi from CrateDB again, in the same spirit like GH-16243 and GH-16244, this patch breaks out another commit from https://github.com/crate-workbench/langchain/pull/1, in order to reduce the size of this patch before submitting it, and to separate concerns. To accompany the SQLAlchemy adapter implementation, the patch includes integration tests for both SQLite and PostgreSQL. Let me know if corresponding utility resources should be added at different spots. With kind regards, Andreas. ### Software Tests ```console docker compose --file libs/community/tests/integration_tests/document_loaders/docker-compose/postgresql.yml up ``` ```console cd libs/community pip install psycopg2-binary pytest -vvv tests/integration_tests -k sqldatabase ``` ``` 14 passed ``` ![image](https://github.com/langchain-ai/langchain/assets/453543/42be233c-eb37-4c76-a830-474276e01436) --------- Co-authored-by: Andreas Motl <andreas.motl@crate.io>	2024-02-28 21:02:28 +00:00
Jack Wotherspoon	92c34d4803	docs: update documentation for Google Cloud database integrations (#18265 ) Description: Fixing typos and rendering issues for Google Cloud database integrations. Issue: NA Dependencies: NA	2024-02-28 15:32:43 +00:00
Averi Kitsch	76eb553084	docs: add documentation for Google Cloud database integrations (#18225 ) Description: add documentation for Google Cloud database integrations Issue: NA Dependencies: NA	2024-02-27 21:17:30 -08:00
Erick Friis	be8d2ff5f7	airbyte[patch]: init pkg (#18236 )	2024-02-27 19:37:53 -08:00
Ayo Ayibiowu	ac1d7d9de8	community[feat]: Adds LLMLingua as a document compressor (#17711 ) Description: This PR adds support for using the [LLMLingua project ](https://github.com/microsoft/LLMLingua) especially the LongLLMLingua (Enhancing Large Language Model Inference via Prompt Compression) as a document compressor / transformer. The LLMLingua project is an interesting project that can greatly improve RAG system by compressing prompts and contexts while keeping their semantic relevance. Issue: https://github.com/microsoft/LLMLingua/issues/31 Dependencies: [llmlingua](https://pypi.org/project/llmlingua/) @baskaryan --------- Co-authored-by: Ayodeji Ayibiowu <ayodeji.ayibiowu@getinge.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-27 19:23:56 -08:00
Isaac Francisco	733367b795	docs: deprecation of OpenAI functions agent, astream_events docstring (#18164 ) Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-27 09:14:53 -08:00
Harrison Chase	b0ccaf5917	Harrison/add structured output (#18165 )	2024-02-27 08:25:09 -08:00
Max Jakob	5ab69f907f	partners: add Elasticsearch package (#17467 ) ### Description This PR moves the Elasticsearch classes to a partners package. Note that we will not move (and later remove) `ElasticKnnSearch`. It were previously deprecated. `ElasticVectorSearch` is going to stay in the community package since it is used quite a lot still. Also note that I left the `ElasticsearchTranslator` for self query untouched because it resides in main `langchain` package. ### Dependencies There will be another PR that updates the notebooks (potentially pulling them into the partners package) and templates and removes the classes from the community package, see https://github.com/langchain-ai/langchain/pull/17468 #### Open question How to make the transition smooth for users? Do we move the import aliases and require people to install `langchain-elasticsearch`? Or do we remove the import aliases from the `langchain` package all together? What has worked well for other partner packages? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-26 23:19:47 +00:00
matt haigh	a4896da2a0	Experimental: Add other threshold types to SemanticChunker (#16807 ) Description Adding different threshold types to the semantic chunker. I’ve had much better and predictable performance when using standard deviations instead of percentiles. ![image](https://github.com/langchain-ai/langchain/assets/44395485/066e84a8-460e-4da5-9fa1-4ff79a1941c5) For all the documents I’ve tried, the distribution of distances look similar to the above: positively skewed normal distribution. All skews I’ve seen are less than 1 so that explains why standard deviations perform well, but I’ve included IQR if anyone wants something more robust. Also, using the percentile method backwards, you can declare the number of clusters and use semantic chunking to get an ‘optimal’ splitting. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 13:50:48 -08:00
am-kinetica	9b8f6455b1	Langchain vectorstore integration with Kinetica (#18102 ) - Description: New vectorstore integration with the Kinetica database - Issue: - Dependencies: the Kinetica Python API `pip install gpudb==7.2.0.1`, - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: --------- Co-authored-by: Chad Juliano <cjuliano@kinetica.com>	2024-02-26 12:46:48 -08:00
Dan Stambler	69344a0661	community: Add Laser Embedding Integration (#18111 ) - Description: Added Integration with Meta AI's LASER Language-Agnostic SEntence Representations embedding library, which supports multilingual embedding for any of the languages listed here: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200, including several low resource languages - Dependencies: laser_encoders	2024-02-26 12:16:37 -08:00
Heidi Steen	166f3d8351	Docs: azuresearch.ipynb (in docs/docs/integrations/vectorstores) -- fixed headings and comments (#18135 ) This PR updates azuresearch.ipynb with an edit to the introduction sentence, consistent heading levels, and disambiguation in code comments.	2024-02-26 11:46:55 -08:00
Barun Amalkumar Halder	23fc7c8c90	docs [patch] : fix import to use community path for handler in fiddler notebook (#18140 ) Description: Update the example fiddler notebook to use community path, instead of langchain.callback Dependencies: None Twitter handle: @bhalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-26 11:41:07 -08:00
Bagatur	96bff0ed5d	infra: create api rst for specific pkg (#18144 ) Example: create rst for libs/core only ```bash poetry run python docs/api_reference/create_api_rst.py core ```	2024-02-26 11:04:22 -08:00
Erick Friis	f5cf6975ba	docs: anthropic partner package docs (#18109 )	2024-02-26 17:51:44 +00:00
rongchenlin	9147a437f1	docs: Fix the bug in MongoDBChatMessageHistory notebook (#18128 ) I tried to configure MongoDBChatMessageHistory using the code from the original documentation to store messages based on the passed session_id in MongoDB. However, this configuration did not take effect, and the session id in the database remained as 'test_session'. To resolve this issue, I found that when configuring MongoDBChatMessageHistory, it is necessary to set session_id=session_id instead of session_id=test_session. Issue: DOC: Ineffective Configuration of MongoDBChatMessageHistory for Custom session_id Storage previous code： ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id="test_session", connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` ![image](https://github.com/langchain-ai/langchain/assets/83388493/c372f785-1ec1-43f5-8d01-b7cc07b806b7) Modified code: ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id=session_id, # here is my modify code connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` Effect after modification (it works)： ![image](https://github.com/langchain-ai/langchain/assets/83388493/5776268c-9098-4da3-bf41-52825be5fafb)	2024-02-26 15:02:56 +00:00
Matt	3b08617a89	docs: update azure search langchain notebook (#18053 ) Description: Update the azure search notebook to have more descriptive comments, and an option to choose between OpenAI and AzureOpenAI Embeddings --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:48:13 -08:00
Barun Amalkumar Halder	cc69976860	community[minor] : adds callback handler for Fiddler AI (#17708 ) Description: Callback handler to integrate fiddler with langchain. This PR adds the following - 1. `FiddlerCallbackHandler` implementation into langchain/community 2. Example notebook `fiddler.ipynb` for usage documentation [Internal Tracker : FDL-14305] Issue: NA Dependencies: - Installation of langchain-community is unaffected. - Usage of FiddlerCallbackHandler requires installation of latest fiddler-client (2.5+) Twitter handle: @fiddlerlabs @behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-25 18:17:03 -08:00
Christophe Bornet	b8b5ce0c8c	astradb: Add AstraDBChatMessageHistory to langchain-astradb package (#17732 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:14:49 -08:00
BeatrixCohere	5d2d80a9a8	docs: Add Cohere examples in documentation (#17794 ) - Description: Add cohere examples to documentation - Issue:N/A - Dependencies: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:10:09 -08:00
Jacob Lee	c9eac3287e	docs[patch]: Remove redundant Pinecone import (#18079 ) CC @efriis	2024-02-24 19:27:54 -08:00
Erick Friis	e85948d46b	docs: fireworks tool calling docs (#18057 )	2024-02-24 00:49:11 +00:00
Erick Friis	1a3383fba1	docs: fireworks fixes (#18056 )	2024-02-23 15:58:53 -08:00
Yufei (Benny) Chen	ee6a773456	fireworks[patch]: Add Fireworks partner packages (#17694 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-23 20:45:47 +00:00
Erick Friis	11cf95e810	docs: recommend lambdas over runnablebranch (#18033 )	2024-02-23 11:34:27 -08:00
Reid Falconer	0534ba5a7d	langchain[patch]: return formatted SPARQL query on demand (#11263 ) - Description: Added the `return_sparql_query` feature to the `GraphSparqlQAChain` class, allowing users to get the formatted SPARQL query along with the chain's result. - Issue: NA - Dependencies: None Note: I've ensured that the PR passes linting and testing by running make format, make lint, and make test locally. I have added a test for the integration (which relies on network access) and I have added an example to the notebook showing its use.	2024-02-22 17:03:26 -08:00
Issac	46505742eb	Update quickstart.mdx (#17659 ) https://github.com/langchain-ai/langchain/issues/17657 Thank you for contributing to LangChain! Checklist: - [ ] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: Delete this entire template message and replace it with the following bulleted list - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-22 17:01:40 -08:00
Stan Duprey	15e42f1799	docs: Added `langchainhub` install and fixed typo (#17985 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-22 16:03:40 -08:00
Chad Juliano	50ba3c68bb	community[minor]: add Kinetica LLM wrapper (#17879 ) Description: Initial pull request for Kinetica LLM wrapper Issue: N/A Dependencies: No new dependencies for unit tests. Integration tests require gpudb, typeguard, and faker Twitter handle: @chad_juliano Note: There is another pull request for Kinetica vectorstore. Ultimately we would like to make a partner package but we are starting with a community contribution.	2024-02-22 16:02:00 -08:00

1 2 3 4 5 ...

3105 Commits