langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
David Lee	0394c6e126	community[minor]: add allow_dangerous_requests for OpenAPI toolkits (#19493 ) OpenAPI allow_dangerous_requests: community: add allow_dangerous_requests for OpenAPI toolkits Description: a description of the change Due to BaseRequestsTool changes, we need to pass allow_dangerous_requests manually. `b617085af0/libs/community/langchain_community/tools/requests/tool.py (L26-L46)` While OpenAPI toolkits didn't pass it in the arguments. `b617085af0/libs/community/langchain_community/agent_toolkits/openapi/planner.py (L262-L269)` Issue: the issue # it fixes, if applicable https://github.com/langchain-ai/langchain/issues/19440 If not passing allow_dangerous_requests, it won't be able to do requests. Dependencies: any dependencies required for this change Not much --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-09 17:14:02 +00:00
Guangdong Liu	301dc3dfd2	docs: Get rid of ZeroShotAgent and use create_react_agent instead (#20157 ) - Issue: #20122 - @baskaryan, @eyurtsev.	2024-04-09 12:00:29 -05:00
jeff kit	ac42e96e4c	community[patch], langchain[minor]: Enhance Tencent Cloud VectorDB, langchain: make Tencent Cloud VectorDB self query retrieve compatible (#19651 ) - make Tencent Cloud VectorDB support metadata filtering. - implement delete function for Tencent Cloud VectorDB. - support both Langchain Embedding model and Tencent Cloud VDB embedding model. - Tencent Cloud VectorDB support filter search keyword, compatible with langchain filtering syntax. - add Tencent Cloud VectorDB TranslationVisitor, now work with self query retriever. - more documentations. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-09 16:50:48 +00:00
Haris Ali	1b480914b4	docs: Fix the class links in openai_tools and openai_functions description in output parser documentations (#20197 ) - Description: In this PR I fixed the links which points to the API docs for classes in OpenAI functions and OpenAI tools section of output parsers. - Issue: It fixed the issue #19969 Co-authored-by: Haris Ali <haris.ali@formulatrix.com>	2024-04-09 16:07:19 +00:00
Piyush Jain	cd7abc495a	community[minor]: add neptune analytics graph (#20047 ) Replacement for PR [#19772](https://github.com/langchain-ai/langchain/pull/19772). --------- Co-authored-by: Dave Bechberger <dbechbe@amazon.com> Co-authored-by: bechbd <bechbd@users.noreply.github.com>	2024-04-09 09:20:59 -05:00
Prince Canuma	1f9f4d8742	community[minor]: Add support for MLX models (chat & llm) (#18152 ) Description: This PR adds support for MLX models both chat (i.e., instruct) and llm (i.e., pretrained) types/ Dependencies: mlx, mlx_lm, transformers Twitter handle: @Prince_Canuma --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-09 14:17:07 +00:00
aditya thomas	6baeaf4802	docs: TogetherAI as a drop-in replacement for OpenAI (#19900 ) Description: TogetherAI as a drop-in replacement for OpenAI Issue: None Dependencies: None @baskaryan apropos #20032	2024-04-09 09:12:52 -05:00
Bagatur	1af7133828	docs: add vertexai to structured output (#20171 )	2024-04-08 16:09:49 -05:00
Alex Sherstinsky	5f563e040a	community: extend Predibase integration to support fine-tuned LLM adapters (#19979 ) - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Langchain-Predibase integration was failing, because it was not current with the Predibase SDK; in addition, Predibase integration tests were instantiating the Langchain Community `Predibase` class with one required argument (`model`) missing. This change updates the Predibase SDK usage and fixes the integration tests. - Twitter handle: `@alexsherstinsky` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-08 18:54:29 +00:00
Bagatur	a27d88f12a	anthropic[patch]: standardize init args (#20161 ) Related to #20085	2024-04-08 12:09:06 -05:00
Bagatur	3490d70238	mistralai[patch]: standardize model params (#20163 ) Related to #20085	2024-04-08 11:48:38 -05:00
Bagatur	17182406f3	docs: standardize fireworks params (#20162 ) Related to #20085	2024-04-08 10:57:56 -05:00
Bagatur	5ae0e687b3	docs: use standard openai params (#20160 ) Part of #20085	2024-04-08 10:56:53 -05:00
david02871	e1a24d09c5	community: Add PHP language parser to document_loaders (#19850 ) Description: Added a PHP language parser to document_loaders Issue: N/A Dependencies: N/A Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-08 11:30:28 -04:00
Marlene	2f03bc397e	Community: Updating Azure Retriever and Docs to be Azure AI Search instead of Azure Cognitive Search (#19925 ) Last year Microsoft [changed the name](https://learn.microsoft.com/en-us/azure/search/search-what-is-azure-search) of Azure Cognitive Search to Azure AI Search. This PR updates the Langchain Azure Retriever API and it's associated docs to reflect this change. It may be confusing for users to see the name Cognitive here and AI in the Microsoft documentation which is why this is needed. I've also added a more detailed example to the Azure retriever doc page. There are more places that need a similar update but I'm breaking it up so the PRs are not too big 😄 Fixing my errors from the previous PR. Twitter: @marlene_zw Two new tests added to test backward compatibility in `libs/community/tests/integration_tests/retrievers/test_azure_cognitive_search.py` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-08 11:12:41 -04:00
Rahul Triptahi	820b713086	community[minor]: Add support for Pebblo cloud_api_key in PebbloSafeLoader (#19855 ) Description: _PebbloSafeLoader_: Add support for pebblo's cloud api-key in PebbloSafeLoader - This Pull request enables PebbloSafeLoader to accept pebblo's cloud api-key and send the semantic classification data to pebblo cloud. Documentation: Updated Unit test: Added Issue: NA Dependencies: - None Twitter handle: @rahul_tripathi2 Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-08 11:10:04 -04:00
Chris Germann	ba602dc562	Documentation: Fixed the typo of Discord -> Telegram (#20008 ) Description: Just fixed one string Issues: None Dependencies: None Twitter handle: @epu9byj Co-authored-by: gere <gere@kapo.zh.ch>	2024-04-06 20:00:03 +00:00
Jacob Lee	58a2123ca0	docs[patch]: Add missing redirects (#20076 )	2024-04-05 12:54:00 -07:00
Erick Friis	ebd24bb5d6	docs: fix title cap (#20048 )	2024-04-05 02:36:33 +00:00
Eugene Yurtsev	1ee8cf7b20	Docs: Update custom chat model (#19967 ) * Clean up in the existing tutorial * Add model_name to identifying params * Add table to summarize messages	2024-04-04 22:36:03 -04:00
Erick Friis	5fc7bb01e9	docs: weaviate docs (#20042 )	2024-04-04 19:01:02 -07:00
Bagatur	38fb1429fe	docs: fix together model tab (#20032 )	2024-04-04 15:33:43 -07:00
Jacob Lee	b69af26717	docs[patch]: Fix Model I/O quickstart (#20031 ) @baskaryan	2024-04-04 15:28:58 -07:00
Usama Ahmed	94ac42c573	docs: fixing typo in argument name (#20028 ) it's "mode" instead of "model", I fixed it	2024-04-04 22:28:28 +00:00
Bagatur	07eeeb84f3	docs: hide experimental anthropic (#20030 )	2024-04-04 15:27:52 -07:00
Leonid Ganeline	3856dedff4	docs: `integrations/providers` update 9 (#19941 ) - Added missed providers - Added links, descriptions in related examples - Formatted in a consistent format Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-04 21:37:48 +00:00
Bagatur	644ff46100	docs: mark anthropic tools wrapper as deprecated (#20024 )	2024-04-04 21:33:55 +00:00
Leonid Ganeline	69bf6262aa	docs: `integrations/providers/unstructured` update (#19892 ) Updated a page with existing document loaders with links to examples. Fixed formatting of one example. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-04 21:31:27 +00:00
Bagatur	1b7ed6071a	anthropic[patch]: Release 0.1.6 (#20026 )	2024-04-04 14:29:50 -07:00
Bagatur	6860450e48	anthropic[patch]: use anthropic 0.23 (#20022 )	2024-04-04 14:23:53 -07:00
Leonid Ganeline	4c969286fe	docs `integrations/providers` update 10 (#19970 ) Fixed broken links. Formatted to get consistent forms. Added missed imports in the example code	2024-04-04 14:22:45 -07:00
Leonid Ganeline	82f0198be2	docs: `graphs` update (#19675 ) Issue: The `graph` code was moved into the `community` package a long ago. But the related documentation is still in the [use_cases](https://python.langchain.com/docs/use_cases/graph/integrations/diffbot_graphtransformer) section and not in the `integrations`. Changes: - moved the `use_cases/graph/integrations` notebooks into the `integrations/graphs` - renamed files and changed titles to follow the consistent format - redirected old page URLs to new URLs in `vercel.json` and in several other pages - added descriptions and links when necessary - formatted into the consistent format	2024-04-04 14:13:22 -07:00
Bagatur	209de0a561	anthropic[minor]: tool use (#20016 )	2024-04-04 13:22:48 -07:00
Jacob Lee	7f0cb3bfba	docs[patch]: Make Docusaurus and Vercel add trailing slashes when navigating by default (#20014 ) Should hopefully avoid weird broken link edge cases. Relative links now trip up the Docusaurus broken link checker, so this PR also removes them. Also snuck in a small addition about asyncio	2024-04-04 12:49:15 -07:00
Christophe Bornet	02152d3909	[docs][minor]: Fix typo in Custom Document Loader doc (#20003 )	2024-04-04 10:59:33 -04:00
Jacob Lee	605c3f23e1	docs: reorg and visual refresh (#19765 ) - put use cases in main sidebar - move modules to own sidebar, rename components - cleanup lcel section - cleanup guides - update font, cell highlighting --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-04 00:58:36 -07:00
Graden Rea	88cf8a2905	groq: Add tool calling support (#19971 ) Description: Add with_structured_output to groq chat models Issue: Dependencies: N/A Twitter handle: N/A	2024-04-03 14:40:20 -07:00
Eugene Yurtsev	ea276d6547	docs: Custom Document Loaders (#19935 ) Add information that shows how to create custom document loaders	2024-04-03 15:34:01 -04:00
happy-go-lucky	c6432abdbe	community[patch]: Implement delete method and all async methods in opensearch_vector_search (#17321 ) - Description: In order to use index and aindex in libs/langchain/langchain/indexes/_api.py, I implemented delete method and all async methods in opensearch_vector_search - Dependencies: No changes	2024-04-03 09:40:49 -07:00
Cheng, Penghui	cc407e8a1b	community[minor]: weight only quantization with intel-extension-for-transformers. (#14504 ) Support weight only quantization with intel-extension-for-transformers. [Intel® Extension for Transformers](https://github.com/intel/intel-extension-for-transformers) is an innovative toolkit to accelerate Transformer-based models on Intel platforms, in particular effective on 4th Intel Xeon Scalable processor [Sapphire Rapids](https://www.intel.com/content/www/us/en/products/docs/processors/xeon-accelerated/4th-gen-xeon-scalable-processors.html) (codenamed Sapphire Rapids). The toolkit provides the below key features: * Seamless user experience of model compressions on Transformer-based models by extending [Hugging Face transformers](https://github.com/huggingface/transformers) APIs and leveraging [Intel® Neural Compressor](https://github.com/intel/neural-compressor) * Advanced software optimizations and unique compression-aware runtime. * Optimized Transformer-based model packages. * [NeuralChat](https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/neural_chat), a customizable chatbot framework to create your own chatbot within minutes by leveraging a rich set of plugins and SOTA optimizations. * [Inference](https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/llm/runtime/graph) of Large Language Model (LLM) in pure C/C++ with weight-only quantization kernels. This PR is an integration of weight only quantization feature with intel-extension-for-transformers. Unit test is in lib/langchain/tests/integration_tests/llm/test_weight_only_quantization.py The notebook is in docs/docs/integrations/llms/weight_only_quantization.ipynb. The document is in docs/docs/integrations/providers/weight_only_quantization.mdx. --------- Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-03 16:21:34 +00:00
aditya thomas	73ebe78249	docs: update cohere documentation (#19700 ) Description: Update of Cohere documentation (main provider page) Issue: After addition of the Cohere partner package, the documentation was out of date Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-02 18:16:48 -04:00
Wang Guan	8638029a37	docs: mention caveats with CacheBackedEmbeddings.embed_query (#19926 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: mention not-caching methods in CacheBackedEmbeddings - Issue: n/a I almost created one until I read the code - Dependencies: n/a - Twitter handle: `tarsylia` - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-02 19:19:29 +00:00
billytrend-cohere	de6c0cf248	cohere, docs: update imports and installs to langchain_cohere (#19918 ) cohere: update imports and installs to langchain_cohere --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-02 09:47:58 -07:00
Jamsheed Mistri	4f70bc119d	community[minor]: add Layerup Security integration (#19787 ) Description: adds integration with [Layerup Security](https://uselayerup.com). Docs can be found [here](https://docs.uselayerup.com). Integrates directly with our Python SDK. Dependencies: [LayerupSecurity](https://pypi.org/project/LayerupSecurity/) Note: all methods for our product require a paid API key, so I only included 1 test which checks for an invalid API key response. I have tested extensively locally. Twitter handle: [@layerup_](https://twitter.com/layerup_) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 23:49:00 +00:00
Brace Sproul	22f78c37c8	docs[patch]: Hide google from function calling docs (#19887 )	2024-04-01 14:26:31 -07:00
Mahdi Setayesh	c28efb878c	text-splitters[minor]: Adding a new section aware splitter to langchain (#16526 ) - Description: the layout of html pages can be variant based on the bootstrap framework or the styles of the pages. So we need to have a splitter to transform the html tags to a proper layout and then split the html content based on the provided list of tags to determine its html sections. We are using BS4 library along with xslt structure to split the html content using an section aware approach. - Dependencies: No new dependencies - Twitter handle: @m_setayesh Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 20:32:26 +00:00
northern-64bit	dfbc10c943	docs: Fix link in Unstructured notebook (#19851 ) Description: This PR fixes the link to the Unstructured documentation in the docs.	2024-04-01 15:26:48 -04:00
Brace Sproul	7538c4de19	docs[patch]: Revert quarto update (#19880 )	2024-04-01 12:11:27 -07:00
Anıl Berk Altuner	4384fa8e49	community[minor]: Add Dria retriever (#17098 ) [Dria](https://dria.co/) is a hub of public RAG models for developers to both contribute and utilize a shared embedding lake. This PR adds a retriever that can retrieve documents from Dria.	2024-04-01 12:04:19 -07:00
Ethan Yang	48f84e253e	community[minor]: Add OpenVINO rerank model support (#19791 ) @eaidova @AlexKoff88 Could you help to review, thanks --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 18:27:23 +00:00
Chenhui Zhang	a1f3e9f537	community[minor]: Update ChatZhipuAI to support GLM-4 model (#16695 ) Description: Update `ChatZhipuAI` to support the latest `glm-4` model. Issue: N/A Dependencies: httpx, httpx-sse, PyJWT The previous `ChatZhipuAI` implementation requires the `zhipuai` package, and cannot call the latest GLM model. This is because - The old version `zhipuai==1.` doesn't support the latest model. - `zhipuai==2.` requires `pydantic V2`, which is incompatible with 'langchain-community'. This re-implementation invokes the GLM model by sending HTTP requests to [open.bigmodel.cn](https://open.bigmodel.cn/dev/api) via the `httpx` package, and uses the `httpx-sse` package to handle stream events. --------- Co-authored-by: zR <2448370773@qq.com>	2024-04-01 18:11:21 +00:00
Jacob Lee	f06229bbf1	👥 Update LangChain people data (#19858 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-04-01 09:57:31 -07:00
Ikko Eltociear Ashimine	8711a05a51	Update cross_encoder_reranker.ipynb (#19846 ) HuggingFace -> Hugging Face	2024-04-01 10:49:54 -04:00
Vardhaman	039f314f20	docs: remove unnecessary args from the pip install (#19823 ) Description: An additional `U` argument was added for the instructions to install the pip packages for the MediaWiki Dump Document loader which was leading to error in installing the package. Removing the argument fixed the command to install. Issue: #19820 Dependencies: No dependency change requierd Twitter handle: [@vardhaman722](https://twitter.com/vardhaman722)	2024-04-01 10:47:26 -04:00
Kenneth Choe	f98d7f7494	langchain[minor], community[minor]: add CrossEncoderReranker with HuggingFaceCrossEncoder and SagemakerEndpointCrossEncoder (#13687 ) - Description: Support reranking based on cross encoder models available from HuggingFace. - Added `CrossEncoder` schema - Implemented `HuggingFaceCrossEncoder` and `SagemakerEndpointCrossEncoder` - Implemented `CrossEncoderReranker` that performs similar functionality to `CohereRerank` - Added `cross-encoder-reranker.ipynb` to demonstrate how to use it. Please let me know if anything else needs to be done to make it visible on the table-of-contents navigation bar on the left, or on the card list on [retrievers documentation page](https://python.langchain.com/docs/integrations/retrievers). - Issue: N/A - Dependencies: None other than the existing ones. --------- Co-authored-by: Kenny Choe <kchoe@amazon.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-31 20:51:31 +00:00
cxumol	3f7da03dd8	docs: fix a dead link (#19814 ) Description Google Colab returned 404 when trying to click an "Open In Colab" button from document. This PR corrected the link.	2024-03-31 10:28:51 -04:00
aditya thomas	b8271bbc4a	docs: (minor) updates to voyage ai documentation (#19819 ) Description: Updates to Voyage AI documentation Issue: Not Applicable Dependencies: None	2024-03-31 10:27:19 -04:00
aditya thomas	765d6762bc	docs[minor]: include tab info for togetherai (#19796 ) Description: Included information for the TogetherAI tab Issue: The tab for TogetherAI information was not correct Dependencies: None	2024-03-30 09:23:45 -04:00
anshaneel	0884e5de7f	community[minor]: Add Alpha Vantage API Tool (#14332 ) ### Description This implementation adds functionality from the AlphaVantage API, renowned for its comprehensive financial data. The class encapsulates various methods, each dedicated to fetching specific types of financial information from the API. ### Implemented Functions - `search_symbols`: - Searches the AlphaVantage API for financial symbols using the provided keywords. - `_get_market_news_sentiment`: - Retrieves market news sentiment for a specified stock symbol from the AlphaVantage API. - `_get_time_series_daily`: - Fetches daily time series data for a specific symbol from the AlphaVantage API. - `_get_quote_endpoint`: - Obtains the latest price and volume information for a given symbol from the AlphaVantage API. - `_get_time_series_weekly`: - Gathers weekly time series data for a particular symbol from the AlphaVantage API. - `_get_top_gainers_losers`: - Provides details on top gainers, losers, and most actively traded tickers in the US market from the AlphaVantage API. ### Issue: - #11994 ### Dependencies: - 'requests' library for HTTP requests. (import requests) - 'pytest' library for testing. (import pytest) --------- Co-authored-by: Adam Badar <94140103+adam-badar@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-30 00:44:01 +00:00
Snehil Kumar	b36f4147b0	docs: Google Drive Loader always set the env var (#14791 ) - Description: Code written by following, the official documentation of [Google Drive Loader](https://python.langchain.com/docs/integrations/document_loaders/google_drive), gives errors. I have opened an issue regarding this. See #14725. This is a pull request for modifying the documentation to use an approach that makes the code work. Basically, the change is that we need to always set the GOOGLE_APPLICATION_CREDENTIALS env var to an emtpy string, rather than only in case of RefreshError. Also, rewrote 2 paragraphs to make the instructions more clear. - Issue: See this related [issue # 14725](https://github.com/langchain-ai/langchain/issues/14725) - Dependencies: NA - Tag maintainer: @baskaryan - Twitter handle: NA Co-authored-by: Snehil <snehil@example.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 23:19:37 +00:00
M.Abdulrahman Alnaseer	ba54f1577f	community[minor]: add support for llmsherpa (#19741 ) Thank you for contributing to LangChain! - [x] PR title: "community: added support for llmsherpa library" - [x] Add tests and docs: 1. Integration test: 'docs/docs/integrations/document_loaders/test_llmsherpa.py'. 2. an example notebook: `docs/docs/integrations/document_loaders/llmsherpa.ipynb`. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 16:04:57 -07:00
Naveenkhasyap	a99bd098ac	docs: fix for #16702 and #16703 (#16705 ) - Description: Quickstart Documentation updates for missing dependency installation steps. - Issue: the issue # it prompts users to install required dependency. - Dependencies: no, - Twitter handle: @naveenkashyap_ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 15:57:51 -07:00
Brace Sproul	6d93a03bef	docs[patch]: Fix or remove broken mdx links (#19777 ) this pr also drops the community added action for checking broken links in mdx. It does not work well for our use case, throwing errors for local paths, plus the rest of the errors our in house solution had.	2024-03-29 15:25:08 -07:00
Brace Sproul	ce0a588ae6	docs[minor]: Add chat model tabs to docs pages (#19589 )	2024-03-29 14:23:55 -07:00
Nisarg Trivedi	1252ccce6f	text-splitters[minor]: Added Haskell support in langchain.text_splitter module (#16191 ) - Description: Haskell language support added in text_splitter module - Dependencies: No - Twitter handle: @nisargtr If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 20:17:50 +00:00
Hrvoje Milković	b7344e3347	community[minor]: Infobip tool integration (#16805 ) Description: Adding Tool that wraps Infobip API for sending sms or emails and email validation. Dependencies: None, Twitter handle: @hmilkovic Implementation: ``` libs/community/langchain_community/utilities/infobip.py ``` Integration tests: ``` libs/community/tests/integration_tests/utilities/test_infobip.py ``` Example notebook: ``` docs/docs/integrations/tools/infobip.ipynb ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 19:01:27 +00:00
Jan Chorowski	b8b42ccbc5	community[minor]: Pathway vectorstore(#14859 ) - Description: Integration with pathway.com data processing pipeline acting as an always updated vectorstore - Issue: not applicable - Dependencies: optional dependency on [`pathway`](https://pypi.org/project/pathway/) - Twitter handle: pathway_com The PR provides and integration with `pathway` to provide an easy to use always updated vector store: ```python import pathway as pw from langchain.embeddings.openai import OpenAIEmbeddings from langchain.text_splitter import CharacterTextSplitter from langchain.vectorstores import PathwayVectorClient, PathwayVectorServer data_sources = [] data_sources.append( pw.io.gdrive.read(object_id="17H4YpBOAKQzEJ93xmC2z170l0bP2npMy", service_user_credentials_file="credentials.json", with_metadata=True)) text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0) embeddings_model = OpenAIEmbeddings(openai_api_key=os.environ["OPENAI_API_KEY"]) vector_server = PathwayVectorServer( *data_sources, embedder=embeddings_model, splitter=text_splitter, ) vector_server.run_server(host="127.0.0.1", port="8765", threaded=True, with_cache=False) client = PathwayVectorClient( host="127.0.0.1", port="8765", ) query = "What is Pathway?" docs = client.similarity_search(query) ``` The `PathwayVectorServer` builds a data processing pipeline which continusly scans documents in a given source connector (google drive, s3, ...) and builds a vector store. The `PathwayVectorClient` implements LangChain's `VectorStore` interface and connects to the server to retrieve documents. --------- Co-authored-by: Mateusz Lewandowski <lewymati@users.noreply.github.com> Co-authored-by: mlewandowski <mlewandowski@MacBook-Pro-mlewandowski.local> Co-authored-by: Berke <berkecanrizai1@gmail.com> Co-authored-by: Adrian Kosowski <adrian@pathway.com> Co-authored-by: mlewandowski <mlewandowski@macbook-pro-mlewandowski.home> Co-authored-by: berkecanrizai <63911408+berkecanrizai@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: mlewandowski <mlewandowski@MBPmlewandowski.ht.home> Co-authored-by: Szymon Dudycz <szymond@pathway.com> Co-authored-by: Szymon Dudycz <szymon.dudycz@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 10:50:39 -07:00
ccurme	0dbd5f5012	add script to check imports (#19611 )	2024-03-29 13:30:20 -04:00
高璟琦	ec7a59c96c	community[minor]: Add solar embedding (#19761 ) Solar is a large language model developed by [Upstage](https://upstage.ai/). It's a powerful and purpose-trained LLM. You can visit the embedding service provided by Solar within this pr. You may get SOLAR_API_KEY from https://console.upstage.ai/services/embedding You can refer to more details about accepted llm integration at https://python.langchain.com/docs/integrations/llms/solar.	2024-03-29 09:36:05 -07:00
Leonid Ganeline	5f814820f6	docs: providers pinecone fix (#19737 ) Current providers page use link to the old package. - Fixed installation instructions - Added a reference to the Pinecone retriever	2024-03-29 08:30:30 -04:00
Bob Lin	53a74ad12b	docs: use markdown cell instead of code block (#19740 ) I found that the code of async and async batch was divided into two blocks: <img width="823" alt="Screenshot 2024-03-29 at 7 45 59 AM" src="https://github.com/langchain-ai/langchain/assets/10000925/0fa59d29-a692-4309-afb8-2260f03242ec"> so I changed it to unified.	2024-03-29 08:27:48 -04:00
Ekaterina Aidova	4ce36af335	docs: fix link in openvino integration doc (#19749 ) - Description: fix incorrect link in docs - Dependencies: None	2024-03-29 12:24:07 +00:00
Jialei	f7c903e24a	community[minor]: add support for Moonshot llm and chat model (#17100 )	2024-03-29 08:54:23 +00:00
Gustavo Isturiz	824dccf5e2	docs: fixed xml URL on sitemap docs exmaple, issue #17236 (#17304 )	2024-03-29 01:36:54 -07:00
Ethan Yang	7164015135	community[minor]: Add Openvino embedding support (#19632 ) This PR is used to support both HF and BGE embeddings with openvino --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>	2024-03-29 01:34:51 -07:00
kYLe	124ab79c23	community[minor]: Add Anyscale embedding support (#17605 ) Description: Add embedding model support for Anyscale Endpoint Dependencies: openai --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:53:53 +00:00
Jiaming	3d3cc71287	community[patch]: fix bugs for bilibili Loader (#18036 ) - Description: 1. Fix the BiliBiliLoader that can receive cookie parameters, it requires 3 other parameters to run. The change is backward compatible. 2. Add test; 3. Add example in docs - Issue: [#14213] Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 16:39:38 -07:00
Ethan Knights	1ef3fa0411	docs: improve readability of Langchain Expression Language get_started.ipynb (#18157 ) Description: A few grammatical changes to improve readability of the LCEL .ipynb and tidy some null characters. Issue: N/A Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 23:38:30 +00:00
Sachin Paryani	25c9f3d1d1	community[patch]: Support Streaming in Azure Machine Learning (#18246 ) - [x] PR title: "community: Support streaming in Azure ML and few naming changes" - [x] PR message: - Description: Added support for streaming for azureml_endpoint. Also, renamed and AzureMLEndpointApiType.realtime to AzureMLEndpointApiType.dedicated. Also, added new classes CustomOpenAIChatContentFormatter and CustomOpenAIContentFormatter and updated the classes LlamaChatContentFormatter and LlamaContentFormatter to now show a deprecated warning message when instantiated. --------- Co-authored-by: Sachin Paryani <saparan@microsoft.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:38:20 +00:00
Isaac Francisco	f5e84c8858	docs: fixing markdown for tips (#18199 ) Previous markdown code was not working as intended, new code should add green box around the tip so it is highlighted Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:37:37 +00:00
Hayden Wolff	85deee521a	docs: Nvidia Riva Runnables Documentation (#18237 ) - Description: Documents how to use the Riva runnables to add streamed automatic-speech-recognition (ASR) and text-to-speech (TTS) to chains. - Issue: None - Dependencies: None - Twitter handle: @HaydenWolff1 --------- Co-authored-by: Hayden Wolff <hwolff@Haydens-Laptop.local> Co-authored-by: Hayden Wolff <hwolff@MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:35:00 +00:00
Bob Lin	aba4bd0d13	docs: Add async batch case (#19686 )	2024-03-28 14:00:46 -07:00
Alessandro Rossi	665f15bd48	docs: fix typos and make quickstart more readable (#19712 ) Description: minor docs changes to make it more readable. Issue: N/A Dependencies: N/A Twitter handle: _kubealex	2024-03-28 20:10:32 +00:00
高璟琦	75173d31db	community[minor]: Add solar model chat model (#18556 ) Add our solar chat models, available model choices: * solar-1-mini-chat * solar-1-mini-translate-enko * solar-1-mini-translate-koen More documents and pricing can be found at https://console.upstage.ai/services/solar. The references to our solar model can be found at * https://arxiv.org/abs/2402.17032 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 12:31:11 -07:00
ccurme	b35e68c41f	docs: update use_cases/question_answering/chat_history (#19349 ) Update following https://github.com/langchain-ai/langchain/issues/19344	2024-03-28 12:51:01 -04:00
Christian Galo	1adaa3c662	community[minor]: Update Azure Cognitive Services to Azure AI Services (#19488 ) This is a follow up to #18371. These are the changes: - New Azure AI Services toolkit and tools to replace those of Azure Cognitive Services. - Updated documentation for Microsoft platform. - The image analysis tool has been rewritten to use the new package `azure-ai-vision-imageanalysis`, doing a proper replacement of `azure-ai-vision`. These changes: - Update outdated naming from "Azure Cognitive Services" to "Azure AI Services". - Update documentation to use non-deprecated methods to create and use agents. - Removes need to depend on yanked python package (`azure-ai-vision`) There is one new dependency that is needed as a replacement to `azure-ai-vision`: - `azure-ai-vision-imageanalysis`. This is optional and declared within a function. There is a new `azure_ai_services.ipynb` notebook showing usage; Changes have been linted and formatted. I am leaving the actions of adding deprecation notices and future removal of Azure Cognitive Services up to the LangChain team, as I am not sure what the current practice around this is. --- If this PR makes it, my handle is @galo@mastodon.social --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-03-28 03:19:02 +00:00
Shengsheng Huang	ac1dd8ad94	community[minor]: migrate `bigdl-llm` to `ipex-llm` (#19518 ) - Description: `bigdl-llm` library has been renamed to [`ipex-llm`](https://github.com/intel-analytics/ipex-llm). This PR migrates the `bigdl-llm` integration to `ipex-llm` . - Issue: N/A. The original PR of `bigdl-llm` is https://github.com/langchain-ai/langchain/pull/17953 - Dependencies: `ipex-llm` library - Contribution maintainer: @shane-huang Updated doc: docs/docs/integrations/llms/ipex_llm.ipynb Updated test: libs/community/tests/integration_tests/llms/test_ipex_llm.py	2024-03-27 20:12:59 -07:00
Chaunte W. Lacewell	a31f692f4e	community[minor]: Add VDMS vectorstore (#19551 ) - Description: Add support for Intel Lab's [Visual Data Management System (VDMS)](https://github.com/IntelLabs/vdms) as a vector store - Dependencies: `vdms` library which requires protobuf = "4.24.2". There is a conflict with dashvector in `langchain` package but conflict is resolved in `community`. - Contribution maintainer: [@cwlacewe](https://github.com/cwlacewe) - Added tests: libs/community/tests/integration_tests/vectorstores/test_vdms.py - Added docs: docs/docs/integrations/vectorstores/vdms.ipynb - Added cookbook: cookbook/multi_modal_RAG_vdms.ipynb --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 03:12:11 +00:00
yongheng.liu	7e29b6061f	community[minor]: integrate China Mobile Ecloud vector search (#15298 ) - Description: integrate China Mobile Ecloud vector search, - Dependencies: elasticsearch==7.10.1 Co-authored-by: liuyongheng <liuyongheng@cmss.chinamobile.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 23:02:40 +00:00
CaroFG	cf96060ab7	community[patch]: update for compatibility with latest Meilisearch version (#18970 ) - Description: Updates Meilisearch vectorstore for compatibility with v1.6 and above. Adds embedders settings and embedder_name which are now required. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 22:08:27 +00:00
Bagatur	b901649032	docs: move extraction up (#19667 )	2024-03-27 14:55:16 -07:00
Kangmoon Seo	d0accc3275	docs: fix error output in XMLOutputParser documentation (#19569 ) - Description: I've made a fix to a ParseError call in the XMLOutputParser documentation. - Issue: None - Dependencies: None Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-27 18:29:00 +00:00
Bagatur	5fc6531c74	docs: use first_tool_only instead of return_single (#19666 )	2024-03-27 18:19:39 +00:00
ccurme	4e9b358ed8	docs: Fix broken imports in documentation (#19655 ) Found via script in https://github.com/langchain-ai/langchain/pull/19611	2024-03-27 13:54:05 -04:00
yuwenzho	3a7d2cf443	community[minor]: Add ITREX optimized Embeddings (#18474 ) Introduction [Intel® Extension for Transformers](https://github.com/intel/intel-extension-for-transformers) is an innovative toolkit designed to accelerate GenAI/LLM everywhere with the optimal performance of Transformer-based models on various Intel platforms Description adding ITREX runtime embeddings using intel-extension-for-transformers. added mdx documentation and example notebooks added embedding import testing. --------- Signed-off-by: yuwenzho <yuwen.zhou@intel.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 07:22:06 +00:00
Fabrizio Ruocco	f12cb0bea4	community[patch]: Microsoft Azure Document Intelligence updates (#16932 ) - Description: Update Azure Document Intelligence implementation by Microsoft team and RAG cookbook with Azure AI Search --------- Co-authored-by: Lu Zhang (AI) <luzhan@microsoft.com> Co-authored-by: Yateng Hong <yatengh@microsoft.com> Co-authored-by: teethache <hongyateng2006@126.com> Co-authored-by: Lu Zhang <44625949+luzhang06@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 23:36:59 -07:00
Leonid Ganeline	3a978a4bdc	docs: `output_parsers` page fix (#19623 ) Issue with this [page](https://python.langchain.com/docs/modules/model_io/output_parsers/): Table: "Input Type" columns: strings `str \\| Message` (the escape char "\" doesn't work inside backticked text).	2024-03-26 22:17:41 -07:00
Ethan Yang	28cd5522c2	docs: fix typo in openvino document (#19627 )	2024-03-26 22:13:54 -07:00
xsai9101	1c27de6ce2	docs: Fix oracle doc loader format issue (#19628 )	2024-03-26 22:13:36 -07:00
Timothy	ad77fa15ee	community[patch]: Adding try-except block for GCSDirectoryLoader (#19591 ) - Description: Implemented try-except block for `GCSDirectoryLoader`. Reason: Users processing large number of unstructured files in a folder may experience many different errors. A try-exception block is added to capture these errors. A new argument `use_try_except=True` is added to enable silent failure so that error caused by processing one file does not break the whole function. - Issue: N/A - Dependencies: no new dependencies - Twitter handle: timothywong731 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 00:12:24 +00:00
fzowl	aea2be5bf3	voyageai[patch]: VoyageAI rerank (#19521 ) Adding VoyageAI reranking --------- Co-authored-by: fodizoltan <zoltan@conway.expert> Co-authored-by: Yujie Qian <thomasq0809@gmail.com>	2024-03-26 17:07:23 -07:00
Leonid Ganeline	4d85485e71	docs: `PromptTemplate` import from `core` (#19616 ) Changed import of `PromptTemplate` from `langchain` to `langchain_core` in all examples (notebooks)	2024-03-26 17:03:36 -07:00
xsai9101	160a8eb178	community[minor]: add oracle autonomous database doc loader integration (#19536 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: Adding oracle autonomous database document loader integration. This will allow users to connect to oracle autonomous database through connection string or TNS configuration. https://www.oracle.com/autonomous-database/ - Issue: None - Dependencies: oracledb python package https://pypi.org/project/oracledb/ - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Unit test and doc are added. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 17:02:18 -07:00
Ethan Yang	5784dfed00	docs: update openvino documents (#19543 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 22:15:30 +00:00
Leonid Ganeline	a3d24bc10b	docs: release date fix (#19585 ) Replaced the overdue release promise.	2024-03-26 14:51:09 -07:00
Raghav Rawat	b5640a0883	docs: Update apify.ipynb for Document class import (#19598 ) - Description: Update to correctly import Document class - from langchain_core.documents import Document - Issue: Fixes the notebook and the hosted documentation [here](https://python.langchain.com/docs/integrations/tools/apify) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 21:46:29 +00:00
Christophe Bornet	6f477e3cb6	docs: Remove chromadb from required dependency in examples with VectorstoreIndexCreator (#19578 )	2024-03-26 11:12:21 -04:00
Piyush Jain	72ba738bf5	community[minor]: Improvements for NeptuneRdfGraph, Improve discovery of graph schema using database statistics (#19546 ) Fixes linting for PR [19244](https://github.com/langchain-ai/langchain/pull/19244) --------- Co-authored-by: mhavey <mchavey@gmail.com>	2024-03-26 10:36:51 -04:00
aditya thomas	fc6b92bb9a	docs: add cohere to the list of partners (#19552 ) Description: Add Cohere to the list of LangChain partners Issue: The Cohere partner package was recently added [#19049](https://github.com/langchain-ai/langchain/pull/19049) Dependencies: None	2024-03-26 10:22:03 -04:00
Aayush Kataria	03c38005cb	community[patch]: Fixing some caching issues for AzureCosmosDBSemanticCache (#18884 ) Fixing some issues for AzureCosmosDBSemanticCache - Added the entry for "AzureCosmosDBSemanticCache" which was missing in langchain/cache.py - Added application name when creating the MongoClient for the AzureCosmosDBVectorSearch, for tracking purposes. @baskaryan, can you please review this PR, we need this to go in asap. These are just small fixes which we found today in our testing.	2024-03-25 19:06:17 -07:00
miri-bar	55db737302	ai21[minor]: AI21 Labs Semantic Text Splitter support (#19510 ) Description: Added support for AI21 Labs model - Segmentation, as a Text Splitter Dependencies: ai21, langchain-text-splitter Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 01:39:37 +00:00
Anindyadeep	b2a11ce686	community[minor]: Prem AI langchain integration (#19113 ) ### Prem SDK integration in LangChain This PR adds the integration with [PremAI's](https://www.premai.io/) prem-sdk with langchain. User can now access to deployed models (llms/embeddings) and use it with langchain's ecosystem. This PR adds the following: ### This PR adds the following: - [x] Add chat support - [X] Adding embedding support - [X] writing integration tests - [X] writing tests for chat - [X] writing tests for embedding - [X] writing unit tests - [X] writing tests for chat - [X] writing tests for embedding - [X] Adding documentation - [X] writing documentation for chat - [X] writing documentation for embedding - [X] run `make test` - [X] run `make lint`, `make lint_diff` - [X] Final checks (spell check, lint, format and overall testing) --------- Co-authored-by: Anindyadeep Sannigrahi <anindyadeepsannigrahi@Anindyadeeps-MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 01:37:19 +00:00
Alessandro D'Armiento	37eb3a4a9e	docs: Some import nits (#19130 ) - Description: fixes some minor issues in the documentation --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 01:25:44 +00:00
Anthony Shaw	6c9b0f96f3	docs: Add guidance for splitting Chinese, Japanese, and Thai (#19295 ) The existing default list of separators for the `RecursiveTextSplitter` assumes spaces are word boundaries. Some languages [don't use spaces between words](https://en.wikipedia.org/wiki/Category:Writing_systems_without_word_boundaries) (Chinese, Japanese, Thai, Burmese). This PR extends the documentation to explain how to cater for those languages by adding additional punctuation to the separators and zero-width spaces which are used by some typesetters and will assist the splitter to not split in words. Ideally, these separators could be a constant in the module but for now, defining them in the documentation is a start.	2024-03-26 00:34:00 +00:00
Ian	d5415dbd68	docs: improve tidb integrations documents (#19321 ) This PR aims to enhance the documentation for TiDB integration, driven by feedback from our users. It provides detailed introductions to key features, ensuring developers can fully leverage TiDB for AI application development.	2024-03-25 17:08:23 -07:00
Dmitry Tyumentsev	08b769d539	community[patch]: YandexGPT Use recent yandexcloud sdk version (#19341 ) Fixed inability to work with [yandexcloud SDK](https://pypi.org/project/yandexcloud/) version higher 0.265.0	2024-03-25 17:05:57 -07:00
Tridib Roy Arjo	d667b1ea8f	docs: Update async_chromium.ipynb (#19514 ) In Jupyter, asyncio would throw an error before `.load()` unless `nest_asyncio` is applied (Issue #8494 mentioned this) +Minor typo fixes..	2024-03-26 00:02:50 +00:00
Bob Lin	5b6b1f9e1d	docs: Fix several sample code errors (#19382 )	2024-03-25 16:59:52 -07:00
Hamid Ali	c281ec8887	docs: Fix broken link in semantic-chunker.ipynb (#19464 ) Corrected a broken link within the semantic-chunker.ipynb notebook, ensuring that users can access the referenced resource. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 23:39:32 +00:00
Ikko Eltociear Ashimine	980658cb47	docs: Update streaming.ipynb (#19500 ) Fixed typo. occuring -> occurring	2024-03-25 16:21:45 -07:00
Leonid Kuligin	91f4c80143	docs: fixed links (#19503 ) - [ ] PR title: "docs: fixed broken links" - [ ] PR message: - Description: fixed links in the documentation	2024-03-25 16:19:28 -07:00
Mikelarg	dac2e0165a	community[minor]: Added GigaChat Embeddings support + updated previous GigaChat integration (#19516 ) - Description: Added integration with [GigaChat](https://developers.sber.ru/portal/products/gigachat) embeddings. Also added support for extra fields in GigaChat LLM and fixed docs.	2024-03-25 16:08:37 -07:00
Erica Clark	a1ff21f90f	docs: Update local llms article to use invoke instead of deprecated __call__ (#19528 ) - Description: Since the implicit `__call__` has been deprecated in favor of `invoke`, the local_llms article also needed to be updated. This article was my introduction to Lanchain, and as it was helpful in getting me setup with running LLMs locally, it is nice to not have any warnings when running the example code. With this change, the warnings go away when running the example code. - Issue: N/A - Dependencies: N/A - Twitter handle: clarkerican	2024-03-25 15:51:39 -07:00
billytrend-cohere	63343b4987	cohere[patch]: add cohere as a partner package (#19049 ) Description: adds support for langchain_cohere --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-25 20:23:47 +00:00
Igor Muniz Soares	743f888580	community[minor]: Dappier chat model integration (#19370 ) Description: This PR adds [Dappier](https://dappier.com/) for the chat model. It supports generate, async generate, and batch functionalities. We added unit and integration tests as well as a notebook with more details about our chat model. Dependencies: No extra dependencies are needed.	2024-03-25 07:29:05 +00:00
Hugoberry	96dc180883	community[minor]: Add `DuckDB` as a vectorstore (#18916 ) DuckDB has a cosine similarity function along list and array data types, which can be used as a vector store. - Description: The latest version of DuckDB features a cosine similarity function, which can be used with its support for list or array column types. This PR surfaces this functionality to langchain. - Dependencies: duckdb 0.10.0 - Twitter handle: @igocrite --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 07:02:35 +00:00
Ethan Yang	fa6397d76a	docs: Add OpenVINO llms docs (#19489 ) Add OpenVINOpipeline instructions in docs. OpenVINO users can find more details in this page.	2024-03-24 23:57:30 -07:00
Lance Martin	db7403d667	docs: Remove non-rendering images & output spamming from doc ntbks (#19475 ) Looking at tokens / page of our docs, we see a few outliers: <img width="761" alt="image" src="https://github.com/langchain-ai/langchain/assets/122662504/677aa2d6-0a29-45e4-882a-db2bbf46d02b"> It is due to non-rendering images in one case, and output spamming. Clean these, along with other cases of excessing output spamming in docs. All get sucked into chat-langchain for retrieval.	2024-03-24 23:47:38 -07:00
aditya thomas	b43a9d5808	docs: adding voyageai to the list of partner packages (#19376 ) Description: Adding VoyageAI to the list of partners Issue: A standalone langchain-voyageai package has been added Dependencies: None	2024-03-22 17:08:15 -07:00
Zeeland	2549df00cd	docs: fix error bilibili url (#19375 ) Thank you for contributing to LangChain! bilibili-api-python use https://github.com/Nemo2011/bilibili-api repo. Change to the correct address. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-22 17:06:17 -07:00
aditya thomas	375ab7bf59	docs: update module imports for fireworks documentation (#19377 ) Description: Update module imports for Fireworks documentation Issue: Module imports not present or in incorrect location Dependencies: None	2024-03-22 17:05:27 -07:00
aditya thomas	0cc0467267	docs: update import paths and move to lcel for llama.cpp examples (#19391 ) Description: Update import paths and move to lcel for llama.cpp examples Issue: Update import paths to reflect package refactoring and move chains to LCEL in examples Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-23 00:04:12 +00:00
fengjial	3b52ee05d1	community[patch]: fix bugs in baiduvectordb as vectorstore (#19380 ) fix small bugs in vectorstore/baiduvectordb	2024-03-22 17:03:59 -07:00
Cailin Wang	5402aef32e	docs: Add `partition` parameter to DashVector (#19385 ) Description: Add `partition` parameter to DashVector dashvector.ipynb Related PR: https://github.com/langchain-ai/langchain/pull/19023 Twitter handle: @CailinWang_ --------- Co-authored-by: root <root@Bluedot-AI>	2024-03-22 17:00:29 -07:00
aditya thomas	16ef88a87d	docs: moving FireworksEmbeddings documentation to docs folder (#19398 ) Description: Moving FireworksEmbeddings documentation to the location docs/integration/text_embedding/ from langchain_fireworks/docs/ Issue: FireworksEmbeddings documentation was not in the correct location Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-22 23:24:22 +00:00
Ray Bell	7d36ee38b7	docs: point to titantic dataset on web (#19455 ) Updated `pd.read_csv("titantic.csv")` to `pd.read_csv("https://raw.githubusercontent.com/pandas-dev/pandas/main/doc/data/titanic.csv")` i.e. it will read it https://raw.githubusercontent.com/pandas-dev/pandas/main/doc/data/titanic.csv and allow anyone to run the code. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 22:22:41 +00:00
Ray Bell	f959fad56e	docs: use invoke instead of run (#19457 ) Updated the deprecated run with invoke Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 15:08:26 -07:00
老阿張	9dfce56b31	docs: Fix typo in infino.ipynb (#18640 ) Description: "conquerer should be conqueror "? 🤔 Issue: Typo Dependencies: Nope Twitter handle: laoazhang	2024-03-20 07:51:58 -07:00
aditya thomas	e46419c851	docs: contribute / integrations code examples update (#19319 ) Description: Update to make the code examples consistent with the actual use Issue: Code examples were different from actual use in the LangChain code Dependencies: Changes on top of https://github.com/langchain-ai/langchain/pull/19294 Note: If these changes are acceptable, please merge them after https://github.com/langchain-ai/langchain/pull/19294.	2024-03-20 09:27:53 -04:00
Brace Sproul	40f846e65d	docs[minor]: Add chat model selection tabs component (#19296 ) <img width="1728" alt="image" src="https://github.com/langchain-ai/langchain/assets/46789226/45e70a92-c2ee-48c8-9964-100eed22687b">	2024-03-19 18:12:46 -07:00
Nithish Raghunandanan	7ad0a3f2a7	community: add Couchbase Vector Store (#18994 ) - Description: Added support for Couchbase Vector Search to LangChain. - Dependencies: couchbase>=4.1.12 - Twitter handle: @nithishr --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com>	2024-03-19 12:39:51 -07:00
Chris Papademetrious	305d74c67a	core: implement a batch_size parameter for CacheBackedEmbeddings (#18070 ) Description: Currently, `CacheBackedEmbeddings` computes vectors for all uncached documents before updating the store. This pull request updates the embedding computation loop to compute embeddings in batches, updating the store after each batch. I noticed this when I tried `CacheBackedEmbeddings` on our 30k document set and the cache directory hadn't appeared on disk after 30 minutes. The motivation is to minimize compute/data loss when problems occur: * If there is a transient embedding failure (e.g. a network outage at the embedding endpoint triggers an exception), at least the completed vectors are written to the store instead of being discarded. * If there is an issue with the store (e.g. no write permissions), the condition is detected early without computing (and discarding!) all the vectors. Issue: Implements enhancement #18026. Testing: I was unable to run unit tests; details in [this post](https://github.com/langchain-ai/langchain/discussions/15019#discussioncomment-8576684). --------- Signed-off-by: chrispy <chrispy@synopsys.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-19 18:55:43 +00:00
Christophe Bornet	30e4a35d7a	community: Use langchain-astradb for AstraDB caches (#18419 ) - [x] Needs https://github.com/langchain-ai/langchain-datastax/pull/4 - [x] Needs a new release of langchain-astradb	2024-03-19 14:04:36 -04:00
Brace Sproul	17c62e0f3a	ci[minor]: Bump LC scripts package, add retry option (#19285 ) The `retryFailed` option will retry all failed links, once at a time with the goal of not triggering bot protection `microsoft.com` is now hard coded into the whitelist	2024-03-19 10:42:59 -07:00
Erick Friis	7eb376d5fc	docs: integration deprecation docs (#19283 )	2024-03-19 17:11:15 +00:00
HatsuneMK00	4761c09e94	docs: update slack toolkit ipynb in integration (#19219 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - PR message: - Description: Update the slack toolkit doc to use an agent that support multiple inputs. Using ReAct agent will cause a ValidationError when invoking the slack tools. This is because the agent return a string like `'{"channel": "C05LDF54S21", "message": "Hello, world!"}'` but the ReAct agent does not support multiple inputs. - Issue: This is related to this [Discussion#18083](https://github.com/langchain-ai/langchain/discussions/18083) - Dependencies: No dependencies required Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-19 10:39:09 -04:00
Vittorio Rigamonti	9b2f9ee952	community: VectorStore Infinispan, adding autoconfiguration (#18967 ) Description: this PR enable VectorStore autoconfiguration for Infinispan: if metadatas are only of basic types, protobuf config will be automatically generated for the user.	2024-03-18 21:33:45 -07:00
Anthony Shaw	bb0dd8f82f	docs: Embellish article on splitting by tokens with more examples and missing details (#18997 ) Description This PR adds some missing details from the "Split by tokens" page in the documentation. Specifically: - The `.from_tiktoken_encoder()` class methods for both the `CharacterTextSplitter` and `RecursiveCharacterTextSplitter` default to the old `gpt-2` encoding. I've added a comment to suggest specifying `model_name` or `encoding` - The docs didn't mention that the `from_tiktoken_encoder()` class method passes additional kwargs down to the constructor of the splitter. I only discovered this by reading the source code - Added an example of using the `.from_tiktoken_encoder()` class method with `RecursiveCharacterTextSplitter` which is the recommended approach for most scenarios above `CharacterTextSplitter` - Added a warning that `TokenTextSplitter` can split characters which have multiple tokens (e.g. 猫 has 3 cl100k_base tokens) between multiple chunks which creates malformed Unicode strings and should not be used in these situations. Side note: I think the default argument of `gpt2` for `.from_tiktoken_encoder()` should be updated? Twitter handle anthonypjshaw --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-18 21:28:17 -07:00
Simon Stone	58c7687174	langchain: preserve document metadata in `FlashrankRerank` (#19148 ) Description: Preserves document metadata in `FlashrankRerank` - Issue: #19142 - Dependencies: None - Twitter handle: n/a --------- Co-authored-by: Simon Stone <simon.stone@dartmouth.edu>	2024-03-19 04:15:18 +00:00
Simon Stone	dc4ce82ddd	docs: fix import path for `FlashrankRerank` example notebook (#19146 ) Description: Fixes the import paths for the `FlashrankRerank` example notebook. Issue: #19139 Dependencies: None Twitter handle: n/a --------- Co-authored-by: Simon Stone <simon.stone@dartmouth.edu> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-18 21:03:00 -07:00
Saurav Kumar	bde199d128	Updating format of pip install (#19198 ) Thank you for contributing to LangChain! - [x] PR title: "Updating format of pip install in two files of docs/cookbook" - pip install is not reflecting properly in some of the files in cookbook - Example: [docs/expression_language/cookbook/sql_db](https://python.langchain.com/docs/expression_language/cookbook/sql_db) - [x] PR message: Updating format of pip install in two files of docs/cookbook - Description: a description of the change - Issue: #19197 - Note - let's do squash merge for the PR If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-19 04:01:24 +00:00
HowardChan	ae3c7f702c	docs:Make url as a markdown link (#19212 ) Description: same as the title Co-authored-by: ChenZhengHao <chenzhenghao@mail.teletraan.io>	2024-03-19 03:47:52 +00:00
Estephania Calvo Carvajal	94e58dd827	docs:Fix links to LangSmith docs on Evaluation page (#19210 ) (#19216 ) - Description: Same as the title - Issue: #19210	2024-03-18 22:27:43 +00:00
Kenzie Mihardja	21f75991d4	deprecate community docugami loader (#19230 ) Thank you for contributing to LangChain! - [x] PR title: "community: deprecate DocugamiLoader" - [x] PR message: Deprecate the langchain_community and use the docugami_langchain DocugamiLoader --------- Co-authored-by: Kenzie Mihardja <kenzie28@cs.washington.edu>	2024-03-18 12:56:47 -07:00
Anubhav Madhav	9235dade90	docs: provided hyperlinks to text and fixed grammar (#19092 ) 1) Provided links to text in the prompt (Refer Page Link 1, Page Link 2 and Page Link 3) 2) Fixed Grammar in Considerations of Model I/O Concepts documentation page - Update concepts.mdx (Page Link 4) Issues are on the following pages: Page Link 1: https://python.langchain.com/docs/modules/model_io/concepts#prompttemplate Page Link 2: https://python.langchain.com/docs/modules/model_io/concepts#messageprompttemplate Page Link 3: https://python.langchain.com/docs/modules/model_io/concepts#chatprompttemplate Page Link 4: https://python.langchain.com/docs/modules/model_io/concepts#considerations Fix 1: Description: Fixed Grammar in Considerations of Model I/O Documentation Page Issue: "to work well with the model are you using" # "to work well with the model you are using" Dependencies: None Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) Fix 2: Description: Provided links to text in the prompt (Refer Page Link 1, Page Link 2 and Page Link 3) Issue: links not provided # links have been provided to the text Dependencies: None Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) baskaryan, efriis, eyurtsev, hwchase17. For Fix 1 Refer to the first word 'This" word in the image attached with this PR. PFA <img width="839" alt="Screenshot 2024-03-15 at 3 04 17 AM" src="https://github.com/langchain-ai/langchain/assets/42323737/94e8db16-249f-48c3-a1d1-dee8d36067fa"> If no one reviews your PR within a few days, please @-mention one of --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-17 01:37:42 +00:00
inpyeong	7c092f479f	docs: Update why.ipynb (#19173 ) I think that cell type for pip command may be 'code'. Please check, thank you :) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-16 22:21:51 +00:00
Vitalii Korsakov	d96e0b2de7	docs: Remove duplicated line in Get Started section (#19182 ) Line `from langchain_openai import ChatOpenAI` is put twice in Get Started / Serving with LangServe section. Imports on lines 559 and 566 are identical Co-authored-by: Vitalii <vitalii@localhost>	2024-03-16 22:21:25 +00:00
Rodrigo Nogueira	e64cf1aba4	community: Add model argument for maritalk models and better error handling (#19187 )	2024-03-16 15:18:56 -07:00
samanhappy	ff94f86ce1	docs: fix link to interface TextSplitter (#19177 )	2024-03-16 15:16:34 -07:00
aditya thomas	05008c4f94	docs: update stale links in Together AI documentation (#19011 ) Description: Update stales link in Together AI documentation Issue: Some links pointed to legacy webpages on the Together AI website Dependencies: None Lint and test: `make format`, `make lint` were run	2024-03-15 16:38:04 -07:00
wulixuan	f79d0cb9fb	docs: update docs for yuan2 in LLMs and Chat models integration. (#19028 ) update yuan2.0 notebook in LLMs and Chat models. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-15 16:03:18 -07:00
Taraka Nithin Vankala	eec023766e	docs: Corrected error (#19030 ) - [ ] PR title: "docs: correction in "https://github.com/langchain-ai/langchain/blob/master/docs/docs/get_started/quickstart.mdx", line 289". - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: - Corrected the spelling mistake - #18981	2024-03-15 16:02:33 -07:00
Christophe Bornet	f2a7dda4bd	community[patch]: Use langchain-astradb for AstraDB doc loader (#19071 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:57:25 +00:00
Leonid Ganeline	a49ac55964	docs: `providers` update 8 (#19053 ) Added missed providers. Added missed integrations. Fixed format.	2024-03-15 15:49:14 -07:00
Holt Skinner	cee03630d9	community[patch]: Add Blended Search Support to `GoogleVertexAISearchRetriever` (#19082 ) https://cloud.google.com/generative-ai-app-builder/docs/create-data-store-es#multi-data-stores --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:39:31 +00:00
William W Wang	0a784074d1	docs: Update llm_caching.ipynb (#19085 )	2024-03-15 22:35:48 +00:00
William W Wang	6327be9048	docsUpdate azure_cosmos_db.ipynb (#19087 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:33:26 +00:00
Anubhav Madhav	553a520ab6	docs: Fixed Grammar in Considerations of Model I/O Concepts (#19091 ) Fixed Grammar in Considerations of Model I/O Concepts documentation page - Update concepts.mdx Page Link: https://python.langchain.com/docs/modules/model_io/concepts#considerations - Description: Fixed Grammar in Considerations of Model I/O Documentation Page - Issue: "to work well with the model are you using" # "to work well with the model you are using" - Dependencies: None - Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:31:39 +00:00
Shotaro Sano	d647ff1a9a	docs: Fix execution results of `docs/docs/modules/data_connection/indexing.ipynb` (#19112 ) ## Description This PR addresses a documentation issue in the [Indexing](https://python.langchain.com/docs/modules/data_connection/indexing) page. Specifically, it corrects the execution results of the Jupyter notebook under the [Source](https://python.langchain.com/docs/modules/data_connection/indexing#source) section, which were broken as detailed below. ## Problem The execution results following the statement, `This should delete the old versions of documents associated with doggy.txt source and replace them with the new versions.`, appear to be incorrect, as described below. ### Current Behavior - For some reason, the `index` function fails to add the new content of `doggy.txt`. Although it deletes the document objects associated with the `doggy.txt` source, it does not add the objects in `changed_doggy_docs`. Consequently, the execution result displays `num_added: 0`. - This unexpected behavior also impacts the results of `vectorstore.similarity_search("dog", k=30)`, showing only the contents of `kitty.txt`. It appears as though the contents of `doggy.txt` have been completely removed from the index: ``` Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ### Expected Behavior - The `index` function should successfully add the objects in `changed_doggy_docs` after removing the old content of `doggy.txt`. The anticipated execution result is `num_added: 2`. - Subsequently, the modified content of `doggy.txt` should appear in the results of `vectorstore.similarity_search("dog", k=30)` as follows: ``` [Document(page_content='woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='woof woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ## Fix I reran `docs/docs/modules/data_connection/indexing.ipynb` and have included the diff in this PR.	2024-03-15 22:27:15 +00:00
Guangdong Liu	cced3eb9bc	community[patch]: Fix sparkllm embeddings api bug. (#19122 ) - Description: Fix sparkllm embeddings api bug. @baskaryan PTAL	2024-03-15 15:08:49 -07:00
samanhappy	b9c62fb905	docs: fix API link for BaseLoader (#19128 ) The link to the BaseLoader API requires an update as it has been moved into the `langchain_core` package.	2024-03-15 14:46:05 -07:00
Kostas Botsas	527676a753	docs: Fix source column xata.ipynb (#19137 ) Docs fix: replace column name search with source. The Xata integration expects metadata column named "source". The docs suggest the name "search", which if used, yields the following error: ``` File "/usr/local/lib/python3.11/site-packages/langchain_community/vectorstores/xata.py", line 95, in _add_vectors raise Exception(f"Error adding vectors to Xata: {r.status_code} {r}") Exception: Error adding vectors to Xata: 400 {'errors': [{'status': 400, 'message': 'invalid record: column [source]: column not found'}]} ```	2024-03-15 14:06:18 -07:00
fengjial	c922ea36cb	community[minor]: Add Baidu VectorDB as vector store (#17997 ) Co-authored-by: fengjialin <fengjialin@MacBook-Pro.local>	2024-03-15 19:01:58 +00:00
aditya thomas	190887c5cd	docs: update the list of providers (#19012 ) Description: Update the list of LangChain providers Issue: Make the list of LangChain providers current Dependencies: None	2024-03-15 12:00:24 -07:00
Erick Friis	bbe164ad28	docs: voyageai as provider (#19154 )	2024-03-15 10:12:37 -07:00
Erick Friis	781aee0068	community, langchain, infra: revert store extended test deps outside of poetry (#19153 ) Reverts langchain-ai/langchain#18995 Because it makes installing dependencies in python 3.11 extended testing take 80 minutes	2024-03-15 17:10:47 +00:00
Leonid Kuligin	e3ff107e4f	docs: updated google integration related imports in the documentation (#19131 ) updated imports in the documentation for google vertex	2024-03-15 09:30:50 -04:00
Erick Friis	9e569d85a4	community, langchain, infra: store extended test deps outside of poetry (#18995 ) poetry can't reliably handle resolving the number of optional "extended test" dependencies we have. If we instead just rely on pip to install extended test deps in CI, this isn't an issue.	2024-03-15 05:55:30 +00:00
Erick Friis	7ce81eb6f4	voyageai[patch]: init package (#19098 ) Co-authored-by: fodizoltan <zoltan@conway.expert> Co-authored-by: Yujie Qian <thomasq0809@gmail.com> Co-authored-by: fzowl <160063452+fzowl@users.noreply.github.com>	2024-03-15 00:56:10 +00:00
Brace Sproul	98cd8f673b	docs[minor]ci[minor]: Add script & CI to check recurring links daily (#19100 )	2024-03-14 17:42:22 -07:00
billytrend-cohere	7253b816cc	community: Add support for cohere SDK v5 (keeps v4 backwards compatibility) (#19084 ) - Description: Add support for cohere SDK v5 (keeps v4 backwards compatibility) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-14 15:53:24 -07:00
Bagatur	e276817e1d	docs: fix vercel build script (#19090 ) amazon linux 2023 doesn't have `amazon-linux-extras` but shoudl have python3.9 by default	2024-03-14 20:53:43 +00:00
Anthony Yang	688a5bd106	docs:fixed typo in streaming document (#19045 ) Fixed typo in line 661 - from 'mimimize' to 'minimize - [ ] PR message: - Description: Fixed typo in streaming document - change 'mimimize' to 'minimize If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-14 19:38:53 +00:00
Bagatur	0ae39ab30e	docs: make links internal (#19063 ) So they can be properly link checked	2024-03-14 16:22:56 +00:00
Erick Friis	2999d06938	docs: deprecate old airbyte loader docs (#19048 )	2024-03-13 23:18:30 +00:00
Prakul	4c53e31377	docs: Updated index definition and reference to LangChain-MongoDB (#19047 ) Description: Updates to LangChain-MongoDB documentation: updates to the Atlas vector search index definition Issue: NA Dependencies: NA Twitter handle: iprakul	2024-03-13 15:44:13 -07:00
Tomaz Bratanic	e5e15c8d59	docs: Add graph construction docs (#18904 )	2024-03-13 12:27:58 -07:00
Max Jakob	911ccf9aa6	docs: elasticsearch retriever (#18965 ) Add documentation notebook for `ElasticsearchRetriever`. ## Dependencies - [ ] Release new `langchain-elasticsearch` version 0.2.0 that includes `ElasticsearchRetriever`	2024-03-12 09:42:36 -07:00
Tymofii	0bec1f6877	commnity[patch]: refactor code for faiss vectorstore, update faiss vectorstore documentation (#18092 ) Description: Refactor code of FAISS vectorcstore and update the related documentation. Details: - replace `.format()` with f-strings for strings formatting; - refactor definition of a filtering function to make code more readable and more flexible; - slightly improve efficiency of `max_marginal_relevance_search_with_score_by_vector` method by removing unnecessary looping over the same elements; - slightly improve efficiency of `delete` method by using set data structure for checking if the element was already deleted; Issue: fix small inconsistency in the documentation (the old example was incorrect and unappliable to faiss vectorstore) Dependencies: basic langchain-community dependencies and `faiss` (for CPU or for GPU) Twitter handle: antonenkodev	2024-03-11 22:33:03 -07:00
Bagatur	e0e688a277	core[minor]: generation info on msg (#18592 ) related to #16403 #17188	2024-03-12 04:43:17 +00:00
Leonid Ganeline	fad308a764	docs: `providers` update 2 (#18407 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-03-11 18:35:37 -07:00
Brace Sproul	578e67c017	docs[patch]: properly load/use env vars (#18942 )	2024-03-11 15:38:05 -07:00
Brace Sproul	4ff6aa5c78	docs[minor]: Swap gtag for supabase (#18937 ) Added deps: - `@supabase/supabase-js` - for sending inserts - `supabase` - dev dep, for generating types via cli - `dotenv` for loading env vars Added script: - `yarn gen` - will auto generate the database schema types using the supabase CLI. Not necessary for development, but is useful. Requires authing with the supabase CLI (will error out w/ instructions if you're not authed). Added functionality: - pulls users IP address (using a free endpoint: `https://api.ipify.org` so we can filter out abuse down the line) TODO: - [x] add env vars to vercel	2024-03-11 14:23:12 -07:00
fjk	a7fc731720	docs: change sparkllm spark_app_url to spark_api_url (#18000 ) community: fix - change sparkllm spark_app_url to spark_api_url - Description: - Change the variable name from `sparkllm spark_app_url` to `spark_api_url` in the community package. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-11 20:01:30 +00:00
Sevin F. Varoglu	8639624d40	docs: update OctoAI doc (#18913 ) This PR updates the OctoAI LLM doc.	2024-03-11 13:01:10 -07:00
Alexander Kozlov	a7500ab0fb	docs: Update huggingface pipelines notebook (#18801 )	2024-03-11 20:00:31 +00:00
Conroy Whitney	96d7fe0f85	docs: Change saved/configured chain variable name (#18863 ) Description: Variable name was `openai_poem` but it didn't pass in the `"prompt": "poem"` config, so the examples were showing a joke being returned from a variable called `_poem`. We could have gone one of two ways: 1. Updating the config line and the output line, or 2. Updating the variable name The latter seemed simpler, so that's what I went with. But I'd be glad to re-do this PR if you prefer the former. Thanks for everything, y'all. You rock 🤘 Issue:* N/A Dependencies: N/A Twitter handle: `conroywhitney`	2024-03-11 12:59:24 -07:00
Virat Singh	cafffe8a21	community: Add PolygonAggregates tool (#18882 ) Description: In this PR, I am adding a `PolygonAggregates` tool, which can be used to get historical stock price data (called aggregates by Polygon) for a given ticker. Polygon [docs](https://polygon.io/docs/stocks/get_v2_aggs_ticker__stocksticker__range__multiplier___timespan___from___to) for this endpoint. Twitter: [@virattt](https://twitter.com/virattt)	2024-03-11 11:58:10 -07:00
Bagatur	34284c25d4	docs: turn on link check (#18924 )	2024-03-11 10:50:39 -07:00
Mohammad Mohtashim	43db4cd20e	core[major]: On Tool End Observation Casting Fix (#18798 ) This PR updates the on_tool_end handlers to return the raw output from the tool instead of casting it to a string. This is technically a breaking change, though it's impact is expected to be somewhat minimal. It will fix behavior in `astream_events` as well. Fixes the following issue #18760 raised by @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-11 10:59:04 -04:00

... 2 3 4 5 6 ...

3547 Commits