langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-11 19:11:02 +00:00

Author	SHA1	Message	Date
baichuan-assistant	f8f2649f12	community: Add Baichuan LLM to community (#16724 ) Replace this entire comment with: - Description: Add Baichuan LLM to integration/llm, also updated related docs. Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-29 20:08:24 -08:00
thiswillbeyourgithub	1d082359ee	community: add support for callable filters in FAISS (#16190 ) - Description: Filtering in a FAISS vectorstores is very inflexible and doesn't allow that many use case. I think supporting callable like this enables a lot: regular expressions, condition on multiple keys etc. Note I had to manually alter a test. I don't understand if it was falty to begin with or if there is something funky going on. - Issue: None - Dependencies: None - Twitter handle: None Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>	2024-01-29 20:05:56 -08:00
Killinsun - Ryota Takeuchi	52f4ad8216	community: Add new fields in metadata for qdrant vector store (#16608 ) ## Description The PR is to return the ID and collection name from qdrant client to metadata field in `Document` class. ## Issue The motivation is almost same to [11592](https://github.com/langchain-ai/langchain/issues/11592) Returning ID is useful to update existing records in a vector store, but we cannot know them if we use some retrievers. In order to avoid any conflicts, breaking changes, the new fields in metadata have a prefix `_` ## Dependencies N/A ## Twitter handle @kill_in_sun <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-29 19:59:54 -08:00
hulitaitai	32cad38ec6	<langchain_community\llms\chatglm.py>: <Correcting "history"> (#16729 ) Use the real "history" provided by the original program instead of putting "None" in the history. - Description: I change one line in the code to make it return the "history" of the chat model. - Issue: At the moment it returns only the answers of the chat model. However the chat model himself provides a history more complet with the questions of the user. - Dependencies: no dependencies required for this change,	2024-01-29 19:50:31 -08:00
Bassem Yacoube	85e93e05ed	community[minor]: Update OctoAI LLM, Embedding and documentation (#16710 ) This PR includes updates for OctoAI integrations: - The LLM class was updated to fix a bug that occurs with multiple sequential calls - The Embedding class was updated to support the new GTE-Large endpoint released on OctoAI lately - The documentation jupyter notebook was updated to reflect using the new LLM sdk Thank you!	2024-01-29 13:57:17 -08:00
Volodymyr Machula	32c5be8b73	community[minor]: Connery Tool and Toolkit (#14506 ) ## Summary This PR implements the "Connery Action Tool" and "Connery Toolkit". Using them, you can integrate Connery actions into your LangChain agents and chains. Connery is an open-source plugin infrastructure for AI. With Connery, you can easily create a custom plugin with a set of actions and seamlessly integrate them into your LangChain agents and chains. Connery will handle the rest: runtime, authorization, secret management, access management, audit logs, and other vital features. Additionally, Connery and our community offer a wide range of ready-to-use open-source plugins for your convenience. Learn more about Connery: - GitHub: https://github.com/connery-io/connery-platform - Documentation: https://docs.connery.io - Twitter: https://twitter.com/connery_io ## TODOs - [x] API wrapper - [x] Integration tests - [x] Connery Action Tool - [x] Docs - [x] Example - [x] Integration tests - [x] Connery Toolkit - [x] Docs - [x] Example - [x] Formatting (`make format`) - [x] Linting (`make lint`) - [x] Testing (`make test`)	2024-01-29 12:45:03 -08:00
Harrison Chase	8457c31c04	community[patch]: activeloop ai tql deprecation (#14634 ) Co-authored-by: AdkSarsen <adilkhan@activeloop.ai>	2024-01-29 12:43:54 -08:00
Neli Hateva	c95facc293	langchain[minor], community[minor]: Implement Ontotext GraphDB QA Chain (#16019 ) - Description: Implement Ontotext GraphDB QA Chain - Issue: N/A - Dependencies: N/A - Twitter handle: @OntotextGraphDB	2024-01-29 12:25:53 -08:00
Elliot	39eb00d304	community[patch]: Adapt more parameters related to MemorySearchPayload for the search method of ZepChatMessageHistory (#15441 ) - Description: To adapt more parameters related to MemorySearchPayload for the search method of ZepChatMessageHistory, - Issue: None, - Dependencies: None, - Twitter handle: None	2024-01-29 11:45:55 -08:00
Jael Gu	a1aa3a657c	community[patch]: Milvus supports add & delete texts by ids (#16256 ) # Description To support [langchain indexing](https://python.langchain.com/docs/modules/data_connection/indexing) as requested by users, vectorstore Milvus needs to support: - document addition by id (`add_documents` method with `ids` argument) - delete by id (`delete` method with `ids` argument) Example usage: ```python from langchain.indexes import SQLRecordManager, index from langchain.schema import Document from langchain_community.vectorstores import Milvus from langchain_openai import OpenAIEmbeddings collection_name = "test_index" embedding = OpenAIEmbeddings() vectorstore = Milvus(embedding_function=embedding, collection_name=collection_name) namespace = f"milvus/{collection_name}" record_manager = SQLRecordManager( namespace, db_url="sqlite:///record_manager_cache.sql" ) record_manager.create_schema() doc1 = Document(page_content="kitty", metadata={"source": "kitty.txt"}) doc2 = Document(page_content="doggy", metadata={"source": "doggy.txt"}) index( [doc1, doc1, doc2], record_manager, vectorstore, cleanup="incremental", # None, "incremental", or "full" source_id_key="source", ) ``` # Fix issues Fix https://github.com/milvus-io/milvus/issues/30112 --------- Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-29 11:19:50 -08:00
Michard Hugo	e9d3527b79	community[patch]: Add missing async similarity_distance_threshold handling in RedisVectorStoreRetriever (#16359 ) Add missing async similarity_distance_threshold handling in RedisVectorStoreRetriever - Description: added method `_aget_relevant_documents` to `RedisVectorStoreRetriever` that overrides parent method to add support of `similarity_distance_threshold` in async mode (as for sync mode) - Issue: #16099 - Dependencies: N/A - Twitter handle: N/A	2024-01-29 11:19:30 -08:00
Anthony Bernabeu	2db79ab111	community[patch]: Implement TTL for DynamoDBChatMessageHistory (#15478 ) - Description: Implement TTL for DynamoDBChatMessageHistory, - Issue: see #15477, - Dependencies: N/A, --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-01-29 10:22:46 -08:00
taimo	d3d9244fee	langchain-community: fix unicode escaping issue with SlackToolkit (#16616 ) - Description: fix unicode escaping issue with SlackToolkit - Issue: #16610	2024-01-29 08:38:12 -08:00
Benito Geordie	f3fdc5c5da	community: Added integrations for ThirdAI's NeuralDB with Retriever and VectorStore frameworks (#15280 ) Description: Adds ThirdAI NeuralDB retriever and vectorstore integration. NeuralDB is a CPU-friendly and fine-tunable text retrieval engine.	2024-01-29 08:35:42 -08:00
Pashva Mehta	22d90800c8	community: Fixed schema discrepancy in from_texts function for weaviate vectorstore (#16693 ) * Description: Fixed schema discrepancy in from_texts function for weaviate vectorstore which created a redundant property "key" inside a class. * Issue: Fixed: https://github.com/langchain-ai/langchain/issues/16692 * Twitter handle: @pashvamehta1	2024-01-28 16:53:31 -08:00
Daniel Erenrich	0600998f38	community: Wikidata tool support (#16691 ) - Description: Adds Wikidata support to langchain. Can read out documents from Wikidata. - Issue: N/A - Dependencies: Adds implicit dependencies for `wikibase-rest-api-client` (for turning items into docs) and `mediawikiapi` (for hitting the search endpoint) - Twitter handle: @derenrich You can see an example of this tool used in a chain [here](https://nbviewer.org/urls/d.erenrich.net/upload/Wikidata_Langchain.ipynb) or [here](https://nbviewer.org/urls/d.erenrich.net/upload/Wikidata_Lars_Kai_Hansen.ipynb) <!-- Thank you for contributing to LangChain! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-28 16:45:21 -08:00
Christophe Bornet	2e3af04080	Use Postponed Evaluation of Annotations in Astra and Cassandra doc loaders (#16694 ) Minor/cosmetic change	2024-01-28 16:39:27 -08:00
Christophe Bornet	36e432672a	community[minor]: Add async methods to AstraDBLoader (#16652 )	2024-01-27 17:05:41 -08:00
Rashedul Hasan Rijul	481493dbce	community[patch]: apply embedding functions during query if defined (#16646 ) Description: This update ensures that the user-defined embedding function specified during vector store creation is applied during queries. Previously, even if a custom embedding function was defined at the time of store creation, Bagel DB would default to using the standard embedding function during query execution. This pull request addresses this issue by consistently using the user-defined embedding function for queries if one has been specified earlier.	2024-01-27 16:46:33 -08:00
Serena Ruan	f01fb47597	community[patch]: MLflowCallbackHandler -- Move textstat and spacy as optional dependency (#16657 ) Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-01-27 16:15:07 -08:00
Zhuoyun(John) Xu	508bde7f40	community[patch]: Ollama - Pass headers to post request in async method (#16660 ) # Description A previous PR (https://github.com/langchain-ai/langchain/pull/15881) added option to pass headers to ollama endpoint, but headers are not pass to the async method.	2024-01-27 16:11:32 -08:00
João Carlos Ferra de Almeida	3e87b67a3c	community[patch]: Add Cookie Support to Fetch Method (#16673 ) - Description: This change allows the `_fetch` method in the `WebBaseLoader` class to utilize cookies from an existing `requests.Session`. It ensures that when the `fetch` method is used, any cookies in the provided session are included in the request. This enhancement maintains compatibility with existing functionality while extending the utility of the `fetch` method for scenarios where cookie persistence is necessary. - Issue: Not applicable (new feature), - Dependencies: Requires `aiohttp` and `requests` libraries (no new dependencies introduced), - Twitter handle: N/A Co-authored-by: Joao Almeida <joao.almeida@mercedes-benz.io>	2024-01-27 16:03:53 -08:00
Harrison Chase	27665e3546	[community] fix anthropic streaming (#16682 )	2024-01-27 15:16:22 -08:00
Christophe Bornet	4915c3cd86	[Fix] Fix Cassandra Document loader default page content mapper (#16273 ) We can't use `json.dumps` by default as many types returned by the cassandra driver are not serializable. It's safer to use `str` and let users define their own custom `page_content_mapper` if needed.	2024-01-27 11:23:02 -08:00
Pasha	4e189cd89a	community[patch]: youtube loader transcript format (#16625 ) - Description: YoutubeLoader right now returns one document that contains the entire transcript. I think it would be useful to add an option to return multiple documents, where each document would contain one line of transcript with the start time and duration in the metadata. For example, [AssemblyAIAudioTranscriptLoader](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/document_loaders/assemblyai.py) is implemented in a similar way, it allows you to choose between the format to use for the document loader.	2024-01-26 15:26:09 -08:00
yin1991	a936472512	docs: Update documentation to use 'model_id' rather than 'model_name' to match actual API (#16615 ) - Description: Replace 'model_name' with 'model_id' for accuracy - Issue: [link-to-issue](https://github.com/langchain-ai/langchain/issues/16577) - Dependencies: - Twitter handle:	2024-01-26 15:01:12 -08:00
Micah Parker	6543e585a5	community[patch]: Added support for Ollama's num_predict option in ChatOllama (#16633 ) Just a simple default addition to the options payload for a ollama generate call to support a max_new_tokens parameter. Should fix issue: https://github.com/langchain-ai/langchain/issues/14715	2024-01-26 15:00:19 -08:00
baichuan-assistant	70ff54eace	community[minor]: Add Baichuan Text Embedding Model and Baichuan Inc introduction (#16568 ) - Description: Adding Baichuan Text Embedding Model and Baichuan Inc introduction. Baichuan Text Embedding ranks #1 in C-MTEB leaderboard: https://huggingface.co/spaces/mteb/leaderboard Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-26 12:57:26 -08:00
Ghani	e30c6662df	Langchain-community : EdenAI chat integration. (#16377 ) - Description: This PR adds [EdenAI](https://edenai.co/) for the chat model (already available in LLM & Embeddings). It supports all [ChatModel] functionality: generate, async generate, stream, astream and batch. A detailed notebook was added. - Dependencies: No dependencies are added as we call a rest API. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-26 09:56:43 -05:00
Jatin Chawda	a79345f199	community[patch]: Fixed tool names snake_case (#16397 ) #16396 Fixed 1. golden_query 2. google_lens 3. memorize 4. merriam_webster 5. open_weather_map 6. pub_med 7. stack_exchange 8. generate_image 9. wikipedia	2024-01-25 15:24:19 -08:00
Bagatur	61e876aad8	openai[patch]: Explicitly support embedding dimensions (#16596 )	2024-01-25 15:16:04 -08:00
Bagatur	5df8ab574e	infra: move indexing documentation test (#16595 )	2024-01-25 14:46:50 -08:00
Bagatur	61b200947f	community[patch]: Release 0.0.16 (#16591 )	2024-01-25 14:19:09 -08:00
Bagatur	ef42d9d559	core[patch], community[patch], openai[patch]: consolidate openai tool… (#16485 ) … converters One way to convert anything to an OAI function: convert_to_openai_function One way to convert anything to an OAI tool: convert_to_openai_tool Corresponding bind functions on OAI models: bind_functions, bind_tools	2024-01-25 13:18:46 -08:00
Brian Burgin	148347e858	community[minor]: Add LiteLLM Router Integration (#15588 ) community: - Description: - Add new ChatLiteLLMRouter class that allows a client to use a LiteLLM Router as a LangChain chat model. - Note: The existing ChatLiteLLM integration did not cover the LiteLLM Router class. - Add tests and Jupyter notebook. - Issue: None - Dependencies: Relies on existing ChatLiteLLM integration - Twitter handle: @bburgin_0 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-25 11:03:05 -08:00
Dmitry Tyumentsev	e86e66bad7	community[patch]: YandexGPT models - add sleep_interval (#16566 ) Added sleep between requests to prevent errors associated with simultaneous requests.	2024-01-25 09:07:19 -08:00
Erick Friis	adc008407e	exa: init pkg (#16553 )	2024-01-24 20:57:17 -07:00
Rave Harpaz	c4e9c9ca29	community[minor]: Add OCI Generative AI integration (#16548 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Adding Oracle Cloud Infrastructure Generative AI integration. Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service that provides a set of state-of-the-art, customizable large language models (LLMs) that cover a wide range of use cases, and which is available through a single API. Using the OCI Generative AI service you can access ready-to-use pretrained models, or create and host your own fine-tuned custom models based on your own data on dedicated AI clusters. https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm - Issue: None, - Dependencies: OCI Python SDK, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. Passed See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. we provide unit tests. However, we cannot provide integration tests due to Oracle policies that prohibit public sharing of api keys. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 18:23:50 -08:00
Harel Gal	a91181fe6d	community[minor]: add support for Guardrails for Amazon Bedrock (#15099 ) Added support for optionally supplying 'Guardrails for Amazon Bedrock' on both types of model invocations (batch/regular and streaming) and for all models supported by the Amazon Bedrock service. @baskaryan @hwchase17 ```python llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, guardrails={"id": " <guardrail_id>", "version": "<guardrail_version>", "trace": True}, callbacks=[BedrockAsyncCallbackHandler()]) class BedrockAsyncCallbackHandler(AsyncCallbackHandler): """Async callback handler that can be used to handle callbacks from langchain.""" async def on_llm_error( self, error: BaseException, **kwargs: Any, ) -> Any: reason = kwargs.get("reason") if reason == "GUARDRAIL_INTERVENED": # kwargs contains additional trace information sent by 'Guardrails for Bedrock' service. print(f"""Guardrails: {kwargs}""") # streaming llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, streaming=True, guardrails={"id": "<guardrail_id>", "version": "<guardrail_version>"}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:44:19 -08:00
Martin Kolb	04651f0248	community[minor]: VectorStore integration for SAP HANA Cloud Vector Engine (#16514 ) - Description: This PR adds a VectorStore integration for SAP HANA Cloud Vector Engine, which is an upcoming feature in the SAP HANA Cloud database (https://blogs.sap.com/2023/11/02/sap-hana-clouds-vector-engine-announcement/). - Issue: N/A - Dependencies: [SAP HANA Python Client](https://pypi.org/project/hdbcli/) - Twitter handle: @sapopensource Implementation of the integration: `libs/community/langchain_community/vectorstores/hanavector.py` Unit tests: `libs/community/tests/unit_tests/vectorstores/test_hanavector.py` Integration tests: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Example notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb` Access credentials for execution of the integration tests can be provided to the maintainers. --------- Co-authored-by: sascha <sascha.stoll@sap.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:05:07 -08:00
Unai Garay Maestre	fdbfa6b2c8	Adds progress bar to VertexAIEmbeddings (#14542 ) - Description: Adds progress bar to VertexAIEmbeddings - Issue: related issue https://github.com/langchain-ai/langchain/issues/13637 Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com>	2024-01-24 11:16:16 -07:00
Jeremi Joslin	9e95699277	community[patch]: Fix error message when litellm is not installed (#16316 ) The error message was mentioning the wrong package. I updated it to the correct one.	2024-01-23 21:42:29 -08:00
bachr	b3ed98dec0	community[patch]: avoid KeyError when language not in LANGUAGE_SEGMENTERS (#15212 ) Description: Handle unsupported languages in same way as when none is provided Issue: The following line will throw a KeyError if the language is not supported. ```python self.Segmenter = LANGUAGE_SEGMENTERS[language] ``` E.g. when using `Language.CPP` we would get `KeyError: <Language.CPP: 'cpp'>` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 21:09:43 -08:00
chyroc	61da2ff24c	community[patch]: use SecretStr for yandex model secrets (#15463 )	2024-01-23 20:08:53 -08:00
Alessio Serra	d628a80a5d	community[patch]: added 'conversational' as a valid task for hugginface endopoint models (#15761 ) - Description: added the conversational task to hugginFace endpoint in order to use models designed for chatbot programming. - Dependencies: None --------- Co-authored-by: Alessio Serra (ext.) <alessio.serra@partner.bmw.de> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 20:04:15 -08:00
Karim Lalani	4c7755778d	community[patch]: SurrealDB fix for asyncio (#16092 ) Code fix for asyncio	2024-01-23 19:46:19 -08:00
Raunak	476bf8b763	community[patch]: Load list of files using UnstructuredFileLoader (#16216 ) - Description: Updated `_get_elements()` function of `UnstructuredFileLoader `class to check if the argument self.file_path is a file or list of files. If it is a list of files then it iterates over the list of file paths, calls the partition function for each one, and appends the results to the elements list. If self.file_path is not a list, it calls the partition function as before. - Issue: Fixed #15607, - Dependencies: NA - Twitter handle: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-23 19:37:37 -08:00
Xudong Sun	019b6ebe8d	community[minor]: Add iFlyTek Spark LLM chat model support (#13389 ) - Description: This PR enables LangChain to access the iFlyTek's Spark LLM via the chat_models wrapper. - Dependencies: websocket-client ^1.6.1 - Tag maintainer: @baskaryan ### SparkLLM chat model usage Get SparkLLM's app_id, api_key and api_secret from [iFlyTek SparkLLM API Console](https://console.xfyun.cn/services/bm3) (for more info, see [iFlyTek SparkLLM Intro](https://xinghuo.xfyun.cn/sparkapi) ), then set environment variables `IFLYTEK_SPARK_APP_ID`, `IFLYTEK_SPARK_API_KEY` and `IFLYTEK_SPARK_API_SECRET` or pass parameters when using it like the demo below: ```python3 from langchain.chat_models.sparkllm import ChatSparkLLM client = ChatSparkLLM( spark_app_id="<app_id>", spark_api_key="<api_key>", spark_api_secret="<api_secret>" ) ```	2024-01-23 19:23:46 -08:00
Serena Ruan	5c6e123757	community[patch]: Fix MlflowCallback with none artifacts_dir (#16487 )	2024-01-23 19:09:02 -08:00
bu2kx	ff3163297b	community[minor]: Add KDBAI vector store (#12797 ) Addition of KDBAI vector store (https://kdb.ai). Dependencies: `kdbai_client` v0.1.2 Python package. Sample notebook: `docs/docs/integrations/vectorstores/kdbai.ipynb` Tag maintainer: @bu2kx Twitter handle: @kxsystems	2024-01-23 18:37:01 -08:00

1 2 3 4 5 ...

317 Commits