langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
Micah Parker	6543e585a5	community[patch]: Added support for Ollama's num_predict option in ChatOllama (#16633 ) Just a simple default addition to the options payload for a ollama generate call to support a max_new_tokens parameter. Should fix issue: https://github.com/langchain-ai/langchain/issues/14715	2024-01-26 15:00:19 -08:00
baichuan-assistant	70ff54eace	community[minor]: Add Baichuan Text Embedding Model and Baichuan Inc introduction (#16568 ) - Description: Adding Baichuan Text Embedding Model and Baichuan Inc introduction. Baichuan Text Embedding ranks #1 in C-MTEB leaderboard: https://huggingface.co/spaces/mteb/leaderboard Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-26 12:57:26 -08:00
Bagatur	5b5115c408	google-vertexai[patch]: streaming bug (#16603 ) Fixes errors seen here https://github.com/langchain-ai/langchain/actions/runs/7661680517/job/20881556592#step:9:229	2024-01-26 09:45:34 -08:00
ccurme	a989f82027	core: expand docstring for RunnableParallel (#16600 ) - Description: expand docstring for RunnableParallel - Issue: https://github.com/langchain-ai/langchain/issues/16462 Feel free to modify this or let me know how it can be improved!	2024-01-26 10:03:32 -05:00
Ghani	e30c6662df	Langchain-community : EdenAI chat integration. (#16377 ) - Description: This PR adds [EdenAI](https://edenai.co/) for the chat model (already available in LLM & Embeddings). It supports all [ChatModel] functionality: generate, async generate, stream, astream and batch. A detailed notebook was added. - Dependencies: No dependencies are added as we call a rest API. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-26 09:56:43 -05:00
Antonio Lanza	08d3fd7f2e	langchain[patch]: inconsistent results with `RecursiveCharacterTextSplitter`'s `add_start_index=True` (#16583 ) This PR fixes issue #16579	2024-01-25 15:50:06 -08:00
Eugene Yurtsev	42db96477f	docs: Update in code documentation for runnable with message history (#16585 ) Update the in code documentation for Runnable With Message History	2024-01-25 15:26:34 -08:00
Jatin Chawda	a79345f199	community[patch]: Fixed tool names snake_case (#16397 ) #16396 Fixed 1. golden_query 2. google_lens 3. memorize 4. merriam_webster 5. open_weather_map 6. pub_med 7. stack_exchange 8. generate_image 9. wikipedia	2024-01-25 15:24:19 -08:00
Bagatur	bcc71d1a57	openai[patch]: Release 0.0.5 (#16598 )	2024-01-25 15:20:28 -08:00
Bagatur	68f7468754	google-vertexai[patch]: Release 0.0.3 (#16597 )	2024-01-25 15:19:00 -08:00
Bagatur	61e876aad8	openai[patch]: Explicitly support embedding dimensions (#16596 )	2024-01-25 15:16:04 -08:00
Bagatur	5df8ab574e	infra: move indexing documentation test (#16595 )	2024-01-25 14:46:50 -08:00
Bagatur	f3d61a6e47	langchain[patch]: Release 0.1.4 (#16592 )	2024-01-25 14:19:18 -08:00
Bagatur	61b200947f	community[patch]: Release 0.0.16 (#16591 )	2024-01-25 14:19:09 -08:00
Bagatur	75ad0bba2d	openai[patch]: Release 0.0.4 (#16590 )	2024-01-25 14:08:46 -08:00
Bagatur	1e3ce338ca	core[patch]: Release 0.1.16 (#16589 )	2024-01-25 13:56:00 -08:00
Bagatur	6c89507988	docs: add rag citations page (#16549 )	2024-01-25 13:51:41 -08:00
Bagatur	31790d15ec	openai[patch]: accept function_call dict in bind_functions (#16483 ) Confusing that you can't pass in a dict	2024-01-25 13:47:44 -08:00
Bagatur	ef42d9d559	core[patch], community[patch], openai[patch]: consolidate openai tool… (#16485 ) … converters One way to convert anything to an OAI function: convert_to_openai_function One way to convert anything to an OAI tool: convert_to_openai_tool Corresponding bind functions on OAI models: bind_functions, bind_tools	2024-01-25 13:18:46 -08:00
Brian Burgin	148347e858	community[minor]: Add LiteLLM Router Integration (#15588 ) community: - Description: - Add new ChatLiteLLMRouter class that allows a client to use a LiteLLM Router as a LangChain chat model. - Note: The existing ChatLiteLLM integration did not cover the LiteLLM Router class. - Add tests and Jupyter notebook. - Issue: None - Dependencies: Relies on existing ChatLiteLLM integration - Twitter handle: @bburgin_0 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-25 11:03:05 -08:00
JongRok BAEK	3b8eba32f9	anthropic[patch]: Fix message type lookup in Anthropic Partners (#16563 ) - Description: The parameters for user and assistant in Anthropic should be 'ai -> assistant,' but they are reversed to 'assistant -> ai.' Below is error code. ```python anthropic.BadRequestError: Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'messages: Unexpected role "ai". Allowed roles are "user" or "assistant"'}} ``` [anthropic](`7177f3a71f/src/anthropic/types/beta/message_param.py (L13)`) - Issue: : #16561 - Dependencies: : None - Twitter handle: : None	2024-01-25 09:17:59 -08:00
Dmitry Tyumentsev	e86e66bad7	community[patch]: YandexGPT models - add sleep_interval (#16566 ) Added sleep between requests to prevent errors associated with simultaneous requests.	2024-01-25 09:07:19 -08:00
Bagatur	e510cfaa23	core[patch]: passthrough BaseRetriever.invoke(**kwargs) (#16551 ) Fix for #16547	2024-01-25 08:58:39 -08:00
Anders Åhsman	355ef2a4a6	langchain[patch]: Fix doc-string grammar (#16543 ) - Description: Small grammar fix in docstring for class `BaseCombineDocumentsChain`.	2024-01-25 10:00:06 -05:00
Aditya	9dd7cbb447	google-genai: added logic for method get_num_tokens() (#16205 ) <!-- Thank you for contributing to LangChain! Please title your PR "partners: google-genai", Replace this entire comment with: - Description: : added logic for method get_num_tokens() for ChatGoogleGenerativeAI , GoogleGenerativeAI, - Issue: : https://github.com/langchain-ai/langchain/issues/16204, - Dependencies: : None, - Twitter handle: @Aditya_Rane --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru>	2024-01-24 21:43:16 -07:00
James Braza	0785432e7b	langchain-google-vertexai: perserving grounding metadata (#16309 ) Revival of https://github.com/langchain-ai/langchain/pull/14549 that closes https://github.com/langchain-ai/langchain/issues/14548.	2024-01-24 21:37:43 -07:00
Erick Friis	adc008407e	exa: init pkg (#16553 )	2024-01-24 20:57:17 -07:00
Rave Harpaz	c4e9c9ca29	community[minor]: Add OCI Generative AI integration (#16548 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Adding Oracle Cloud Infrastructure Generative AI integration. Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service that provides a set of state-of-the-art, customizable large language models (LLMs) that cover a wide range of use cases, and which is available through a single API. Using the OCI Generative AI service you can access ready-to-use pretrained models, or create and host your own fine-tuned custom models based on your own data on dedicated AI clusters. https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm - Issue: None, - Dependencies: OCI Python SDK, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. Passed See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. we provide unit tests. However, we cannot provide integration tests due to Oracle policies that prohibit public sharing of api keys. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 18:23:50 -08:00
Bagatur	c173a69908	langchain[patch]: oai tools output parser nit (#16540 ) allow positional init args	2024-01-24 16:57:16 -08:00
arnob-sengupta	f9976b9630	core[patch]: consolidate conditional in BaseTool (#16530 ) - Description: Refactor contradictory conditional to single line - Issue: #16528	2024-01-24 16:56:58 -08:00
Bagatur	5c2538b9f7	anthropic[patch]: allow pop by field name (#16544 ) allow `ChatAnthropicMessages(model=...)`	2024-01-24 15:48:31 -07:00
Harel Gal	a91181fe6d	community[minor]: add support for Guardrails for Amazon Bedrock (#15099 ) Added support for optionally supplying 'Guardrails for Amazon Bedrock' on both types of model invocations (batch/regular and streaming) and for all models supported by the Amazon Bedrock service. @baskaryan @hwchase17 ```python llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, guardrails={"id": " <guardrail_id>", "version": "<guardrail_version>", "trace": True}, callbacks=[BedrockAsyncCallbackHandler()]) class BedrockAsyncCallbackHandler(AsyncCallbackHandler): """Async callback handler that can be used to handle callbacks from langchain.""" async def on_llm_error( self, error: BaseException, **kwargs: Any, ) -> Any: reason = kwargs.get("reason") if reason == "GUARDRAIL_INTERVENED": # kwargs contains additional trace information sent by 'Guardrails for Bedrock' service. print(f"""Guardrails: {kwargs}""") # streaming llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, streaming=True, guardrails={"id": "<guardrail_id>", "version": "<guardrail_version>"}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:44:19 -08:00
Martin Kolb	04651f0248	community[minor]: VectorStore integration for SAP HANA Cloud Vector Engine (#16514 ) - Description: This PR adds a VectorStore integration for SAP HANA Cloud Vector Engine, which is an upcoming feature in the SAP HANA Cloud database (https://blogs.sap.com/2023/11/02/sap-hana-clouds-vector-engine-announcement/). - Issue: N/A - Dependencies: [SAP HANA Python Client](https://pypi.org/project/hdbcli/) - Twitter handle: @sapopensource Implementation of the integration: `libs/community/langchain_community/vectorstores/hanavector.py` Unit tests: `libs/community/tests/unit_tests/vectorstores/test_hanavector.py` Integration tests: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Example notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb` Access credentials for execution of the integration tests can be provided to the maintainers. --------- Co-authored-by: sascha <sascha.stoll@sap.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:05:07 -08:00
Leonid Kuligin	1113700b09	google-genai[patch]: better error message when location is not supported (#16535 ) Replace this entire comment with: - Description: a better error message when location is not supported	2024-01-24 13:58:46 -08:00
Unai Garay Maestre	fdbfa6b2c8	Adds progress bar to VertexAIEmbeddings (#14542 ) - Description: Adds progress bar to VertexAIEmbeddings - Issue: related issue https://github.com/langchain-ai/langchain/issues/13637 Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com>	2024-01-24 11:16:16 -07:00
James Braza	643fb3ab50	langchain-google-vertexai[patch]: more verbose mypy config (#16307 ) Flushing out the `mypy` config in `langchain-google-vertexai` to show error codes and other warnings This PR also bumps `mypy` to above version 1's stable release	2024-01-24 11:10:45 -07:00
Jeremi Joslin	9e95699277	community[patch]: Fix error message when litellm is not installed (#16316 ) The error message was mentioning the wrong package. I updated it to the correct one.	2024-01-23 21:42:29 -08:00
bachr	b3ed98dec0	community[patch]: avoid KeyError when language not in LANGUAGE_SEGMENTERS (#15212 ) Description: Handle unsupported languages in same way as when none is provided Issue: The following line will throw a KeyError if the language is not supported. ```python self.Segmenter = LANGUAGE_SEGMENTERS[language] ``` E.g. when using `Language.CPP` we would get `KeyError: <Language.CPP: 'cpp'>` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 21:09:43 -08:00
Nuno Campos	3f38e1a457	Remove double line (#16426 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-23 20:22:37 -08:00
chyroc	61da2ff24c	community[patch]: use SecretStr for yandex model secrets (#15463 )	2024-01-23 20:08:53 -08:00
Alessio Serra	d628a80a5d	community[patch]: added 'conversational' as a valid task for hugginface endopoint models (#15761 ) - Description: added the conversational task to hugginFace endpoint in order to use models designed for chatbot programming. - Dependencies: None --------- Co-authored-by: Alessio Serra (ext.) <alessio.serra@partner.bmw.de> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 20:04:15 -08:00
Karim Lalani	4c7755778d	community[patch]: SurrealDB fix for asyncio (#16092 ) Code fix for asyncio	2024-01-23 19:46:19 -08:00
Raunak	476bf8b763	community[patch]: Load list of files using UnstructuredFileLoader (#16216 ) - Description: Updated `_get_elements()` function of `UnstructuredFileLoader `class to check if the argument self.file_path is a file or list of files. If it is a list of files then it iterates over the list of file paths, calls the partition function for each one, and appends the results to the elements list. If self.file_path is not a list, it calls the partition function as before. - Issue: Fixed #15607, - Dependencies: NA - Twitter handle: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-23 19:37:37 -08:00
Xudong Sun	019b6ebe8d	community[minor]: Add iFlyTek Spark LLM chat model support (#13389 ) - Description: This PR enables LangChain to access the iFlyTek's Spark LLM via the chat_models wrapper. - Dependencies: websocket-client ^1.6.1 - Tag maintainer: @baskaryan ### SparkLLM chat model usage Get SparkLLM's app_id, api_key and api_secret from [iFlyTek SparkLLM API Console](https://console.xfyun.cn/services/bm3) (for more info, see [iFlyTek SparkLLM Intro](https://xinghuo.xfyun.cn/sparkapi) ), then set environment variables `IFLYTEK_SPARK_APP_ID`, `IFLYTEK_SPARK_API_KEY` and `IFLYTEK_SPARK_API_SECRET` or pass parameters when using it like the demo below: ```python3 from langchain.chat_models.sparkllm import ChatSparkLLM client = ChatSparkLLM( spark_app_id="<app_id>", spark_api_key="<api_key>", spark_api_secret="<api_secret>" ) ```	2024-01-23 19:23:46 -08:00
Ali Zendegani	80fcc50c65	langchain[patch]: Minor Fix: Enable Passing custom_headers for Authentication in GraphQL Agent/Tool (#16413 ) - Description: This PR aims to enhance the `langchain` library by enabling the support for passing `custom_headers` in the `GraphQLAPIWrapper` usage within `langchain/agents/load_tools.py`. While the `GraphQLAPIWrapper` from the `langchain_community` module is inherently capable of handling `custom_headers`, its current invocation in `load_tools.py` does not facilitate this functionality. This limitation restricts the use of the `graphql` tool with databases or APIs that require token-based authentication. The absence of support for `custom_headers` in this context also leads to a lack of error messages when attempting to interact with secured GraphQL endpoints, making debugging and troubleshooting more challenging. This update modifies the `load_tools` function to correctly handle `custom_headers`, thereby allowing secure and authenticated access to GraphQL services requiring tokens. Example usage after the proposed change: ```python tools = load_tools( ["graphql"], graphql_endpoint="https://your-graphql-endpoint.com/graphql", custom_headers={"Authorization": f"Token {api_token}"}, ) ``` - Issue: None, - Dependencies: None, - Twitter handle: None	2024-01-23 19:19:53 -08:00
Serena Ruan	5c6e123757	community[patch]: Fix MlflowCallback with none artifacts_dir (#16487 )	2024-01-23 19:09:02 -08:00
Krista Pratico	0e2e7d8b83	langchain[patch]: allow passing client with OpenAIAssistantRunnable (#16486 ) - Description: This addresses the issue tagged below where if you try to pass your own client when creating an OpenAI assistant, a pydantic error is raised: Example code: ```python import openai from langchain.agents.openai_assistant import OpenAIAssistantRunnable client = openai.OpenAI() interpreter_assistant = OpenAIAssistantRunnable.create_assistant( name="langchain assistant", instructions="You are a personal math tutor. Write and run code to answer math questions.", tools=[{"type": "code_interpreter"}], model="gpt-4-1106-preview", client=client ) ``` Error: `pydantic.v1.errors.ConfigError: field "client" not yet prepared, so the type is still a ForwardRef. You might need to call OpenAIAssistantRunnable.update_forward_refs()` It additionally updates type hints and docstrings to indicate that an AzureOpenAI client is permissible as well. - Issue: https://github.com/langchain-ai/langchain/issues/15948 - Dependencies: N/A	2024-01-23 18:48:29 -08:00
bu2kx	ff3163297b	community[minor]: Add KDBAI vector store (#12797 ) Addition of KDBAI vector store (https://kdb.ai). Dependencies: `kdbai_client` v0.1.2 Python package. Sample notebook: `docs/docs/integrations/vectorstores/kdbai.ipynb` Tag maintainer: @bu2kx Twitter handle: @kxsystems	2024-01-23 18:37:01 -08:00
Shivani Modi	4e160540ff	community[minor]: Adding Konko Completion endpoint (#15570 ) This PR introduces update to Konko Integration with LangChain. 1. New Endpoint Addition: Integration of a new endpoint to utilize completion models hosted on Konko. 2. Chat Model Updates for Backward Compatibility: We have updated the chat models to ensure backward compatibility with previous OpenAI versions. 4. Updated Documentation: Comprehensive documentation has been updated to reflect these new changes, providing clear guidance on utilizing the new features and ensuring seamless integration. Thank you to the LangChain team for their exceptional work and for considering this PR. Please let me know if any additional information is needed. --------- Co-authored-by: Shivani Modi <shivanimodi@Shivanis-MacBook-Pro.local> Co-authored-by: Shivani Modi <shivanimodi@Shivanis-MBP.lan>	2024-01-23 18:22:32 -08:00
Gianfranco Demarco	c69f599594	langchain[patch]: Extract _aperform_agent_action from _aiter_next_step from AgentExecutor (#15707 ) - Description: extreact the _aperform_agent_action in the AgentExecutor class to allow for easier overriding. Extracted logic from _iter_next_step into a new method _perform_agent_action for consistency and easier overriding. - Issue: #15706 Closes #15706	2024-01-23 18:22:09 -08:00
i-w-a	95ee69a301	langchain[patch]: In HTMLHeaderTextSplitter set default encoding to utf-8 (#16372 ) - Description: The HTMLHeaderTextSplitter Class now explicitly specifies utf-8 encoding in the part of the split_text_from_file method that calls the HTMLParser. - Issue: Prevent garbled characters due to differences in encoding of html files (except for English in particular, I noticed that problem with Japanese). - Dependencies: No dependencies, - Twitter handle: @i_w__a	2024-01-23 18:20:29 -08:00
Noah Stapp	e135e5257c	community[patch]: Include scores in MongoDB Atlas QA chain results (#14666 ) Adds the ability to return similarity scores when using `RetrievalQA.from_chain_type` with `MongoDBAtlasVectorSearch`. Requires that `return_source_documents=True` is set. Example use: ``` vector_search = MongoDBAtlasVectorSearch.from_documents(...) qa = RetrievalQA.from_chain_type( llm=OpenAI(), chain_type="stuff", retriever=vector_search.as_retriever(search_kwargs={"additional": ["similarity_score"]}), return_source_documents=True ) ... docs = qa({"query": "..."}) docs["source_documents"][0].metadata["score"] # score will be here ``` I've tested this feature locally, using a MongoDB Atlas Cluster with a vector search index.	2024-01-23 18:18:28 -08:00
Serena Ruan	90f5a1c40e	community[minor]: Improve mlflow callback (#15691 ) - Description: Allow passing run_id to MLflowCallbackHandler to resume a run instead of creating a new run. Support recording retriever relevant metrics. Refactor the code to fix some bugs. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-01-23 18:16:51 -08:00
Facundo Santiago	92e6a641fd	feat: adding paygo api support for Azure ML / Azure AI Studio (#14560 ) - Description: Introducing support for LLMs and Chat models running in Azure AI studio and Azure ML using the new deployment mode pay-as-you-go (model as a service). - Issue: NA - Dependencies: None. - Tag maintainer: @prakharg-msft @gdyre - Twitter handle: @santiagofacundo Examples added: * [docs/docs/integrations/llms/azure_ml.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_endpoint.ipynb) * [docs/docs/integrations/chat/azureml_chat_endpoint.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_chat_endpoint.ipynb) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 17:08:51 -08:00
Davide Menini	9ce177580a	community: normalize bedrock embeddings (#15103 ) In this PR I added a post-processing function to normalize the embeddings. This happens only if the new `normalize` flag is `True`. --------- Co-authored-by: taamedag <Davide.Menini@swisscom.com>	2024-01-23 17:05:24 -08:00
baichuan-assistant	20fcd49348	community: Fix Baichuan Chat. (#15207 ) - Description: Baichuan Chat (with both Baichuan-Turbo and Baichuan-Turbo-192K models) has updated their APIs. There are breaking changes. For example, BAICHUAN_SECRET_KEY is removed in the latest API but is still required in Langchain. Baichuan's Langchain integration needs to be updated to the latest version. - Issue: #15206 - Dependencies: None, - Twitter handle: None @hwchase17. Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-23 17:01:57 -08:00
gcheron	cfc225ecb3	community: SQLStrStore/SQLDocStore provide an easy SQL alternative to `InMemoryStore` to persist data remotely in a SQL storage (#15909 ) Description: - Implement `SQLStrStore` and `SQLDocStore` classes that inherits from `BaseStore` to allow to persist data remotely on a SQL server. - SQL is widely used and sometimes we do not want to install a caching solution like Redis. - Multiple issues/comments complain that there is no easy remote and persistent solution that are not in memory (users want to replace InMemoryStore), e.g., https://github.com/langchain-ai/langchain/issues/14267, https://github.com/langchain-ai/langchain/issues/15633, https://github.com/langchain-ai/langchain/issues/14643, https://stackoverflow.com/questions/77385587/persist-parentdocumentretriever-of-langchain - This is particularly painful when wanting to use `ParentDocumentRetriever ` - This implementation is particularly useful when: * it's expensive to construct an InMemoryDocstore/dict * you want to retrieve documents from remote sources * you just want to reuse existing objects - This implementation integrates well with PGVector, indeed, when using PGVector, you already have a SQL instance running. `SQLDocStore` is a convenient way of using this instance to store documents associated to vectors. An integration example with ParentDocumentRetriever and PGVector is provided in docs/docs/integrations/stores/sql.ipynb or [here](https://github.com/gcheron/langchain/blob/sql-store/docs/docs/integrations/stores/sql.ipynb). - It persists `str` and `Document` objects but can be easily extended. Issue: Provide an easy SQL alternative to `InMemoryStore`. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 16:50:48 -08:00
Massimiliano Pronesti	e529939c54	feat(llms): support more tasks in HuggingFaceHub LLM and remove deprecated dep (#14406 ) - Description: this PR upgrades the `HuggingFaceHub` LLM: * support more tasks (`translation` and `conversational`) * replaced the deprecated `InferenceApi` with `InferenceClient` * adjusted the overall logic to use the "recommended" model for each task when no model is provided, and vice-versa. - Tag mainter(s): @baskaryan @hwchase17	2024-01-23 16:48:56 -08:00
Erick Friis	afb25eeec4	cli[patch]: add integration tests to default makefile (#16479 )	2024-01-23 16:09:16 -07:00
Bagatur	ba326b98d0	langchain[patch]: Release 0.1.3 (#16475 )	2024-01-23 11:50:25 -08:00
Bagatur	54149292f8	community[patch]: Release 0.0.15 (#16474 )	2024-01-23 11:50:10 -08:00
Bagatur	ef6a335570	core[patch]: Release 0.1.15 (#16473 )	2024-01-23 11:31:50 -08:00
Erick Friis	1f4ac62dee	cli[patch], google-vertexai[patch]: readme template (#16470 )	2024-01-23 12:08:17 -07:00
Tomaz Bratanic	d0a8082188	Fix neo4j sanitize (#16439 ) Fix the sanitization bug and add an integration test	2024-01-23 10:56:28 -05:00
William FH	5de59f9236	Core[Patch] Parse tool input after on_start (#16430 ) For tracing, if a validation error occurs, currently it is attributed to the previous step of the chain. It would be nice to have the on_start and on_error callbacks called for tools when there is a validation error that occurs to more easily attribute the root-cause	2024-01-23 10:54:47 -05:00
Nuno Campos	226fe645f1	core[patch] Do not try to access attribute of None (#16321 )	2024-01-22 22:10:03 -08:00
Florian MOREL	4b7969efc5	community[minor]: New documents loader for visio files (with extension .vsdx) (#16171 ) Description : New documents loader for visio files (with extension .vsdx) A [visio file](https://fr.wikipedia.org/wiki/Microsoft_Visio) (with extension .vsdx) is associated with Microsoft Visio, a diagram creation software. It stores information about the structure, layout, and graphical elements of a diagram. This format facilitates the creation and sharing of visualizations in areas such as business, engineering, and computer science. A Visio file can contain multiple pages. Some of them may serve as the background for others, and this can occur across multiple layers. This loader extracts the textual content from each page and its associated pages, enabling the extraction of all visible text from each page, similar to what an OCR algorithm would do. Dependencies : xmltodict package	2024-01-22 22:07:03 -08:00
Boris Feld	404abf139a	community: Add CometLLM tracing context var (#15765 ) I also added LANGCHAIN_COMET_TRACING to enable the CometLLM tracing integration similar to other tracing integrations. This is easier for end-users to enable it rather than importing the callback and pass it manually. (This is the same content as https://github.com/langchain-ai/langchain/pull/14650 but rebased and squashed as something seems to confuse Github Action).	2024-01-22 15:17:16 -08:00
Nicolò Boschi	a500527030	infra: google-vertexai relax types-requests deps range (#16264 ) - Description: At the moment it's not possible to include in the same project langchain-google-vertexai and boto3 (e.g. use bedrock and vertex in the same application) because of the dependency resolutions conflict. boto3 is still using urllib3 1.x, meanwhile langchain-google-vertexai -> types-requests depends on urllib3 2.x. [the last version of types-requests that allows urllib3 1.x is 2.31.0.6](https://pypi.org/project/types-requests/#description). In this PR I allow the vertexai package to get that version also. - Twitter handle: nicoloboschi	2024-01-22 14:54:41 -08:00
DL	b9e7f6f38a	community[minor]: Bedrock async methods (#12477 ) Description: Added support for asynchronous streaming in the Bedrock class and corresponding tests. Primarily: async def aprepare_output_stream async def _aprepare_input_and_invoke_stream async def _astream async def _acall I've ensured that the code adheres to the project's linting and formatting standards by running make format, make lint, and make test. Issue: #12054, #11589 Dependencies: None Tag maintainer: @baskaryan Twitter handle: @dominic_lovric --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-01-22 14:44:49 -08:00
Frank995	5694728816	community[patch]: Implement vector length definition at init time in PGVector for indexing (#16133 ) Replace this entire comment with: - Description: allow user to define tVector length in PGVector when creating the embedding store, this allows for later indexing - Issue: #16132 - Dependencies: None	2024-01-22 14:32:44 -08:00
Chase VanSteenburg	1011b681dc	core[patch]: Fix f-string formatting in error message for configurable_fields (#16411 ) - Description: Simple fix to f-string formatting. Allows more informative ValueError output. - Issue: None needed. - Dependencies: None. - Twitter handle: @FlightP1an	2024-01-22 14:08:44 -08:00
parkererickson-tg	b26a22f307	community[minor]: add TigerGraph support (#16280 ) Description: Add support for querying TigerGraph databases through the InquiryAI service. Issue: N/A Dependencies: N/A Twitter handle: @TigerGraphDB	2024-01-22 14:07:44 -08:00
Alireza Kashani	d1b4ead87c	community[patch]: Update grobid.py (#16298 ) there is a case where "coords" does not exist in the "sentence" therefore, the "split(";")" will lead to error. we can fix that by adding "if sentence.get("coords") is not None:" the resulting empty "sbboxes" from this scenario will raise error at "sbboxes[0]["page"]" because sbboxes are empty. the PDF from https://pubmed.ncbi.nlm.nih.gov/23970373/ can replicate those errors.	2024-01-22 14:03:58 -08:00
s-g-1	fbe592a5ce	community[patch]: fix typo in pgvecto_rs debug msg (#16318 ) fixes typo in pip install message for the pgvecto_rs community vector store no issues found mentioning this no dependents changed	2024-01-22 14:01:33 -08:00
James Braza	d511366dd3	infra: absolute `EXAMPLE_DIR` path in core unit tests (#16325 ) If you invoked testing from places besides `core/`, this `EXAMPLE_DIR` path won't work. This PR makes`EXAMPLE_DIR` robust against invocation location	2024-01-22 14:00:23 -08:00
Ian	b9f5104e6c	communty[minor]: Store Message History to TiDB Database (#16304 ) This pull request integrates the TiDB database into LangChain for storing message history, marking one of several steps towards a comprehensive integration of TiDB with LangChain. A simple usage ```python from datetime import datetime from langchain_community.chat_message_histories import TiDBChatMessageHistory history = TiDBChatMessageHistory( connection_string="mysql+pymysql://<host>:<PASSWORD>@<host>:4000/<db>?ssl_ca=/etc/ssl/cert.pem&ssl_verify_cert=true&ssl_verify_identity=true", session_id="code_gen", earliest_time=datetime.utcnow(), # Optional to set earliest_time to load messages after this time point. ) history.add_user_message("hi! How's feature going?") history.add_ai_message("It's almot done") ```	2024-01-22 13:56:56 -08:00
Erick Friis	35ec0bbd3b	cli[patch]: pypi fields (#16410 )	2024-01-22 14:28:30 -07:00
Erick Friis	2ac3a82d85	cli[patch]: new fields in integration template, release 0.0.21 (#16398 )	2024-01-22 14:26:47 -07:00
Erick Friis	cfe95ab085	multiple: update langsmith dep (#16407 )	2024-01-22 14:23:11 -07:00
Eli Lucherini	6b2a57161a	community[patch]: allow additional kwargs in MlflowEmbeddings for compatibility with Cohere API (#15242 ) - Description: add support for kwargs in`MlflowEmbeddings` `embed_document()` and `embed_query()` so that all the arguments required by Cohere API (and others?) can be passed down to the server. - Issue: #15234 - Dependencies: MLflow with MLflow Deployments (`pip install mlflow[genai]`) Tests Now this code [adapted from the docs](https://python.langchain.com/docs/integrations/providers/mlflow#embeddings-example) for the Cohere API works locally. ```python """ Setup ----- export COHERE_API_KEY=... mlflow deployments start-server --config-path examples/deployments/cohere/config.yaml Run --- python /path/to/this/file.py """ embeddings = MlflowCohereEmbeddings(target_uri="http://127.0.0.1:5000", endpoint="embeddings") print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) ``` Output ``` [0.060455322, 0.028793335, -0.025848389] [0.031707764, 0.021057129, -0.009361267] ```	2024-01-22 11:38:11 -08:00
Guillem Orellana Trullols	aad2aa7188	community[patch]: BedrockChat -> Support Titan express as chat model (#15408 ) Titan Express model was not supported as a chat model because LangChain messages were not "translated" to a text prompt. Co-authored-by: Guillem Orellana Trullols <guillem.orellana_trullols@siemens.com>	2024-01-22 11:37:23 -08:00
Piotr Mardziel	1b9001db47	core[patch]: preserve inspect.iscoroutinefunction with @deprecated decorator (#16295 ) Adjusted `deprecate` decorator to make sure decorated async functions are still recognized as "coroutinefunction" by `inspect`. Before change, functions such as `LLMChain.acall` which are decorated as deprecated are not recognized as coroutine functions. After the change, they are recognized: ```python import inspect from langchain import LLMChain # Is false before change but true after. inspect.iscoroutinefunction(LLMChain.acall) ```	2024-01-22 11:34:13 -08:00
Katarina Supe	01c2f27ffa	community[patch]: Update Memgraph support (#16360 ) - Description: I removed two queries to the database and left just one whose results were formatted afterward into other type of schema (avoided two calls to DB) - Issue: / - Dependencies: / - Twitter handle: @supe_katarina	2024-01-22 11:33:28 -08:00
Max Jakob	8569b8f680	community[patch]: ElasticsearchStore enable max inner product (#16393 ) Enable max inner product for approximate retrieval strategy. For exact strategy we lack the necessary `maxInnerProduct` function in the Painless scripting language, this is why we do not add it there. Similarity docs: https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Joe McElroy <joseph.mcelroy@elastic.co>	2024-01-22 11:26:18 -08:00
Iskren Ivov Chernev	fc196cab12	community[minor]: DeepInfra support for chat models (#16380 ) Add deepinfra chat models support. This is https://github.com/langchain-ai/langchain/pull/14234 re-opened from my branch (so maintainers can edit).	2024-01-22 11:22:17 -08:00
Bagatur	85e8423312	community[patch]: Update bing results tool name (#16395 ) Make BingSearchResults tool name OpenAI functions compatible (can't have spaces). Fixes #16368	2024-01-22 11:11:03 -08:00
Max Jakob	de209af533	community[patch]: ElasticsearchStore: add relevance function selector (#16378 ) Implement similarity function selector for ElasticsearchStore. The scores coming back from Elasticsearch are already similarities (not distances) and they are already normalized (see [docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params)). Hence we leave the scores untouched and just forward them. This fixes #11539. However, in hybrid mode (when keyword search and vector search are involved) Elasticsearch currently returns no scores. This PR adds an error message around this fact. We need to think a bit more to come up with a solution for this case. This PR also corrects a small error in the Elasticsearch integration test. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-22 11:52:20 -07:00
y2noda	54f90fc6bc	langchain_google_vertexai:Enable the use of langchain's built-in tools in Gemini's function calling (#16341 ) - Issue: This is a PR about #16340 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: yuhei.tsunoda <yuhei.tsunoda@brainpad.co.jp>	2024-01-22 11:16:36 -07:00
Tom Jorquera	1445ac95e8	community[patch]: Enable streaming for GPT4all (#16392 ) `streaming` param was never passed to model	2024-01-22 09:54:18 -08:00
Bagatur	af9f1738ca	langchain[patch]: Release 0.1.2 (#16388 )	2024-01-22 09:32:24 -08:00
Bagatur	8779013847	community[patch]: Release 0.0.14 (#16384 )	2024-01-22 08:50:19 -08:00
Bagatur	9cf0f5eb78	core[patch]: Release 0.1.14 (#16382 )	2024-01-22 08:28:03 -08:00
Bagatur	1dc6c1ce06	core[patch], community[patch], langchain[patch], docs: Update SQL chains/agents/docs (#16168 ) Revamp SQL use cases docs. In the process update SQL chains and agents.	2024-01-22 08:19:08 -08:00
Bob Lin	acc14802d1	Fix `conn` field definition in SQLiteEntityStore (#15440 )	2024-01-22 07:53:49 -08:00
James Braza	e1c59779ad	core[patch]: Remove `print` statement on missing `grandalf` dependency in favor of more explicit ImportError (#16326 ) After this PR an ImportError will be raised without a print if grandalf is missing when using grandalf related code for printing runnable graphs.	2024-01-22 10:48:54 -05:00
Nuno Campos	971a68d04f	Docs: Update README.md in core (#16329 ) Docs: Update README.md in core	2024-01-22 10:42:31 -05:00
Eugene Yurtsev	89372fca22	core[patch]: Update sys info information (#16297 ) Update information collected in sys info. python -m langchain_core.sys_info System Information ------------------ > OS: Linux > OS Version: #14~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Mon Nov 20 18:15:30 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.1.10 > langchain: 0.1.0 > langchain_community: 0.0.11 > langchain_cli: 0.0.20 > langchain_experimental: 0.0.36 > langchain_openai: 0.0.2 > langchainhub: 0.1.14 > langserve: 0.0.19 Packages not installed (Not Necessarily a Problem) -------------------------------------------------- The following packages were not found: > langgraph	2024-01-22 10:18:04 -05:00
Luke	5396604ef4	community: Handling missing key in Google Trends API response. (#15864 ) - Description: Handing response where _interest_over_time_ is missing. - Issue: #15859 - Dependencies: None	2024-01-21 18:11:45 -08:00
Virat Singh	c2a614eddc	community: Add PolygonLastQuote Tool and Toolkit (#15990 ) Description: In this PR, I am adding a `PolygonLastQuote` Tool, which can be used to get the latest price quote for a given ticker / stock. Additionally, I've added a Polygon Toolkit, which we can use to encapsulate future tools that we build for Polygon. Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-21 15:08:55 -08:00
Nuno Campos	ef75bb63ce	core[patch] Fix tracer output of streamed runs with non-addable output (#16324 ) - Used to be None, now is just the last chunk <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-20 18:52:26 -08:00
Ryan French	3d23a5eb36	langchain[patch]: Allow OpenSearch Query Translator to correctly work with Date types (#16022 ) Description: Fixes an issue where the Date type in an OpenSearch Self Querying Retriever would fail to generate a valid query Issue: https://github.com/langchain-ai/langchain/issues/14225	2024-01-19 17:57:18 -08:00
Ofer Mendelevitch	ffae98d371	template: Update Vectara templates (#15363 ) fixed multi-query template for Vectara added self-query template for Vectara Also added prompt_name parameter to summarization CC @efriis Twitter handle: @ofermend	2024-01-19 17:32:33 -08:00
Bagatur	1e29b676d5	core[patch]: simple fallback streaming (#16055 )	2024-01-19 16:31:54 -08:00
Eugene Yurtsev	4ef0ed4ddc	astream_events: Add version parameter while method is in beta (#16290 ) Add a version parameter while the method is in beta phase. The idea is to make it possible to minimize making breaking changes for users while we're iterating on schema. Once the API is stable we can assign a default version requirement.	2024-01-19 13:20:02 -05:00
Bagatur	91230ef5d1	openai[patch]: Release 0.0.3 (#16289 )	2024-01-19 10:15:08 -08:00
Hamza Kyamanywa	39b3c6d94c	langchain[patch]: Add konlpy based text splitting for Korean (#16003 ) - Description: Adds a text splitter based on [Konlpy](https://konlpy.org/en/latest/#start) which is a Python package for natural language processing (NLP) of the Korean language. (It is like Spacy or NLTK for Korean) - Dependencies: Konlpy would have to be installed before this splitter is used, - Twitter handle: @untilhamza	2024-01-19 09:44:56 -08:00
Bagatur	e3828bee43	core[patch]: Release 0.1.13 (#16287 )	2024-01-19 09:28:31 -08:00
Bagatur	2454fefc53	docs: agent prompt docs (#16105 )	2024-01-19 09:19:22 -08:00
Bagatur	84bf5787a7	core[patch], openai[patch]: Chat openai stream logprobs (#16218 )	2024-01-19 09:16:09 -08:00
Carey	021b0484a8	community[patch]: add skipped test for inner product normalization (#14989 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 23:03:15 -08:00
Christophe Bornet	3ccbe11363	community[minor]: Add Cassandra document loader (#16215 ) - Description: document loader for Apache Cassandra - Twitter handle: cbornet_	2024-01-18 18:49:02 -08:00
mikeFore4	9d32af72ce	community[patch]: huggingface hub character removal bug fix (#16233 ) - Description: Some text-generation models on huggingface repeat the prompt in their generated response, but not all do! The tests use "gpt2" which DOES repeat the prompt and as such, the HuggingFaceHub class is hardcoded to remove the first few characters of the response (to match the len(prompt)). However, if you are using a model (such as the very popular "meta-llama/Llama-2-7b-chat-hf") that DOES NOT repeat the prompt in it's generated text, then the beginning of the generated text will be cut off. This code change fixes that bug by first checking whether the prompt is repeated in the generated response and removing it conditionally. - Issue: #16232 - Dependencies: N/A - Twitter handle: N/A	2024-01-18 18:44:10 -08:00
Andreas Motl	3613d8a2ad	community[patch]: Use SQLAlchemy's `bulk_save_objects` method to improve insert performance (#16244 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to save embeddings in batches, to improve its performance. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 Hi again from the CrateDB team, following up on GH-16243, this is another minor patch to the pgvector vector store adapter. Inserting embeddings in batches, using [SQLAlchemy's `bulk_save_objects`](https://docs.sqlalchemy.org/en/20/orm/session_api.html#sqlalchemy.orm.Session.bulk_save_objects) method, can deliver substantial performance gains. With kind regards, Andreas. NB: As I am seeing just now that this method is a legacy feature of SA 2.0, it will need to be reworked on a future iteration. However, it is not deprecated yet, and I haven't been able to come up with a different implementation, yet.	2024-01-18 18:35:39 -08:00
Eugene Yurtsev	177af65dc4	core[minor]: RFC Add astream_events to Runnables (#16172 ) This PR adds `astream_events` method to Runnables to make it easier to stream data from arbitrary chains. * Streaming only works properly in async right now * One should use `astream()` with if mixing in imperative code as might be done with tool implementations * Astream_log has been modified with minimal additive changes, so no breaking changes are expected * Underlying callback code / tracing code should be refactored at some point to handle things more consistently (OK for now) - ~~[ ] verify event for on_retry~~ does not work until we implement streaming for retry - ~~[ ] Any rrenaming? Should we rename "event" to "hook"?~~ - [ ] Any other feedback from community? - [x] throw NotImplementedError for `RunnableEach` for now ## Example See this [Example Notebook](`dbbc7fa0d6/docs/docs/modules/agents/how_to/streaming_events.ipynb`) for an example with streaming in the context of an Agent ## Event Hooks Reference Here is a reference table that shows some events that might be emitted by the various Runnable objects. Definitions for some of the Runnable are included after the table. \| event \| name \| chunk \| input \| output \| \|----------------------\|------------------\|---------------------------------\|-----------------------------------------------\|-------------------------------------------------\| \| on_chat_model_start \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| \| \| on_chat_model_stream \| [model name] \| AIMessageChunk(content="hello") \| \| \| \| on_chat_model_end \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| {"generations": [...], "llm_output": None, ...} \| \| on_llm_start \| [model name] \| \| {'input': 'hello'} \| \| \| on_llm_stream \| [model name] \| 'Hello' \| \| \| \| on_llm_end \| [model name] \| \| 'Hello human!' \| \| on_chain_start \| format_docs \| \| \| \| \| on_chain_stream \| format_docs \| "hello world!, goodbye world!" \| \| \| \| on_chain_end \| format_docs \| \| [Document(...)] \| "hello world!, goodbye world!" \| \| on_tool_start \| some_tool \| \| {"x": 1, "y": "2"} \| \| \| on_tool_stream \| some_tool \| {"x": 1, "y": "2"} \| \| \| \| on_tool_end \| some_tool \| \| \| {"x": 1, "y": "2"} \| \| on_retriever_start \| [retriever name] \| \| {"query": "hello"} \| \| \| on_retriever_chunk \| [retriever name] \| {documents: [...]} \| \| \| \| on_retriever_end \| [retriever name] \| \| {"query": "hello"} \| {documents: [...]} \| \| on_prompt_start \| [template_name] \| \| {"question": "hello"} \| \| \| on_prompt_end \| [template_name] \| \| {"question": "hello"} \| ChatPromptValue(messages: [SystemMessage, ...]) \| Here are declarations associated with the events shown above: `format_docs`: ```python def format_docs(docs: List[Document]) -> str: '''Format the docs.''' return ", ".join([doc.page_content for doc in docs]) format_docs = RunnableLambda(format_docs) ``` `some_tool`: ```python @tool def some_tool(x: int, y: str) -> dict: '''Some_tool.''' return {"x": x, "y": y} ``` `prompt`: ```python template = ChatPromptTemplate.from_messages( [("system", "You are Cat Agent 007"), ("human", "{question}")] ).with_config({"run_name": "my_template", "tags": ["my_template"]}) ```	2024-01-18 21:27:01 -05:00
SN	f175bf7d7b	Use env for revision id if not passed in as param; use `git describe` as backup (#16227 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-01-18 16:15:26 -08:00
Erick Friis	b9495da92d	langchain[patch]: fix stuff documents chain api docs render (#16159 )	2024-01-18 14:07:44 -08:00
Erick Friis	0e76d84137	google-vertexai[patch]: more integration test fixes (#16234 )	2024-01-18 13:59:23 -08:00
Erick Friis	aa35b43bcd	docs, google-vertex[patch]: function docs (#16231 )	2024-01-18 13:15:09 -08:00
Harrison Chase	f60f59d69f	google-vertexai[patch]: Harrison/vertex function calling (#16223 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 12:17:40 -08:00
Rajesh Thallam	6bc6d64a12	langchain_google_vertexai[patch]: Add support for SystemMessage for Gemini chat model (#15933 ) - Description: In Google Vertex AI, Gemini Chat models currently doesn't have a support for SystemMessage. This PR adds support for it only if a user provides additional convert_system_message_to_human flag during model initialization (in this case, SystemMessage would be prepended to the first HumanMessage). NOTE: The implementation is similar to #14824 - Twitter handle: rajesh_thallam --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 10:22:07 -08:00
Erick Friis	65b231d40b	mistralai[patch]: async integration tests (#16214 )	2024-01-18 09:45:44 -08:00
Eugene Zapolsky	6b9e3ed9e9	google-vertexai[minor]: added safety_settings property to gemini wrapper (#15344 ) Description: Gemini model has quite annoying default safety_settings settings. In addition, current VertexAI class doesn't provide a property to override such settings. So, this PR aims to - add safety_settings property to VertexAI - fix issue with incorrect LLM output parsing when LLM responds with appropriate 'blocked' response - fix issue with incorrect parsing LLM output when Gemini API blocks prompt itself as inappropriate - add safety_settings related tests I'm not enough familiar with langchain code base and guidelines. So, any comments and/or suggestions are very welcome. Issue: it will likely fix #14841 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 08:54:30 -08:00
Eugene Yurtsev	ecd4f0a7ec	core[patch]: testing add chat model for unit-tests (#16209 ) This PR adds a fake chat model for testing purposes. Used in this PR: https://github.com/langchain-ai/langchain/pull/16172	2024-01-18 11:30:53 -05:00
SN	7d444724d7	Add revision identifier to run_on_dataset (#16167 ) Allow specifying revision identifier for better project versioning	2024-01-17 20:27:43 -08:00
Eugene Yurtsev	5d8c147332	docs: Document and test PydanticOutputFunctionsParser (#15759 ) This PR adds documentation and testing to `PydanticOutputFunctionsParser(OutputFunctionsParser)`.	2024-01-17 18:21:18 -08:00
Christophe Bornet	3502a407d9	infra: Use dotenv in langchain-community's integration tests (#16137 ) * Removed some env vars not used in langchain package IT * Added Astra DB env vars in langchain package, used for cache tests * Added conftest.py to load env vars in langchain_community IT * Added .env.example in langchain_community IT	2024-01-17 18:18:26 -08:00
Nuno Campos	ca014d5b04	Update readme (#16160 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-17 13:56:07 -08:00
Tomaz Bratanic	1e80113ac9	community[patch]: Add neo4j timeout and value sanitization option (#16138 ) The timeout function comes in handy when you want to kill longrunning queries. The value sanitization removes all lists that are larger than 128 elements. The idea here is to remove embedding properties from results.	2024-01-17 13:22:19 -08:00
Krishna Shedbalkar	f238217cea	community[patch]: Basic Logging and Human input to ShellTool (#15932 ) - Description: As Shell tool is very versatile, while integrating it into applications as openai functions, developers have no clue about what command is being executed using the ShellTool. All one can see is: ![image](https://github.com/langchain-ai/langchain/assets/60742358/540e274a-debc-4564-9027-046b91424df3) Summarising my feature request: 1. There's no visibility about what command was executed. 2. There's no mechanism to prevent a command to be executed using ShellTool, like a y/n human input which can be accepted from user to proceed with executing the command., - Issue: the issue #15931 it fixes if applicable, - Dependencies: There isn't any dependancy, - Twitter handle: @krishnashed	2024-01-17 12:57:51 -08:00
Bagatur	679a3ae933	openai[patch]: clarify azure error (#16157 )	2024-01-17 12:43:14 -08:00
Bagatur	7ad9eba8f4	core[patch]: Release 0.1.12 (#16161 )	2024-01-17 12:39:45 -08:00
Leonid Kuligin	58f0ba306b	changed default params for gemini (#16044 ) Replace this entire comment with: - Description: changed default values for Vertex LLMs (to be handled on the SDK's side)	2024-01-17 12:19:18 -08:00
Bagatur	5c73fd5bba	core[patch]: support old core namespaces (#16155 )	2024-01-17 11:26:25 -08:00
Christophe Bornet	fb940d11df	community[patch]: Use newer MetadataVectorCassandraTable in Cassandra vector store (#15987 ) as VectorTable is deprecated Tested manually with `test_cassandra.py` vector store integration test.	2024-01-17 10:37:07 -08:00
Mohammad Mohtashim	1fa056c324	community[patch]: Don't set search path for unknown SQL dialects (#16047 ) - Description: Made a small fix for the `SQLDatabase` highlighted in an issue. The issue pertains to switching schema for different SQL engines. - Issue: #16023 @baskaryan	2024-01-17 10:31:11 -08:00
Erick Friis	11327e6b64	google-vertexai[patch]: typing, release 0.0.2 (#16153 )	2024-01-17 10:16:59 -08:00
Leonid Ganeline	2709d3e5f2	langchain[patch]: updated imports for `langchain.callbacks` (#16060 ) Updated imports from 'langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:06:59 -08:00
Leonid Ganeline	c5f6b828ad	langchain[patch], community[minor]: move `output_parsers.ernie_functions` (#16057 ) `output_parsers.ernie_functions` moved into `community`	2024-01-17 10:06:18 -08:00
Leonid Ganeline	49aff3ea5b	langchain[patch]: updated `agents` imports (#16061 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:02:29 -08:00
Leonid Ganeline	60b1bd02d7	langchain[patch]: updated imports for `output_parsers` (#16059 ) Updated imports from `langchain` to `core` where it is possible	2024-01-17 10:02:12 -08:00
Leonid Ganeline	9e9ad9b0e9	langchain[patch]: updated `retrievers` imports (#16062 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:01:06 -08:00
Leonid Ganeline	d350be959d	langchain[patch]: updated `chains` imports (#16064 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 09:58:42 -08:00
Fei Wang	d0e101e4e0	community[patch]: fix ollama astream (#16070 ) Update ollama.py	2024-01-17 09:42:41 -08:00
ChengZi	8597484195	langchain[patch]: support more comparators in Milvus self-querying retriever (#16076 ) - Description: Support IN and LIKE comparators in Milvus self-querying retriever, based on [Boolean Expression Rules](https://milvus.io/docs/boolean.md) - Issue: No - Dependencies: No - Twitter handle: No Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-01-17 09:41:23 -08:00
Kapil Sachdeva	f406dc3872	docs: in RunnableRetry, correct the example snippet that uses with_retry method on Runnable (#16108 ) The example code snippet for with_retry is using incorrect argument names. This PR fixes that	2024-01-17 09:11:27 -08:00
BeatrixCohere	b0c3e3db2b	community[patch]: Handle when documents are not provided in the Cohere response (#16144 ) - Description: This handles the cohere response when documents aren't included in the response - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-01-17 09:11:00 -08:00
Felix Krones	d91126fc64	community[patch]: missing unpack operator for or_clause in pgvector document filter (#16148 ) - Fix for #16146 - Adding unpack operation to "or" and "and" filter for pgvector retriever. #	2024-01-17 09:10:43 -08:00
Erick Friis	06fe2f4fb0	partners: add license field (#16117 ) - bumps package post versions for packages without current unreleased updates - will bump package version in release prs associated with packages that do have changes (mistral, vertex)	2024-01-17 08:37:13 -08:00
Erick Friis	ce10fe0c2f	mistralai[patch]: release 0.0.3 (#16116 ) embeddings	2024-01-17 08:36:05 -08:00
William FH	e5cf1e2414	Community[patch]use secret str in Tavily and HuggingFaceInferenceEmbeddings (#16109 ) So the api keys don't show up in repr's Still need to do tests	2024-01-17 00:30:07 -08:00
William FH	f3601b0aaf	Community[Patch] Remove docs form bm25 repr (#16110 ) Resolves: https://github.com/langchain-ai/langsmith-sdk/issues/356	2024-01-17 00:00:55 -08:00
David	c323742f4f	mistralai[minor]: Add embeddings (#15282 ) - Description: Adds MistralAIEmbeddings class for embeddings, using the new official API. - Dependencies: mistralai - Tag maintainer: @efriis, @hwchase17 - Twitter handle: @LMS_David_RS Create `integrations/text_embedding/mistralai.ipynb`: an example notebook for MistralAIEmbeddings class Modify `embeddings/__init__.py`: Import the class Create `embeddings/mistralai.py`: The embedding class Create `integration_tests/embeddings/test_mistralai.py`: The test file. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:48:37 -08:00
Leonid Kuligin	4df14a61fc	google-vertexai[minor]: add function calling on VertexAI (#15822 ) Replace this entire comment with: - Description: Description: added support for tools on VertexAI - Issue: #15073 - Twitter handle: lkuligin --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:01:26 -08:00
Bagatur	8840a8cc95	docs: tool-use use case (#15783 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-16 10:41:14 -08:00
Bagatur	3d34347a85	langchain[patch]: bump core dep to 0.1.9 (#16104 )	2024-01-16 10:39:07 -08:00
Bagatur	62a2e9ee19	langchain[patch]: Release 0.1.1 (#16103 )	2024-01-16 10:17:38 -08:00
Bagatur	076593382a	core[patch]: Release 0.1.11 (#16100 )	2024-01-16 09:46:04 -08:00
Bagatur	c5656a4905	core[patch]: pass exceptions to fallbacks (#16048 )	2024-01-16 09:36:43 -08:00
Nuno Campos	770f57196e	Add unit test for overridden lc_namespace (#16093 )	2024-01-16 09:22:52 -08:00
Erick Friis	52114bdfac	community[patch]: release 0.0.13 (#16087 )	2024-01-16 06:25:28 -08:00
James Briggs	ca288d8f2c	community[patch]: add vector param to index query for pinecone vec store (#16054 )	2024-01-16 06:12:19 -08:00
Antonio Morales	476fb328ee	community[patch]: implement adelete from VectorStore in Qdrant (#16005 ) Description: Implement `adelete` function from `VectorStore` in `Qdrant` to support other asynchronous flows such as async indexing (`aindex`) which requires `adelete` to be implemented. Since `Qdrant` can be passed an async qdrant client, this can be supported easily. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 19:57:09 -08:00
Bagatur	697a6f2c80	langchain[patch]: fix requests lint (#16049 )	2024-01-15 12:54:30 -08:00
高远	061e63eef2	community[minor]: add vikingdb vecstore (#15155 ) --------- Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-01-15 12:34:01 -08:00
andrijdavid	d196646811	community[patch]: Refactor OpenAIWhisperParserLocal (#15150 ) This PR addresses an issue in OpenAIWhisperParserLocal where requesting CUDA without availability leads to an AttributeError #15143 Changes: - Refactored Logic for CUDA Availability: The initialization now includes a check for CUDA availability. If CUDA is not available, the code falls back to using the CPU. This ensures seamless operation without manual intervention. - Parameterizing Batch Size and Chunk Size: The batch_size and chunk_size are now configurable parameters, offering greater flexibility and optimization options based on the specific requirements of the use case. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 12:29:14 -08:00
Zhichao HAN	5cf06db3b3	community[minor]: add JsonRequestsWrapper tool (#15374 ) Description: This new feature enhances the flexibility of pipeline integration, particularly when working with RESTful APIs. ``JsonRequestsWrapper`` allows for the decoding of JSON output, instead of the only option for text output. --------- Co-authored-by: Zhichao HAN <hanzhichao2000@hotmail.com>	2024-01-15 12:27:19 -08:00
chyroc	d334efc848	community[patch]: fix top_p type hint (#15452 ) fix: https://github.com/langchain-ai/langchain/issues/15341 @efriis	2024-01-15 11:59:39 -08:00
Mateusz Szewczyk	251afda549	community[patch]: fix stop (stop_sequences) param on WatsonxLLM (#15541 ) - Description: Fix to IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM provider (stop (`stop_sequences`) param on watsonxLLM) - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/),	2024-01-15 11:44:57 -08:00
Funkeke	7220124368	community[patch]: fix tongyi completion and params error (#15544 ) fix tongyi completion json parse error and prompt's params error --------- Co-authored-by: fangkeke <3339698829@qq.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 11:43:13 -08:00
盐粒 Yanli	ddf4e7c633	community[minor]: Update pgvecto_rs to use its high level sdk (#15574 ) - Description: Update pgvecto_rs to use its high level sdk, - Issue: fix #15173	2024-01-15 11:41:59 -08:00
YHW	ce21392a21	community: add a flag that determines whether to load the milvus collection (#15693 ) fix https://github.com/langchain-ai/langchain/issues/15694 --------- Co-authored-by: hyungwookyang <hyungwookyang@worksmobile.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:25:23 -08:00
Mohammad Mohtashim	9e779ca846	community[patch]: Fixing the SlackGetChannel Tool Input Error (#15725 ) Fixed the issue mentioned in #15698 for SlackGetChannel Tool. @baskaryan. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:23:55 -08:00
axiangcoding	daa9ccae52	community[patch]: deprecate ErnieBotChat and ErnieEmbeddings classes (#15862 ) - Description: add deprecated warning for ErnieBotChat and ErnieEmbeddings. - These two classes lack maintenance and do not use the sdk provided by qianfan, which means hard to implement some key feature like streaming. - The alternative `langchain_community.chat_models.QianfanChatEndpoint` and `langchain_community.embeddings.QianfanEmbeddingsEndpoint` can completely replace these two classes, only need to change configuration items. - Issue: None, - Dependencies: None, - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:14:44 -08:00
JaguarDB	b11fd3bedc	community[patch]: jaguar vector store fix integer-element error when joining metadata values (#15939 ) - Description: some document loaders add integer-type metadata values which cause error - Issue: 15937 - Dependencies: none --------- Co-authored-by: JY <jyjy@jaguardb>	2024-01-15 11:13:45 -08:00
Neo Zhao	21e0df937f	community[patch]: fix a bug that mistakenly handle zip iterator in FAISS.from_embeddings (#16020 ) Description: `zip` is iterator that will only produce result once, so the previous code will cause the `embeddings` to be an empty list. Issue: I could not find a related issue. Dependencies: this PR does not introduce or affect dependencies. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:13:14 -08:00
Christophe Bornet	15c2b4a47e	community[minor]: Add AstraDB self query retriever (#15738 ) - Description: this change adds a self-query retriever for AstraDB - Twitter handle: cbornet_	2024-01-15 11:04:11 -08:00
Leonid Ganeline	fb676d8a9b	community[minor], langchain[minor]: refactor `output_parsers` Rail (#15852 ) Moved Rail parser to `community` package.	2024-01-15 10:54:49 -08:00
Massimiliano Pronesti	e80aab2275	docs(community): update Amadeus toolkit to langchain v0.1 (#15976 ) - Description: docs update following the changes introduced in #15879 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-15 10:50:47 -08:00
Ashley Xu	ce7723c1e5	community[minor]: add additional support for `BigQueryVectorSearch` (#15904 ) BigQuery vector search lets you use GoogleSQL to do semantic search, using vector indexes for fast but approximate results, or using brute force for exact results. This PR: 1. Add `metadata[_job_ib]` in Document returned by any similarity search 2. Add `explore_job_stats` to enable users to explore job statistics and better the debuggability 3. Set the minimum row limit for running create vector index.	2024-01-15 10:45:15 -08:00
Mohammed Naqi	8799b028a6	community[minor]: Adding asynchronous function implementation for Doctran (#15941 ) ## Description In this update, I addressed the missing implementation for atransform_document, which is the asynchronous counterpart of transform_document in Doctran. ### Usage Example: ```py # Instantiate DoctranPropertyExtractor with specified properties property_extractor = DoctranPropertyExtractor(properties=properties) # Asynchronously extract properties from a list of documents extracted_document = await property_extractor.atransform_documents( documents, properties=properties ) # Display metadata of the first extracted document print(json.dumps(extracted_document[0].metadata, indent=2)) ``` ## Issue - Pull request #14525 has caused a break in the aforementioned code. Instead of removing an asynchronous implementation of a function, consider implementing a synchronous version alongside it.	2024-01-15 10:39:25 -08:00
Raunak	c0773ab329	community[patch]: Fixed 'coroutine' object is not subscriptable error (#15986 ) - Description: Added parenthesis in return statement of aembed_query() funtion to fix 'coroutine' object is not subscriptable error. - Dependencies: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-15 10:34:10 -08:00
Karim Lalani	14244bd7e5	community[minor]: Added document loader for SurrealDB (#15995 ) Added a simple document loader to work with SurrealDB.	2024-01-15 10:32:42 -08:00
Karim Lalani	768e5e33bc	community[minor]: Fix to match SurrealDB 0.3.2 SDK (#15996 ) New version of SurrealDB python sdk was causing the integration to break. This fix addresses that change.	2024-01-15 10:31:59 -08:00
shahrin014	86321a949f	community: Ollama - Parameter structure to follow official documentation (#16035 ) ## Feature - Follow parameter structure as per official documentation - top level parameters (e.g. model, system, template) will be passed as top level parameters - other parameters will be sent in options unless options is provided ![image](https://github.com/langchain-ai/langchain/assets/17451563/d14715d9-9701-4ee3-b44b-89fffea62389) ## Tests - Test if top level parameters handled properly - Test if parameters that are not top level parameters are handled as options - Test if options is provided, it will be passed as is	2024-01-15 10:17:58 -08:00
Nir Kopler	0fa06732b7	community: add new gpt-3.5-turbo-1106 finetuned for cost calculation (#16039 ) Description: Added the new gpt-3.5-turbo-1106 for finetuned cost calculation, Issue: no issue found open By the information in OpenAI the pricing is the same as the older model (0613)	2024-01-15 08:36:54 -08:00
Bagatur	bccb07f93e	core[patch]: simple prompt pretty printing (#15968 )	2024-01-12 21:08:51 -05:00
Virat Singh	eb6e385dc5	community: Add PolygonAPIWrapper and get_last_quote endpoint (#15971 ) - Description: Added a `PolygonAPIWrapper` and an initial `get_last_quote` endpoint, which allows us to get the last price quote for a given `ticker`. Once merged, I can add a Polygon tool in `tools/` for agents to use. - Twitter handle: [@virattt](https://twitter.com/virattt) The Polygon.io Stocks API provides REST endpoints that let you query the latest market data from all US stock exchanges.	2024-01-12 17:52:09 -08:00
Erick Friis	74bac7bda1	community[patch]: core min 0.1.9 (#15974 )	2024-01-12 15:32:06 -08:00
Erick Friis	845e407e08	community[patch]: release 0.0.12 (#15973 )	2024-01-12 15:27:05 -08:00
Jonathan Algar	a74f3a4979	Batch update of alt text and title attributes for images in md/mdx files across repo (#15357 ) Description: Batch update of alt text and title attributes for images in `md` & `mdx` files across the repo using [alttexter](https://github.com/jonathanalgar/alttexter)/[alttexter-ghclient](https://github.com/jonathanalgar/alttexter-ghclient) (built using LangChain/LangSmith). Limitation: cannot update `ipynb` files because of [this issue](https://github.com/langchain-ai/langchain/pull/15357#issuecomment-1885037250). Can revisit when Docusaurus is bumped to v3. I checked all the generated alt texts and titles and didn't find any technical inaccuracies. That's not to say they're _perfect_, but a lot better than what's there currently. [Deployed](https://langchain-819yf1tbk-langchain.vercel.app/docs/modules/model_io/) image example: ![chrome_yZQ7BF2GTj](https://github.com/langchain-ai/langchain/assets/93204286/43a9a4d4-70fd-41c4-8978-b6240ff63ffa) You can see LangSmith traces for all the calls out to the LLM in the PRs merged into this one: * https://github.com/jonathanalgar/langchain/pull/6 * https://github.com/jonathanalgar/langchain/pull/4 * https://github.com/jonathanalgar/langchain/pull/3 I didn't add the following files to the PR as the images already have OK alt texts: * `27dca2d92f/docs/docs/integrations/providers/argilla.mdx (L3)` * `27dca2d92f/docs/docs/integrations/providers/apify.mdx (L11)` --------- Co-authored-by: github-actions <github-actions@github.com>	2024-01-12 14:37:48 -08:00
Varik Matevosyan	efe6cfafe2	community: Added Lantern as VectorStore (#12951 ) Support [Lantern](https://github.com/lanterndata/lantern) as a new VectorStore type. - Added Lantern as VectorStore. It will support 3 distance functions `l2 squared`, `cosine` and `hamming` and will use `HNSW` index. - Added tests - Added example notebook	2024-01-12 12:00:16 -08:00
Harrison Chase	1afac77439	stop making copies of inputs (#15926 )	2024-01-12 11:49:26 -08:00
Edwin Wenink	9fb09c1c30	community: fix the "page" mode in the AzureAIDocumentIntelligenceParser (bug) (#15958 ) Description: the "page" mode in the AzureAIDocumentIntelligenceParser is not accessible due to a wrong membership test. The mode argument can only be a string (also see the assertion in the `__init__`: `assert self.mode in ["single", "page", "object", "markdown"]`, so the check `elif self.mode == ["page"]:` always fails. As a result, effectively the "object" mode is used when selecting the "page" mode, which may lead to errors. The docstring of the `AzureAIDocumentIntelligenceLoader` also ommitted the `mode` parameter alltogether, so I added it. Issue: I could not find a related issue (this class is only 3 weeks old anyways) Dependencies: this PR does not introduce or affect dependencies. The current demo notebook and examples are not affected because they all use the default markdown mode.	2024-01-12 11:01:28 -08:00
Mahdi Setayesh	eb76f9c9fe	community: Fixing a performance issue with AzureSearch to perform batch embedding (#15594 ) - Description: Azure Cognitive Search vector DB store performs slow embedding as it does not utilize the batch embedding functionality. This PR provide a fix to improve the performance of Azure Search class when adding documents to the vector search, - Issue: #11313 , - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-12 10:58:55 -08:00
Bagatur	c697c89ca4	docs: add agent prompt creation examples (#15957 )	2024-01-12 10:26:12 -08:00
Erick Friis	69533c8628	multiple[patch]: .post releases and pyproject metadata (#15962 )	2024-01-12 10:09:02 -08:00
Erick Friis	95020637bc	openai[patch]: 0.0.2.post1, urls (#15961 )	2024-01-12 09:36:37 -08:00
ChengZi	d5808f786c	community: Support milvus partition key. (#15740 ) - Description: Milvus's partition key is an important feature. It can support multi-tenancy. We hope to introduce this feature. https://milvus.io/docs/partition_key.md - Issue: No - Dependencies: No - Twitter handle: No --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-12 09:15:03 -08:00
enfeng	13b90232c1	langchain-google-genai[patch]: Add support for end_point and transport parameters to the Gemini API (#15532 ) Add support for end_point and transport parameters to the Gemini API --------- Co-authored-by: yangenfeng <yangenfeng@xiaoniangao.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-12 08:52:00 -08:00

... 2 3 4 5 6 ...

2801 Commits