langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
Nan LI	f506b4cfd2	community: Integration of New Chat Model Based on ChatGLM3 via ZhipuAI API (#15105 ) - Description: - This PR introduces a significant enhancement to the LangChain project by integrating a new chat model powered by the third-generation base large model, ChatGLM3, via the zhipuai API. - This advanced model supports functionalities like function calls, code interpretation, and intelligent Agent capabilities. - The additions include the chat model itself, comprehensive documentation in the form of Python notebook docs, and thorough testing with both unit and integrated tests. - Dependencies: This update relies on the ZhipuAI package as a key dependency. - Twitter handle: If this PR receives spotlight attention, we would be honored to receive a mention for our integration of the advanced ChatGLM3 model via the ZhipuAI API. Kindly tag us at @kaiwu. To ensure quality and standards, we have performed extensive linting and testing. Commands such as make format, make lint, and make test have been run from the root of the modified package to ensure compliance with LangChain's coding standards. TO DO: Continue refining and enhancing both the unit tests and integrated tests. --------- Co-authored-by: jing <jingguo92@gmail.com> Co-authored-by: hyy1987 <779003812@qq.com> Co-authored-by: jianchuanqi <qijianchuan@hotmail.com> Co-authored-by: lirq <whuclarence@gmail.com> Co-authored-by: whucalrence <81530213+whucalrence@users.noreply.github.com> Co-authored-by: Jing Guo <48378126+JaneCrystall@users.noreply.github.com>	2024-01-01 15:17:03 -08:00
Hin	2cf1e73d12	Feat add volcano embedding (#14693 ) Description: Volcano Ark is an enterprise-grade large-model service platform for developers, providing a full range of functions and services such as model training, inference, evaluation, fine-tuning. You can visit its homepage at https://www.volcengine.com/docs/82379/1099455 for details. This change could help developers use the platform for embedding. Issue: None Dependencies: volcengine Tag maintainer: @baskaryan Twitter handle: @hinnnnnnnnnnnns --------- Co-authored-by: lujingxuansc <lujingxuansc@bytedance.com>	2024-01-01 14:37:35 -08:00
David Křístek	a010f29013	fix: call correct stream method in ollama (#15104 ) Co-authored-by: David Kristek <david@David--MacBook-Pro.local>	2024-01-01 14:03:53 -08:00
Christian Janiake	be578f32be	community:Lazy load wikipedia dump file (#15111 ) Description: the MWDumpLoader implementation currently does not support the lazy_load method, and the files are usually very large. We are proposing refactoring the load function, extracting two private functions with the functionality of loading the dump file and parsing a single page, to reuse the code in the lazy_load implementation.	2024-01-01 14:02:56 -08:00
chyroc	a4ae4bc361	feat: mask api_key for konko (#14010 ) for https://github.com/langchain-ai/langchain/issues/12165	2024-01-01 13:42:49 -08:00
joel-teratis	62d32bd214	fix(minor): added missing kwargs parameter to chroma query function (#14919 ) Description: This PR adds the `kwargs` parameter to six calls in the `chroma.py` package. All functions already were able to receive `kwargs` but they were discarded before. Issue: When passing `kwargs` to functions in the `chroma.py` package they are being ignored. For example: ``` chroma_instance.similarity_search_with_score( query, k=100, include=["metadatas", "documents", "distances", "embeddings"], # this parameter gets ignored ) ``` The `include` parameter does not get passed on to the next function and does not have any effect. Dependencies: None	2024-01-01 13:40:29 -08:00
chyroc	0665a7da19	Docs: add param comment for `tracing_v2_enabled` (#15308 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-01 13:38:44 -08:00
NuODaniel	7773943a51	community:qianfan endpoint support init params & remove useless params definietion (#15381 ) - Description: - support custom kwargs in object initialization. For instantance, QPS differs from multiple object(chat/completion/embedding with diverse models), for which global env is not a good choice for configuration. - Issue: no - Dependencies: no - Twitter handle: no @baskaryan PTAL	2024-01-01 13:12:31 -08:00
Nuno Campos	b9636e5c98	Catch type errors in dumps/dumpd (#15336 ) These can happen for edge cases not covered by `default` handler (eg. "strange" keys in dicts) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-29 17:37:12 -08:00
Nuno Campos	99000c612e	Propagate context vars in all classes/methods (#15329 ) - Any direct usage of ThreadPoolExecutor or asyncio.run_in_executor needs manual handling of context vars <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-29 15:59:00 -08:00
Ankush Gola	7eec8f2487	Delete V1 tracer and refactor tracer tests to core (#15326 )	2023-12-29 15:55:56 -08:00
Nuno Campos	4e4b119614	Fix executor	2023-12-29 15:50:45 -08:00
chyroc	7ce338201c	Patch: improve check openai version (#15301 )	2023-12-29 13:44:19 -08:00
Jon Nolen	27ee61645d	core: Update messages/__init__.py to account for AIMessageChunk which breaks message history runnable. (#15327 ) - Description: fix parse issue for AIMessageChunk when using - Issue: https://github.com/langchain-ai/langchain/issues/14511 - Dependencies: none - Twitter handle: none Taken from this fix: https://github.com/gpt-engineer-org/gpt-engineer/issues/804#issuecomment-1769853850 Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-29 13:41:47 -08:00
Nuno Campos	9bb1fbcadf	Lint	2023-12-29 12:43:55 -08:00
Nuno Campos	f7313adf2a	old py compat	2023-12-29 12:38:58 -08:00
Nuno Campos	eb5e250188	Propagate context vars in all classes/methods - Any direct usage of ThreadPoolExecutor or asyncio.run_in_executor needs manual handling of context vars	2023-12-29 12:34:03 -08:00
Shuai Liu	4b53440e70	Upgrades the Tongyi LLM and ChatTongyi Model (#14793 ) - Description: fixes and upgrades for the Tongyi LLM and ChatTongyi Model - Fixed typos; it should be `Tongyi`, not `OpenAI`. - Fixed a bug in `stream_generate_with_retry`; it's a real stream generator now. - Fixed a bug in `validate_environment`; the `dashscope_api_key` should be properly handled when set by environment variables or initialization parameters. - Changed the `dashscope` response to incremental output by setting the parameter `incremental_output`, which eliminates the need for the prefix-removal trick. - Removed some unused parameters, like `n`, `prefix_messages`. - Added `_stream` method. - Added async methods support, such as `_astream`, `_agenerate`, `_abatch`. - Dependencies: No new dependencies. - Tag maintainer: @hwchase17 > PS: Some may be confused about the terms `dashscope`, `tongyi`, and `Qwen`: > - `dashscope`: A platform to deploy LLMs and provide APIs to invoke the LLM. > - `tongyi`: A brand name or overall term about Alibaba Cloud's LLM/AI. > - `Qwen`: An LLM that is open-sourced and deployed in `dashscope`. > > We use the `dashscope` SDK to interact with the `tongyi`-`Qwen` LLM. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-29 12:06:12 -08:00
Romain Fouilland	6f15cc64b8	langchain: minor changes to StuffDocumentsChain._get_inputs (#15321 ) Correcting a small typo ('the' instead of 'then') and changing another 'the' (instead of 'then' too, it was a hard day for the 'n' key :D) to 'also' to match better with what is done in the code <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-29 11:53:30 -08:00
Bagatur	8bfac1a319	community[patch]: Release 0.0.7 (#15320 )	2023-12-29 13:10:23 -05:00
Harrison Chase	c3b3b77a11	[core] add test for json parser (#15297 ) this should fail, but isnt --------- Co-authored-by: Nuno Campos <nuno@langchain.dev>	2023-12-29 09:59:39 -08:00
Nuno Campos	ec090745a6	Improve markdown list parser (#15295 ) - do not match text after - in the middle of a sentence <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-29 09:59:21 -08:00
Bagatur	50e99ec601	langchain[patch]: Release 0.0.353 (#15322 )	2023-12-29 12:02:51 -05:00
Bagatur	80ceed6da5	core[patch]: Release 0.1.4 (#15319 )	2023-12-29 11:33:06 -05:00
Nuno Campos	36ceffd2cd	Strip code block fences and extra test from xml when doing streaming … (#15293 ) …parse <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-28 16:37:15 -08:00
Diego Rani Mazine	ec72225265	refactor: enable connection pool usage in PGVector (#11514 ) - Description: `PGVector` refactored to use connection pool. - Issue: #11433, - Tag maintainer: @hwchase17 @eyurtsev, --------- Co-authored-by: Diego Rani Mazine <diego.mazine@mercadolivre.com> Co-authored-by: Nuno Campos <nuno@langchain.dev>	2023-12-28 15:07:16 -08:00
chyroc	507c195a4b	Patch: improve openai functions call parser compatibility (#15197 ) ```shell Python 3.11.6 (main, Nov 2 2023, 04:39:43) [Clang 14.0.3 (clang-1403.0.22.14.1)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> s = {'name': 'gc', 'arguments': '{"prompt":"hi\nbob."}'} >>> import json >>> json.loads(s['arguments']) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/opt/homebrew/Cellar/python@3.11/3.11.6_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/json/__init__.py", line 346, in loads return _default_decoder.decode(s) ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.6_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/json/decoder.py", line 337, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.6_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/json/decoder.py", line 353, in raw_decode obj, end = self.scan_once(s, idx) ^^^^^^^^^^^^^^^^^^^^^^ json.decoder.JSONDecodeError: Invalid control character at: line 1 column 14 (char 13) >>> json.loads(s['arguments'].replace('\n', '\\n')) {'prompt': 'hi\nbob.'} >>> ``` --------- Co-authored-by: Nuno Campos <nuno@langchain.dev>	2023-12-28 15:06:27 -08:00
joshy-deshaw	bf5385592e	core, community: propagate context between threads (#15171 ) While using `chain.batch`, the default implementation uses a `ThreadPoolExecutor` and run the chains in separate threads. An issue with this approach is that that [the token counting callback](https://python.langchain.com/docs/modules/callbacks/token_counting) fails to work as a consequence of the context not being propagated between threads. This PR adds context propagation to the new threads and adds some thread synchronization in the OpenAI callback. With this change, the token counting callback works as intended. Having the context propagation change would be highly beneficial for those implementing custom callbacks for similar functionalities as well. --------- Co-authored-by: Nuno Campos <nuno@langchain.dev>	2023-12-28 14:51:22 -08:00
Nuno Campos	f74151b4e4	Make all json parsing less strict by default (#15287 ) - Enables strict=False by default - Uses partial json recovery logic by default <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-28 14:48:53 -08:00
Harrison Chase	bc5a0ef6ca	remove chat-history (#15286 )	2023-12-28 14:22:16 -08:00
Harrison Chase	90aa26a90e	[langchain] agents code changes (#15278 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out!	2023-12-28 13:39:08 -08:00
Harrison Chase	b86803153e	[core, langchain] modelio code improvements (#15277 )	2023-12-28 12:56:20 -08:00
shroominic	694bbb14cd	community: fix typo in async ollama chat (#15276 ) Made a stupid typo in the last PR which got already merged😅	2023-12-28 09:56:55 -08:00
triThirty	fea4888e72	community: Enhance Github error prompt (#15248 ) - Description: The Github error prompt is confused because of JWT enctrypt to somebody not familiar with Github connection method. This PR is to add some useful error prompt to help users troubleshooting. - Issue: https://github.com/langchain-ai/langchain/issues/14550#issuecomment-1867445049 - Dependencies: None, - Twitter handle: None	2023-12-28 08:25:19 -08:00
Christopher Queen	d5e1725ace	langchain: Fix for issue #14631 - .devcontainer doesnt build (#15251 ) - Description: Fix for issue #14631 - Issue: This fixes [Issue #14631](https://github.com/langchain-ai/langchain/issues/14631) - Twitter handle: [@consultchrisq ](https://twitter.com/consultchrisq?lang=en)	2023-12-28 08:25:03 -08:00
Bob Lin	a464eb4394	community: Make doctran synchronous (#15264 ) ### Description I found that the methods in [the doctran library](https://github.com/psychic-api/doctran) have been restructured into [synchronized versions](`14944a59f7`), And [the example ipynb](https://github.com/psychic-api/doctran/blob/main/examples.ipynb) also shows that the code is synchronized, but the README has not been updated yet. so we need to modify the code and update the documentation. ### Issue https://github.com/langchain-ai/langchain/issues/14645	2023-12-28 08:05:24 -08:00
Brendan Smith	9a16590aa9	langchain: Fix class name in RetryOutputParser docstring (#15268 ) `OutputFixingParser` -> `RetryOutputParser` ![i'm-helping](https://github.com/langchain-ai/langchain/assets/5986636/68f1b8ce-8a6e-4e75-9cf8-e3c93ac562c2)	2023-12-28 08:03:46 -08:00
Nuno Campos	22b3a233b8	Update passthrough.py (#15252 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-27 22:12:32 -08:00
chyroc	6fb3cc6f27	Fix: Use `Union` instead of `\|` to improve compatibility, fix #15244 (#15245 )	2023-12-27 22:06:42 -08:00
Nuno Campos	6a5a2fb9c8	Add .pick and .assign methods to Runnable (#15229 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-27 13:35:34 -08:00
Nuno Campos	0252a24471	Implement nicer runnable seq constructor, Propagate name through Runn… (#15226 ) …ableBinding <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-27 11:24:32 -08:00
Nuno Campos	f36ef0739d	Add create_conv_retrieval_chain func (#15084 ) ``` +----------+ \| MapInput \| +----------+ * +------------------------------------+ \| Lambda(itemgetter('chat_history')) \| * +------------------------------------+ * * * * * * * +---------------------------+ +--------------------------------+ \| Lambda(_get_chat_history) \| \| Lambda(itemgetter('question')) \| +---------------------------+ +--------------------------------+ * * * * * * +----------------------------+ +------------------------+ \| ContextSet('chat_history') \| \| ContextSet('question') \| +----------------------------+ +------------------------+ ** ** +-----------+ \| MapOutput \| +-----------+ * * * +----------------+ \| PromptTemplate \| +----------------+ * * * +-------------+ \| FakeListLLM \| +-------------+ * * * +-----------------+ \| StrOutputParser \| +-----------------+ * * * +----------------------------+ \| ContextSet('new_question') \| +----------------------------+ * * * +---------------------+ \| SequentialRetriever \| +---------------------+ * * * +------------------------------------+ \| Lambda(_reduce_tokens_below_limit) \| +------------------------------------+ * * * +-------------------------------+ \| ContextSet('input_documents') \| +-------------------------------+ * * * +----------+ *\| MapInput \| *** +----------+ **** ****** * ***** ***** * ****** ** * ** +-------------------------------+ +----------------------------+ +----------------------------+ \| ContextGet('input_documents') \| \| ContextGet('chat_history') \| \| ContextGet('new_question') \| +-------------------------------+ +----------------------------+ +----------------------------+ ******* * ***** ****** * **** *** * **** +-----------+ \| MapOutput \| +-----------+ * * * +-------------+ \| FakeListLLM \| +-------------+ * * * +----------+ *\| MapInput \|* ****** +----------+ ** ***** * *** ****** * **** ** * * +-------------------------------+ +----------------------------+ +-------------+ \| ContextGet('input_documents') \| \| ContextGet('new_question') \| \| Passthrough \| +-------------------------------+ +----------------------------+ ***** +-------------+ ***** * **** **** * ***** ** * **** +-----------+ \| MapOutput \| +-----------+ ``` --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-26 17:28:10 -08:00
Harrison Chase	4ad77f777e	[core] prompt changes (#15186 ) change it to pass all variables through all the way in invoke	2023-12-26 15:52:17 -08:00
Nuno Campos	ccf9c8e0be	Better input and output schemas for chains that start or end with a R… (#15185 ) …unnableAssign or RunnablePick <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-26 15:21:13 -08:00
Nuno Campos	8cdc633465	Implement RunnablePassthrough.pick() (#15184 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-26 14:01:20 -08:00
chyroc	1abcf441ae	Refactor: use SecretStr for Predibase llms (#15119 )	2023-12-26 13:01:42 -08:00
chyroc	0a9a73a9c9	Refactor: use SecretStr for PipelineAI llms (#15120 )	2023-12-26 13:00:58 -08:00
chyroc	d63ceb65b3	Refactor: use SecretStr for StochasticAI llms (#15118 )	2023-12-26 12:59:51 -08:00
chyroc	674fde87d2	Refactor: use SecretStr for VolcEngineMaas llms (#15117 )	2023-12-26 12:59:08 -08:00
chyroc	3cc1da2b38	Refactor: use SecretStr for Petals llms (#15121 )	2023-12-26 12:57:37 -08:00
Quy Tang	7ef25a3c1b	Implement stream and astream for RunnableLambda (#14794 ) Description: Implement stream and astream methods for RunnableLambda to make streaming work for functions returning Runnable - Issue: https://github.com/langchain-ai/langchain/issues/11998 - Dependencies: No new dependencies - Twitter handle: https://twitter.com/qtangs --------- Co-authored-by: Nuno Campos <nuno@langchain.dev>	2023-12-26 12:49:02 -08:00
Nuno Campos	7e26559256	Fix runnable vistitor for funcs without pos args (#15182 )	2023-12-26 12:42:24 -08:00
Harrison Chase	b4a0d206d9	[core: minor] fix getters (#15181 )	2023-12-26 12:32:55 -08:00
Bagatur	56fad2e8ff	langchain[minor]: Add stuff docs runnable (#15178 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-26 12:20:00 -08:00
Harrison Chase	63916cfe35	[core] langauge model like (#15180 )	2023-12-26 12:19:50 -08:00
shroominic	e6f0cee896	community: Async Ollama + ChatOllama (#15169 ) Description: Adding async methods to booth OllamaLLM and ChatOllama to enable async streaming and async .on_llm_new_token callbacks. Issue: ChatOllama is not working in combination with an AsyncCallbackManager because the .on_llm_new_token method is not awaited.	2023-12-26 12:08:04 -08:00
Harrison Chase	33e024ad10	[core] print ascii (#15179 )	2023-12-26 11:43:14 -08:00
Phill Zarfos	35896faab7	community: correct spelling mistakes of "Suffle" and "reporoducibility" (#15172 ) - Description: Correct spelling mistakes of "Suffle" and "reporoducibility" in `DirectoryLoader` class - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2023-12-26 11:22:59 -08:00
chyroc	3a3f880e5a	Patch: improve ollama 404 api error message, fix #15147 (#15156 ) Make this issue more clearly exposed to developers	2023-12-26 11:07:39 -08:00
Nuno Campos	a2d3042823	Improve graph repr for runnable passthrough and itemgetter (#15083 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-22 16:05:48 -08:00
Nuno Campos	0d0901ea18	Nc/dec22/runnable graph lambda (#15078 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-22 14:36:46 -08:00
Ivan	59d4b80a92	[community]: Elasticsearch chat history encoding (#15055 ) - Added ensure_ascii property to ElasticsearchChatMessageHistory <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Ivan Chetverikov <ivan.chetverikov@raftds.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-22 13:21:34 -08:00
Corey Brown	9e492620d4	Don't reassign chunk_type (#14923 ) Description: The parameter chunk_type was being hard coded to "extractive_answers", so that when "snippet" was being passed, it was being ignored. This change simply doesn't do that.	2023-12-22 13:20:53 -08:00
Takuya Igei	6da2246215	Add support Vertex AI Gemini uses a public image URL (#14949 ) ## What Since `langchain_google_genai.ChatGoogleGenerativeAI` supported A public image URL, we add to support it in `langchain.chat_models.ChatVertexAI` as well. ### Example ```py from langchain.chat_models.vertexai import ChatVertexAI from langchain_core.messages import HumanMessage llm = ChatVertexAI(model_name="gemini-pro-vision") image_message = { "type": "image_url", "image_url": { "url": "https://python.langchain.com/assets/images/cell-18-output-1-0c7fb8b94ff032d51bfe1880d8370104.png", }, } text_message = { "type": "text", "text": "What is shown in this image?", } message = HumanMessage(content=[text_message, image_message]) output = llm([message]) print(output.content) ``` ## Refs - https://python.langchain.com/docs/integrations/llms/google_vertex_ai_palm - https://python.langchain.com/docs/integrations/chat/google_generative_ai	2023-12-22 13:19:09 -08:00
Archan Ghosh	affa3e755a	Update arxiv.py with get_summaries_as_docs inside of Arxivloader (#14953 ) Added the call function get_summaries_as_docs inside of Arxivloader - Description: Added a function that returns the documents from get_summaries_as_docs, as the call signature is present in the parent file but never used from Arxivloader, this can be used from Arxivloader itself just like .load() as both the signatures are same. - Issue: Reduces time to load papers as no pdf is processed only metadata is pulled from Arxiv allowing users for faster load times on bulk loads. Users can then choose one or more paper and use ID directly with .load() to load pdf thereby loading all the contents of the paper.	2023-12-22 13:14:22 -08:00
Sypherd	d4f45b1421	core(minor): Allow explicit types for ChatMessageHistory adds (#14967 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> ## Description Changes the behavior of `add_user_message` and `add_ai_message` to allow for messages of those types to be passed in. Currently, if you want to use the `add_user_message` or `add_ai_message` methods, you have to pass in a string. For `add_message` on `ChatMessageHistory`, however, you have to pass a `BaseMessage`. This behavior seems a bit inconsistent. Personally, I'd love to be able to be explicit that I want to `add_user_message` and pass in a `HumanMessage` without having to grab the `content` attribute. This PR allows `add_user_message` to accept `HumanMessage`s or `str`s and `add_ai_message` to accept `AIMessage`s or `str`s to add that functionality and ensure backwards compatibility. ## Issue * None ## Dependencies * None ## Tag maintainer @hinthornw @baskaryan ## Note `make test` results in `make: *** No rule to make target 'test'. Stop.`	2023-12-22 13:12:01 -08:00
ccurme	f2782f4c86	community: add args_schema to GmailSendMessage (#14973 ) - Description: `tools.gmail.send_message` implements a `SendMessageSchema` that is not used anywhere. `GmailSendMessage` also does not have an `args_schema` attribute (this led to issues when invoking the tool with an OpenAI functions agent, at least for me). Here we add the missing attribute and a minimal test for the tool. - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: Chester Curme <chestercurme@microsoft.com>	2023-12-22 13:07:44 -08:00
Philip Kiely - Baseten	6342da333a	community: refactor Baseten integration with new API endpoints & docs (#15017 ) - Description: In response to user feedback, this PR refactors the Baseten integration with updated model endpoints, as well as updates relevant documentation. This PR has been tested by end users in production and works as expected. - Issue: N/A - Dependencies: This PR actually removes the dependency on the `baseten` package! - Twitter handle: https://twitter.com/basetenco	2023-12-22 12:46:24 -08:00
Blane Honeycutt	3fc1b3553b	Community: Adds ability to pass a Config to the boto3 client used by Bedrock (#15029 ) # Description This PR adds the ability to pass a `botocore.config.Config` instance to the boto3 client instantiated by the Bedrock LLM. Currently, the Bedrock LLM doesn't support a way to pass a Config, which means that some settings (e.g., timeouts and retry configuration) require instantiating a new boto3 client with a Config and then replacing the LLM's client: ```python llm = Bedrock( region_name='us-west-2', model_id="anthropic.claude-v2", model_kwargs={'max_tokens_to_sample': 4096, 'temperature': 0}, ) llm.client = boto_client('bedrock-runtime', region_name='us-west-2', config=Config({'read_timeout': 300})) ``` # Issue N/A # Dependencies N/A	2023-12-22 12:42:56 -08:00
Grzegorz Sajko	dc71fcfabf	corrected outdated link (#15053 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-22 12:39:38 -08:00
chyroc	0e149bbb4c	Improve: remove extra spaces in get_from_env error (#15064 )	2023-12-22 11:50:03 -08:00
Ran	c3f8733aef	fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647 ) fix spellings seperate -> separate: found more occurrences, see https://github.com/langchain-ai/langchain/pull/14602 initialise -> intialize: the latter is more common in the repo pre-defined > predefined: adding a comma after a prefix is a delicate matter, but this is a generally accepted word also, another word that appears in the repo is "fs" (stands for filesystem), e.g., in `libs/core/langchain_core/prompts/loading.py` ` """Unified method for loading a prompt from LangChainHub or local fs."""` Isn't "filesystem" better?	2023-12-22 11:49:35 -08:00
chyroc	86d27fd684	Fix: fix partners name typo in tests (#15066 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Ran <rccalman@gmail.com>	2023-12-22 11:48:39 -08:00
Harrison Chase	2e159931ac	add defaults for tavily (#15075 )	2023-12-22 11:48:26 -08:00
chyroc	4440ec5ab3	Refactor: use SecretStr for minimax embeddings (#15067 )	2023-12-22 11:43:23 -08:00
chyroc	aa19ca9723	Refactor: use SecretStr for jina embeddings (#15068 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-22 11:42:29 -08:00
Nuno Campos	7d5800ee51	Add Runnable.get_graph() to get a graph representation of a Runnable (#15040 ) It can be drawn in ascii with Runnable.get_graph().draw()	2023-12-22 11:40:45 -08:00
Eugene Yurtsev	aad3d8bd47	langchain(patch): Restrict paths in LocalFileStore cache (#15065 ) This PR restricts the paths that can be resolve using the local file system cache so that all paths must be contained within the root path.	2023-12-22 11:20:17 -05:00
Michael Goin	501cc8311d	community[patch]: Fix generation_config not setting properly for DeepSparse (#15036 ) - Description: Tiny but important bugfix to use a more stable interface for specifying generation_config parameters for DeepSparse LLM	2023-12-22 01:39:22 -05:00
QIAN Zifei	2460f977c5	community[minor]: Azure DocumentIntelligenceLoader/Parser support update with latest SDK (#14389 ) - Description: Add DocumentIntelligenceLoader & DocumentIntelligenceParser implementation using the latest Azure Document Intelligence SDK with markdown support. The core logic resides in DocumentIntelligenceParser and DocumentIntelligenceLoader is a mere wrapper of the parser. The parser will takes api_endpoint and api_key and creates DocumentIntelligenceClient for the user. 4 parsing modes are supported: 1. Markdown (default) 2. Single 3. Page 4. Object UT and notebook are also updated accordingly. - Dependencies: Azure Document Intelligence SDK: azure-ai-documentintelligence [azure-sdk-for-python/sdk/documentintelligence/azure-ai-documentintelligence at 7c42462ac662522a6fd21b17d2a20f4cd40d0356 · Azure/azure-sdk-for-python (github.com)](https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FAzure%2Fazure-sdk-for-python%2Ftree%2F7c42462ac662522a6fd21b17d2a20f4cd40d0356%2Fsdk%2Fdocumentintelligence%2Fazure-ai-documentintelligence&data=05%7C01%7CZifei.Qian%40microsoft.com%7C298225aa3e31468a863108dbf07374ff%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638368150928704292%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=oE0Sl4HERnMKdbkV9KgBV46Z2xytcQAShdTWf7ZNl%2Bs%3D&reserved=0). --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-21 16:40:27 -08:00
Ran	129a929d69	infra: Fix test filesystem paths incompatible with windows (#14388 ) - Description: This PR fixes test failures on Windows caused by path handling differences and unescaped special characters in regex. The failing tests are: ``` FAILED tests/unit_tests/storage/test_filesystem.py::test_yield_keys - AssertionError: assert ['key1', 'subdir\\key2'] == ['key1', 'subdir/key2'] FAILED tests/unit_tests/test_imports.py::test_importable_all - ModuleNotFoundError: No module named 'langchain_community.langchain_community\\adapters' FAILED tests/unit_tests/tools/file_management/test_utils.py::test_get_validated_relative_path_errs_on_absolute - re.error: incomplete escape \U at position 53 FAILED tests/unit_tests/tools/file_management/test_utils.py::test_get_validated_relative_path_errs_on_parent_dir - re.error: incomplete escape \U at position 69 FAILED tests/unit_tests/tools/file_management/test_utils.py::test_get_validated_relative_path_errs_for_symlink_outside_root - re.error: incomplete escape \U at position 64 ``` - Issue: fixes https://github.com/langchain-ai/langchain/issues/11775 (partially) - Dependencies: none	2023-12-21 13:45:42 -08:00
Nuno Campos	71076cceaf	Move json and xml parsers to core (#15026 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-21 12:36:56 -08:00
Nuno Campos	d5533b7081	Add option to make messages placeholder optional (#15031 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-21 12:36:37 -08:00
Bagatur	40f42b8947	community[patch]: Release 0.0.6 (#15023 )	2023-12-21 14:37:44 -05:00
Bagatur	7eb1100925	core[patch]: Release 0.1.3 (#15022 )	2023-12-21 14:35:15 -05:00
Nuno Campos	63e512b680	Implement streaming for all list output parsers (#14981 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-21 11:30:35 -08:00
Nuno Campos	b471166df7	Implement streaming for xml output parser (#14984 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-21 11:30:18 -08:00
Jacob Lee	1b01ee0e3c	community[minor]: add hf chat wrapper (#14736 ) Builds on #14040 with community refactor merged and notebook updated. Note that with this refactor, models will be imported from `langchain_community.chat_models.huggingface` rather than the main `langchain` repo. --------- Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> Signed-off-by: Yuchen Liang <yuchenl3@andrew.cmu.edu> Co-authored-by: Andrew Reed <andrew.reed.r@gmail.com> Co-authored-by: Andrew Reed <areed1242@gmail.com> Co-authored-by: A-Roucher <aymeric.roucher@gmail.com> Co-authored-by: Aymeric Roucher <69208727+A-Roucher@users.noreply.github.com>	2023-12-21 12:28:30 -05:00
Leonid Kuligin	b99274c9d8	community[patch]: changed default for VertexAIEmbeddings (#14614 ) Replace this entire comment with: - Description: @kurtisvg has raised a point that it's a good idea to have a fixed version for embeddings (since otherwise a user might run a query with one version vs a vectorstore where another version was used). In order to avoid breaking changes, I'd suggest to give users a warning, and make a `model_name` a required argument in 1.5 months.	2023-12-21 12:15:19 -05:00
Karim Lalani	228ddabc3b	community: fix for surrealdb client 0.3.2 update + store and retrieve metadata (#14997 ) Surrealdb client changes from 0.3.1 to 0.3.2 broke the surrealdb vectore integration. This PR updates the code to work with the updated client. The change is backwards compatible with previous versions of surrealdb client. Also expanded the vector store implementation to store and retrieve metadata that's included with the document object.	2023-12-21 12:04:57 -05:00
JaguarDB	ca0a75e1fc	community[patch]: JaguarHttpClient conditional import (#14985 ) - Description: Fixed jaguar.py to import JaguarHttpClient with try and catch - Issue: the issue # Unable to use the JaguarHttpClient at run time - Dependencies: It requires "pip install -U jaguardb-http-client" - Twitter handle: workbot --------- Co-authored-by: JY <jyjy@jaguardb> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 19:11:57 -08:00
Michael Landis	1c934fff0e	community[patch]: support momento vector index filter expressions (#14978 ) Description For the Momento Vector Index (MVI) vector store implementation, pass through `filter_expression` kwarg to the MVI client, if specified. This change will enable the MVI self query implementation in a future PR. Also fixes some integration tests.	2023-12-20 19:11:43 -08:00
Yacine	300c1cbf92	community[patch]: Fix typo in class Docstring (#14982 ) - Description: Fix typo in class Docstring to replace AZURE_OPENAI_API_ENDPOINT by AZURE_OPENAI_ENDPOINT - Issue: the issue #14901 - Dependencies: NA - Twitter handle: Co-authored-by: Yacine Bouakkaz <Yacine.Bouakkaz@evokegroup.com>	2023-12-20 19:03:45 -08:00
chyroc	57d1eb733f	core[patch]: update langchain-core runtime library name (#14884 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-20 14:35:48 -08:00
Quy Tang	42822484ef	core(minor): Implement stream and astream for RunnableBranch (#14805 ) * This PR adds `stream` implementations to Runnable Branch. * Runnable Branch still does not support `transform` so it'll break streaming if it happens in middle or end of sequence, but will work if happens at beginning of sequence. * Fixes use the async callback manager for async methods * Handle BaseException rather than Exception, so more errors could be logged as errors when they are encountered --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-12-20 15:37:56 -05:00
MING KANG	ed5e0cfe57	community: add OCI Endpoint (#14250 ) - Description: - [OCI Data Science](https://docs.oracle.com/en-us/iaas/data-science/using/home.htm) is a fully managed and serverless platform for data science teams to build, train, and manage machine learning models in the Oracle Cloud Infrastructure. This PR add integration for using LangChain with an LLM hosted on a [OCI Data Science Model Deployment](https://docs.oracle.com/en-us/iaas/data-science/using/model-dep-about.htm). To authenticate, [oracle-ads](https://accelerated-data-science.readthedocs.io/en/latest/user_guide/cli/authentication.html) has been used to automatically load credentials for invoking endpoint. - Issue: None - Dependencies: `oracle-ads` - Tag maintainer: @baskaryan - Twitter handle: None --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-20 11:52:20 -08:00
Erick Friis	75ba22793f	community: Vectara summarization (#14970 ) Description: Adding Summarization to Vectara, to reflect it provides not only vector-store type functionality but also can return a summary. Also added: MMR capability (in the Vectara platform side) Updated templates Updated documentation and IPYNB examples Tag maintainer: @baskaryan Twitter handle: @ofermend --------- Co-authored-by: Ofer Mendelevitch <ofermend@gmail.com>	2023-12-20 11:51:33 -08:00
Liang Zhang	6479aab74f	community[patch]: Add param "task" to Databricks LLM to work around serialization of transform_output_fn (#14933 ) What is the reproduce code? ```python from langchain.chains import LLMChain, load_chain from langchain.llms import Databricks from langchain.prompts import PromptTemplate def transform_output(response): # Extract the answer from the responses. return str(response["candidates"][0]["text"]) def transform_input(request): full_prompt = f"""{request["prompt"]} Be Concise. """ request["prompt"] = full_prompt return request chat_model = Databricks( endpoint_name="llama2-13B-chat-Brambles", transform_input_fn=transform_input, transform_output_fn=transform_output, verbose=True, ) print(f"Test chat model: {chat_model('What is Apache Spark')}") # This works llm_chain = LLMChain(llm=chat_model, prompt=PromptTemplate.from_template("{chat_input}")) llm_chain("colorful socks") # this works llm_chain.save("databricks_llm_chain.yaml") # transform_input_fn and transform_output_fn are not serialized into the model yaml file loaded_chain = load_chain("databricks_llm_chain.yaml") # The Databricks LLM is recreated with transform_input_fn=None, transform_output_fn=None. loaded_chain("colorful socks") # Thus this errors. The transform_output_fn is needed to produce the correct output ``` Error: ``` File "/local_disk0/.ephemeral_nfs/envs/pythonEnv-6c34afab-3473-421d-877f-1ef18930ef4d/lib/python3.10/site-packages/pydantic/v1/main.py", line 341, in __init__ raise validation_error pydantic.v1.error_wrappers.ValidationError: 1 validation error for Generation text str type expected (type=type_error.str) request payload: {'query': 'What is a databricks notebook?'}'} ``` What does the error mean? When the LLM generates an answer, represented by a Generation data object. The Generation data object takes a str field called text, e.g. Generation(text=”blah”). However, the Databricks LLM tried to put a non-str to text, e.g. Generation(text={“candidates”:[{“text”: “blah”}]}) Thus, pydantic errors. Why the output format becomes incorrect after saving and loading the Databricks LLM? Databrick LLM does not support serializing transform_input_fn and transform_output_fn, so they are not serialized into the model yaml file. When the Databricks LLM is loaded, it is recreated with transform_input_fn=None, transform_output_fn=None. Without transform_output_fn, the output text is not unwrapped, thus errors. Missing transform_output_fn causes this error. Missing transform_input_fn causes the additional prompt “Be Concise.” to be lost after saving and loading. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle:** we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 12:50:23 -05:00
Bagatur	1ea6d83188	langchain[patch]: Release 0.0.352 (#14961 )	2023-12-20 10:27:03 -05:00
Bagatur	b03845e069	community[patch]: Release 0.0.5 (#14960 )	2023-12-20 10:25:15 -05:00
Bagatur	a841f62791	core[patch]: 0.1.2 (#14959 )	2023-12-20 10:13:54 -05:00
Anush	60c70effe9	community[minor]: Qdrant sparse vector retriever (#14814 ) ## Description This PR intends to add support for Qdrant's new [sparse vector retrieval](https://qdrant.tech/articles/sparse-vectors/) by introducing a new retriever class, `QdrantSparseVectorRetriever`. Necessary usage docs and integration tests have been added for the retriever. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 02:22:19 -05:00
mogith-pn	c53fab63a3	community[patch]: Fixed duplicate input id issue in clarifai vectorstore (#14914 ) - Description: This PR fixes the issue faces with duplicate input id in Clarifai vectorstore class when ingesting documents into the vectorstore more than the batch size. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 02:21:36 -05:00
Sypherd	5642132c0c	community[patch]: Add safe lookup to OpenAI response adapter (#14765 ) ## Description Similar to https://github.com/langchain-ai/langchain/issues/5861, I've experienced `KeyError`s resulting from unsafe lookups in the `convert_dict_to_message` function in [this file](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/adapters/openai.py). While that issue focused on `KeyError 'content'`, I've opened another issue (#14764) about how the problem still exists in the same function but with `KeyError 'role'`. The fix for #5861 only added a safe lookup to the specific line that was giving them trouble.. This PR fixes the unsafe lookup in the rest of the function but the problem still exists across the repo. ## Issues * #14764 * #5861 ## Dependencies * None ## Checklist [x] make format [x] make lint [ ] make test - Results in `make: *** No rule to make target 'test'. Stop.` ## Maintainers * @hinthornw --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 01:17:23 -05:00
AlpinDale	b0588774f1	community[minor]: Add Aphrodite Engine support (#14759 ) This PR adds support for PygmalionAI's [Aphrodite Engine](https://github.com/PygmalionAI/aphrodite-engine), based on vLLM's attention mechanism. At the moment, this PR does not include support for the API servers, but they will be added in a later PR. The only dependency as of now is `aphrodite-engine==0.4.2`. We pin the version to prevent breakage due to changes in the aphrodite-engine library. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 01:16:57 -05:00
Dmitry Tyumentsev	d21f44b484	community[minor]: Add YandexGPT embeddings (#14767 ) - Description: Introducing an ability to work with the [YandexGPT](https://cloud.yandex.com/en/services/yandexgpt) embeddings models. --------- Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-20 01:11:07 -05:00
Nicolas Suzor	529144649e	community[patch]: add png support for vertexai._parse_chat_history_gemini() (#14788 ) - Description: Modify community chat model vertexai to handle png and other image types encoded in base64 - Dependencies: added `import re` but no new dependencies. This addresses a problem where the vertexai method _parse_chat_history_gemini() was only recognizing image uris in jpeg format. I made a simple change to cover other extension types.	2023-12-20 00:58:39 -05:00
Liu Jun	b0c48dc983	community[patch]: make ak and sk optional in qianfan endpoint (#14835 ) - Description: The Qianfan SDK offers multiple authentication methods, but in the `QianfanEndpoint` of Langchain, it currently only supports authentication through AK and SK. In order to accommodate users who wish to use alternative authentication methods, this pull request makes AK and SK optional. This change should not impact existing users, while allowing users to configure other authentication methods as per the Qianfan SDK documentation. - Issue: / - Dependencies: No - Tag maintainer: No - Twitter handle:	2023-12-20 00:49:33 -05:00
Archan Ghosh	65678b3816	community[patch]: Update arxiv.py with Entry ID as a return value (#14915 ) Added Entry ID as a return value inside get_summaries_as_docs - Description: Added the Entry ID as a return, so it's easier to track the IDs of the papers that are being returned. With the addition return of the entry ID in functions like ArxivRetriever, it will be easier to reference the ID of the paper itself.	2023-12-20 00:30:24 -05:00
thehunmonkgroup	dc20766513	docs: readme for langchain-mistralai (#14917 ) - Description: Add README doc for MistralAI partner package. - Tag maintainer: @baskaryan	2023-12-20 00:22:43 -05:00
Bagatur	345acb26ac	community[patch]: Matching engine, return doc id (#14930 )	2023-12-20 00:03:11 -05:00
Erick Friis	8a3360edf6	anthropic: beta messages integration (#14928 )	2023-12-19 18:55:19 -08:00
Erick Friis	795cf2ddda	together: package and embedding model (#14936 )	2023-12-19 18:48:32 -08:00
Erick Friis	8b29b31554	cli: test_integration group (#14924 )	2023-12-19 12:09:04 -08:00
Erick Friis	4d48aedea3	cli: 0.0.20 (#14920 )	2023-12-19 11:56:21 -08:00
Erick Friis	9ef2feb674	cli[patch]: add embedding to integration template (#14881 )	2023-12-19 09:58:21 -08:00
Michael Feil	7b96de3d5d	community[patch]: update Gradient embeddings (#14846 ) - Description: Going forward, we have a own API `pip install gradientai`. Therefore gradually removing the self-build packages in llamaindex, haystack and langchain. - Issue: None. - Dependencies: `pip install gradientai` - Tag maintainer: @michaelfeil	2023-12-19 11:46:33 -05:00
Igor Dvorkin	6cc3c2452c	community[patch]: Enhance iMessage chat loader with timestamp parsing and message ownership (#14804 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 11:09:01 -05:00
Mohammad Mohtashim	e3abe12243	community[patch]: helpful error message for GitHubAPIWrapper (#14803 ) Very simple change in relation to the issue https://github.com/langchain-ai/langchain/issues/14550 @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 11:08:06 -05:00
Dmitry Tyumentsev	50381abc42	community[patch]: Add retry logic to Yandex GPT API Calls (#14907 ) Description: Added logic for re-calling the YandexGPT API in case of an error --------- Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-19 10:51:42 -05:00
Sirjanpreet Singh Banga	425e5e1791	community[minor]: rename ChatGPTRouter to GPTRouter (#14913 ) Description:: Rename integration to GPTRouter Tag maintainer: @Gupta-Anubhav12 @samanyougarg @sirjan-ws-ext Twitter handle: [@SamanyouGarg](https://twitter.com/SamanyouGarg)	2023-12-19 10:48:52 -05:00
JaguarDB	992b04e475	community[minor]: added jaguar vector store (#14838 ) Description: A new vector store Jaguar is being added. Class, test scripts, and documentation is added. Issue: None -- This is the first PR contributing to LangChain Dependencies: This depends on "pip install -U jaguardb-http-client" client http package Tag maintainer: @baskaryan, @eyurtsev, @hwchase1 Twitter handle: @workbot --------- Co-authored-by: JY <jyjy@jaguardb> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 10:40:18 -05:00
Bagatur	a5be9f9475	mistralai: Add langchain-mistralai partner package (#14783 ) Co-authored-by: Chad Phillips <chad@apartmentlines.com>	2023-12-19 10:34:19 -05:00
Sirjanpreet Singh Banga	44cb899a93	community[minor]: Integrating GPTRouter (#14900 ) Description: Adding a langchain integration for [GPTRouter](https://gpt-router.writesonic.com/) 🚀 , Tag maintainer: @Gupta-Anubhav12 @samanyougarg @sirjan-ws-ext Twitter handle: [@SamanyouGarg](https://twitter.com/SamanyouGarg) Integration Tests Passing: <img width="1137" alt="Screenshot 2023-12-19 at 5 45 31 PM" src="https://github.com/Writesonic/langchain/assets/151817113/4a59df9a-ee30-47aa-9df9-b8c4eeb9dc76">	2023-12-19 10:08:36 -05:00
Bagatur	1069a93d18	langchain[patch]: export sagemaker LLMContentHandler (#14906 ) Resolves #14904	2023-12-19 10:00:32 -05:00
Leonid Ganeline	b2fd41331e	docs: docstrings `langchain_community` update (#14889 ) Addded missed docstrings. Fixed inconsistency in docstrings. Note CC @efriis There were PR errors on `langchain_experimental/prompt_injection_identifier/hugging_face_identifier.py` But, I didn't touch this file in this PR! Can it be some cache problems? I fixed this error.	2023-12-19 08:58:24 -05:00
William FH	583696732c	[Partner] NVIDIA TRT Package (#14733 ) Simplify #13976 and add as a separate package. - [] Add README - [X] Add doc notebook - [X] Add simple LLM integration --------- Co-authored-by: Jeremy Dyer <jdye64@gmail.com>	2023-12-18 19:08:25 -08:00
William FH	0d4cbbcc85	[Partner] Update google integration test (#14883 ) Gemini has decided that pickle rick is unsafe: https://github.com/langchain-ai/langchain/actions/runs/7256642294/job/19769249444#step:8:189 ![image](https://github.com/langchain-ai/langchain/assets/13333726/cfbf4312-53b6-4290-84ee-6ce0742e739e)	2023-12-18 18:46:24 -08:00
William FH	f88af1f1cd	[Partner] Google GenAi new release (#14882 ) to support the system message merging Also fix integration tests that weren't passing	2023-12-18 18:35:57 -08:00
Leonid Kuligin	2d0f1cae8c	added history and support for system_message as param (#14824 ) - Description: added support for chat_history for Google GenerativeAI (to actually use the `chat` API) plus since Gemini currently doesn't have a support for SystemMessage, added support for it only if a user provides additional `convert_system_message_to_human` flag during model initialization (in this case, SystemMessage would be prepanded to the first HumanMessage) - Issue: #14710 - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: lkuligin --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2023-12-18 18:23:14 -08:00
Oleksandr Yaremchuk	d82a3828f2	Improve prompt injection detection (#14842 ) - Description: This is addition to [my previous PR](https://github.com/langchain-ai/langchain/pull/13930) with improvements to flexibility allowing different models and notebook to use ONNX runtime for faster speed. Since the last PR, [our model](https://huggingface.co/laiyer/deberta-v3-base-prompt-injection) got more than 660k downloads, and with the [public benchmark](https://huggingface.co/spaces/laiyer/prompt-injection-benchmark) showed much fewer false-positives than the previous one from deepset. Additionally, on the ONNX runtime, it can be running 3x faster on the CPU, which might be handy for builders using Langchain. Issue: N/A - Dependencies: N/A - Tag maintainer: N/A - Twitter handle: `@laiyer_ai`	2023-12-18 17:50:21 -08:00
abhjaw	6fbd068b3f	Update kendra.py to avoid Kendra query ValidationException (#14866 ) Fixing issue - https://github.com/langchain-ai/langchain/issues/14494 to avoid Kendra query ValidationException <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: Update kendra.py to avoid Kendra query ValidationException, - Issue: the issue #https://github.com/langchain-ai/langchain/issues/14494, - Dependencies: None, - Tag maintainer: , - Twitter handle: If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-18 17:46:18 -08:00
Leonid Ganeline	6577b0d987	docstrings `langchain` update (#14870 ) Added missed docstrings	2023-12-18 17:16:08 -08:00
Kane Sweet	ea331f3136	Fix token text splitter duplicates (#14848 ) - Description: - Add a break case to `text_splitter.py::split_text_on_tokens()` to avoid unwanted item at the end of result. - Add a testcase to enforce the behavior. - Issue: - #14649 - #5897 - Dependencies: n/a, --- Quick illustration of change: ``` text = "foo bar baz 123" tokenizer = Tokenizer( chunk_overlap=3, tokens_per_chunk=7 ) output = split_text_on_tokens(text=text, tokenizer=tokenizer) ``` output before change: `["foo bar", "bar baz", "baz 123", "123"]` output after change: `["foo bar", "bar baz", "baz 123"]`	2023-12-18 17:15:57 -08:00
Leonid Ganeline	14d04180eb	docstrings `core` update (#14871 ) Added missed docstrings	2023-12-18 17:13:35 -08:00
Erick Friis	5f839beab9	community: replace deprecated davinci models (#14860 ) This is technically a breaking change because it'll switch out default models from `text-davinci-003` to `gpt-3.5-turbo-instruct`, but OpenAI is shutting off those endpoints on 1/4 anyways. Feels less disruptive to switch out the default instead.	2023-12-18 13:49:46 -08:00
Harrison Chase	193f107cb5	add methods to deserialize prompts that were old (#14857 )	2023-12-18 13:45:08 -08:00
Bagatur	714bef0cb6	langchain[patch]: Release 0.0.351 (#14867 )	2023-12-18 16:41:48 -05:00
Bagatur	61ad0e8be9	community[patch]: Release 0.0.4 (#14864 )	2023-12-18 16:08:08 -05:00
Bob Lin	5de1dc72b9	community[patch]: Update Tongyi default model_name (#14844 ) <img width="1305" alt="Screenshot 2023-12-18 at 9 54 01 PM" src="https://github.com/langchain-ai/langchain/assets/10000925/c943fd81-cd48-46eb-8dff-4680424d9ba9"> The current model is no longer available.	2023-12-18 11:35:53 -05:00
William FH	5fc2c578cf	[Bugfix] Ensure tool output is a str, for OAI Assistant (#14830 ) Tool outputs have to be strings apparently. Ensure they are formatted correctly before passing as intermediate steps. ``` BadRequestError: Error code: 400 - {'error': {'message': '1 validation error for Request\nbody -> tool_outputs -> 0 -> output\n str type expected (type=type_error.str)', 'type': 'invalid_request_error', 'param': None, 'code': None}} ```	2023-12-17 20:02:18 -08:00
William FH	bbc98a234d	Update parser (#14831 ) Gpt-3.5 sometimes calls with empty string arguments instead of `{}` I'd assume it's because the typescript representation on their backend makes it a bit ambiguous.	2023-12-17 20:02:07 -08:00
Vlad Kolesnikov	11fda490ca	community[minor]: New model parameters and dynamic batching for VertexAIEmbeddings (#13999 ) - Description: VertexAIEmbeddings performance improvements - Twitter handle: @vladkol ## Improvements - Dynamic batch size, starting from 250, lowering down to 5. Batch size varies across regions. Some regions support larger batches, and it significantly improves performance. When running large batches of texts in `us-central1`, performance gain can be up to 3.5x. The dynamic batching also makes sure every batch is below 20K token limit. - New model parameter `embeddings_type` that translates to `task_type` parameter of the API. Newer model versions support [different embeddings task types](https://cloud.google.com/vertex-ai/docs/generative-ai/embeddings/get-text-embeddings#api_changes_to_models_released_on_or_after_august_2023).	2023-12-17 22:24:22 -05:00
William FH	2d91d2b978	community: Add logprobs in gen output (#14826 ) Now that it's supported again for OAI chat models . Shame this wouldn't include it in the `.invoke()` output though (it's not included in the message itself). Would need to do a follow-up for that to be the case	2023-12-17 20:59:27 -05:00
Dmitry Tyumentsev	78ae276df7	community[patch]: fix agenerate return value (#14815 ) Fixed: - `_agenerate` return value in the YandexGPT Chat Model - duplicate line in the documentation Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-17 16:40:59 -05:00
sujeet	f1d3f29bc4	community[patch]: support for Sybase SQL anywhere added. (#14821 ) - Description: support for Sybase SQL anywhere added in sql_database.py file at path langchain\libs\community\langchain_community\utilities - Issue: It will resolve default schema setting for Sybase SQL anywhere - Dependencies: No, - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17, - Twitter handle: NA --------- Co-authored-by: learn360sujeet <121271779+learn360sujeet@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-17 16:39:44 -05:00
Erick Friis	8a07c56313	docs: developer docs (#14776 ) Builds out a developer documentation section in the docs - Links it from contributing.md - Adds an initial guide on how to contribute an integration --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-17 12:55:49 -08:00
William FH	01693b291e	Permit updates in indexing (#14482 )	2023-12-16 13:34:33 -08:00
Noah Stapp	34e6f3ff72	community[patch]: Implement similarity_score_threshold for MongoDB Vector Store (#14740 ) Adds the option for `similarity_score_threshold` when using `MongoDBAtlasVectorSearch` as a vector store retriever. Example use: ``` vector_search = MongoDBAtlasVectorSearch.from_documents(...) qa_retriever = vector_search.as_retriever( search_type="similarity_score_threshold", search_kwargs={ "score_threshold": 0.5, } ) qa = RetrievalQA.from_chain_type( llm=OpenAI(), chain_type="stuff", retriever=qa_retriever, ) docs = qa({"query": "..."}) ``` I've tested this feature locally, using a MongoDB Atlas Cluster with a vector search index.	2023-12-15 16:49:21 -08:00
Dmitry Tyumentsev	dcead816df	community[patch]: Update YandexGPT API (#14773 ) Update LLMand Chat model to use new api version --------- Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-15 16:25:09 -08:00
Lance Martin	42421860bc	Add image support for Ollama (#14713 ) Support [LLaVA](https://ollama.ai/library/llava): * Upgrade Ollama * `ollama pull llava` Ensure compatibility with [image prompt template](https://github.com/langchain-ai/langchain/pull/14263) --------- Co-authored-by: jacoblee93 <jacoblee93@gmail.com>	2023-12-15 16:00:55 -08:00
Harrison Chase	16399fd61d	langchain[patch]: remove unused imports (#14680 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-15 14:12:02 -08:00
Karim Lalani	a0064330b1	community[minor]: Add SurrealDB vectorstore (#13331 ) Description: Vectorstore implementation around [SurrealDB](https://www.surrealdb.com) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-15 13:34:51 -08:00
William FH	c5296fd42c	[Documentation] Updates to NVIDIA Playground/Foundation Model naming.… (#14770 ) … (#14723) - Description: Minor updates per marketing requests. Namely, name decisions (AI Foundation Models / AI Playground) - Tag maintainer: @hinthornw Do want to pass around the PR for a bit and ask a few more marketing questions before merge, but just want to make sure I'm not working in a vacuum. No major changes to code functionality intended; the PR should be for documentation and only minor tweaks. Note: QA model is a bit borked across staging/prod right now. Relevant teams have been informed and are looking into it, and I'm placeholdered the response to that of a working version in the notebook. Co-authored-by: Vadim Kudlay <32310964+VKudlay@users.noreply.github.com>	2023-12-15 12:21:59 -08:00
William FH	4855964332	Fix OAI Tool Message (#14746 ) See format here: https://platform.openai.com/docs/guides/function-calling/parallel-function-calling It expects a "name" argument, which we aren't providing by default. ![image](https://github.com/langchain-ai/langchain/assets/13333726/7cd82978-337c-40a1-b099-3bb25cd57eb4) Alternative is to add the 'name' field directly to the message if people prefer.	2023-12-15 06:45:09 -08:00
William FH	e3132a7efc	[Evals] End project (#14324 ) Also does some cleanup. Now that we support updating/ending projects, do this automatically. Then you can edit the name of the project in the app.	2023-12-15 00:05:34 -08:00
William FH	93c7eb4e6b	[Tracing] String Stacktrace (#14131 ) Add full stacktrace	2023-12-14 22:15:07 -08:00
Leonid Kuligin	7f42811e14	google-genai[patch], community[patch]: Added support for new Google GenerativeAI models (#14530 ) Replace this entire comment with: - Description: added support for new Google GenerativeAI models - Twitter handle: lkuligin --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-14 20:56:46 -08:00
Bagatur	b802dd96f2	core[patch]: Release 0.1.1 (#14738 )	2023-12-14 16:02:19 -08:00
William FH	9d4100f915	Revert "[Hub\|tracing] Tag hub prompts" (#14735 ) Reverts langchain-ai/langchain#14720	2023-12-14 14:39:58 -08:00
Erick Friis	9fb26a2a71	community[patch]: fix pgvector sqlalchemy (#14726 ) Fixes #14699	2023-12-14 13:27:30 -08:00
Bagatur	1cec0afc62	google-genai[patch]: add google-genai integration deps and extras (#14731 )	2023-12-14 13:20:10 -08:00
William FH	852b9ca494	[Hub\|tracing] Tag hub prompts (#14720 ) If you're using the hub, you'll likely be interested in tracking the commit/object when tracing. This PR adds it to the config	2023-12-14 10:04:18 -08:00
William FH	451c5d1d8c	[Integration] NVIDIA AI Playground (#14648 ) Description: Added NVIDIA AI Playground Initial support for a selection of models (Llama models, Mistral, etc.) Dependencies: These models do depend on the AI Playground services in NVIDIA NGC. API keys with a significant amount of trial compute are available (10K queries as of the time of writing). H/t to @VKudlay	2023-12-13 19:46:37 -08:00
William FH	1e21a3f7ed	[Partner] Gemini Embeddings (#14690 ) Add support for Gemini embeddings in the langchain-google-genai package	2023-12-13 17:05:31 -08:00
Funkeke	ea99612caa	community[patch]: fix dashvector endpoint params error (#14484 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: fangkeke <3339698829@qq.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-13 14:38:27 -08:00
Bob Lin	dce3c74905	community[patch]: Correct type annotation for azure_ad_token_provider Closed: #14402 (#14432 ) Description Fix https://github.com/langchain-ai/langchain/issues/14402, Similar changes: https://github.com/langchain-ai/langchain/pull/14166 Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-13 14:37:39 -08:00
Fran Cirka	8a4162d15e	community[patch]: Fixed issue with importing Row from sqlalchemy (#14488 ) - Description: Fixed import of Row in cache.py, - Issue: the issue # #13464 https://creditone.us.to/langchain-ai/langchain/issues/13464, - Dependencies: None, - Twitter handle: @frankybridman Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-13 14:36:08 -08:00
Bagatur	47451951a1	core[patch]: Fix runnable with message history (#14629 ) Fix bug shown in #14458. Namely, that saving inputs to history fails when the input to base runnable is a list of messages	2023-12-13 14:25:35 -08:00
Bagatur	73382a579f	google-genai[patch]: Release 0.0.2 (#14677 )	2023-12-13 12:59:19 -08:00
Nuno Campos	a16f4a318f	\Fix tool_calls message merge (#14613 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-13 12:37:40 -08:00
William FH	405d111da6	[Partner] Add langchain-google-genai package (gemini) (#14621 ) Add a new ChatGoogleGenerativeAI class in a `langchain-google-genai` package. Still todo: add a deprecation warning in PALM --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-13 11:57:59 -08:00
Bagatur	4574749147	communty[patch]: Release 0.0.3 (#14673 )	2023-12-13 11:21:00 -08:00
Erick Friis	c5250f12c2	cli[patch]: unicode issue (#14672 ) Some operating systems compile template, resulting in unicode decode errors	2023-12-13 11:14:51 -08:00
William FH	75b8891399	Update Vertex AI to include Gemini (#14670 ) h/t to @lkuligin - Description: added new models on VertexAI - Twitter handle: @lkuligin --------- Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-13 10:45:02 -08:00
Erick Friis	858f4cbce4	cli[patch]: rc (#14667 )	2023-12-13 10:00:04 -08:00
Tomaz Bratanic	ea2616ae23	Fix RRF and lucene escape characters for neo4j vector store (#14646 ) * Remove Lucene special characters (fixes https://github.com/langchain-ai/langchain/issues/14232) * Fixes RRF normalization for hybrid search	2023-12-13 09:09:50 -08:00
Erick Friis	7e6ca3c2b9	cli[patch]: integration template (#14571 )	2023-12-13 08:55:30 -08:00
James Braza	b9ef92f2f4	Fixed `DeprecationWarning` for `PromptTemplate.from_file` module-level calls (#14468 ) Resolves https://github.com/langchain-ai/langchain/issues/14467	2023-12-12 17:43:27 -08:00
Chengzu Ou	df95abb7e7	docs: Add Databricks Vector Search example notebook (#14158 ) This PR adds an example notebook for the Databricks Vector Search vector store. It also adds an introduction to the Databricks Vector Search product on the Databricks's provider page. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-12 17:40:29 -08:00
葛尧	e780433f6b	Fix token_usage None issue in ChatOpenAI with local Chatglm2-6B (#14493 ) When using local Chatglm2-6B by changing OPENAI_BASE_URL to localhost, the token_usage in ChatOpenAI becomes None. This leads to an AttributeError when trying to access token_usage.items(). This commit adds a check to ensure token_usage is not None before accessing its items. This change prevents the AttributeError and allows ChatOpenAI to work seamlessly with a local Chatglm2-6B model, aligning with the way it operates with the OpenAI API. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-12 17:30:37 -08:00
Massimiliano Pronesti	6080c98108	fix(embeddings): huggingface hub embeddings and TEI (#14489 ) Description: This PR fixes `HuggingFaceHubEmbeddings` by making the API token optional (as in the client beneath). Most models don't require one. I also updated the notebook for TEI (text-embeddings-inference) accordingly as requested here #14288. In addition, I fixed a mistake in the POST call parameters. Tag maintainers: @baskaryan	2023-12-12 17:21:52 -08:00
Thomas B	b4e3e47c92	feat: Yaml output parser (#14496 ) ## Description New YAML output parser as a drop-in replacement for the Pydantic output parser. Yaml is a much more token-efficient format than JSON, proving to be ~35% faster and using the same percentage fewer completion tokens. ☑️ Formatted ☑️ Linted ☑️ Tested (analogous to the existing`test_pydantic_parser.py`) The YAML parser excels in situations where a list of objects is required, where the root object needs no key: ```python class Products(BaseModel): __root__: list[Product] ``` I ran the prompt `Generate 10 healthy, organic products` 10 times on one chain using the `PydanticOutputParser`, the other one using the`YamlOutputParser` with `Products` (see below) being the targeted model to be created. LLMs used were Fireworks' `lama-v2-34b-code-instruct` and OpenAI `gpt-3.5-turbo`. All runs succeeded without validation errors. ```python class Nutrition(BaseModel): sugar: int = Field(description="Sugar in grams") fat: float = Field(description="% of daily fat intake") class Product(BaseModel): name: str = Field(description="Product name") stats: Nutrition class Products(BaseModel): """A list of products""" products: list[Product] # Used `__root__` for the yaml chain ``` Stats after 10 runs reach were as follows: ### JSON ø time: 7.75s ø tokens: 380.8 ### YAML ø time: 5.12s ø tokens: 242.2 Looking forward to feedback, tips and contributions!	2023-12-12 17:04:31 -08:00
Bob Lin	a019183a01	create mypy cache dir if it doesn't exist (#14579 ) ### Description When running `make lint` multiple times, i can see the error `mkdir: .mypy_cache: File exists`. Use `mkdir -p` to solve this problem. <img width="1512" alt="Screenshot 2023-12-12 at 11 22 01 AM" src="https://github.com/langchain-ai/langchain/assets/10000925/1429383d-3283-4e22-8882-5693bc50b502">	2023-12-12 15:34:50 -08:00
dandanwei	e5bd88383f	fix a bug in RedisNum filter againt value 0 (#14587 ) - Description: There is a bug in RedisNum filter that filter towards value 0 will be parsed as "". This is a fix to it. - Issue:* NA - Dependencies: NA - Tag maintainer: NA - Twitter handle: NA	2023-12-12 15:34:45 -08:00
Lance Martin	282362382c	Minor update to ensemble retriever to handle a mix of Documents or str (#14552 )	2023-12-12 15:16:49 -08:00
Bagatur	ca7da8f7ef	docs: fix links in readme (#14624 )	2023-12-12 12:59:09 -08:00
Bagatur	2a10cabf66	docs: core and community readme (#14623 )	2023-12-12 12:52:32 -08:00
Bagatur	b72b19b593	experimental[patch]: Release 0.0.47 (#14617 )	2023-12-12 11:09:39 -08:00
Bagatur	57337b4862	langchain[patch]: Release 0.0.350 (#14612 )	2023-12-12 10:10:34 -08:00
Bagatur	d388863a3b	community[patch]: Release 0.0.2 (#14610 )	2023-12-12 09:58:04 -08:00
Bagatur	5d1deddbfb	core[minor]: Release 0.1.0 (#14607 )	2023-12-12 09:33:11 -08:00
Harrison Chase	ad8d8f71aa	allow other namespaces (#14606 )	2023-12-12 09:09:59 -08:00
Eugene Yurtsev	76905aa043	Update RunnableWithMessageHistory (#14351 ) This PR updates RunnableWithMessage history to support user specific configuration for the factory. It extends support to passing multiple named arguments into the factory if the factory takes more than a single argument.	2023-12-11 21:34:49 -05:00
Erick Friis	0a9d933bb2	infra: import checking bugfix (#14569 )	2023-12-11 15:53:51 -08:00
Bagatur	8bdaf55e92	experimental[patch]: Release 0.0.46 (#14572 )	2023-12-11 15:46:14 -08:00
Bagatur	14bfc5f9f4	langchain[patch]: Release 0.0.349 (#14570 )	2023-12-11 15:30:14 -08:00
Erick Friis	482e2b94fa	infra: import CI speed (#14566 ) Was taking 10 mins. Now a few seconds.	2023-12-11 15:19:21 -08:00
Bagatur	6a828e60ee	community[patch]: Release 0.0.1 (#14565 )	2023-12-11 15:18:55 -08:00
Erick Friis	5418d8bfd6	infra: import CI fix (#14562 ) TIL `**` globstar doesn't work in make Makefile changes fix that. `__getattr__` changes allow import of all files, but raise error when accessing anything from the module. file deletions were corresponding libs change from #14559	2023-12-11 14:59:10 -08:00
Bagatur	9cb128e6e2	core[patch]: Release 0.0.13 (#14558 )	2023-12-11 14:36:28 -08:00
Bagatur	a844b495c4	community[patch]: Fix agenttoolkits imports (#14559 )	2023-12-11 14:19:25 -08:00
Nuno Campos	3b5b0f16c6	Move runnable context to beta (#14507 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-11 13:58:30 -08:00
Bagatur	ed58eeb9c5	community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463 ) Moved the following modules to new package langchain-community in a backwards compatible fashion: ``` mv langchain/langchain/adapters community/langchain_community mv langchain/langchain/callbacks community/langchain_community/callbacks mv langchain/langchain/chat_loaders community/langchain_community mv langchain/langchain/chat_models community/langchain_community mv langchain/langchain/document_loaders community/langchain_community mv langchain/langchain/docstore community/langchain_community mv langchain/langchain/document_transformers community/langchain_community mv langchain/langchain/embeddings community/langchain_community mv langchain/langchain/graphs community/langchain_community mv langchain/langchain/llms community/langchain_community mv langchain/langchain/memory/chat_message_histories community/langchain_community mv langchain/langchain/retrievers community/langchain_community mv langchain/langchain/storage community/langchain_community mv langchain/langchain/tools community/langchain_community mv langchain/langchain/utilities community/langchain_community mv langchain/langchain/vectorstores community/langchain_community mv langchain/langchain/agents/agent_toolkits community/langchain_community mv langchain/langchain/cache.py community/langchain_community mv langchain/langchain/adapters community/langchain_community mv langchain/langchain/callbacks community/langchain_community/callbacks mv langchain/langchain/chat_loaders community/langchain_community mv langchain/langchain/chat_models community/langchain_community mv langchain/langchain/document_loaders community/langchain_community mv langchain/langchain/docstore community/langchain_community mv langchain/langchain/document_transformers community/langchain_community mv langchain/langchain/embeddings community/langchain_community mv langchain/langchain/graphs community/langchain_community mv langchain/langchain/llms community/langchain_community mv langchain/langchain/memory/chat_message_histories community/langchain_community mv langchain/langchain/retrievers community/langchain_community mv langchain/langchain/storage community/langchain_community mv langchain/langchain/tools community/langchain_community mv langchain/langchain/utilities community/langchain_community mv langchain/langchain/vectorstores community/langchain_community mv langchain/langchain/agents/agent_toolkits community/langchain_community mv langchain/langchain/cache.py community/langchain_community ``` Moved the following to core ``` mv langchain/langchain/utils/json_schema.py core/langchain_core/utils mv langchain/langchain/utils/html.py core/langchain_core/utils mv langchain/langchain/utils/strings.py core/langchain_core/utils cat langchain/langchain/utils/env.py >> core/langchain_core/utils/env.py rm langchain/langchain/utils/env.py ``` See .scripts/community_split/script_integrations.sh for all changes	2023-12-11 13:53:30 -08:00
Eugene Yurtsev	c0f4b95aa9	RunnableWithMessageHistory: Fix input schema (#14516 ) Input schema should not have history key	2023-12-10 23:33:02 -05:00
Harrison Chase	f5befe3b89	manual mapping (#14422 )	2023-12-08 16:29:33 -08:00
Erick Friis	c24f277b7c	langchain[patch], docs[patch]: use byte store in multivectorretriever (#14474 )	2023-12-08 16:26:11 -08:00
Anish Nag	6da0cfea0e	experimental[patch]: SmartLLMChain Output Key Customization (#14466 ) Description The `SmartLLMChain` was was fixed to output key "resolution". Unfortunately, this prevents the ability to use multiple `SmartLLMChain` in a `SequentialChain` because of colliding output keys. This change simply gives the option the customize the output key to allow for sequential chaining. The default behavior is the same as the current behavior. Now, it's possible to do the following: ``` from langchain.chat_models import ChatOpenAI from langchain.prompts import PromptTemplate from langchain_experimental.smart_llm import SmartLLMChain from langchain.chains import SequentialChain joke_prompt = PromptTemplate( input_variables=["content"], template="Tell me a joke about {content}.", ) review_prompt = PromptTemplate( input_variables=["scale", "joke"], template="Rate the following joke from 1 to {scale}: {joke}" ) llm = ChatOpenAI(temperature=0.9, model_name="gpt-4-32k") joke_chain = SmartLLMChain(llm=llm, prompt=joke_prompt, output_key="joke") review_chain = SmartLLMChain(llm=llm, prompt=review_prompt, output_key="review") chain = SequentialChain( chains=[joke_chain, review_chain], input_variables=["content", "scale"], output_variables=["review"], verbose=True ) response = chain.run({"content": "chickens", "scale": "10"}) print(response) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-08 13:55:51 -08:00
Erick Friis	b3f226e8f8	core[patch], langchain[patch], experimental[patch]: import CI (#14414 )	2023-12-08 11:28:55 -08:00
Eugene Yurtsev	37bee92b8a	Use deepcopy in RunLogPatch (#14244 ) This PR adds deepcopy usage in RunLogPatch. I included a unit-test that shows an issue that was caused in LangServe in the RemoteClient. ```python import jsonpatch s1 = {} s2 = {'value': []} s3 = {'value': ['a']} ops0 = list(jsonpatch.JsonPatch.from_diff(None, s1)) ops1 = list(jsonpatch.JsonPatch.from_diff(s1, s2)) ops2 = list(jsonpatch.JsonPatch.from_diff(s2, s3)) ops = ops0 + ops1 + ops2 jsonpatch.apply_patch(None, ops) {'value': ['a']} jsonpatch.apply_patch(None, ops) {'value': ['a', 'a']} jsonpatch.apply_patch(None, ops) {'value': ['a', 'a', 'a']} ```	2023-12-08 14:09:36 -05:00
Erick Friis	1d7e5c51aa	langchain[patch]: xfail unstable vertex test (#14462 )	2023-12-08 11:00:37 -08:00
Harrison Chase	02ee0073cf	revoke serialization (#14456 )	2023-12-08 10:31:05 -08:00
Erick Friis	1d725327eb	langchain[patch]: Fix scheduled testing (#14428 ) - integration tests in pyproject - integration test fixes	2023-12-08 10:23:02 -08:00
Harrison Chase	7be3eb6fbd	fix imports from core (#14430 )	2023-12-08 09:33:35 -08:00
Bagatur	52052cc7b9	experimental[patch]: Release 0.0.45 (#14418 )	2023-12-07 15:01:39 -08:00
Bagatur	e4d6e55c5e	langchain[patch]: Release 0.0.348 (#14417 )	2023-12-07 14:52:43 -08:00
Bagatur	eb209e7ee3	core[patch]: Release 0.0.12 (#14415 )	2023-12-07 14:37:00 -08:00
Bagatur	b2280fd874	core[patch], langchain[patch]: fix required deps (#14373 )	2023-12-07 14:24:58 -08:00
Kacper Łukawski	76f30f5297	langchain[patch]: Rollback multiple keys in Qdrant (#14390 ) This reverts commit `38813d7090`. This is a temporary fix, as I don't see a clear way on how to use multiple keys with `Qdrant.from_texts`. Context: #14378	2023-12-07 11:13:19 -08:00
Erick Friis	54040b00a4	langchain[patch]: fix ChatVertexAI streaming (#14369 )	2023-12-07 09:46:11 -08:00
Bagatur	db6bf8b022	langchain[patch]: Release 0.0.347 (#14368 )	2023-12-06 16:13:29 -08:00
Bagatur	a7271cf5bd	core[patch]: Release 0.0.11 (#14367 )	2023-12-06 15:53:49 -08:00
Nuno Campos	77c38df36c	[core/minor] Runnables: Implement a context api (#14046 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Brace Sproul <braceasproul@gmail.com>	2023-12-06 15:02:29 -08:00
Erick Friis	8f95a8206b	core[patch]: message history error typo (#14361 )	2023-12-06 14:20:10 -08:00
William FH	e5bd32ff6d	Include run_id (#14331 ) in the test run outputs	2023-12-06 14:07:45 -08:00
Bagatur	cc76f0e834	langchain[patch]: import nits (#14354 ) import from core instead of langchain.schema	2023-12-06 11:45:05 -08:00
Jacob Lee	867ca6d0be	Fix multi vector retriever subclassing (#14350 ) Fixes #14342 @eyurtsev @baskaryan --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-06 11:12:50 -08:00
Erick Friis	7bdfc43766	core[patch], langchain[patch]: ByteStore (#14312 )	2023-12-06 10:05:43 -08:00
Eugene Yurtsev	0dea8cc62d	Update doc-string in RunnableWithMessageHistory (#14262 ) Update doc-string in RunnableWithMessageHistory	2023-12-06 12:31:46 -05:00
Jean-Baptiste dlb	38813d7090	Qdrant metadata payload keys (#13001 ) - Description: In Qdrant allows to input list of keys as the content_payload_key to retrieve multiple fields (the generated document will contain the dictionary {field: value} in a string), - Issue: Previously we were able to retrieve only one field from the vector database when making a search - Dependencies: - Tag maintainer: - Twitter handle: @jb_dlb --------- Co-authored-by: Jean Baptiste De La Broise <jeanbaptiste.delabroise@mdpi.com>	2023-12-06 09:12:54 -08:00
Yuchen Liang	ad6dfb6220	feat: mask api key for cerebriumai llm (#14272 ) - Description: Masking API key for CerebriumAI LLM to protect user secrets. - Issue: #12165 - Dependencies: None - Tag maintainer: @eyurtsev --------- Signed-off-by: Yuchen Liang <yuchenl3@andrew.cmu.edu> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-06 09:06:00 -08:00
newfinder	d4d64daa1e	Mask API key for baidu qianfan (#14281 ) Description: This PR masked baidu qianfan - Chat_Models API Key and added unit tests. Issue: the issue langchain-ai#12165. Tag maintainer: @eyurtsev --------- Co-authored-by: xiayi <xiayi@bytedance.com>	2023-12-06 08:47:09 -08:00
cxumol	06e3316f54	feat(add): LLM integration of Cloudflare Workers AI (#14322 ) Add [Text Generation by Cloudflare Workers AI](https://developers.cloudflare.com/workers-ai/models/text-generation/). It's a new LLM integration. - Dependencies: N/A	2023-12-06 08:24:19 -08:00
Harutaka Kawamura	5efaedf488	Exclude `max_tokens` from request if it's None (#14334 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> We found a request with `max_tokens=None` results in the following error in Anthropic: ``` HTTPError: 400 Client Error: Bad Request for url: https://oregon.staging.cloud.databricks.com/serving-endpoints/corey-anthropic/invocations. Response text: {"error_code":"INVALID_PARAMETER_VALUE","message":"INVALID_PARAMETER_VALUE: max_tokens was not of type Integer: null"} ``` This PR excludes `max_tokens` if it's None.	2023-12-06 08:23:17 -08:00
MinjiK	a1a11ffd78	Amadeus toolkit minor update (#13002 ) - update `Amadeus` toolkit with ability to switch Amadeus environments - update minor code explanations --------- Co-authored-by: MinjiK <minji.kim@amadeus.com>	2023-12-05 20:08:34 -08:00
Alexandre Dumont	b05c46074b	OpenAIEmbeddings: retry_min_seconds/retry_max_seconds parameters (#13138 ) - Description: new parameters in OpenAIEmbeddings() constructor (retry_min_seconds and retry_max_seconds) that allow parametrization by the user of the former min_seconds and max_seconds that were hidden in _create_retry_decorator() and _async_retry_decorator() - Issue: #9298, #12986 - Dependencies: none - Tag maintainer: @hwchase17 - Twitter handle: @adumont make format ✅ make lint ✅ make test ✅ Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-05 20:08:17 -08:00
mogith-pn	9e5d146409	Updated integration with Clarifai python SDK functions (#13671 ) Description : Updated the functions with new Clarifai python SDK. Enabled initialisation of Clarifai class with model URL. Updated docs with new functions examples.	2023-12-05 20:08:00 -08:00
dudub12	8f403ea2d7	info sql tool remove whitespaces in table names (#13712 ) Remove whitespaces from the input of the ListSQLDatabaseTool for better support. for example, the input "table1,table2,table3" will throw an exception whiteout the change although it's a valid input. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-05 20:07:38 -08:00
balaba-max	64d5108f99	Feature: GitLab url from ENV (#14221 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: add gitlab url from env, - Issue: no issue, - Dependencies: no, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-05 19:41:36 -08:00
kavinraj A S	ab6b41937a	Fixed a typo in smart_llm prompt (#13052 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-12-05 19:16:18 -08:00
jeffpezzone	7c2ef06136	Adds "NIN" metadata filter for pgvector to all checking for set absence (#14205 ) This PR adds support for metadata filters of the form: `{"filter": {"key": { "NIN" : ["list", "of", "values"]}}}` "IN" is already supported, so this is a quick & related update to add "NIN"	2023-12-05 19:07:33 -08:00
lif	20d2b4a6ba	feat: Increased compatibility with new and old versions for dalle (#14222 ) - Description: Increased compatibility with all versions openai for dalle, This pr add support for openai version from 0 ~ 1.3.	2023-12-05 17:31:28 -08:00
Wang Wei	7205bfdd00	feat: 1. Add system parameters, 2. Align with the QianfanChatEndpoint for function calling (#14275 ) - Description: 1. Add system parameters to the ERNIE LLM API to set the role of the LLM. 2. Add support for the ERNIE-Bot-turbo-AI model according from the document https://cloud.baidu.com/doc/WENXINWORKSHOP/s/Alp0kdm0n. 3. For the function call of ErnieBotChat, align with the QianfanChatEndpoint. With this PR, the `QianfanChatEndpoint()` can use the `function calling` ability with `create_ernie_fn_chain()`. The example is as the following: ``` from langchain.prompts import ChatPromptTemplate import json from langchain.prompts.chat import ( ChatPromptTemplate, ) from langchain.chat_models import QianfanChatEndpoint from langchain.chains.ernie_functions import ( create_ernie_fn_chain, ) def get_current_news(location: str) -> str: """Get the current news based on the location.' Args: location (str): The location to query. Returs: str: Current news based on the location. """ news_info = { "location": location, "news": [ "I have a Book.", "It's a nice day, today." ] } return json.dumps(news_info) def get_current_weather(location: str, unit: str="celsius") -> str: """Get the current weather in a given location Args: location (str): location of the weather. unit (str): unit of the tempuature. Returns: str: weather in the given location. """ weather_info = { "location": location, "temperature": "27", "unit": unit, "forecast": ["sunny", "windy"], } return json.dumps(weather_info) template = ChatPromptTemplate.from_messages([ ("user", "{user_input}"), ]) chat = QianfanChatEndpoint(model="ERNIE-Bot-4") chain = create_ernie_fn_chain([get_current_weather, get_current_news], chat, template, verbose=True) res = chain.run("北京今天的新闻是什么？") print(res) ``` The result of the above code: ``` > Entering new LLMChain chain... Prompt after formatting: Human: 北京今天的新闻是什么？ > Finished chain. {'name': 'get_current_news', 'arguments': {'location': '北京'}} ``` For the `ErnieBotChat`, now can use the `system` parameter to set the role of the LLM. ``` from langchain.prompts import ChatPromptTemplate from langchain.chains import LLMChain from langchain.chat_models import ErnieBotChat llm = ErnieBotChat(model_name="ERNIE-Bot-turbo-AI", system="你是一个能力很强的机器人，你的名字叫小叮当。无论问你什么问题，你都可以给出答案。") prompt = ChatPromptTemplate.from_messages( [ ("human", "{query}"), ] ) chain = LLMChain(llm=llm, prompt=prompt, verbose=True) res = chain.run(query="你是谁？") print(res) ``` The result of the above code: ``` > Entering new LLMChain chain... Prompt after formatting: Human: 你是谁？ > Finished chain. 我是小叮当，一个智能机器人。我可以为你提供各种服务，包括回答问题、提供信息、进行计算等。如果你需要任何帮助，请随时告诉我，我会尽力为你提供最好的服务。 ```	2023-12-05 17:28:31 -08:00
Leonid Kuligin	fd5be55a7b	added get_num_tokens to GooglePalm (#14282 ) added get_num_tokens to GooglePalm + a little bit of refactoring	2023-12-05 17:24:19 -08:00
Massimiliano Pronesti	c215a4c9ec	feat(embeddings): text-embeddings-inference (#14288 ) - Description: Added a notebook to illustrate how to use `text-embeddings-inference` from huggingface. As `HuggingFaceHubEmbeddings` was using a deprecated client, I made the most of this PR updating that too. - Issue: #13286 - Dependencies: None - Tag maintainer: @baskaryan	2023-12-05 17:22:05 -08:00
Tim Van Wassenhove	85b88c33f3	Fixes issue-14295: Correctly pass along the kwargs (#14296 ) - Description: Update code to correctly pass the kwargs - Issue: #14295 - Dependencies: - - Tag maintainer: <-- If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> #issue-14295	2023-12-05 17:14:00 -08:00
Jarkko Lagus	667ad6a5de	Add support for CORS options for AzureSearch (#14305 ) - Description: Add support for setting the CORS options when using AzureSearch indexes	2023-12-05 16:05:40 -08:00
Karim Assi	9401539e43	Allow not enforcing function usage when a single function is passed to openai function executable (#14308 ) - Description: allows not enforcing function usage when a single function is passed to an openAI function executable (or corresponding legacy chain). This is a desired feature in the case where the model does not have enough information to call a function, and needs to get back to the user. - Issue: N/A - Dependencies: N/A - Tag maintainer: N/A	2023-12-05 15:56:31 -08:00
Ran	d22c13ec48	Mask API key for Minimax LLM (#14309 ) - Description: Added masking for the API key for Minimax LLM + tests inspired by https://github.com/langchain-ai/langchain/pull/12418. - Issue: the issue # fixes https://github.com/langchain-ai/langchain/issues/12165 - Dependencies: this fix is dependent on Minimax instantiation fix which is introduced in https://github.com/langchain-ai/langchain/pull/13439, so merge this one after. - Tag maintainer: @eyurtsev --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-05 15:42:00 -08:00
Eugene Yurtsev	a74c03da3c	Add metadata to blob (#14162 ) Add metadata to the blob object. This makes it easier to make a pipeline that properly propagates metadata information from raw content to the derived content.	2023-12-05 17:17:41 -05:00

... 3 4 5 6 7 ...

2502 Commits