langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Pengcheng Liu	144f2821af	docs: add example for loading data from LarkSuite wiki. (#21311 ) Description: Update LarkSuite loader doc to give an example for loading data from LarkSuite wiki. Issue: None Dependencies: None Twitter handle: None	2024-05-06 09:56:12 -07:00
Mateusz Szewczyk	682d21c3de	ibm: Add support for ibm-watsonx-ai new major version (#21313 ) Thank you for contributing to LangChain! - [x] PR title: "langchain-ibm: Add support for ibm-watsonx-ai new major version" - [x] PR message: - Description: Add support for ibm-watsonx-ai new major version - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 16:48:26 +00:00
Chris Papademetrious	ee6c922c91	langchain[minor]: enhance `LocalFileStore` to offer `update_atime` parameter that updates access times on read (#20951 ) Description: The `LocalFileStore` class can be used to create an on-disk `CacheBackedEmbeddings` cache. The number of files in these embeddings caches can grow to be quite large over time (hundreds of thousands) as embeddings are computed for new versions of content, but the embeddings for old/deprecated content are not removed. A least-recently-used (LRU) cache policy could be applied to the `LocalFileStore` directory to delete cache entries that have not been referenced for some time: ```bash # delete files that have not been accessed in the last 90 days find embeddings_cache_dir/ -atime 90 -print0 \| xargs -0 rm ``` However, most filesystems in enterprise environments disable access time modification on read to improve performance. As a result, the access times of these cache entry files are not updated when their values are read. To resolve this, this pull request updates the `LocalFileStore` constructor to offer an `update_atime` parameter that causes access times to be updated when a cache entry is read. For example, ```python file_store = LocalFileStore(temp_dir, update_atime=True) ``` The default is `False`, which retains the original behavior. Testing: I updated the LocalFileStore unit tests to test the access time update.	2024-05-06 11:52:29 -04:00
Tomaz Bratanic	5b6d1a907d	Add the extract types to diffbot graph transformer (#21315 ) Before you could only extract triples (diffbot calls it facts) from diffbot to avoid isolated nodes. However, sometimes isolated nodes can still be useful like for prefiltering, so we want to allow users to extract them if they want. Default behaviour is unchanged.	2024-05-06 09:19:52 -04:00
Jagadish Krishnamoorthy	c038991590	docs: Update pandas.ipynb (#21289 ) Remove the redundant comment.	2024-05-05 20:22:17 +00:00
aditya thomas	b868c78a12	partners[anthropic]: update unit test for key passed in from the environment (#21290 ) Description: Update unit test for ChatAnthropic Issue: Test for key passed in from the environment should not have the key initialized in the constructor Dependencies: None	2024-05-05 16:19:10 -04:00
tanersekmen	d310f9c71e	docs:update code structure (#21302 ) update the structure of llm_chain variables Co-authored-by: tanersemenn <0418>	2024-05-05 17:18:15 +00:00
Christophe Bornet	ba9dc04ffa	docs: Add doc for hybrid search (#21245 ) See [preview](https://langchain-git-fork-cbornet-doc-hybrid-search-langchain.vercel.app/docs/use_cases/question_answering/hybrid/) In the model of [per user retrieval](https://python.langchain.com/docs/use_cases/question_answering/per_user/)	2024-05-04 08:22:56 -04:00
Rohan Aggarwal	8021d2a2ab	community[minor]: Oraclevs integration (#21123 ) Thank you for contributing to LangChain! - Oracle AI Vector Search Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. - Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. This Pull Requests Adds the following functionalities Oracle AI Vector Search : Vector Store Oracle AI Vector Search : Document Loader Oracle AI Vector Search : Document Splitter Oracle AI Vector Search : Summary Oracle AI Vector Search : Oracle Embeddings - We have added unit tests and have our own local unit test suite which verifies all the code is correct. We have made sure to add guides for each of the components and one end to end guide that shows how the entire thing runs. - We have made sure that make format and make lint run clean. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: skmishraoracle <shailendra.mishra@oracle.com> Co-authored-by: hroyofc <harichandan.roy@oracle.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-04 03:15:35 +00:00
ccurme	c9e9470c5a	langchain: fix deprecation decorators on extraction chains (#21276 ) Calling any of these raises ``` ValueError: A pending deprecation cannot have a scheduled removal ```	2024-05-03 18:29:40 -04:00
Wickes Wong	ee1adaacaa	langchain[patch]: Fix summary buffer memory with return message flag (#21115 ) ## Description Memory return could be set as `str` or `message` by `return_messages` flag as mentioned in https://python.langchain.com/docs/modules/memory/#whether-memory-is-a-string-or-a-list-of-messages, where `langchain.chains.conversation.memory.ConversationSummaryBufferMemory` did not implement that. This commit added `buffer_as_str` and `buffer_as_messages` function, and `buffer` now affected by `return_messages` flag. ## Example Test Code and Output ```python # Fix: ConversationSummaryBufferMemory with return_messages flag function # Test code from langchain.chains.conversation.memory import ConversationSummaryBufferMemory from langchain_community.llms.ollama import Ollama llm = Ollama() # Create an instance of ConversationSummaryBufferMemory with return_messages set to True memory = ConversationSummaryBufferMemory(return_messages=True, llm=llm) # Add user and AI messages to the chat memory memory.chat_memory.add_user_message("hi!") memory.chat_memory.add_ai_message("what's up?") # Print the buffer print("Buffer:") print(map(type, memory.buffer), sep="\n") print(memory.buffer, "\n") # Print the buffer as a string print("Buffer as String:") print(type(memory.buffer_as_str)) print(memory.buffer_as_str, "\n") # Print the buffer as messages print("Buffer as Messages:") print(map(type, memory.buffer_as_messages), sep="\n") print(memory.buffer_as_messages, "\n") # Print the buffer after setting return_messages to False memory.return_messages = False print("Buffer after setting return_messages to False:") print(type(memory.buffer)) print(memory.buffer, "\n") ``` ```plaintext Buffer: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer as String: <class 'str'> Human: hi! AI: what's up? Buffer as Messages: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer after setting return_messages to False: <class 'str'> Human: hi! AI: what's up? ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-03 17:25:09 -04:00
Leonid Ganeline	9639457222	community[patch]: `tools` imports (#21156 ) Issue: we have several helper functions to import third-party libraries like tools.gmail.utils.import_google in [community.tools](https://api.python.langchain.com/en/latest/community_api_reference.html#id37). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-03 17:22:45 -04:00
Leonid Ganeline	3ef8b24277	core[patch]: `utils.guard_import` fix (#21133 ) Issues (nit): 1. `utils.guard_import` prints wrong error message when there is an import `error.` It prints the whole `module_name` but should be only the first part as the pip package name. E.i. `langchain_core.utils` -> print not `langchain-core` but `langchain_core.utils`. Also replace '_' with '-' in the pip package name. 2. it does not handle the `ModuleNotFoundError` which raised if `guard_import("wrong_module")` Fixed issues; added ut-s. Controversial: I've reraised `ModuleNotFoundError` as `ImportError`, since in case of the error, the proposed action is the same - we need to install a missed package.	2024-05-03 17:21:36 -04:00
Erick Friis	36c2ca3c8b	mistralai: relax tokenizers dep (#21277 )	2024-05-03 14:16:22 -07:00
Nuno Campos	6e1e0c7d5c	fix: core: draw_mermaid() would create subgroup for edges with same src and tgt (#21275 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-03 13:51:08 -07:00
Eugene Yurtsev	26a37dce0a	langchain[patch]: Remove jsonpatch from poetry file (#21272 ) jsonpatch is only used in langchain-core not in langchain	2024-05-03 15:46:05 -04:00
Eugene Yurtsev	335bd01e45	langchain[patch]: Update deprecation warning (#21268 ) Update deprecation warning	2024-05-03 15:31:29 -04:00
Leonid Ganeline	23a05c3986	langchain: `summarize` chain fix (#21266 ) Issue: `load_summarize_chain` is placed in the __init__.py file. As a result, it doesn't listed in the API Reference docs. Change: moved code from __init__.py into a new file.	2024-05-03 14:44:39 -04:00
ccurme	6da3d92b42	(all): update removal in deprecation warnings from 0.2 to 0.3 (#21265 ) We are pushing out the removal of these to 0.3. `find . -type f -name "*.py" -exec sed -i '' 's/removal="0\.2/removal="0.3/g' {} +`	2024-05-03 14:29:36 -04:00
Eugene Yurtsev	d6e34f9ee5	langchain[patch]: Improve deprecation warnings (#21262 ) * Remove spurious derprecation warning * Make deprecation warnings consistent with 0.1 namespaces that were announced as deprecated	2024-05-03 13:40:16 -04:00
Eugene Yurtsev	487aff7e46	langchain[patch]: Revert 20794 until 0.2 release (#21257 ) PR of 2079 was already released as part of 0.1.17rc. Issue for 0.2 release: https://github.com/langchain-ai/langchain/issues/21080	2024-05-03 17:02:48 +00:00
Eugene Yurtsev	ba4a309d98	langchain[patch]: Revert breaking change until 0.2 release (#21256 ) Reverts a minor breaking change until 0.2 release	2024-05-03 09:42:27 -07:00
Eugene Yurtsev	66a1e3f083	langchain[patch]: Fix flaky unit test (#21258 ) Should sort the results of the import test since it depends on import order	2024-05-03 15:55:46 +00:00
Eugene Yurtsev	0989c48028	langchain[minor]: Re-add deleted ainetwork tool (#21254 ) * Adding __init__.py to turn it into a package in community * Adding proxy imports that assume that langchain_community is optional	2024-05-03 11:39:40 -04:00
Christophe Bornet	2fbe82f5e6	community[minor]: Relax constraints on CassandraChatMessageHistory constructor (#21241 )	2024-05-03 10:20:39 -04:00
Chris Germann	3a8d1d8838	Hotfix RetrievalQA Docs: docs: Fix formatting (#21183 ) # Newline Characters breaking formatting Description: As you can see in the image below, the formatting in the documentation is broken. As far as I can see the two added `\n` characters are breaking the documentation. Therefore I would propose to remove those ![image](https://github.com/langchain-ai/langchain/assets/88305668/23b6e726-71b2-4812-91ea-3e8600683733) Dependencies: None Twitter Handle - epu9byj --------- Co-authored-by: gere <gere@kapo.zh.ch> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-03 12:46:29 +00:00
andyjessen	64e17bd793	docs: Fix comment within "handle long text" example (#21248 ) The current doc-string comment is referring to the wrong schema.	2024-05-03 12:36:53 +00:00
Daniel Glogowski	c3d169ab00	docs: Update Nvidia documentation (#21240 ) Updating Nvidia docs ahead for 5/15 competition. Thanks!	2024-05-03 12:29:03 +00:00
Bagatur	70bde15480	docs: add tool choice to tool calling (#21229 )	2024-05-03 03:10:22 -04:00
Bagatur	67a5cc34c6	openai[patch]: Release 0.1.6 (#21236 )	2024-05-03 04:10:39 +00:00
Erick Friis	c1eb95b967	core: release 0.1.50 (#21230 )	2024-05-02 22:44:18 +00:00
Nuno Campos	47ce8d5a57	core: tracer: remove numeric execution order (#21220 ) - this hasn't been used in a long time and requires some additional bookkeeping i'm going to streamline in the next pr	2024-05-02 15:38:55 -07:00
Bagatur	6ac6158a07	openai[patch]: support tool_choice="required" (#21216 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-02 18:33:25 -04:00
Erick Friis	aa9faa8512	docs: model table keywords, remove tool calling from llm (#21225 )	2024-05-02 21:04:29 +00:00
xindoo	c1aa237bc2	langchain: fix syntax error in code comment for create_tool_calling_agent (#21205 ) PR message: - Description: Corrected a syntax error in the code comments within the `create_tool_calling_agent` function in the langchain package. - Issue: N/A - Dependencies: No additional dependencies required. - Twitter handle: N/A	2024-05-02 19:17:23 +00:00
ccurme	eb0a2fd53a	mistral: release 0.1.6 (#21214 )	2024-05-02 13:59:19 -04:00
ccurme	2d77e5e3a1	(standard tests): add test for basic conversation sequence (#21213 )	2024-05-02 13:47:10 -04:00
Maxime Perrin	1ebb5a70ad	partners(mistralai): Removing unused variable in completion request (using tool_calls or content) (#21201 ) This PR fixes #21196. The error was occurring when calling chat completion API with a chat history. Indeed, the Mistral API does not accept both `content` and `tool_calls` in the same body. This PR removes one of theses variables depending on the necessity. --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-02 13:20:14 -04:00
Christophe Bornet	683fb45c6b	community[patch]: Refactor CassandraDatabase wrapper (#21075 ) * Introduce individual `fetch_` methods for easier typing. * Rework some docstrings to google style * Move some logic to the tool * Merge the 2 cassandra utility files	2024-05-02 13:13:08 -04:00
Bagatur	b00fd1dbde	infra: Undo gh cache removal (#21210 ) Co-authored-by: Nuno Campos <nuno@langchain.dev>	2024-05-02 17:12:32 +00:00
Aditya	ee2c55ca09	docs: Added documentation on Anthropic models on vertex (#21070 ) Description:Added documentation on Anthropic models on Vertex @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-05-02 13:12:01 -04:00
Raghav Dixit	7d451d0041	community[patch]: Update lancedb.py (#21192 ) very minor update in LanceDB integration, 'metric' argument was missing.	2024-05-02 17:06:39 +00:00
Bagatur	d297d90ad9	core[patch]: Release 0.1.49 (#21211 )	2024-05-02 17:06:27 +00:00
Nuno Campos	663747b730	core[patch]: Fixes for convert_messages (#21207 ) - support two-tuples of any sequence type (eg. json.loads never produces tuples) - support type alias for role key - if id is passed in in dict form use it - if tool_calls passed in in dict form use them --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-02 16:55:42 +00:00
Eugene Yurtsev	df49404794	langchain[patch]: Make more memory code handle community dependency as optional (#21199 )	2024-05-02 11:05:26 -04:00
ccurme	bd5d2c2674	langchain: import InMemoryChatMessageHistory from core (#21198 )	2024-05-02 14:53:07 +00:00
Eugene Yurtsev	3cd7fced5f	langchain[patch],community[minor]: Migrate memory implementations to community (#20845 ) Migrates memory implementations to community	2024-05-02 10:46:50 -04:00
Eugene Yurtsev	b5c3a04e4b	langchain[patch]: chat histories to handle optional community dependence (#21194 )	2024-05-02 10:36:08 -04:00
Eugene Yurtsev	c9119b0e75	langchain[patch],community[minor]: Move some unit tests from langchain to community, use core for fake models (#21190 )	2024-05-02 09:57:52 -04:00
Eugene Yurtsev	c306364b06	langchain[patch]: Update more code to use langchain community as an optional dependency (#21170 ) More code to use langchain community as an optional dependency	2024-05-02 09:05:48 -04:00

1 2 3 4 5 ...

9154 Commits