langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-02 09:40:22 +00:00

Author	SHA1	Message	Date
East Agile	2a6f78a53f	community[minor]: Rememberizer retriever (#20052 ) Description: This pull request introduces a new feature for LangChain: the integration with the Rememberizer API through a custom retriever. This enables LangChain applications to allow users to load and sync their data from Dropbox, Google Drive, Slack, their hard drive into a vector database that LangChain can query. Queries involve sending text chunks generated within LangChain and retrieving a collection of semantically relevant user data for inclusion in LLM prompts. User knowledge dramatically improved AI applications. The Rememberizer integration will also allow users to access general purpose vectorized data such as Reddit channel discussions and US patents. Issue: N/A Dependencies: N/A Twitter handle: https://twitter.com/Rememberizer	2024-05-01 10:41:44 -04:00
Eugene Yurtsev	1ce1a10f2b	langchain[patch],community[minor]: Move graph index creator (#20795 ) Move graph index creator to community	2024-05-01 10:04:30 -04:00
Eugene Yurtsev	aa0bc7467c	langchain[patch]: Migrate agents module into optional imports for community (#21088 )	2024-05-01 09:36:03 -04:00
Eugene Yurtsev	86ff8a3fb4	langchain[patch]: Update docstore module to use optional imports from community (#21091 )	2024-05-01 09:35:05 -04:00
Eugene Yurtsev	d640605694	langchain[patch]: Migrate chat loaders to optional community imports (#21089 ) Migrate chat loaders to optional community imports	2024-05-01 09:34:44 -04:00
Charlie Marsh	2b10c4dd52	ci: Use `ruff check` in Makefile (#21138 ) ## Summary `ruff /path/to/file.py` works but is deprecated, and we now recommend `ruff check /path/to/file.py` (to match `ruff format /path/to/file.py`).	2024-05-01 09:34:15 -04:00
Eugene Yurtsev	2fcab9acd9	langchain[patch]: Upgrade storage to treat langchain community as optional (#21105 )	2024-05-01 09:33:31 -04:00
William FH	ab55f6996d	[Core] Tracing: update parent run_tree's child_runs (#21049 )	2024-05-01 06:33:08 -07:00
Abhishek Bhagwat	86fe484e24	docs: Docs (sample notebook) for Vertex DIY RAG Ranking API (#21054 ) Vertex DIY RAG APIs helps to build complex RAG systems and provide more granular control, and are suited for custom use cases. The Ranking API takes in a list of documents and reranks those documents based on how relevant the documents are to a given query. Compared to embeddings that look purely at the semantic similarity of a document and a query, the ranking API can give you a more precise score for how well a document answers a given query. [Reference](https://cloud.google.com/generative-ai-app-builder/docs/ranking) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:39:39 +00:00
Stuart Leeks	8a01760a0f	infra: Sync devcontainer.json and compose file mount location (#20461 ) Sync the config in `devcontainer.json` and `docker-compose.yml` Issue: when opening the current `master` branch in a dev container in VS Code, I get the following message as VS Code cannot find the mounted source folder: ![image](https://github.com/langchain-ai/langchain/assets/1824461/41cf20c0-d1e0-4648-9578-edf80b99c2db) Opening in a GitHub Codespace works (it seems to ignore the mounts in the `docker-compose.yml`. This PR updates the mount in `docker-compose.yml` and the config in `devcontainer.json` so that the two align. I have tested these changes in GitHub Codespaces and a VS Code dev container and both loaded successfully. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-01 01:32:12 -04:00
aditya thomas	12b1caf295	openai[patch]: add tests for secret_str for keys (#20982 ) Description: Add tests to check API keys and Active Directory tokens are masked Issue: Resolves #12165 for OpenAI and Azure OpenAI models Dependencies: None Also resolves #12473 which may be closed. Additional contributors @alex4321 (#12473) and @onesolpark (#12542)	2024-05-01 01:26:20 -04:00
Noah	45ddf4d26f	community[patch]: Update comments for lazy_load method (#21063 ) - [ ] PR message: - Description: Refactored the lazy_load method to use asynchronous execution for improved performance. The method now initiates scraping of all URLs simultaneously using asyncio.gather, enhancing data fetching efficiency. Each Document object is yielded immediately once its content becomes available, streamlining the entire process. - Issue: N/A - Dependencies: Requires the asyncio library for handling asynchronous tasks, which should already be part of standard Python libraries in Python 3.7 and above. - Email: [r73327118@gmail.com](mailto:r73327118@gmail.com) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 01:20:57 -04:00
Liu Xiaodong	3b473d10f2	experimental: clean python repl input（experimental：Added code for PythonREPL） (#20930 ) Update python.py（experimental：Added code for PythonREPL） Added code for PythonREPL, defining a static method 'sanitize_input' that takes the string 'query' as input and returns a sanitizing string. The purpose of this method is to remove unwanted characters from the input string, Specifically: 1. Delete the whitespace at the beginning and end of the string (' \s'). 2. Remove the quotation marks (`` ` ``) at the beginning and end of the string. 3. Remove the keyword "python" at the beginning of the string (case insensitive) because the user may have typed it. This method uses regular expressions (regex) to implement sanitizing. It all started with this code： from langchain.agents import Tool from langchain_experimental.utilities import PythonREPL python_repl = PythonREPL() repl_tool = Tool( name="python_repl", description="Remove redundant formatting marks at the beginning and end of source code from input.Use a Python shell to execute python commands. If you want to see the output of a value, you should print it out with `print(...)`.", func=python_repl.run, ) When I call the agent to write a piece of code for me and execute it with the defined code, I must get an error: SyntaxError('invalid syntax', ('<string>', 1, 1,'In', 1, 2)) After checking, I found that pythonREPL has less formatting of input code than the soon-to-be deprecated pythonREPL tool, so I added this step to it, so that no matter what code I ask the agent to write for me, it can be executed smoothly and get the output result. I have tried modifying the prompt words to solve this problem before, but it did not work, and by adding a simple format check, the problem is well resolved. <img width="1271" alt="image" src="https://github.com/langchain-ai/langchain/assets/164149097/c49a685f-d246-4b11-b655-fd952fc2f04c"> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:19:09 +00:00
Ismail Hossain Polas	1fdf63fa6c	community[patch]: update package name to bagelML (#19948 ) Description This pull request updates the Bagel Network package name from "betabageldb" to "bagelML" to align with the latest changes made by the Bagel Network team. The following modifications have been made: - Updated all references to the old package name ("betabageldb") with the new package name ("bagelML") throughout the codebase. - Modified the documentation, and any relevant scripts to reflect the package name change. - Tested the changes to ensure that the functionality remains intact and no breaking changes were introduced. By merging this pull request, our project will stay up to date with the latest Bagel Network package naming convention, ensuring compatibility and smooth integration with their updated library. Please review the changes and provide any feedback or suggestions. Thank you!	2024-05-01 01:17:33 -04:00
Tomaz Bratanic	7860e4c649	experimental[patch]: Add support for non-function calling LLMs in llm graph transformers (#21014 )	2024-05-01 01:16:07 -04:00
Erick Friis	67e6744e0f	docs: fix some notebook formatting (#21136 )	2024-04-30 21:39:03 -07:00
tianzedavid	5a8909440b	docs: remove repetitive words (#21058 ) remove repetitive words Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-01 01:10:42 +00:00
Leonid Kuligin	a36935b520	docs: updated docs on langchain_google_community (#21064 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: updated docs on langchain_google_community" - [ ] PR message: - Description: updated docs on langchain_google_community	2024-04-30 20:20:49 -04:00
Tomaz Bratanic	c9e96bb5e2	community[patch]: Fix neo4j enhanced schema bugs (#21072 )	2024-04-30 20:16:26 -04:00
junkeon	8d2909ee25	upstage[minor]: Update few codes and add upstage loader in pdf section (#21085 ) Description: Update UpstageLayoutAnalysisParser and Loader and add upstage loader example in pdf section Dependencies: langchain_community Twitter handle: [@upstageai](https://twitter.com/upstageai) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-30 20:15:49 -04:00
Bagatur	bef50ded63	openai[patch]: fix special token default behavior (#21131 ) By default handle special sequences as regular text	2024-04-30 20:08:24 -04:00
MacanPN	0f7f448603	community[patch]: add delete() method to AzureSearch vector store (#21127 ) Issue: Currently `AzureSearch` vector store does not implement `delete` method. This PR implements it. This also makes it compatible with LangChain indexer. Dependencies: None Twitter handle: @martintriska1 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:46:18 +00:00
Jorge Piedrahita Ortiz	3441a11b21	docs: minor changes in sambanova community integration docs (#21129 ) - Description: minor changes in sambanova community integration notebook docs --------- Co-authored-by: Renate Kempf <165940384+renate-snova@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:44:26 +00:00
Bagatur	6d3e9eaf84	docs: format (#21132 )	2024-04-30 23:32:41 +00:00
Erick Friis	14422a4220	langchain: fix core dep (#21128 )	2024-04-30 14:55:12 -07:00
Erick Friis	6c938da302	langchain: release 0.1.17 (#21125 )	2024-04-30 14:43:59 -07:00
Erick Friis	5f8a307565	infra: same tagging for langchain (#21126 )	2024-04-30 14:43:45 -07:00
Eugene Yurtsev	bf95414758	langchain[minor]: enhance unit test to test imports recursively (#21122 )	2024-04-30 17:05:53 -04:00
Eugene Yurtsev	e4f51f59a2	langchain[patch]: Migrate tools to treat community imports as optional (#21117 ) Migrate tools to treat community imports as optional	2024-04-30 16:26:18 -04:00
Eugene Yurtsev	9e788f09c6	langchain[patch]: Migrate output parsers to support optional community imports (#21103 ) Migrate output parsers	2024-04-30 16:24:29 -04:00
Eugene Yurtsev	3853fe9f64	langchain[patch]: Migrate graphs to use optional community imports (#21100 ) Migrate graphs to use optional community imports.	2024-04-30 16:24:06 -04:00
Eugene Yurtsev	8658d52587	langchain[patch]: Upgrade prompts to optional imports (#21078 ) Upgrades prompts module to use optional imports. This code was generated with a migration script, but had to be adjusted manually a bit. Testing in preparation for applying this code modification across the rest of the modules in langchain package to reverse the dependency between langchain community and langchain.	2024-04-30 16:23:39 -04:00
Eugene Yurtsev	9b6d04a187	langchain[patch]: Migrate document transformers (#21098 ) Migrate document transformers	2024-04-30 16:20:02 -04:00
Eugene Yurtsev	aec13a6123	langchain[patch]: Migrate callbacks module to use optional imports for community (#21086 )	2024-04-30 16:19:13 -04:00
Erick Friis	8a62fb0570	community: release 0.0.36 (#21118 )	2024-04-30 13:18:44 -07:00
Erick Friis	2407c353be	core: release 0.1.48 (#21113 )	2024-04-30 19:52:36 +00:00
Erick Friis	dbdfa3d34e	infra: fix minimum version install to force pypi install (#21112 )	2024-04-30 12:41:26 -07:00
Charlie Marsh	fd94aa8366	partner[patch]: Upgrade to Ruff v0.4.2 (#21108 ) ## Summary No new diagnostics (given that the set of enabled rules hasn't changed), but gains access to our new parser (much faster) and reduced false positives all around.	2024-04-30 15:06:42 -04:00
Jamsheed Mistri	3e749369ef	community[minor]: bump version of LayerupSecurity, add support for untrusted_input parameter (#19985 ) Description: update version of LayerupSecurity package for the Layerup Security integration. Add untrusted_input parameter.	2024-04-30 14:55:26 -04:00
fubuki8087	f1c3687aa5	community[patch]: Using the right encoding to parse the web page in RecursiveUrlLoader (#20632 ) As shown in #13749 , `RecursiveUrlLoader` has encoding issue. This PR is to solve this. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 18:41:36 +00:00
Jakub Pawłowski	b0b1a67771	community[patch]: Skip unexpected 404 HTTP Error in Arxiv download (#21042 ) ### Description: When attempting to download PDF files from arXiv, an unexpected 404 error frequently occurs. This error halts the operation, regardless of whether there are additional documents to process. As a solution, I suggest implementing a mechanism to ignore and communicate this error and continue processing the next document from the list. Proposed Solution: To address the issue of unexpected 404 errors during PDF downloads from arXiv, I propose implementing the following solution: - Error Handling: Implement error handling mechanisms to catch and handle 404 errors gracefully. - Communication: Inform the user or logging system about the occurrence of the 404 error. - Continued Processing: After encountering a 404 error, continue processing the remaining documents from the list without interruption. This solution ensures that the application can handle unexpected errors without terminating the entire operation. It promotes resilience and robustness in the face of intermittent issues encountered during PDF downloads from arXiv. ### Issue: #20909 ### Dependencies: none --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-30 18:29:22 +00:00
Erick Friis	b9c53e95b7	community: release 0.0.35 (#21104 )	2024-04-30 17:48:56 +00:00
Eugene Yurtsev	3c064a757f	core[minor],langchain[patch],community[patch]: Move storage interfaces to core (#20750 ) * Move storage interface to core * Move in memory and file system implementation to core	2024-04-30 13:14:26 -04:00
Charlie Marsh	8f38b7a725	multiple: Remove unnecessary Ruff suppression comments (#21050 ) ## Summary I ran `ruff check --extend-select RUF100 -n` to identify `# noqa` comments that weren't having any effect in Ruff, and then `ruff check --extend-select RUF100 -n --fix` on select files to remove all of the unnecessary `# noqa: F401` violations. It's possible that these were needed at some point in the past, but they're not necessary in Ruff v0.1.15 (used by LangChain) or in the latest release. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-30 17:13:48 +00:00
Erick Friis	748f2ba9ea	core: release 0.1.47 (#21094 )	2024-04-30 09:22:05 -07:00
Erick Friis	efe27ef849	infra: tag non-langchain releases (#20805 )	2024-04-30 16:15:46 +00:00
Eugene Yurtsev	c8f18a2524	langchain[patch]: Update import handling in `adapters` (#21079 )	2024-04-30 10:55:29 -04:00
William FH	5c63ac3dd7	[Patch] Dedent docstring (#20959 ) Technically a slight prompt breaking change, but I think positive EV in that it saves tokens and results in more sane / in-distribution prompts	2024-04-30 07:40:57 -07:00
Eugene Yurtsev	845d8e0025	langchain[patch]: Update handling of deprecation warnings (#21083 ) Chains should not be emitting deprecation warnings.	2024-04-30 10:30:23 -04:00
Christophe Bornet	5c77f45b06	community[minor]: Add async methods to CassandraCache and CassandraSemanticCache (#20654 )	2024-04-30 10:27:44 -04:00

1 2 3 4 5 ...

9080 Commits