langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
Nat Noordanus	8a3b74fe1f	community[patch]: Fix pydantic ForwardRef error in BedrockBase (#17416 ) - Description: Fixes a type annotation issue in the definition of BedrockBase. This issue was that the annotation for the `config` attribute includes a ForwardRef to `botocore.client.Config` which is only imported when `TYPE_CHECKING`. This can cause pydantic to raise an error like `pydantic.errors.ConfigError: field "config" not yet prepared so type is still a ForwardRef, ...`. - Issue: N/A - Dependencies: N/A - Twitter handle: `@__nat_n__`	2024-02-13 16:15:55 -08:00
Bagatur	2c076bebc9	docs: fix self query redirect (#17490 )	2024-02-13 15:44:56 -08:00
Ashley Xu	f746a73e26	Add the BQ job usage tracking from LangChain (#17123 ) - Description: Add the BQ job usage tracking from LangChain --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-13 14:47:57 -08:00
Bagatur	5dca107621	docs: update providers (#17488 )	2024-02-13 14:00:15 -08:00
JongRok BAEK	8d6cc90fc5	langchain.core : Use shallow copy for schema manipulation in JsonOutputParser.get_format_instructions (#17162 ) - Description : Fix: Use shallow copy for schema manipulation in get_format_instructions Prevents side effects on the original schema object by using a dictionary comprehension for a safer and more controlled manipulation of schema key-value pairs, enhancing code reliability. - Issue: #17161 - Dependencies: None - Twitter handle: None	2024-02-13 13:30:53 -08:00
Rave Harpaz	90f55e6bd1	Documentation/add update documentation for oci (#17473 ) Thank you for contributing to LangChain! Checklist: - PR title: docs: add & update docs for Oracle Cloud Infrastructure (OCI) integrations - Description: adding and updating documentation for two integrations - OCI Generative AI & OCI Data Science (1) adding integration page for OCI Generative AI embeddings (@baskaryan request, docs/docs/integrations/text_embedding/oci_generative_ai.ipynb) (2) updating integration page for OCI Generative AI llms (docs/docs/integrations/llms/oci_generative_ai.ipynb) (3) adding platform documentation for OCI (@baskaryan request, docs/docs/integrations/platforms/oci.mdx). this combines the integrations of OCI Generative AI & OCI Data Science (4) if possible, requesting to be added to 'Featured Community Providers' so supplying a modified docs/docs/integrations/platforms/index.mdx to reflect the addition - Issue: none - Dependencies: no new dependencies - Twitter handle: --------- Co-authored-by: MING KANG <ming.kang@oracle.com>	2024-02-13 13:26:23 -08:00
Bagatur	b5d3416563	experimental[patch]: Release 0.0.51 (#17484 )	2024-02-13 13:14:38 -08:00
Bagatur	de7c4b277c	langchain[patch]: Release 0.1.7 (#17482 )	2024-02-13 13:13:04 -08:00
Bagatur	39342d98d6	community[patch]: Release 0.0.20 (#17480 )	2024-02-13 13:01:51 -08:00
Bagatur	89b765ec27	core[patch]: Release 0.1.23 (#17479 )	2024-02-13 12:55:45 -08:00
Max Jakob	ab3d944667	community[patch]: ElasticsearchStore: preserve user headers (#16830 ) Users can provide an Elasticsearch connection with custom headers. This PR makes sure these headers are preserved when adding the langchain user agent header.	2024-02-13 12:37:35 -08:00
Erick Friis	112e10e933	infra: azure release integration testing secrets (#17476 )	2024-02-13 12:17:06 -08:00
Erick Friis	9eb1b56e73	pinecone[patch]: release 0.0.2 (#17477 )	2024-02-13 12:01:45 -08:00
Erick Friis	37678471c4	openai[patch]: relax tiktoken constraint, release 0.0.6 (#17472 )	2024-02-13 11:25:55 -08:00
Wendy H. Chun	2df7387c91	langchain[patch]: Fix to avoid infinite loop during collapse chain in map reduce (#16253 ) - Description: Depending on `token_max` used in `load_summarize_chain`, it could cause an infinite loop when documents cannot collapse under `token_max`. This change would not affect the existing feature, but it also gives an option to users to avoid the situation. - Issue: https://github.com/langchain-ai/langchain/issues/16251 - Dependencies: None - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:55:32 -08:00
wulixuan	5d06797905	community[minor]: integrate chat models with Yuan2.0 (#16575 ) 1. integrate chat models with [`Yuan2.0`](https://github.com/IEIT-Yuan/Yuan-2.0/blob/main/README-EN.md) 2. add a new doc for [Yuan2.0 integration](docs/docs/integrations/llms/yuan2.ipynb) Yuan2.0 is a new generation Fundamental Large Language Model developed by IEIT System. We have published all three models, Yuan 2.0-102B, Yuan 2.0-51B, and Yuan 2.0-2B. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:55:14 -08:00
Taha Khabouss	15baffc484	langchain[patch]: Ensure that the Elasticsearch Query Translator functions accurately w… (#17044 ) Description: Addresses a problem where the Date type within an Elasticsearch SelfQueryRetriever would encounter difficulties in generating a valid query. Issue: #17042 --------- Co-authored-by: Max Jakob <max.jakob@elastic.co> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:54:24 -08:00
Erick Friis	e5c76f9dbd	pinecone[patch]: poetry update (#17471 )	2024-02-13 10:32:29 -08:00
Erick Friis	10bdf2422c	pinecone[patch]: release 0.0.2rc0, remove simsimd dep (#17469 )	2024-02-13 10:02:16 -08:00
Erick Friis	065cde69b1	google-genai[patch]: release 0.0.9, safety settings docs (#17432 )	2024-02-13 10:01:25 -08:00
Sergey Kozlov	db6f266d97	core: improve None value processing in merge_dicts() (#17462 ) - Description: fix `None` and `0` merging in `merge_dicts()`, add tests. ```python from langchain_core.utils._merge import merge_dicts assert merge_dicts({"a": None}, {"a": 0}) == {"a": 0} ``` --------- Co-authored-by: Sergey Kozlov <sergey.kozlov@ludditelabs.io>	2024-02-13 08:48:02 -08:00
Ian Gregory	e5472b5eb8	Framework for supporting more languages in LanguageParser (#13318 ) ## Description I am submitting this for a school project as part of a team of 5. Other team members are @LeilaChr, @maazh10, @Megabear137, @jelalalamy. This PR also has contributions from community members @Harrolee and @Mario928. Initial context is in the issue we opened (#11229). This pull request adds: - Generic framework for expanding the languages that `LanguageParser` can handle, using the [tree-sitter](https://github.com/tree-sitter/py-tree-sitter#py-tree-sitter) parsing library and existing language-specific parsers written for it - Support for the following additional languages in `LanguageParser`: - C - C++ - C# - Go - Java (contributed by @Mario928 https://github.com/ThatsJustCheesy/langchain/pull/2) - Kotlin - Lua - Perl - Ruby - Rust - Scala - TypeScript (contributed by @Harrolee https://github.com/ThatsJustCheesy/langchain/pull/1) Here is the [design document](https://docs.google.com/document/d/17dB14cKCWAaiTeSeBtxHpoVPGKrsPye8W0o_WClz2kk) if curious, but no need to read it. ## Issues - Closes #11229 - Closes #10996 - Closes #8405 ## Dependencies `tree_sitter` and `tree_sitter_languages` on PyPI. We have tried to add these as optional dependencies. ## Documentation We have updated the list of supported languages, and also added a section to `source_code.ipynb` detailing how to add support for additional languages using our framework. ## Maintainer - @hwchase17 (previously reviewed https://github.com/langchain-ai/langchain/pull/6486) Thanks!! ## Git commits We will gladly squash any/all of our commits (esp merge commits) if necessary. Let us know if this is desirable, or if you will be squash-merging anyway. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Maaz Hashmi <mhashmi373@gmail.com> Co-authored-by: LeilaChr <87657694+LeilaChr@users.noreply.github.com> Co-authored-by: Jeremy La <jeremylai511@gmail.com> Co-authored-by: Megabear137 <zubair.alnoor27@gmail.com> Co-authored-by: Lee Harrold <lhharrold@sep.com> Co-authored-by: Mario928 <88029051+Mario928@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-13 08:45:49 -08:00
merlin-quix	729c6d6827	docs: add use case for managing chat messages via Apache Kafka (#16771 ) Adding a new notebook that demonstrates how to use LangChain's standard chat features while passing the chat messages back and forth via Apache Kafka. This goal is to simulate an architecture where the chat front end and the LLM are running as separate services that need to communicate with one another over an internal nework. It's an alternative to typical pattern of requesting a reponse from the model via a REST API (there's more info on why you would want to do this at the end of the notebook). NOTE: Assuming "uses cases" is the right place for this but feel free to propose another location. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-13 08:09:15 -08:00
Bagatur	3925071dd6	langchain[patch], templates[patch]: fix multi query retriever, web re… (#17434 ) …search retriever Fixes #17352	2024-02-12 22:52:07 -08:00
Bagatur	c0ce93236a	experimental[patch]: fix zero-shot pandas agent (#17442 )	2024-02-12 21:58:35 -08:00
Abhishek Jain	37e1275f9e	community[patch]: Fixed the 'aembed' method of 'CohereEmbeddings'. (#16497 ) Description: - The existing code was trying to find a `.embeddings` property on the `Coroutine` returned by calling `cohere.async_client.embed`. - Instead, the `.embeddings` property is present on the value returned by the `Coroutine`. - Also, it seems that the original cohere client expects a value of `max_retries` to not be `None`. Hence, setting the default value of `max_retries` to `3`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 21:57:27 -08:00
Sridhar Ramaswamy	9f1cbbc6ed	community[minor]: Add pebblo safe document loader (#16862 ) - Description: Pebblo opensource project enables developers to safely load data to their Gen AI apps. It identifies semantic topics and entities found in the loaded data and summarizes them in a developer-friendly report. - Dependencies: none - Twitter handle: srics @hwchase17	2024-02-12 21:56:12 -08:00
Preetam D'Souza	0834457f28	docs: Fix broken link in summarization use-case (#16554 ) - Description: Fix broken link to `StuffDocumentsChain` - Issue: N/A - Dependencies: None - Twitter handle: [@preetamdsouza](https://twitter.com/preetamdsouza)	2024-02-12 21:40:57 -08:00
Sheil Naik	d70a5bbf15	docs: Fix broken link in LLMs index.mdx (#16557 ) - Description: The [LLMs](https://python.langchain.com/docs/modules/model_io/llms/) page has a broken link. This fixes the link. - Issue: N/A - Dependencies: N/A - Twitter handle: @sheilnaik	2024-02-12 21:39:56 -08:00
mhavey	1bbb64d956	community[minor], langchian[minor]: Add Neptune Rdf graph and chain (#16650 ) Description: This PR adds a chain for Amazon Neptune graph database RDF format. It complements the existing Neptune Cypher chain. The PR also includes a Neptune RDF graph class to connect to, introspect, and query a Neptune RDF graph database from the chain. A sample notebook is provided under docs that demonstrates the overall effect: invoking the chain to make natural language queries against Neptune using an LLM. Issue: This is a new feature Dependencies: The RDF graph class depends on the AWS boto3 library if using IAM authentication to connect to the Neptune database. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 21:30:20 -08:00
Michael Feil	e1cfd0f3e7	community[patch]: infinity embeddings update incorrect default url (#16759 ) The default url has always been incorrect (7797 instead 7997). Here is a update to the correct url.	2024-02-12 20:05:08 -08:00
Massimiliano Pronesti	df7cbd6fbb	community[minor]: add FlashRank ranker (#16785 ) Description: This PR adds support for [flashrank](https://github.com/PrithivirajDamodaran/FlashRank) for reranking as alternative to Cohere. I'm not sure `libs/langchain` is the right place for this change. At first, I wanted to put it under `libs/community`. All the compressors were under `libs/langchain/retrievers/document_compressors` though. Hope this makes sense!	2024-02-12 20:00:52 -08:00
Andreas Motl	1fdd9bd980	community/SQLDatabase: Generalize and trim software tests (#16659 ) - Description: Improve test cases for `SQLDatabase` adapter component, see [suggestion](https://github.com/langchain-ai/langchain/pull/16655#pullrequestreview-1846749474). - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev _Remark: This PR is stacked upon GH-16655, so that one will need to go in first._ Edit: Thank you for bringing in GH-17191, @eyurtsev. This is a little aftermath, improving/streamlining the corresponding test cases.	2024-02-12 22:58:34 -05:00
Theo / Taeyoon Kang	1987f905ed	core[patch]: Support .yml extension for YAML (#16783 ) - Description: [AS-IS] When dealing with a yaml file, the extension must be .yaml. [TO-BE] In the absence of extension length constraints in the OS, the extension of the YAML file is yaml, but control over the yml extension must still be made. It's as if it's an error because it's a .jpg extension in jpeg support. - Issue: - - Dependencies: no dependencies required for this change,	2024-02-12 19:57:20 -08:00
Kapil Sachdeva	cd00a87db7	community[patch] - in FAISS vector store, support passing custom DocStore implementation when using from_xxx methods (#16801 ) - Description: The from__xx methods of FAISS class have hardcoded InMemoryStore implementation and thereby not let users pass a custom DocStore implementation, - Issue: no referenced issue, - Dependencies: none, - Twitter handle: ksachdeva	2024-02-12 19:51:55 -08:00
Chris	f9f5626ca4	community[patch]: Fix github search issues and PRs PaginatedList has no len() error (#16806 ) Description: Bugfix: Langchain_community's GitHub Api wrapper throws a TypeError when searching for issues and/or PRs (the `search_issues_and_prs` method). This is because PyGithub's PageinatedList type does not support the len() method. See https://github.com/PyGithub/PyGithub/issues/1476 ![image](https://github.com/langchain-ai/langchain/assets/8849021/57390b11-ed41-4f48-ba50-f3028610789c) Dependencies: None Twitter handle: @ChrisKeoghNZ I haven't registered an issue as it would take me longer to fill the template out than to make the fix, but I'm happy to if that's deemed essential. I've added a simple integration test to cover this as there were no existing unit tests and it was going to be tricky to set them up. Co-authored-by: Chris Keogh <chris.keogh@xero.com>	2024-02-12 19:50:59 -08:00
morgana	722aae4fd1	community: add delete method to rocksetdb vectorstore to support recordmanager (#17030 ) - Description: This adds a delete method so that rocksetdb can be used with `RecordManager`. - Issue: N/A - Dependencies: N/A - Twitter handle: `@_morgan_adams_` --------- Co-authored-by: Rockset API Bot <admin@rockset.io>	2024-02-12 19:50:20 -08:00
yin1991	c454dc36fc	community[proxy]: Enhancement/add proxy support playwrighturlloader 16751 (#16822 ) - Description: Enhancement/add proxy support playwrighturlloader 16751 - Issue: [Enhancement: Add Proxy Support to PlaywrightURLLoader Class](https://github.com/langchain-ai/langchain/issues/16751) - Dependencies: - Twitter handle: @ootR77013489 --------- Co-authored-by: root <root@ip-172-31-46-160.ap-southeast-1.compute.internal> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 19:48:29 -08:00
Bhupesh Varshney	e3b775e035	infra: make `.gitignore` consistent with standard python gitignore (#16828 ) - The new .gitignore version is inherited from the one maintained by the github community over at https://github.com/github/gitignore/blob/main/Python.gitignore - This should cover all the cases of how a langchain app can be used.	2024-02-12 19:43:41 -08:00
James Braza	64938ae6f2	infra: unit testing `check_package_version` (#16825 ) Wrote a unit test for `check_package_version` in the core package. Note that this is a revival of https://github.com/langchain-ai/langchain/pull/16387 after GitHub incident (see https://github.com/langchain-ai/langchain/discussions/16796).	2024-02-12 19:39:58 -08:00
Max Jakob	604e117411	docs: another auth method for ElasticsearchStore (#16831 ) Users can also use their own Elasticsearch client object to configure the connection.	2024-02-12 19:29:54 -08:00
Zeeland	4986e7227e	docs: rm unnecessary imports (#16876 ) - Description: optimize the document of memory usage - Issue: it lose some install guide	2024-02-12 19:25:54 -08:00
Lingzhen Chen	30af711c34	community[patch]: update AzureSearch class to work with azure-search-documents=11.4.0 (#15659 ) - Description: Updates `libs/community/langchain_community/vectorstores/azuresearch.py` to support the stable version `azure-search-documents=11.4.0` - Issue: https://github.com/langchain-ai/langchain/issues/14534, https://github.com/langchain-ai/langchain/issues/15039, https://github.com/langchain-ai/langchain/issues/15355 - Dependencies: azure-search-documents>=11.4.0 --------- Co-authored-by: Clément Tamines <Skar0@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 19:23:35 -08:00
Robby	e135dc70c3	community[patch]: Invoke callback prior to yielding token (#17348 ) Description: Invoke callback prior to yielding token in stream method for Ollama. Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Co-authored-by: Robby <h0rv@users.noreply.github.com>	2024-02-12 19:22:55 -08:00
Christophe Bornet	ab025507bc	community[patch]: Add async methods to VectorStoreQATool (#16949 )	2024-02-12 19:19:50 -08:00
Christophe Bornet	fb7552bfcf	Add async methods to InMemoryCache (#17425 ) Add async methods to InMemoryCache	2024-02-12 22:02:38 -05:00
Eugene Yurtsev	93472ee9e6	core[patch]: Replace memory stream implementation used by LogStreamCallbackHandler (#17185 ) This PR replaces the memory stream implementation used by the LogStreamCallbackHandler. This implementation resolves an issue in which streamed logs and streamed events originating from sync code would arrive only after the entire sync code would finish execution (rather than arriving in real time as they're generated). One example is if trying to stream tokens from an llm within a tool. If the tool was an async tool, but the llm was invoked via stream (sync variant) rather than astream (async variant), then the tokens would fail to stream in real time and would all arrived bunched up after the tool invocation completed.	2024-02-12 21:57:38 -05:00
yin1991	37ef6ac113	community[patch]: Add Pagination to GitHubIssuesLoader for Efficient GitHub Issues Retrieval (#16934 ) - Description: Add Pagination to GitHubIssuesLoader for Efficient GitHub Issues Retrieval - Issue: [the issue # it fixes if applicable,](https://github.com/langchain-ai/langchain/issues/16864) --------- Co-authored-by: root <root@ip-172-31-46-160.ap-southeast-1.compute.internal> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 18:30:36 -08:00
Leonid Ganeline	b87d6f9f48	docs: `Redis` page update (#16906 ) - Reordered sections - Applied consistent formatting - Fixed headers (there were 2 H1 headers; this breaks CoT) - Added `Settings` header and moved all related sections under it	2024-02-12 18:23:35 -08:00
Bagatur	22638e5927	community[patch]: give reranker default client val (#17289 )	2024-02-12 17:21:53 -08:00

1 2 3 4 5 ...

7493 Commits