langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
William De Vena	0486404a74	langchain_openai[patch]: Invoke callback prior to yielding token (#18269 ) ## PR title langchain_openai[patch]: Invoke callback prior to yielding token ## PR message Description: Invoke callback prior to yielding token in _stream and _astream methods for langchain_openai. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-29 00:00:08 +00:00
William De Vena	5ee76fccd5	langchain_groq[patch]: Invoke callback prior to yielding token (#18272 ) ## PR title langchain_groq[patch]: Invoke callback prior to yielding ## PR message Description:Invoke callback prior to yielding token in _stream and _astream methods for groq. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 23:43:16 +00:00
aditya thomas	eb0c178d75	docs: update to the list of partner packages in the list of providers (#18252 ) Description: Update to the list of partner packages in the list of providers Issue: Google & Nvidia had two entries each, both pointing to the same page Dependencies: None	2024-02-28 15:40:14 -08:00
ccurme	9bf58ec7dd	update extraction use-case docs (#17979 ) Update extraction use-case docs to showcase and explain all modes of `create_structured_output_runnable`.	2024-02-28 17:32:04 -05:00
Christophe Bornet	8a81fcd5d3	community: Fix deprecation version of AstraDB VectorStore (#17991 )	2024-02-28 17:15:09 -05:00
Stefano Lottini	6d863bed51	partner[minor]: Astra DB clients identify themselves as coming through LangChain package (#18131 ) Description This PR sets the "caller identity" of the Astra DB clients used by the integration plugins (`AstraDBChatMessageHistory`, `AstraDBStore`, `AstraDBByteStore` and, pending #17767 , `AstraDBVectorStore`). In this way, the requests to the Astra DB Data API coming from within LangChain are identified as such (the purpose is anonymous usage stats to best improve the Astra DB service).	2024-02-28 17:13:22 -05:00
kkdamowang	4899a72b56	docs: remove duplicate word in lcel/streaming (#18249 ) - Description: Remove duplicate word in lcel/streaming. - Issue: No. - Dependencies: No.	2024-02-28 21:50:26 +00:00
mackong	2c42f3a955	ollama[patch]: delete suffix slash to avoid redirect (#18260 ) - Description: see [ollama](https://github.com/ollama/ollama/blob/main/server/routes.go#L949)'s route definitions - Issue: N/A - Dependencies: N/A	2024-02-28 16:44:48 -05:00
William De Vena	6b58943917	community[patch]: Invoke callback prior to yielding token (#18288 ) ## PR title community[patch]: Invoke callback prior to yielding PR message Description: Invoke on_llm_new_token callback prior to yielding token in _stream and _astream methods. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 21:40:53 +00:00
Brace Sproul	ca4f5e2408	ci: Update issue template required checks (#18283 )	2024-02-28 13:27:39 -08:00
William De Vena	23722e3653	langchain[patch]: Invoke callback prior to yielding token (#18282 ) ## PR title langchain[patch]: Invoke callback prior to yielding ## PR message Description: Invoke on_llm_new_token callback prior to yielding token in _stream and _astream methods in langchain/tests/fake_chat_model. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 16:15:02 -05:00
Eugene Yurtsev	cd52433ba0	community[minor]: Add `SQLDatabaseLoader` document loader (#18281 ) - Description: A generic document loader adapter for SQLAlchemy on top of LangChain's `SQLDatabaseLoader`. - Needed by: https://github.com/crate-workbench/langchain/pull/1 - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev Hi from CrateDB again, in the same spirit like GH-16243 and GH-16244, this patch breaks out another commit from https://github.com/crate-workbench/langchain/pull/1, in order to reduce the size of this patch before submitting it, and to separate concerns. To accompany the SQLAlchemy adapter implementation, the patch includes integration tests for both SQLite and PostgreSQL. Let me know if corresponding utility resources should be added at different spots. With kind regards, Andreas. ### Software Tests ```console docker compose --file libs/community/tests/integration_tests/document_loaders/docker-compose/postgresql.yml up ``` ```console cd libs/community pip install psycopg2-binary pytest -vvv tests/integration_tests -k sqldatabase ``` ``` 14 passed ``` ![image](https://github.com/langchain-ai/langchain/assets/453543/42be233c-eb37-4c76-a830-474276e01436) --------- Co-authored-by: Andreas Motl <andreas.motl@crate.io>	2024-02-28 21:02:28 +00:00
William De Vena	a37dc83a9e	langchain_anthropic[patch]: Invoke callback prior to yielding token (#18274 ) ## PR title langchain_anthropic[patch]: Invoke callback prior to yielding ## PR message - Description: Invoke callback prior to yielding token in _stream and _astream methods for anthropic. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: None	2024-02-28 20:19:22 +00:00
David Ruan	af35e2525a	community[minor]: add hugging_face_model document loader (#17323 ) - Description: add hugging_face_model document loader, - Issue: NA, - Dependencies: NA, --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-28 20:05:35 +00:00
Sanjaypranav V M	b9a495e56e	community[patch]: added latin-1 decoder to gmail search tool (#18116 ) some mails from flipkart , amazon are encoded with other plain text format so to handle UnicodeDecode error , added exception and latin decoder Thank you for contributing to LangChain! @hwchase17	2024-02-28 19:28:29 +00:00
Nuno Campos	6da08d0f22	Add PNG drawer for Runnable.get_graph() (#18239 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-28 11:25:19 -08:00
Nuno Campos	d9fd1194f5	Remove check preventing passing non-declared config keys (#18276 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-28 18:28:53 +00:00
William De Vena	7ac74f291e	langchain_nvidia_ai_endpoints[patch]: Invoke callback prior to yielding token (#18271 ) ## PR title langchain_nvidia_ai_endpoints[patch]: Invoke callback prior to yielding ## PR message Description: Invoke callback prior to yielding token in _stream and _astream methods for nvidia_ai_endpoints. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None	2024-02-28 18:10:57 +00:00
Erick Friis	b4f6066a57	docs: airbyte github cookbook (#18275 )	2024-02-28 18:04:15 +00:00
Ashley Xu	e3211c2b3d	community[patch]: BigQueryVectorSearch JSON type unsupported for metadatas (#18234 )	2024-02-28 08:19:53 -08:00
Jack Wotherspoon	92c34d4803	docs: update documentation for Google Cloud database integrations (#18265 ) Description: Fixing typos and rendering issues for Google Cloud database integrations. Issue: NA Dependencies: NA	2024-02-28 15:32:43 +00:00
Erick Friis	2e31f1c2f8	infra: api docs folder move (#18223 )	2024-02-28 07:10:27 -08:00
Mateusz Szewczyk	db643f6283	ibm[patch]: release 0.1.0 Add possibility to pass ModelInference or Model object to WatsonxLLM class (#18189 ) - Description: Add possibility to pass ModelInference or Model object to WatsonxLLM class - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅	2024-02-28 07:03:15 -08:00
Averi Kitsch	76eb553084	docs: add documentation for Google Cloud database integrations (#18225 ) Description: add documentation for Google Cloud database integrations Issue: NA Dependencies: NA	2024-02-27 21:17:30 -08:00
Erick Friis	d7a77054ed	airbyte[patch]: core version 0.1.5 (#18244 )	2024-02-27 19:54:43 -08:00
Erick Friis	be8d2ff5f7	airbyte[patch]: init pkg (#18236 )	2024-02-27 19:37:53 -08:00
Ayo Ayibiowu	ac1d7d9de8	community[feat]: Adds LLMLingua as a document compressor (#17711 ) Description: This PR adds support for using the [LLMLingua project ](https://github.com/microsoft/LLMLingua) especially the LongLLMLingua (Enhancing Large Language Model Inference via Prompt Compression) as a document compressor / transformer. The LLMLingua project is an interesting project that can greatly improve RAG system by compressing prompts and contexts while keeping their semantic relevance. Issue: https://github.com/microsoft/LLMLingua/issues/31 Dependencies: [llmlingua](https://pypi.org/project/llmlingua/) @baskaryan --------- Co-authored-by: Ayodeji Ayibiowu <ayodeji.ayibiowu@getinge.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-27 19:23:56 -08:00
Nuno Campos	a99eb3abf4	openai[patch]: Assign message id in ChatOpenAI (#17837 )	2024-02-27 17:32:54 -08:00
Isaac Francisco	733367b795	docs: deprecation of OpenAI functions agent, astream_events docstring (#18164 ) Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-27 09:14:53 -08:00
Harrison Chase	b0ccaf5917	Harrison/add structured output (#18165 )	2024-02-27 08:25:09 -08:00
Bagatur	242af4b5a4	openai[patch], mistral[patch], fireworks[patch]: releases 0.0.8, 0.0.5, 0.0.2 (#18186 )	2024-02-27 04:22:24 -08:00
Bagatur	7e66d964c6	core[patch]: Release 0.1.27 (#18159 )	2024-02-26 17:27:38 -08:00
Harrison Chase	d7c607ca00	core[minor]: move document compressor base (#17910 )	2024-02-26 17:20:50 -08:00
Bagatur	b3f4de38ae	mistral[minor]: Function calling and with_structured_output (#18150 ) ![Screenshot 2024-02-26 at 2 07 06 PM](https://github.com/langchain-ai/langchain/assets/22008038/20cacb47-3b24-45b5-871b-dd169f1acd37)	2024-02-26 16:22:30 -08:00
Bagatur	c53aa5cd37	core[patch]: support JS message serial namespaces (#18151 )	2024-02-26 16:19:46 -08:00
Harrison Chase	c673717c2b	add optimization notebook (#18155 )	2024-02-26 16:09:31 -08:00
Max Jakob	5ab69f907f	partners: add Elasticsearch package (#17467 ) ### Description This PR moves the Elasticsearch classes to a partners package. Note that we will not move (and later remove) `ElasticKnnSearch`. It were previously deprecated. `ElasticVectorSearch` is going to stay in the community package since it is used quite a lot still. Also note that I left the `ElasticsearchTranslator` for self query untouched because it resides in main `langchain` package. ### Dependencies There will be another PR that updates the notebooks (potentially pulling them into the partners package) and templates and removes the classes from the community package, see https://github.com/langchain-ai/langchain/pull/17468 #### Open question How to make the transition smooth for users? Do we move the import aliases and require people to install `langchain-elasticsearch`? Or do we remove the import aliases from the `langchain` package all together? What has worked well for other partner packages? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-26 23:19:47 +00:00
matt haigh	a4896da2a0	Experimental: Add other threshold types to SemanticChunker (#16807 ) Description Adding different threshold types to the semantic chunker. I’ve had much better and predictable performance when using standard deviations instead of percentiles. ![image](https://github.com/langchain-ai/langchain/assets/44395485/066e84a8-460e-4da5-9fa1-4ff79a1941c5) For all the documents I’ve tried, the distribution of distances look similar to the above: positively skewed normal distribution. All skews I’ve seen are less than 1 so that explains why standard deviations perform well, but I’ve included IQR if anyone wants something more robust. Also, using the percentile method backwards, you can declare the number of clusters and use semantic chunking to get an ‘optimal’ splitting. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 13:50:48 -08:00
Jaskirat Singh	ce682f5a09	community: vectorstores.kdbai - Added support for when no docs are present (#18103 ) - Description: By default it expects a list but that's not the case in corner scenarios when there is no document ingested(use case: Bootstrap application). \ Hence added as check, if the instance is panda Dataframe instead of list then it will procced with return immediately. - Issue: NA - Dependencies: NA - Twitter handle: jaskiratsingh1 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 12:47:06 -08:00
am-kinetica	9b8f6455b1	Langchain vectorstore integration with Kinetica (#18102 ) - Description: New vectorstore integration with the Kinetica database - Issue: - Dependencies: the Kinetica Python API `pip install gpudb==7.2.0.1`, - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: --------- Co-authored-by: Chad Juliano <cjuliano@kinetica.com>	2024-02-26 12:46:48 -08:00
Bagatur	1e8ab83d7b	langchain[patch], core[patch], openai[patch], fireworks[minor]: ChatFireworks.with_structured_output (#18078 ) <img width="1192" alt="Screenshot 2024-02-24 at 3 39 39 PM" src="https://github.com/langchain-ai/langchain/assets/22008038/1cf74774-a23f-4b06-9b9b-85dfa2f75b63">	2024-02-26 12:46:39 -08:00
GoodBai	3589a135ef	community: make `SET allow_experimental_[engine]_index` configurabe in vectorstores.clickhouse (#18107 ) ## Description & Issue While following the official doc to use clickhouse as a vectorstore, I found only the default `annoy` index is properly supported. But I want to try another engine `usearch` for `annoy` is not properly supported on ARM platforms. Here is the settings I prefer: ``` python settings = ClickhouseSettings( table="wiki_Ethereum", index_type="usearch", # annoy by default index_param=[], ) ``` The above settings do not work for the command `set allow_experimental_annoy_index=1` is hard-coded. This PR will make sure the experimental feature follow the `index_type` which is also consistent with Clickhouse's naming conventions.	2024-02-26 12:39:17 -08:00
Dan Stambler	69344a0661	community: Add Laser Embedding Integration (#18111 ) - Description: Added Integration with Meta AI's LASER Language-Agnostic SEntence Representations embedding library, which supports multilingual embedding for any of the languages listed here: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200, including several low resource languages - Dependencies: laser_encoders	2024-02-26 12:16:37 -08:00
Erick Friis	257879e98d	infra: api docs setup action location (#18148 )	2024-02-26 11:50:21 -08:00
Erick Friis	28cf3aab45	infra: api docs build commit dir (#18147 )	2024-02-26 11:47:04 -08:00
Heidi Steen	166f3d8351	Docs: azuresearch.ipynb (in docs/docs/integrations/vectorstores) -- fixed headings and comments (#18135 ) This PR updates azuresearch.ipynb with an edit to the introduction sentence, consistent heading levels, and disambiguation in code comments.	2024-02-26 11:46:55 -08:00
Luan Fernandes	e867557936	[docs] Update doc-string for buffer_as_messages method in ConversationBufferWindowMemory (#18136 ) minor fix stated in #18080	2024-02-26 11:46:43 -08:00
Barun Amalkumar Halder	23fc7c8c90	docs [patch] : fix import to use community path for handler in fiddler notebook (#18140 ) Description: Update the example fiddler notebook to use community path, instead of langchain.callback Dependencies: None Twitter handle: @bhalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-26 11:41:07 -08:00
Bagatur	767523f364	core[patch], langchain[patch], templates: move openai functions parsers to core (#18060 ) ![Screenshot 2024-02-23 at 7 48 03 PM](https://github.com/langchain-ai/langchain/assets/22008038/e5540c4d-0020-4ece-869f-ae19db2a1f3f)	2024-02-26 11:12:53 -08:00
Bagatur	96bff0ed5d	infra: create api rst for specific pkg (#18144 ) Example: create rst for libs/core only ```bash poetry run python docs/api_reference/create_api_rst.py core ```	2024-02-26 11:04:22 -08:00

... 3 4 5 6 7 ...

7953 Commits