langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
Luis Antonio Vieira Junior	67c880af74	community[patch]: adding linearization config to AmazonTextractPDFLoader (#17489 ) - Description: Adding an optional parameter `linearization_config` to the `AmazonTextractPDFLoader` so the caller can define how the output will be linearized, instead of forcing a predefined set of linearization configs. It will still have a default configuration as this will be an optional parameter. - Issue: #17457 - Dependencies: The same ones that already exist for `AmazonTextractPDFLoader` - Twitter handle: [@lvieirajr19](https://twitter.com/lvieirajr19) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 17:25:22 -08:00
Erick Friis	4f4300723b	docs: pinecone client version note (#17491 )	2024-03-08 17:09:17 -08:00
AtomicVar	23e62f8f8d	docs: fix lists display issue (#17911 ) Description: Fix lists display issues in Docs > Use Cases > Q&A with RAG > Quickstart. In essence, this PR changes: ```markdown Some paragraph. - Item a. - Item b. ``` to: ```markdown Some paragraph. - Item a. - Item b. ``` There needs an extra empty line to make the list rendered properly. FYI, the old version is displayed not properly as: <img width="856" alt="image" src="https://github.com/langchain-ai/langchain/assets/22856433/65202577-8ea2-47c6-b310-39bf42796fac"> - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:52:16 -08:00
Brace Sproul	9c218d0154	docs[patch]: Update how GA4 is collected (#18821 ) There's some issue/setting with the current python GA4 app. I created a new one just for feedback.	2024-03-08 14:32:40 -08:00
Ishani Vyas	2b0cbd65ba	community[patch]: Add Passio Nutrition AI Food Search Tool to Community Package (#18278 ) ## Add Passio Nutrition AI Food Search Tool to Community Package ### Description We propose adding a new tool to the `community` package, enabling integration with Passio Nutrition AI for food search functionality. This tool will provide a simple interface for retrieving nutrition facts through the Passio Nutrition AI API, simplifying user access to nutrition data based on food search queries. ### Implementation Details - Class Structure: Implement `NutritionAI`, extending `BaseTool`. It includes an `_run` method that accepts a query string and, optionally, a `CallbackManagerForToolRun`. - API Integration: Use `NutritionAIAPI` for the API wrapper, encapsulating all interactions with the Passio Nutrition AI and providing a clean API interface. - Error Handling: Implement comprehensive error handling for API request failures. ### Expected Outcome - User Benefits: Enable easy querying of nutrition facts from Passio Nutrition AI, enhancing the utility of the `langchain_community` package for nutrition-related projects. - Functionality: Provide a straightforward method for integrating nutrition information retrieval into users' applications. ### Dependencies - `langchain_core` for base tooling support - `pydantic` for data validation and settings management - Consider `requests` or another HTTP client library if not covered by `NutritionAIAPI`. ### Tests and Documentation - Unit Tests: Include tests that mock network interactions to ensure tool reliability without external API dependency. - Documentation: Create an example notebook in `docs/docs/integrations/tools/passio_nutrition_ai.ipynb` showing usage, setup, and example queries. ### Contribution Guidelines Compliance - Adhere to the project's linting and formatting standards (`make format`, `make lint`, `make test`). - Ensure compliance with LangChain's contribution guidelines, particularly around dependency management and package modifications. ### Additional Notes - Aim for the tool to be a lightweight, focused addition, not introducing significant new dependencies or complexity. - Potential future enhancements could include caching for common queries to improve performance. ### Twitter Handle - Here is our Passio AI [twitter handle](https://twitter.com/@passio_ai) where we announce our products. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-08 20:33:22 +00:00
Aaron Jimenez	bd9f98a20b	docs: Fix typo in modules/chains.ipynb (#18808 ) Description: Fix a minor typo in `modules/chains.ipynb`. - Issue: fixes #17851	2024-03-08 12:09:20 -08:00
Tomaz Bratanic	c0bdd4d45b	docs: Add main graph documentation (#18021 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 20:03:03 +00:00
Leonid Ganeline	7c8c4e5743	docs: `providers` update 7 (#18620 ) Added missed providers. Added missed integrations. Formatted to the consistent form. Fixed outdated imports.	2024-03-08 12:00:27 -08:00
kAIto47802	ff70cc4e80	docs: fix typo (#18810 ) Fixed typo in docs	2024-03-08 13:28:17 -05:00
Leonid Ganeline	3624f56ccb	docs: update imports of `retrievers` to use `langchain_community` (#18707 ) Updated `langchain` imports to `langchain_community`.	2024-03-08 13:04:38 -05:00
Leonid Ganeline	48eed86931	docs: update imports of `memory` to use `langchain_community` (#18689 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-08 13:02:31 -05:00
aditya thomas	a35203b164	docs: (minor) update to anthropic doc (#18794 ) Description: Minor update to Anthropic documentation Issue: Not applicable Dependencies: None Lint and test: `make format` and `make lint` was done	2024-03-08 09:48:04 -08:00
Paul Sanders	93b87f2bfb	docs: Fix typo (#18545 ) Fixing a minor typo in the package name. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-07 19:40:42 -08:00
Aaron Jimenez	fcf6213c22	docs: Fix link to HF TEI in text_embeddings_inference.ipynb (#18682 ) - [ ] PR title: docs: Fix link to HF TEI in text_embeddings_inference.ipynb - [ ] PR message: - Description: Fix the link to [Hugging Face Text Embeddings Inference (TEI)](https://huggingface.co/docs/text-embeddings-inference/index) in text_embeddings_inference.ipynb - Issue: Fix #18576	2024-03-07 19:38:39 -08:00
Averi Kitsch	8accee57a9	docs: update Google Cloud database integration docs (#18711 ) Description: update Google Cloud database integration docs Issue: NA Dependencies: NA	2024-03-07 19:36:00 -08:00
Ian	7f504c1f81	docs: Improve the tidb vector store notebook (#18773 ) Remove redundant useless content, and fix some minor oversight	2024-03-07 19:15:55 -08:00
Yunmo Koo	fee6f983ef	community[minor]: Integration for `Friendli` LLM and `ChatFriendli` ChatModel. (#17913 ) ## Description - Add [Friendli](https://friendli.ai/) integration for `Friendli` LLM and `ChatFriendli` chat model. - Unit tests and integration tests corresponding to this change are added. - Documentations corresponding to this change are added. ## Dependencies - Optional dependency [`friendli-client`](https://pypi.org/project/friendli-client/) package is added only for those who use `Frienldi` or `ChatFriendli` model. ## Twitter handle - https://twitter.com/friendliai	2024-03-08 02:20:47 +00:00
Ian	390ef6abe3	community[minor]: Add Initial Support for TiDB Vector Store (#15796 ) This pull request introduces initial support for the TiDB vector store. The current version is basic, laying the foundation for the vector store integration. While this implementation provides the essential features, we plan to expand and improve the TiDB vector store support with additional enhancements in future updates. Upcoming Enhancements: * Support for Vector Index Creation: To enhance the efficiency and performance of the vector store. * Support for max marginal relevance search. * Customized Table Structure Support: Recognizing the need for flexibility, we plan for more tailored and efficient data store solutions. Simple use case exmaple ```python from typing import List, Tuple from langchain.docstore.document import Document from langchain_community.vectorstores import TiDBVectorStore from langchain_openai import OpenAIEmbeddings db = TiDBVectorStore.from_texts( embedding=embeddings, texts=['Andrew like eating oranges', 'Alexandra is from England', 'Ketanji Brown Jackson is a judge'], table_name="tidb_vector_langchain", connection_string=tidb_connection_url, distance_strategy="cosine", ) query = "Can you tell me about Alexandra?" docs_with_score: List[Tuple[Document, float]] = db.similarity_search_with_score(query) for doc, score in docs_with_score: print("-" * 80) print("Score: ", score) print(doc.page_content) print("-" * 80) ```	2024-03-07 17:18:20 -08:00
Eugene Yurtsev	1e1cac50d8	Docs: remove sales from security (#18762 ) Remove sales from security	2024-03-07 17:35:46 -05:00
Eugene Yurtsev	ca299a8e08	Docs: Add custom parsing documentation and extending langchain (#18331 ) * Added extending langchain.mdx -- we'll need to add links as we add more custom documentation * Added partial documentation about parsers	2024-03-07 16:30:57 -05:00
Leonid Ganeline	dad949eb99	docs: update imports of `adapters` to use langchain_community (#18751 ) Updated imports from `langchain` to `langchain_community`	2024-03-07 15:04:25 -05:00
Leonid Ganeline	1af2130ff7	docs: update imports of tools to use langchain_community (#18705 ) Updated imports from `langchain` to `langchain_community`.	2024-03-07 11:46:09 -05:00
Sam Khano	1b4dcf22f3	community[minor]: Add DocumentDBVectorSearch VectorStore (#17757 ) Description: - Added Amazon DocumentDB Vector Search integration (HNSW index) - Added integration tests - Updated AWS documentation with DocumentDB Vector Search instructions - Added notebook for DocumentDB integration with example usage --------- Co-authored-by: EC2 Default User <ec2-user@ip-172-31-95-226.ec2.internal>	2024-03-06 15:11:34 -08:00
Vittorio Rigamonti	51f3902bc4	community[minor]: Adding support for Infinispan as VectorStore (#17861 ) Description: This integrates Infinispan as a vectorstore. Infinispan is an open-source key-value data grid, it can work as single node as well as distributed. Vector search is supported since release 15.x For more: [Infinispan Home](https://infinispan.org) Integration tests are provided as well as a demo notebook	2024-03-06 15:11:02 -08:00
Max Jakob	cca0167917	elasticsearch[patch], community[patch]: update references, deprecate community classes (#18506 ) Follow up on https://github.com/langchain-ai/langchain/pull/17467. - Update all references to the Elasticsearch classes to use the partners package. - Deprecate community classes. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-06 15:09:12 -08:00
Djordje	12b4a4d860	community[patch]: Opensearch delete method added - indexing supported (#18522 ) - Description: Added delete method for OpenSearchVectorSearch, therefore indexing supported - Issue: No - Dependencies: No - Twitter handle: stkbmf	2024-03-06 15:08:47 -08:00
Leonid Ganeline	81cbf0f2fd	docs: update import paths for callbacks to use langchain_community callbacks where applicable (#18691 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-06 14:49:06 -05:00
Leonid Ganeline	fb686333ac	docs: fix `streamlit` provider (#18606 ) There is a wrong python package import. Fixed it.	2024-03-06 11:42:26 -08:00
aditya thomas	97de498d39	docs: update to the streaming tutorial notebook in the lcel documentation (#18378 ) Description: Update to the streaming tutorial notebook in the LCEL documentation Issue: Fixed an import and (minor) changes in documentation language Dependencies: None	2024-03-06 10:47:22 -08:00
Guangdong Liu	32db9e74e4	docs: Fix some issues with sparkllm use cases (#17674 )	2024-03-06 10:46:51 -08:00
Eugene Yurtsev	b9f3c7a0c9	Use Case: Extraction set temperature to 0, qualify a statement (#18672 ) Minor changes: 1) Set temperature to 0 (important) 2) Better qualify one of the statements with confidence	2024-03-06 12:35:45 -05:00
Eugene Yurtsev	a4a6978224	Docs: Revamp Extraction Use Case (#18588 ) Revamp the extraction use case documentation --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-06 09:18:25 -05:00
Leonid Ganeline	114d64d4a7	docs: `providers` update (#18527 ) Added missed pages. Added links and descriptions. Foratted to the consistent form.	2024-03-05 17:32:59 -08:00
PSV	d7dd3cd248	docs: structured_output (#18608 ) - Description: Fixed some typos and copy errors in the Beta Structured Output docs - Issue: N/A - Dependencies: Docs only - Twitter handle: @psvann Co-authored-by: P.S. Vann <psvann@yahoo.com>	2024-03-05 17:20:06 -08:00
Bagatur	29f1619d61	docs: why lcel nit (#18616 )	2024-03-05 17:10:47 -08:00
Bagatur	080904689c	docs: text splitters install (#18589 )	2024-03-05 16:19:37 -08:00
Sunchao Wang	dc81dba6cf	community[patch]: Improve amadeus tool and doc (#18509 ) Description: This pull request addresses two key improvements to the langchain repository: Fix for Crash in Flight Search Interface: Previously, the code would crash when encountering a failure scenario in the flight ticket search interface. This PR resolves this issue by implementing a fix to handle such scenarios gracefully. Now, the code handles failures in the flight search interface without crashing, ensuring smoother operation. Documentation Update for Amadeus Toolkit: Prior to this update, examples provided in the documentation for the Amadeus Toolkit were unable to run correctly due to outdated information. This PR includes an update to the documentation, ensuring that all examples can now be executed successfully. With this update, users can effectively utilize the Amadeus Toolkit with accurate and functioning examples. These changes aim to enhance the reliability and usability of the langchain repository by addressing issues related to error handling and ensuring that documentation remains up-to-date and actionable. Issue: https://github.com/langchain-ai/langchain/issues/17375 Twitter Handle: SingletonYxx	2024-03-05 16:17:22 -08:00
Utkarsh Kapil	539a13dbda	docs: minor spelling errors (#18429 ) Description: Noticed spelling errors. 'Colab' mispelt as 'Collab'. https://python.langchain.com/docs/use_cases Dependencies: n/a	2024-03-05 15:54:15 -08:00
Dounx	ad48f55357	community[minor]: add Yuque document loader (#17924 ) This pull request support loading documents from Yuque with Langchain. Yuque is a professional cloud-based knowledge base for team collaboration in documentation. Website: https://www.yuque.com OpenAPI: https://www.yuque.com/yuque/developer/openapi	2024-03-05 15:54:07 -08:00
Kazuki Maeda	60c5d964a8	community[minor]: use jq schema for content_key in json_loader (#18003 ) ### Description Changed the value specified for `content_key` in JSONLoader from a single key to a value based on jq schema. I created [similar PR](https://github.com/langchain-ai/langchain/pull/11255) before, but it has several conflicts because of the architectural change associated stable version release, so I re-create this PR to fit new architecture. ### Why For json data like the following, specify `.data[].attributes.message` for page_content and `.data[].attributes.id` or `.data[].attributes.attributes. tags`, etc., the `content_key` must also parse the json structure. <details> <summary>sample json data</summary> ```json { "data": [ { "attributes": { "message": "message1", "tags": [ "tag1" ] }, "id": "1" }, { "attributes": { "message": "message2", "tags": [ "tag2" ] }, "id": "2" } ] } ``` </details> <details> <summary>sample code</summary> ```python def metadata_func(record: dict, metadata: dict) -> dict: metadata["source"] = None metadata["id"] = record.get("id") metadata["tags"] = record["attributes"].get("tags") return metadata sample_file = "sample1.json" loader = JSONLoader( file_path=sample_file, jq_schema=".data[]", content_key=".attributes.message", ## content_key is parsable into jq schema is_content_key_jq_parsable=True, ## this is added parameter metadata_func=metadata_func ) data = loader.load() data ``` </details> ### Dependencies none ### Twitter handle [kzk_maeda](https://twitter.com/kzk_maeda)	2024-03-05 15:51:24 -08:00
Rodrigo Nogueira	f4bb33bbf3	docs: fix link and missing package (#18405 ) Issue: fix broken links and missing package on colab example	2024-03-05 15:50:06 -08:00
Max Jakob	81e9ab6e3a	docs: Update elasticsearch README (#18497 ) Update Elasticsearch README with information on how to start a deployment. Also make some cosmetic changes to the [Elasticsearch docs](https://python.langchain.com/docs/integrations/vectorstores/elasticsearch). Follow up on https://github.com/langchain-ai/langchain/pull/17467	2024-03-05 15:49:16 -08:00
Hech	6a08134661	community[patch], langchain[minor]: Add retriever self_query and score_threshold in DingoDB (#18106 )	2024-03-05 15:47:29 -08:00
Bagatur	1569b19191	docs: query analysis links (#18614 )	2024-03-05 15:05:44 -08:00
Asaf Joseph Gardin	27441555d0	ai21[patch]: AI21 Labs Contextual Answers support (#18270 ) Description: Added support for AI21 Labs model - Contextual Answers Dependencies: ai21, ai21-tokenizer Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 22:42:04 +00:00
Leonid Ganeline	bd4993141d	docs: `providers` update 5 (#18550 ) Added missed sections. Added descriptions.	2024-03-05 12:55:13 -08:00
Reuben Zotz-Wilson	96cd50938a	community:update telegram notebook (#18569 ) Description: modified the user_name to username to conform with the expected inputs to TelegramChatApiLoader Issue: Current code fails in langchain-community 0.0.24 <loader = TelegramChatApiLoader( chat_entity="<CHAT_URL>", # recommended to use Entity here api_hash="<API HASH >", api_id="<API_ID>", user_name="", # needed only for caching the session. )>	2024-03-05 11:47:17 -08:00
Jib	9da1e0cf34	mongodb[patch]: Migrate MongoDBChatMessageHistory (#18590 ) ## Description Migrate the `MongoDBChatMessageHistory` to the managed `langchain-mongodb` partner-package ## Dependencies None ## Twitter handle @mongodb ## tests and docs - [x] Migrate existing integration test - [x ]~ Convert existing integration test to a unit test~ Creation is out of scope for this ticket - [x ] ~Considering delaying work until #17470 merges to leverage the `MockCollection` object. ~ - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 18:53:02 +00:00
Jib	f92f7d2e03	mongodb[minor]: Add MongoDB LLM Cache (#17470 ) # Description - Description: Adding MongoDB LLM Caching Layer abstraction - Issue: N/A - Dependencies: None - Twitter handle: @mongodb Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR Message (above) - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @efriis, @eyurtsev, @hwchase17. --------- Co-authored-by: Jib <jib@byblack.us>	2024-03-05 10:38:39 -08:00
Erick Friis	07f23c2d45	docs: anthropic multimodal (#18586 )	2024-03-05 16:58:06 +00:00

1 2 3 4 5 ...

3186 Commits