langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Taraka Nithin Vankala	eec023766e	docs: Corrected error (#19030 ) - [ ] PR title: "docs: correction in "https://github.com/langchain-ai/langchain/blob/master/docs/docs/get_started/quickstart.mdx", line 289". - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: - Corrected the spelling mistake - #18981	2024-03-15 16:02:33 -07:00
Christophe Bornet	f2a7dda4bd	community[patch]: Use langchain-astradb for AstraDB doc loader (#19071 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:57:25 +00:00
Leonid Ganeline	a49ac55964	docs: `providers` update 8 (#19053 ) Added missed providers. Added missed integrations. Fixed format.	2024-03-15 15:49:14 -07:00
Holt Skinner	cee03630d9	community[patch]: Add Blended Search Support to `GoogleVertexAISearchRetriever` (#19082 ) https://cloud.google.com/generative-ai-app-builder/docs/create-data-store-es#multi-data-stores --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:39:31 +00:00
William W Wang	0a784074d1	docs: Update llm_caching.ipynb (#19085 )	2024-03-15 22:35:48 +00:00
William W Wang	6327be9048	docsUpdate azure_cosmos_db.ipynb (#19087 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:33:26 +00:00
Anubhav Madhav	553a520ab6	docs: Fixed Grammar in Considerations of Model I/O Concepts (#19091 ) Fixed Grammar in Considerations of Model I/O Concepts documentation page - Update concepts.mdx Page Link: https://python.langchain.com/docs/modules/model_io/concepts#considerations - Description: Fixed Grammar in Considerations of Model I/O Documentation Page - Issue: "to work well with the model are you using" # "to work well with the model you are using" - Dependencies: None - Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:31:39 +00:00
Shotaro Sano	d647ff1a9a	docs: Fix execution results of `docs/docs/modules/data_connection/indexing.ipynb` (#19112 ) ## Description This PR addresses a documentation issue in the [Indexing](https://python.langchain.com/docs/modules/data_connection/indexing) page. Specifically, it corrects the execution results of the Jupyter notebook under the [Source](https://python.langchain.com/docs/modules/data_connection/indexing#source) section, which were broken as detailed below. ## Problem The execution results following the statement, `This should delete the old versions of documents associated with doggy.txt source and replace them with the new versions.`, appear to be incorrect, as described below. ### Current Behavior - For some reason, the `index` function fails to add the new content of `doggy.txt`. Although it deletes the document objects associated with the `doggy.txt` source, it does not add the objects in `changed_doggy_docs`. Consequently, the execution result displays `num_added: 0`. - This unexpected behavior also impacts the results of `vectorstore.similarity_search("dog", k=30)`, showing only the contents of `kitty.txt`. It appears as though the contents of `doggy.txt` have been completely removed from the index: ``` Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ### Expected Behavior - The `index` function should successfully add the objects in `changed_doggy_docs` after removing the old content of `doggy.txt`. The anticipated execution result is `num_added: 2`. - Subsequently, the modified content of `doggy.txt` should appear in the results of `vectorstore.similarity_search("dog", k=30)` as follows: ``` [Document(page_content='woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='woof woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ## Fix I reran `docs/docs/modules/data_connection/indexing.ipynb` and have included the diff in this PR.	2024-03-15 22:27:15 +00:00
Guangdong Liu	cced3eb9bc	community[patch]: Fix sparkllm embeddings api bug. (#19122 ) - Description: Fix sparkllm embeddings api bug. @baskaryan PTAL	2024-03-15 15:08:49 -07:00
samanhappy	b9c62fb905	docs: fix API link for BaseLoader (#19128 ) The link to the BaseLoader API requires an update as it has been moved into the `langchain_core` package.	2024-03-15 14:46:05 -07:00
Kostas Botsas	527676a753	docs: Fix source column xata.ipynb (#19137 ) Docs fix: replace column name search with source. The Xata integration expects metadata column named "source". The docs suggest the name "search", which if used, yields the following error: ``` File "/usr/local/lib/python3.11/site-packages/langchain_community/vectorstores/xata.py", line 95, in _add_vectors raise Exception(f"Error adding vectors to Xata: {r.status_code} {r}") Exception: Error adding vectors to Xata: 400 {'errors': [{'status': 400, 'message': 'invalid record: column [source]: column not found'}]} ```	2024-03-15 14:06:18 -07:00
fengjial	c922ea36cb	community[minor]: Add Baidu VectorDB as vector store (#17997 ) Co-authored-by: fengjialin <fengjialin@MacBook-Pro.local>	2024-03-15 19:01:58 +00:00
aditya thomas	190887c5cd	docs: update the list of providers (#19012 ) Description: Update the list of LangChain providers Issue: Make the list of LangChain providers current Dependencies: None	2024-03-15 12:00:24 -07:00
Erick Friis	bbe164ad28	docs: voyageai as provider (#19154 )	2024-03-15 10:12:37 -07:00
Erick Friis	781aee0068	community, langchain, infra: revert store extended test deps outside of poetry (#19153 ) Reverts langchain-ai/langchain#18995 Because it makes installing dependencies in python 3.11 extended testing take 80 minutes	2024-03-15 17:10:47 +00:00
Leonid Kuligin	e3ff107e4f	docs: updated google integration related imports in the documentation (#19131 ) updated imports in the documentation for google vertex	2024-03-15 09:30:50 -04:00
Erick Friis	9e569d85a4	community, langchain, infra: store extended test deps outside of poetry (#18995 ) poetry can't reliably handle resolving the number of optional "extended test" dependencies we have. If we instead just rely on pip to install extended test deps in CI, this isn't an issue.	2024-03-15 05:55:30 +00:00
Erick Friis	7ce81eb6f4	voyageai[patch]: init package (#19098 ) Co-authored-by: fodizoltan <zoltan@conway.expert> Co-authored-by: Yujie Qian <thomasq0809@gmail.com> Co-authored-by: fzowl <160063452+fzowl@users.noreply.github.com>	2024-03-15 00:56:10 +00:00
Brace Sproul	98cd8f673b	docs[minor]ci[minor]: Add script & CI to check recurring links daily (#19100 )	2024-03-14 17:42:22 -07:00
billytrend-cohere	7253b816cc	community: Add support for cohere SDK v5 (keeps v4 backwards compatibility) (#19084 ) - Description: Add support for cohere SDK v5 (keeps v4 backwards compatibility) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-14 15:53:24 -07:00
Bagatur	e276817e1d	docs: fix vercel build script (#19090 ) amazon linux 2023 doesn't have `amazon-linux-extras` but shoudl have python3.9 by default	2024-03-14 20:53:43 +00:00
Anthony Yang	688a5bd106	docs:fixed typo in streaming document (#19045 ) Fixed typo in line 661 - from 'mimimize' to 'minimize - [ ] PR message: - Description: Fixed typo in streaming document - change 'mimimize' to 'minimize If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-14 19:38:53 +00:00
Bagatur	0ae39ab30e	docs: make links internal (#19063 ) So they can be properly link checked	2024-03-14 16:22:56 +00:00
Erick Friis	2999d06938	docs: deprecate old airbyte loader docs (#19048 )	2024-03-13 23:18:30 +00:00
Prakul	4c53e31377	docs: Updated index definition and reference to LangChain-MongoDB (#19047 ) Description: Updates to LangChain-MongoDB documentation: updates to the Atlas vector search index definition Issue: NA Dependencies: NA Twitter handle: iprakul	2024-03-13 15:44:13 -07:00
Tomaz Bratanic	e5e15c8d59	docs: Add graph construction docs (#18904 )	2024-03-13 12:27:58 -07:00
Max Jakob	911ccf9aa6	docs: elasticsearch retriever (#18965 ) Add documentation notebook for `ElasticsearchRetriever`. ## Dependencies - [ ] Release new `langchain-elasticsearch` version 0.2.0 that includes `ElasticsearchRetriever`	2024-03-12 09:42:36 -07:00
Tymofii	0bec1f6877	commnity[patch]: refactor code for faiss vectorstore, update faiss vectorstore documentation (#18092 ) Description: Refactor code of FAISS vectorcstore and update the related documentation. Details: - replace `.format()` with f-strings for strings formatting; - refactor definition of a filtering function to make code more readable and more flexible; - slightly improve efficiency of `max_marginal_relevance_search_with_score_by_vector` method by removing unnecessary looping over the same elements; - slightly improve efficiency of `delete` method by using set data structure for checking if the element was already deleted; Issue: fix small inconsistency in the documentation (the old example was incorrect and unappliable to faiss vectorstore) Dependencies: basic langchain-community dependencies and `faiss` (for CPU or for GPU) Twitter handle: antonenkodev	2024-03-11 22:33:03 -07:00
Bagatur	e0e688a277	core[minor]: generation info on msg (#18592 ) related to #16403 #17188	2024-03-12 04:43:17 +00:00
Leonid Ganeline	fad308a764	docs: `providers` update 2 (#18407 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-03-11 18:35:37 -07:00
Brace Sproul	578e67c017	docs[patch]: properly load/use env vars (#18942 )	2024-03-11 15:38:05 -07:00
Brace Sproul	4ff6aa5c78	docs[minor]: Swap gtag for supabase (#18937 ) Added deps: - `@supabase/supabase-js` - for sending inserts - `supabase` - dev dep, for generating types via cli - `dotenv` for loading env vars Added script: - `yarn gen` - will auto generate the database schema types using the supabase CLI. Not necessary for development, but is useful. Requires authing with the supabase CLI (will error out w/ instructions if you're not authed). Added functionality: - pulls users IP address (using a free endpoint: `https://api.ipify.org` so we can filter out abuse down the line) TODO: - [x] add env vars to vercel	2024-03-11 14:23:12 -07:00
fjk	a7fc731720	docs: change sparkllm spark_app_url to spark_api_url (#18000 ) community: fix - change sparkllm spark_app_url to spark_api_url - Description: - Change the variable name from `sparkllm spark_app_url` to `spark_api_url` in the community package. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-11 20:01:30 +00:00
Sevin F. Varoglu	8639624d40	docs: update OctoAI doc (#18913 ) This PR updates the OctoAI LLM doc.	2024-03-11 13:01:10 -07:00
Alexander Kozlov	a7500ab0fb	docs: Update huggingface pipelines notebook (#18801 )	2024-03-11 20:00:31 +00:00
Conroy Whitney	96d7fe0f85	docs: Change saved/configured chain variable name (#18863 ) Description: Variable name was `openai_poem` but it didn't pass in the `"prompt": "poem"` config, so the examples were showing a joke being returned from a variable called `_poem`. We could have gone one of two ways: 1. Updating the config line and the output line, or 2. Updating the variable name The latter seemed simpler, so that's what I went with. But I'd be glad to re-do this PR if you prefer the former. Thanks for everything, y'all. You rock 🤘 Issue:* N/A Dependencies: N/A Twitter handle: `conroywhitney`	2024-03-11 12:59:24 -07:00
Virat Singh	cafffe8a21	community: Add PolygonAggregates tool (#18882 ) Description: In this PR, I am adding a `PolygonAggregates` tool, which can be used to get historical stock price data (called aggregates by Polygon) for a given ticker. Polygon [docs](https://polygon.io/docs/stocks/get_v2_aggs_ticker__stocksticker__range__multiplier___timespan___from___to) for this endpoint. Twitter: [@virattt](https://twitter.com/virattt)	2024-03-11 11:58:10 -07:00
Bagatur	34284c25d4	docs: turn on link check (#18924 )	2024-03-11 10:50:39 -07:00
Mohammad Mohtashim	43db4cd20e	core[major]: On Tool End Observation Casting Fix (#18798 ) This PR updates the on_tool_end handlers to return the raw output from the tool instead of casting it to a string. This is technically a breaking change, though it's impact is expected to be somewhat minimal. It will fix behavior in `astream_events` as well. Fixes the following issue #18760 raised by @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-11 10:59:04 -04:00
Prashanth Rao	a96a6e0f2c	docs: Fix typo and add KùzuDB to graphs docs (#18915 ) - Description: Adding Kùzu (an embedded graph DB that uses Cypher) to the graph docs, and fixing a typo - Issue: docs update	2024-03-11 14:42:46 +00:00
aditya thomas	3d15498612	docs: Update callbacks documentation (#18899 ) Description: Update callbacks documentation Issue: Change some module imports and a method invocation to reflect the current LangChainAPI Dependencies: None	2024-03-11 10:40:11 -04:00
Leonid Ganeline	dee256ef5a	docs: `platforms/google` fixed broken links (#18878 ) Several links are broken. Fixed them.	2024-03-10 18:19:43 -07:00
Kushagra	5fcbe9dd2a	community[patch]: documented the feature to filter documents in MongoDBloader (#18842 ) "community[docs]: documented the feature to filter documents in MongoDBloader" - Description: documented the feature to filter documents in MongoDBloader - Feature: the feature https://github.com/langchain-ai/langchain/discussions/18251 - Dependencies: No - Twitter handle: https://twitter.com/im_Kushagra	2024-03-09 13:41:34 -08:00
Ikko Eltociear Ashimine	c3580d3c64	docs: fix typo in google_cloud_sql_mysql.ipynb (#18847 ) arbitary -> arbitrary	2024-03-09 13:39:36 -08:00
Luan Fernandes	5a006f7264	docs: update typo in docs about agent tools (#18850 ) fixes #18849	2024-03-09 13:39:18 -08:00
Leonid Ganeline	3dabd3f214	docs: platform pages update (#17836 ) `Integrations` platform page ToC-s: sections there are placed without order. For example, the [google](https://python.langchain.com/docs/integrations/platforms/google) page. The `LLM` section is not the first section, as it is in the [Components](https://python.langchain.com/docs/integrations/components) menu. Updates: * reorganized the page sections so they follow the Component menu order. * fixed names for the section names: "Text Embedding Models" -> "Embedding Models"	2024-03-09 13:34:33 -08:00
Leonid Ganeline	07c518ad3e	docs: `providers` update 4 (#18540 ) Created the `facebook` page from `facebook_faiss` and `facebook_chat` pages. Added another Facebook integrations into this page. Updated `discord` page.	2024-03-09 13:30:48 -08:00
Leonid Ganeline	9c0f84ae95	docs: `providers` update 6 (#18610 ) Cleaned up the `Integrations/Components/Memory` navbar by shortening the page titles. Updated page titles and file names to consistent formats.	2024-03-09 13:29:44 -08:00
Tomaz Bratanic	e778d60aec	Fix broken link in graph docs (#18837 )	2024-03-09 10:40:33 -08:00
Leonid Ganeline	5d65b47e41	docs: chat menu item as icon (#18806 ) Update chat icon in docs	2024-03-08 21:00:21 -05:00
Luis Antonio Vieira Junior	67c880af74	community[patch]: adding linearization config to AmazonTextractPDFLoader (#17489 ) - Description: Adding an optional parameter `linearization_config` to the `AmazonTextractPDFLoader` so the caller can define how the output will be linearized, instead of forcing a predefined set of linearization configs. It will still have a default configuration as this will be an optional parameter. - Issue: #17457 - Dependencies: The same ones that already exist for `AmazonTextractPDFLoader` - Twitter handle: [@lvieirajr19](https://twitter.com/lvieirajr19) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 17:25:22 -08:00
Erick Friis	4f4300723b	docs: pinecone client version note (#17491 )	2024-03-08 17:09:17 -08:00
AtomicVar	23e62f8f8d	docs: fix lists display issue (#17911 ) Description: Fix lists display issues in Docs > Use Cases > Q&A with RAG > Quickstart. In essence, this PR changes: ```markdown Some paragraph. - Item a. - Item b. ``` to: ```markdown Some paragraph. - Item a. - Item b. ``` There needs an extra empty line to make the list rendered properly. FYI, the old version is displayed not properly as: <img width="856" alt="image" src="https://github.com/langchain-ai/langchain/assets/22856433/65202577-8ea2-47c6-b310-39bf42796fac"> - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:52:16 -08:00
Brace Sproul	9c218d0154	docs[patch]: Update how GA4 is collected (#18821 ) There's some issue/setting with the current python GA4 app. I created a new one just for feedback.	2024-03-08 14:32:40 -08:00
Ishani Vyas	2b0cbd65ba	community[patch]: Add Passio Nutrition AI Food Search Tool to Community Package (#18278 ) ## Add Passio Nutrition AI Food Search Tool to Community Package ### Description We propose adding a new tool to the `community` package, enabling integration with Passio Nutrition AI for food search functionality. This tool will provide a simple interface for retrieving nutrition facts through the Passio Nutrition AI API, simplifying user access to nutrition data based on food search queries. ### Implementation Details - Class Structure: Implement `NutritionAI`, extending `BaseTool`. It includes an `_run` method that accepts a query string and, optionally, a `CallbackManagerForToolRun`. - API Integration: Use `NutritionAIAPI` for the API wrapper, encapsulating all interactions with the Passio Nutrition AI and providing a clean API interface. - Error Handling: Implement comprehensive error handling for API request failures. ### Expected Outcome - User Benefits: Enable easy querying of nutrition facts from Passio Nutrition AI, enhancing the utility of the `langchain_community` package for nutrition-related projects. - Functionality: Provide a straightforward method for integrating nutrition information retrieval into users' applications. ### Dependencies - `langchain_core` for base tooling support - `pydantic` for data validation and settings management - Consider `requests` or another HTTP client library if not covered by `NutritionAIAPI`. ### Tests and Documentation - Unit Tests: Include tests that mock network interactions to ensure tool reliability without external API dependency. - Documentation: Create an example notebook in `docs/docs/integrations/tools/passio_nutrition_ai.ipynb` showing usage, setup, and example queries. ### Contribution Guidelines Compliance - Adhere to the project's linting and formatting standards (`make format`, `make lint`, `make test`). - Ensure compliance with LangChain's contribution guidelines, particularly around dependency management and package modifications. ### Additional Notes - Aim for the tool to be a lightweight, focused addition, not introducing significant new dependencies or complexity. - Potential future enhancements could include caching for common queries to improve performance. ### Twitter Handle - Here is our Passio AI [twitter handle](https://twitter.com/@passio_ai) where we announce our products. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-08 20:33:22 +00:00
Aaron Jimenez	bd9f98a20b	docs: Fix typo in modules/chains.ipynb (#18808 ) Description: Fix a minor typo in `modules/chains.ipynb`. - Issue: fixes #17851	2024-03-08 12:09:20 -08:00
Tomaz Bratanic	c0bdd4d45b	docs: Add main graph documentation (#18021 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 20:03:03 +00:00
Leonid Ganeline	7c8c4e5743	docs: `providers` update 7 (#18620 ) Added missed providers. Added missed integrations. Formatted to the consistent form. Fixed outdated imports.	2024-03-08 12:00:27 -08:00
kAIto47802	ff70cc4e80	docs: fix typo (#18810 ) Fixed typo in docs	2024-03-08 13:28:17 -05:00
Leonid Ganeline	3624f56ccb	docs: update imports of `retrievers` to use `langchain_community` (#18707 ) Updated `langchain` imports to `langchain_community`.	2024-03-08 13:04:38 -05:00
Leonid Ganeline	48eed86931	docs: update imports of `memory` to use `langchain_community` (#18689 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-08 13:02:31 -05:00
aditya thomas	a35203b164	docs: (minor) update to anthropic doc (#18794 ) Description: Minor update to Anthropic documentation Issue: Not applicable Dependencies: None Lint and test: `make format` and `make lint` was done	2024-03-08 09:48:04 -08:00
Paul Sanders	93b87f2bfb	docs: Fix typo (#18545 ) Fixing a minor typo in the package name. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-07 19:40:42 -08:00
Aaron Jimenez	fcf6213c22	docs: Fix link to HF TEI in text_embeddings_inference.ipynb (#18682 ) - [ ] PR title: docs: Fix link to HF TEI in text_embeddings_inference.ipynb - [ ] PR message: - Description: Fix the link to [Hugging Face Text Embeddings Inference (TEI)](https://huggingface.co/docs/text-embeddings-inference/index) in text_embeddings_inference.ipynb - Issue: Fix #18576	2024-03-07 19:38:39 -08:00
Averi Kitsch	8accee57a9	docs: update Google Cloud database integration docs (#18711 ) Description: update Google Cloud database integration docs Issue: NA Dependencies: NA	2024-03-07 19:36:00 -08:00
Ian	7f504c1f81	docs: Improve the tidb vector store notebook (#18773 ) Remove redundant useless content, and fix some minor oversight	2024-03-07 19:15:55 -08:00
Yunmo Koo	fee6f983ef	community[minor]: Integration for `Friendli` LLM and `ChatFriendli` ChatModel. (#17913 ) ## Description - Add [Friendli](https://friendli.ai/) integration for `Friendli` LLM and `ChatFriendli` chat model. - Unit tests and integration tests corresponding to this change are added. - Documentations corresponding to this change are added. ## Dependencies - Optional dependency [`friendli-client`](https://pypi.org/project/friendli-client/) package is added only for those who use `Frienldi` or `ChatFriendli` model. ## Twitter handle - https://twitter.com/friendliai	2024-03-08 02:20:47 +00:00
Ian	390ef6abe3	community[minor]: Add Initial Support for TiDB Vector Store (#15796 ) This pull request introduces initial support for the TiDB vector store. The current version is basic, laying the foundation for the vector store integration. While this implementation provides the essential features, we plan to expand and improve the TiDB vector store support with additional enhancements in future updates. Upcoming Enhancements: * Support for Vector Index Creation: To enhance the efficiency and performance of the vector store. * Support for max marginal relevance search. * Customized Table Structure Support: Recognizing the need for flexibility, we plan for more tailored and efficient data store solutions. Simple use case exmaple ```python from typing import List, Tuple from langchain.docstore.document import Document from langchain_community.vectorstores import TiDBVectorStore from langchain_openai import OpenAIEmbeddings db = TiDBVectorStore.from_texts( embedding=embeddings, texts=['Andrew like eating oranges', 'Alexandra is from England', 'Ketanji Brown Jackson is a judge'], table_name="tidb_vector_langchain", connection_string=tidb_connection_url, distance_strategy="cosine", ) query = "Can you tell me about Alexandra?" docs_with_score: List[Tuple[Document, float]] = db.similarity_search_with_score(query) for doc, score in docs_with_score: print("-" * 80) print("Score: ", score) print(doc.page_content) print("-" * 80) ```	2024-03-07 17:18:20 -08:00
Eugene Yurtsev	1e1cac50d8	Docs: remove sales from security (#18762 ) Remove sales from security	2024-03-07 17:35:46 -05:00
Eugene Yurtsev	ca299a8e08	Docs: Add custom parsing documentation and extending langchain (#18331 ) * Added extending langchain.mdx -- we'll need to add links as we add more custom documentation * Added partial documentation about parsers	2024-03-07 16:30:57 -05:00
Leonid Ganeline	dad949eb99	docs: update imports of `adapters` to use langchain_community (#18751 ) Updated imports from `langchain` to `langchain_community`	2024-03-07 15:04:25 -05:00
Leonid Ganeline	1af2130ff7	docs: update imports of tools to use langchain_community (#18705 ) Updated imports from `langchain` to `langchain_community`.	2024-03-07 11:46:09 -05:00
Sam Khano	1b4dcf22f3	community[minor]: Add DocumentDBVectorSearch VectorStore (#17757 ) Description: - Added Amazon DocumentDB Vector Search integration (HNSW index) - Added integration tests - Updated AWS documentation with DocumentDB Vector Search instructions - Added notebook for DocumentDB integration with example usage --------- Co-authored-by: EC2 Default User <ec2-user@ip-172-31-95-226.ec2.internal>	2024-03-06 15:11:34 -08:00
Vittorio Rigamonti	51f3902bc4	community[minor]: Adding support for Infinispan as VectorStore (#17861 ) Description: This integrates Infinispan as a vectorstore. Infinispan is an open-source key-value data grid, it can work as single node as well as distributed. Vector search is supported since release 15.x For more: [Infinispan Home](https://infinispan.org) Integration tests are provided as well as a demo notebook	2024-03-06 15:11:02 -08:00
Max Jakob	cca0167917	elasticsearch[patch], community[patch]: update references, deprecate community classes (#18506 ) Follow up on https://github.com/langchain-ai/langchain/pull/17467. - Update all references to the Elasticsearch classes to use the partners package. - Deprecate community classes. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-06 15:09:12 -08:00
Djordje	12b4a4d860	community[patch]: Opensearch delete method added - indexing supported (#18522 ) - Description: Added delete method for OpenSearchVectorSearch, therefore indexing supported - Issue: No - Dependencies: No - Twitter handle: stkbmf	2024-03-06 15:08:47 -08:00
Leonid Ganeline	81cbf0f2fd	docs: update import paths for callbacks to use langchain_community callbacks where applicable (#18691 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-06 14:49:06 -05:00
Leonid Ganeline	fb686333ac	docs: fix `streamlit` provider (#18606 ) There is a wrong python package import. Fixed it.	2024-03-06 11:42:26 -08:00
aditya thomas	97de498d39	docs: update to the streaming tutorial notebook in the lcel documentation (#18378 ) Description: Update to the streaming tutorial notebook in the LCEL documentation Issue: Fixed an import and (minor) changes in documentation language Dependencies: None	2024-03-06 10:47:22 -08:00
Guangdong Liu	32db9e74e4	docs: Fix some issues with sparkllm use cases (#17674 )	2024-03-06 10:46:51 -08:00
Eugene Yurtsev	b9f3c7a0c9	Use Case: Extraction set temperature to 0, qualify a statement (#18672 ) Minor changes: 1) Set temperature to 0 (important) 2) Better qualify one of the statements with confidence	2024-03-06 12:35:45 -05:00
Eugene Yurtsev	a4a6978224	Docs: Revamp Extraction Use Case (#18588 ) Revamp the extraction use case documentation --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-06 09:18:25 -05:00
Leonid Ganeline	114d64d4a7	docs: `providers` update (#18527 ) Added missed pages. Added links and descriptions. Foratted to the consistent form.	2024-03-05 17:32:59 -08:00
PSV	d7dd3cd248	docs: structured_output (#18608 ) - Description: Fixed some typos and copy errors in the Beta Structured Output docs - Issue: N/A - Dependencies: Docs only - Twitter handle: @psvann Co-authored-by: P.S. Vann <psvann@yahoo.com>	2024-03-05 17:20:06 -08:00
Bagatur	29f1619d61	docs: why lcel nit (#18616 )	2024-03-05 17:10:47 -08:00
Bagatur	080904689c	docs: text splitters install (#18589 )	2024-03-05 16:19:37 -08:00
Sunchao Wang	dc81dba6cf	community[patch]: Improve amadeus tool and doc (#18509 ) Description: This pull request addresses two key improvements to the langchain repository: Fix for Crash in Flight Search Interface: Previously, the code would crash when encountering a failure scenario in the flight ticket search interface. This PR resolves this issue by implementing a fix to handle such scenarios gracefully. Now, the code handles failures in the flight search interface without crashing, ensuring smoother operation. Documentation Update for Amadeus Toolkit: Prior to this update, examples provided in the documentation for the Amadeus Toolkit were unable to run correctly due to outdated information. This PR includes an update to the documentation, ensuring that all examples can now be executed successfully. With this update, users can effectively utilize the Amadeus Toolkit with accurate and functioning examples. These changes aim to enhance the reliability and usability of the langchain repository by addressing issues related to error handling and ensuring that documentation remains up-to-date and actionable. Issue: https://github.com/langchain-ai/langchain/issues/17375 Twitter Handle: SingletonYxx	2024-03-05 16:17:22 -08:00
Utkarsh Kapil	539a13dbda	docs: minor spelling errors (#18429 ) Description: Noticed spelling errors. 'Colab' mispelt as 'Collab'. https://python.langchain.com/docs/use_cases Dependencies: n/a	2024-03-05 15:54:15 -08:00
Dounx	ad48f55357	community[minor]: add Yuque document loader (#17924 ) This pull request support loading documents from Yuque with Langchain. Yuque is a professional cloud-based knowledge base for team collaboration in documentation. Website: https://www.yuque.com OpenAPI: https://www.yuque.com/yuque/developer/openapi	2024-03-05 15:54:07 -08:00
Kazuki Maeda	60c5d964a8	community[minor]: use jq schema for content_key in json_loader (#18003 ) ### Description Changed the value specified for `content_key` in JSONLoader from a single key to a value based on jq schema. I created [similar PR](https://github.com/langchain-ai/langchain/pull/11255) before, but it has several conflicts because of the architectural change associated stable version release, so I re-create this PR to fit new architecture. ### Why For json data like the following, specify `.data[].attributes.message` for page_content and `.data[].attributes.id` or `.data[].attributes.attributes. tags`, etc., the `content_key` must also parse the json structure. <details> <summary>sample json data</summary> ```json { "data": [ { "attributes": { "message": "message1", "tags": [ "tag1" ] }, "id": "1" }, { "attributes": { "message": "message2", "tags": [ "tag2" ] }, "id": "2" } ] } ``` </details> <details> <summary>sample code</summary> ```python def metadata_func(record: dict, metadata: dict) -> dict: metadata["source"] = None metadata["id"] = record.get("id") metadata["tags"] = record["attributes"].get("tags") return metadata sample_file = "sample1.json" loader = JSONLoader( file_path=sample_file, jq_schema=".data[]", content_key=".attributes.message", ## content_key is parsable into jq schema is_content_key_jq_parsable=True, ## this is added parameter metadata_func=metadata_func ) data = loader.load() data ``` </details> ### Dependencies none ### Twitter handle [kzk_maeda](https://twitter.com/kzk_maeda)	2024-03-05 15:51:24 -08:00
Rodrigo Nogueira	f4bb33bbf3	docs: fix link and missing package (#18405 ) Issue: fix broken links and missing package on colab example	2024-03-05 15:50:06 -08:00
Max Jakob	81e9ab6e3a	docs: Update elasticsearch README (#18497 ) Update Elasticsearch README with information on how to start a deployment. Also make some cosmetic changes to the [Elasticsearch docs](https://python.langchain.com/docs/integrations/vectorstores/elasticsearch). Follow up on https://github.com/langchain-ai/langchain/pull/17467	2024-03-05 15:49:16 -08:00
Hech	6a08134661	community[patch], langchain[minor]: Add retriever self_query and score_threshold in DingoDB (#18106 )	2024-03-05 15:47:29 -08:00
Bagatur	1569b19191	docs: query analysis links (#18614 )	2024-03-05 15:05:44 -08:00
Asaf Joseph Gardin	27441555d0	ai21[patch]: AI21 Labs Contextual Answers support (#18270 ) Description: Added support for AI21 Labs model - Contextual Answers Dependencies: ai21, ai21-tokenizer Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 22:42:04 +00:00
Leonid Ganeline	bd4993141d	docs: `providers` update 5 (#18550 ) Added missed sections. Added descriptions.	2024-03-05 12:55:13 -08:00
Reuben Zotz-Wilson	96cd50938a	community:update telegram notebook (#18569 ) Description: modified the user_name to username to conform with the expected inputs to TelegramChatApiLoader Issue: Current code fails in langchain-community 0.0.24 <loader = TelegramChatApiLoader( chat_entity="<CHAT_URL>", # recommended to use Entity here api_hash="<API HASH >", api_id="<API_ID>", user_name="", # needed only for caching the session. )>	2024-03-05 11:47:17 -08:00
Jib	9da1e0cf34	mongodb[patch]: Migrate MongoDBChatMessageHistory (#18590 ) ## Description Migrate the `MongoDBChatMessageHistory` to the managed `langchain-mongodb` partner-package ## Dependencies None ## Twitter handle @mongodb ## tests and docs - [x] Migrate existing integration test - [x ]~ Convert existing integration test to a unit test~ Creation is out of scope for this ticket - [x ] ~Considering delaying work until #17470 merges to leverage the `MockCollection` object. ~ - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 18:53:02 +00:00
Jib	f92f7d2e03	mongodb[minor]: Add MongoDB LLM Cache (#17470 ) # Description - Description: Adding MongoDB LLM Caching Layer abstraction - Issue: N/A - Dependencies: None - Twitter handle: @mongodb Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR Message (above) - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @efriis, @eyurtsev, @hwchase17. --------- Co-authored-by: Jib <jib@byblack.us>	2024-03-05 10:38:39 -08:00
Erick Friis	07f23c2d45	docs: anthropic multimodal (#18586 )	2024-03-05 16:58:06 +00:00
Erick Friis	4ac2cb4adc	anthropic[minor]: add tool calling (#18554 )	2024-03-05 08:30:16 -08:00
Brace Sproul	328a498a78	docs[minor]: Add thumbs up/down to all docs pages (#18526 )	2024-03-04 15:14:28 -08:00
Erick Friis	10874d5002	docs: update stack graphic (#18532 )	2024-03-04 23:07:28 +00:00
aditya thomas	7803b973c7	docs: update documentation of stackexchange component (#18486 ) Description: Update documentation of the StackExchange component Issue: None Dependencies: None	2024-03-04 10:45:29 -08:00
Martin Kolb	63702a2044	docs: Improved notebook for vector store "HANA Cloud" (#18496 ) - Description: This PR fixes some issues in the Jupyter notebook for the VectorStore "SAP HANA Cloud Vector Engine": * Slight textual adaptations * Fix of wrong column name VEC_META (was: VEC_METADATA) - Issue: N/A - Dependencies: no new dependecies added - Twitter handle: @sapopensource path to notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb`	2024-03-04 10:44:16 -08:00
Bagatur	1c1a3a7415	docs: quickstart models (#18511 )	2024-03-04 08:33:19 -08:00
aditya thomas	a727eec6ed	docs: add groq to list of providers (#18503 ) Description: Add Groq to the list of providers Issue: None Dependencies: None	2024-03-04 08:20:40 -08:00
Erick Friis	24f9c700f2	anthropic[minor]: claude 3 (#18508 )	2024-03-04 15:03:51 +00:00
William De Vena	172499404a	Docs: Updated callbacks/index.mdx adding example on invoke method (#18403 ) ## PR title Docs: Updated callbacks/index.mdx adding example on runnable methods ## PR message - Description: Updated callbacks/index.mdx adding an example on how to pass callbacks to the runnable methods (invoke, batch, ...) - Issue: #16379 - Dependencies: None	2024-03-04 09:11:48 -05:00
Jacob Lee	de2d9447c6	👥 Update LangChain people data (#18473 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-03-03 19:58:58 -08:00
William FH	1cdb813196	Improve notebook wording (#18472 )	2024-03-03 18:31:15 -08:00
William FH	55b69d5ad1	Update Notebook Image (#18470 )	2024-03-03 17:22:59 -08:00
Harrison Chase	73d653324f	[Evals] Session-level feedback (#18463 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-03-03 17:18:29 -08:00
Scott Nath	b051bba1a9	community: Add you.com tool, add async to retriever, add async testing, add You tool doc (#18032 ) - Description: finishes adding the you.com functionality including: - add async functions to utility and retriever - add the You.com Tool - add async testing for utility, retriever, and tool - add a tool integration notebook page - Dependencies: any dependencies required for this change - Twitter handle: @scottnath	2024-03-03 14:30:05 -08:00
Harrison Chase	7ce2f32c64	improve query analysis docs (#18426 )	2024-03-03 14:24:33 -08:00
Aayush Kataria	7c2f3f6f95	community[minor]: Adding Azure Cosmos Mongo vCore Vector DB Cache (#16856 ) Description: This pull request introduces several enhancements for Azure Cosmos Vector DB, primarily focused on improving caching and search capabilities using Azure Cosmos MongoDB vCore Vector DB. Here's a summary of the changes: - AzureCosmosDBSemanticCache: Added a new cache implementation called AzureCosmosDBSemanticCache, which utilizes Azure Cosmos MongoDB vCore Vector DB for efficient caching of semantic data. Added comprehensive test cases for AzureCosmosDBSemanticCache to ensure its correctness and robustness. These tests cover various scenarios and edge cases to validate the cache's behavior. - HNSW Vector Search: Added HNSW vector search functionality in the CosmosDB Vector Search module. This enhancement enables more efficient and accurate vector searches by utilizing the HNSW (Hierarchical Navigable Small World) algorithm. Added corresponding test cases to validate the HNSW vector search functionality in both AzureCosmosDBSemanticCache and AzureCosmosDBVectorSearch. These tests ensure the correctness and performance of the HNSW search algorithm. - LLM Caching Notebook - The notebook now includes a comprehensive example showcasing the usage of the AzureCosmosDBSemanticCache. This example highlights how the cache can be employed to efficiently store and retrieve semantic data. Additionally, the example provides default values for all parameters used within the AzureCosmosDBSemanticCache, ensuring clarity and ease of understanding for users who are new to the cache implementation. @hwchase17,@baskaryan, @eyurtsev,	2024-03-03 14:04:15 -08:00
Bagatur	db47b5deee	docs: anthropic quickstart (#18440 )	2024-03-03 13:59:28 -08:00
Bagatur	74f3908182	docs: anthropic qa quickstart (#18459 )	2024-03-03 13:33:24 -08:00
Harrison Chase	bc768a12ed	more query analysis docs (#18358 )	2024-03-02 08:44:22 -08:00
Erick Friis	9fda6ac7e6	docs: stop copying source (#18404 )	2024-03-01 13:57:53 -08:00
Kate Silverstein	b7c71e2e07	community[minor]: llamafile embeddings support (#17976 ) * Description: adds `LlamafileEmbeddings` class implementation for generating embeddings using [llamafile](https://github.com/Mozilla-Ocho/llamafile)-based models. Includes related unit tests and notebook showing example usage. * Issue: N/A * Dependencies: N/A	2024-03-01 13:49:18 -08:00
Massimiliano Pronesti	c3c987dd70	docs: update Azure OpenAI to v1 and langchain API to 0.1 (#18005 ) Description: Updated Azure OpenAI docs to OpenAI API v1 and LLM invocation to langchain 0.1	2024-03-01 13:47:00 -08:00
Kate Silverstein	c9153a3fd4	docs: add llamafile info to 'Local LLMs' guides (#18049 ) - Description: add information about [llamafile](https://github.com/Mozilla-Ocho/llamafile) (setup, example usage) to ['Run LLMs locally'](https://python.langchain.com/docs/guides/local_llms) and ['Using local models for Q&A with RAG'](https://python.langchain.com/docs/use_cases/question_answering/local_retrieval_qa) guides. - Issue: N/A - Dependencies: N/A	2024-03-01 12:44:31 -08:00
aditya thomas	e6e60e2492	docs: ChatOpenAI update module import path and calling method (#18169 ) Description: (a) Update to the module import path to reflect the splitting up of langchain into separate packages (b) Update to the documentation to include the new calling method (invoke)	2024-03-01 12:32:20 -08:00
Ryan Meinzer	d883fd4a37	docs: Correct WebBaseLoader URL: docs: python.langchain.com/docs/get_started/quickstartQuickstart (#17981 ) Description: The URL of the data to index, specified to `WebBaseLoader` to import is incorrect, causing the `langsmith_search` retriever to return a `404: NOT_FOUND`. Incorrect URL: https://docs.smith.langchain.com/overview Correct URL: https://docs.smith.langchain.com Issue: This commit corrects the URL and prevents the LangServe Playground from returning an error from its inability to use the retriever when inquiring, "how can langsmith help with testing?". Dependencies: None. Twitter Handle: @ryanmeinzer	2024-03-01 12:21:53 -08:00
Petteri Johansson	6c1989d292	community[minor], langchain[minor], docs: Gremlin Graph Store and QA Chain (#17683 ) - Description: New feature: Gremlin graph-store and QA chain (including docs). Compatible with Azure CosmosDB. - Dependencies: no changes	2024-03-01 12:21:14 -08:00
Ather Fawaz	a5ccf5d33c	community[minor]: Add support for Perplexity chat model(#17024 ) - Description: This PR adds support for [Perplexity AI APIs](https://blog.perplexity.ai/blog/introducing-pplx-api). - Issues: None - Dependencies: None - Twitter handle: [@atherfawaz](https://twitter.com/AtherFawaz) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 12:19:23 -08:00
Rodrigo Nogueira	3438d2cbcc	community[minor]: add maritalk chat (#17675 ) Description: Adds the MariTalk chat that is based on a LLM specially trained for Portuguese. Twitter handle: @MaritacaAI	2024-03-01 12:18:23 -08:00
老阿張	1701f7b8e9	docs: Fix typo in baidu_qianfan_endpoint.ipynb & baidu_qianfan_endpoint.ipynb (#18176 ) Description: "sucessfully should be successfully "? 🤔 Issue: Typo Dependencies: Nope Twitter handle: laoazhang	2024-03-01 12:10:23 -08:00
Ikko Eltociear Ashimine	31b4e78174	docs: fix typo in milvus.ipynb (#18373 ) retreival -> retrieval	2024-03-01 11:22:39 -08:00
Tabby	dd6f85caf1	docs: Update Google El Carro for Oracle Workload Documentation. (#18394 ) In this commit we update the documentation for Google El Carro for Oracle Workloads. We amend the documentation in the Google Providers page to use the correct name which is El Carro for Oracle Workloads. We also add changes to the document_loaders and memory pages to reflect changes we made in our repo.	2024-03-01 11:21:35 -08:00
Leonid Ganeline	d937fa4f9c	docs: `Tutorials` update (#18230 ) A big update of the `Tutorials` page. Cleaned it up. Added several new resources.	2024-03-01 11:07:39 -08:00
Yujie Qian	cbb65741a7	community[patch]: Voyage AI updates default model and batch size (#17655 ) - Description: update the default model and batch size in VoyageEmbeddings - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: fodizoltan <zoltan@conway.expert>	2024-03-01 10:22:24 -08:00
Shengsheng Huang	ae471a7dcb	community[minor]: add BigDL-LLM integrations (#17953 ) - Description: [`bigdl-llm`](https://github.com/intel-analytics/BigDL) is a library for running LLM on Intel XPU (from Laptop to GPU to Cloud) using INT4/FP4/INT8/FP8 with very low latency (for any PyTorch model). This PR adds bigdl-llm integrations to langchain. - Issue: NA - Dependencies: `bigdl-llm` library - Contribution maintainer: @shane-huang Examples added: - docs/docs/integrations/llms/bigdl.ipynb	2024-03-01 10:04:53 -08:00
Ethan Yang	f61cb8d407	community[minor]: Add openvino backend support (#11591 ) - Description: add openvino backend support by HuggingFace Optimum Intel, - Dependencies: “optimum[openvino]”, --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-01 10:04:24 -08:00
Leonid Ganeline	6d0af4e805	docs: nvidia: provider page update (#18054 ) Nvidia provider page is missing a Triton Inference Server package reference. Changes: - added the Triton Inference Server reference - copied the example notebook from the package into the doc files. - added the Triton Inference Server description and links, the link to the above example notebook - formatted page to the consistent format NOTE: It seems that the [example notebook](https://github.com/langchain-ai/langchain/blob/master/libs/partners/nvidia-trt/docs/llms.ipynb) was originally created in wrong place. It should be in the LangChain docs [here](https://github.com/langchain-ai/langchain/tree/master/docs/docs/integrations/llms). So, I've created a copy of this example. The original example is still in the nvidia-trt package.	2024-03-01 10:00:42 -08:00
Jacob Lee	590d47bff4	docs[patch]: Add Neo4j GraphAcademy to tutorials section (#18353 )	2024-02-29 20:50:24 -07:00
Bagatur	4730ee2766	docs: update api ref nav (#18362 )	2024-02-29 19:04:56 -08:00
Bagatur	12f19b8a6a	infra: update create_api_rst (#18361 )	2024-02-29 19:04:44 -08:00
Bagatur	5efb5c099f	text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346 )	2024-02-29 18:33:21 -08:00
Erick Friis	bce0684327	docs: airbyte deps note (#18243 )	2024-02-29 16:02:13 -08:00
Jib	72bfc1d3db	mongodb[minor]: MongoDB Partner Package -- Porting MongoDBAtlasVectorSearch (#17652 ) This PR migrates the existing MongoDBAtlasVectorSearch abstraction from the `langchain_community` section to the partners package section of the codebase. - [x] Run the partner package script as advised in the partner-packages documentation. - [x] Add Unit Tests - [x] Migrate Integration Tests - [x] Refactor `MongoDBAtlasVectorStore` (autogenerated) to `MongoDBAtlasVectorSearch` - [x] ~Remove~ deprecate the old `langchain_community` VectorStore references. ## Additional Callouts - Implemented the `delete` method - Included any missing async function implementations - `amax_marginal_relevance_search_by_vector` - `adelete` - Added new Unit Tests that test for functionality of `MongoDBVectorSearch` methods - Removed [`del res[self._embedding_key]`](`e0c81e1cb0/libs/community/langchain_community/vectorstores/mongodb_atlas.py (L218)`) in `_similarity_search_with_score` function as it would make the `maximal_marginal_relevance` function fail otherwise. The `Document` needs to store the embedding key in metadata to work. Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [x] Add tests and docs: If you're adding a new integration, please include 1. Existing tests supplied in docs/docs do not change. Updated docstrings for new functions like `delete` 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. (This already exists) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Steven Silvester <steven.silvester@ieee.org> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 23:09:48 +00:00
Bagatur	a6f0506aaf	docs: query analysis use case (#17766 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-29 12:33:49 -08:00
Filip Schouwenaars	4c62362eab	Add links to relevant DataCamp code alongs (#18332 ) This PR adds links to some more free resources for people to get acquainted with Langhchain without having to configure their system. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --> Co-authored-by: Filip Schouwenaars <filipsch@users.noreply.github.com>	2024-02-29 11:25:01 -08:00
Virat Singh	cd926ac3dd	community: Add PolygonFinancials Tool (#18324 ) Description: In this PR, I am adding a `PolygonFinancials` tool, which can be used to get financials data for a given ticker. The financials data is the fundamental data that is found in income statements, balance sheets, and cash flow statements of public US companies. Twitter: [@virattt](https://twitter.com/virattt)	2024-02-29 10:56:05 -08:00
Leonid Ganeline	d43fa2eab1	docs `providers` update (#18336 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-02-29 10:53:12 -08:00
Bagatur	6a5b084704	docs: update func calling doc (#18300 )	2024-02-29 09:45:07 -08:00
Averi Kitsch	1b63530274	docs: update Google documentation (#18297 ) Description: update Google documentation Issue: Dependencies:	2024-02-29 01:42:44 +00:00
Leonid Ganeline	1d865a7e86	docs: `google` provider page fixes (#18290 ) Several URL-s were broken (in the yesterday PR). Like [Integrations/platforms/google/Document Loaders](https://python.langchain.com/docs/integrations/platforms/google#document-loaders) page, Example link to "Document Loaders / Cloud SQL for PostgreSQL" and most of the new example links in the Document Loaders, Vectorstores, Memory sections. - fixed URL-s (manually verified all example links) - sorted sections in page to follow the "integrations/components" menu item order. - fixed several page titles to fix Navbar item order --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 00:45:03 +00:00
aditya thomas	eb0c178d75	docs: update to the list of partner packages in the list of providers (#18252 ) Description: Update to the list of partner packages in the list of providers Issue: Google & Nvidia had two entries each, both pointing to the same page Dependencies: None	2024-02-28 15:40:14 -08:00
ccurme	9bf58ec7dd	update extraction use-case docs (#17979 ) Update extraction use-case docs to showcase and explain all modes of `create_structured_output_runnable`.	2024-02-28 17:32:04 -05:00
kkdamowang	4899a72b56	docs: remove duplicate word in lcel/streaming (#18249 ) - Description: Remove duplicate word in lcel/streaming. - Issue: No. - Dependencies: No.	2024-02-28 21:50:26 +00:00
Eugene Yurtsev	cd52433ba0	community[minor]: Add `SQLDatabaseLoader` document loader (#18281 ) - Description: A generic document loader adapter for SQLAlchemy on top of LangChain's `SQLDatabaseLoader`. - Needed by: https://github.com/crate-workbench/langchain/pull/1 - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev Hi from CrateDB again, in the same spirit like GH-16243 and GH-16244, this patch breaks out another commit from https://github.com/crate-workbench/langchain/pull/1, in order to reduce the size of this patch before submitting it, and to separate concerns. To accompany the SQLAlchemy adapter implementation, the patch includes integration tests for both SQLite and PostgreSQL. Let me know if corresponding utility resources should be added at different spots. With kind regards, Andreas. ### Software Tests ```console docker compose --file libs/community/tests/integration_tests/document_loaders/docker-compose/postgresql.yml up ``` ```console cd libs/community pip install psycopg2-binary pytest -vvv tests/integration_tests -k sqldatabase ``` ``` 14 passed ``` ![image](https://github.com/langchain-ai/langchain/assets/453543/42be233c-eb37-4c76-a830-474276e01436) --------- Co-authored-by: Andreas Motl <andreas.motl@crate.io>	2024-02-28 21:02:28 +00:00
Jack Wotherspoon	92c34d4803	docs: update documentation for Google Cloud database integrations (#18265 ) Description: Fixing typos and rendering issues for Google Cloud database integrations. Issue: NA Dependencies: NA	2024-02-28 15:32:43 +00:00
Averi Kitsch	76eb553084	docs: add documentation for Google Cloud database integrations (#18225 ) Description: add documentation for Google Cloud database integrations Issue: NA Dependencies: NA	2024-02-27 21:17:30 -08:00
Erick Friis	be8d2ff5f7	airbyte[patch]: init pkg (#18236 )	2024-02-27 19:37:53 -08:00
Ayo Ayibiowu	ac1d7d9de8	community[feat]: Adds LLMLingua as a document compressor (#17711 ) Description: This PR adds support for using the [LLMLingua project ](https://github.com/microsoft/LLMLingua) especially the LongLLMLingua (Enhancing Large Language Model Inference via Prompt Compression) as a document compressor / transformer. The LLMLingua project is an interesting project that can greatly improve RAG system by compressing prompts and contexts while keeping their semantic relevance. Issue: https://github.com/microsoft/LLMLingua/issues/31 Dependencies: [llmlingua](https://pypi.org/project/llmlingua/) @baskaryan --------- Co-authored-by: Ayodeji Ayibiowu <ayodeji.ayibiowu@getinge.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-27 19:23:56 -08:00
Isaac Francisco	733367b795	docs: deprecation of OpenAI functions agent, astream_events docstring (#18164 ) Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-27 09:14:53 -08:00
Harrison Chase	b0ccaf5917	Harrison/add structured output (#18165 )	2024-02-27 08:25:09 -08:00
Max Jakob	5ab69f907f	partners: add Elasticsearch package (#17467 ) ### Description This PR moves the Elasticsearch classes to a partners package. Note that we will not move (and later remove) `ElasticKnnSearch`. It were previously deprecated. `ElasticVectorSearch` is going to stay in the community package since it is used quite a lot still. Also note that I left the `ElasticsearchTranslator` for self query untouched because it resides in main `langchain` package. ### Dependencies There will be another PR that updates the notebooks (potentially pulling them into the partners package) and templates and removes the classes from the community package, see https://github.com/langchain-ai/langchain/pull/17468 #### Open question How to make the transition smooth for users? Do we move the import aliases and require people to install `langchain-elasticsearch`? Or do we remove the import aliases from the `langchain` package all together? What has worked well for other partner packages? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-26 23:19:47 +00:00
matt haigh	a4896da2a0	Experimental: Add other threshold types to SemanticChunker (#16807 ) Description Adding different threshold types to the semantic chunker. I’ve had much better and predictable performance when using standard deviations instead of percentiles. ![image](https://github.com/langchain-ai/langchain/assets/44395485/066e84a8-460e-4da5-9fa1-4ff79a1941c5) For all the documents I’ve tried, the distribution of distances look similar to the above: positively skewed normal distribution. All skews I’ve seen are less than 1 so that explains why standard deviations perform well, but I’ve included IQR if anyone wants something more robust. Also, using the percentile method backwards, you can declare the number of clusters and use semantic chunking to get an ‘optimal’ splitting. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 13:50:48 -08:00
am-kinetica	9b8f6455b1	Langchain vectorstore integration with Kinetica (#18102 ) - Description: New vectorstore integration with the Kinetica database - Issue: - Dependencies: the Kinetica Python API `pip install gpudb==7.2.0.1`, - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: --------- Co-authored-by: Chad Juliano <cjuliano@kinetica.com>	2024-02-26 12:46:48 -08:00
Dan Stambler	69344a0661	community: Add Laser Embedding Integration (#18111 ) - Description: Added Integration with Meta AI's LASER Language-Agnostic SEntence Representations embedding library, which supports multilingual embedding for any of the languages listed here: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200, including several low resource languages - Dependencies: laser_encoders	2024-02-26 12:16:37 -08:00
Heidi Steen	166f3d8351	Docs: azuresearch.ipynb (in docs/docs/integrations/vectorstores) -- fixed headings and comments (#18135 ) This PR updates azuresearch.ipynb with an edit to the introduction sentence, consistent heading levels, and disambiguation in code comments.	2024-02-26 11:46:55 -08:00
Barun Amalkumar Halder	23fc7c8c90	docs [patch] : fix import to use community path for handler in fiddler notebook (#18140 ) Description: Update the example fiddler notebook to use community path, instead of langchain.callback Dependencies: None Twitter handle: @bhalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-26 11:41:07 -08:00
Bagatur	96bff0ed5d	infra: create api rst for specific pkg (#18144 ) Example: create rst for libs/core only ```bash poetry run python docs/api_reference/create_api_rst.py core ```	2024-02-26 11:04:22 -08:00
Erick Friis	f5cf6975ba	docs: anthropic partner package docs (#18109 )	2024-02-26 17:51:44 +00:00
rongchenlin	9147a437f1	docs: Fix the bug in MongoDBChatMessageHistory notebook (#18128 ) I tried to configure MongoDBChatMessageHistory using the code from the original documentation to store messages based on the passed session_id in MongoDB. However, this configuration did not take effect, and the session id in the database remained as 'test_session'. To resolve this issue, I found that when configuring MongoDBChatMessageHistory, it is necessary to set session_id=session_id instead of session_id=test_session. Issue: DOC: Ineffective Configuration of MongoDBChatMessageHistory for Custom session_id Storage previous code： ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id="test_session", connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` ![image](https://github.com/langchain-ai/langchain/assets/83388493/c372f785-1ec1-43f5-8d01-b7cc07b806b7) Modified code: ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id=session_id, # here is my modify code connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` Effect after modification (it works)： ![image](https://github.com/langchain-ai/langchain/assets/83388493/5776268c-9098-4da3-bf41-52825be5fafb)	2024-02-26 15:02:56 +00:00
Matt	3b08617a89	docs: update azure search langchain notebook (#18053 ) Description: Update the azure search notebook to have more descriptive comments, and an option to choose between OpenAI and AzureOpenAI Embeddings --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:48:13 -08:00
Barun Amalkumar Halder	cc69976860	community[minor] : adds callback handler for Fiddler AI (#17708 ) Description: Callback handler to integrate fiddler with langchain. This PR adds the following - 1. `FiddlerCallbackHandler` implementation into langchain/community 2. Example notebook `fiddler.ipynb` for usage documentation [Internal Tracker : FDL-14305] Issue: NA Dependencies: - Installation of langchain-community is unaffected. - Usage of FiddlerCallbackHandler requires installation of latest fiddler-client (2.5+) Twitter handle: @fiddlerlabs @behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-25 18:17:03 -08:00
Christophe Bornet	b8b5ce0c8c	astradb: Add AstraDBChatMessageHistory to langchain-astradb package (#17732 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:14:49 -08:00
BeatrixCohere	5d2d80a9a8	docs: Add Cohere examples in documentation (#17794 ) - Description: Add cohere examples to documentation - Issue:N/A - Dependencies: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:10:09 -08:00
Jacob Lee	c9eac3287e	docs[patch]: Remove redundant Pinecone import (#18079 ) CC @efriis	2024-02-24 19:27:54 -08:00
Erick Friis	e85948d46b	docs: fireworks tool calling docs (#18057 )	2024-02-24 00:49:11 +00:00
Erick Friis	1a3383fba1	docs: fireworks fixes (#18056 )	2024-02-23 15:58:53 -08:00
Yufei (Benny) Chen	ee6a773456	fireworks[patch]: Add Fireworks partner packages (#17694 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-23 20:45:47 +00:00
Erick Friis	11cf95e810	docs: recommend lambdas over runnablebranch (#18033 )	2024-02-23 11:34:27 -08:00
Reid Falconer	0534ba5a7d	langchain[patch]: return formatted SPARQL query on demand (#11263 ) - Description: Added the `return_sparql_query` feature to the `GraphSparqlQAChain` class, allowing users to get the formatted SPARQL query along with the chain's result. - Issue: NA - Dependencies: None Note: I've ensured that the PR passes linting and testing by running make format, make lint, and make test locally. I have added a test for the integration (which relies on network access) and I have added an example to the notebook showing its use.	2024-02-22 17:03:26 -08:00
Issac	46505742eb	Update quickstart.mdx (#17659 ) https://github.com/langchain-ai/langchain/issues/17657 Thank you for contributing to LangChain! Checklist: - [ ] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: Delete this entire template message and replace it with the following bulleted list - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-22 17:01:40 -08:00
Stan Duprey	15e42f1799	docs: Added `langchainhub` install and fixed typo (#17985 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-22 16:03:40 -08:00
Chad Juliano	50ba3c68bb	community[minor]: add Kinetica LLM wrapper (#17879 ) Description: Initial pull request for Kinetica LLM wrapper Issue: N/A Dependencies: No new dependencies for unit tests. Integration tests require gpudb, typeguard, and faker Twitter handle: @chad_juliano Note: There is another pull request for Kinetica vectorstore. Ultimately we would like to make a partner package but we are starting with a community contribution.	2024-02-22 16:02:00 -08:00
Matt	6ef12fdfd2	docs: Update Azure Search vector store notebook (#17901 ) - Description: Update the Azure Search vector store notebook for the latest version of the SDK --------- Co-authored-by: Matt Gotteiner <[email protected]>	2024-02-22 15:59:43 -08:00
Averi Kitsch	c05cbf0533	docs: Update Google Provider documentation (#17970 ) Description: Clean up Google product names and fix document loader section Issue: NA Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 15:58:52 -08:00
Erick Friis	ed789be8f4	docs, templates: update schema imports to core (#17885 ) - chat models, messages - documents - agentaction/finish - baseretriever,document - stroutputparser - more messages - basemessage - format_document - baseoutputparser --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 15:58:44 -08:00
Leonid Ganeline	f685d2f50c	docs: partner package list (#17978 ) Updated partner package list	2024-02-22 18:23:07 -05:00
Erick Friis	29660f8918	docs: logo (#17972 )	2024-02-22 15:20:34 -08:00
bear	e8633e53c4	docs: Rerun the Tongyi Qwen model to fix incorrect responses. (#17693 ) This PR updates the docs of Tongyi Qwen model. 1. fix the previously incorrect responses of the Tongyi Qwen. 2. rewrite the case with LCEL.	2024-02-22 13:20:04 -08:00
Mateusz Szewczyk	f6e3aa9770	docs: update IBM watsonx.ai docs (#17932 ) - Description: Update IBM watsonx.ai docs and add IBM as a provider docs - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅	2024-02-22 10:22:18 -08:00
Erick Friis	a53370a060	pinecone[patch], docs: PineconeVectorStore, release 0.0.3 (#17896 )	2024-02-22 08:24:08 -08:00
Graden Rea	e5e38e89ce	partner: Add groq partner integration and chat model (#17856 ) Description: Add a Groq chat model issue: TODO Dependencies: groq Twitter handle: N/A	2024-02-22 07:36:16 -08:00
William FH	da957a22cc	Redirect the expression language guides (#17914 )	2024-02-22 00:39:57 -08:00
Leonid Ganeline	919b8a387f	docs: sorting `Examples using ...` section (#17588 ) The API Reference docs. If the class has a long list of the examples that works with this class, then the `Examples using` list is [hard to comprehend](https://api.python.langchain.com/en/latest/llms/langchain_community.llms.openai.OpenAI.html#langchain-community-llms-openai-openai). If this list is sorted it would be much easier. - sorting the `Examples using <ClassName>` list	2024-02-21 17:04:23 -08:00
Raunak	1ec8199c8e	community[patch]: Added more functions in NetworkxEntityGraph class (#17624 ) - Description: 1. Added add_node(), remove_node(), has_node(), remove_edge(), has_edge() and get_neighbors() functions in NetworkxEntityGraph class. 2. Added the above functions in graph_networkx_qa.ipynb documentation.	2024-02-21 17:02:56 -08:00
William FH	42f158c128	docs: typo (#17710 )	2024-02-21 16:53:41 -08:00
Neli Hateva	66e1005898	docs: Update Links to resources in the GraphDB QA Chain documentation (#17720 ) - Description: Update Links to resources in the GraphDB QA Chain documentation - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-02-21 16:51:32 -08:00
Ian	3019a594b7	community[minor]: Add tidb loader support (#17788 ) This pull request support loading data from TiDB database with Langchain. A simple usage: ``` from langchain_community.document_loaders import TiDBLoader CONNECTION_STRING = "mysql+pymysql://root@127.0.0.1:4000/test" QUERY = "select id, name, description from items;" loader = TiDBLoader( connection_string=CONNECTION_STRING, query=QUERY, page_content_columns=["name", "description"], metadata_columns=["id"], ) documents = loader.load() print(documents) ```	2024-02-21 16:42:33 -08:00
Jacob Lee	375051a64e	👥 Update LangChain people data (#17900 ) 👥 Update LangChain people data --------- Co-authored-by: github-actions <github-actions@github.com>	2024-02-21 16:38:28 -08:00
Bagatur	762f49162a	docs: fix api build (#17898 )	2024-02-21 16:34:37 -08:00
Michael Feil	242981b8f0	community[minor]: infinity embedding local option (#17671 ) drop-in-replacement for sentence-transformers inference. https://github.com/langchain-ai/langchain/discussions/17670 tldr from the discussion above -> around a 4x-22x speedup over using SentenceTransformers / huggingface embeddings. For more info: https://github.com/michaelfeil/infinity (pure-python dependency) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-21 16:33:13 -08:00
Aymen EL Amri	581095b9b5	docs: fix a small typo (#17859 ) Just a small typo	2024-02-21 16:31:31 -08:00

... 2 3 4 5 6 ...

3386 Commits