langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
kYLe	467b082c34	Modify Anyscale integration to work with Anyscale Endpoint (#11569 ) Description: Modify Anyscale integration to work with [Anyscale Endpoint](https://docs.endpoints.anyscale.com/) and it supports invoke, async invoke, stream and async invoke features --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 08:41:25 -07:00
Shreyas S	70a793ca9d	Update zep_memory.ipynb (#11713 ) fixed minor typos; the your > your on > upon	2023-10-12 10:41:19 -04:00
Surav Shrestha	e61b528c0e	Fix typos in docs/docs/use_cases/question_answering/code_understandin… (#11710 ) herarchy -> hierarchy	2023-10-12 10:17:23 -04:00
Surav Shrestha	f386ac3bef	Fix typos in docs/docs/use_cases/tagging.ipynb (#11712 ) funtion -> function	2023-10-12 10:17:10 -04:00
Surav Shrestha	ac73154005	Fix typos in docs/docs/use_cases/question_answering/conversational_re… (#11709 ) neccessary -> necessary	2023-10-12 10:16:52 -04:00
Surav Shrestha	af9ce3c224	Fix typos in docs/docs/use_cases/chatbots.ipynb (#11707 ) implemet -> implement	2023-10-12 10:16:34 -04:00
Surav Shrestha	77fcaa410a	Fix typos in docs/docs/use_cases/extraction.ipynb (#11708 ) This PR has a number of typos correction. I kindly request the repo maintainers to review this PR and merge it.	2023-10-12 10:16:17 -04:00
nuric	44da27c07b	Add SemaDB VST wrapper (#11484 ) - Description: Adding vectorstore wrapper for [SemaDB](https://rapidapi.com/semafind-semadb/api/semadb). - Issue: None - Dependencies: None - Twitter handle: semafind Checks performed: - [x] `make format` - [x] `make lint` - [x] `make test` - [x] `make spell_check` - [x] `make docs_build` Documentation added: - SemaDB vectorstore wrapper tutorial	2023-10-11 19:09:38 -07:00
Leonid Kuligin	2aba9ab47e	Retriever based on GCP DocAI Warehouse (#11400 ) - Description: implements a retriever on top of DocAI Warehouse (to interact with existing enterprise documents) https://cloud.google.com/document-ai-warehouse?hl=en - Issue: new functionality @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 19:08:53 -07:00
mvhensbergen	629d9b78fa	Make example work during pydantic transition (#11498 ) Description: Make the example extraction code on https://python.langchain.com/docs/use_cases/extraction work again by importing the langchain.pydantic_v1 lib instead of the v2. Issue: Solves issue https://github.com/langchain-ai/langchain/issues/11468 Co-authored-by: Martin van Hensbergen <martin@mvhensbergen.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 18:44:47 -07:00
ElliotKetchup	7ae8b7f065	Llama doc: add 'language' to the response message (#11543 ) - Description: add 'language' to the reponse message in the Llama doc, - Issue: None, - Dependencies: None, - Tag maintainer: None, - Twitter handle: None Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 17:06:04 -07:00
Shinya Maeda	1f7edcd08b	doc: Fix documentation about n-gram overlap (#11549 ) Fix the documentation in https://python.langchain.com/docs/modules/model_io/prompts/example_selectors/ngram_overlap. It's currently declaring unrelated variables, for example, `examples` local variable is declared twice and the first one is overwritten immediately. - Issue: N/A - Dependencies: N/A - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: @dosuken123	2023-10-11 16:26:56 -07:00
maks-operlejn-ds	3c83779661	Qa with anonymization (#11658 ) Added demo for QA system with anonymization. It will be part of LangChain's privacy webinar. @hwchase17 @baskaryan @nfcampos Twitter handle: @MaksOpp --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 15:38:08 -07:00
Shreyas S	3cd0827785	Update kay.ipynb (#11676 ) Fixed title display	2023-10-11 14:02:11 -07:00
Vinay Kakade	dd0cd98861	Add support for ChatOpenAI models in Infino callback handler (#11608 ) Description: This PR adds support for ChatOpenAI models in the Infino callback handler. In particular, this PR implements `on_chat_model_start` callback, so that ChatOpenAI models are supported. With this change, Infino callback handler can be used to track latency, errors, and prompt tokens for ChatOpenAI models too (in addition to the support for OpenAI and other non-chat models it has today). The existing example notebook is updated to show how to use this integration as well. cc/ @naman-modi @savannahar68 Issue: https://github.com/langchain-ai/langchain/issues/11607 Dependencies: None Tag maintainer: @hwchase17 Twitter handle: [@vkakade](https://twitter.com/vkakade)	2023-10-11 14:00:54 -07:00
Israel Ekpo	d0603c86b6	Add Support for Azure Cosmos DB MongoDB vCore Vector Store #11627 (#11632 ) This PR adds support for the Azure Cosmos DB MongoDB vCore Vector Store https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/ https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/vector-search Summary: - Description: added vector store integration for Azure Cosmos DB MongoDB vCore Vector Store, - Issue: the issue # it fixes #11627, - Dependencies: pymongo dependency, - Tag maintainer: @hwchase17, - Twitter handle: @izzyacademy --------- Co-authored-by: Israel Ekpo <israel.ekpo@gmail.com> Co-authored-by: Israel Ekpo <44282278+izzyacademy@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 13:56:46 -07:00
Erick Friis	2c1e735403	Fix runnable docs link (#11675 )	2023-10-11 13:11:23 -07:00
Ikko Eltociear Ashimine	7d0dda7e41	Fix typo in baidu_qianfan_endpoint.ipynb (#11667 ) enviroment -> environment	2023-10-11 16:01:18 -04:00
Bagatur	cf86447623	Start cookbook and move stuff from use cases (#11636 )	2023-10-11 12:27:13 -07:00
Bassem Yacoube	5451b724fc	Adds support for llama2 and fixes MPT-7b url (#11465 ) - Description: This is an update to OctoAI LLM provider that adds support for llama2 endpoints hosted on OctoAI and updates MPT-7b url with the current one. @baskaryan Thanks! --------- Co-authored-by: ML Wiz <bassemgeorgi@gmail.com>	2023-10-10 20:34:35 -07:00
Bagatur	78b4c7d5a0	collapse sidebar peer items (#11639 )	2023-10-10 19:56:21 -07:00
Bagatur	0ca8d4449c	add ls guide redirect (#11623 )	2023-10-10 12:58:04 -07:00
Bagatur	eedfddac2d	Restructure docs (#11620 )	2023-10-10 12:55:19 -07:00
ElliotKetchup	683f4a93b9	Update azureml_chat_endpoint code exemple (#11602 ) - Description: azureml_chat_endpoint code exemple now takes endpoint_url and endpoint_api_key parameter into consideration, - Issue: None), - Dependencies: None, - Tag maintainer: None, - Twitter handle: @ElliotAlladaye	2023-10-10 10:27:28 -07:00
Yong woo Song	fca34eb122	Fix: invalid link to chat model in openai platform docs (#11609 ) There is some invalid link in open ai platform [docs](https://python.langchain.com/docs/integrations/platforms/openai). So i fixed it to valid links. - `/docs/integrations/chat_models/openai` -> `/docs/integrations/chat/openai` - `/docs/integrations/chat_models/azure_openai` -> `/docs/integrations/chat/azure_chat_openai` Thanks! ☺️	2023-10-10 10:22:39 -07:00
Shubham Kushwaha	49de862076	Arcee.ai LLM & Retriever integration (#11579 ) - Description: This PR introduces a new LLM and Retriever API to https://arcee.ai for the python client - Issue: implements the integrations as requested in #11578 , - Dependencies: no dependencies are required, - Tag maintainer: @hwchase17 - Twitter handle: shwooobham ✅ `make format`, `make lint` and `make test` runs locally. ```shell =========== 1245 passed, 277 skipped, 20 warnings in 16.26s =========== ./scripts/check_pydantic.sh . ./scripts/check_imports.sh poetry run ruff . [ "." = "" ] \|\| poetry run black . --check All done! ✨ 🍰 ✨ 1818 files would be left unchanged. [ "." = "" ] \|\| poetry run mypy . Success: no issues found in 1815 source files [ "." = "" ] \|\| poetry run black . All done! ✨ 🍰 ✨ 1818 files left unchanged. [ "." = "" ] \|\| poetry run ruff --select I --fix . poetry run codespell --toml pyproject.toml poetry run codespell --toml pyproject.toml -w ``` Contributions 1. Arcee (langchain/llms), ArceeRetriever (langchain/retrievers), ArceeWrapper (langchain/utilities) 2. docs for Arcee (llms/arcee.py) and ArceeRetriever(retrievers/arcee.py) 3. cc: @jacobsolawetz @ben-epstein --------- Co-authored-by: Shubham <shubham@sORo.local>	2023-10-10 10:20:45 -07:00
Eugene Yurtsev	b6a2507794	Docs to use LLMSymbolicMath and LLMBash + utilities from experimental (#11614 ) Update docs in lieu of: https://github.com/langchain-ai/langchain/discussions/11352	2023-10-10 13:11:46 -04:00
Leonid Ganeline	59adeaddb3	docs: update `dependents` (#11502 ) A regular update of dependents.	2023-10-10 09:31:23 -07:00
Bagatur	b642d00f9f	rm slack from community.md (#11610 )	2023-10-10 07:55:26 -07:00
unifyh	fd7f129f10	Docs: Fix broken line breaks in snippets (#11523 ) Description: This PR fix some code snippets that have raw `\n`'s instead of actual line breaks. Issue: Currently some snippets look like this: ![image](https://github.com/langchain-ai/langchain/assets/18213435/355b4911-38e9-4ba4-8570-f928557b6c13) Affected pages: - https://python.langchain.com/docs/integrations/providers/predictionguard#example-usage - https://python.langchain.com/docs/modules/agents/how_to/custom_llm_agent#set-up-environment - https://python.langchain.com/docs/modules/chains/foundational/llm_chain#get-started - https://python.langchain.com/docs/integrations/providers/shaleprotocol#how-to Tag maintainer: @hwchase17	2023-10-09 15:40:27 -07:00
Michael Landis	8e45f720a8	feat: add momento vector index as a vector store provider (#11567 ) Description: - Added Momento Vector Index (MVI) as a vector store provider. This includes an implementation with docstrings, integration tests, a notebook, and documentation on the docs pages. - Updated the Momento dependency in pyproject.toml and the lock file to enable access to MVI. - Refactored the Momento cache and chat history session store to prefer using "MOMENTO_API_KEY" over "MOMENTO_AUTH_TOKEN" for consistency with MVI. This change is backwards compatible with the previous "auth_token" variable usage. Updated the code and tests accordingly. Dependencies: - Updated Momento dependency in pyproject.toml. Testing: - Run the integration tests with a Momento API key. Get one at the [Momento Console](https://console.gomomento.com) for free. MVI is available in AWS us-west-2 with a superuser key. - `MOMENTO_API_KEY=<your key> poetry run pytest tests/integration_tests/vectorstores/test_momento_vector_index.py` Tag maintainer: @eyurtsev Twitter handle: Please mention @momentohq for this addition to langchain. With the integration of Momento Vector Index, Momento caching, and session store, Momento provides serverless support for the core langchain data needs. Also mention @mlonml for the integration.	2023-10-09 14:02:59 -07:00
MSFTeegarden	923e9f9596	Add Azure Redis example (#11570 ) Description This PR adds an additional Example to the Redis integration documentation. [The example](https://learn.microsoft.com/azure/azure-cache-for-redis/cache-tutorial-vector-similarity) is a step-by-step walkthrough of using Azure Cache for Redis and Azure OpenAI for vector similarity search, using LangChain extensively throughout. Issue Nothing specific, just adding an additional example. Dependencies None. Tag Maintainer Tagging @hwchase17 :)	2023-10-09 13:27:03 -07:00
maks-operlejn-ds	4d62def9ff	Better deanonymizer matching strategy (#11557 ) @baskaryan, @hwchase17	2023-10-09 11:10:29 -07:00
Bagatur	0a754fa286	redirect langsmith guides (#11562 )	2023-10-09 09:58:03 -07:00
Holt Skinner	09c66fe04f	feat: Update Google Document AI Parser (#11413 ) - Description: Code Refactoring, Documentation Improvements for Google Document AI PDF Parser - Adds Online (synchronous) processing option. - Adds default field mask to limit payload size. - Skips Human review by default. - Issue: Fixes #10589 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-09 08:04:25 -07:00
Nuno Campos	628cc4cce8	Rename RunnableMap to RunnableParallel (#11487 ) - keep alias for RunnableMap - update docs to use RunnableParallel and RunnablePassthrough.assign <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-09 11:22:03 +01:00
William FH	eb572f41a6	Add LangSmith Run Chat Loader (#11458 )	2023-10-06 17:02:18 -07:00
Leonid Ganeline	c3d2b01adf	docs: `integrations/retrievers` cleanup (#11388 ) fixed several notebooks: - headers - formats --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-06 13:40:46 -07:00
Maciej Dzieżyc	bcd308c368	Fix Open in Colab link for ClearML docs 2 (#11491 ) Description: Fixed the Open in Colab link for ClearML docs Issue: https://github.com/allegroai/clearml/issues/1125 Twitter handle: DziezycMaciej	2023-10-06 12:01:47 -07:00
Bagatur	88ab69c288	mv docs extras (#11399 )	2023-10-06 10:09:41 -07:00
Bagatur	1bf8ef1a4f	rm brave (#11482 )	2023-10-06 07:44:19 -07:00
Theron Tau	35297ca0d3	Add feature for extracting images from pdf and recognizing text from images. (#10653 ) Description It is for #10423 that it will be a useful feature if we can extract images from pdf and recognize text on them. I have implemented it with `PyPDFLoader`, `PyPDFium2Loader`, `PyPDFDirectoryLoader`, `PyMuPDFLoader`, `PDFMinerLoader`, and `PDFPlumberLoader`. [RapidOCR](https://github.com/RapidAI/RapidOCR.git) is used to recognize text on extracted images. It is time-consuming for ocr so a boolen parameter `extract_images` is set to control whether to extract and recognize. I have tested the time usage for each parser on my own laptop thinkbook 14+ with AMD R7-6800H by unit test and the result is: \| extract_images \| PyPDFParser \| PDFMinerParser \| PyMuPDFParser \| PyPDFium2Parser \| PDFPlumberParser \| \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| \| False \| 0.27s \| 0.39s \| 0.06s \| 0.08s \| 1.01s \| \| True \| 17.01s \| 20.67s \| 20.32s \| 19,75s \| 20.55s \| Issue #10423 Dependencies rapidocr_onnxruntime in [RapidOCR](https://github.com/RapidAI/RapidOCR/tree/main) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 18:51:59 -07:00
Bagatur	a3a2ce623e	Revise vowpal_wabbit notebook	2023-10-05 18:18:19 -07:00
Bagatur	8fafa1af91	merge	2023-10-05 18:09:35 -07:00
rodrigo-clickup	5944c1851b	Add ClickUp Toolkit (#10662 ) - Description: Adds a toolkit to interact with the [ClickUp](https://clickup.com/) [Public API](https://clickup.com/api/) - Dependencies: None - Tag maintainer: @rodrigo-georgian, @rodrigo-clickup, @aiswaryasankarwork - Twitter handle: - Aiswarya (https://twitter.com/Aiswarya_Sankar, https://www.linkedin.com/in/sankaraiswarya/) - Rodrigo (https://www.linkedin.com/in/rodrigo-ceballos-lentini/) --------- Co-authored-by: Aiswarya Sankar <aiswaryasankar@Aiswaryas-MacBook-Pro.local> Co-authored-by: aiswaryasankarwork <143119412+aiswaryasankarwork@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 16:33:05 -07:00
Beck Bekmyradov	f9df55f7d2	Fix a Typo in Documentation (#11453 ) - Description: This commit corrects a minor typo in the documentation. It changes "frum" to "from" in the sentence: "The results from search are passed back to the LLM for synthesis into an answer" in the file `docs/extras/use_cases/more/agents/agents.ipynb`. This typo fix enhances the clarity and accuracy of the documentation. - Tag maintainer: @baskaryan	2023-10-05 15:34:06 -07:00
Bagatur	f5ce286932	fix api docs build (#11445 )	2023-10-05 15:33:11 -07:00
mrbean	9903a70379	Add youdotcom retriever (#11304 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 13:48:11 -07:00
Syed Ather Rizvi	bfd48925e5	Feature/csharp text splitter doc (#10571 ) - Description: Just docs related to csharp code splitter - Issue: It's related to a request made by @baskaryan in a comment on my previous PR #10350 - Dependencies: None - Twitter handle: @ather19 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 12:22:54 -07:00
maks-operlejn-ds	2aae1102b0	Instance anonymization (#10501 ) ### Description Add instance anonymization - if `John Doe` will appear twice in the text, it will be treated as the same entity. The difference between `PresidioAnonymizer` and `PresidioReversibleAnonymizer` is that only the second one has a built-in memory, so it will remember anonymization mapping for multiple texts: ``` >>> anonymizer = PresidioAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Brett Russell. Hi Brett Russell!' ``` ``` >>> anonymizer = PresidioReversibleAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' ``` ### Twitter handle @deepsense_ai / @MaksOpp ### Tag maintainer @baskaryan @hwchase17 @hinthornw --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 11:23:02 -07:00

1 2 3 4 5 ...

2214 Commits