langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	3be76ee2fa	Add security.md (#11881 ) Add security markdown file	2023-10-16 17:41:21 -04:00
Bagatur	e3664272f0	Add LCEL to output parser doc (#11880 )	2023-10-16 12:35:18 -07:00
Bagatur	049a0357e7	Add LCEL to prompt doc (#11875 )	2023-10-16 11:34:31 -07:00
Bagatur	ece22b6b6a	Add LCEL to LLM intro (#11835 )	2023-10-15 14:59:45 -07:00
Bagatur	ffa1b3a758	Add LCEL to chat model intro (#11834 )	2023-10-15 14:59:36 -07:00
Bagatur	6c5bb1b2e1	RM snippets (#11798 )	2023-10-15 12:20:58 -07:00
Harrison Chase	a506302772	bearly tool (#11812 )	2023-10-14 16:03:58 -07:00
Bagatur	d6e34ca2ee	fix recent docs integrations file loc (#11782 )	2023-10-13 13:58:26 -07:00
Michael Feil	233a904f2e	GradientLLM Docs update and model_id renaming. (#10963 ) Related to #10800 - Errors in the Docstring of GradientLLM / Gradient.ai LLM - Renamed the `model_id` to `model` and adapting this in all tests. Reason to so is to be in Sync with `GradientEmbeddings` and other LLM's. - inmproving tests so they check the headers in the sent request. - making the aiosession a private attribute in the docs, as in the future `pip install gradientai` will be replacing aiosession. - adding a example how to fine-tune on the Prompt Template as suggested in #10800	2023-10-13 13:57:58 -07:00
David	6876b02c87	Move EverlyAI python notebook to the right location (#11779 ) Hi, After submitting https://github.com/langchain-ai/langchain/pull/11357, we realized that the notebooks are moved to a new location. Sending a new PR to update the doc. --------- Co-authored-by: everly-studio <127131037+everly-studio@users.noreply.github.com>	2023-10-13 13:34:27 -07:00
David	9d200e6cbe	Create ChatEverlyAI (#11357 ) - Description: Adds the ChatEverlyAI class with llama-2 7b on [EverlyAI Hosted Endpoints](https://everlyai.xyz/) - It inherits from ChatOpenAI and requires openai (probably unnecessary but it made for a quick and easy implementation) --------- Co-authored-by: everly-studio <127131037+everly-studio@users.noreply.github.com>	2023-10-13 12:25:11 -07:00
Nuno Campos	17c69678ab	Revert "New add Baichuan Model" (#11761 ) Reverts langchain-ai/langchain#11714 This has linting and formatting issues, plus it's added to chat models folder but doesn't subclass Chat Model base class	2023-10-13 08:23:15 -07:00
cloudscool	56653c53aa	New add Baichuan Model (#11714 ) Motivation and Context At present, the Baichuan Large Language Model is relatively popular and efficient in performance. Due to widespread market recognition, this model has been added to enhance the scalability of Langchain's ability to access the big language model, so as to facilitate application access and usage for interested users. System Info langchain： 0.0.295 python：3.8.3 IDE：vs code Description Add the following files: 1. Add baichuan_baichuaninc_endpoint.py in the libs/langchain/langchain/chat_models 2. Modify the __init__.py file,which is located in the libs/langchain/langchain/chat_models/__init__.py： a. Add "from langchain.chat_models.baichuan_baichuaninc_endpoint import BaichuanChatEndpoint" b. Add "BaichuanChatEndpoint" In the file's __ All__ method Your contribution I am willing to help implement this feature and submit a PR, but I would appreciate guidance from the maintainers or community to ensure the changes are made correctly and in line with the project's standards and practices.	2023-10-12 23:04:28 -07:00
Shreyas S	694d768174	Minor fix (#11748 ) changed > to over	2023-10-12 22:36:31 -07:00
Bagatur	8e6fa5f1d7	mv self-query docs to integrations (#11744 )	2023-10-12 22:36:07 -07:00
Burak Yılmaz	63e516c2b0	Upstash redis integration (#10871 ) - Description: Introduced Upstash provider with following wrappers: UpstashRedisCache, UpstashRedisEntityStore, UpstashRedisChatMessageHistory, UpstashRedisStore - Issue: -, - Dependencies: upstash-redis python package is needed, - Tag maintainer: @baskaryan - Twitter handle: @BurakY744 --------- Co-authored-by: Burak Yılmaz <burakyilmaz@Buraks-MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 17:36:51 -07:00
Richy Wang	11cdfe44af	Implement Alibaba Tongyi chat model apis. (#10922 ) Hi there This PR is aim to implement chat model for Alibaba Tongyi LLM model. It contains work below: 1.Implement ChatTongyi chat model in langchain.chat_models.tongyi. Note this is different with tongyi llm model to another PR https://github.com/langchain-ai/langchain/pull/10878. For detail it implements _generate() and _stream() function in ChatTongyi. 2. Add some examples in chat/tongyi.ipynb. 3. Add integration test in chat_models/test_tongyi.py Note async completion for the Text API is not yet supported. Dependencies: dashscope. It will be installed manually cause it is not need by everyone.	2023-10-12 16:59:37 -07:00
Adam Demjen	008348ce71	Add ElasticsearchChatMessageHistory (#10932 ) Description This PR adds the `ElasticsearchChatMessageHistory` implementation that stores chat message history in the configured [Elasticsearch](https://www.elastic.co/elasticsearch/) deployment. ```python from langchain.memory.chat_message_histories import ElasticsearchChatMessageHistory history = ElasticsearchChatMessageHistory( es_url="https://my-elasticsearch-deployment-url:9200", index="chat-history-index", session_id="123" ) history.add_ai_message("This is me, the AI") history.add_user_message("This is me, the human") ``` Dependencies - [elasticsearch client](https://elasticsearch-py.readthedocs.io/) required Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 16:51:38 -07:00
Bagatur	d3a5090e12	mv semadb docs (#11743 )	2023-10-12 16:31:09 -07:00
Bagatur	acdbdbddb1	clean up doc (#11742 ) committed old doc in wrong place	2023-10-12 16:26:55 -07:00
Mateusz Kozak	e42a576cb2	update Qdrant documentation (#3105 ) fix `from_documents` method usage for Qdrant in documentation as previous example doesn't work --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 16:20:18 -07:00
Predrag Gruevski	9e32120cbb	Deprecate direct access to globals like `debug` and `verbose`. (#11311 ) Instead of accessing `langchain.debug`, `langchain.verbose`, or `langchain.llm_cache`, please use the new getter/setter functions in `langchain.globals`: - `langchain.globals.set_debug()` and `langchain.globals.get_debug()` - `langchain.globals.set_verbose()` and `langchain.globals.get_verbose()` - `langchain.globals.set_llm_cache()` and `langchain.globals.get_llm_cache()` Using the old globals directly will now raise a warning. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-12 15:48:04 -07:00
Bagatur	01b7b46908	reorder eval docs (#11738 ) cc @leo-gan	2023-10-12 15:46:55 -07:00
Richard Adams	35965df20d	Rspace doc loader (#11511 ) Description: Add a document loader for the RSpace Electronic Lab Notebook (www.researchspace.com), so that scientific documents and research notes can be easily pulled into Langchain pipelines. Issue This is an new contribution, rather than an issue fix. Dependencies: There are no new required dependencies. In order to use the loader, clients will need to install rspace_client SDK using `pip install rspace_client` --------- Co-authored-by: richarda23 <richard.c.adams@infinityworks.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 15:05:38 -07:00
Ryan Zotti	9d1867c77f	Update docs to specify Indexing-API-compatible vectorstores (#11581 ) Description: Update Indexing API docs to specify vectorstores that are compatible with the Indexing API. I add a unit test to remind developers to update the documentation whenever they add or change a vectorstore in a way that affects compatibility. For the unit test I repurposed existing code from [here](https://github.com/langchain-ai/langchain/blob/v0.0.311/libs/langchain/langchain/indexes/_api.py#L245-L257). This is my first PR to an open source project. This is a trivially simple PR whose main purpose is to make me more comfortable submitting Langchain PRs. If this PR goes through I plan to submit PRs with more substantive changes in the near future. Issue: Resolves [10482](https://github.com/langchain-ai/langchain/discussions/10482). Dependencies: No new dependencies. Twitter handle: None.	2023-10-12 15:17:44 -04:00
Tomaz Bratanic	3759a34229	Add graph construction to neo4j docs (#11716 ) Add graph construction section to Neo4j provider docs	2023-10-12 11:37:42 -07:00
Nuno Campos	b54727fbad	Nc/why lcel (#11717 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-12 17:52:20 +01:00
Johnny Deuss	bb2ed4615c	Fix typos (#11663 )	2023-10-12 11:44:03 -04:00
kYLe	467b082c34	Modify Anyscale integration to work with Anyscale Endpoint (#11569 ) Description: Modify Anyscale integration to work with [Anyscale Endpoint](https://docs.endpoints.anyscale.com/) and it supports invoke, async invoke, stream and async invoke features --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-12 08:41:25 -07:00
Shreyas S	70a793ca9d	Update zep_memory.ipynb (#11713 ) fixed minor typos; the your > your on > upon	2023-10-12 10:41:19 -04:00
Surav Shrestha	e61b528c0e	Fix typos in docs/docs/use_cases/question_answering/code_understandin… (#11710 ) herarchy -> hierarchy	2023-10-12 10:17:23 -04:00
Surav Shrestha	f386ac3bef	Fix typos in docs/docs/use_cases/tagging.ipynb (#11712 ) funtion -> function	2023-10-12 10:17:10 -04:00
Surav Shrestha	ac73154005	Fix typos in docs/docs/use_cases/question_answering/conversational_re… (#11709 ) neccessary -> necessary	2023-10-12 10:16:52 -04:00
Surav Shrestha	af9ce3c224	Fix typos in docs/docs/use_cases/chatbots.ipynb (#11707 ) implemet -> implement	2023-10-12 10:16:34 -04:00
Surav Shrestha	77fcaa410a	Fix typos in docs/docs/use_cases/extraction.ipynb (#11708 ) This PR has a number of typos correction. I kindly request the repo maintainers to review this PR and merge it.	2023-10-12 10:16:17 -04:00
nuric	44da27c07b	Add SemaDB VST wrapper (#11484 ) - Description: Adding vectorstore wrapper for [SemaDB](https://rapidapi.com/semafind-semadb/api/semadb). - Issue: None - Dependencies: None - Twitter handle: semafind Checks performed: - [x] `make format` - [x] `make lint` - [x] `make test` - [x] `make spell_check` - [x] `make docs_build` Documentation added: - SemaDB vectorstore wrapper tutorial	2023-10-11 19:09:38 -07:00
Leonid Kuligin	2aba9ab47e	Retriever based on GCP DocAI Warehouse (#11400 ) - Description: implements a retriever on top of DocAI Warehouse (to interact with existing enterprise documents) https://cloud.google.com/document-ai-warehouse?hl=en - Issue: new functionality @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 19:08:53 -07:00
mvhensbergen	629d9b78fa	Make example work during pydantic transition (#11498 ) Description: Make the example extraction code on https://python.langchain.com/docs/use_cases/extraction work again by importing the langchain.pydantic_v1 lib instead of the v2. Issue: Solves issue https://github.com/langchain-ai/langchain/issues/11468 Co-authored-by: Martin van Hensbergen <martin@mvhensbergen.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 18:44:47 -07:00
ElliotKetchup	7ae8b7f065	Llama doc: add 'language' to the response message (#11543 ) - Description: add 'language' to the reponse message in the Llama doc, - Issue: None, - Dependencies: None, - Tag maintainer: None, - Twitter handle: None Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 17:06:04 -07:00
Shinya Maeda	1f7edcd08b	doc: Fix documentation about n-gram overlap (#11549 ) Fix the documentation in https://python.langchain.com/docs/modules/model_io/prompts/example_selectors/ngram_overlap. It's currently declaring unrelated variables, for example, `examples` local variable is declared twice and the first one is overwritten immediately. - Issue: N/A - Dependencies: N/A - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: @dosuken123	2023-10-11 16:26:56 -07:00
maks-operlejn-ds	3c83779661	Qa with anonymization (#11658 ) Added demo for QA system with anonymization. It will be part of LangChain's privacy webinar. @hwchase17 @baskaryan @nfcampos Twitter handle: @MaksOpp --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 15:38:08 -07:00
Shreyas S	3cd0827785	Update kay.ipynb (#11676 ) Fixed title display	2023-10-11 14:02:11 -07:00
Vinay Kakade	dd0cd98861	Add support for ChatOpenAI models in Infino callback handler (#11608 ) Description: This PR adds support for ChatOpenAI models in the Infino callback handler. In particular, this PR implements `on_chat_model_start` callback, so that ChatOpenAI models are supported. With this change, Infino callback handler can be used to track latency, errors, and prompt tokens for ChatOpenAI models too (in addition to the support for OpenAI and other non-chat models it has today). The existing example notebook is updated to show how to use this integration as well. cc/ @naman-modi @savannahar68 Issue: https://github.com/langchain-ai/langchain/issues/11607 Dependencies: None Tag maintainer: @hwchase17 Twitter handle: [@vkakade](https://twitter.com/vkakade)	2023-10-11 14:00:54 -07:00
Israel Ekpo	d0603c86b6	Add Support for Azure Cosmos DB MongoDB vCore Vector Store #11627 (#11632 ) This PR adds support for the Azure Cosmos DB MongoDB vCore Vector Store https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/ https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/vector-search Summary: - Description: added vector store integration for Azure Cosmos DB MongoDB vCore Vector Store, - Issue: the issue # it fixes #11627, - Dependencies: pymongo dependency, - Tag maintainer: @hwchase17, - Twitter handle: @izzyacademy --------- Co-authored-by: Israel Ekpo <israel.ekpo@gmail.com> Co-authored-by: Israel Ekpo <44282278+izzyacademy@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-11 13:56:46 -07:00
Erick Friis	2c1e735403	Fix runnable docs link (#11675 )	2023-10-11 13:11:23 -07:00
Ikko Eltociear Ashimine	7d0dda7e41	Fix typo in baidu_qianfan_endpoint.ipynb (#11667 ) enviroment -> environment	2023-10-11 16:01:18 -04:00
Bagatur	cf86447623	Start cookbook and move stuff from use cases (#11636 )	2023-10-11 12:27:13 -07:00
Bassem Yacoube	5451b724fc	Adds support for llama2 and fixes MPT-7b url (#11465 ) - Description: This is an update to OctoAI LLM provider that adds support for llama2 endpoints hosted on OctoAI and updates MPT-7b url with the current one. @baskaryan Thanks! --------- Co-authored-by: ML Wiz <bassemgeorgi@gmail.com>	2023-10-10 20:34:35 -07:00
Bagatur	78b4c7d5a0	collapse sidebar peer items (#11639 )	2023-10-10 19:56:21 -07:00
Bagatur	0ca8d4449c	add ls guide redirect (#11623 )	2023-10-10 12:58:04 -07:00
Bagatur	eedfddac2d	Restructure docs (#11620 )	2023-10-10 12:55:19 -07:00
ElliotKetchup	683f4a93b9	Update azureml_chat_endpoint code exemple (#11602 ) - Description: azureml_chat_endpoint code exemple now takes endpoint_url and endpoint_api_key parameter into consideration, - Issue: None), - Dependencies: None, - Tag maintainer: None, - Twitter handle: @ElliotAlladaye	2023-10-10 10:27:28 -07:00
Yong woo Song	fca34eb122	Fix: invalid link to chat model in openai platform docs (#11609 ) There is some invalid link in open ai platform [docs](https://python.langchain.com/docs/integrations/platforms/openai). So i fixed it to valid links. - `/docs/integrations/chat_models/openai` -> `/docs/integrations/chat/openai` - `/docs/integrations/chat_models/azure_openai` -> `/docs/integrations/chat/azure_chat_openai` Thanks! ☺️	2023-10-10 10:22:39 -07:00
Shubham Kushwaha	49de862076	Arcee.ai LLM & Retriever integration (#11579 ) - Description: This PR introduces a new LLM and Retriever API to https://arcee.ai for the python client - Issue: implements the integrations as requested in #11578 , - Dependencies: no dependencies are required, - Tag maintainer: @hwchase17 - Twitter handle: shwooobham ✅ `make format`, `make lint` and `make test` runs locally. ```shell =========== 1245 passed, 277 skipped, 20 warnings in 16.26s =========== ./scripts/check_pydantic.sh . ./scripts/check_imports.sh poetry run ruff . [ "." = "" ] \|\| poetry run black . --check All done! ✨ 🍰 ✨ 1818 files would be left unchanged. [ "." = "" ] \|\| poetry run mypy . Success: no issues found in 1815 source files [ "." = "" ] \|\| poetry run black . All done! ✨ 🍰 ✨ 1818 files left unchanged. [ "." = "" ] \|\| poetry run ruff --select I --fix . poetry run codespell --toml pyproject.toml poetry run codespell --toml pyproject.toml -w ``` Contributions 1. Arcee (langchain/llms), ArceeRetriever (langchain/retrievers), ArceeWrapper (langchain/utilities) 2. docs for Arcee (llms/arcee.py) and ArceeRetriever(retrievers/arcee.py) 3. cc: @jacobsolawetz @ben-epstein --------- Co-authored-by: Shubham <shubham@sORo.local>	2023-10-10 10:20:45 -07:00
Eugene Yurtsev	b6a2507794	Docs to use LLMSymbolicMath and LLMBash + utilities from experimental (#11614 ) Update docs in lieu of: https://github.com/langchain-ai/langchain/discussions/11352	2023-10-10 13:11:46 -04:00
Leonid Ganeline	59adeaddb3	docs: update `dependents` (#11502 ) A regular update of dependents.	2023-10-10 09:31:23 -07:00
Bagatur	b642d00f9f	rm slack from community.md (#11610 )	2023-10-10 07:55:26 -07:00
unifyh	fd7f129f10	Docs: Fix broken line breaks in snippets (#11523 ) Description: This PR fix some code snippets that have raw `\n`'s instead of actual line breaks. Issue: Currently some snippets look like this: ![image](https://github.com/langchain-ai/langchain/assets/18213435/355b4911-38e9-4ba4-8570-f928557b6c13) Affected pages: - https://python.langchain.com/docs/integrations/providers/predictionguard#example-usage - https://python.langchain.com/docs/modules/agents/how_to/custom_llm_agent#set-up-environment - https://python.langchain.com/docs/modules/chains/foundational/llm_chain#get-started - https://python.langchain.com/docs/integrations/providers/shaleprotocol#how-to Tag maintainer: @hwchase17	2023-10-09 15:40:27 -07:00
Michael Landis	8e45f720a8	feat: add momento vector index as a vector store provider (#11567 ) Description: - Added Momento Vector Index (MVI) as a vector store provider. This includes an implementation with docstrings, integration tests, a notebook, and documentation on the docs pages. - Updated the Momento dependency in pyproject.toml and the lock file to enable access to MVI. - Refactored the Momento cache and chat history session store to prefer using "MOMENTO_API_KEY" over "MOMENTO_AUTH_TOKEN" for consistency with MVI. This change is backwards compatible with the previous "auth_token" variable usage. Updated the code and tests accordingly. Dependencies: - Updated Momento dependency in pyproject.toml. Testing: - Run the integration tests with a Momento API key. Get one at the [Momento Console](https://console.gomomento.com) for free. MVI is available in AWS us-west-2 with a superuser key. - `MOMENTO_API_KEY=<your key> poetry run pytest tests/integration_tests/vectorstores/test_momento_vector_index.py` Tag maintainer: @eyurtsev Twitter handle: Please mention @momentohq for this addition to langchain. With the integration of Momento Vector Index, Momento caching, and session store, Momento provides serverless support for the core langchain data needs. Also mention @mlonml for the integration.	2023-10-09 14:02:59 -07:00
MSFTeegarden	923e9f9596	Add Azure Redis example (#11570 ) Description This PR adds an additional Example to the Redis integration documentation. [The example](https://learn.microsoft.com/azure/azure-cache-for-redis/cache-tutorial-vector-similarity) is a step-by-step walkthrough of using Azure Cache for Redis and Azure OpenAI for vector similarity search, using LangChain extensively throughout. Issue Nothing specific, just adding an additional example. Dependencies None. Tag Maintainer Tagging @hwchase17 :)	2023-10-09 13:27:03 -07:00
maks-operlejn-ds	4d62def9ff	Better deanonymizer matching strategy (#11557 ) @baskaryan, @hwchase17	2023-10-09 11:10:29 -07:00
Bagatur	0a754fa286	redirect langsmith guides (#11562 )	2023-10-09 09:58:03 -07:00
Holt Skinner	09c66fe04f	feat: Update Google Document AI Parser (#11413 ) - Description: Code Refactoring, Documentation Improvements for Google Document AI PDF Parser - Adds Online (synchronous) processing option. - Adds default field mask to limit payload size. - Skips Human review by default. - Issue: Fixes #10589 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-09 08:04:25 -07:00
Nuno Campos	628cc4cce8	Rename RunnableMap to RunnableParallel (#11487 ) - keep alias for RunnableMap - update docs to use RunnableParallel and RunnablePassthrough.assign <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-09 11:22:03 +01:00
William FH	eb572f41a6	Add LangSmith Run Chat Loader (#11458 )	2023-10-06 17:02:18 -07:00
Leonid Ganeline	c3d2b01adf	docs: `integrations/retrievers` cleanup (#11388 ) fixed several notebooks: - headers - formats --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-06 13:40:46 -07:00
Maciej Dzieżyc	bcd308c368	Fix Open in Colab link for ClearML docs 2 (#11491 ) Description: Fixed the Open in Colab link for ClearML docs Issue: https://github.com/allegroai/clearml/issues/1125 Twitter handle: DziezycMaciej	2023-10-06 12:01:47 -07:00
Bagatur	88ab69c288	mv docs extras (#11399 )	2023-10-06 10:09:41 -07:00
Bagatur	1bf8ef1a4f	rm brave (#11482 )	2023-10-06 07:44:19 -07:00
Theron Tau	35297ca0d3	Add feature for extracting images from pdf and recognizing text from images. (#10653 ) Description It is for #10423 that it will be a useful feature if we can extract images from pdf and recognize text on them. I have implemented it with `PyPDFLoader`, `PyPDFium2Loader`, `PyPDFDirectoryLoader`, `PyMuPDFLoader`, `PDFMinerLoader`, and `PDFPlumberLoader`. [RapidOCR](https://github.com/RapidAI/RapidOCR.git) is used to recognize text on extracted images. It is time-consuming for ocr so a boolen parameter `extract_images` is set to control whether to extract and recognize. I have tested the time usage for each parser on my own laptop thinkbook 14+ with AMD R7-6800H by unit test and the result is: \| extract_images \| PyPDFParser \| PDFMinerParser \| PyMuPDFParser \| PyPDFium2Parser \| PDFPlumberParser \| \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| \| False \| 0.27s \| 0.39s \| 0.06s \| 0.08s \| 1.01s \| \| True \| 17.01s \| 20.67s \| 20.32s \| 19,75s \| 20.55s \| Issue #10423 Dependencies rapidocr_onnxruntime in [RapidOCR](https://github.com/RapidAI/RapidOCR/tree/main) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 18:51:59 -07:00
Bagatur	a3a2ce623e	Revise vowpal_wabbit notebook	2023-10-05 18:18:19 -07:00
Bagatur	8fafa1af91	merge	2023-10-05 18:09:35 -07:00
rodrigo-clickup	5944c1851b	Add ClickUp Toolkit (#10662 ) - Description: Adds a toolkit to interact with the [ClickUp](https://clickup.com/) [Public API](https://clickup.com/api/) - Dependencies: None - Tag maintainer: @rodrigo-georgian, @rodrigo-clickup, @aiswaryasankarwork - Twitter handle: - Aiswarya (https://twitter.com/Aiswarya_Sankar, https://www.linkedin.com/in/sankaraiswarya/) - Rodrigo (https://www.linkedin.com/in/rodrigo-ceballos-lentini/) --------- Co-authored-by: Aiswarya Sankar <aiswaryasankar@Aiswaryas-MacBook-Pro.local> Co-authored-by: aiswaryasankarwork <143119412+aiswaryasankarwork@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 16:33:05 -07:00
Beck Bekmyradov	f9df55f7d2	Fix a Typo in Documentation (#11453 ) - Description: This commit corrects a minor typo in the documentation. It changes "frum" to "from" in the sentence: "The results from search are passed back to the LLM for synthesis into an answer" in the file `docs/extras/use_cases/more/agents/agents.ipynb`. This typo fix enhances the clarity and accuracy of the documentation. - Tag maintainer: @baskaryan	2023-10-05 15:34:06 -07:00
Bagatur	f5ce286932	fix api docs build (#11445 )	2023-10-05 15:33:11 -07:00
mrbean	9903a70379	Add youdotcom retriever (#11304 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 13:48:11 -07:00
Syed Ather Rizvi	bfd48925e5	Feature/csharp text splitter doc (#10571 ) - Description: Just docs related to csharp code splitter - Issue: It's related to a request made by @baskaryan in a comment on my previous PR #10350 - Dependencies: None - Twitter handle: @ather19 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 12:22:54 -07:00
maks-operlejn-ds	2aae1102b0	Instance anonymization (#10501 ) ### Description Add instance anonymization - if `John Doe` will appear twice in the text, it will be treated as the same entity. The difference between `PresidioAnonymizer` and `PresidioReversibleAnonymizer` is that only the second one has a built-in memory, so it will remember anonymization mapping for multiple texts: ``` >>> anonymizer = PresidioAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Brett Russell. Hi Brett Russell!' ``` ``` >>> anonymizer = PresidioReversibleAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' ``` ### Twitter handle @deepsense_ai / @MaksOpp ### Tag maintainer @baskaryan @hwchase17 @hinthornw --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 11:23:02 -07:00
Holt Skinner	9f73fec057	fix: Update Google Cloud Enterprise Search to Vertex AI Search (#10513 ) - Description: Google Cloud Enterprise Search was renamed to Vertex AI Search - https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-search-and-conversation-is-now-generally-available - This PR updates the documentation and Retriever class to use the new terminology. - Changed retriever class from `GoogleCloudEnterpriseSearchRetriever` to `GoogleVertexAISearchRetriever` - Updated documentation to specify that `extractive_segments` requires the new [Enterprise edition](https://cloud.google.com/generative-ai-app-builder/docs/about-advanced-features#enterprise-features) to be enabled. - Fixed spelling errors in documentation. - Change parameter for Retriever from `search_engine_id` to `data_store_id` - When this retriever was originally implemented, there was no distinction between a data store and search engine, but now these have been split. - Fixed an issue blocking some users where the api_endpoint can't be set	2023-10-05 10:47:47 -07:00
Mateusz Wosinski	656480feb6	Add language detection example (#10540 ) ### Description Adds language detection examples based on [langdetect](https://github.com/Mimino666/langdetect/tree/master/langdetect) and [fasttext](https://github.com/facebookresearch/fastText/) libraries. These frameworks can be especially useful together with components that require selection of the language (e.g. data-anonymizer) ### Twitter handle @deepsense_ai, @matt_wosinski	2023-10-05 10:39:08 -07:00
billytrend-cohere	2ff91a46c0	Add cohere /chat integration (#11389 ) Add cohere /chat integration and an iPython notebook to demonstrate the addition. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-05 09:20:47 -07:00
ElliotKetchup	53d4f1554a	Update aws.mdx (#11431 )	2023-10-05 09:07:16 -07:00
Lance Martin	211a74941a	Update QA doc w/ Runnables (#11401 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-05 08:07:38 -07:00
Nuno Campos	1e59c44d36	Nc/5oct/runnable release (#11428 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-05 14:27:50 +01:00
William FH	940b9ae30a	Normalize Option in Scoring Chain (#11412 )	2023-10-04 15:59:28 -07:00
bholagabbar	b9fad28f5e	Fix typing imports in extraction usecase (#11402 ) The person class here: https://python.langchain.com/docs/use_cases/extraction#pydantic-1 has attributes `dog_breed` and `dog_name` that use `Optional` from typing, but it hasn't been imported. Fixed the import here	2023-10-04 13:55:02 -07:00
Leonid Ganeline	22165cb2fc	merge pages into `google` and `AWS` pages (#11312 ) There are several pages in `integrations/providers/more` that belongs to Google and AWS `integrations/providers`. - moved content of these pages into the Google and AWS `integrations/providers` pages - removed these individual pages	2023-10-04 13:44:23 -07:00
Bagatur	91941d1f19	mv LCEL up in docs (#11395 )	2023-10-04 15:34:06 -04:00
Lester Solbakken	a30f98f534	Add Vespa vector store (#11329 ) Addition of Vespa vector store integration including notebook showing its use. Maintainer: @lesters Twitter handle: LesterSolbakken	2023-10-04 14:59:11 -04:00
Tomaz Bratanic	71290315cf	Add optional Cypher validation tool (#11078 ) LLMs have trouble with consistently getting the relationship direction accurately. That's why I organized a competition how to best and most simple to fix it based on the existing schema as a post-processing step. https://github.com/tomasonjo/cypher-direction-competition I am adding the winner's code in this PR: https://github.com/sakusaku-rich/cypher-direction-competition	2023-10-04 12:54:37 -04:00
Anatolii Kmetiuk	34a64101cc	Add explanations to GoogleDriveLoader how to avoid errors (#11335 ) - Description: add a paragraph to the GoogleDriveLoader doc on how to bypass errors on authentication. For some reason, specifying credential path via `credentials_path` constructor parameter when creating `GoogleDriveLoader` makes it so that the oAuth screen is never showing up when first using GoogleDriveLoader. Instead, the `RefreshError: ('invalid_grant: Bad Request', {'error': 'invalid_grant', 'error_description': 'Bad Request'})` error happens. Setting it via `os.environ["GOOGLE_APPLICATION_CREDENTIALS"] = ...` solves the problem. Also, `token_path` constructor parameter is mandatory, otherwise another error happens when trying to `load()` for the first time. These errors are tricky and time-consuming to figure out, so I believe it's good to mention them in the docs. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-04 11:12:54 -04:00
MattiaSangermano	cdf5259ca9	Fixed import typo (#11278 ) Fixed small import typo in react_docstore documentation --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-10-04 10:18:10 -04:00
mziru	9e3c1d4463	add HTMLHeaderTextSplitter (#11039 ) Description: Similar in concept to the `MarkdownHeaderTextSplitter`, the `HTMLHeaderTextSplitter` is a "structure-aware" chunker that splits text at the element level and adds metadata for each header "relevant" to any given chunk. It can return chunks element by element or combine elements with the same metadata, with the objectives of (a) keeping related text grouped (more or less) semantically and (b) preserving context-rich information encoded in document structures. It can be used with other text splitters as part of a chunking pipeline. Dependency: lxml python package Maintainer: @hwchase17 Twitter handle: @MartinZirulnik --------- Co-authored-by: PresidioVantage <github@presidiovantage.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-04 09:24:25 -04:00
Isaac Chung	1165767df2	Clarifai integration doc improvements (#11251 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: Doc corrections and resolve notebook rendering issue on GH - Issue: N/A - Dependencies: N/A - Tag maintainer: @baskaryan - Twitter handle: `@isaacchung1217` --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-03 21:47:57 -04:00
Oleg Sinavski	1ca62b232b	Docs: improve similarity search examples (#11298 ) Description: Examples in the "Select by similarity" section were not really highlighting capabilities of similarity search. E.g. "# Input is a measurement, so should select the tall/short example" was still outputting the "mood" example. I tweaked the inputs a bit and fixed the examples (checking that those are indeed what the search outputs). Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-03 21:47:08 -04:00
LeeJongBeom	92683262f4	Fix documents for RetrievalQAWithSourcesChain (#11292 ) - Description: Fix typo about `RetrievalQAWithSourceChain` -> `RetrievalQAWithSourcesChain` <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-10-03 17:36:16 -07:00
Fynn Flügge	0a4baca291	chore: add kotlin code splitter (#11364 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: Adds Kotlin language to `TextSplitter` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-10-03 18:35:36 -04:00
Ofer Mendelevitch	b93a08079e	Updates to Vectara Implementation (#11366 ) Replace this entire comment with: - Description: updates to documentation and API headers - Tag maintainer: @baskarya - Twitter handle: @ofermend	2023-10-03 18:34:39 -04:00
Vicente Reyes	f3e13e7e5a	Use term keyword according to the official python doc glossary (#11338 ) - Description: use term keyword according to the official python doc glossary, see https://docs.python.org/3/glossary.html - Issue: not applicable - Dependencies: not applicable - Tag maintainer: @hwchase17 - Twitter handle: vreyespue	2023-10-03 12:56:08 -07:00
Leonid Ganeline	39316314fa	`fallback` definition (#10504 ) I've added a definition to `fallback` and fixed couple misspells. It was not really clear what is the "fallback".	2023-10-03 12:38:59 -07:00

1 2 3 4 5 ...

2292 Commits