langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Lance Martin	4339d21cf1	Code LLaMA in code understanding use case (#9779 ) Update Code Understanding use case doc w/ Code-llama.	2023-08-25 14:24:38 -07:00
Lance Martin	2ab04a4e32	Update agent docs, move to use-case sub-directory (#9344 ) Re-structure and add new agent page	2023-08-25 11:28:55 -07:00
Lance Martin	985873c497	Update RAG use case (move to ntbk) (#9340 )	2023-08-25 11:27:27 -07:00
Harrison Chase	709a67d9bf	multivector notebook (#9740 )	2023-08-25 07:07:27 -07:00
Fabrizio Ruocco	cacaf487c3	Azure Cognitive Search - update sdk b8, mod user agent, search with scores (#9191 ) Description: Update Azure Cognitive Search SDK to version b8 (breaking change) Customizable User Agent. Implemented Similarity search with scores @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-25 02:34:09 -07:00
Margaret Qian	30151c99c7	Update Mosaic endpoint input/output api (#7391 ) As noted in prior PRs (https://github.com/hwchase17/langchain/pull/6060, https://github.com/hwchase17/langchain/pull/7348), the input/output format has changed a few times as we've stabilized our inference API. This PR updates the API to the latest stable version as indicated in our docs: https://docs.mosaicml.com/en/latest/inference.html The input format looks like this: `{"inputs": [<prompt>]} ` The output format looks like this: ` {"outputs": [<output_text>]} ` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-24 22:13:17 -07:00
Harrison Chase	ade482c17e	add twitter chat loader doc (#9737 )	2023-08-24 21:55:22 -07:00
Leonid Kuligin	87da56fb1e	Added a pdf parser based on DocAI (#9579 ) #9578 --------- Co-authored-by: Leonid Kuligin <kuligin@google.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-24 21:44:49 -07:00
Tudor Golubenco	dc30edf51c	Xata as a chat message memory store (#9719 ) This adds Xata as a memory store also to the python version of LangChain, similar to the [one for LangChain.js](https://github.com/hwchase17/langchainjs/pull/2217). I have added a Jupyter Notebook with a simple and a more complex example using an agent. To run the integration test, you need to execute something like: ``` XATA_API_KEY='xau_...' XATA_DB_URL="https://demo-uni3q8.eu-west-1.xata.sh/db/langchain" poetry run pytest tests/integration_tests/memory/test_xata.py ``` Where `langchain` is the database you create in Xata.	2023-08-24 17:37:46 -07:00
William FH	dff00ea91e	Chat Loaders (#9708 ) Still working out interface/notebooks + need discord data dump to test out things other than copy+paste Update: - Going to remove the 'user_id' arg in the loaders themselves and just standardize on putting the "sender" arg in the extra kwargs. Then can provide a utility function to map these to ai and human messages - Going to move the discord one into just a notebook since I don't have a good dump to test on and copy+paste maybe isn't the greatest thing to support in v0 - Need to do more testing on slack since it seems the dump only includes channels and NOT 1 on 1 convos - --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-08-24 17:23:27 -07:00
Tomaz Bratanic	dacf96895a	Add the option to use separate LLMs for GraphCypherQA chain (#9689 ) The Graph Chains are different in the way that it uses two LLMChains instead of one like the retrievalQA chains. Therefore, sometimes you want to use different LLM to generate the database query and to generate the final answer. This feature would make it more convenient to use different LLMs in the same chain. I have also renamed the Graph DB QA Chain to Neo4j DB QA Chain in the documentation only as it is used only for Neo4j. The naming was ambigious as it was the first graphQA chain added and wasn't sure how do you want to spin it.	2023-08-24 11:50:38 -07:00
Lance Martin	c37be7f5fb	Add Code LLaMA to code QA use case (#9713 ) Use [Ollama integration](https://ollama.ai/blog/run-code-llama-locally).	2023-08-24 11:03:35 -07:00
Patrick Loeber	6bedfdf25a	Fix docs for AssemblyAIAudioTranscriptLoader (shorter import path) (#9687 ) Uses the shorter import path `from langchain.document_loaders import` instead of the full path `from langchain.document_loaders.assemblyai` Applies those changes to the docs and the unit test. See #9667 that adds this new loader.	2023-08-24 07:24:53 -07:00
了空	7cf5c582d2	Added a link to the dependencies document (#9703 )	2023-08-24 07:23:48 -07:00
Harrison Chase	9963b32e59	Harrison/multi vector (#9700 )	2023-08-24 06:42:42 -07:00
Leonid Ganeline	b048236c1a	📖 docs: `integrations/agent_toolkits` (#9333 ) Note: There are no changes in the file names! - The group name on the main navbar changed: `Agent toolkits` -> `Agents & Toolkits`. Examples here are the mix of the Agent and Toolkit examples because Agents and Toolkits in examples are always used together. - Titles changed: removed "Agent" and "Toolkit" suffixes. The reason is the same. - Formatting: mostly cleaning the header structure, so it could be better on the right-side navbar. Main navbar is looking much cleaner now.	2023-08-23 23:17:47 -07:00
Patrick Loeber	5990651070	Add new document_loader: AssemblyAIAudioTranscriptLoader (#9667 ) This PR adds a new document loader `AssemblyAIAudioTranscriptLoader` that allows to transcribe audio files with the [AssemblyAI API](https://www.assemblyai.com) and loads the transcribed text into documents. - Add new document_loader with class `AssemblyAIAudioTranscriptLoader` - Add optional dependency `assemblyai` - Add unit tests (using a Mock client) - Add docs notebook This is the equivalent to the JS integration already available in LangChain.js. See the [LangChain JS docs AssemblyAI page](https://js.langchain.com/docs/modules/data_connection/document_loaders/integrations/web_loaders/assemblyai_audio_transcription). At its simplest, you can use the loader to get a transcript back from an audio file like this: ```python from langchain.document_loaders.assemblyai import AssemblyAIAudioTranscriptLoader loader = AssemblyAIAudioTranscriptLoader(file_path="./testfile.mp3") docs = loader.load() ``` To use it, it needs the `assemblyai` python package installed, and the environment variable `ASSEMBLYAI_API_KEY` set with your API key. Alternatively, the API key can also be passed as an argument. Twitter handles to shout out if so kindly 🙇 [@AssemblyAI](https://twitter.com/AssemblyAI) and [@patloeber](https://twitter.com/patloeber) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-23 22:51:19 -07:00
seamusp	25f2c82ae8	docs:misc fixes (#9671 ) Improve internal consistency in LangChain documentation - Change occurrences of eg and eg. to e.g. - Fix headers containing unnecessary capital letters. - Change instances of "few shot" to "few-shot". - Add periods to end of sentences where missing. - Minor spelling and grammar fixes.	2023-08-23 22:36:54 -07:00
Eugene Yurtsev	b88dfcb42a	Add indexing support (#9614 ) This PR introduces a persistence layer to help with indexing workflows into vectostores. The indexing code helps users to: 1. Avoid writing duplicated content into the vectostore 2. Avoid over-writing content if it's unchanged Importantly, this keeps on working even if the content being written is derived via a set of transformations from some source content (e.g., indexing children documents that were derived from parent documents by chunking.) The two main components are: 1. Persistence layer that keeps track of which keys were updated and when. Keeping track of the timestamp of updates, allows to clean up old content safely, and with minimal complexity. 2. HashedDocument which is used to hash the contents (including metadata) of the documents. We rely on the hashes for identifying duplicates. The indexing code works with ANY document loader. To add transformations to the documents, users for now can add a custom document loader that composes an existing loader together with document transformers. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-23 21:41:38 -04:00
Lakshay Kansal	a8c916955f	Updates to Nomic Atlas and GPT4All documentation (#9414 ) Description: Updates for Nomic AI Atlas and GPT4All integrations documentation. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-23 17:49:44 -07:00
Keras Conv3d	cbaea8d63b	tair fix distance_type error, and add hybrid search (#9531 ) - fix: distance_type error, - feature: Tair add hybrid search --------- Co-authored-by: thw <hanwen.thw@alibaba-inc.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-23 16:38:31 -07:00
Jacob Lee	278ef0bdcf	Adds ChatOllama (#9628 ) @rlancemartin --------- Co-authored-by: Adilkhan Sarsen <54854336+adolkhan@users.noreply.github.com> Co-authored-by: Kim Minjong <make.dirty.code@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Lance Martin <lance@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-23 13:02:26 -07:00
Bagatur	80dd162e0d	mv embedding cache docs (#9664 )	2023-08-23 11:46:04 -07:00
Bagatur	a40c12bb88	Update the nlpcloud connector after some changes on the NLP Cloud API (#9586 ) - Description: remove some text generation deprecated parameters and update the embeddings doc, - Tag maintainer: @rlancemartin	2023-08-23 11:35:08 -07:00
Bagatur	d8e2dd4c89	mv	2023-08-23 11:30:44 -07:00
Bagatur	e2e582f1f6	Fixed source key name for docugami loader (#8598 ) The Docugami loader was not returning the source metadata key. This was triggering this exception when used with retrievers, per https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/schema/prompt_template.py#L193C1-L195C41 The fix is simple and just updates the metadata key name for the document each chunk is sourced from, from "name" to "source" as expected. I tested by running the python notebook that has an end to end scenario in it. Tagging DataLoader maintainers @rlancemartin @eyurtsev	2023-08-23 11:24:55 -07:00
Zizhong Zhang	8a03836160	docs: fix PromptGuard docs (#9659 ) Fix PromptGuard docs. Noticed several trivial issues on the docs when integrating the new class. cc @baskaryan	2023-08-23 10:04:53 -07:00
Yong woo Song	f0ae10a20e	Fix typo in tigris (#9637 ) The link has a typo in [tigirs docs](https://python.langchain.com/docs/integrations/providers/tigris), so I couldn't access it. So, I have corrected it. Thanks! ☺️	2023-08-23 07:15:18 -07:00
Junlin Zhou	5b9bdcac1b	docs: fix link url (#9643 ) This pull request corrects the URL links in the Async API documentation to align with the updated project layout. The links had not been updated despite the changes in layout.	2023-08-23 07:05:02 -07:00
Joseph McElroy	2a06e7b216	ElasticsearchStore: improve error logging for adding documents (#9648 ) Not obvious what the error is when you cannot index. This pr adds the ability to log the first errors reason, to help the user diagnose the issue. Also added some more documentation for when you want to use the vectorstore with an embedding model deployed in elasticsearch. Credit: @elastic and @phoey1	2023-08-23 07:04:09 -07:00
Julien Salinas	f1072cc31f	Merge branch 'master' into master	2023-08-23 14:42:40 +02:00
Leonid Ganeline	e1f4f9ac3e	docs: `integrations/providers` (#9631 ) Added missed pages for `integrations/providers` from `vectorstores`. Updated several `vectorstores` notebooks.	2023-08-22 20:28:11 -07:00
anifort	900c1f3e8d	Add support for structured data sources with google enterprise search (#9037 ) <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: Added the capability to handles structured data from google enterprise search, - Issue: Retriever failed when underline search engine was integrated with structured data, - Dependencies: google-api-core - Tag maintainer: @jarokaz - Twitter handle: anifort Please make sure you're PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> --------- Co-authored-by: Christos Aniftos <aniftos@google.com> Co-authored-by: Holt Skinner <13262395+holtskinner@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-22 23:18:10 -04:00
Jacob Lee	632a83c48e	Update ChatOpenAI docs with fine-tuning example (#9632 )	2023-08-22 16:56:53 -07:00
Adilkhan Sarsen	f29312eb84	Fixing deeplake.mdx file as it uses outdates links (#9602 ) deeplake.mdx was using old links and was not working properly, in the PR we fix the issue.	2023-08-22 15:12:24 -07:00
klae01	b868ef23bc	Add AINetwork blockchain toolkit integration (#9527 ) # Description This PR introduces a new toolkit for interacting with the AINetwork blockchain. The toolkit provides a set of tools for performing various operations on the AINetwork blockchain, such as transferring AIN, reading and writing values to the blockchain database, managing apps, setting rules and owners. # Dependencies [ain-py](https://github.com/ainblockchain/ain-py) >= 1.0.2 # Misc The example notebook (langchain/docs/extras/integrations/toolkits/ainetwork.ipynb) is in the PR --------- Co-authored-by: kriii <kriii@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-22 08:03:33 -07:00
Vanessa Arndorfer	1ea2f9adf4	Document AzureML Deployment Example (#9571 ) Description: Link an example of deploying a Langchain app to an AzureML online endpoint to the deployments documentation page. Co-authored-by: Vanessa Arndorfer <vaarndor@microsoft.com>	2023-08-22 07:36:47 -07:00
toddkim95	fba29f203a	Add to support polars (#9610 ) ### Description Polars is a DataFrame interface on top of an OLAP Query Engine implemented in Rust. Polars is faster to read than pandas, so I'm looking forward to seeing it added to the document loader. ### Dependencies polars (https://pola-rs.github.io/polars-book/user-guide/) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-22 07:36:24 -07:00
Julien Salinas	4d0b7bb8e1	Remove Dolphin and GPT-J from the embeddings docs. These models are not proposed anymore.	2023-08-22 09:28:22 +02:00
Jacob Lee	0fea987dd2	Add missing param to parent document retriever notebook (#9569 )	2023-08-21 15:02:12 -07:00
Zizhong Zhang	00eff8c4a7	feat: Add PromptGuard integration (#9481 ) Add PromptGuard integration ------- There are two approaches to integrate PromptGuard with a LangChain application. 1. PromptGuardLLMWrapper 2. functions that can be used in LangChain expression. ----- - Dependencies `promptguard` python package, which is a runtime requirement if you'd try out the demo. - @baskaryan @hwchase17 Thanks for the ideas and suggestions along the development process. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 14:59:36 -07:00
Oleksandr Ichenskyi	8bc1a3dca8	docs: Add memgraph notebook (#9448 ) - Description: added graph_memgraph_qa.ipynb which shows how to use LLMs to provide a natural language interface to a Memgraph database using [MemgraphGraph](https://github.com/langchain-ai/langchain/pull/8591) class. - Dependencies: given that the notebook utilizes the MemgraphGraph class, it relies on both this class and several Python packages that are installed in the notebook using pip (langchain, openai, neo4j, gqlalchemy). The notebook is dependent on having a functional Memgraph instance running, as it requires this instance to establish a connection.	2023-08-21 13:45:04 -07:00
Matthew Zeiler	949b2cf177	Improvements to the Clarifai integration (#9290 ) - Improved docs - Improved performance in multiple ways through batching, threading, etc. - fixed error message - Added support for metadata filtering during similarity search. @baskaryan PTAL	2023-08-21 12:53:36 -07:00
ricki-epsilla	66a47d9a61	add Epsilla vectorstore (#9239 ) [Epsilla](https://github.com/epsilla-cloud/vectordb) vectordb is an open-source vector database that leverages the advanced academic parallel graph traversal techniques for vector indexing. This PR adds basic integration with [pyepsilla](https://github.com/epsilla-cloud/epsilla-python-client)(Epsilla vectordb python client) as a vectorstore. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 12:51:15 -07:00
axiangcoding	05aa02005b	feat(llms): support ERNIE Embedding-V1 (#9370 ) - Description: support [ERNIE Embedding-V1](https://cloud.baidu.com/doc/WENXINWORKSHOP/s/alj562vvu), which is part of ERNIE ecology - Issue: None - Dependencies: None - Tag maintainer: @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 07:52:25 -07:00
José Ferraz Neto	f116e10d53	Add SharePoint Loader (#4284 ) - Added a loader (`SharePointLoader`) that can pull documents (`pdf`, `docx`, `doc`) from the [SharePoint Document Library](https://support.microsoft.com/en-us/office/what-is-a-document-library-3b5976dd-65cf-4c9e-bf5a-713c10ca2872). - Added a Base Loader (`O365BaseLoader`) to be used for all Loaders that use [O365](https://github.com/O365/python-o365) Package - Code refactoring on `OneDriveLoader` to use the new `O365BaseLoader`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 07:49:07 -07:00
Utku Ege Tuluk	bb4f7936f9	feat(llms): add streaming support to textgen (#9295 ) - Description: Added streaming support to the textgen component in the llms module. - Dependencies: websocket-client = "^1.6.1"	2023-08-21 07:39:14 -07:00
Leonid Ganeline	fdbeb52756	`Qwen` model example (#9516 ) added an example for `Qwen-7B` model on `HugginfFaceHub` 🤗	2023-08-20 17:21:45 -07:00
Martin Schade	0c8a88b3fa	AmazonTextractPDFLoader documentation updates (#9415 ) Description: Updating documentation to add AmazonTextractPDFLoader according to [comment](https://github.com/langchain-ai/langchain/pull/8661#issuecomment-1666572992) from [baskaryan](https://github.com/baskaryan) Adding one notebook and instructions to the modules/data_connection/document_loaders/pdf.mdx --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-08-20 16:40:15 -07:00
Asif Ahmad	08feed3332	Changed the NIBittensorLLM API URL to the correct one (#9419 ) Changed https://api.neuralinterent.ai/ to https://api.neuralinternet.ai/ which is the valid URL for the API of NIBittensorLLM.	2023-08-20 16:25:19 -07:00
EpixMan	103094286e	Fixing class calling error in the documentation of connecting_to_a_feature_store.ipynb (#9508 )	2023-08-20 15:59:40 -07:00
IlyaKIS1	fd8fe209cb	Added In-Depth Langchain Agent Execution Guide (#9507 ) Made the notion document of how Langchain executes agents method by method in the codebase. Can be helpful for developers that just started working with the Langchain codebase.	2023-08-20 15:59:01 -07:00
Rosário P. Fernandes	09a92bb9bf	chatbots use case - fix broken collab URL (#9491 ) The current Collab URL returns a 404, since there is no `chatbots` directory under `use_cases`. <!-- Thank you for contributing to LangChain! If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17, @rlancemartin. -->	2023-08-19 14:53:54 -07:00
bsenst	a956b69720	fix typo in huggingface_hub.ipynb (#9499 )	2023-08-19 14:50:05 -07:00
Bagatur	d87cfd33e8	Update pydantic compatibility guide (#9496 )	2023-08-19 14:44:19 -07:00
Taqi Jaffri	069c0a041f	comment update for poetry install	2023-08-19 13:50:16 -07:00
Taqi Jaffri	5cd244e9b7	CR feedback	2023-08-19 13:48:15 -07:00
Ikko Eltociear Ashimine	0808949e54	Fix typo in apis.ipynb (#9490 ) funtions -> functions	2023-08-19 09:26:08 -04:00
RajneeshSinghShorthillsAI	129d056085	fixed spelling mistake and added missing bracket in parent_document_r… (#9380 ) …etriever.ipynb Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-18 21:36:56 -07:00
Matt Robinson	83d2a871eb	fix: apply unstructured preprocess functions (#9473 ) ### Summary Fixes a bug from #7850 where post processing functions in Unstructured loaders were not apply. Adds a assertion to the test to verify the post processing function was applied and also updates the explanation in the example notebook.	2023-08-18 18:54:28 -07:00
NavanitDubeyShorthillsAI	b58d492e05	Update pydantic_compatibility.md (#9382 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-18 13:03:15 -07:00
bsenst	083726ecda	fix small typo (#9464 )	2023-08-18 11:55:46 -07:00
Leonid Ganeline	99e5eaa9b1	`InternLM` example (#9465 ) Added `InternML` model example to the HubbingFace Hub notebook	2023-08-18 11:17:17 -07:00
William FH	d4f790fd40	Fix imports in notebook (#9458 )	2023-08-18 10:08:47 -07:00
AmitSinghShorthillsAI	2b06792c81	Fixing spelling mistakes in fallbacks.ipynb (#9376 ) Fix spelling errors in the text: 'Therefore' and 'Retrying I want to stress that your feedback is invaluable to us and is genuinely cherished. With gratitude, @baskaryan @hwchase17	2023-08-18 10:33:47 -04:00
PuneetDhimanShorthillsAI	61e4a06447	Corrected Sentence in router.ipynb (#9377 ) Added missing question marks in the lines in the router.ipynb @baskaryan @hwchase17	2023-08-18 10:32:17 -04:00
Leonid Ganeline	edcb03943e	👀 docs: updated `dependents` (#9426 ) Updated statistics (the previous statistics was taken 1+month ago). A lot of new dependents and more starts.	2023-08-18 10:15:39 -04:00
Holmodi	89a8121eaa	Fix a dead loop bug caused by assigning two variables with opposite values. (#9447 ) - Description: Fix a dead loop bug caused by assigning two variables with opposite values.	2023-08-18 10:12:53 -04:00
Bagatur	5d60ced7b3	pydantic compatibility guide fix (#9418 )	2023-08-17 12:33:20 -07:00
Bagatur	0c4683ebcc	Revert "Update compatibility guide for pydantic (#9396 )" (#9417 )	2023-08-17 12:14:32 -07:00
Eugene Yurtsev	b11c233304	Update compatibility guide for pydantic (#9396 ) Use langchain.pydantic_v1 instead of pydantic_v1	2023-08-17 12:09:18 -07:00
Leonid Kuligin	019aa04b06	fixed a pal chain reference (#9387 ) #9386 Co-authored-by: Leonid Kuligin <kuligin@google.com>	2023-08-17 13:02:49 -04:00
Sanskar Tanwar	c194828be0	Fixed Typo in Fallbacks.ipynb (#9373 ) Removed extra "the" in the sentence about the chicken crossing the road in fallbacks.ipynb. The sentence now reads correctly: "Why did the chicken cross the road?" This resolves the grammatical error and improves the overall quality of the content. @baskaryan , @hinthornw , @hwchase17	2023-08-17 02:06:49 -07:00
AashutoshPathakShorthillsAI	c71afb46d1	Corrected Sentence in .ipynb File (#9372 ) Fixed grammatical errors in the sentence by repositioning the word "are" for improved clarity and readability. @baskaryan @hwchase17 @hinthornw	2023-08-17 02:06:43 -07:00
Akshay Tripathi	de8dfde7f7	Corrected Grammatical errors in tutorials.mdx (#9358 ) I want to extend my heartfelt gratitude to the creator for masterfully crafting this remarkable application. 🙌 I am truly impressed by the meticulous attention to grammar and spelling in the documentation, which undoubtedly contributes to a polished and seamless reader experience. As always, your feedback holds immense value and is greatly appreciated. @baskaryan , @hwchase17	2023-08-17 01:55:21 -07:00
Md Nazish Arman	e842131425	Fixed Grammatical errors in tutorials.mdx (#9359 ) I want to convey my deep appreciation to the creator for their expert craftsmanship in developing this exceptional application. 👏 The remarkable dedication to upholding impeccable grammar and spelling in the documentation significantly enhances the polished and seamless experience for readers. I want to stress that your feedback is invaluable to us and is genuinely cherished. With gratitude, @baskaryan, @hwchase17	2023-08-17 01:55:11 -07:00
AnujMauryaShorthillsAI	6dedd94ba4	Update "Langchain" to "LangChain" in the tutorials.mdx file (#9361 ) In this commit, I have made a modification to the term "Langchain" to correctly reflect the project's name as "LangChain". This change ensures consistency and accuracy throughout the codebase and documentation. @baskaryan , @hwchase17	2023-08-17 01:54:57 -07:00
Adarsh Shrivastav	c5e23293f8	Corrected Typo in MultiPromptChain Example in router.ipynb (#9362 ) Refined the example in router.ipynb by addressing a minor typographical error. The typo "rins" has been corrected to "rains" in the code snippet that demonstrates the usage of the MultiPromptChain. This change ensures accuracy and consistency in the provided code example. This improvement enhances the readability and correctness of the notebook, making it easier for users to understand and follow the demonstration. The commit aims to maintain the quality and accuracy of the content within the repository. Thank you for your attention to detail, and please review the change at your convenience. @baskaryan , @hwchase17	2023-08-17 01:54:43 -07:00
Angel Luis	2e8733cf54	Fix typo in huggingface_textgen_inference.ipynb (#9313 ) Replaced incorrect `stream` parameter by `streaming` on Integrations docs.	2023-08-16 16:22:21 -07:00
Lance Martin	b04e472acf	Open source LLM guide (#9266 ) Guide for using open source LLMs locally.	2023-08-16 16:18:31 -07:00
Eugene Yurtsev	0f9f213833	Pydantic Compatibility (#9327 ) Pydantic Compatibility Guidelines for migration plan + debugging	2023-08-16 15:55:53 -07:00
Chandler May	15f1af8ed6	Fix variable case in code snippet in docs (#9311 ) - Description: Fix a minor variable naming inconsistency in a code snippet in the docs - Issue: N/A - Dependencies: none - Tag maintainer: N/A - Twitter handle: N/A	2023-08-16 13:34:46 -07:00
axiangcoding	63601551b1	fix(llms): improve the ernie chat model (#9289 ) - Description: improve the ernie chat model. - fix missing kwargs to payload - new test cases - add some debug level log - improve description - Issue: None - Dependencies: None - Tag maintainer: @baskaryan	2023-08-16 00:48:42 -07:00
Daniel Chalef	1d55141c50	zep/new ZepVectorStore (#9159 ) - new ZepVectorStore class - ZepVectorStore unit tests - ZepVectorStore demo notebook - update zep-python to ~1.0.2 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-16 00:23:07 -07:00
Bagatur	afba2be3dc	update openai functions docs (#9278 )	2023-08-15 17:00:56 -07:00
Bagatur	9abf60acb6	Bagatur/vectara regression (#9276 ) Co-authored-by: Ofer Mendelevitch <ofer@vectara.com> Co-authored-by: Ofer Mendelevitch <ofermend@gmail.com>	2023-08-15 16:19:46 -07:00
Xiaoyu Xee	b30f449dae	Add dashvector vectorstore (#9163 ) ## Description Add `Dashvector` vectorstore for langchain - [dashvector quick start](https://help.aliyun.com/document_detail/2510223.html) - [dashvector package description](https://pypi.org/project/dashvector/) ## How to use ```python from langchain.vectorstores.dashvector import DashVector dashvector = DashVector.from_documents(docs, embeddings) ``` --------- Co-authored-by: smallrain.xuxy <smallrain.xuxy@alibaba-inc.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-15 16:19:30 -07:00
Bagatur	bfbb97b74c	Bagatur/deeplake docs fixes (#9275 ) Co-authored-by: adilkhan <adilkhan.sarsen@nu.edu.kz>	2023-08-15 15:56:36 -07:00
Kunj-2206	1b3942ba74	Added BittensorLLM (#9250 ) Description: Adding NIBittensorLLM via Validator Endpoint to langchain llms Tag maintainer: @Kunj-2206 Maintainer responsibilities: Models / Prompts: @hwchase17, @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-15 15:40:52 -07:00
Toshish Jawale	852722ea45	Improvements in Nebula LLM (#9226 ) - Description: Added improvements in Nebula LLM to perform auto-retry; more generation parameters supported. Conversation is no longer required to be passed in the LLM object. Examples are updated. - Issue: N/A - Dependencies: N/A - Tag maintainer: @baskaryan - Twitter handle: symbldotai --------- Co-authored-by: toshishjawale <toshish@symbl.ai>	2023-08-15 15:33:07 -07:00
Bagatur	1aae77f26f	fix context nb (#9267 )	2023-08-15 12:53:37 -07:00
Alex Gamble	cf17c58b47	Update documentation for the Context integration with new URL and features (#9259 ) Update documentation and URLs for the Langchain Context integration. We've moved from getcontext.ai to context.ai \o/ Thanks in advance for the review!	2023-08-15 11:38:34 -07:00
Joseph McElroy	5e9687a196	Elasticsearch self-query retriever (#9248 ) Now with ElasticsearchStore VectorStore merged, i've added support for the self-query retriever. I've added a notebook also to demonstrate capability. I've also added unit tests. Credit @elastic and @phoey1 on twitter.	2023-08-15 10:53:43 -04:00
Anthony Mahanna	0a04e63811	docs: Update ArangoDB Links (#9251 ) ready for review - mdx link update - colab link update	2023-08-15 07:43:47 -07:00
Hech	4b505060bd	fix: max_marginal_relevance_search and docs in Dingo (#9244 )	2023-08-15 01:06:06 -07:00
axiangcoding	664ff28cba	feat(llms): support ernie chat (#9114 ) Description: support ernie (文心一言) chat model Related issue: #7990 Dependencies: None Tag maintainer: @baskaryan	2023-08-15 01:05:46 -07:00
fanyou-wbd	5e43768f61	docs: update LlamaCpp max_tokens args (#9238 ) This PR updates documentations only, `max_length` should be `max_tokens` according to latest LlamaCpp API doc: https://api.python.langchain.com/en/latest/llms/langchain.llms.llamacpp.LlamaCpp.html	2023-08-15 00:50:20 -07:00
Joshua Sundance Bailey	ef0664728e	ArcGISLoader update (#9240 ) Small bug fixes and added metadata based on user feedback. This PR is from the author of https://github.com/langchain-ai/langchain/pull/8873 .	2023-08-14 23:44:29 -07:00
Joseph McElroy	eac4ddb4bb	Elasticsearch Store Improvements (#8636 ) Todo: - [x] Connection options (cloud, localhost url, es_connection) support - [x] Logging support - [x] Customisable field support - [x] Distance Similarity support - [x] Metadata support - [x] Metadata Filter support - [x] Retrieval Strategies - [x] Approx - [x] Approx with Hybrid - [x] Exact - [x] Custom - [x] ELSER (excluding hybrid as we are working on RRF support) - [x] integration tests - [x] Documentation 👋 this is a contribution to improve Elasticsearch integration with Langchain. Its based loosely on the changes that are in master but with some notable changes: ## Package name & design improvements The import name is now `ElasticsearchStore`, to aid discoverability of the VectorStore. ```py ## Before from langchain.vectorstores.elastic_vector_search import ElasticVectorSearch, ElasticKnnSearch ## Now from langchain.vectorstores.elasticsearch import ElasticsearchStore ``` ## Retrieval Strategy support Before we had a number of classes, depending on the strategy you wanted. `ElasticKnnSearch` for approx, `ElasticVectorSearch` for exact / brute force. With `ElasticsearchStore` we have retrieval strategies: ### Approx Example Default strategy for the vast majority of developers who use Elasticsearch will be inferring the embeddings from outside of Elasticsearch. Uses KNN functionality of _search. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index" ) output = docsearch.similarity_search("foo", k=1) ``` ### Approx, with hybrid Developers who want to search, using both the embedding and the text bm25 match. Its simple to enable. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ApproxRetrievalStrategy(hybrid=True) ) output = docsearch.similarity_search("foo", k=1) ``` ### Approx, with `query_model_id` Developers who want to infer within Elasticsearch, using the model loaded in the ml node. This relies on the developer to setup the pipeline and index if they wish to embed the text in Elasticsearch. Example of this in the test. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ApproxRetrievalStrategy( query_model_id="sentence-transformers__all-minilm-l6-v2" ), ) output = docsearch.similarity_search("foo", k=1) ``` ### I want to provide my own custom Elasticsearch Query You might want to have more control over the query, to perform multi-phase retrieval such as LTR, linearly boosting on document parameters like recently updated or geo-distance. You can do this with `custom_query_fn` ```py def my_custom_query(query_body: dict, query: str) -> dict: return {"query": {"match": {"text": {"query": "bar"}}}} texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), **elasticsearch_connection, index_name=index_name ) docsearch.similarity_search("foo", k=1, custom_query=my_custom_query) ``` ### Exact Example Developers who have a small dataset in Elasticsearch, dont want the cost of indexing the dims vs tradeoff on cost at query time. Uses script_score. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ExactRetrievalStrategy(), ) output = docsearch.similarity_search("foo", k=1) ``` ### ELSER Example Elastic provides its own sparse vector model called ELSER. With these changes, its really easy to use. The vector store creates a pipeline and index thats setup for ELSER. All the developer needs to do is configure, ingest and query via langchain tooling. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.SparseVectorStrategy(), ) output = docsearch.similarity_search("foo", k=1) ``` ## Architecture In future, we can introduce new strategies and allow us to not break bwc as we evolve the index / query strategy. ## Credit On release, could you credit @elastic and @phoey1 please? Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 23:42:35 -07:00
Harrison Chase	71d5b7c9bf	Harrison/fallbacks (#9233 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 18:27:38 -07:00

1 2 3 4 5 ...

614 Commits