langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
Yong woo Song	f0ae10a20e	Fix typo in tigris (#9637 ) The link has a typo in [tigirs docs](https://python.langchain.com/docs/integrations/providers/tigris), so I couldn't access it. So, I have corrected it. Thanks! ☺️	2023-08-23 07:15:18 -07:00
Junlin Zhou	5b9bdcac1b	docs: fix link url (#9643 ) This pull request corrects the URL links in the Async API documentation to align with the updated project layout. The links had not been updated despite the changes in layout.	2023-08-23 07:05:02 -07:00
Aashish Saini	eb92da84a1	Fixings grammatical errors in Doc Files (#9647 ) Fixing some typos and grammatical error is doc file. @eyurtsev , @baskaryan Thanks --------- Co-authored-by: Aayush <142384656+AayushShorthillsAI@users.noreply.github.com> Co-authored-by: Ishita Chauhan <136303787+IshitaChauhanShortHillsAI@users.noreply.github.com>	2023-08-23 07:04:29 -07:00
Joseph McElroy	2a06e7b216	ElasticsearchStore: improve error logging for adding documents (#9648 ) Not obvious what the error is when you cannot index. This pr adds the ability to log the first errors reason, to help the user diagnose the issue. Also added some more documentation for when you want to use the vectorstore with an embedding model deployed in elasticsearch. Credit: @elastic and @phoey1	2023-08-23 07:04:09 -07:00
Julien Salinas	f1072cc31f	Merge branch 'master' into master	2023-08-23 14:42:40 +02:00
Leonid Ganeline	e1f4f9ac3e	docs: `integrations/providers` (#9631 ) Added missed pages for `integrations/providers` from `vectorstores`. Updated several `vectorstores` notebooks.	2023-08-22 20:28:11 -07:00
anifort	900c1f3e8d	Add support for structured data sources with google enterprise search (#9037 ) <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: Added the capability to handles structured data from google enterprise search, - Issue: Retriever failed when underline search engine was integrated with structured data, - Dependencies: google-api-core - Tag maintainer: @jarokaz - Twitter handle: anifort Please make sure you're PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> --------- Co-authored-by: Christos Aniftos <aniftos@google.com> Co-authored-by: Holt Skinner <13262395+holtskinner@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-22 23:18:10 -04:00
Jacob Lee	632a83c48e	Update ChatOpenAI docs with fine-tuning example (#9632 )	2023-08-22 16:56:53 -07:00
Adilkhan Sarsen	f29312eb84	Fixing deeplake.mdx file as it uses outdates links (#9602 ) deeplake.mdx was using old links and was not working properly, in the PR we fix the issue.	2023-08-22 15:12:24 -07:00
klae01	b868ef23bc	Add AINetwork blockchain toolkit integration (#9527 ) # Description This PR introduces a new toolkit for interacting with the AINetwork blockchain. The toolkit provides a set of tools for performing various operations on the AINetwork blockchain, such as transferring AIN, reading and writing values to the blockchain database, managing apps, setting rules and owners. # Dependencies [ain-py](https://github.com/ainblockchain/ain-py) >= 1.0.2 # Misc The example notebook (langchain/docs/extras/integrations/toolkits/ainetwork.ipynb) is in the PR --------- Co-authored-by: kriii <kriii@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-22 08:03:33 -07:00
Vanessa Arndorfer	1ea2f9adf4	Document AzureML Deployment Example (#9571 ) Description: Link an example of deploying a Langchain app to an AzureML online endpoint to the deployments documentation page. Co-authored-by: Vanessa Arndorfer <vaarndor@microsoft.com>	2023-08-22 07:36:47 -07:00
toddkim95	fba29f203a	Add to support polars (#9610 ) ### Description Polars is a DataFrame interface on top of an OLAP Query Engine implemented in Rust. Polars is faster to read than pandas, so I'm looking forward to seeing it added to the document loader. ### Dependencies polars (https://pola-rs.github.io/polars-book/user-guide/) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-22 07:36:24 -07:00
Julien Salinas	4d0b7bb8e1	Remove Dolphin and GPT-J from the embeddings docs. These models are not proposed anymore.	2023-08-22 09:28:22 +02:00
Jeremy Suriel	0fa4516ce4	Fix typo (#9565 ) Corrected a minor documentation typo here: https://python.langchain.com/docs/modules/model_io/models/llms/#generate-batch-calls-richer-outputs	2023-08-21 15:54:38 -07:00
Jacob Lee	0fea987dd2	Add missing param to parent document retriever notebook (#9569 )	2023-08-21 15:02:12 -07:00
Zizhong Zhang	00eff8c4a7	feat: Add PromptGuard integration (#9481 ) Add PromptGuard integration ------- There are two approaches to integrate PromptGuard with a LangChain application. 1. PromptGuardLLMWrapper 2. functions that can be used in LangChain expression. ----- - Dependencies `promptguard` python package, which is a runtime requirement if you'd try out the demo. - @baskaryan @hwchase17 Thanks for the ideas and suggestions along the development process. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 14:59:36 -07:00
Oleksandr Ichenskyi	8bc1a3dca8	docs: Add memgraph notebook (#9448 ) - Description: added graph_memgraph_qa.ipynb which shows how to use LLMs to provide a natural language interface to a Memgraph database using [MemgraphGraph](https://github.com/langchain-ai/langchain/pull/8591) class. - Dependencies: given that the notebook utilizes the MemgraphGraph class, it relies on both this class and several Python packages that are installed in the notebook using pip (langchain, openai, neo4j, gqlalchemy). The notebook is dependent on having a functional Memgraph instance running, as it requires this instance to establish a connection.	2023-08-21 13:45:04 -07:00
Bagatur	d09cdb4880	update data connection -> retrieval (#9561 )	2023-08-21 13:03:29 -07:00
Matthew Zeiler	949b2cf177	Improvements to the Clarifai integration (#9290 ) - Improved docs - Improved performance in multiple ways through batching, threading, etc. - fixed error message - Added support for metadata filtering during similarity search. @baskaryan PTAL	2023-08-21 12:53:36 -07:00
ricki-epsilla	66a47d9a61	add Epsilla vectorstore (#9239 ) [Epsilla](https://github.com/epsilla-cloud/vectordb) vectordb is an open-source vector database that leverages the advanced academic parallel graph traversal techniques for vector indexing. This PR adds basic integration with [pyepsilla](https://github.com/epsilla-cloud/epsilla-python-client)(Epsilla vectordb python client) as a vectorstore. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 12:51:15 -07:00
Bagatur	4999e8af7e	pin pydantic api ref build (#9556 )	2023-08-21 12:11:49 -07:00
axiangcoding	05aa02005b	feat(llms): support ERNIE Embedding-V1 (#9370 ) - Description: support [ERNIE Embedding-V1](https://cloud.baidu.com/doc/WENXINWORKSHOP/s/alj562vvu), which is part of ERNIE ecology - Issue: None - Dependencies: None - Tag maintainer: @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 07:52:25 -07:00
José Ferraz Neto	f116e10d53	Add SharePoint Loader (#4284 ) - Added a loader (`SharePointLoader`) that can pull documents (`pdf`, `docx`, `doc`) from the [SharePoint Document Library](https://support.microsoft.com/en-us/office/what-is-a-document-library-3b5976dd-65cf-4c9e-bf5a-713c10ca2872). - Added a Base Loader (`O365BaseLoader`) to be used for all Loaders that use [O365](https://github.com/O365/python-o365) Package - Code refactoring on `OneDriveLoader` to use the new `O365BaseLoader`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-21 07:49:07 -07:00
Utku Ege Tuluk	bb4f7936f9	feat(llms): add streaming support to textgen (#9295 ) - Description: Added streaming support to the textgen component in the llms module. - Dependencies: websocket-client = "^1.6.1"	2023-08-21 07:39:14 -07:00
Harrison Chase	9930ddc555	beef up retrieval docs (#9518 )	2023-08-21 07:22:22 -07:00
Leonid Ganeline	fdbeb52756	`Qwen` model example (#9516 ) added an example for `Qwen-7B` model on `HugginfFaceHub` 🤗	2023-08-20 17:21:45 -07:00
Martin Schade	0c8a88b3fa	AmazonTextractPDFLoader documentation updates (#9415 ) Description: Updating documentation to add AmazonTextractPDFLoader according to [comment](https://github.com/langchain-ai/langchain/pull/8661#issuecomment-1666572992) from [baskaryan](https://github.com/baskaryan) Adding one notebook and instructions to the modules/data_connection/document_loaders/pdf.mdx --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-08-20 16:40:15 -07:00
Asif Ahmad	08feed3332	Changed the NIBittensorLLM API URL to the correct one (#9419 ) Changed https://api.neuralinterent.ai/ to https://api.neuralinternet.ai/ which is the valid URL for the API of NIBittensorLLM.	2023-08-20 16:25:19 -07:00
EpixMan	103094286e	Fixing class calling error in the documentation of connecting_to_a_feature_store.ipynb (#9508 )	2023-08-20 15:59:40 -07:00
IlyaKIS1	fd8fe209cb	Added In-Depth Langchain Agent Execution Guide (#9507 ) Made the notion document of how Langchain executes agents method by method in the codebase. Can be helpful for developers that just started working with the Langchain codebase.	2023-08-20 15:59:01 -07:00
Rosário P. Fernandes	09a92bb9bf	chatbots use case - fix broken collab URL (#9491 ) The current Collab URL returns a 404, since there is no `chatbots` directory under `use_cases`. <!-- Thank you for contributing to LangChain! If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17, @rlancemartin. -->	2023-08-19 14:53:54 -07:00
bsenst	a956b69720	fix typo in huggingface_hub.ipynb (#9499 )	2023-08-19 14:50:05 -07:00
Bagatur	d87cfd33e8	Update pydantic compatibility guide (#9496 )	2023-08-19 14:44:19 -07:00
Taqi Jaffri	069c0a041f	comment update for poetry install	2023-08-19 13:50:16 -07:00
Taqi Jaffri	5cd244e9b7	CR feedback	2023-08-19 13:48:15 -07:00
Ikko Eltociear Ashimine	0808949e54	Fix typo in apis.ipynb (#9490 ) funtions -> functions	2023-08-19 09:26:08 -04:00
RajneeshSinghShorthillsAI	129d056085	fixed spelling mistake and added missing bracket in parent_document_r… (#9380 ) …etriever.ipynb Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-18 21:36:56 -07:00
Matt Robinson	83d2a871eb	fix: apply unstructured preprocess functions (#9473 ) ### Summary Fixes a bug from #7850 where post processing functions in Unstructured loaders were not apply. Adds a assertion to the test to verify the post processing function was applied and also updates the explanation in the example notebook.	2023-08-18 18:54:28 -07:00
NavanitDubeyShorthillsAI	b58d492e05	Update pydantic_compatibility.md (#9382 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-18 13:03:15 -07:00
bsenst	083726ecda	fix small typo (#9464 )	2023-08-18 11:55:46 -07:00
Leonid Ganeline	99e5eaa9b1	`InternLM` example (#9465 ) Added `InternML` model example to the HubbingFace Hub notebook	2023-08-18 11:17:17 -07:00
William FH	d4f790fd40	Fix imports in notebook (#9458 )	2023-08-18 10:08:47 -07:00
AmitSinghShorthillsAI	2b06792c81	Fixing spelling mistakes in fallbacks.ipynb (#9376 ) Fix spelling errors in the text: 'Therefore' and 'Retrying I want to stress that your feedback is invaluable to us and is genuinely cherished. With gratitude, @baskaryan @hwchase17	2023-08-18 10:33:47 -04:00
PuneetDhimanShorthillsAI	61e4a06447	Corrected Sentence in router.ipynb (#9377 ) Added missing question marks in the lines in the router.ipynb @baskaryan @hwchase17	2023-08-18 10:32:17 -04:00
呂安	ead04487fd	doc: make install from source more clearer (#9433 ) Description: if just `pip install -e .` it will not install anything, we have to find the right directory to do `pip install -e .`	2023-08-18 10:30:55 -04:00
Leonid Ganeline	edcb03943e	👀 docs: updated `dependents` (#9426 ) Updated statistics (the previous statistics was taken 1+month ago). A lot of new dependents and more starts.	2023-08-18 10:15:39 -04:00
Holmodi	89a8121eaa	Fix a dead loop bug caused by assigning two variables with opposite values. (#9447 ) - Description: Fix a dead loop bug caused by assigning two variables with opposite values.	2023-08-18 10:12:53 -04:00
Lance Martin	589927e9e1	Update figure in OSS model guide (#9399 )	2023-08-17 15:09:21 -07:00
Bagatur	5d60ced7b3	pydantic compatibility guide fix (#9418 )	2023-08-17 12:33:20 -07:00
Bagatur	0c4683ebcc	Revert "Update compatibility guide for pydantic (#9396 )" (#9417 )	2023-08-17 12:14:32 -07:00
Eugene Yurtsev	b11c233304	Update compatibility guide for pydantic (#9396 ) Use langchain.pydantic_v1 instead of pydantic_v1	2023-08-17 12:09:18 -07:00
Leonid Kuligin	019aa04b06	fixed a pal chain reference (#9387 ) #9386 Co-authored-by: Leonid Kuligin <kuligin@google.com>	2023-08-17 13:02:49 -04:00
Sanskar Tanwar	c194828be0	Fixed Typo in Fallbacks.ipynb (#9373 ) Removed extra "the" in the sentence about the chicken crossing the road in fallbacks.ipynb. The sentence now reads correctly: "Why did the chicken cross the road?" This resolves the grammatical error and improves the overall quality of the content. @baskaryan , @hinthornw , @hwchase17	2023-08-17 02:06:49 -07:00
AashutoshPathakShorthillsAI	c71afb46d1	Corrected Sentence in .ipynb File (#9372 ) Fixed grammatical errors in the sentence by repositioning the word "are" for improved clarity and readability. @baskaryan @hwchase17 @hinthornw	2023-08-17 02:06:43 -07:00
Akshay Tripathi	de8dfde7f7	Corrected Grammatical errors in tutorials.mdx (#9358 ) I want to extend my heartfelt gratitude to the creator for masterfully crafting this remarkable application. 🙌 I am truly impressed by the meticulous attention to grammar and spelling in the documentation, which undoubtedly contributes to a polished and seamless reader experience. As always, your feedback holds immense value and is greatly appreciated. @baskaryan , @hwchase17	2023-08-17 01:55:21 -07:00
Md Nazish Arman	e842131425	Fixed Grammatical errors in tutorials.mdx (#9359 ) I want to convey my deep appreciation to the creator for their expert craftsmanship in developing this exceptional application. 👏 The remarkable dedication to upholding impeccable grammar and spelling in the documentation significantly enhances the polished and seamless experience for readers. I want to stress that your feedback is invaluable to us and is genuinely cherished. With gratitude, @baskaryan, @hwchase17	2023-08-17 01:55:11 -07:00
AnujMauryaShorthillsAI	6dedd94ba4	Update "Langchain" to "LangChain" in the tutorials.mdx file (#9361 ) In this commit, I have made a modification to the term "Langchain" to correctly reflect the project's name as "LangChain". This change ensures consistency and accuracy throughout the codebase and documentation. @baskaryan , @hwchase17	2023-08-17 01:54:57 -07:00
Adarsh Shrivastav	c5e23293f8	Corrected Typo in MultiPromptChain Example in router.ipynb (#9362 ) Refined the example in router.ipynb by addressing a minor typographical error. The typo "rins" has been corrected to "rains" in the code snippet that demonstrates the usage of the MultiPromptChain. This change ensures accuracy and consistency in the provided code example. This improvement enhances the readability and correctness of the notebook, making it easier for users to understand and follow the demonstration. The commit aims to maintain the quality and accuracy of the content within the repository. Thank you for your attention to detail, and please review the change at your convenience. @baskaryan , @hwchase17	2023-08-17 01:54:43 -07:00
AbhishekYadavShorthillsAI	90d7c55343	Fix Typo in "community.md" (#9360 ) Corrected a typographical error in the "community.md" file by removing an extra word from the sentence. @baskaryan , @hwchase17	2023-08-17 01:54:13 -07:00
Angel Luis	2e8733cf54	Fix typo in huggingface_textgen_inference.ipynb (#9313 ) Replaced incorrect `stream` parameter by `streaming` on Integrations docs.	2023-08-16 16:22:21 -07:00
Lance Martin	b04e472acf	Open source LLM guide (#9266 ) Guide for using open source LLMs locally.	2023-08-16 16:18:31 -07:00
Eugene Yurtsev	090411842e	Fix API reference docs (#9321 ) Do not document members nested within any private component	2023-08-16 15:56:54 -07:00
Eugene Yurtsev	0f9f213833	Pydantic Compatibility (#9327 ) Pydantic Compatibility Guidelines for migration plan + debugging	2023-08-16 15:55:53 -07:00
Chandler May	15f1af8ed6	Fix variable case in code snippet in docs (#9311 ) - Description: Fix a minor variable naming inconsistency in a code snippet in the docs - Issue: N/A - Dependencies: none - Tag maintainer: N/A - Twitter handle: N/A	2023-08-16 13:34:46 -07:00
Michael Bianco	23928a3311	docs: remove multiple code blocks from comma-separated docs (#9323 )	2023-08-16 11:51:58 -07:00
Navanit Dubey	3e6cea46e2	Guide import readable json (#9291 )	2023-08-16 00:49:01 -07:00
axiangcoding	63601551b1	fix(llms): improve the ernie chat model (#9289 ) - Description: improve the ernie chat model. - fix missing kwargs to payload - new test cases - add some debug level log - improve description - Issue: None - Dependencies: None - Tag maintainer: @baskaryan	2023-08-16 00:48:42 -07:00
Daniel Chalef	1d55141c50	zep/new ZepVectorStore (#9159 ) - new ZepVectorStore class - ZepVectorStore unit tests - ZepVectorStore demo notebook - update zep-python to ~1.0.2 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-16 00:23:07 -07:00
Bagatur	b9ca5cc5ea	update guide import (#9279 )	2023-08-15 17:01:06 -07:00
Bagatur	afba2be3dc	update openai functions docs (#9278 )	2023-08-15 17:00:56 -07:00
Bagatur	9abf60acb6	Bagatur/vectara regression (#9276 ) Co-authored-by: Ofer Mendelevitch <ofer@vectara.com> Co-authored-by: Ofer Mendelevitch <ofermend@gmail.com>	2023-08-15 16:19:46 -07:00
Xiaoyu Xee	b30f449dae	Add dashvector vectorstore (#9163 ) ## Description Add `Dashvector` vectorstore for langchain - [dashvector quick start](https://help.aliyun.com/document_detail/2510223.html) - [dashvector package description](https://pypi.org/project/dashvector/) ## How to use ```python from langchain.vectorstores.dashvector import DashVector dashvector = DashVector.from_documents(docs, embeddings) ``` --------- Co-authored-by: smallrain.xuxy <smallrain.xuxy@alibaba-inc.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-15 16:19:30 -07:00
Bagatur	bfbb97b74c	Bagatur/deeplake docs fixes (#9275 ) Co-authored-by: adilkhan <adilkhan.sarsen@nu.edu.kz>	2023-08-15 15:56:36 -07:00
Kunj-2206	1b3942ba74	Added BittensorLLM (#9250 ) Description: Adding NIBittensorLLM via Validator Endpoint to langchain llms Tag maintainer: @Kunj-2206 Maintainer responsibilities: Models / Prompts: @hwchase17, @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-15 15:40:52 -07:00
Toshish Jawale	852722ea45	Improvements in Nebula LLM (#9226 ) - Description: Added improvements in Nebula LLM to perform auto-retry; more generation parameters supported. Conversation is no longer required to be passed in the LLM object. Examples are updated. - Issue: N/A - Dependencies: N/A - Tag maintainer: @baskaryan - Twitter handle: symbldotai --------- Co-authored-by: toshishjawale <toshish@symbl.ai>	2023-08-15 15:33:07 -07:00
Bagatur	1aae77f26f	fix context nb (#9267 )	2023-08-15 12:53:37 -07:00
Alex Gamble	cf17c58b47	Update documentation for the Context integration with new URL and features (#9259 ) Update documentation and URLs for the Langchain Context integration. We've moved from getcontext.ai to context.ai \o/ Thanks in advance for the review!	2023-08-15 11:38:34 -07:00
Joseph McElroy	5e9687a196	Elasticsearch self-query retriever (#9248 ) Now with ElasticsearchStore VectorStore merged, i've added support for the self-query retriever. I've added a notebook also to demonstrate capability. I've also added unit tests. Credit @elastic and @phoey1 on twitter.	2023-08-15 10:53:43 -04:00
Anthony Mahanna	0a04e63811	docs: Update ArangoDB Links (#9251 ) ready for review - mdx link update - colab link update	2023-08-15 07:43:47 -07:00
Hech	4b505060bd	fix: max_marginal_relevance_search and docs in Dingo (#9244 )	2023-08-15 01:06:06 -07:00
axiangcoding	664ff28cba	feat(llms): support ernie chat (#9114 ) Description: support ernie (文心一言) chat model Related issue: #7990 Dependencies: None Tag maintainer: @baskaryan	2023-08-15 01:05:46 -07:00
fanyou-wbd	5e43768f61	docs: update LlamaCpp max_tokens args (#9238 ) This PR updates documentations only, `max_length` should be `max_tokens` according to latest LlamaCpp API doc: https://api.python.langchain.com/en/latest/llms/langchain.llms.llamacpp.LlamaCpp.html	2023-08-15 00:50:20 -07:00
Bagatur	a8aa1aba1c	nit (#9243 )	2023-08-15 00:49:12 -07:00
Bagatur	68d8f73698	consolidate redirects (#9242 )	2023-08-15 00:48:23 -07:00
Joshua Sundance Bailey	ef0664728e	ArcGISLoader update (#9240 ) Small bug fixes and added metadata based on user feedback. This PR is from the author of https://github.com/langchain-ai/langchain/pull/8873 .	2023-08-14 23:44:29 -07:00
Joseph McElroy	eac4ddb4bb	Elasticsearch Store Improvements (#8636 ) Todo: - [x] Connection options (cloud, localhost url, es_connection) support - [x] Logging support - [x] Customisable field support - [x] Distance Similarity support - [x] Metadata support - [x] Metadata Filter support - [x] Retrieval Strategies - [x] Approx - [x] Approx with Hybrid - [x] Exact - [x] Custom - [x] ELSER (excluding hybrid as we are working on RRF support) - [x] integration tests - [x] Documentation 👋 this is a contribution to improve Elasticsearch integration with Langchain. Its based loosely on the changes that are in master but with some notable changes: ## Package name & design improvements The import name is now `ElasticsearchStore`, to aid discoverability of the VectorStore. ```py ## Before from langchain.vectorstores.elastic_vector_search import ElasticVectorSearch, ElasticKnnSearch ## Now from langchain.vectorstores.elasticsearch import ElasticsearchStore ``` ## Retrieval Strategy support Before we had a number of classes, depending on the strategy you wanted. `ElasticKnnSearch` for approx, `ElasticVectorSearch` for exact / brute force. With `ElasticsearchStore` we have retrieval strategies: ### Approx Example Default strategy for the vast majority of developers who use Elasticsearch will be inferring the embeddings from outside of Elasticsearch. Uses KNN functionality of _search. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index" ) output = docsearch.similarity_search("foo", k=1) ``` ### Approx, with hybrid Developers who want to search, using both the embedding and the text bm25 match. Its simple to enable. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ApproxRetrievalStrategy(hybrid=True) ) output = docsearch.similarity_search("foo", k=1) ``` ### Approx, with `query_model_id` Developers who want to infer within Elasticsearch, using the model loaded in the ml node. This relies on the developer to setup the pipeline and index if they wish to embed the text in Elasticsearch. Example of this in the test. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ApproxRetrievalStrategy( query_model_id="sentence-transformers__all-minilm-l6-v2" ), ) output = docsearch.similarity_search("foo", k=1) ``` ### I want to provide my own custom Elasticsearch Query You might want to have more control over the query, to perform multi-phase retrieval such as LTR, linearly boosting on document parameters like recently updated or geo-distance. You can do this with `custom_query_fn` ```py def my_custom_query(query_body: dict, query: str) -> dict: return {"query": {"match": {"text": {"query": "bar"}}}} texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), **elasticsearch_connection, index_name=index_name ) docsearch.similarity_search("foo", k=1, custom_query=my_custom_query) ``` ### Exact Example Developers who have a small dataset in Elasticsearch, dont want the cost of indexing the dims vs tradeoff on cost at query time. Uses script_score. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.ExactRetrievalStrategy(), ) output = docsearch.similarity_search("foo", k=1) ``` ### ELSER Example Elastic provides its own sparse vector model called ELSER. With these changes, its really easy to use. The vector store creates a pipeline and index thats setup for ELSER. All the developer needs to do is configure, ingest and query via langchain tooling. ```py texts = ["foo", "bar", "baz"] docsearch = ElasticsearchStore.from_texts( texts, FakeEmbeddings(), es_url="http://localhost:9200", index_name="sample-index", strategy=ElasticsearchStore.SparseVectorStrategy(), ) output = docsearch.similarity_search("foo", k=1) ``` ## Architecture In future, we can introduce new strategies and allow us to not break bwc as we evolve the index / query strategy. ## Credit On release, could you credit @elastic and @phoey1 please? Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 23:42:35 -07:00
Harrison Chase	71d5b7c9bf	Harrison/fallbacks (#9233 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 18:27:38 -07:00
Lance Martin	41279a3ae1	Move self-check use case to "more" section (#9137 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 18:27:28 -07:00
Lance Martin	22858d99b5	Move code-writing use case to "more" section (#9134 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 18:27:19 -07:00
Bagatur	249d7d06a2	adapter doc nit (#9234 )	2023-08-14 18:26:37 -07:00
Lance Martin	969e1683de	Move graph use case to "more" section (#8997 ) Clean `use_cases` by moving the `GraphDB` to `integrations`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 17:20:38 -07:00
Lance Martin	d0a0d560ad	Minor formatting on Web Research Use Case (#9221 )	2023-08-14 16:29:36 -07:00
Lance Martin	17ae2998e7	Update Ollama docs (#9220 ) Based on discussion w/ team.	2023-08-14 13:56:16 -07:00
Krish Dholakia	49f1d8477c	Adding ChatLiteLLM model (#9020 ) Description: Adding a langchain integration for the LiteLLM library Tag maintainer: @hwchase17, @baskaryan Twitter handle: @krrish_dh / @Berri_AI --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-14 07:43:40 -07:00
Emmanuel Gautier	f11e5442d6	docs: update LlamaCpp input args (#9173 ) This PR only updates the LlamaCpp args documentation. The input arg has been flattened.	2023-08-14 07:42:03 -07:00
Massimiliano Pronesti	d95eeaedbe	feat(llms): support vLLM's OpenAI-compatible server (#9179 ) This PR aims at supporting [vLLM's OpenAI-compatible server feature](https://vllm.readthedocs.io/en/latest/getting_started/quickstart.html#openai-compatible-server), i.e. allowing to call vLLM's LLMs like if they were OpenAI's. I've also udpated the related notebook providing an example usage. At the moment, vLLM only supports the `Completion` API.	2023-08-13 23:03:05 -07:00
Michael Goin	621da3c164	Adds DeepSparse as an LLM (#9184 ) Adds [DeepSparse](https://github.com/neuralmagic/deepsparse) as an LLM backend. DeepSparse supports running various open-source sparsified models hosted on [SparseZoo](https://sparsezoo.neuralmagic.com/) for performance gains on CPUs. Twitter handles: @mgoin_ @neuralmagic --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-13 22:35:58 -07:00
Bagatur	0fa69d8988	Bagatur/zep python 1.0 (#9186 ) Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com>	2023-08-13 21:52:53 -07:00
Harrison Chase	8d69dacdf3	multiple retreival in parralel (#9174 )	2023-08-13 10:03:54 -07:00
Eugene Yurtsev	aca8cb5fba	API Reference: Do not document private modules (#9042 ) This PR prevents documentation of private modules in the API reference	2023-08-11 15:58:14 -07:00
UmerHA	8aab39e3ce	Added SmartGPT workflow (issue #4463 ) (#4816 ) # Added SmartGPT workflow by providing SmartLLM wrapper around LLMs Edit: As @hwchase17 suggested, this should be a chain, not an LLM. I have adapted the PR. It is used like this: ``` from langchain.prompts import PromptTemplate from langchain.chains import SmartLLMChain from langchain.chat_models import ChatOpenAI hard_question = "I have a 12 liter jug and a 6 liter jug. I want to measure 6 liters. How do I do it?" hard_question_prompt = PromptTemplate.from_template(hard_question) llm = ChatOpenAI(model_name="gpt-4") prompt = PromptTemplate.from_template(hard_question) chain = SmartLLMChain(llm=llm, prompt=prompt, verbose=True) chain.run({}) ``` Original text: Added SmartLLM wrapper around LLMs to allow for SmartGPT workflow (as in https://youtu.be/wVzuvf9D9BU). SmartLLM can be used wherever LLM can be used. E.g: ``` smart_llm = SmartLLM(llm=OpenAI()) smart_llm("What would be a good company name for a company that makes colorful socks?") ``` or ``` smart_llm = SmartLLM(llm=OpenAI()) prompt = PromptTemplate( input_variables=["product"], template="What is a good name for a company that makes {product}?", ) chain = LLMChain(llm=smart_llm, prompt=prompt) chain.run("colorful socks") ``` SmartGPT consists of 3 steps: 1. Ideate - generate n possible solutions ("ideas") to user prompt 2. Critique - find flaws in every idea & select best one 3. Resolve - improve upon best idea & return it Fixes #4463 ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: - @hwchase17 - @agola11 Twitter: [@UmerHAdil](https://twitter.com/@UmerHAdil) \| Discord: RicChilligerDude#7589 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 15:44:27 -07:00
Bagatur	45741bcc1b	Bagatur/vectara nit (#9140 ) Co-authored-by: Ofer Mendelevitch <ofer@vectara.com>	2023-08-11 15:32:03 -07:00
Dominick DEV	9b64932e55	Add LangChain utility for real-time crypto exchange prices (#4501 ) This commit adds the LangChain utility which allows for the real-time retrieval of cryptocurrency exchange prices. With LangChain, users can easily access up-to-date pricing information by running the command ".run(from_currency, to_currency)". This new feature provides a convenient way to stay informed on the latest exchange rates and make informed decisions when trading crypto. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 14:45:06 -07:00
Joshua Sundance Bailey	eaa505fb09	Create ArcGISLoader & example notebook (#8873 ) - Description: Adds the ArcGISLoader class to `langchain.document_loaders` - Allows users to load data from ArcGIS Online, Portal, and similar - Users can authenticate with `arcgis.gis.GIS` or retrieve public data anonymously - Uses the `arcgis.features.FeatureLayer` class to retrieve the data - Defines the most relevant keywords arguments and accepts `**kwargs` - Dependencies: Using this class requires `arcgis` and, optionally, `bs4.BeautifulSoup`. Tagging maintainers: - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 14:33:40 -07:00
Aashish Saini	0aabded97f	Updating interactive walkthrough link in index.md to resolve 404 error (#9063 ) Updated interactive walkthrough link in index.md to resolve 404 error. Also, expressing deep gratitude to LangChain library developers for their exceptional efforts 🥇 . --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 13:08:56 -07:00
Hai The Dude	e4418d1b7e	Added new use case docs for Web Scraping, Chromium loader, BS4 transformer (#8732 ) - Description: Added a new use case category called "Web Scraping", and a tutorial to scrape websites using OpenAI Functions Extraction chain to the docs. - Tag maintainer:@baskaryan @hwchase17 , - Twitter handle: https://www.linkedin.com/in/haiphunghiem/ (I'm on LinkedIn mostly) --------- Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 11:46:59 -07:00
niklub	16af5f8690	Add LabelStudio integration (#8880 ) This PR introduces [Label Studio](https://labelstud.io/) integration with LangChain via `LabelStudioCallbackHandler`: - sending data to the Label Studio instance - labeling dataset for supervised LLM finetuning - rating model responses - tracking and displaying chat history - support for custom data labeling workflow ### Example ``` chat_llm = ChatOpenAI(callbacks=[LabelStudioCallbackHandler(mode="chat")]) chat_llm([ SystemMessage(content="Always use emojis in your responses."), HumanMessage(content="Hey AI, how's your day going?"), AIMessage(content="🤖 I don't have feelings, but I'm running smoothly! How can I help you today?"), HumanMessage(content="I'm feeling a bit down. Any advice?"), AIMessage(content="🤗 I'm sorry to hear that. Remember, it's okay to seek help or talk to someone if you need to. 💬"), HumanMessage(content="Can you tell me a joke to lighten the mood?"), AIMessage(content="Of course! 🎭 Why did the scarecrow win an award? Because he was outstanding in his field! 🌾"), HumanMessage(content="Haha, that was a good one! Thanks for cheering me up."), AIMessage(content="Always here to help! 😊 If you need anything else, just let me know."), HumanMessage(content="Will do! By the way, can you recommend a good movie?"), ]) ``` <img width="906" alt="image" src="https://github.com/langchain-ai/langchain/assets/6087484/0a1cf559-0bd3-4250-ad96-6e71dbb1d2f3"> ### Dependencies - [label-studio](https://pypi.org/project/label-studio/) - [label-studio-sdk](https://pypi.org/project/label-studio-sdk/) https://twitter.com/labelstudiohq --------- Co-authored-by: nik <nik@heartex.net>	2023-08-11 11:24:10 -07:00
Bagatur	8cb2594562	Bagatur/dingo (#9079 ) Co-authored-by: gary <1625721671@qq.com>	2023-08-11 10:54:45 -07:00
Manuel Soria	31cfc00845	Code understanding use case (#8801 ) Code understanding docs --------- Co-authored-by: Manuel Soria <manuel.soria@greyscaleai.com> Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 10:16:05 -07:00
Alvaro Bartolome	f7ae183f40	`ArgillaCallbackHandler` to properly use default values for `api_url` and `api_key` (#9113 ) As of the recent PR at #9043, after some testing we've realised that the default values were not being used for `api_key` and `api_url`. Besides that, the default for `api_key` was set to `argilla.apikey`, but since the default values are intended for people using the Argilla Quickstart (easy to run and setup), the defaults should be instead `owner.apikey` if using Argilla 1.11.0 or higher, or `admin.apikey` if using a lower version of Argilla. Additionally, we've removed the f-string replacements from the docstrings. --------- Co-authored-by: Gabriel Martin <gabriel@argilla.io>	2023-08-11 09:37:06 -07:00
Bagatur	0e5d09d0da	dalle nb fix (#9125 )	2023-08-11 08:21:48 -07:00
Francisco Ingham	9249d305af	tagging docs refactor (#8722 ) refactor of tagging use case according to new format --------- Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 08:06:07 -07:00
Aayush Shah	a429145420	Minor grammatical error (#9102 ) Have corrected a grammatical error in: https://python.langchain.com/docs/modules/model_io/models/llms/ document 😄	2023-08-11 01:01:40 -07:00
Ashutosh Sanzgiri	991b448dfc	minor edits (#9093 ) Description: Minor edit to PR#845 Thanks!	2023-08-10 23:40:36 -07:00
Chenyu Zhao	c0acbdca1b	Update Fireworks model names (#9085 )	2023-08-10 19:23:42 -07:00
Charles Lanahan	a2588d6c57	Update openai embeddings notebook with correct embedding model in section 2 (#5831 ) In second section it looks like a copy/paste from the first section and doesn't include the specific embedding model mentioned in the example so I added it for clarity. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 19:02:10 -07:00
Josh Phillips	5fc07fa524	change id column type to uuid to match function (#7456 ) The table creation process in these examples commands do not match what the recently updated functions in these example commands is looking for. This change updates the type in the table creation command. Issue Number for my report of the doc problem #7446 @rlancemartin and @eyurtsev I believe this is your area Twitter: @j1philli Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 16:57:19 -07:00
Bidhan Roy	02430e25b6	BagelDB (bageldb.ai), VectorStore integration. (#8971 ) - Description: [BagelDB](bageldb.ai) a collaborative vector database. Integrated the bageldb PyPi package with langchain with related tests and code. - Issue: Not applicable. - Dependencies: `betabageldb` PyPi package. - Tag maintainer: @rlancemartin, @eyurtsev, @baskaryan - Twitter handle: bageldb_ai (https://twitter.com/BagelDB_ai) We ran `make format`, `make lint` and `make test` locally. Followed the contribution guideline thoroughly https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --------- Co-authored-by: Towhid1 <nurulaktertowhid@gmail.com>	2023-08-10 16:48:36 -07:00
Harrison Chase	bb6fbf4c71	openai adapters (#8988 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Nuno Campos <nuno@boringbits.io>	2023-08-10 16:08:50 -07:00
Piyush Jain	8eea46ed0e	Bedrock embeddings async methods (#9024 ) ## Description This PR adds the `aembed_query` and `aembed_documents` async methods for improving the embeddings generation for large documents. The implementation uses asyncio tasks and gather to achieve concurrency as there is no bedrock async API in boto3. ### Maintainers @agola11 @aarora79 ### Open questions To avoid throttling from the Bedrock API, should there be an option to limit the concurrency of the calls?	2023-08-10 14:21:03 -07:00
Nicolas	e3fb11bc10	docs: (Mendable Search) Fixes stuck when tabbing out issue (#9074 ) This fixes Mendable not completing when tabbing out and fixes the duplicate message issue as well.	2023-08-10 13:46:06 -07:00
Bagatur	1edead28b8	Add docs community page (#8992 ) Co-authored-by: briannawolfson <brianna.wolfson@gmail.com>	2023-08-10 13:41:35 -07:00
Eugene Yurtsev	a5a4c53280	RedisStore: Update init and Documentation updates (#9044 ) * Update Redis Store to support init from parameters * Update notebook to show how to use redis store, and some fixes in documentation	2023-08-10 15:30:29 -04:00
Bagatur	f3f5853e9f	update api ref exampels (#9065 ) manually update for now	2023-08-10 11:28:24 -07:00
Blake (Yung Cher Ho)	8d351bfc20	Takeoff integration (#9045 ) ## Description: This PR adds the Titan Takeoff Server to the available LLMs in LangChain. Titan Takeoff is an inference server created by [TitanML](https://www.titanml.co/) that allows you to deploy large language models locally on your hardware in a single command. Most generative model architectures are included, such as Falcon, Llama 2, GPT2, T5 and many more. Read more about Titan Takeoff here: - [Blog](https://medium.com/@TitanML/introducing-titan-takeoff-6c30e55a8e1e) - [Docs](https://docs.titanml.co/docs/titan-takeoff/getting-started) #### Testing As Titan Takeoff runs locally on port 8000 by default, no network access is needed. Responses are mocked for testing. - [x] Make Lint - [x] Make Format - [x] Make Test #### Dependencies No new dependencies are introduced. However, users will need to install the titan-iris package in their local environment and start the Titan Takeoff inferencing server in order to use the Titan Takeoff integration. Thanks for your help and please let me know if you have any questions. cc: @hwchase17 @baskaryan	2023-08-10 10:56:06 -07:00
Aashish Saini	8a320e55a0	Corrected grammatical errors and spelling mistakes in the index.mdx file. (#9026 ) Expressing gratitude to the creator for crafting this remarkable application. 🙌, Would like to Enhance grammar and spelling in the documentation for a polished reader experience. Your feedback is valuable as always @baskaryan , @hwchase17 , @eyurtsev	2023-08-10 10:17:09 -07:00
Eugene Yurtsev	5e05ba2140	Add embeddings cache (#8976 ) This PR adds the ability to temporarily cache or persistently store embeddings. A notebook has been included showing how to set up the cache and how to use it with a vectorstore.	2023-08-10 11:15:30 -04:00
Lance Martin	2380492c8e	API use case (#8546 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 07:52:54 -07:00
Luca Foppiano	dfb93dd2b5	Improved grobid documentation (#9025 ) - Description: Improvement in the Grobid loader documentation, typos and suggesting to use the docker image instead of installing Grobid in local (the documentation was also limited to Mac, while docker allow running in any platform) - Tag maintainer: @rlancemartin, @eyurtsev - Twitter handle: @whitenoise	2023-08-10 10:47:22 -04:00
Hiroshige Umino	2c7297d243	Fix a broken code block display (#9034 ) - Description: Fix a broken code block in this page: https://python.langchain.com/docs/modules/model_io/prompts/prompt_templates/ - Issue: N/A - Dependencies: None - Tag maintainer: @baskaryan - Twitter handle: yaotti	2023-08-10 10:39:01 -04:00
Piyush Jain	3b51817706	Updating port and ssl use in sample notebook (#8995 ) ## Description This PR updates the sample notebook to use the default port (8182) and the ssl for the Neptune database connection.	2023-08-09 17:08:48 -07:00
Michael Shen	c2f46b2cdb	Fixed wrong paper reference (#8970 ) The ReAct reference references to MRKL paper. Corrected so that it points to the actual ReAct paper #8964.	2023-08-09 16:17:46 -04:00
Jerzy Czopek	539672a7fd	Feature/fix azureopenai model mappings (#8621 ) This pull request aims to ensure that the `OpenAICallbackHandler` can properly calculate the total cost for Azure OpenAI chat models. The following changes have resolved this issue: - The `model_name` has been added to the ChatResult llm_output. Without this, the default values of `gpt-35-turbo` were applied. This was causing the total cost for Azure OpenAI's GPT-4 to be significantly inaccurate. - A new parameter `model_version` has been added to `AzureChatOpenAI`. Azure does not include the model version in the response. With the addition of `model_name`, this is not a significant issue for GPT-4 models, but it's an issue for GPT-3.5-Turbo. Version 0301 (default) of GPT-3.5-Turbo on Azure has a flat rate of 0.002 per 1k tokens for both prompt and completion. However, version 0613 introduced a split in pricing for prompt and completion tokens. - The `OpenAICallbackHandler` implementation has been updated with the proper model names, versions, and cost per 1k tokens. Unit tests have been added to ensure the functionality works as expected; the Azure ChatOpenAI notebook has been updated with examples. Maintainers: @hwchase17, @baskaryan Twitter handle: @jjczopek --------- Co-authored-by: Jerzy Czopek <jerzy.czopek@avanade.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-09 10:56:15 -07:00
Harrison Chase	7de6a1b78e	parent document retriever (#8941 )	2023-08-08 22:39:08 -07:00
Taqi Jaffri	5919c0f4a2	notebook cleanup	2023-08-08 21:38:55 -07:00
Taqi Jaffri	bcdf3be530	Merge branch 'master' into tjaffri/docugami_loader_source	2023-08-08 20:59:13 -07:00
arjunbansal	a2681f950d	add instructions on integrating Log10 (#8938 ) - Description: Instruction for integration with Log10: an [open source](https://github.com/log10-io/log10) proxiless LLM data management and application development platform that lets you log, debug and tag your Langchain calls - Tag maintainer: @baskaryan - Twitter handle: @log10io @coffeephoenix Several examples showing the integration included [here](https://github.com/log10-io/log10/tree/main/examples/logging) and in the PR	2023-08-08 19:15:31 -07:00
Aarav Borthakur	3f64b8a761	Integrate Rockset as a chat history store (#8940 ) Description: Adds Rockset as a chat history store Dependencies: no changes Tag maintainer: @hwchase17 This PR passes linting and testing. I added a test for the integration and an example notebook showing its use.	2023-08-08 18:54:07 -07:00
Bagatur	0a1be1d501	document lcel fallbacks (#8942 )	2023-08-08 18:49:33 -07:00
Molly Cantillon	99b5a7226c	Weaviate: adding auth example + fixing spelling in ReadME (#8939 ) Added basic auth example to Weaviate notebook @baskaryan	2023-08-08 16:24:17 -07:00
Joe Reuter	8f0cd91d57	Airbyte based loaders (#8586 ) This PR adds 8 new loaders: * `AirbyteCDKLoader` This reader can wrap and run all python-based Airbyte source connectors. * Separate loaders for the most commonly used APIs: * `AirbyteGongLoader` * `AirbyteHubspotLoader` * `AirbyteSalesforceLoader` * `AirbyteShopifyLoader` * `AirbyteStripeLoader` * `AirbyteTypeformLoader` * `AirbyteZendeskSupportLoader` ## Documentation and getting started I added the basic shape of the config to the notebooks. This increases the maintenance effort a bit, but I think it's worth it to make sure people can get started quickly with these important connectors. This is also why I linked the spec and the documentation page in the readme as these two contain all the information to configure a source correctly (e.g. it won't suggest using oauth if that's avoidable even if the connector supports it). ## Document generation The "documents" produced by these loaders won't have a text part (instead, all the record fields are put into the metadata). If a text is required by the use case, the caller needs to do custom transformation suitable for their use case. ## Incremental sync All loaders support incremental syncs if the underlying streams support it. By storing the `last_state` from the reader instance away and passing it in when loading, it will only load updated records. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-08 14:49:25 -07:00
Harrison Chase	7543a3d70e	Harrison/image (#845 ) Co-authored-by: Ashutosh Sanzgiri <sanzgiri@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-08 13:58:27 -07:00
Leonid Ganeline	33a2f58fbf	`tensoflow_datasets` document loader (#8721 ) This PR adds `tensoflow_datasets` document loader	2023-08-08 15:19:28 -04:00
Leonid Ganeline	2d078c7767	`PubMed` document loader (#8893 ) - added `PubMed Document Loader` artifacts; ut-s; examples - fixed `PubMed utility`; ut-s @hwchase17	2023-08-08 14:26:03 -04:00
Jeremy W	c5c0735fc4	Remove Evaluation from Modules page (#8926 ) Remove Evaluation link (which gives 404 now) from Modules page, since it lives under Guides page now	2023-08-08 10:20:24 -07:00
Seif	6327eecdaf	Fix typo in Vectara docs (#8925 ) Fixed a typo in the Vectara docs description.	2023-08-08 10:11:07 -07:00
Chris Pappalardo	beab637f04	added filter kwarg to VectorStoreIndexWrapper query and query_with_so… (#8844 ) - Description: added filter to query methods in VectorStoreIndexWrapper for filtering by metadata (i.e. search_kwargs) - Tag maintainer: @rlancemartin, @eyurtsev Updated the doc snippet on this topic as well. It took me a long while to figure out how to filter the vectorstore by filename, so this might help someone else out. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-08 10:10:45 -07:00
Apurv Agarwal	4a63533216	addition to docs at 'Store and reference chat history' (#8910 ) - Description: I have added an example showing how to pass a custom template to ConversationRetrievalChain. Instead of CONDENSE_QUESTION_PROMPT we can pass any prompt in the argument condense_question_prompt. Look in Use cases -> QA over Documents -> How to -> Store and reference chat history, - Issue: #8864, - Dependencies: NA, - Tag maintainer: @hinthornw, - Twitter handle: --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-08 10:10:11 -07:00
David vonThenen	bf4a112aa6	Fixes to the Nebula LLM Integration (#8918 ) This addresses some issues with introducing the Nebula LLM to LangChain in this PR: https://github.com/langchain-ai/langchain/pull/8876 This fixes the following: - Removes `SYMBLAI` from variable names - Fixes bug with `Bearer` for the API KEY Thanks again in advance for your help! cc: @hwchase17, @baskaryan --------- Co-authored-by: dvonthenen <david.vonthenen@gmail.com>	2023-08-08 10:04:43 -07:00
Jacob Lee	d1e305028f	Automatically set docs appearance to system default (#8924 ) @baskaryan	2023-08-08 09:54:18 -07:00

1 2 3 4 5 ...

1895 Commits