langchain

Commit Graph

Author	SHA1	Message	Date
Harrison Chase	166cda2cc6	Harrison/deeplake (#1316 ) Co-authored-by: Davit Buniatyan <d@activeloop.ai>	1 year ago
Harrison Chase	aaad6cc954	Harrison/atlas db (#1315 ) Co-authored-by: Brandon Duderstadt <brandonduderstadt@gmail.com>	1 year ago
Marc Puig	3989c793fd	Making it possible to use "certainty" as a parameter for the weaviate similarity_search (#1218 ) Checking if weaviate similarity_search kwargs contains "certainty" and use it accordingly. The minimal level of certainty must be a float, and it is computed by normalized distance.	1 year ago
Harrison Chase	0c84ce1082	Harrison/add documents (#1197 ) Co-authored-by: OmriNach <32659330+OmriNach@users.noreply.github.com>	1 year ago
Anton Troynikov	d2ef5d6167	Default Chroma collection name (#1198 ) For persistence, it's convenient to have a default collection name which gets used everywhere.	1 year ago
Naveen Tatikonda	0118706fd6	Add Support for OpenSearch Vector database (#1191 ) ### Description This PR adds a wrapper which adds support for the OpenSearch vector database. Using opensearch-py client we are ingesting the embeddings of given text into opensearch cluster using Bulk API. We can perform the `similarity_search` on the index using the 3 popular searching methods of OpenSearch k-NN plugin: - `Approximate k-NN Search` use approximate nearest neighbor (ANN) algorithms from the [nmslib](https://github.com/nmslib/nmslib), [faiss](https://github.com/facebookresearch/faiss), and [Lucene](https://lucene.apache.org/) libraries to power k-NN search. - `Script Scoring` extends OpenSearch’s script scoring functionality to execute a brute force, exact k-NN search. - `Painless Scripting` adds the distance functions as painless extensions that can be used in more complex combinations. Also, supports brute force, exact k-NN search like Script Scoring. ### Issues Resolved https://github.com/hwchase17/langchain/issues/1054 --------- Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	1 year ago
Andrew White	c5015d77e2	Allow k to be higher than doc size in max_marginal_relevance_search (#1187 ) Fixes issue #1186. For some reason, #1117 didn't seem to fix it.	1 year ago
Tom Bocklisch	47c3221fda	Max marginal relecance search fails if there are not enough docs (#1117 ) Implementation fails if there are not enough documents. Added the same check as used for similarity search. Current implementation raises ``` File ".venv/lib/python3.9/site-packages/langchain/vectorstores/faiss.py", line 160, in max_marginal_relevance_search _id = self.index_to_docstore_id[i] KeyError: -1 ```	1 year ago
Kacper Łukawski	ab1a3cccac	Hotfix: Qdrant content retrieval (revert: #1088 ) (#1093 ) The #1088 introduced a bug in Qdrant integration. That PR reverts those changes and provides class attributes to ensure consistent payload keys. In addition to that, an exception will be thrown if any of texts is None (that could have been an issue reported in #1087)	1 year ago
Rishabh Raizada	5d11e5da40	Update qdrant.py (#1088 ) Fixes #1087	1 year ago
seanaedmiston	f0a258555b	Support similarity search by vector (in FAISS) (#961 ) Alternate implementation to PR #960 Again - only FAISS is implemented. If accepted can add this to other vectorstores or leave as NotImplemented? Suggestions welcome...	1 year ago
Jeff Huber	34cba2da32	Fix typo in integration with Chroma (#1070 ) We introduced a breaking change but missed this call. This PR fixes `langchain` to work with upstream `chroma`.	1 year ago
Anton Troynikov	d43d430d86	Chroma persistence (#1028 ) This PR adds persistence to the Chroma vector store. Users can supply a `persist_directory` with any of the `Chroma` creation methods. If supplied, the store will be automatically persisted at that directory. If a user creates a new `Chroma` instance with the same persistence directory, it will get loaded up automatically. If they use `from_texts` or `from_documents` in this way, the documents will be loaded into the existing store. There is the chance of some funky behavior if the user passes a different embedding function from the one used to create the collection - we will make this easier in future updates. For now, we log a warning.	1 year ago
Harrison Chase	7fb33fca47	chroma docs (#1012 )	1 year ago
Anton Troynikov	78abd277ff	Chroma in LangChain (#1010 ) Chroma is a simple to use, open-source, zero-config, zero setup vectorstore. Simply `pip install chromadb`, and you're good to go. Out-of-the-box Chroma is suitable for most LangChain workloads, but is highly flexible. I tested to 1M embs on my M1 mac, with out issues and reasonably fast query times. Look out for future releases as we integrate more Chroma features with LangChain!	1 year ago
Harrison Chase	1e56879d38	Harrison/save faiss (#916 ) Co-authored-by: Shrey Joshi <shreyjoshi2004@gmail.com>	1 year ago
James Briggs	3aa53b44dd	added i_end in batch extraction (#907 ) Fix for issue #906 Switches `[i : i + batch_size]` to `[i : i_end]` in Pinecone `from_texts` method	1 year ago
Harrison Chase	3f48eed5bd	Harrison/milvus (#856 ) Signed-off-by: Filip Haltmayer <filip.haltmayer@zilliz.com> Signed-off-by: Frank Liu <frank.liu@zilliz.com> Co-authored-by: Filip Haltmayer <81822489+filip-halt@users.noreply.github.com> Co-authored-by: Frank Liu <frank@frankzliu.com>	1 year ago
Francisco Ingham	f9ddcb5705	Hotfix: distance_func and collection_name must not be in kwargs (#735 ) If `distance_func` and `collection_name` are in `kwargs` they are sent to the `QdrantClient` which results in an error being raised. Co-authored-by: Francisco Ingham <>	1 year ago
Feynman Liang	2824f36401	Add namespace to Pinecone.from_index (#716 ) Resolves https://github.com/hwchase17/langchain/issues/718	1 year ago
Kacper Łukawski	97c3544a1e	Hotfix: Qdrant.from_text embeddings (#713 ) I'm providing a hotfix for Qdrant integration. Calculating a single embedding to obtain the vector size was great idea. However, that change introduced a bug trying to put only that single embedding into the database. It's fixed. Right now all the embeddings will be pushed to Qdrant.	1 year ago
dham	e04b063ff4	add faiss local saving/loading (#676 ) - This uses the faiss built-in `write_index` and `load_index` to save and load faiss indexes locally - Also fixes #674 - The save/load functions also use the faiss library, so I refactored the dependency into a function	1 year ago
Harrison Chase	0b204d8c21	Harrison/quadrant (#665 ) Co-authored-by: Kacper Łukawski <kacperlukawski@users.noreply.github.com>	1 year ago
Harrison Chase	983b73f47c	add search kwargs (#664 )	1 year ago
iocuydi	69998b5fad	Add ids parameter for pinecone from_texts / add_texts (#659 ) Allow optionally specifying a list of ids for pinecone rather than having them randomly generated. This also permits editing the embedding/metadata of existing pinecone entries, by id.	1 year ago
Harrison Chase	052c361031	pinecone docstring (#654 )	1 year ago
babbldev	b5eb91536a	Added filter argument to pinecone queries, fixes #600 (#601 ) Added filter argument to similarity_search() and similarity_search_with_score() Co-authored-by: Sam Cartford (MBP) <cartford@hey.com>	1 year ago
Harrison Chase	a5ee7de650	pinecone changes (#590 ) Co-authored-by: Smit Shah <who828@gmail.com> Co-authored-by: iocuydi <46613640+iocuydi@users.noreply.github.com>	1 year ago
Harrison Chase	2aa08631cb	add similarity score method to faiss (#574 ) adds `similarity_search_with_score` to faiss wrapper	1 year ago
Harrison Chase	5ba46f6d0c	Harrison/namespace pinecone (#581 ) Co-authored-by: mmorzywolek <89693033+mmorzywolek@users.noreply.github.com>	1 year ago
Harrison Chase	150b67de10	Harrison/weaviate improvements (#433 ) Co-authored-by: Connor Shorten <connorshorten300@gmail.com>	1 year ago
Harrison Chase	0d7aa1ee99	Harrison/docs to index (#419 ) Add method for going directly from documents to VectorStores Update notebook to showcase this functionality	1 year ago
Harrison Chase	c104d507bf	Harrison/improve data augmented generation docs (#390 ) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com>	1 year ago
Harrison Chase	46c428234f	MMR example selector (#377 ) implement max marginal relevance example selector	1 year ago
Harrison Chase	2163d064f3	add return of ids (#254 ) not actually sure the desired return in add_example to example selector is actually general/good - whats the use case?	1 year ago
Xupeng (Tony) Tong	bb4bf9d6d0	chore: minor clean up / formatting (#233 ) to get familiarize with the project	1 year ago
Samantha Whitmore	09f301cd38	Add add_example method to all ExampleSelector classes, with tests (#178 ) Also updated docs, and noticed an issue with the add_texts method on VectorStores that I had missed before -- the metadatas arg should be required to match the classmethod which initializes the VectorStores (the add_example methods break otherwise in the ExampleSelectors)	2 years ago
Samantha Whitmore	315b0c09c6	wip: add method for both docstore and embeddings (#119 ) this will break atm but wanted to get thoughts on implementation. 1. should add() be on docstore interface? 2. should InMemoryDocstore change to take a list of documents as init? (makes this slightly easier to implement in FAISS -- if we think it is less clean then could expose a method to get the number of documents currently in the dict, and perform the logic of creating the necessary dictionary in the FAISS.add_texts method. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2 years ago
Harrison Chase	c02eb199b6	add few shot example (#148 )	2 years ago
Harrison Chase	b504cd739f	Harrison/cleanup env check (#144 )	2 years ago
Harrison Chase	9f223e6ccc	Harrison/fix lint (#138 )	2 years ago
Delip Rao	76cecf8165	A fix for Jupyter environment variable issue (#135 ) - fixes the Jupyter environment variable issues mentioned in issue #134 - fixes format/lint issues in some unrelated files (from make format/lint) ![image](https://user-images.githubusercontent.com/347398/201599322-090af858-362d-4d69-bf59-208aea65419a.png)	2 years ago
Eugene Yurtsev	2910f50a3c	Fix a few typos and wrapped f-strings (#128 ) Fix a few typos and wrapped f-strings	2 years ago
Harrison Chase	2179ea3103	remove unnecc variables (#113 ) i dont think either of these variables are used?	2 years ago
Samantha Whitmore	2ddab88c06	Update VectorStore interface to contain from_texts, enforce common in… (#97 ) …terface	2 years ago
Samantha Whitmore	61f12229df	Create VectorStore interface (#92 )	2 years ago

46 Commits (main)