langchain

Commit Graph

Author	SHA1	Message	Date
Harrison Chase	7e8f832cd6	Harrison/cohere params (#1278 ) Co-authored-by: Stefano Faraggi <40745694+stepp1@users.noreply.github.com>	1 year ago
Zach Schillaci	159c560c95	Refactor some loops into list comprehensions (#1185 )	1 year ago
Harrison Chase	9d6d8f85da	Harrison/self hosted runhouse (#1154 ) Co-authored-by: Donny Greenberg <dongreenberg2@gmail.com> Co-authored-by: John Dagdelen <jdagdelen@users.noreply.github.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net> Co-authored-by: Andrew White <white.d.andrew@gmail.com> Co-authored-by: Peng Qu <82029664+pengqu123@users.noreply.github.com> Co-authored-by: Matt Robinson <mthw.wm.robinson@gmail.com> Co-authored-by: jeff <tangj1122@gmail.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MacBook-Pro.local> Co-authored-by: zanderchase <zander@unfold.ag> Co-authored-by: Charles Frye <cfrye59@gmail.com> Co-authored-by: zanderchase <zanderchase@gmail.com> Co-authored-by: Shahriar Tajbakhsh <sh.tajbakhsh@gmail.com> Co-authored-by: Stefan Keselj <skeselj@princeton.edu> Co-authored-by: Francisco Ingham <fpingham@gmail.com> Co-authored-by: Dhruv Anand <105786647+dhruv-anand-aintech@users.noreply.github.com> Co-authored-by: cragwolfe <cragcw@gmail.com> Co-authored-by: Anton Troynikov <atroyn@users.noreply.github.com> Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com> Co-authored-by: Oliver Klingefjord <oliver@klingefjord.com> Co-authored-by: blob42 <contact@blob42.xyz> Co-authored-by: blob42 <spike@w530> Co-authored-by: Enrico Shippole <henryshippole@gmail.com> Co-authored-by: Ibis Prevedello <ibiscp@gmail.com> Co-authored-by: jped <jonathanped@gmail.com> Co-authored-by: Justin Torre <justintorre75@gmail.com> Co-authored-by: Ivan Vendrov <ivan@anthropic.com> Co-authored-by: Sasmitha Manathunga <70096033+mmz-001@users.noreply.github.com> Co-authored-by: Ankush Gola <9536492+agola11@users.noreply.github.com> Co-authored-by: Matt Robinson <mrobinson@unstructuredai.io> Co-authored-by: Jeff Huber <jeffchuber@gmail.com> Co-authored-by: Akshay <64036106+akshayvkt@users.noreply.github.com> Co-authored-by: Andrew Huang <jhuang16888@gmail.com> Co-authored-by: rogerserper <124558887+rogerserper@users.noreply.github.com> Co-authored-by: seanaedmiston <seane999@gmail.com> Co-authored-by: Hasegawa Yuya <52068175+Hase-U@users.noreply.github.com> Co-authored-by: Ivan Vendrov <ivendrov@gmail.com> Co-authored-by: Chen Wu (吴尘) <henrychenwu@cmu.edu> Co-authored-by: Dennis Antela Martinez <dennis.antela@gmail.com> Co-authored-by: Maxime Vidal <max.vidal@hotmail.fr> Co-authored-by: Rishabh Raizada <110235735+rishabh-ti@users.noreply.github.com>	1 year ago
Harrison Chase	ee3590cb61	instruct embeddings docs (#1131 )	1 year ago
Hasegawa Yuya	383c67c1b2	Fix Issue #1100 (#1101 ) https://github.com/hwchase17/langchain/issues/1100 When faiss data and doc.index are created in past versions, error occurs that say there was no attribute. So I put hasattr in the check as a simple solution. However, increasing the number of such checks is not good for conservatism, so I think there is a better solution. Also, the code for the batch process was left out, so I put it back in.	1 year ago
Hasegawa Yuya	e08961ab25	Fixed openai embeddings to be safe by batching them based on token size calculation. (#991 ) I modified the logic of the batch calculation for embedding according to this cookbook https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_long_inputs.ipynb	1 year ago
Harrison Chase	91c6cea227	Harrison/batch embeds (#972 ) Co-authored-by: John Dagdelen <jdagdelen@users.noreply.github.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	1 year ago
Harrison Chase	d564308e0f	rfc: instruct embeddings (#811 ) Co-authored-by: seanaedmiston <seane999@gmail.com>	1 year ago
Johanna Appel	ebea40ce86	Add 'truncate' parameter for CohereEmbeddings (#798 ) Currently, the 'truncate' parameter of the cohere API is not supported. This means that by default, if trying to generate and embedding that is too big, the call will just fail with an error (which is frustrating if using this embedding source e.g. with GPT-Index, because it's hard to handle it properly when generating a lot of embeddings). With the parameter, one can decide to either truncate the START or END of the text to fit the max token length and still generate an embedding without throwing the error. In this PR, I added this parameter to the class. _Arguably, there should be a better way to handle this error, e.g. by optionally calling a function or so that gets triggered when the token limit is reached and can split the document or some such. Especially in the use case with GPT-Index, its often hard to estimate the token counts for each document and I'd rather sort out the troublemakers or simply split them than interrupting the whole execution. Thoughts?_ --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	1 year ago
Harrison Chase	7b4882a2f4	Harrison/tf embeddings (#817 ) Co-authored-by: Ryohei Kuroki <10434946+yakigac@users.noreply.github.com>	1 year ago
Johanna Appel	cacf4091c0	Fix documentation for 'model' parameter in CohereEmbeddings (#797 ) Currently, the class parameter 'model_name' of the CohereEmbeddings class is not supported, but 'model' is. The class documentation is inconsistent with this, though, so I propose to either fix the documentation (this PR right now) or fix the parameter. It will create the following error: ``` ValidationError: 1 validation error for CohereEmbeddings model_name extra fields not permitted (type=value_error.extra) ```	1 year ago
Harrison Chase	2ba1128095	Harrison/backwards compat (#740 )	1 year ago
scadEfUr	e3df8ab6dc	move hyde into chains (#728 ) Co-authored-by: scadEfUr <>	1 year ago
Kacper Łukawski	d4f719c34b	Convert numpy arrays to lists in HuggingFaceEmbeddings (#714 ) `SentenceTransformer` returns a NumPy array, not a `List[List[float]]` or `List[float]` as specified in the interface of `Embeddings`. That PR makes it consistent with the interface.	1 year ago
Scott Leibrand	34932dd211	remove legacy embedding model name (#703 ) Now that OpenAI has deprecated all embeddings models except text-embedding-ada-002, we should stop specifying a legacy embedding model in the example. This will also avoid confusion from people (like me) trying to specify model="text-embedding-ada-002" and having that erroneously expanded to text-search-text-embedding-ada-002-query-001	1 year ago
Sasmitha Manathunga	5c97f70bf1	Fix CohereError: embed is not an available endpoint on this model (#637 ) Running the Cohere embeddings example from the docs: ```python from langchain.embeddings import CohereEmbeddings embeddings = CohereEmbeddings(cohere_api_key= cohere_api_key) text = "This is a test document." query_result = embeddings.embed_query(text) doc_result = embeddings.embed_documents([text]) ``` I get the error: ```bash CohereError(message=res['message'], http_status=response.status_code, headers=response.headers) cohere.error.CohereError: embed is not an available endpoint on this model ``` This is because the `model` string is set to `medium` which is not currently available. From the Cohere docs: > Currently available models are small and large (default)	1 year ago
Harrison Chase	6b60c509ac	(WIP) add HyDE (#393 ) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com>	1 year ago
Harrison Chase	ed143b598f	improve openai embeddings (#351 ) add more formal support for explicitly specifying each model, but in a backwards compatible way	1 year ago
Bagatur	b90e25f786	Add HuggingFace Hub Embeddings (#125 ) Add support for calling HuggingFace embedding models using the HuggingFaceHub Inference API. New class mirrors the existing HuggingFaceHub LLM implementation. Currently only supports 'sentence-transformers' models. Closes #86	1 year ago
Harrison Chase	b504cd739f	Harrison/cleanup env check (#144 )	2 years ago
Delip Rao	76cecf8165	A fix for Jupyter environment variable issue (#135 ) - fixes the Jupyter environment variable issues mentioned in issue #134 - fixes format/lint issues in some unrelated files (from make format/lint) ![image](https://user-images.githubusercontent.com/347398/201599322-090af858-362d-4d69-bf59-208aea65419a.png)	2 years ago
issam9	28282ad099	Issam9/cohere embeddings (#105 ) Add support for cohere embeddings	2 years ago
Cameron Whitehead	54e325be2f	Improve credential handing to allow passing in constructors (#79 ) Addresses the issue in #76 by either using the relevant environment variable if set or using a string passed in the constructor. Prefers the constructor string over the environment variable, which seemed like the natural choice to me.	2 years ago
Harrison Chase	95d0e5f368	fix lint (#77 )	2 years ago
issam9	990cd821cc	Issam/hf embeddings (#68 ) Add support of HuggingFace embedding models	2 years ago
Harrison Chase	76aff023d7	FAISS and embedding support (#48 ) also adds embeddings and an in memory docstore	2 years ago

26 Commits (main)