langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
mrkhalil6	4e7d0c115b	Add support for filters and namespaces in similarity search in Pinecone similarity_score_threshold (#7301 ) At the moment, pinecone vectorStore does not support filters and namespaces when using similarity_score_threshold search type. In this PR, I've implemented that. It passes all the kwargs except "score_threshold" as that is not a supported argument for method "similarity_search_with_score". --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-07 15:03:59 -04:00
Manuel Saelices	01dca1e438	Add context to an output parsing error on Pydantic schema to improve exception handling (#7344 ) ## Changes - [X] Fill the `llm_output` param when there is an output parsing error in a Pydantic schema so that we can get the original text that failed to parse when handling the exception ## Background With this change, we could do something like this: ``` output_parser = PydanticOutputParser(pydantic_object=pydantic_obj) chain = ConversationChain(..., output_parser=output_parser) try: response: PydanticSchema = chain.predict(input=input) except OutputParserException as exc: logger.error( 'OutputParserException while parsing chatbot response: %s', exc.llm_output, ) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-07 14:49:37 -04:00
Raouf Chebri	1ac6deda89	update extension name (#7359 ) hi @rlancemartin , We had a new deployment and the `pg_extension` creation command was updated from `CREATE EXTENSION pg_embedding` to `CREATE EXTENSION embedding`. https://github.com/neondatabase/neon/pull/4646 The extension not made public yet. No users will be affected by this. Will be public next week. Please let me know if you have any questions. Thank you in advance 🙏	2023-07-07 11:35:51 -07:00
William FH	4e180dc54e	Unset Cache in Tests (#7362 ) This is impacting other unit tests that use callbacks since the cache is still set (just empty)	2023-07-07 11:05:09 -07:00
German Martin	3ce4e46c8c	The Fellowship of the Vectors: New Embeddings Filter using clustering. (#7015 ) Continuing with Tolkien inspired series of langchain tools. I bring to you: The Fellowship of the Vectors, AKA EmbeddingsClusteringFilter. This document filter uses embeddings to group vectors together into clusters, then allows you to pick an arbitrary number of documents vector based on proximity to the cluster centers. That's a representative sample of the cluster. The original idea is from [Greg Kamradt](https://github.com/gkamradt) from this video (Level4): https://www.youtube.com/watch?v=qaPMdcCqtWk&t=365s I added few tricks to make it a bit more versatile, so you can parametrize what to do with duplicate documents in case of cluster overlap: replace the duplicates with the next closest document or remove it. This allow you to use it as an special kind of redundant filter too. Additionally you can choose 2 diff orders: grouped by cluster or respecting the original retriever scores. In my use case I was using the docs grouped by cluster to run refine chains per cluster to generate summarization over a large corpus of documents. Let me know if you want to change anything! @rlancemartin, @eyurtsev, @hwchase17, --------- Co-authored-by: rlm <pexpresss31@gmail.com>	2023-07-07 10:28:17 -07:00
Leonid Ganeline	b489466488	docs: `dependents` update 4 (#7360 ) Updated links and counters of the `dependents` page.	2023-07-07 13:22:30 -04:00
William FH	38ca5c84cb	Explicitly list requires_reference in function (#7357 )	2023-07-07 10:04:03 -07:00
Harrison Chase	49b2b0e3c0	change embedding to None (#7355 )	2023-07-07 12:33:03 -04:00
imaprogrammer	a2830e3056	Update chroma.py: Persist directory from client_settings if provided there (#7087 ) Change details: - Description: When calling db.persist(), a check prevents from it proceeding as the constructor only sets member `_persist_directory` from parameters. But the ChromaDB client settings also has this parameter, and if the client_settings parameter is used without passing the persist_directory (which is optional), the `persist` method raises `ValueError` for not setting `_persist_directory`. This change fixes it by setting the member `_persist_directory` variable from client_settings if it is set, else uses the constructor parameter. - Issue: I didn't find any github issue of this, but I discovered it after calling the persist method - Dependencies: None - Tag maintainer: vectorstore related change - @rlancemartin, @eyurtsev - Twitter handle: Don't have one :( Additional discussion: We may need to discuss the way I implemented the fallback using `or`. --------- Co-authored-by: rlm <pexpresss31@gmail.com>	2023-07-07 09:20:27 -07:00
Bagatur	cb4e88e4fb	bump 227 (#7354 )	2023-07-07 11:52:35 -04:00
Bagatur	d1c7237034	openai fn update nb (#7352 )	2023-07-07 11:52:21 -04:00
Bagatur	0ed2da7020	bump 226 (#7335 )	2023-07-07 05:59:13 -04:00
Bagatur	1c8cff32f1	Generic OpenAI fn chain (#7270 ) Add loading functions for openai function chains and add docs page	2023-07-07 05:44:53 -04:00
Bagatur	fd7145970f	Output parser redirect (#7330 ) Related to ##7311	2023-07-07 04:26:34 -04:00
OwenElliott	3074306ae1	Marqo Vector Store Examples & Type Hints (#7326 ) This PR improves the example notebook for the Marqo vectorstore implementation by adding a new RetrievalQAWithSourcesChain example. The `embedding` parameter in `from_documents` has its type updated to `Union[Embeddings, None]` and a default parameter of None because this is ignored in Marqo. This PR also upgrades the Marqo version to 0.11.0 to remove the device parameter after a breaking change to the API. Related to #7068 @tomhamer @hwchase17 --------- Co-authored-by: Tom Hamer <tom@marqo.ai>	2023-07-07 04:11:20 -04:00
Nayjest	5809c3d29d	Pack of small fixes and refactorings that don't affect functionality (#6990 ) Description: Pack of small fixes and refactorings that don't affect functionality, just making code prettier & fixing some misspelling (hand-filtered improvements proposed by SeniorAi.online, prototype of code improving tool based on gpt4), agents and callbacks folders was covered. Dependencies: Nothing changed Twitter: https://twitter.com/nayjest Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-07 03:40:49 -04:00
Bagatur	87f75cb322	Add base Chain docstrings (#7114 )	2023-07-07 03:06:33 -04:00
Leonid Ganeline	284d40b7af	docstrings top level update (#7173 ) Updated docstrings so, that [API Reference](https://api.python.langchain.com/en/latest/api_reference.html) page has text in the second column (class/function/... description.	2023-07-07 02:42:28 -04:00
Stav Sapir	8d961b9e33	add preset ability to textgen llm (#7196 ) add an ability for textgen llm to work with preset provided by text gen webui API. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-07 02:41:24 -04:00
Bagatur	a9c5b4bcea	Bagatur/clarifai update (#7324 ) This PR improves upon the Clarifai LangChain integration with improved docs, errors, args and the addition of embedding model support in LancChain for Clarifai's embedding models and an overview of the various ways you can integrate with Clarifai added to the docs. --------- Co-authored-by: Matthew Zeiler <zeiler@clarifai.com>	2023-07-07 02:23:20 -04:00
Oleg Zabluda	9954eff8fd	Rename prompt_template => _DEFAULT_GRAPH_QA_TEMPLATE and PROMPT => GRAPH_QA_PROMPT to make consistent with the rest of the files (#7250 ) Rename prompt_template => _DEFAULT_GRAPH_QA_TEMPLATE to make consistent with the rest of the file.	2023-07-07 02:17:40 -04:00
Nikhil Kumar Gupta	6095a0a310	Added number_of_head_rows to pandas agent parameters (#7271 ) Description: Added number_of_head_rows as a parameter to pandas agent. number_of_head_rows allows the user to select the number of rows to pass with the prompt when include_df_in_prompt is True. This gives the ability to control the token length and can be helpful in dealing with large dataframe. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-07 02:17:26 -04:00
John Landahl	e047541b5f	Corrected a typo in elasticsearch.ipynb (#7318 ) Simple typo fix	2023-07-07 01:35:32 -04:00
Subsegment	152dc59060	docs : add cnosdb to Ecosystem Integrations (#7316 ) - Implement a `from_cnosdb` method for the `SQLDatabase` class - Write CnosDB documentation and add it to Ecosystem Integrations	2023-07-07 01:35:22 -04:00
Bagatur	927c8eb91a	Refac package version check (#7312 )	2023-07-07 01:21:53 -04:00
Sparsh Jain	bac56618b4	Solving anthropic packaging version issue (#7306 ) - Description: Solving, anthropic packaging version issue by clearing the mixup from package.version that is being confused with version from - importlib.metadata.version. - Issue: it fixes the issue #7283 - Maintainer: @hwchase17 The following change has been explained in the comment - https://github.com/hwchase17/langchain/issues/7283#issuecomment-1624328978	2023-07-06 19:35:42 -04:00
Jason B. Koh	d642609a23	Fix: Recognize `List` at `from_function` (#7178 ) - Description: pydantic's `ModelField.type_` only exposes the native data type but not complex type hints like `List`. Thus, generating a Tool with `from_function` through function signature produces incorrect argument schemas (e.g., `str` instead of `List[str]`) - Issue: N/A - Dependencies: N/A - Tag maintainer: @hinthornw - Twitter handle: `mapped` All the unittest (with an additional one in this PR) passed, though I didn't try integration tests...	2023-07-06 17:22:09 -04:00
Chathura Rathnayake	ec10787bc7	Fixed the confluence loader ".csv" files loading issue (#7195 ) - Description: Sometimes there are csv attachments with the media type "application/vnd.ms-excel". These files failed to be loaded via the xlrd library. It throws a corrupted file error. I fixed it by separately processing excel files using pandas. Excel files will be processed just like before. - Dependencies: pandas, os, io --------- Co-authored-by: Chathura <chathurar@yaalalabs.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-07-06 17:21:43 -04:00
Andre Elizondo	b21c2f8704	Update docs for whylabs (langkit) callback handler (#7293 ) - Description: Update docs for whylabs callback handler - Issue: none - Dependencies: none - Tag maintainer: @agola11 - Twitter handle: @useautomation @whylabs --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Jamie Broomall <jamie@whylabs.ai>	2023-07-06 17:21:28 -04:00
William FH	e736d60516	Load Evaluator (#6942 ) Create a `load_evaluators()` function so you don't have to import all the individual evaluator classes	2023-07-06 13:58:58 -07:00
David Duong	12d14f8947	Fix secrets serialisation for ChatAnthropic (#7300 )	2023-07-06 21:57:12 +01:00
William FH	cb9ff6efb8	Add function call params to invocation params (#7240 )	2023-07-06 13:56:07 -07:00
William FH	1f4a51cb9c	Add Agent Trajectory Interface (#7122 )	2023-07-06 13:33:33 -07:00
Bagatur	a6b39afe0e	rm side nav (#7297 )	2023-07-06 15:19:29 -04:00
Bruno Bornsztein	1a4ca3eff9	handle missing finish_reason (#7296 ) In some cases, the OpenAI response is missing the `finish_reason` attribute. It seems to happen when using Ada or Babbage and `stream=true`, but I can't always reproduce it. This change just gracefully handles the missing key.	2023-07-06 15:13:51 -04:00
Leonid Ganeline	6ff9e9b34a	updated `huggingface_hub` examples (#7292 ) Added examples for models: - Google `Flan` - TII `Falcon` - Salesforce `XGen`	2023-07-06 15:04:37 -04:00
Avinash Raj	09acbb8410	Modified PromptLayerChatOpenAI class to support function call (#6366 ) Introduction of newest function calling feature doesn't work properly with PromptLayerChatOpenAI model since on the `_generate` method, functions argument are not even getting passed to the `ChatOpenAI` base class which results in empty `ai_message.additional_kwargs` Fixes #6365	2023-07-06 13:16:04 -04:00
Dídac Sabatés	e0cb3ea90c	Fix sql_database.ipynb link (#6525 ) Looks like the [SQLDatabaseChain](https://langchain.readthedocs.io/en/latest/modules/chains/examples/sqlite.html) in the SQL Database Agent page was broken I've change it to the SQL Chain page	2023-07-06 13:07:37 -04:00
Leonid Ganeline	4450791edd	docs: tutorials update (#7230 ) updated `tutorials.mdx`: - added a link to new `Deeplearning AI` course on LangChain - added links to other tutorial videos - fixed format @baskaryan, @hwchase17	2023-07-06 12:44:23 -04:00
Diego Machado	a7ae35fe4e	Fix duplicated sentence in documentation's introduction (#6351 ) Fix duplicated sentence in documentation's introduction	2023-07-06 12:12:18 -04:00
Bagatur	681f2678a3	add elasticknn to init (#7284 )	2023-07-06 11:58:24 -04:00
hayao-k	c23e16c459	docs: Fixed typos in Amazon Kendra Retriever documentation (#7261 ) ## Description Fixed to the official service name Amazon Kendra. ## Tag maintainer @baskaryan	2023-07-06 11:56:52 -04:00
zhujiangwei	8c371e12eb	refactor BedrockEmbeddings class (#7266 ) #### Description refactor BedrockEmbeddings class to clean code as below: 1. inline content type and accept 2. rewrite input_body as a dictionary literal 3. no need to declare embeddings variable, so remove it	2023-07-06 11:56:30 -04:00
Chui	c7cf11b8ab	Remove whitespace in filename (#7264 )	2023-07-06 11:55:42 -04:00
Jan Kubica	fed64ae060	Chroma: add vector search with scores (#6864 ) - Description: Adding to Chroma integration the option to run a similarity search by a vector with relevance scores. Fixing two minor typos. - Issue: The "lambda_mult" typo is related to #4861 - Maintainer: @rlancemartin, @eyurtsev	2023-07-06 10:01:55 -04:00
William FH	576880abc5	Re-use Trajectory Evaluator (#7248 ) Use the trajectory eval chain in the run evaluation implementation and update the prepare inputs method to apply to both asynca nd sync	2023-07-06 07:00:24 -07:00
zhaoshengbo	e8f24164f0	Improve the alibaba cloud opensearch vector store documentation (#6964 ) Based on user feedback, we have improved the Alibaba Cloud OpenSearch vector store documentation. Co-authored-by: zhaoshengbo <shengbo.zsb@alibaba-inc.com>	2023-07-06 09:47:49 -04:00
Eduard van Valkenburg	ae5aa496ee	PowerBI updates (#7143 ) <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> Several updates for the PowerBI tools: - Handle 0 records returned by requesting redo with different filtering - Handle too large results by optionally tokenizing the result and comparing against a max (change in signature, non-breaking) - Implemented LLMChain with Chat for chat models for the tools. - Updates to the main prompt including tables - Update to Tool prompt with TOPN function - Split the tool prompt to allow the LLMChain with ChatPromptTemplate Smaller fixes for stability. For visibility: @hinthornw	2023-07-06 09:39:23 -04:00
emarco177	b9d6d4cd4c	added template repo for CI/CD deployment on Google Cloud Run (#7218 ) Replace this comment with: - Description: added documentation for a template repo that helps dockerizing and deploying a LangChain using a Cloud Build CI/CD pipeline to Google Cloud build serverless - Issue: None, - Dependencies: None, - Tag maintainer: @baskaryan, - Twitter handle: EdenEmarco177 If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use.	2023-07-06 09:38:38 -04:00
Leonid Kuligin	8b19f6a0da	Added retries for Vertex LLM (#7219 ) #7217 --------- Co-authored-by: Leonid Kuligin <kuligin@google.com>	2023-07-06 09:38:01 -04:00

... 4 5 6 7 8 ...

3227 Commits