langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Djordje	22f9ae489f	community: Opensearch - added score function for similarity_score_threshold (#23928 ) This PR resolves the NotImplemented error for the similarity_score_threshold search type for OpenSearch.	2024-08-23 11:30:04 -04:00
ZhangShenao	b38c83ff93	patch[Community] Optimize methods in several ChatLoaders (#24806 ) There are some static methods in ChatLoaders, try to add @staticmethod decorator for them.	2024-08-23 11:00:41 -04:00
James Espichan Vilca	644e0d3463	Use extend method for embeddings concatenation in mlflow_gateway (#14358 ) ## Description There is a bug in the concatenation of embeddings obtained from MLflow that does not conform to the type hint requested by the function. ``` python def _query(self, texts: List[str]) -> List[List[float]]: ``` It is logical to expect a List[List[float]] for a List[str]. However, the append method encapsulates the response in a global List. To avoid this, the extend method should be used, which will add the embeddings of all strings at the same list level. ## Testing I have tried using OpenAI-ADA to obtain the embeddings, and the result of executing this snippet is as follows: ``` python embeds = await MlflowAIGatewayEmbeddings().aembed_documents(texts=["hi", "how are you?"]) print(embeds) ``` ``` python [[[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]]] ``` When in reality, the expected result should be: ``` python [[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]] ``` The above result complies with the expected type hint: List[List[float]] . As I mentioned, we can achieve that by using the extend method instead of the append method. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 14:43:43 +00:00
Christophe Bornet	7f1e444efa	partners: Use simsimd types (#25299 ) The simsimd package [now has types](https://github.com/ashvardanian/SimSIMD/releases/tag/v5.0.0)	2024-08-23 10:41:39 -04:00
clement.l	642f9530cd	community: add supported blockchains to Blockchain Document Loader (#25428 ) - Remove deprecated chains. - Add more supported chains. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-23 14:39:42 +00:00
conjuncts	818267bbc3	community: allow chroma DB delete() to use "where" argument (#19826 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Simply pass kwargs to allow arguments like "where" to be propagated Issue: Previously, db.delete(where={}) wouldn't work for chroma vectorstores Dependencies: N/A Twitter handle: N/A - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-08-23 10:10:57 -04:00
Kevin Engelke	3c7f12cbf5	community[minor]: Fix missing 'keep_newlines' parameter forward-pass to 'process_pages' function in confluence loader (#20086 ) (#20087 ) - Description: Fixed missing `keep_newlines` parameter forward-pass in confluence-loader - Issue: #20086 - Dependencies: None --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 12:59:38 +00:00
Erik Lindgren	583b0449eb	community[patch]: Fix Hybrid Search for non-Databricks managed embeddings (#25590 ) Description: Send both the query and query_embedding to the Databricks index for hybrid search. Issue: When using hybrid search with non-Databricks managed embedding we currently don't pass both the embedding and query_text to the index. Hybrid search requires both of these. This change fixes this issue for both `similarity_search` and `similarity_search_by_vector`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 08:57:13 +00:00
Alejandro Companioni	bcd5842b5d	community[patch]: Updating default PPLX model to supported llama-3.1 model. (#25643 ) # Issue As of late July, Perplexity [no longer supports Llama 3 models](https://docs.perplexity.ai/changelog/introducing-new-and-improved-sonar-models). # Description This PR updates the default model and doc examples to reflect their latest supported model. (Mostly updating the same places changed by #23723.) # Twitter handle `@acompa_` on behalf of the team at Not Diamond. Check us out [here](https://notdiamond.ai). --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-08-23 08:33:30 +00:00
Leonid Ganeline	163ef35dd1	docs: `templates` updated titles (#25646 ) Updated titles into a consistent format. Fixed links to the diagrams. Fixed typos. Note: The Templates menu in the navbar is now sorted by the file names. I'll try sorting the navbar menus by the page titles, not the page file names.	2024-08-23 01:19:38 -07:00
Parsa Abbasi	1b2ae40d45	docs: Updated WikipediaLoader documentation (#25647 ) - Output of the cells was not included in the documentation. I have added them. - There is another parameter in the `WikipediaLoader` class called `doc_content_chars_max` (Based on [this](https://api.python.langchain.com/en/latest/document_loaders/langchain_community.document_loaders.wikipedia.WikipediaLoader.html)). I have included this in the list of parameters. - I put the list of parameters under a new section called "Parameters" in the documentation. - I also included the `langchain_community` package in the installation command. - Some minor formatting/spelling issues were fixed.	2024-08-23 01:19:03 -07:00
Jakub W.	b865ee49a0	community[patch]: Dynamodb history messages key (#25658 ) - Description: adding the history_messages_key so you don't have to use "History" as a key in langchain	2024-08-23 08:05:28 +00:00
Erick Friis	b28bc252c4	core[patch]: mmr util (#25689 )	2024-08-22 21:31:17 -07:00
ZhangShenao	ba89933c2c	Doc[Embeddings] Add docs for `ZhipuAIEmbeddings` (#25662 ) - Add docs for `ZhipuAIEmbeddings`. - Using integration doc template. - Source api reference: https://bigmodel.cn/dev/api#vector --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 01:33:43 +00:00
Erick Friis	6096c80b71	core: pydantic output parser streaming fix (#24415 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 18:00:09 -07:00
Eugene Yurtsev	c316361115	core[patch]: Add _api.rename_parameter to support renaming of parameters in functions (#25101 ) Add ability to rename paramerters in function signatures ```python @rename_parameter(since="2.0.0", removal="3.0.0", old="old_name", new="new_name") def foo(new_name: str) -> str: """original doc""" return new_name ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-22 17:16:31 -07:00
Yusuke Fukasawa	0258cb96fa	core[patch]: add additionalProperties recursively to oai function if strict (#25169 ) Hello. First of all, thank you for maintaining such a great project. ## Description In https://github.com/langchain-ai/langchain/pull/25123, support for structured_output is added. However, `"additionalProperties": false` needs to be included at all levels when a nested object is generated. error from current code: https://gist.github.com/fufufukakaka/e9b475300e6934853d119428e390f204 ``` BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for response_format 'JokeWithEvaluation': In context=('properties', 'self_evaluation'), 'additionalProperties' is required to be supplied and to be false", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}} ``` Reference: [Introducing Structured Outputs in the API](https://openai.com/index/introducing-structured-outputs-in-the-api/) ```json { "model": "gpt-4o-2024-08-06", "messages": [ { "role": "system", "content": "You are a helpful math tutor." }, { "role": "user", "content": "solve 8x + 31 = 2" } ], "response_format": { "type": "json_schema", "json_schema": { "name": "math_response", "strict": true, "schema": { "type": "object", "properties": { "steps": { "type": "array", "items": { "type": "object", "properties": { "explanation": { "type": "string" }, "output": { "type": "string" } }, "required": ["explanation", "output"], "additionalProperties": false } }, "final_answer": { "type": "string" } }, "required": ["steps", "final_answer"], "additionalProperties": false } } } } ``` In the current code, `"additionalProperties": false` is only added at the last level. This PR introduces the `_add_additional_properties_key` function, which recursively adds `"additionalProperties": false` to the entire JSON schema for the request. Twitter handle: `@fukkaa1225` Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-23 00:08:58 +00:00
Bagatur	b35ee09b3f	infra: xfail pydantic v2 arg to py function (#25686 ) Issue to track: #25687	2024-08-22 23:52:57 +00:00
Christophe Bornet	ee98da4f4e	core[patch]: Add UP(upgrade) ruff rules (#25358 )	2024-08-22 16:29:22 -07:00
William FH	294f7fcb38	core[patch]: Remove different parent run id warning (#25683 )	2024-08-22 16:10:35 -07:00
Vadym Barda	46d344c33d	core[patch]: support drawing nested subgraphs in draw_mermaid (#25581 ) Previously the code was able to only handle a single level of nesting for subgraphs in mermaid. This change adds support for arbitrary nesting of subgraphs.	2024-08-22 16:08:49 -07:00
Manuel Jaiczay	1c31234eed	community: fix HuggingFacePipeline pipeline_kwargs (#19920 ) Fix handling of pipeline_kwargs to prioritize class attribute defaults. #19770 Co-authored-by: jaizo <manuel.jaiczay@polygons.at> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:29:46 -04:00
Nobuhiko Otoba	4b63a217c2	"community: Fix GithubFileLoader source code", "docs: Fix GithubFileLoader code sample" (#19943 ) This PR adds tiny improvements to the `GithubFileLoader` document loader and its code sample, addressing the following issues: 1. Currently, the `file_extension` argument of `GithubFileLoader` does not change its behavior at all. 1. The `GithubFileLoader` sample code in `docs/docs/integrations/document_loaders/github.ipynb` does not work as it stands. The respective solutions I propose are the following: 1. Remove `file_extension` argument from `GithubFileLoader`. 1. Specify the branch as `master` (not the default `main`) and rename `documents` as `document`. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:24:57 -04:00
Leonid Ganeline	e7abee034e	docs: `integrations` reference updates 4 (#25118 ) Added missed references; missed provider pages. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 22:16:45 +00:00
Erick Friis	5fb8aa82b9	docs: api ref to new site links in featuretable (#25678 )	2024-08-22 21:52:50 +00:00
Bagatur	cf9c484715	standard-tests[patch]: test Message.name (#25677 ) Tests: https://github.com/langchain-ai/langchain/actions/runs/10516092584	2024-08-22 14:47:31 -07:00
Nada Amin	ac7b71e0d7	langchain_community.graphs: Neo4JGraph: prop min_size might be None (#23944 ) When I used the Neo4JGraph enhanced_schema=True option, I ran into an error because a prop min_size of None was compared numerically with an int. The fix I applied is similar to the pattern of skipping embeddings elsewhere in the file. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-22 20:29:52 +00:00
CastaChick	7d13a2f958	core[patch]: add option to specify the chunk separator in `merge_message_runs` (#24783 ) Description: LLM will stop generating text even in the middle of a sentence if `finish_reason` is `length` (for OpenAI) or `stop_reason` is `max_tokens` (for Anthropic). To obtain longer outputs from LLM, we should call the message generation API multiple times and merge the results into the text to circumvent the API's output token limit. The extra line breaks forced by the `merge_message_runs` function when seamlessly merging messages can be annoying, so I added the option to specify the chunk separator. Issue: No corresponding issues. Dependencies: No dependencies required. Twitter handle: @hanama_chem https://x.com/hanama_chem --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 19:46:25 +00:00
basirsedighi	0f3fe44e44	parsed_json is expected to be a list of dictionaries, but it seems to… (#24018 ) parsed_json is expected to be a list of dictionaries, but it seems to… be a single dictionary instead. This is at libs/experimental/langchain_experimental/graph_transformers/llm.py process process_response Thank you for contributing to LangChain! - [ ] Bugfix: "experimental: bugfix" --------- Co-authored-by: based <basir.sedighi@nris.no> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 19:09:43 +00:00
ZhangShenao	8bde04079b	patch[experimental] Fix start_index in `SemanticChunker` (#24761 ) - Cause chunks are joined by space, so they can't be found in text, and the final `start_index` is very possibility to be -1. - The simplest way is to use the natural index of the chunk as `start_index`.	2024-08-22 14:59:40 -04:00
Sanjay Parajuli	6fbd53bc60	docs: Update tool_calling.ipynb (#25434 ) Description: This part of the documentation didn't explain about the `required` property of function calling. I added additional line as a note. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 18:55:24 +00:00
William FH	fad6fc866a	Rm DeepInfra Breakpoint Comment (#25206 ) tbh should rm the print staement too	2024-08-22 14:43:44 -04:00
yahya-mouman	e5bb4cb646	lagchain-pinecone: add id to similarity documents results (#25630 ) - Description: This change adds the ID field that's required in Pinecone to the result documents of the similarity search method. - Issue: Lack of document metadata namely the ID field - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 18:33:26 +00:00
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:27:16 -04:00
Ivan	5b9290a449	Fix UnionType type var replacement (#25566 ) [langchain_core] Fix UnionType type var replacement - Added types.UnionType to typing.Union mapping Type replacement cause `TypeError: 'type' object is not subscriptable` if any of union type comes as function `_py_38_safe_origin` return `types.UnionType` instead of `typing.Union` ```python >>> from types import UnionType >>> from typing import Union, get_origin >>> type_ = get_origin(str \| None) >>> type_ <class 'types.UnionType'> >>> UnionType[(str, None)] Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'type' object is not subscriptable >>> Union[(str, None)] typing.Optional[str] ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:22:09 -04:00
William FH	8230ba47f3	core[patch]: Improve some error messages and add another test for checking RunnableWithMessageHistory (#25209 ) Also add more useful error messages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-22 18:14:27 +00:00
Hasan Kumar	b4fcda7657	langchain: Fix type warnings when passing Runnable as agent to AgentExecutor (#24750 ) Fix for https://github.com/langchain-ai/langchain/issues/13075 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:02:02 -04:00
sslee	61228da1c4	fix typo (#25673 )	2024-08-22 17:33:53 +00:00
Leonid Ganeline	d886f4e107	docs: `integrations` reference update 9 (#25511 ) Added missed provider pages. Added missed references and descriptions.	2024-08-22 10:25:41 -07:00
Maurits Bos	3da752c7bb	Update pyproject.toml of package`openai-functions-agent-gmail` to prevent `ModuleOrPackageNotFound` error (#25597 ) I was trying to add this package using langchain-cli: `langchain app add openai-functions-agent-gmail`, but when then try to build the whole project using poetry or pip, it fails with the following error:`poetry.core.masonry.utils.module.ModuleOrPackageNotFound: No file/folder found for package openai-functions-agent-gmail` This was fixed by modifying the pyproject.toml as in this commit Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-22 17:22:50 +00:00
Leonid Ganeline	624e0747b9	docs: `integrations` reference updates 10 (#25556 ) Added missed provider pages. Added descriptions, links.	2024-08-22 10:21:54 -07:00
Erick Friis	9447925d94	cli: release 0.0.30 (#25672 )	2024-08-22 10:21:19 -07:00
Leonid Ganeline	47adc7f32b	docs: `integrations` reference updates 11 (#25598 ) Added missed provider pages and links.	2024-08-22 10:19:17 -07:00
Dylan	16fc0a866e	docs: Change Pull Request to Merge Request in GitLab notebook (#25649 ) - Description: In GitLab we call these "merge requests" rather than "pull requests" so I thought I'd go ahead and update the notebook. - Issue: N/A - Dependencies: none - Twitter handle: N/A Thanks for creating the tools and notebook to help people work with GitLab. I thought I'd contribute some minor docs updates here.	2024-08-22 17:15:45 +00:00
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	2024-08-22 10:10:51 -07:00
Brian Sam-Bodden	29c873dd69	[docs]: update Redis (langchain-redis) documentation notebooks (vectorstore, llm caching, chat message history) (#25113 ) - Description: Adds notebooks for Redis Partner Package (langchain-redis) - Issue: N/A - Dependencies: None - Twitter handle: `@bsbodden` and `@redis` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 11:53:02 -04:00
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	2024-08-22 11:51:21 -04:00
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	2024-08-22 11:46:52 -04:00
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	2024-08-22 10:34:41 -04:00
Krishna Kulkarni	820da64983	limit the most recent documents to fetch from MongoDB database. (#25435 ) limit the most recent documents to fetch from MongoDB database. Thank you for contributing to LangChain! - [ ] limit the most recent documents to fetch from MongoDB database.: "langchain_mongodb: limit the most recent documents to fetch from MongoDB database." - [ ] PR message: *Delete this entire checklist* and replace with - Description: Added a doc_limit parameter which enables the limit for the documents to fetch from MongoDB database - Issue: - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 10:33:45 -04:00

... 4 5 6 7 8 ...

11319 Commits