langchain

Commit Graph

Author	SHA1	Message	Date
Cillian Berragan	754f3c41f9	community: add score to PineconeHybridSearchRetriever (#25781 ) Description: Adds the 'score' returned by Pinecone to the `PineconeHybridSearchRetriever` list of returned Documents. There is currently no way to return the score when using Pinecone hybrid search, so in this PR I include it by default. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	3 weeks ago
ZhangShenao	3f1d652f15	Improvement[Community] Improve api doc for `PineconeHybridSearchRetriever` (#25803 ) - Complete missing args in api doc	3 weeks ago
Moritz Schlager	555f97becb	community[patch]: fix model initialization bug for deepinfra (#25727 ) ### Description adds an init method to ChatDeepInfra to set the model_name attribute accordings to the argument ### Issue currently, the model_name specified by the user during initialization of the ChatDeepInfra class is never set. Therefore, it always chooses the default model (meta-llama/Llama-2-70b-chat-hf, however probably since this is deprecated it always uses meta-llama/Llama-3-70b-Instruct). We stumbled across this issue and fixed it as proposed in this pull request. Feel free to change the fix according to your coding guidelines and style, this is just a proposal and we want to draw attention to this problem. ### Dependencies no additional dependencies required Feel free to contact me or @timo282 and @finitearth if you have any questions. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	3 weeks ago
Bagatur	b0ac6fe8d3	community[patch]: Release 0.2.13 (#25806 )	3 weeks ago
zysoong	25a6790e1a	community[patch]: Minor Improvement of extract hyperlinks tool output (#25728 ) Description: Make the hyperlink only appear once in the extract_hyperlinks tool output. (for some websites output contains meaningless '#' hyperlinks multiple times which will extend the tokens of context window without any advantage) Issue: None Dependencies: None	3 weeks ago
Isaac Francisco	d5ddaac1fc	docs minor fix (#25794 )	3 weeks ago
Tomaz Bratanic	f359e6b0a5	Add mmr to neo4j vector (#25765 )	3 weeks ago
Luis Valencia	99f9a664a5	community: Azure Search Vector Store is missing Access Token Authentication (#24330 ) Added Azure Search Access Token Authentication instead of API KEY auth. Fixes Issue: https://github.com/langchain-ai/langchain/issues/24263 Dependencies: None Twitter: @levalencia @baskaryan Could you please review? First time creating a PR that fixes some code. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	4 weeks ago
ZhangShenao	44e3e2391c	Improvement[Community] Improve methods in `IMessageChatLoader` (#25746 ) - Add @staticmethod to static methods in `IMessageChatLoader`. - Format args name.	4 weeks ago
maang-h	a566a15930	Fix MoonshotChat instantiate with alias (#25755 ) - Description: - Fix `MoonshotChat` instantiate with alias - Add `MoonshotChat` to `__init__.py` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	4 weeks ago
Ashvin	af3b3a4474	Update endpoint for AzureMLEndpointApiType class. (#25725 ) This addresses the issue mentioned in #25702 I have updated the endpoint used in validating the endpoint API type in the AzureMLBaseEndpoint class from `/v1/completions` to `/completions` and `/v1/chat/completions` to `/chat/completions`. Co-authored-by: = <=>	4 weeks ago
Dristy Srivastava	7205057c3e	[Community][minor]: Added langchain_version while calling discover API (#24428 ) - Description: Added langchain version while calling discover API during both ingestion and retrieval - Issue: NA - Dependencies: NA - Tests: NA - Docs NA --------- Co-authored-by: dristy.cd <dristy@clouddefense.io>	4 weeks ago
Dristy Srivastava	fbb4761199	[Community][minor]: Updating source path, and file path for SharePoint loader in PebbloSafeLoader (#25592 ) - Description: Updating source path and file path in Pebblo safe loader for SharePoint apps during loading - Issue: NA - Dependencies: NA - Tests: NA - Docs NA --------- Co-authored-by: dristy.cd <dristy@clouddefense.io>	4 weeks ago
Rajendra Kadam	745d1c2b8d	community[minor]: [Pebblo] Fix URL construction in newer Python versions (#25747 ) - PR message: Fix URL construction in newer Python versions - Description: - Update the URL construction logic to use the .value attribute for Routes enum members. - This adjustment resolves an issue where the code worked correctly in Python 3.9 but failed in Python 3.11. - Clean up unused routes. - Issue: NA - Dependencies: NA	4 weeks ago
Rajendra Kadam	58a98c7d8a	community: [PebbloRetrievalQA] Implemented Async support for prompt APIs (#25748 ) - Description: PebbloRetrievalQA: Implemented Async support for prompt APIs (classification and governance) - Issue: NA - Dependencies: NA	4 weeks ago
Christophe Bornet	038c287b3a	all: Improve make lint command (#25344 ) * Removed `ruff check --select I` as `I` is already selected and checked in the main `ruff check` command * Added checks for non-empty `PYTHON_FILES` * Run `ruff check` only on `PYTHON_FILES` Co-authored-by: Erick Friis <erick@langchain.dev>	4 weeks ago
Erick Friis	f6491ceb7d	community: remove integration test deps (#24460 ) they arent used	4 weeks ago
Sharmistha S. Gupta	90439b12f6	Added support for Nebula Chat model (#21925 ) Description: Added support for Nebula Chat model in addition to Nebula Instruct Dependencies: N/A Twitter handle: @Symbldotai --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	4 weeks ago
Ian	64ace25eb8	<Community>: tidb vector support vector index (#19984 ) This PR introduces adjustments to ensure compatibility with the recently released preview version of [TiDB Serverless Vector Search](https://tidb.cloud/ai), aiming to prevent user confusion. - TiDB Vector now supports vector indexing with cosine and l2 distance strategies, although inner_product remains unsupported. - Changing the distance strategy is currently not supported, so the test cased should be adjusted.	4 weeks ago
Austin Burdette	f355a98bb6	community:yuan2[patch]: standardize init args (#21462 ) updated stop and request_timeout so they aliased to stop_sequences, and timeout respectively. Added test that both continue to set the same underlying attributes. Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) Co-authored-by: ccurme <chester.curme@gmail.com>	4 weeks ago
Erick Friis	b365ee996b	community: remove unused verify_ssl kwarg from aiohttp request (#25707 ) it's not a valid kwarg in aiohttp request	4 weeks ago
Ashvin	2cd77a53a3	docs: Add docstrings for CassandraChatMessageHistory class and package namespace function. (#24222 ) - Modified docstring for CassandraChatMessageHistory in libs/community/langchain_community/chat_message_history/cassandra.py. - Added docstring for _package_namespace function in docs/api_reference/create_api_rst.py --------- Co-authored-by: ashvin <ashvin.anilkumar@qburst.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com>	4 weeks ago
Leonid Ganeline	8788a34bfa	community: `NeptuneGraph` fix (#23281 ) Issue: the `service` optional parameter was mentioned but not used. Fix: added this parameter. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	4 weeks ago
Djordje	22f9ae489f	community: Opensearch - added score function for similarity_score_threshold (#23928 ) This PR resolves the NotImplemented error for the similarity_score_threshold search type for OpenSearch.	4 weeks ago
ZhangShenao	b38c83ff93	patch[Community] Optimize methods in several ChatLoaders (#24806 ) There are some static methods in ChatLoaders, try to add @staticmethod decorator for them.	4 weeks ago
James Espichan Vilca	644e0d3463	Use extend method for embeddings concatenation in mlflow_gateway (#14358 ) ## Description There is a bug in the concatenation of embeddings obtained from MLflow that does not conform to the type hint requested by the function. ``` python def _query(self, texts: List[str]) -> List[List[float]]: ``` It is logical to expect a List[List[float]] for a List[str]. However, the append method encapsulates the response in a global List. To avoid this, the extend method should be used, which will add the embeddings of all strings at the same list level. ## Testing I have tried using OpenAI-ADA to obtain the embeddings, and the result of executing this snippet is as follows: ``` python embeds = await MlflowAIGatewayEmbeddings().aembed_documents(texts=["hi", "how are you?"]) print(embeds) ``` ``` python [[[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]]] ``` When in reality, the expected result should be: ``` python [[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]] ``` The above result complies with the expected type hint: List[List[float]] . As I mentioned, we can achieve that by using the extend method instead of the append method. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	4 weeks ago
Christophe Bornet	7f1e444efa	partners: Use simsimd types (#25299 ) The simsimd package [now has types](https://github.com/ashvardanian/SimSIMD/releases/tag/v5.0.0)	4 weeks ago
clement.l	642f9530cd	community: add supported blockchains to Blockchain Document Loader (#25428 ) - Remove deprecated chains. - Add more supported chains. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	4 weeks ago
conjuncts	818267bbc3	community: allow chroma DB delete() to use "where" argument (#19826 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Simply pass kwargs to allow arguments like "where" to be propagated Issue: Previously, db.delete(where={}) wouldn't work for chroma vectorstores Dependencies: N/A Twitter handle: N/A - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	4 weeks ago
Kevin Engelke	3c7f12cbf5	community[minor]: Fix missing 'keep_newlines' parameter forward-pass to 'process_pages' function in confluence loader (#20086 ) (#20087 ) - Description: Fixed missing `keep_newlines` parameter forward-pass in confluence-loader - Issue: #20086 - Dependencies: None --------- Co-authored-by: ccurme <chester.curme@gmail.com>	4 weeks ago
Erik Lindgren	583b0449eb	community[patch]: Fix Hybrid Search for non-Databricks managed embeddings (#25590 ) Description: Send both the query and query_embedding to the Databricks index for hybrid search. Issue: When using hybrid search with non-Databricks managed embedding we currently don't pass both the embedding and query_text to the index. Hybrid search requires both of these. This change fixes this issue for both `similarity_search` and `similarity_search_by_vector`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	4 weeks ago
Alejandro Companioni	bcd5842b5d	community[patch]: Updating default PPLX model to supported llama-3.1 model. (#25643 ) # Issue As of late July, Perplexity [no longer supports Llama 3 models](https://docs.perplexity.ai/changelog/introducing-new-and-improved-sonar-models). # Description This PR updates the default model and doc examples to reflect their latest supported model. (Mostly updating the same places changed by #23723.) # Twitter handle `@acompa_` on behalf of the team at Not Diamond. Check us out [here](https://notdiamond.ai). --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	4 weeks ago
Jakub W.	b865ee49a0	community[patch]: Dynamodb history messages key (#25658 ) - Description: adding the history_messages_key so you don't have to use "History" as a key in langchain	4 weeks ago
Manuel Jaiczay	1c31234eed	community: fix HuggingFacePipeline pipeline_kwargs (#19920 ) Fix handling of pipeline_kwargs to prioritize class attribute defaults. #19770 Co-authored-by: jaizo <manuel.jaiczay@polygons.at> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	4 weeks ago
Nobuhiko Otoba	4b63a217c2	"community: Fix GithubFileLoader source code", "docs: Fix GithubFileLoader code sample" (#19943 ) This PR adds tiny improvements to the `GithubFileLoader` document loader and its code sample, addressing the following issues: 1. Currently, the `file_extension` argument of `GithubFileLoader` does not change its behavior at all. 1. The `GithubFileLoader` sample code in `docs/docs/integrations/document_loaders/github.ipynb` does not work as it stands. The respective solutions I propose are the following: 1. Remove `file_extension` argument from `GithubFileLoader`. 1. Specify the branch as `master` (not the default `main`) and rename `documents` as `document`. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	4 weeks ago
Nada Amin	ac7b71e0d7	langchain_community.graphs: Neo4JGraph: prop min_size might be None (#23944 ) When I used the Neo4JGraph enhanced_schema=True option, I ran into an error because a prop min_size of None was compared numerically with an int. The fix I applied is similar to the pattern of skipping embeddings elsewhere in the file. Co-authored-by: ccurme <chester.curme@gmail.com>	4 weeks ago
William FH	fad6fc866a	Rm DeepInfra Breakpoint Comment (#25206 ) tbh should rm the print staement too	4 weeks ago
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	4 weeks ago
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	4 weeks ago
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	4 weeks ago
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	4 weeks ago
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	4 weeks ago
Noah Mayerhofer	0091947efd	community: add retry for session expired exception in neo4j (#25660 ) Description: The neo4j driver can raise a SessionExpired error, which is considered a retriable error. If a query fails with a SessionExpired error, this change retries every query once. This change will make the neo4j integration less flaky. Twitter handle: noahmay_	4 weeks ago
Dristy Srivastava	b002702af6	[Community][minor]: Updating metadata with full_path in SharePoint loader (#25593 ) - Description: Updating metadata for sharepoint loader with full path i.e., webUrl - Issue: NA - Dependencies: NA - Tests: NA - Docs NA Co-authored-by: dristy.cd <dristy@clouddefense.io> Co-authored-by: ccurme <chester.curme@gmail.com>	1 month ago
Jabir	12e490ea56	Update azuresearch.py (#25577 ) This will allow complextype metadata to be returned. the current implementation throws error when dealing with nested metadata Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	1 month ago
Erick Friis	e01c6789c4	core,community: add beta decorator to missed GraphVectorStore extensions (#25562 )	1 month ago
maang-h	015ab91b83	community[patch]: Add ToolMessage for ChatZhipuAI (#25547 ) - Description: Add ToolMessage for `ChatZhipuAI` to solve the issue #25490	1 month ago
Mohammad Mohtashim	75c3c81b8c	[Community]: Fix - Open AI Whisper `client.audio.transcriptions` returning Text Object which raises error (#25271 ) - Description: The following [line](`fd546196ef/libs/community/langchain_community/document_loaders/parsers/audio.py (L117)`) in `OpenAIWhisperParser` returns a text object for some odd reason despite the official documentation saying it should return `Transcript` Instance which should have the text attribute. But for the example given in the issue and even when I tried running on my own, I was directly getting the text. The small PR accounts for that. - Issue: : #25218 I was able to replicate the error even without the GenericLoader as shown below and the issue was with `OpenAIWhisperParser` ```python parser = OpenAIWhisperParser(api_key="sk-fxxxxxxxxx", response_format="srt", temperature=0) list(parser.lazy_parse(Blob.from_path('path_to_file.m4a'))) ```	1 month ago
maang-h	32f5147523	docs: Fix QianfanLLMEndpoint and Tongyi input text (#25529 ) - Description: Fix `QianfanLLMEndpoint` and `Tongyi` input text.	1 month ago
ZhangShenao	4255a30f20	Improvement[Community] Improve api doc for `SingleFileFacebookMessengerChatLoader` (#25536 ) Delete redundant args in api doc	1 month ago

1 2 3 4 5 ...

1496 Commits (754f3c41f9dd3344b9d79cc9830791bd168422d6)