langchain

Commit Graph

Author	SHA1	Message	Date
Christophe Bornet	999365186b	langchain[major]: Use InMemoryVectorStore by default in VectorstoreIndexCreator (#19575 ) This is a small breaking change but I think it should be done as: * No external dependency needs to be installed anymore for the default to work * It is vendor-neutral	6 months ago
Guangdong Liu	c93d4ea91c	docs: Add in code documentation to core Runnable map methods (docs only) (#19517 ) - Issue: #18804 - @baskaryan, @eyurtsev	6 months ago
Leonid Ganeline	0199b73188	docs: added `partners/package-name` folders (#19290 ) Added references to new integration packages from Google, by adding subfolders to `partners/`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
Aayush Kataria	03c38005cb	community[patch]: Fixing some caching issues for AzureCosmosDBSemanticCache (#18884 ) Fixing some issues for AzureCosmosDBSemanticCache - Added the entry for "AzureCosmosDBSemanticCache" which was missing in langchain/cache.py - Added application name when creating the MongoClient for the AzureCosmosDBVectorSearch, for tracking purposes. @baskaryan, can you please review this PR, we need this to go in asap. These are just small fixes which we found today in our testing.	6 months ago
Clément Tamines	a6cbb755a7	community[patch]: fix semantic answer bug in AzureSearch vector store (#18938 ) - Description: The `semantic_hybrid_search_with_score_and_rerank` method of `AzureSearch` contains a hardcoded field name "metadata" for the document metadata in the Azure AI Search Index. Adding such a field is optional when creating an Azure AI Search Index, as other snippets from `AzureSearch` test for the existence of this field before trying to access it. Furthermore, the metadata field name shouldn't be hardcoded as "metadata" and use the `FIELDS_METADATA` variable that defines this field name instead. In the current implementation, any index without a metadata field named "metadata" will yield an error if a semantic answer is returned by the search in `semantic_hybrid_search_with_score_and_rerank`. - Issue: https://github.com/langchain-ai/langchain/issues/18731 - Prior fix to this bug: This bug was fixed in this PR https://github.com/langchain-ai/langchain/pull/15642 by adding a check for the existence of the metadata field named `FIELDS_METADATA` and retrieving a value for the key called "key" in that metadata if it exists. If the field named `FIELDS_METADATA` was not present, an empty string was returned. This fix was removed in this PR https://github.com/langchain-ai/langchain/pull/15659 (see `ed1ffca911`#). @lz-chen: could you confirm this wasn't intentional? - New fix to this bug: I believe there was an oversight in the logic of the fix from [#1564](https://github.com/langchain-ai/langchain/pull/15642) which I explain below. The `semantic_hybrid_search_with_score_and_rerank` method creates a dictionary `semantic_answers_dict` with semantic answers returned by the search as follows. `5c2f7e6b2b/libs/community/langchain_community/vectorstores/azuresearch.py (L574-L581)` The keys in this dictionary are the unique document ids in the index, if I understand the [documentation of semantic answers](https://learn.microsoft.com/en-us/azure/search/semantic-answers) in Azure AI Search correctly. When the method transforms a search result into a `Document` object, an "answer" key is added to the document's metadata. The value for this "answer" key should be the semantic answer returned by the search from this document, if such an answer is returned. The match between a `Document` object and the semantic answers returned by the search should be done through the unique document id, which is used as a key for the `semantic_answers_dict` dictionary. This id is defined in the search result's field named `FIELDS_ID`. I added a check to avoid any error in case no field named `FIELDS_ID` exists in a search result (which shouldn't happen in theory). A benefit of this approach is that this fix should work whether or not the Azure AI Search Index contains a metadata field. @levalencia could you confirm my analysis and test the fix? @raunakshrivastava7 do you agree with the fix? Thanks for the help!	6 months ago
miri-bar	55db737302	ai21[minor]: AI21 Labs Semantic Text Splitter support (#19510 ) Description: Added support for AI21 Labs model - Segmentation, as a Text Splitter Dependencies: ai21, langchain-text-splitter Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
Anindyadeep	b2a11ce686	community[minor]: Prem AI langchain integration (#19113 ) ### Prem SDK integration in LangChain This PR adds the integration with [PremAI's](https://www.premai.io/) prem-sdk with langchain. User can now access to deployed models (llms/embeddings) and use it with langchain's ecosystem. This PR adds the following: ### This PR adds the following: - [x] Add chat support - [X] Adding embedding support - [X] writing integration tests - [X] writing tests for chat - [X] writing tests for embedding - [X] writing unit tests - [X] writing tests for chat - [X] writing tests for embedding - [X] Adding documentation - [X] writing documentation for chat - [X] writing documentation for embedding - [X] run `make test` - [X] run `make lint`, `make lint_diff` - [X] Final checks (spell check, lint, format and overall testing) --------- Co-authored-by: Anindyadeep Sannigrahi <anindyadeepsannigrahi@Anindyadeeps-MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Souhail Hanfi	cbec43afa9	community[patch]: avoid creating extension PGvector while using readOnly Databases (#19268 ) - Description: PgVector class always runs "create extension" on init and this statement crashes on ReadOnly databases (read only replicas). but wierdly the next create collection etc work even in readOnly databases - Dependencies: no new dependencies - Twitter handle: @VenOmaX666 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Mauricio Cruz	fb9ce95184	cli[patch]: Fix Tuple typing problem when create new langchain app (#19141 ) Thank you for contributing to LangChain! When run command langchain app new my-app, i get this error: File "/home/mauricio/.local/lib/python3.8/site-packages/langchain_cli/utils/pyproject.py", line 15, in <module> pyproject_toml: Path, local_editable_dependencies: Iterable[tuple[str, Path]] TypeError: 'type' object is not subscriptable This PR fix the error.	6 months ago
Erick Friis	441a8012b3	mistralai[patch]: release 0.1.0 (#19540 )	6 months ago
Barun Amalkumar Halder	9246ec6b36	community[patch] : [Fiddler] ensure dataset is not added if model is present (#19293 ) Description: - minor PR to speed up onboarding by not trying to add a dataset, if a model is already present. - replace batch publish API with streaming when single events are published. Dependencies: any dependencies required for this change Twitter handle: behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	6 months ago
JSDu	6e090280fd	community[patch]: milvus will autoflush, manual flush is slowly (#19300 ) reference: https://milvus.io/docs/configure_quota_limits.md#quotaAndLimitsflushRateenabled https://github.com/milvus-io/milvus/issues/31407 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
mackong	e65dc4b95b	community[patch]: clean warning when delete by ids (#19301 ) * Description: rearrange to avoid variable overwrite, which cause warning always. * Issue: N/A * Dependencies: N/A	6 months ago
Stefano Mosconi	01fc69c191	community[patch]: expanding version in confluence loader (#19324 ) Description: Expanding version in all the Confluence API calls so to get when the page was last modified/created in all cases. Issue: #12812 Twitter handle: zzste	6 months ago
Dmitry Tyumentsev	08b769d539	community[patch]: YandexGPT Use recent yandexcloud sdk version (#19341 ) Fixed inability to work with [yandexcloud SDK](https://pypi.org/project/yandexcloud/) version higher 0.265.0	6 months ago
Marlene	f1313339ac	community[patch]: Fixing incorrect base URLs for Azure Cognitive Search Retriever (#19352 ) This PR adds code to make sure that the correct base URL is being created for the Azure Cognitive Search retriever. At the moment an incorrect base URL is being generated. I think this is happening because the original code was based on a depreciated API version. No dependencies need to be added. I've also added more context to the test doc strings. I should also note that ACS is now Azure AI Search. I will open a separate PR to make these changes as that would be a breaking change and should potentially be discussed. Twitter: @marlene_zw - No new tests added, however the current ACS retriever tests are now passing when I run them. - Code was linted. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
FinTech秋田	03ba1d4731	community[patch]: Add Support for GPU Index Types in Milvus 2.4 (#19468 ) - Description: This commit introduces support for the newly available GPU index types introduced in Milvus 2.4 within the LangChain project's `milvus.py`. With the release of Milvus 2.4, a range of GPU-accelerated index types have been added, offering enhanced search capabilities and performance optimizations for vector search operations. This update ensures LangChain users can fully utilize the new performance benefits for vector search operations. - Reference: https://milvus.io/docs/gpu_index.md Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Ash Vardanian	d01bad5169	core[patch]: Convert SimSIMD back to NumPy (#19473 ) This patch fixes the #18022 issue, converting the SimSIMD internal zero-copy outputs to NumPy. I've also noticed, that oftentimes `dtype=np.float32` conversion is used before passing to SimSIMD. Which numeric types do LangChain users generally care about? We support `float64`, `float32`, `float16`, and `int8` for cosine distances and `float16` seems reasonable for practically any kind of embeddings and any modern piece of hardware, so we can change that part as well 🤗	6 months ago
Mikelarg	dac2e0165a	community[minor]: Added GigaChat Embeddings support + updated previous GigaChat integration (#19516 ) - Description: Added integration with [GigaChat](https://developers.sber.ru/portal/products/gigachat) embeddings. Also added support for extra fields in GigaChat LLM and fixed docs.	6 months ago
Martin Kolb	e5bdb26f76	community[patch]: More flexible handling for entity names in vector store "HANA Cloud" (#19523 ) - Description: Added support for lower-case and mixed-case names The names for tables and columns previouly had to be UPPER_CASE. With this enhancement, also lower_case and MixedCase are supported, - Issue: N/A - Dependencies: no new dependecies added - Twitter handle: @sapopensource	6 months ago
Orest Xherija	0b1e09029f	openai[patch]: increase max batch size for Azure OpenAI Embeddings API (#19532 ) Description: Azure OpenAI has increased its maximum batch size from 16 to 2048 for the Embeddings API per this How-To [page](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/embeddings?tabs=console#best-practices) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Eugene Yurtsev	56f4c5459b	core[patch]: fix xml output parser transform (#19530 ) Previous PR passed _parser attribute which apparently is not meant to be used by user code and causes non deterministic failures on CI when testing the transform and a transform methods. Reverting this change temporarily.	6 months ago
Erick Friis	e6952b04d5	cohere[patch]: fix release (#19529 )	6 months ago
aditya thomas	aa68fd7e91	core[runnables]: docstring for class runnable, method with_listeners() (#19515 ) Description: Docstring for method with_listerners() of class Runnable Issue: [Add in code documentation to core Runnable methods #18804](https://github.com/langchain-ai/langchain/issues/18804) Dependencies: None	6 months ago
billytrend-cohere	63343b4987	cohere[patch]: add cohere as a partner package (#19049 ) Description: adds support for langchain_cohere --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	6 months ago
Eugene Yurtsev	727d5023ce	core[patch]: Use defusedxml in XMLOutputParser (#19526 ) This mitigates a security concern for users still using older versions of libexpat that causes an attacker to compromise the availability of the system if an attacker manages to surface malicious payload to this XMLParser.	6 months ago
Zachary Wilkins	e1a6341940	langchain: Passthrough batch_size on index()/aindex() calls (#19443 ) Description: This change passes through `batch_size` to `add_documents()`/`aadd_documents()` on calls to `index()` and `aindex()` such that the documents are processed in the expected batch size. Issue: #19415 Dependencies: N/A Twitter handle: N/A	6 months ago
ccurme	82de8fd6c9	add kwargs (#19519 ) `HanaDB.add_texts` is missing **kwargs.	6 months ago
Nikhil Kumar	3d3b46a782	docs: Update docs for `HuggingFacePipeline` (#19306 ) Updated `HuggingFacePipeline` docs to be in sync with list of supported tasks, including translation. - [x] PR title: "community: Update docs for `HuggingFacePipeline`" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: Update docs for `HuggingFacePipeline`, was earlier missing `translation` as a valid task - Issue: N/A - Dependencies: N/A - Twitter handle: None - [x] Add tests and docs: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	6 months ago
Igor Muniz Soares	743f888580	community[minor]: Dappier chat model integration (#19370 ) Description: This PR adds [Dappier](https://dappier.com/) for the chat model. It supports generate, async generate, and batch functionalities. We added unit and integration tests as well as a notebook with more details about our chat model. Dependencies: No extra dependencies are needed.	6 months ago
Hugoberry	96dc180883	community[minor]: Add `DuckDB` as a vectorstore (#18916 ) DuckDB has a cosine similarity function along list and array data types, which can be used as a vector store. - Description: The latest version of DuckDB features a cosine similarity function, which can be used with its support for list or array column types. This PR surfaces this functionality to langchain. - Dependencies: duckdb 0.10.0 - Twitter handle: @igocrite --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
preak95	6ea3e57a63	community[minor]: S3FileLoader to use expose mode and post_processors arguments of unstructured loader (#19270 ) Description: Update s3_file.py to use arguments mode and post_processors from the base class UnstructuredBaseLoader to include more metadata about the files from the S3 bucket such as 'page_number', 'languages' etc. Issue: NA Dependencies: None Twitter handle: preak95 --------- Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Guangdong Liu	560e2182d8	docs: docstring Runnable `pipe` and `pick` methods (docs only) (#19395 ) - Issue: #18804 - @eyurtsev @ccurme PTAL --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Christophe Bornet	63898dbda0	langchain[patch]: Use async memory in Chain when needed (#19429 )	6 months ago
Erick Friis	b617085af0	mistralai[patch]: streaming tool calls (#19469 )	6 months ago
fengjial	3b52ee05d1	community[patch]: fix bugs in baiduvectordb as vectorstore (#19380 ) fix small bugs in vectorstore/baiduvectordb	6 months ago
aditya thomas	515aab3312	community[patch]: invoke callback prior to yielding token (openai) (#19389 ) Description: Invoke callback prior to yielding token for BaseOpenAI & OpenAIChat Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	6 months ago
aditya thomas	49e932cd24	community[patch]: invoke callback prior to yielding token (fireworks) (#19388 ) Description: Invoke callback prior to yielding token for Fireworks Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	6 months ago
aditya thomas	16ef88a87d	docs: moving FireworksEmbeddings documentation to docs folder (#19398 ) Description: Moving FireworksEmbeddings documentation to the location docs/integration/text_embedding/ from langchain_fireworks/docs/ Issue: FireworksEmbeddings documentation was not in the correct location Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
Christophe Bornet	1b813fe6fe	langchain[patch]: Add async methods to VectorStoreRetrieverMemory (#19408 )	6 months ago
Tarun Jain	ef6d3d66d6	community[patch]: docarray requires hnsw installation (#19416 ) I have a small dataset, and I tried to use docarray: ``DocArrayHnswSearch ``. But when I execute, it returns: ```bash raise ImportError( ImportError: Could not import docarray python package. Please install it with `pip install "langchain[docarray]"`. ``` Instead of docarray it needs to be ```bash docarray[hnswlib] ``` Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
German Swan	d4dc98a9f9	community[patch]: RecursiveUrlLoader: add base_url option (#19421 ) RecursiveUrlLoader does not currently provide an option to set `base_url` other than the `url`, though it uses a function with such an option. For example, this causes it unable to parse the `https://python.langchain.com/docs`, as it returns the 404 page, and `https://python.langchain.com/docs/get_started/introduction` has no child routes to parse. `base_url` allows setting the `https://python.langchain.com/docs` to filter by, while the starting URL is anything inside, that contains relevant links to continue crawling. I understand that for this case, the docusaurus loader could be used, but it's a common issue with many websites. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
Erick Friis	e71daa7a03	openai[patch]: add test coverage to output (#19462 )	6 months ago
igeni	4babefcb2f	cli[patch]: Modified regular expression (#19449 ) - Description: Modified regular expression to add support for unicode chars and simplify pattern Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Bagatur	d93d49bc43	openai[patch]: tool use integration test (#19460 )	6 months ago
Erick Friis	a99e644913	openai[patch]: integration test structured output (#19459 )	6 months ago
Erick Friis	ac57123f40	openai[patch]: release 0.1.1 (#19458 )	6 months ago
Luca Dorigo	47cfbe7522	openai[patch]: [URGENT REGRESSION FIX] Don't fail if tool message already doesn't contain name (#19435 ) - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
aditya thomas	bc028294d0	docs: delete mistralai embeddings doc from incorrect location (#19432 ) Description: Delete MistralAIEmbeddings usage document from folder partners/mistralai/docs Issue: The document is present in the folder docs/docs Dependencies: None	6 months ago
Erick Friis	11e37943ed	mistralai[patch]: fix core version (#19454 )	6 months ago
Erick Friis	3b093160c4	mistralai[patch]: release 0.1.0rc1 (#19453 )	6 months ago
aditya thomas	4856a87261	community[patch]: invoke callback prior to yielding token (llama.cpp) (#19392 ) Description: Invoke callback prior to yielding token for llama.cpp Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	6 months ago
ccurme	c4599444ee	mistralai: update tool calling (#19451 ) ```python from langchain.agents import tool from langchain_mistralai import ChatMistralAI llm = ChatMistralAI(model="mistral-large-latest", temperature=0) @tool def get_word_length(word: str) -> int: """Returns the length of a word.""" return len(word) tools = [get_word_length] llm_with_tools = llm.bind_tools(tools) llm_with_tools.invoke("how long is the word chrysanthemum") ``` currently raises ``` AttributeError: 'dict' object has no attribute 'model_dump' ``` Same with `.with_structured_output` ```python from langchain_mistralai import ChatMistralAI from langchain_core.pydantic_v1 import BaseModel class AnswerWithJustification(BaseModel): """An answer to the user question along with justification for the answer.""" answer: str justification: str llm = ChatMistralAI(model="mistral-large-latest", temperature=0) structured_llm = llm.with_structured_output(AnswerWithJustification) structured_llm.invoke("What weighs more a pound of bricks or a pound of feathers") ``` This appears to fix.	6 months ago
ccurme	8a2528c34a	[langchain] fix OpenAIAssistantRunnable.create_assistant (#19081 ) - Description: OpenAI assistants support some pre-built tools (e.g., `"retrieval"` and `"code_interpreter"`) and expect these as `{"type": "code_interpreter"}`. This may have been upset by https://github.com/langchain-ai/langchain/pull/18935 - Issue: https://github.com/langchain-ai/langchain/issues/19057	6 months ago
Harrison Chase	b40c80007f	core[minor]: Add utility code to create tool examples (#18602 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	6 months ago
Erick Friis	53ac1ebbbc	mistralai[minor]: 0.1.0rc0, remove mistral sdk (#19420 )	6 months ago
William FH	e980c14d6a	core[patch]: allow "placeholder" type in from_messages tuples (#19152 ) Co-authored-by: Erick Friis <erick@langchain.dev>	6 months ago
billytrend-cohere	f6bcd42421	community[patch]: Replace positional argument with text=text for cohere>=5 compatibility (#19407 ) - Description: Replace positional argument with text=text for cohere>=5 compatibility	6 months ago
enfeng	b20c2640da	anthropic[patch]: update base_url of anthropic (#18634 ) A small change ~ - [ ] update base_url: "package: langchain_anthropic" --------- Co-authored-by: yangenfeng <yangenfeng@xiaoniangao.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	6 months ago
Erick Friis	a9cda536ad	openai[patch]: fix core min version (#19366 )	6 months ago
Erick Friis	0b20c098df	openai[patch]: fix name param (#19365 )	6 months ago
Erick Friis	f6c8700326	openai[patch]: release 0.1.0, message id and name support (#19363 )	6 months ago
Bagatur	3fa711dce0	experimental[patch]: Release 0.0.55 (#19353 )	6 months ago
Erick Friis	2bcd760c46	robocorp[patch]: run integration tests on release (#19358 )	6 months ago
Erick Friis	a031c183ae	robocorp[patch]: release 0.0.4 (#19357 )	6 months ago
Bagatur	d95ea3550e	langchain[patch]: Release 0.1.13 (#19351 )	6 months ago
Bagatur	b58b38769d	community[patch]: Release 0.0.29 (#19350 )	6 months ago
Bagatur	5d220975fc	core[patch]: Release 0.1.33 (#19348 )	6 months ago
Eugene Yurtsev	aa9ccca775	langchain[patch]: Add tests for indexing (#19342 ) This PR adds tests for the indexing API	6 months ago
William FH	68298cdc82	[Feat] Accept non-dict if only 1 prompt input variable (#19156 ) For prompt templates with only 1 variable (common in e.g., MessageGraph), it's convenient to wrap the incoming object in the variable before formatting. The downside of this, of course, would be that some number of invocations will successfully format when the user may have intended to format it properly before	6 months ago
mackong	d9396bdec1	langchain[patch]: add stop for various non-openai agents (#19333 ) * Description: add stop for various non-openai agents. * Issue: N/A * Dependencies: N/A --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	6 months ago
Yudhajit Sinha	7d216ad1e1	community[patch]: Invoke callback prior to yielding token (titan_takeoff_pro) (#18624 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/titan_takeoff_pro. - Issue: #16913 - Dependencies: None	6 months ago
Yudhajit Sinha	455a74486b	community[patch]: Invoke callback prior to yielding token (sparkllm) (#18625 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/sparkllm. - Issue: #16913 - Dependencies: None	6 months ago
Yudhajit Sinha	5ac1860484	community[patch]: Invoke callback prior to yielding token (replicate) (#18626 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/replicate. - Issue: #16913 - Dependencies: None	6 months ago
Yudhajit Sinha	9525e392de	community[patch]: Invoke callback prior to yielding token (pai_eas_endpoint) (#18627 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/pai_eas_endpoint. - Issue: #16913 - Dependencies: None	6 months ago
Yudhajit Sinha	140f06e59a	community[patch]: Invoke callback prior to yielding token (openai) (#18628 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/openai. - Issue: #16913 - Dependencies: None	6 months ago
Yudhajit Sinha	280a914920	community[patch]: Invoke callback prior to yielding token (ollama) (#18629 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ & _astream_ methods in llms/ollama. - Issue: #16913 - Dependencies: None	6 months ago
Christophe Bornet	00614f332a	community[minor]: Add InMemoryVectorStore (#19326 ) This is a basic VectorStore implementation using an in-memory dict to store the documents. It doesn't need any extra/optional dependency as it uses numpy which is already a dependency of langchain. This is useful for quick testing, demos, examples. Also it allows to write vendor-neutral tutorials, guides, etc...	6 months ago
Devesh Rahatekar	3c4529ac69	core: Updated docstring for RunnablePick (#18832 ) Description: : Updated the docstring for RunnablePick. Added Overview and an Example for RunnablePick class. Issue: : #18803	6 months ago
Leonid Ganeline	8609afbd10	core[patch]: Update `messages` namespace to fix API reference docs (#19161 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: - libs/core/langchain_core/messages/__init__.py : AnyMessage, MessageLikeRepresentation, get_buffer_string(), messages_from_dict(), ... Opinionated: __init__.py is not a typical place to define artifacts. Moved artifacts from __init__ into utils.py. Added `MessageLikeRepresentation` to __all__ since it is used outside of `messages`, for example, in `libs/core/langchain_core/language_models/base.py` Added `_message_from_dict` to __all__ since it is used outside of `messages`(???) I would add `message_from_dict` (without underscore) as an alias. Please, advise.	6 months ago
Christophe Bornet	4c2e887276	core: Simplify astream logic in BaseChatModel and BaseLLM (#19332 ) Covered by tests in `libs/core/tests/unit_tests/language_models/chat_models/test_base.py`, `libs/core/tests/unit_tests/language_models/llms/test_base.py` and `libs/core/tests/unit_tests/runnables/test_runnable_events.py`	6 months ago
Erick Friis	69e9610f62	openai[patch]: pass message name (#17537 )	6 months ago
Guangdong Liu	e5d7e455dc	splitters: Add ensure_ascii parameter (#18485 ) - Description: Add ensure_ascii parameter	6 months ago
Nithish Raghunandanan	7ad0a3f2a7	community: add Couchbase Vector Store (#18994 ) - Description: Added support for Couchbase Vector Search to LangChain. - Dependencies: couchbase>=4.1.12 - Twitter handle: @nithishr --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com>	6 months ago
Chris Papademetrious	305d74c67a	core: implement a batch_size parameter for CacheBackedEmbeddings (#18070 ) Description: Currently, `CacheBackedEmbeddings` computes vectors for all uncached documents before updating the store. This pull request updates the embedding computation loop to compute embeddings in batches, updating the store after each batch. I noticed this when I tried `CacheBackedEmbeddings` on our 30k document set and the cache directory hadn't appeared on disk after 30 minutes. The motivation is to minimize compute/data loss when problems occur: * If there is a transient embedding failure (e.g. a network outage at the embedding endpoint triggers an exception), at least the completed vectors are written to the store instead of being discarded. * If there is an issue with the store (e.g. no write permissions), the condition is detected early without computing (and discarding!) all the vectors. Issue: Implements enhancement #18026. Testing: I was unable to run unit tests; details in [this post](https://github.com/langchain-ai/langchain/discussions/15019#discussioncomment-8576684). --------- Signed-off-by: chrispy <chrispy@synopsys.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	6 months ago
William FH	89af30807b	Permit function eval on llm data type (#19287 )	6 months ago
Jib	f8078e41e5	mongodb[patch]: Added scoring threshold to caching (#19286 ) ## Description Semantic Cache can retrieve noisy information if the score threshold for the value is too low. Adding the ability to set a `score_threshold` on cache construction can allow for less noisy scores to appear. - [x] Add tests and docs 1. Added tests that confirm the `score_threshold` query is valid. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	6 months ago
Christophe Bornet	30e4a35d7a	community: Use langchain-astradb for AstraDB caches (#18419 ) - [x] Needs https://github.com/langchain-ai/langchain-datastax/pull/4 - [x] Needs a new release of langchain-astradb	6 months ago
Guangdong Liu	2c835baae4	code[patch]: Add in code documentation to core Runnable with_retry method (docs only) (#19192 ) - Description: Add in code documentation to core Runnable with_retry method (docs only) - Issue: #18804 @baskaryan @eyurtsev PTAL --------- Co-authored-by: ccurme <chester.curme@gmail.com>	6 months ago
Eugene Yurtsev	4b3dd34544	core[patch]: Pass sync run manager for sync stream fallback in astream (#19280 ) This PR patches the fallback in chat models and language models to pass in the appropriate version of the run manager (sync vs. async)	6 months ago
Leonid Ganeline	d314acb2d5	core[patch]: Move `globals` to a module instead of a package (non breaking change) (#19159 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: libs/core/langchain_core/globals/__init__.py : `set_verbose` `get_llm_cache`, `set_llm_cache`, ... And the whole `langchain_core.globals` namespace is not visible in the API Reference. The refactoring is just file renaming.	6 months ago
Al-Ekram Elahee Hridoy	50f93d86ec	core[minor]: Enhance cache flexibility in BaseChatModel (#17386 ) - Description: Enhanced the `BaseChatModel` to support an `Optional[Union[bool, BaseCache]]` type for the `cache` attribute, allowing for both boolean flags and custom cache implementations. Implemented logic within chat model methods to utilize the provided custom cache implementation effectively. This change aims to provide more flexibility in caching strategies for chat models. - Issue: Implements enhancement request #17242. - Dependencies: No additional dependencies required for this change. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	6 months ago
Zihong	ff31cc1648	experimental: update the notebook link of semantic chunk. (#19253 ) update the notebook link of semantic chunk.	6 months ago
Frederico Wu	f36418a5b0	langchain: creating assistants with file_ids (#19199 ) Changing OpenAIAssistantRunnable.create_assistant to send the `file_ids` parameter to openai.beta.assistants.create Co-authored-by: Frederico Wu <fred.diaswu@coxautoinc.com>	6 months ago
Vittorio Rigamonti	9b2f9ee952	community: VectorStore Infinispan, adding autoconfiguration (#18967 ) Description: this PR enable VectorStore autoconfiguration for Infinispan: if metadatas are only of basic types, protobuf config will be automatically generated for the user.	6 months ago
Max Jakob	6f544a6a25	elasticsearch: check for deployed models (#18973 ) When creating a new index, if we use a retrieval strategy that expects a model to be deployed in Elasticsearch, check if a model with this name is indeed deployed before creating an index. This lowers the probability to get into a state in which an index was created with a faulty model ID, which cannot be overwritten any more (the index has to manually be deleted).	6 months ago
gonvee	b82644078e	community: Add `keep_alive` parameter to control how long the model w… (#19005 ) Add `keep_alive` parameter to control how long the model will stay loaded into memory with Ollama。 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	6 months ago
Roshan Santhosh	7afecec280	core: update _rm_titles to account for title argument name bug (#19036 ) Issue : For functions which have an argument with the name 'title', the convert_pydantic_to_openai_function generates an incorrect output and omits the argument all together. This is because the _rm_titles function removes all instances of the the key 'title' from the output. Description : Updates the _rm_titles function to check the presence of the 'type' key as well before removing the 'title' key. As the title key that we wish to omit always has a type key along with it. Potential gap if there is a function defined which has both title and key as argument names, in which case this would fail. Maybe we could set a filter on the function argument names and reject those with keyword argument names. No dependencies. Passed all tests. - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	6 months ago
Harrison Chase	efcdf54edd	Josha91 fix docstring (#19249 ) Co-authored-by: Josha van Houdt <josha.van.houdt@sap.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	6 months ago
Simon Stone	58c7687174	langchain: preserve document metadata in `FlashrankRerank` (#19148 ) Description: Preserves document metadata in `FlashrankRerank` - Issue: #19142 - Dependencies: None - Twitter handle: n/a --------- Co-authored-by: Simon Stone <simon.stone@dartmouth.edu>	6 months ago

1 2 3 4 5 ...

3505 Commits (2fceec37715256d0ce88a171f8482a938f2cd127)