langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
i-w-a	95ee69a301	langchain[patch]: In HTMLHeaderTextSplitter set default encoding to utf-8 (#16372 ) - Description: The HTMLHeaderTextSplitter Class now explicitly specifies utf-8 encoding in the part of the split_text_from_file method that calls the HTMLParser. - Issue: Prevent garbled characters due to differences in encoding of html files (except for English in particular, I noticed that problem with Japanese). - Dependencies: No dependencies, - Twitter handle: @i_w__a	2024-01-23 18:20:29 -08:00
Noah Stapp	e135e5257c	community[patch]: Include scores in MongoDB Atlas QA chain results (#14666 ) Adds the ability to return similarity scores when using `RetrievalQA.from_chain_type` with `MongoDBAtlasVectorSearch`. Requires that `return_source_documents=True` is set. Example use: ``` vector_search = MongoDBAtlasVectorSearch.from_documents(...) qa = RetrievalQA.from_chain_type( llm=OpenAI(), chain_type="stuff", retriever=vector_search.as_retriever(search_kwargs={"additional": ["similarity_score"]}), return_source_documents=True ) ... docs = qa({"query": "..."}) docs["source_documents"][0].metadata["score"] # score will be here ``` I've tested this feature locally, using a MongoDB Atlas Cluster with a vector search index.	2024-01-23 18:18:28 -08:00
Serena Ruan	90f5a1c40e	community[minor]: Improve mlflow callback (#15691 ) - Description: Allow passing run_id to MLflowCallbackHandler to resume a run instead of creating a new run. Support recording retriever relevant metrics. Refactor the code to fix some bugs. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-01-23 18:16:51 -08:00
Facundo Santiago	92e6a641fd	feat: adding paygo api support for Azure ML / Azure AI Studio (#14560 ) - Description: Introducing support for LLMs and Chat models running in Azure AI studio and Azure ML using the new deployment mode pay-as-you-go (model as a service). - Issue: NA - Dependencies: None. - Tag maintainer: @prakharg-msft @gdyre - Twitter handle: @santiagofacundo Examples added: * [docs/docs/integrations/llms/azure_ml.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_endpoint.ipynb) * [docs/docs/integrations/chat/azureml_chat_endpoint.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_chat_endpoint.ipynb) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 17:08:51 -08:00
Davide Menini	9ce177580a	community: normalize bedrock embeddings (#15103 ) In this PR I added a post-processing function to normalize the embeddings. This happens only if the new `normalize` flag is `True`. --------- Co-authored-by: taamedag <Davide.Menini@swisscom.com>	2024-01-23 17:05:24 -08:00
baichuan-assistant	20fcd49348	community: Fix Baichuan Chat. (#15207 ) - Description: Baichuan Chat (with both Baichuan-Turbo and Baichuan-Turbo-192K models) has updated their APIs. There are breaking changes. For example, BAICHUAN_SECRET_KEY is removed in the latest API but is still required in Langchain. Baichuan's Langchain integration needs to be updated to the latest version. - Issue: #15206 - Dependencies: None, - Twitter handle: None @hwchase17. Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-23 17:01:57 -08:00
gcheron	cfc225ecb3	community: SQLStrStore/SQLDocStore provide an easy SQL alternative to `InMemoryStore` to persist data remotely in a SQL storage (#15909 ) Description: - Implement `SQLStrStore` and `SQLDocStore` classes that inherits from `BaseStore` to allow to persist data remotely on a SQL server. - SQL is widely used and sometimes we do not want to install a caching solution like Redis. - Multiple issues/comments complain that there is no easy remote and persistent solution that are not in memory (users want to replace InMemoryStore), e.g., https://github.com/langchain-ai/langchain/issues/14267, https://github.com/langchain-ai/langchain/issues/15633, https://github.com/langchain-ai/langchain/issues/14643, https://stackoverflow.com/questions/77385587/persist-parentdocumentretriever-of-langchain - This is particularly painful when wanting to use `ParentDocumentRetriever ` - This implementation is particularly useful when: * it's expensive to construct an InMemoryDocstore/dict * you want to retrieve documents from remote sources * you just want to reuse existing objects - This implementation integrates well with PGVector, indeed, when using PGVector, you already have a SQL instance running. `SQLDocStore` is a convenient way of using this instance to store documents associated to vectors. An integration example with ParentDocumentRetriever and PGVector is provided in docs/docs/integrations/stores/sql.ipynb or [here](https://github.com/gcheron/langchain/blob/sql-store/docs/docs/integrations/stores/sql.ipynb). - It persists `str` and `Document` objects but can be easily extended. Issue: Provide an easy SQL alternative to `InMemoryStore`. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 16:50:48 -08:00
Massimiliano Pronesti	e529939c54	feat(llms): support more tasks in HuggingFaceHub LLM and remove deprecated dep (#14406 ) - Description: this PR upgrades the `HuggingFaceHub` LLM: * support more tasks (`translation` and `conversational`) * replaced the deprecated `InferenceApi` with `InferenceClient` * adjusted the overall logic to use the "recommended" model for each task when no model is provided, and vice-versa. - Tag mainter(s): @baskaryan @hwchase17	2024-01-23 16:48:56 -08:00
Erick Friis	afb25eeec4	cli[patch]: add integration tests to default makefile (#16479 )	2024-01-23 16:09:16 -07:00
Bagatur	ba326b98d0	langchain[patch]: Release 0.1.3 (#16475 )	2024-01-23 11:50:25 -08:00
Bagatur	54149292f8	community[patch]: Release 0.0.15 (#16474 )	2024-01-23 11:50:10 -08:00
Bagatur	ef6a335570	core[patch]: Release 0.1.15 (#16473 )	2024-01-23 11:31:50 -08:00
Erick Friis	1f4ac62dee	cli[patch], google-vertexai[patch]: readme template (#16470 )	2024-01-23 12:08:17 -07:00
Tomaz Bratanic	d0a8082188	Fix neo4j sanitize (#16439 ) Fix the sanitization bug and add an integration test	2024-01-23 10:56:28 -05:00
William FH	5de59f9236	Core[Patch] Parse tool input after on_start (#16430 ) For tracing, if a validation error occurs, currently it is attributed to the previous step of the chain. It would be nice to have the on_start and on_error callbacks called for tools when there is a validation error that occurs to more easily attribute the root-cause	2024-01-23 10:54:47 -05:00
Nuno Campos	226fe645f1	core[patch] Do not try to access attribute of None (#16321 )	2024-01-22 22:10:03 -08:00
Florian MOREL	4b7969efc5	community[minor]: New documents loader for visio files (with extension .vsdx) (#16171 ) Description : New documents loader for visio files (with extension .vsdx) A [visio file](https://fr.wikipedia.org/wiki/Microsoft_Visio) (with extension .vsdx) is associated with Microsoft Visio, a diagram creation software. It stores information about the structure, layout, and graphical elements of a diagram. This format facilitates the creation and sharing of visualizations in areas such as business, engineering, and computer science. A Visio file can contain multiple pages. Some of them may serve as the background for others, and this can occur across multiple layers. This loader extracts the textual content from each page and its associated pages, enabling the extraction of all visible text from each page, similar to what an OCR algorithm would do. Dependencies : xmltodict package	2024-01-22 22:07:03 -08:00
Boris Feld	404abf139a	community: Add CometLLM tracing context var (#15765 ) I also added LANGCHAIN_COMET_TRACING to enable the CometLLM tracing integration similar to other tracing integrations. This is easier for end-users to enable it rather than importing the callback and pass it manually. (This is the same content as https://github.com/langchain-ai/langchain/pull/14650 but rebased and squashed as something seems to confuse Github Action).	2024-01-22 15:17:16 -08:00
Nicolò Boschi	a500527030	infra: google-vertexai relax types-requests deps range (#16264 ) - Description: At the moment it's not possible to include in the same project langchain-google-vertexai and boto3 (e.g. use bedrock and vertex in the same application) because of the dependency resolutions conflict. boto3 is still using urllib3 1.x, meanwhile langchain-google-vertexai -> types-requests depends on urllib3 2.x. [the last version of types-requests that allows urllib3 1.x is 2.31.0.6](https://pypi.org/project/types-requests/#description). In this PR I allow the vertexai package to get that version also. - Twitter handle: nicoloboschi	2024-01-22 14:54:41 -08:00
DL	b9e7f6f38a	community[minor]: Bedrock async methods (#12477 ) Description: Added support for asynchronous streaming in the Bedrock class and corresponding tests. Primarily: async def aprepare_output_stream async def _aprepare_input_and_invoke_stream async def _astream async def _acall I've ensured that the code adheres to the project's linting and formatting standards by running make format, make lint, and make test. Issue: #12054, #11589 Dependencies: None Tag maintainer: @baskaryan Twitter handle: @dominic_lovric --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-01-22 14:44:49 -08:00
Frank995	5694728816	community[patch]: Implement vector length definition at init time in PGVector for indexing (#16133 ) Replace this entire comment with: - Description: allow user to define tVector length in PGVector when creating the embedding store, this allows for later indexing - Issue: #16132 - Dependencies: None	2024-01-22 14:32:44 -08:00
Chase VanSteenburg	1011b681dc	core[patch]: Fix f-string formatting in error message for configurable_fields (#16411 ) - Description: Simple fix to f-string formatting. Allows more informative ValueError output. - Issue: None needed. - Dependencies: None. - Twitter handle: @FlightP1an	2024-01-22 14:08:44 -08:00
parkererickson-tg	b26a22f307	community[minor]: add TigerGraph support (#16280 ) Description: Add support for querying TigerGraph databases through the InquiryAI service. Issue: N/A Dependencies: N/A Twitter handle: @TigerGraphDB	2024-01-22 14:07:44 -08:00
Alireza Kashani	d1b4ead87c	community[patch]: Update grobid.py (#16298 ) there is a case where "coords" does not exist in the "sentence" therefore, the "split(";")" will lead to error. we can fix that by adding "if sentence.get("coords") is not None:" the resulting empty "sbboxes" from this scenario will raise error at "sbboxes[0]["page"]" because sbboxes are empty. the PDF from https://pubmed.ncbi.nlm.nih.gov/23970373/ can replicate those errors.	2024-01-22 14:03:58 -08:00
s-g-1	fbe592a5ce	community[patch]: fix typo in pgvecto_rs debug msg (#16318 ) fixes typo in pip install message for the pgvecto_rs community vector store no issues found mentioning this no dependents changed	2024-01-22 14:01:33 -08:00
James Braza	d511366dd3	infra: absolute `EXAMPLE_DIR` path in core unit tests (#16325 ) If you invoked testing from places besides `core/`, this `EXAMPLE_DIR` path won't work. This PR makes`EXAMPLE_DIR` robust against invocation location	2024-01-22 14:00:23 -08:00
Ian	b9f5104e6c	communty[minor]: Store Message History to TiDB Database (#16304 ) This pull request integrates the TiDB database into LangChain for storing message history, marking one of several steps towards a comprehensive integration of TiDB with LangChain. A simple usage ```python from datetime import datetime from langchain_community.chat_message_histories import TiDBChatMessageHistory history = TiDBChatMessageHistory( connection_string="mysql+pymysql://<host>:<PASSWORD>@<host>:4000/<db>?ssl_ca=/etc/ssl/cert.pem&ssl_verify_cert=true&ssl_verify_identity=true", session_id="code_gen", earliest_time=datetime.utcnow(), # Optional to set earliest_time to load messages after this time point. ) history.add_user_message("hi! How's feature going?") history.add_ai_message("It's almot done") ```	2024-01-22 13:56:56 -08:00
Erick Friis	35ec0bbd3b	cli[patch]: pypi fields (#16410 )	2024-01-22 14:28:30 -07:00
Erick Friis	2ac3a82d85	cli[patch]: new fields in integration template, release 0.0.21 (#16398 )	2024-01-22 14:26:47 -07:00
Erick Friis	cfe95ab085	multiple: update langsmith dep (#16407 )	2024-01-22 14:23:11 -07:00
Eli Lucherini	6b2a57161a	community[patch]: allow additional kwargs in MlflowEmbeddings for compatibility with Cohere API (#15242 ) - Description: add support for kwargs in`MlflowEmbeddings` `embed_document()` and `embed_query()` so that all the arguments required by Cohere API (and others?) can be passed down to the server. - Issue: #15234 - Dependencies: MLflow with MLflow Deployments (`pip install mlflow[genai]`) Tests Now this code [adapted from the docs](https://python.langchain.com/docs/integrations/providers/mlflow#embeddings-example) for the Cohere API works locally. ```python """ Setup ----- export COHERE_API_KEY=... mlflow deployments start-server --config-path examples/deployments/cohere/config.yaml Run --- python /path/to/this/file.py """ embeddings = MlflowCohereEmbeddings(target_uri="http://127.0.0.1:5000", endpoint="embeddings") print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) ``` Output ``` [0.060455322, 0.028793335, -0.025848389] [0.031707764, 0.021057129, -0.009361267] ```	2024-01-22 11:38:11 -08:00
Guillem Orellana Trullols	aad2aa7188	community[patch]: BedrockChat -> Support Titan express as chat model (#15408 ) Titan Express model was not supported as a chat model because LangChain messages were not "translated" to a text prompt. Co-authored-by: Guillem Orellana Trullols <guillem.orellana_trullols@siemens.com>	2024-01-22 11:37:23 -08:00
Piotr Mardziel	1b9001db47	core[patch]: preserve inspect.iscoroutinefunction with @deprecated decorator (#16295 ) Adjusted `deprecate` decorator to make sure decorated async functions are still recognized as "coroutinefunction" by `inspect`. Before change, functions such as `LLMChain.acall` which are decorated as deprecated are not recognized as coroutine functions. After the change, they are recognized: ```python import inspect from langchain import LLMChain # Is false before change but true after. inspect.iscoroutinefunction(LLMChain.acall) ```	2024-01-22 11:34:13 -08:00
Katarina Supe	01c2f27ffa	community[patch]: Update Memgraph support (#16360 ) - Description: I removed two queries to the database and left just one whose results were formatted afterward into other type of schema (avoided two calls to DB) - Issue: / - Dependencies: / - Twitter handle: @supe_katarina	2024-01-22 11:33:28 -08:00
Max Jakob	8569b8f680	community[patch]: ElasticsearchStore enable max inner product (#16393 ) Enable max inner product for approximate retrieval strategy. For exact strategy we lack the necessary `maxInnerProduct` function in the Painless scripting language, this is why we do not add it there. Similarity docs: https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Joe McElroy <joseph.mcelroy@elastic.co>	2024-01-22 11:26:18 -08:00
Iskren Ivov Chernev	fc196cab12	community[minor]: DeepInfra support for chat models (#16380 ) Add deepinfra chat models support. This is https://github.com/langchain-ai/langchain/pull/14234 re-opened from my branch (so maintainers can edit).	2024-01-22 11:22:17 -08:00
Bagatur	85e8423312	community[patch]: Update bing results tool name (#16395 ) Make BingSearchResults tool name OpenAI functions compatible (can't have spaces). Fixes #16368	2024-01-22 11:11:03 -08:00
Max Jakob	de209af533	community[patch]: ElasticsearchStore: add relevance function selector (#16378 ) Implement similarity function selector for ElasticsearchStore. The scores coming back from Elasticsearch are already similarities (not distances) and they are already normalized (see [docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params)). Hence we leave the scores untouched and just forward them. This fixes #11539. However, in hybrid mode (when keyword search and vector search are involved) Elasticsearch currently returns no scores. This PR adds an error message around this fact. We need to think a bit more to come up with a solution for this case. This PR also corrects a small error in the Elasticsearch integration test. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-22 11:52:20 -07:00
y2noda	54f90fc6bc	langchain_google_vertexai:Enable the use of langchain's built-in tools in Gemini's function calling (#16341 ) - Issue: This is a PR about #16340 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: yuhei.tsunoda <yuhei.tsunoda@brainpad.co.jp>	2024-01-22 11:16:36 -07:00
Tom Jorquera	1445ac95e8	community[patch]: Enable streaming for GPT4all (#16392 ) `streaming` param was never passed to model	2024-01-22 09:54:18 -08:00
Bagatur	af9f1738ca	langchain[patch]: Release 0.1.2 (#16388 )	2024-01-22 09:32:24 -08:00
Bagatur	8779013847	community[patch]: Release 0.0.14 (#16384 )	2024-01-22 08:50:19 -08:00
Bagatur	9cf0f5eb78	core[patch]: Release 0.1.14 (#16382 )	2024-01-22 08:28:03 -08:00
Bagatur	1dc6c1ce06	core[patch], community[patch], langchain[patch], docs: Update SQL chains/agents/docs (#16168 ) Revamp SQL use cases docs. In the process update SQL chains and agents.	2024-01-22 08:19:08 -08:00
Bob Lin	acc14802d1	Fix `conn` field definition in SQLiteEntityStore (#15440 )	2024-01-22 07:53:49 -08:00
James Braza	e1c59779ad	core[patch]: Remove `print` statement on missing `grandalf` dependency in favor of more explicit ImportError (#16326 ) After this PR an ImportError will be raised without a print if grandalf is missing when using grandalf related code for printing runnable graphs.	2024-01-22 10:48:54 -05:00
Nuno Campos	971a68d04f	Docs: Update README.md in core (#16329 ) Docs: Update README.md in core	2024-01-22 10:42:31 -05:00
Eugene Yurtsev	89372fca22	core[patch]: Update sys info information (#16297 ) Update information collected in sys info. python -m langchain_core.sys_info System Information ------------------ > OS: Linux > OS Version: #14~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Mon Nov 20 18:15:30 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.1.10 > langchain: 0.1.0 > langchain_community: 0.0.11 > langchain_cli: 0.0.20 > langchain_experimental: 0.0.36 > langchain_openai: 0.0.2 > langchainhub: 0.1.14 > langserve: 0.0.19 Packages not installed (Not Necessarily a Problem) -------------------------------------------------- The following packages were not found: > langgraph	2024-01-22 10:18:04 -05:00
Luke	5396604ef4	community: Handling missing key in Google Trends API response. (#15864 ) - Description: Handing response where _interest_over_time_ is missing. - Issue: #15859 - Dependencies: None	2024-01-21 18:11:45 -08:00
Virat Singh	c2a614eddc	community: Add PolygonLastQuote Tool and Toolkit (#15990 ) Description: In this PR, I am adding a `PolygonLastQuote` Tool, which can be used to get the latest price quote for a given ticker / stock. Additionally, I've added a Polygon Toolkit, which we can use to encapsulate future tools that we build for Polygon. Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-21 15:08:55 -08:00
Nuno Campos	ef75bb63ce	core[patch] Fix tracer output of streamed runs with non-addable output (#16324 ) - Used to be None, now is just the last chunk <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-20 18:52:26 -08:00
Ryan French	3d23a5eb36	langchain[patch]: Allow OpenSearch Query Translator to correctly work with Date types (#16022 ) Description: Fixes an issue where the Date type in an OpenSearch Self Querying Retriever would fail to generate a valid query Issue: https://github.com/langchain-ai/langchain/issues/14225	2024-01-19 17:57:18 -08:00
Ofer Mendelevitch	ffae98d371	template: Update Vectara templates (#15363 ) fixed multi-query template for Vectara added self-query template for Vectara Also added prompt_name parameter to summarization CC @efriis Twitter handle: @ofermend	2024-01-19 17:32:33 -08:00
Bagatur	1e29b676d5	core[patch]: simple fallback streaming (#16055 )	2024-01-19 16:31:54 -08:00
Eugene Yurtsev	4ef0ed4ddc	astream_events: Add version parameter while method is in beta (#16290 ) Add a version parameter while the method is in beta phase. The idea is to make it possible to minimize making breaking changes for users while we're iterating on schema. Once the API is stable we can assign a default version requirement.	2024-01-19 13:20:02 -05:00
Bagatur	91230ef5d1	openai[patch]: Release 0.0.3 (#16289 )	2024-01-19 10:15:08 -08:00
Hamza Kyamanywa	39b3c6d94c	langchain[patch]: Add konlpy based text splitting for Korean (#16003 ) - Description: Adds a text splitter based on [Konlpy](https://konlpy.org/en/latest/#start) which is a Python package for natural language processing (NLP) of the Korean language. (It is like Spacy or NLTK for Korean) - Dependencies: Konlpy would have to be installed before this splitter is used, - Twitter handle: @untilhamza	2024-01-19 09:44:56 -08:00
Bagatur	e3828bee43	core[patch]: Release 0.1.13 (#16287 )	2024-01-19 09:28:31 -08:00
Bagatur	2454fefc53	docs: agent prompt docs (#16105 )	2024-01-19 09:19:22 -08:00
Bagatur	84bf5787a7	core[patch], openai[patch]: Chat openai stream logprobs (#16218 )	2024-01-19 09:16:09 -08:00
Carey	021b0484a8	community[patch]: add skipped test for inner product normalization (#14989 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 23:03:15 -08:00
Christophe Bornet	3ccbe11363	community[minor]: Add Cassandra document loader (#16215 ) - Description: document loader for Apache Cassandra - Twitter handle: cbornet_	2024-01-18 18:49:02 -08:00
mikeFore4	9d32af72ce	community[patch]: huggingface hub character removal bug fix (#16233 ) - Description: Some text-generation models on huggingface repeat the prompt in their generated response, but not all do! The tests use "gpt2" which DOES repeat the prompt and as such, the HuggingFaceHub class is hardcoded to remove the first few characters of the response (to match the len(prompt)). However, if you are using a model (such as the very popular "meta-llama/Llama-2-7b-chat-hf") that DOES NOT repeat the prompt in it's generated text, then the beginning of the generated text will be cut off. This code change fixes that bug by first checking whether the prompt is repeated in the generated response and removing it conditionally. - Issue: #16232 - Dependencies: N/A - Twitter handle: N/A	2024-01-18 18:44:10 -08:00
Andreas Motl	3613d8a2ad	community[patch]: Use SQLAlchemy's `bulk_save_objects` method to improve insert performance (#16244 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to save embeddings in batches, to improve its performance. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 Hi again from the CrateDB team, following up on GH-16243, this is another minor patch to the pgvector vector store adapter. Inserting embeddings in batches, using [SQLAlchemy's `bulk_save_objects`](https://docs.sqlalchemy.org/en/20/orm/session_api.html#sqlalchemy.orm.Session.bulk_save_objects) method, can deliver substantial performance gains. With kind regards, Andreas. NB: As I am seeing just now that this method is a legacy feature of SA 2.0, it will need to be reworked on a future iteration. However, it is not deprecated yet, and I haven't been able to come up with a different implementation, yet.	2024-01-18 18:35:39 -08:00
Eugene Yurtsev	177af65dc4	core[minor]: RFC Add astream_events to Runnables (#16172 ) This PR adds `astream_events` method to Runnables to make it easier to stream data from arbitrary chains. * Streaming only works properly in async right now * One should use `astream()` with if mixing in imperative code as might be done with tool implementations * Astream_log has been modified with minimal additive changes, so no breaking changes are expected * Underlying callback code / tracing code should be refactored at some point to handle things more consistently (OK for now) - ~~[ ] verify event for on_retry~~ does not work until we implement streaming for retry - ~~[ ] Any rrenaming? Should we rename "event" to "hook"?~~ - [ ] Any other feedback from community? - [x] throw NotImplementedError for `RunnableEach` for now ## Example See this [Example Notebook](`dbbc7fa0d6/docs/docs/modules/agents/how_to/streaming_events.ipynb`) for an example with streaming in the context of an Agent ## Event Hooks Reference Here is a reference table that shows some events that might be emitted by the various Runnable objects. Definitions for some of the Runnable are included after the table. \| event \| name \| chunk \| input \| output \| \|----------------------\|------------------\|---------------------------------\|-----------------------------------------------\|-------------------------------------------------\| \| on_chat_model_start \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| \| \| on_chat_model_stream \| [model name] \| AIMessageChunk(content="hello") \| \| \| \| on_chat_model_end \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| {"generations": [...], "llm_output": None, ...} \| \| on_llm_start \| [model name] \| \| {'input': 'hello'} \| \| \| on_llm_stream \| [model name] \| 'Hello' \| \| \| \| on_llm_end \| [model name] \| \| 'Hello human!' \| \| on_chain_start \| format_docs \| \| \| \| \| on_chain_stream \| format_docs \| "hello world!, goodbye world!" \| \| \| \| on_chain_end \| format_docs \| \| [Document(...)] \| "hello world!, goodbye world!" \| \| on_tool_start \| some_tool \| \| {"x": 1, "y": "2"} \| \| \| on_tool_stream \| some_tool \| {"x": 1, "y": "2"} \| \| \| \| on_tool_end \| some_tool \| \| \| {"x": 1, "y": "2"} \| \| on_retriever_start \| [retriever name] \| \| {"query": "hello"} \| \| \| on_retriever_chunk \| [retriever name] \| {documents: [...]} \| \| \| \| on_retriever_end \| [retriever name] \| \| {"query": "hello"} \| {documents: [...]} \| \| on_prompt_start \| [template_name] \| \| {"question": "hello"} \| \| \| on_prompt_end \| [template_name] \| \| {"question": "hello"} \| ChatPromptValue(messages: [SystemMessage, ...]) \| Here are declarations associated with the events shown above: `format_docs`: ```python def format_docs(docs: List[Document]) -> str: '''Format the docs.''' return ", ".join([doc.page_content for doc in docs]) format_docs = RunnableLambda(format_docs) ``` `some_tool`: ```python @tool def some_tool(x: int, y: str) -> dict: '''Some_tool.''' return {"x": x, "y": y} ``` `prompt`: ```python template = ChatPromptTemplate.from_messages( [("system", "You are Cat Agent 007"), ("human", "{question}")] ).with_config({"run_name": "my_template", "tags": ["my_template"]}) ```	2024-01-18 21:27:01 -05:00
SN	f175bf7d7b	Use env for revision id if not passed in as param; use `git describe` as backup (#16227 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-01-18 16:15:26 -08:00
Erick Friis	b9495da92d	langchain[patch]: fix stuff documents chain api docs render (#16159 )	2024-01-18 14:07:44 -08:00
Erick Friis	0e76d84137	google-vertexai[patch]: more integration test fixes (#16234 )	2024-01-18 13:59:23 -08:00
Erick Friis	aa35b43bcd	docs, google-vertex[patch]: function docs (#16231 )	2024-01-18 13:15:09 -08:00
Harrison Chase	f60f59d69f	google-vertexai[patch]: Harrison/vertex function calling (#16223 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 12:17:40 -08:00
Rajesh Thallam	6bc6d64a12	langchain_google_vertexai[patch]: Add support for SystemMessage for Gemini chat model (#15933 ) - Description: In Google Vertex AI, Gemini Chat models currently doesn't have a support for SystemMessage. This PR adds support for it only if a user provides additional convert_system_message_to_human flag during model initialization (in this case, SystemMessage would be prepended to the first HumanMessage). NOTE: The implementation is similar to #14824 - Twitter handle: rajesh_thallam --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 10:22:07 -08:00
Erick Friis	65b231d40b	mistralai[patch]: async integration tests (#16214 )	2024-01-18 09:45:44 -08:00
Eugene Zapolsky	6b9e3ed9e9	google-vertexai[minor]: added safety_settings property to gemini wrapper (#15344 ) Description: Gemini model has quite annoying default safety_settings settings. In addition, current VertexAI class doesn't provide a property to override such settings. So, this PR aims to - add safety_settings property to VertexAI - fix issue with incorrect LLM output parsing when LLM responds with appropriate 'blocked' response - fix issue with incorrect parsing LLM output when Gemini API blocks prompt itself as inappropriate - add safety_settings related tests I'm not enough familiar with langchain code base and guidelines. So, any comments and/or suggestions are very welcome. Issue: it will likely fix #14841 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 08:54:30 -08:00
Eugene Yurtsev	ecd4f0a7ec	core[patch]: testing add chat model for unit-tests (#16209 ) This PR adds a fake chat model for testing purposes. Used in this PR: https://github.com/langchain-ai/langchain/pull/16172	2024-01-18 11:30:53 -05:00
SN	7d444724d7	Add revision identifier to run_on_dataset (#16167 ) Allow specifying revision identifier for better project versioning	2024-01-17 20:27:43 -08:00
Eugene Yurtsev	5d8c147332	docs: Document and test PydanticOutputFunctionsParser (#15759 ) This PR adds documentation and testing to `PydanticOutputFunctionsParser(OutputFunctionsParser)`.	2024-01-17 18:21:18 -08:00
Christophe Bornet	3502a407d9	infra: Use dotenv in langchain-community's integration tests (#16137 ) * Removed some env vars not used in langchain package IT * Added Astra DB env vars in langchain package, used for cache tests * Added conftest.py to load env vars in langchain_community IT * Added .env.example in langchain_community IT	2024-01-17 18:18:26 -08:00
Nuno Campos	ca014d5b04	Update readme (#16160 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-17 13:56:07 -08:00
Tomaz Bratanic	1e80113ac9	community[patch]: Add neo4j timeout and value sanitization option (#16138 ) The timeout function comes in handy when you want to kill longrunning queries. The value sanitization removes all lists that are larger than 128 elements. The idea here is to remove embedding properties from results.	2024-01-17 13:22:19 -08:00
Krishna Shedbalkar	f238217cea	community[patch]: Basic Logging and Human input to ShellTool (#15932 ) - Description: As Shell tool is very versatile, while integrating it into applications as openai functions, developers have no clue about what command is being executed using the ShellTool. All one can see is: ![image](https://github.com/langchain-ai/langchain/assets/60742358/540e274a-debc-4564-9027-046b91424df3) Summarising my feature request: 1. There's no visibility about what command was executed. 2. There's no mechanism to prevent a command to be executed using ShellTool, like a y/n human input which can be accepted from user to proceed with executing the command., - Issue: the issue #15931 it fixes if applicable, - Dependencies: There isn't any dependancy, - Twitter handle: @krishnashed	2024-01-17 12:57:51 -08:00
Bagatur	679a3ae933	openai[patch]: clarify azure error (#16157 )	2024-01-17 12:43:14 -08:00
Bagatur	7ad9eba8f4	core[patch]: Release 0.1.12 (#16161 )	2024-01-17 12:39:45 -08:00
Leonid Kuligin	58f0ba306b	changed default params for gemini (#16044 ) Replace this entire comment with: - Description: changed default values for Vertex LLMs (to be handled on the SDK's side)	2024-01-17 12:19:18 -08:00
Bagatur	5c73fd5bba	core[patch]: support old core namespaces (#16155 )	2024-01-17 11:26:25 -08:00
Christophe Bornet	fb940d11df	community[patch]: Use newer MetadataVectorCassandraTable in Cassandra vector store (#15987 ) as VectorTable is deprecated Tested manually with `test_cassandra.py` vector store integration test.	2024-01-17 10:37:07 -08:00
Mohammad Mohtashim	1fa056c324	community[patch]: Don't set search path for unknown SQL dialects (#16047 ) - Description: Made a small fix for the `SQLDatabase` highlighted in an issue. The issue pertains to switching schema for different SQL engines. - Issue: #16023 @baskaryan	2024-01-17 10:31:11 -08:00
Erick Friis	11327e6b64	google-vertexai[patch]: typing, release 0.0.2 (#16153 )	2024-01-17 10:16:59 -08:00
Leonid Ganeline	2709d3e5f2	langchain[patch]: updated imports for `langchain.callbacks` (#16060 ) Updated imports from 'langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:06:59 -08:00
Leonid Ganeline	c5f6b828ad	langchain[patch], community[minor]: move `output_parsers.ernie_functions` (#16057 ) `output_parsers.ernie_functions` moved into `community`	2024-01-17 10:06:18 -08:00
Leonid Ganeline	49aff3ea5b	langchain[patch]: updated `agents` imports (#16061 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:02:29 -08:00
Leonid Ganeline	60b1bd02d7	langchain[patch]: updated imports for `output_parsers` (#16059 ) Updated imports from `langchain` to `core` where it is possible	2024-01-17 10:02:12 -08:00
Leonid Ganeline	9e9ad9b0e9	langchain[patch]: updated `retrievers` imports (#16062 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:01:06 -08:00
Leonid Ganeline	d350be959d	langchain[patch]: updated `chains` imports (#16064 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 09:58:42 -08:00
Fei Wang	d0e101e4e0	community[patch]: fix ollama astream (#16070 ) Update ollama.py	2024-01-17 09:42:41 -08:00
ChengZi	8597484195	langchain[patch]: support more comparators in Milvus self-querying retriever (#16076 ) - Description: Support IN and LIKE comparators in Milvus self-querying retriever, based on [Boolean Expression Rules](https://milvus.io/docs/boolean.md) - Issue: No - Dependencies: No - Twitter handle: No Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-01-17 09:41:23 -08:00
Kapil Sachdeva	f406dc3872	docs: in RunnableRetry, correct the example snippet that uses with_retry method on Runnable (#16108 ) The example code snippet for with_retry is using incorrect argument names. This PR fixes that	2024-01-17 09:11:27 -08:00
BeatrixCohere	b0c3e3db2b	community[patch]: Handle when documents are not provided in the Cohere response (#16144 ) - Description: This handles the cohere response when documents aren't included in the response - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-01-17 09:11:00 -08:00
Felix Krones	d91126fc64	community[patch]: missing unpack operator for or_clause in pgvector document filter (#16148 ) - Fix for #16146 - Adding unpack operation to "or" and "and" filter for pgvector retriever. #	2024-01-17 09:10:43 -08:00
Erick Friis	06fe2f4fb0	partners: add license field (#16117 ) - bumps package post versions for packages without current unreleased updates - will bump package version in release prs associated with packages that do have changes (mistral, vertex)	2024-01-17 08:37:13 -08:00
Erick Friis	ce10fe0c2f	mistralai[patch]: release 0.0.3 (#16116 ) embeddings	2024-01-17 08:36:05 -08:00
William FH	e5cf1e2414	Community[patch]use secret str in Tavily and HuggingFaceInferenceEmbeddings (#16109 ) So the api keys don't show up in repr's Still need to do tests	2024-01-17 00:30:07 -08:00
William FH	f3601b0aaf	Community[Patch] Remove docs form bm25 repr (#16110 ) Resolves: https://github.com/langchain-ai/langsmith-sdk/issues/356	2024-01-17 00:00:55 -08:00
David	c323742f4f	mistralai[minor]: Add embeddings (#15282 ) - Description: Adds MistralAIEmbeddings class for embeddings, using the new official API. - Dependencies: mistralai - Tag maintainer: @efriis, @hwchase17 - Twitter handle: @LMS_David_RS Create `integrations/text_embedding/mistralai.ipynb`: an example notebook for MistralAIEmbeddings class Modify `embeddings/__init__.py`: Import the class Create `embeddings/mistralai.py`: The embedding class Create `integration_tests/embeddings/test_mistralai.py`: The test file. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:48:37 -08:00
Leonid Kuligin	4df14a61fc	google-vertexai[minor]: add function calling on VertexAI (#15822 ) Replace this entire comment with: - Description: Description: added support for tools on VertexAI - Issue: #15073 - Twitter handle: lkuligin --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:01:26 -08:00
Bagatur	8840a8cc95	docs: tool-use use case (#15783 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-16 10:41:14 -08:00
Bagatur	3d34347a85	langchain[patch]: bump core dep to 0.1.9 (#16104 )	2024-01-16 10:39:07 -08:00
Bagatur	62a2e9ee19	langchain[patch]: Release 0.1.1 (#16103 )	2024-01-16 10:17:38 -08:00
Bagatur	076593382a	core[patch]: Release 0.1.11 (#16100 )	2024-01-16 09:46:04 -08:00
Bagatur	c5656a4905	core[patch]: pass exceptions to fallbacks (#16048 )	2024-01-16 09:36:43 -08:00
Nuno Campos	770f57196e	Add unit test for overridden lc_namespace (#16093 )	2024-01-16 09:22:52 -08:00
Erick Friis	52114bdfac	community[patch]: release 0.0.13 (#16087 )	2024-01-16 06:25:28 -08:00
James Briggs	ca288d8f2c	community[patch]: add vector param to index query for pinecone vec store (#16054 )	2024-01-16 06:12:19 -08:00
Antonio Morales	476fb328ee	community[patch]: implement adelete from VectorStore in Qdrant (#16005 ) Description: Implement `adelete` function from `VectorStore` in `Qdrant` to support other asynchronous flows such as async indexing (`aindex`) which requires `adelete` to be implemented. Since `Qdrant` can be passed an async qdrant client, this can be supported easily. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 19:57:09 -08:00
Bagatur	697a6f2c80	langchain[patch]: fix requests lint (#16049 )	2024-01-15 12:54:30 -08:00
高远	061e63eef2	community[minor]: add vikingdb vecstore (#15155 ) --------- Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-01-15 12:34:01 -08:00
andrijdavid	d196646811	community[patch]: Refactor OpenAIWhisperParserLocal (#15150 ) This PR addresses an issue in OpenAIWhisperParserLocal where requesting CUDA without availability leads to an AttributeError #15143 Changes: - Refactored Logic for CUDA Availability: The initialization now includes a check for CUDA availability. If CUDA is not available, the code falls back to using the CPU. This ensures seamless operation without manual intervention. - Parameterizing Batch Size and Chunk Size: The batch_size and chunk_size are now configurable parameters, offering greater flexibility and optimization options based on the specific requirements of the use case. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 12:29:14 -08:00
Zhichao HAN	5cf06db3b3	community[minor]: add JsonRequestsWrapper tool (#15374 ) Description: This new feature enhances the flexibility of pipeline integration, particularly when working with RESTful APIs. ``JsonRequestsWrapper`` allows for the decoding of JSON output, instead of the only option for text output. --------- Co-authored-by: Zhichao HAN <hanzhichao2000@hotmail.com>	2024-01-15 12:27:19 -08:00
chyroc	d334efc848	community[patch]: fix top_p type hint (#15452 ) fix: https://github.com/langchain-ai/langchain/issues/15341 @efriis	2024-01-15 11:59:39 -08:00
Mateusz Szewczyk	251afda549	community[patch]: fix stop (stop_sequences) param on WatsonxLLM (#15541 ) - Description: Fix to IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM provider (stop (`stop_sequences`) param on watsonxLLM) - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/),	2024-01-15 11:44:57 -08:00
Funkeke	7220124368	community[patch]: fix tongyi completion and params error (#15544 ) fix tongyi completion json parse error and prompt's params error --------- Co-authored-by: fangkeke <3339698829@qq.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 11:43:13 -08:00
盐粒 Yanli	ddf4e7c633	community[minor]: Update pgvecto_rs to use its high level sdk (#15574 ) - Description: Update pgvecto_rs to use its high level sdk, - Issue: fix #15173	2024-01-15 11:41:59 -08:00
YHW	ce21392a21	community: add a flag that determines whether to load the milvus collection (#15693 ) fix https://github.com/langchain-ai/langchain/issues/15694 --------- Co-authored-by: hyungwookyang <hyungwookyang@worksmobile.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:25:23 -08:00
Mohammad Mohtashim	9e779ca846	community[patch]: Fixing the SlackGetChannel Tool Input Error (#15725 ) Fixed the issue mentioned in #15698 for SlackGetChannel Tool. @baskaryan. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:23:55 -08:00
axiangcoding	daa9ccae52	community[patch]: deprecate ErnieBotChat and ErnieEmbeddings classes (#15862 ) - Description: add deprecated warning for ErnieBotChat and ErnieEmbeddings. - These two classes lack maintenance and do not use the sdk provided by qianfan, which means hard to implement some key feature like streaming. - The alternative `langchain_community.chat_models.QianfanChatEndpoint` and `langchain_community.embeddings.QianfanEmbeddingsEndpoint` can completely replace these two classes, only need to change configuration items. - Issue: None, - Dependencies: None, - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:14:44 -08:00
JaguarDB	b11fd3bedc	community[patch]: jaguar vector store fix integer-element error when joining metadata values (#15939 ) - Description: some document loaders add integer-type metadata values which cause error - Issue: 15937 - Dependencies: none --------- Co-authored-by: JY <jyjy@jaguardb>	2024-01-15 11:13:45 -08:00
Neo Zhao	21e0df937f	community[patch]: fix a bug that mistakenly handle zip iterator in FAISS.from_embeddings (#16020 ) Description: `zip` is iterator that will only produce result once, so the previous code will cause the `embeddings` to be an empty list. Issue: I could not find a related issue. Dependencies: this PR does not introduce or affect dependencies. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:13:14 -08:00
Christophe Bornet	15c2b4a47e	community[minor]: Add AstraDB self query retriever (#15738 ) - Description: this change adds a self-query retriever for AstraDB - Twitter handle: cbornet_	2024-01-15 11:04:11 -08:00
Leonid Ganeline	fb676d8a9b	community[minor], langchain[minor]: refactor `output_parsers` Rail (#15852 ) Moved Rail parser to `community` package.	2024-01-15 10:54:49 -08:00
Massimiliano Pronesti	e80aab2275	docs(community): update Amadeus toolkit to langchain v0.1 (#15976 ) - Description: docs update following the changes introduced in #15879 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-15 10:50:47 -08:00
Ashley Xu	ce7723c1e5	community[minor]: add additional support for `BigQueryVectorSearch` (#15904 ) BigQuery vector search lets you use GoogleSQL to do semantic search, using vector indexes for fast but approximate results, or using brute force for exact results. This PR: 1. Add `metadata[_job_ib]` in Document returned by any similarity search 2. Add `explore_job_stats` to enable users to explore job statistics and better the debuggability 3. Set the minimum row limit for running create vector index.	2024-01-15 10:45:15 -08:00
Mohammed Naqi	8799b028a6	community[minor]: Adding asynchronous function implementation for Doctran (#15941 ) ## Description In this update, I addressed the missing implementation for atransform_document, which is the asynchronous counterpart of transform_document in Doctran. ### Usage Example: ```py # Instantiate DoctranPropertyExtractor with specified properties property_extractor = DoctranPropertyExtractor(properties=properties) # Asynchronously extract properties from a list of documents extracted_document = await property_extractor.atransform_documents( documents, properties=properties ) # Display metadata of the first extracted document print(json.dumps(extracted_document[0].metadata, indent=2)) ``` ## Issue - Pull request #14525 has caused a break in the aforementioned code. Instead of removing an asynchronous implementation of a function, consider implementing a synchronous version alongside it.	2024-01-15 10:39:25 -08:00
Raunak	c0773ab329	community[patch]: Fixed 'coroutine' object is not subscriptable error (#15986 ) - Description: Added parenthesis in return statement of aembed_query() funtion to fix 'coroutine' object is not subscriptable error. - Dependencies: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-15 10:34:10 -08:00
Karim Lalani	14244bd7e5	community[minor]: Added document loader for SurrealDB (#15995 ) Added a simple document loader to work with SurrealDB.	2024-01-15 10:32:42 -08:00
Karim Lalani	768e5e33bc	community[minor]: Fix to match SurrealDB 0.3.2 SDK (#15996 ) New version of SurrealDB python sdk was causing the integration to break. This fix addresses that change.	2024-01-15 10:31:59 -08:00
shahrin014	86321a949f	community: Ollama - Parameter structure to follow official documentation (#16035 ) ## Feature - Follow parameter structure as per official documentation - top level parameters (e.g. model, system, template) will be passed as top level parameters - other parameters will be sent in options unless options is provided ![image](https://github.com/langchain-ai/langchain/assets/17451563/d14715d9-9701-4ee3-b44b-89fffea62389) ## Tests - Test if top level parameters handled properly - Test if parameters that are not top level parameters are handled as options - Test if options is provided, it will be passed as is	2024-01-15 10:17:58 -08:00
Nir Kopler	0fa06732b7	community: add new gpt-3.5-turbo-1106 finetuned for cost calculation (#16039 ) Description: Added the new gpt-3.5-turbo-1106 for finetuned cost calculation, Issue: no issue found open By the information in OpenAI the pricing is the same as the older model (0613)	2024-01-15 08:36:54 -08:00
Bagatur	bccb07f93e	core[patch]: simple prompt pretty printing (#15968 )	2024-01-12 21:08:51 -05:00
Virat Singh	eb6e385dc5	community: Add PolygonAPIWrapper and get_last_quote endpoint (#15971 ) - Description: Added a `PolygonAPIWrapper` and an initial `get_last_quote` endpoint, which allows us to get the last price quote for a given `ticker`. Once merged, I can add a Polygon tool in `tools/` for agents to use. - Twitter handle: [@virattt](https://twitter.com/virattt) The Polygon.io Stocks API provides REST endpoints that let you query the latest market data from all US stock exchanges.	2024-01-12 17:52:09 -08:00
Erick Friis	74bac7bda1	community[patch]: core min 0.1.9 (#15974 )	2024-01-12 15:32:06 -08:00
Erick Friis	845e407e08	community[patch]: release 0.0.12 (#15973 )	2024-01-12 15:27:05 -08:00
Jonathan Algar	a74f3a4979	Batch update of alt text and title attributes for images in md/mdx files across repo (#15357 ) Description: Batch update of alt text and title attributes for images in `md` & `mdx` files across the repo using [alttexter](https://github.com/jonathanalgar/alttexter)/[alttexter-ghclient](https://github.com/jonathanalgar/alttexter-ghclient) (built using LangChain/LangSmith). Limitation: cannot update `ipynb` files because of [this issue](https://github.com/langchain-ai/langchain/pull/15357#issuecomment-1885037250). Can revisit when Docusaurus is bumped to v3. I checked all the generated alt texts and titles and didn't find any technical inaccuracies. That's not to say they're _perfect_, but a lot better than what's there currently. [Deployed](https://langchain-819yf1tbk-langchain.vercel.app/docs/modules/model_io/) image example: ![chrome_yZQ7BF2GTj](https://github.com/langchain-ai/langchain/assets/93204286/43a9a4d4-70fd-41c4-8978-b6240ff63ffa) You can see LangSmith traces for all the calls out to the LLM in the PRs merged into this one: * https://github.com/jonathanalgar/langchain/pull/6 * https://github.com/jonathanalgar/langchain/pull/4 * https://github.com/jonathanalgar/langchain/pull/3 I didn't add the following files to the PR as the images already have OK alt texts: * `27dca2d92f/docs/docs/integrations/providers/argilla.mdx (L3)` * `27dca2d92f/docs/docs/integrations/providers/apify.mdx (L11)` --------- Co-authored-by: github-actions <github-actions@github.com>	2024-01-12 14:37:48 -08:00
Varik Matevosyan	efe6cfafe2	community: Added Lantern as VectorStore (#12951 ) Support [Lantern](https://github.com/lanterndata/lantern) as a new VectorStore type. - Added Lantern as VectorStore. It will support 3 distance functions `l2 squared`, `cosine` and `hamming` and will use `HNSW` index. - Added tests - Added example notebook	2024-01-12 12:00:16 -08:00
Harrison Chase	1afac77439	stop making copies of inputs (#15926 )	2024-01-12 11:49:26 -08:00
Edwin Wenink	9fb09c1c30	community: fix the "page" mode in the AzureAIDocumentIntelligenceParser (bug) (#15958 ) Description: the "page" mode in the AzureAIDocumentIntelligenceParser is not accessible due to a wrong membership test. The mode argument can only be a string (also see the assertion in the `__init__`: `assert self.mode in ["single", "page", "object", "markdown"]`, so the check `elif self.mode == ["page"]:` always fails. As a result, effectively the "object" mode is used when selecting the "page" mode, which may lead to errors. The docstring of the `AzureAIDocumentIntelligenceLoader` also ommitted the `mode` parameter alltogether, so I added it. Issue: I could not find a related issue (this class is only 3 weeks old anyways) Dependencies: this PR does not introduce or affect dependencies. The current demo notebook and examples are not affected because they all use the default markdown mode.	2024-01-12 11:01:28 -08:00
Mahdi Setayesh	eb76f9c9fe	community: Fixing a performance issue with AzureSearch to perform batch embedding (#15594 ) - Description: Azure Cognitive Search vector DB store performs slow embedding as it does not utilize the batch embedding functionality. This PR provide a fix to improve the performance of Azure Search class when adding documents to the vector search, - Issue: #11313 , - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-12 10:58:55 -08:00
Bagatur	c697c89ca4	docs: add agent prompt creation examples (#15957 )	2024-01-12 10:26:12 -08:00
Erick Friis	69533c8628	multiple[patch]: .post releases and pyproject metadata (#15962 )	2024-01-12 10:09:02 -08:00
Erick Friis	95020637bc	openai[patch]: 0.0.2.post1, urls (#15961 )	2024-01-12 09:36:37 -08:00
ChengZi	d5808f786c	community: Support milvus partition key. (#15740 ) - Description: Milvus's partition key is an important feature. It can support multi-tenancy. We hope to introduce this feature. https://milvus.io/docs/partition_key.md - Issue: No - Dependencies: No - Twitter handle: No --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-12 09:15:03 -08:00
enfeng	13b90232c1	langchain-google-genai[patch]: Add support for end_point and transport parameters to the Gemini API (#15532 ) Add support for end_point and transport parameters to the Gemini API --------- Co-authored-by: yangenfeng <yangenfeng@xiaoniangao.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-12 08:52:00 -08:00
ohbeep	9b3962fc25	community: Add support of "http" URI for Milvus (#12710 ) (#15683 ) - Description: Add support of HTTP URI for Milvus - Issue: #12710 - Dependencies: N/A,	2024-01-11 21:55:35 -08:00
Raunak	e26e1f8b37	community: Added functions to make async calls to HuggingFaceHub's embedding endpoint in HuggingFaceHubEmbeddings class (#15737 ) Description: Added aembed_documents() and aembed_query() async functions in HuggingFaceHubEmbeddings class in langchain_community\embeddings\huggingface_hub.py file. It will support to make async calls to HuggingFaceHub's embedding endpoint and generate embeddings asynchronously. Test Cases: Added test_huggingfacehub_embedding_async_documents() and test_huggingfacehub_embedding_async_query() functions in test_huggingface_hub.py file to test the two async functions created in HuggingFaceHubEmbeddings class. Documentation: Updated huggingfacehub.ipynb with steps to install huggingface_hub package and use HuggingFaceHubEmbeddings. Dependencies: None, Twitter handle: I do not have a Twitter account --------- Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-11 21:52:55 -08:00
Tal	eb9b334a6b	Enable customizing the output parser of `OpenAIFunctionsAgent` (#15827 ) - Description: This PR defines the output parser of OpenAIFunctionsAgent as an attribute, enabling customization and subclassing of the parser logic. - Issue: Subclassing is currently impossible as the `OpenAIFunctionsAgentOutputParser` class is hard coded into the `plan` and `aplan` methods - Dependencies: None <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-11 21:52:36 -08:00
Christophe Bornet	81d1ba05dc	Add a BaseStore backed by AstraDB (#15812 ) - Description: this change adds a `BaseStore` backed by AstraDB - Twitter handle: cbornet_	2024-01-11 21:41:24 -08:00
manishsahni2000	74d9fc2f9e	PR community:Removing knn beta content in mongodb atlas vectorstore (#15865 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 21:40:54 -08:00
shahrin014	bdd90ae2ee	community: Ollama - Pass headers to post request (#15881 ) ## Feature - Set additional headers in constructor - Headers will be sent in post request This feature is useful if deploying Ollama on a cloud service such as hugging face, which requires authentication tokens to be passed in the request header. ## Tests - Test if header is passed - Test if header is not passed	2024-01-11 21:40:35 -08:00
Xin Liu	5efec068c9	feat: Implement `stream` interface (#15875 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Major changes: - Rename `wasm_chat.py` to `llama_edge.py` - Rename the `WasmChatService` class to `ChatService` - Implement the `stream` interface for `ChatService` - Add `test_chat_wasm_service_streaming` in the integration test - Update `llama_edge.ipynb` --------- Signed-off-by: Xin Liu <sam@secondstate.io>	2024-01-11 21:32:48 -08:00
Massimiliano Pronesti	ec4dab0449	feat(community): make Amadeus toolkit LLM-agnostic (#15879 ) - Description: `AmadeusToolkit` and `AmadeusClosestAirport` contained a hardcoded call to `ChatOpenAI`. This PR makes it LLM-independent, while guaranteeing backward compatibility. - Issue: #15847 - Dependencies: None @baskaryan <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 21:32:03 -08:00
JanHorcicka	f454e95461	langchain: fix OutputParserException (#15914 ) (#15916 ) Description: Fixes OutputParserException thrown by the output_parser when 'query' is 'Null'. Replace this entire comment with: - Description: Current implentation of output_parser throws OutputParserException if the response from the LLM contains `query: null`. This unfortunately happens for my use case. And since there is no way to modify the prompt used in SelfQueryRetriever, then we have to fix it here, so it doesn't crash. - Issue: https://github.com/langchain-ai/langchain/issues/15914 Didn't run tests. `make test` is not working. There is no `test` rule in the `Makefile`. Co-authored-by: Jan Horcicka <jhorcick@amazon.com>	2024-01-11 21:26:45 -08:00
Yacine	782dd44be9	<langchain_community.vectorstores>:<Fix pinecone.py __init__ docsrting instruction> (#15922 ) - Description: The pinecone docstring instructs to pass the embedding query text causing the warning below. It should be the embeddings object. warning message: UserWarning: Passing in `embedding` as a Callable is deprecated. Please pass in an Embeddings object instead. - Issue: NA - Dependencies: None @baskaryan	2024-01-11 21:26:33 -08:00
Nuno Campos	112208baa5	Passthrough configurable primitive values as tracer metadata (#15915 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 18:47:55 -08:00
William FH	129552e3d6	Rm deprecated (#15920 ) Remove the usage of deprecated methods in the test runner.	2024-01-11 18:10:49 -08:00
Nuno Campos	438beb6c94	Pass config specs through ensemble retriever (#15917 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-11 16:22:17 -08:00
Erick Friis	ebb6ad4f7a	mistralai[patch]: release 0.0.2 (#15912 )	2024-01-11 13:42:04 -08:00
Erick Friis	437cebc955	core[patch]: release 0.1.10 (#15911 )	2024-01-11 13:39:06 -08:00
Harrison Chase	80d41a8da3	add old serializable mapping (#15906 )	2024-01-11 13:03:12 -08:00
Erick Friis	623f87c888	community[patch]: pinecone bug (#15905 )	2024-01-11 11:44:07 -08:00
axiangcoding	d5aa277b94	community: add collection_properties parameter to Milvus (#15788 ) - Description: add collection_properties parameter to Milvus. See [pymilvus set_properties() description](https://milvus.io/api-reference/pymilvus/v2.3.x/Collection/set_properties().md) - Issue: None - Dependencies: None - Twitter handle: None	2024-01-10 20:29:01 -08:00
mogith-pn	9e1ed17bfb	Community : Modified doc strings and example notebook for Clarifai (#15816 ) Community : Modified doc strings and example notebook for Clarifai Description: 1. Modified doc strings inside clarifai vectorstore class and embeddings. 2. Modified notebook examples. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-10 19:33:10 -08:00
Erick Friis	38523d7c57	together[minor]: add llm (#15853 )	2024-01-10 17:55:34 -08:00
Erick Friis	ee708739c3	community[patch]: pinecone v3 support (#15849 ) Info in slack --------- Co-authored-by: Roie Schwaber-Cohen <roie.cohen@gmail.com>	2024-01-10 14:54:50 -08:00
Eugene Yurtsev	a06db53c37	Add unit tests to test openai tools agent (#15843 ) This PR adds unit testing to test openai tools agent.	2024-01-10 17:06:30 -05:00
Bagatur	79124fd71d	experimental[patch]: Release 0.0.49 (#15823 )	2024-01-10 11:23:19 -05:00
Harrison Chase	20abe24819	experimental[minor]: Add semantic chunker (#15799 )	2024-01-10 11:18:30 -05:00
Eugene Yurtsev	feb41c5e28	langchain[patch]: Improve stream_log with AgentExecutor and Runnable Agent (#15792 ) This PR fixes an issue where AgentExecutor with RunnableAgent does not allow users to see individual llm tokens if streaming=True is not set explicitly on the underlying chat model. The majority of this PR is testing code: 1. Create a test chat model that makes it easier to test streaming and supports AIMessages that include function invocation information. 2. Tests for the chat model 3. Tests for RunnableAgent (previously untested) 4. Tests for openai agent (previously untested)	2024-01-10 10:53:01 -05:00
Erick Friis	85a4594ed7	community[patch]: more deprecations (#15782 )	2024-01-09 20:36:16 -08:00
Erick Friis	33dccf0f66	core[patch]: release 0.1.9 (#15794 )	2024-01-09 19:27:19 -08:00
Erick Friis	0c95f3a981	mistralai[patch]: warn on stop token, fix on_llm_new_token (#15787 ) Fixes #15269 Addresses with warning. MistralAI API doesn't support stop token yet. --------- Co-authored-by: Niels Garve <info@nielsgarve.com>	2024-01-09 16:27:20 -08:00
Erick Friis	323941a90a	mistralai[patch]: persist async client (#15786 )	2024-01-09 16:21:39 -08:00
NuODaniel	70b6315b23	community[patch]: fix qianfan chat stream calling caused exception (#13800 ) - Description: `QianfanChatEndpoint` extends `BaseChatModel` as a super class, which has a default stream implement might concat the MessageChunk with `__add__`. When call stream(), a ValueError for duplicated key will be raise. - Issues: * #13546 * #13548 * merge two single test file related to qianfan. - Dependencies: no - Tag maintainer: --------- Co-authored-by: root <liujun45@baidu.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-09 15:29:25 -08:00
Erick Friis	656e87beb9	core[patch]: add alternative_import to deprecated (#15781 )	2024-01-09 14:45:28 -08:00
Erick Friis	04a5a37e92	robocorp[patch]: fix readme, release 0.0.1.post1 (#15777 )	2024-01-09 12:53:57 -08:00
Erick Friis	91ec9da534	openai[patch]: unit test load (#15624 )	2024-01-09 11:54:11 -08:00
Erick Friis	7be72e1103	openai[patch], docs: readme (#15773 )	2024-01-09 11:52:24 -08:00
Bagatur	ee5bd986de	community[patch]: update oai deprecation message (#15681 ) addresses #15674	2024-01-09 14:36:58 -05:00
Erick Friis	7562f70c95	robocorp[minor]: Add robocorp action server toolkit (#15766 ) Co-authored-by: Rihards Gravis <rihards@gravis.lv> Co-authored-by: Mikko Korpela <mikko@robocorp.com>	2024-01-09 11:29:19 -08:00
Erick Friis	4ed3d17c47	community[patch]: release 0.0.11 (#15760 )	2024-01-09 09:44:26 -08:00
Bagatur	da395f3182	experimental[patch]: loosen core max version (#15763 )	2024-01-09 12:10:14 -05:00
William FH	04caf07dee	Make packages optional (#15727 ) So we don't have to instruct people to modify the Dockerfile every time they delete the packages directory. See: https://stackoverflow.com/questions/70096208/dockerfile-copy-folder-if-it-exists-conditional-copy/70096420#70096420 Tested on a new repo	2024-01-08 17:09:21 -08:00
Eugene Yurtsev	3a8ad90509	langchain(patch): Fix output type for pydantic output parser (#15714 ) This PR fixes the output type for the pydantic output parser. Fix for: https://github.com/langchain-ai/langserve/issues/301	2024-01-08 16:53:10 -05:00
Erick Friis	95a2c92e26	experimental[patch]: minimum version bump (#15724 ) - experimental: minimum version bump - actually 0.1.5 - actually 0.1.7	2024-01-08 13:04:57 -08:00
Erick Friis	6c9b7c2cec	experimental: minimum version bump (#15722 ) experimental relies on `from langchain_core.runnables.config import run_in_executor` which was introduced in core 0.1.5. Updated pyproject dependency as well as minimum version test.	2024-01-08 12:58:24 -08:00
Ian	32ec56194b	community: fix myscale delete function bug (#15675 ) Now the SQL used to delete vector doc from myscale is as follow: ```sql DELETE FROM collection WHERE id = '1' AND id = '2' AND id = '3' ``` But the expected one should be ```sql DELETE FROM collection WHERE id IN ('1', '2', '3') ```	2024-01-08 12:26:29 -08:00
Christophe Bornet	a466f79ac9	Fix AstraDB logical operator filtering (#15699 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> This change fixes the AstraDB logical operator filtering (`$and,` `$or`). The `metadata` prefix must not be added if the key is `$and` or `$or`.	2024-01-08 12:23:46 -08:00
Christophe Bornet	1f5f6381ec	Add doc for AstraDB document loader (#15703 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> See preview : https://langchain-git-fork-cbornet-astra-loader-doc-langchain.vercel.app/docs/integrations/document_loaders/astradb	2024-01-08 12:21:46 -08:00
Eugene Yurtsev	b508fcce65	core(minor): Add a way to print out system information for debugging purposes. (#15718 ) To use: ```bash python -m langchain_core.sys_info ```	2024-01-08 12:20:18 -08:00
Erick Friis	94911ae503	community[patch]: Support different Pinecone initializations depending on the version (#15717 ) Co-authored-by: DosticJelena <jelenadostic2@gmail.com>	2024-01-08 11:33:36 -08:00
Bagatur	4c47f39fcb	community[patch]: Release 0.0.10 (#15678 )	2024-01-08 00:24:45 -05:00
Bagatur	60f925d678	core[patch]: Release 0.1.8 (#15677 )	2024-01-08 00:05:12 -05:00
Nuno Campos	7ce4cd0709	Do not issue beta or deprecation warnings on internal calls (#15641 )	2024-01-07 20:54:45 -08:00
Nuno Campos	ef22559f1f	Populate streamed_output for all runs handled by atransform_stream_with_config (#15599 ) This means that users of astream_log() now get streamed output of virtually all requested runs, whereas before the only streamed output would be for the root run and raw llm runs <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-07 19:35:43 -08:00
Earlee	98c6c9603e	community: fix: should flush after inserting data on milvus (#15568 ) The inserted data cannot take effect immediately. We should flush after inserting data on milvus.	2024-01-07 09:33:47 -08:00
chyroc	a17a3638b5	Docs: fix excel document loader typo (#15470 )	2024-01-07 09:33:35 -08:00
chyroc	9ae901c5e6	Feat: add CHM file loader (#15519 ) fix https://github.com/langchain-ai/langchain/issues/15469	2024-01-07 09:28:52 -08:00
Nan LI	0b393315ce	community: Correct Input API Key Name in Notebook and Enhance Readability of Comments for ZhipuAI Chat Model (#15529 ) - Description: This update rectifies an error in the notebook by changing the input variable from `zhipu_api_key` to `api_key`. It also includes revisions to comments to improve program readability. - Issue: The input variable in the notebook example should be `api_key` instead of `zhipu_api_key`. - Dependencies: No additional dependencies are required for this change. To ensure quality and standards, we have performed extensive linting and testing. Commands such as make format, make lint, and make test have been run from the root of the modified package to ensure compliance with LangChain's coding standards.	2024-01-07 09:27:47 -08:00
kursathalat	9ea28ee464	fix: Fix DEFAULT_API_KEY for ArgillaCallbackHandler (#15534 ) - ArgillaCallbackHandler does not properly set the default values while initializing. This PR corrects the line. - Issue: #15531 - Dependencies: Argilla - Also corrected some dead links.	2024-01-07 09:26:51 -08:00
Chad Norvell	d1bfb70bc4	community: Allow deleting by ID and collection in `pgvector` (#15627 ) - Description: The `delete_collection` method deletes an entire collection regardless of custom ID. The `delete` method deletes everything with the provided custom IDs regardless of collection. It can be useful to restrict deletion to both the collection and a set of custom IDs. This change adds support for that by allowing you to optionally specify that `delete` should be restricted to the collection defined on the `PGVector` instance.	2024-01-07 08:33:21 -08:00
Chad Norvell	f6226d464e	community: Include PDF ID in MathPix metadata (#15629 ) - Description: Includes the PDF ID in the MathPix document metadata. This is useful in case you need to re-request a processed PDF from the MathPix API later.	2024-01-07 08:31:53 -08:00
Chad Norvell	d2a686b165	community: Provide more actionable errors in the MathPix PDF loader (#15630 ) - Description: The `error_info['id']` can be cross-referenced with the MathPix API documentation to get very specific information about why an error occurred.	2024-01-07 08:31:09 -08:00
Kai	5d05df4bce	community: Fixed bug of "system message check" in chat_models/tongyi. (#15631 ) - Description: This PR is to fix a bug of "system message check" in langchain_community/ chat_models/tongyi.py - Issue: In term of current logic, if there's no system message in the chat messages, an error of "System message can only be the first message." will be wrongly raised. - Dependencies: No. - Twitter handle: I don't have a Twitter account.	2024-01-07 08:30:18 -08:00
Raunak	64f5968a81	community: Replaced hardcoded "metadata" with FIELDS_METADATA variable in semantic_hybrid_search_with_score_and_rerank (#15642 ) - Description: This PR is to fix a bug in semantic_hybrid_search_with_score_and_rerank() function in langchain_community/vectorstores/azuresearch.py. The hardcoded "metadata" name is replaced with FIELDS_METADATA variable with an if block to check if the metadata column exists or not. - Issue: Fixed #15581 - Dependencies: No - Twitter handle: None Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-06 17:04:59 -08:00
Erick Friis	b1fa726377	docs: langchain-openai (#15513 ) Updates docs and cookbooks to import ChatOpenAI, OpenAI, and OpenAI Embeddings from `langchain_openai` There are likely more --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-06 15:54:48 -08:00
Bagatur	14c5c15958	experimental[patch]: Release 0.0.48 (#15483 )	2024-01-06 12:46:00 -05:00
Erick Friis	d136925c49	community[patch]: fix deprecation warnings on openai subclasses (#15621 )	2024-01-05 18:02:17 -08:00
Bagatur	4ac61670b2	infra: fix langchain openai test dep (#15620 )	2024-01-05 20:14:22 -05:00
Bagatur	81810cec2e	langchain[minor]: Release 0.1.0 (#15619 )	2024-01-05 19:33:35 -05:00
Bagatur	c5226d7a18	docs: update cohere chat integration (#15562 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-05 16:33:29 -08:00
Erick Friis	1bc6b19ea7	openai[patch]: v0.0.2 (#15618 )	2024-01-05 16:33:10 -08:00
Bagatur	46446a100d	core[patch]: deprecate v1 tracer (#15608 )	2024-01-05 19:25:19 -05:00
Bagatur	dbb582d227	infra: community bump min core version (#15617 )	2024-01-05 19:17:48 -05:00
Bagatur	1e4b8f0453	community[patch]: Release 0.0.9 (#15615 )	2024-01-05 19:11:18 -05:00
Erick Friis	7f8baa030b	openai: core version, rc1 (#15614 )	2024-01-05 15:57:23 -08:00
Erick Friis	5ac3a06378	google-vertexai: release 0.0.1 (#15613 )	2024-01-05 15:24:23 -08:00
Bagatur	96b47e18e0	core[patch]: Release 0.1.7 (#15610 )	2024-01-05 18:24:11 -05:00
Erick Friis	b257c7d0ea	google-vertexai, openai: release candidate version (#15611 )	2024-01-05 15:05:27 -08:00
Erick Friis	ebc75c5ca7	openai[minor]: implement langchain-openai package (#15503 ) Todo - [x] copy over integration tests - [x] update docs with new instructions in #15513 - [x] add linear ticket to bump core -> community, community->langchain, and core->openai deps - [ ] (optional): add `pip install langchain-openai` command to each notebook using it - [x] Update docstrings to not need `openai` install - [x] Add serialization - [x] deprecate old models Contributor steps: - [x] Add secret names to manual integrations workflow in .github/workflows/_integration_test.yml - [x] Add secrets to release workflow (for pre-release testing) in .github/workflows/_release.yml Maintainer steps (Contributors should not do these): - [x] set up pypi and test pypi projects - [x] add credential secrets to Github Actions - [ ] add package to conda-forge Functional changes to existing classes: - now relies on openai client v1 (1.6.1) via concrete dep in langchain-openai package Codebase organization - some function calling stuff moved to `langchain_core.utils.function_calling` in order to be used in both community and langchain-openai	2024-01-05 15:03:28 -08:00
Bagatur	a7d023aaf0	core[patch], community[patch]: mark runnable context, lc load as beta (#15603 )	2024-01-05 17:54:26 -05:00
Leonid Kuligin	f73bf4ee54	google-vertexai: added langchain_google_vertexai package (#15218 ) added langchain_google_vertexai package --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-05 10:44:10 -08:00
Bagatur	e1fc4d5b95	core[patch]: add beta decorator (#15589 )	2024-01-05 13:16:27 -05:00
Bagatur	68eb3053e7	langchain[patch]: deprecate old agent classes and methods (#15558 )	2024-01-05 12:42:54 -05:00
Harrison Chase	9b9449750c	update chain docs (#15495 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-05 09:15:00 -08:00
Bagatur	00dfbd2a99	core[minor], langchain[minor]: deprecate old Chain and LLM methods (#15499 )	2024-01-05 11:58:35 -05:00
chyroc	f12b5c1222	Feat: support Milvus more params (#15447 ) fix https://github.com/langchain-ai/langchain/issues/15442	2024-01-04 20:07:23 -08:00
Bagatur	f5e4f0b30b	langchain[minor]: add warnings when importing integrations (#15505 ) Should be imported from community directly	2024-01-04 17:41:45 -05:00
Eugene Yurtsev	bf0b3cc0b5	core[patch]: Further restrict recursive URL loader (#15559 ) Includes code from this PR: https://github.com/langchain-ai/langchain/compare/HEAD...m0kr4n3:security/fix_ssrf with additional fixes Unit tests cover new test cases	2024-01-04 16:33:57 -05:00
Bagatur	817b84de9e	core[patch]: Release 0.1.6 (#15547 )	2024-01-04 11:02:04 -05:00
Bagatur	b2f15738dd	core[patch], langchain[patch], community[patch]: Revert #15326 (#15546 )	2024-01-04 10:39:37 -05:00
Bagatur	6e90b7a91b	langchain[patch]: bump community >=0.0.8,<0.1 (#15492 )	2024-01-03 13:31:48 -05:00
Bagatur	8b7d6531a5	langchain[patch]: Release 0.0.354 (#15482 )	2024-01-03 12:51:55 -05:00
Bagatur	0b579dc623	infra: update community test min reqs (#15490 )	2024-01-03 12:13:29 -05:00
Bagatur	266db0efc8	community[patch]: bump core version >=0.1.5,<0.2 (#15488 )	2024-01-03 12:03:31 -05:00
Bagatur	a2324ee533	community[patch]: Release 0.0.8 (#15481 )	2024-01-03 11:28:50 -05:00
Bagatur	54b58c03db	infra: add minimum deps pre release check (#15485 )	2024-01-03 11:28:35 -05:00
Bagatur	b317ad2472	core[patch]: Release 0.1.5 (#15480 )	2024-01-03 10:26:27 -05:00
Bagatur	baeac236b6	langchain[patch], experimental[patch]: update utilities imports (#15438 )	2024-01-03 02:18:15 -05:00
Harutaka Kawamura	73da8f863c	Remove unused `Params` (#14385 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Removes unused `Params` in `libs/langchain/langchain/llms/mlflow.py`. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-02 22:45:18 -08:00
chyroc	b65e57971e	Patch: improve type hint (#15451 )	2024-01-02 22:39:27 -08:00
Harutaka Kawamura	8ebf55ebbf	Fix `llms.Mlflow` example (#14386 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> The example code for `llms.Mlflow` is outdated. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-02 22:35:13 -08:00
Nolan	6c4b5a4eff	Add option to preserve headers in MarkdownHeaderTextSplitter (#14433 ) - Description: `MarkdownHeaderTextSplitter` currently strips header lines from chunked content. Many applications require these header lines are preserved. This adds an optional parameter to preserve those headers in the chunked content. - Issue: #2836 (relevant) - Dependencies: - - Tag maintainer: @baskaryan - Twitter handle: @finnless Unit tests and new examples in notebook included. cc @rlancemartin --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-02 22:34:52 -08:00
Xin Liu	0a7d360ba4	feat: new integration `wasm_chat` (#14787 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Adds `WasmChat` integration. `WasmChat` runs GGUF models locally or via chat service in lightweight and secure WebAssembly containers. In this PR, `WasmChatService` is introduced as the first step of the integration. `WasmChatService` is driven by [llama-api-server](https://github.com/second-state/llama-utils) and [WasmEdge Runtime](https://wasmedge.org/). --------- Signed-off-by: Xin Liu <sam@secondstate.io>	2024-01-02 22:33:14 -08:00

... 3 4 5 6 7 ...

2801 Commits