langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-16 06:13:16 +00:00

Author	SHA1	Message	Date
Virat Singh	c2a614eddc	community: Add PolygonLastQuote Tool and Toolkit (#15990 ) Description: In this PR, I am adding a `PolygonLastQuote` Tool, which can be used to get the latest price quote for a given ticker / stock. Additionally, I've added a Polygon Toolkit, which we can use to encapsulate future tools that we build for Polygon. Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-21 15:08:55 -08:00
Nuno Campos	ef75bb63ce	core[patch] Fix tracer output of streamed runs with non-addable output (#16324 ) - Used to be None, now is just the last chunk <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-20 18:52:26 -08:00
Ryan French	3d23a5eb36	langchain[patch]: Allow OpenSearch Query Translator to correctly work with Date types (#16022 ) Description: Fixes an issue where the Date type in an OpenSearch Self Querying Retriever would fail to generate a valid query Issue: https://github.com/langchain-ai/langchain/issues/14225	2024-01-19 17:57:18 -08:00
Ofer Mendelevitch	ffae98d371	template: Update Vectara templates (#15363 ) fixed multi-query template for Vectara added self-query template for Vectara Also added prompt_name parameter to summarization CC @efriis Twitter handle: @ofermend	2024-01-19 17:32:33 -08:00
Bagatur	1e29b676d5	core[patch]: simple fallback streaming (#16055 )	2024-01-19 16:31:54 -08:00
Eugene Yurtsev	4ef0ed4ddc	astream_events: Add version parameter while method is in beta (#16290 ) Add a version parameter while the method is in beta phase. The idea is to make it possible to minimize making breaking changes for users while we're iterating on schema. Once the API is stable we can assign a default version requirement.	2024-01-19 13:20:02 -05:00
Bagatur	91230ef5d1	openai[patch]: Release 0.0.3 (#16289 )	2024-01-19 10:15:08 -08:00
Hamza Kyamanywa	39b3c6d94c	langchain[patch]: Add konlpy based text splitting for Korean (#16003 ) - Description: Adds a text splitter based on [Konlpy](https://konlpy.org/en/latest/#start) which is a Python package for natural language processing (NLP) of the Korean language. (It is like Spacy or NLTK for Korean) - Dependencies: Konlpy would have to be installed before this splitter is used, - Twitter handle: @untilhamza	2024-01-19 09:44:56 -08:00
Bagatur	e3828bee43	core[patch]: Release 0.1.13 (#16287 )	2024-01-19 09:28:31 -08:00
Bagatur	2454fefc53	docs: agent prompt docs (#16105 )	2024-01-19 09:19:22 -08:00
Bagatur	84bf5787a7	core[patch], openai[patch]: Chat openai stream logprobs (#16218 )	2024-01-19 09:16:09 -08:00
Carey	021b0484a8	community[patch]: add skipped test for inner product normalization (#14989 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 23:03:15 -08:00
Christophe Bornet	3ccbe11363	community[minor]: Add Cassandra document loader (#16215 ) - Description: document loader for Apache Cassandra - Twitter handle: cbornet_	2024-01-18 18:49:02 -08:00
mikeFore4	9d32af72ce	community[patch]: huggingface hub character removal bug fix (#16233 ) - Description: Some text-generation models on huggingface repeat the prompt in their generated response, but not all do! The tests use "gpt2" which DOES repeat the prompt and as such, the HuggingFaceHub class is hardcoded to remove the first few characters of the response (to match the len(prompt)). However, if you are using a model (such as the very popular "meta-llama/Llama-2-7b-chat-hf") that DOES NOT repeat the prompt in it's generated text, then the beginning of the generated text will be cut off. This code change fixes that bug by first checking whether the prompt is repeated in the generated response and removing it conditionally. - Issue: #16232 - Dependencies: N/A - Twitter handle: N/A	2024-01-18 18:44:10 -08:00
Andreas Motl	3613d8a2ad	community[patch]: Use SQLAlchemy's `bulk_save_objects` method to improve insert performance (#16244 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to save embeddings in batches, to improve its performance. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 Hi again from the CrateDB team, following up on GH-16243, this is another minor patch to the pgvector vector store adapter. Inserting embeddings in batches, using [SQLAlchemy's `bulk_save_objects`](https://docs.sqlalchemy.org/en/20/orm/session_api.html#sqlalchemy.orm.Session.bulk_save_objects) method, can deliver substantial performance gains. With kind regards, Andreas. NB: As I am seeing just now that this method is a legacy feature of SA 2.0, it will need to be reworked on a future iteration. However, it is not deprecated yet, and I haven't been able to come up with a different implementation, yet.	2024-01-18 18:35:39 -08:00
Eugene Yurtsev	177af65dc4	core[minor]: RFC Add astream_events to Runnables (#16172 ) This PR adds `astream_events` method to Runnables to make it easier to stream data from arbitrary chains. * Streaming only works properly in async right now * One should use `astream()` with if mixing in imperative code as might be done with tool implementations * Astream_log has been modified with minimal additive changes, so no breaking changes are expected * Underlying callback code / tracing code should be refactored at some point to handle things more consistently (OK for now) - ~~[ ] verify event for on_retry~~ does not work until we implement streaming for retry - ~~[ ] Any rrenaming? Should we rename "event" to "hook"?~~ - [ ] Any other feedback from community? - [x] throw NotImplementedError for `RunnableEach` for now ## Example See this [Example Notebook](`dbbc7fa0d6/docs/docs/modules/agents/how_to/streaming_events.ipynb`) for an example with streaming in the context of an Agent ## Event Hooks Reference Here is a reference table that shows some events that might be emitted by the various Runnable objects. Definitions for some of the Runnable are included after the table. \| event \| name \| chunk \| input \| output \| \|----------------------\|------------------\|---------------------------------\|-----------------------------------------------\|-------------------------------------------------\| \| on_chat_model_start \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| \| \| on_chat_model_stream \| [model name] \| AIMessageChunk(content="hello") \| \| \| \| on_chat_model_end \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| {"generations": [...], "llm_output": None, ...} \| \| on_llm_start \| [model name] \| \| {'input': 'hello'} \| \| \| on_llm_stream \| [model name] \| 'Hello' \| \| \| \| on_llm_end \| [model name] \| \| 'Hello human!' \| \| on_chain_start \| format_docs \| \| \| \| \| on_chain_stream \| format_docs \| "hello world!, goodbye world!" \| \| \| \| on_chain_end \| format_docs \| \| [Document(...)] \| "hello world!, goodbye world!" \| \| on_tool_start \| some_tool \| \| {"x": 1, "y": "2"} \| \| \| on_tool_stream \| some_tool \| {"x": 1, "y": "2"} \| \| \| \| on_tool_end \| some_tool \| \| \| {"x": 1, "y": "2"} \| \| on_retriever_start \| [retriever name] \| \| {"query": "hello"} \| \| \| on_retriever_chunk \| [retriever name] \| {documents: [...]} \| \| \| \| on_retriever_end \| [retriever name] \| \| {"query": "hello"} \| {documents: [...]} \| \| on_prompt_start \| [template_name] \| \| {"question": "hello"} \| \| \| on_prompt_end \| [template_name] \| \| {"question": "hello"} \| ChatPromptValue(messages: [SystemMessage, ...]) \| Here are declarations associated with the events shown above: `format_docs`: ```python def format_docs(docs: List[Document]) -> str: '''Format the docs.''' return ", ".join([doc.page_content for doc in docs]) format_docs = RunnableLambda(format_docs) ``` `some_tool`: ```python @tool def some_tool(x: int, y: str) -> dict: '''Some_tool.''' return {"x": x, "y": y} ``` `prompt`: ```python template = ChatPromptTemplate.from_messages( [("system", "You are Cat Agent 007"), ("human", "{question}")] ).with_config({"run_name": "my_template", "tags": ["my_template"]}) ```	2024-01-18 21:27:01 -05:00
SN	f175bf7d7b	Use env for revision id if not passed in as param; use `git describe` as backup (#16227 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-01-18 16:15:26 -08:00
Erick Friis	b9495da92d	langchain[patch]: fix stuff documents chain api docs render (#16159 )	2024-01-18 14:07:44 -08:00
Erick Friis	0e76d84137	google-vertexai[patch]: more integration test fixes (#16234 )	2024-01-18 13:59:23 -08:00
Erick Friis	aa35b43bcd	docs, google-vertex[patch]: function docs (#16231 )	2024-01-18 13:15:09 -08:00
Harrison Chase	f60f59d69f	google-vertexai[patch]: Harrison/vertex function calling (#16223 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 12:17:40 -08:00
Rajesh Thallam	6bc6d64a12	langchain_google_vertexai[patch]: Add support for SystemMessage for Gemini chat model (#15933 ) - Description: In Google Vertex AI, Gemini Chat models currently doesn't have a support for SystemMessage. This PR adds support for it only if a user provides additional convert_system_message_to_human flag during model initialization (in this case, SystemMessage would be prepended to the first HumanMessage). NOTE: The implementation is similar to #14824 - Twitter handle: rajesh_thallam --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 10:22:07 -08:00
Erick Friis	65b231d40b	mistralai[patch]: async integration tests (#16214 )	2024-01-18 09:45:44 -08:00
Eugene Zapolsky	6b9e3ed9e9	google-vertexai[minor]: added safety_settings property to gemini wrapper (#15344 ) Description: Gemini model has quite annoying default safety_settings settings. In addition, current VertexAI class doesn't provide a property to override such settings. So, this PR aims to - add safety_settings property to VertexAI - fix issue with incorrect LLM output parsing when LLM responds with appropriate 'blocked' response - fix issue with incorrect parsing LLM output when Gemini API blocks prompt itself as inappropriate - add safety_settings related tests I'm not enough familiar with langchain code base and guidelines. So, any comments and/or suggestions are very welcome. Issue: it will likely fix #14841 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 08:54:30 -08:00
Eugene Yurtsev	ecd4f0a7ec	core[patch]: testing add chat model for unit-tests (#16209 ) This PR adds a fake chat model for testing purposes. Used in this PR: https://github.com/langchain-ai/langchain/pull/16172	2024-01-18 11:30:53 -05:00
SN	7d444724d7	Add revision identifier to run_on_dataset (#16167 ) Allow specifying revision identifier for better project versioning	2024-01-17 20:27:43 -08:00
Eugene Yurtsev	5d8c147332	docs: Document and test PydanticOutputFunctionsParser (#15759 ) This PR adds documentation and testing to `PydanticOutputFunctionsParser(OutputFunctionsParser)`.	2024-01-17 18:21:18 -08:00
Christophe Bornet	3502a407d9	infra: Use dotenv in langchain-community's integration tests (#16137 ) * Removed some env vars not used in langchain package IT * Added Astra DB env vars in langchain package, used for cache tests * Added conftest.py to load env vars in langchain_community IT * Added .env.example in langchain_community IT	2024-01-17 18:18:26 -08:00
Nuno Campos	ca014d5b04	Update readme (#16160 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-17 13:56:07 -08:00
Tomaz Bratanic	1e80113ac9	community[patch]: Add neo4j timeout and value sanitization option (#16138 ) The timeout function comes in handy when you want to kill longrunning queries. The value sanitization removes all lists that are larger than 128 elements. The idea here is to remove embedding properties from results.	2024-01-17 13:22:19 -08:00
Krishna Shedbalkar	f238217cea	community[patch]: Basic Logging and Human input to ShellTool (#15932 ) - Description: As Shell tool is very versatile, while integrating it into applications as openai functions, developers have no clue about what command is being executed using the ShellTool. All one can see is: ![image](https://github.com/langchain-ai/langchain/assets/60742358/540e274a-debc-4564-9027-046b91424df3) Summarising my feature request: 1. There's no visibility about what command was executed. 2. There's no mechanism to prevent a command to be executed using ShellTool, like a y/n human input which can be accepted from user to proceed with executing the command., - Issue: the issue #15931 it fixes if applicable, - Dependencies: There isn't any dependancy, - Twitter handle: @krishnashed	2024-01-17 12:57:51 -08:00
Bagatur	679a3ae933	openai[patch]: clarify azure error (#16157 )	2024-01-17 12:43:14 -08:00
Bagatur	7ad9eba8f4	core[patch]: Release 0.1.12 (#16161 )	2024-01-17 12:39:45 -08:00
Leonid Kuligin	58f0ba306b	changed default params for gemini (#16044 ) Replace this entire comment with: - Description: changed default values for Vertex LLMs (to be handled on the SDK's side)	2024-01-17 12:19:18 -08:00
Bagatur	5c73fd5bba	core[patch]: support old core namespaces (#16155 )	2024-01-17 11:26:25 -08:00
Christophe Bornet	fb940d11df	community[patch]: Use newer MetadataVectorCassandraTable in Cassandra vector store (#15987 ) as VectorTable is deprecated Tested manually with `test_cassandra.py` vector store integration test.	2024-01-17 10:37:07 -08:00
Mohammad Mohtashim	1fa056c324	community[patch]: Don't set search path for unknown SQL dialects (#16047 ) - Description: Made a small fix for the `SQLDatabase` highlighted in an issue. The issue pertains to switching schema for different SQL engines. - Issue: #16023 @baskaryan	2024-01-17 10:31:11 -08:00
Erick Friis	11327e6b64	google-vertexai[patch]: typing, release 0.0.2 (#16153 )	2024-01-17 10:16:59 -08:00
Leonid Ganeline	2709d3e5f2	langchain[patch]: updated imports for `langchain.callbacks` (#16060 ) Updated imports from 'langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:06:59 -08:00
Leonid Ganeline	c5f6b828ad	langchain[patch], community[minor]: move `output_parsers.ernie_functions` (#16057 ) `output_parsers.ernie_functions` moved into `community`	2024-01-17 10:06:18 -08:00
Leonid Ganeline	49aff3ea5b	langchain[patch]: updated `agents` imports (#16061 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:02:29 -08:00
Leonid Ganeline	60b1bd02d7	langchain[patch]: updated imports for `output_parsers` (#16059 ) Updated imports from `langchain` to `core` where it is possible	2024-01-17 10:02:12 -08:00
Leonid Ganeline	9e9ad9b0e9	langchain[patch]: updated `retrievers` imports (#16062 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:01:06 -08:00
Leonid Ganeline	d350be959d	langchain[patch]: updated `chains` imports (#16064 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 09:58:42 -08:00
Fei Wang	d0e101e4e0	community[patch]: fix ollama astream (#16070 ) Update ollama.py	2024-01-17 09:42:41 -08:00
ChengZi	8597484195	langchain[patch]: support more comparators in Milvus self-querying retriever (#16076 ) - Description: Support IN and LIKE comparators in Milvus self-querying retriever, based on [Boolean Expression Rules](https://milvus.io/docs/boolean.md) - Issue: No - Dependencies: No - Twitter handle: No Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-01-17 09:41:23 -08:00
Kapil Sachdeva	f406dc3872	docs: in RunnableRetry, correct the example snippet that uses with_retry method on Runnable (#16108 ) The example code snippet for with_retry is using incorrect argument names. This PR fixes that	2024-01-17 09:11:27 -08:00
BeatrixCohere	b0c3e3db2b	community[patch]: Handle when documents are not provided in the Cohere response (#16144 ) - Description: This handles the cohere response when documents aren't included in the response - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-01-17 09:11:00 -08:00
Felix Krones	d91126fc64	community[patch]: missing unpack operator for or_clause in pgvector document filter (#16148 ) - Fix for #16146 - Adding unpack operation to "or" and "and" filter for pgvector retriever. #	2024-01-17 09:10:43 -08:00
Erick Friis	06fe2f4fb0	partners: add license field (#16117 ) - bumps package post versions for packages without current unreleased updates - will bump package version in release prs associated with packages that do have changes (mistral, vertex)	2024-01-17 08:37:13 -08:00

1 2 3 4 5 ...

2552 Commits