langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Max Jakob	de209af533	community[patch]: ElasticsearchStore: add relevance function selector (#16378 ) Implement similarity function selector for ElasticsearchStore. The scores coming back from Elasticsearch are already similarities (not distances) and they are already normalized (see [docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params)). Hence we leave the scores untouched and just forward them. This fixes #11539. However, in hybrid mode (when keyword search and vector search are involved) Elasticsearch currently returns no scores. This PR adds an error message around this fact. We need to think a bit more to come up with a solution for this case. This PR also corrects a small error in the Elasticsearch integration test. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-22 11:52:20 -07:00
y2noda	54f90fc6bc	langchain_google_vertexai:Enable the use of langchain's built-in tools in Gemini's function calling (#16341 ) - Issue: This is a PR about #16340 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: yuhei.tsunoda <yuhei.tsunoda@brainpad.co.jp>	2024-01-22 11:16:36 -07:00
Tom Jorquera	1445ac95e8	community[patch]: Enable streaming for GPT4all (#16392 ) `streaming` param was never passed to model	2024-01-22 09:54:18 -08:00
Bagatur	af9f1738ca	langchain[patch]: Release 0.1.2 (#16388 )	2024-01-22 09:32:24 -08:00
Bagatur	8779013847	community[patch]: Release 0.0.14 (#16384 )	2024-01-22 08:50:19 -08:00
Bagatur	9cf0f5eb78	core[patch]: Release 0.1.14 (#16382 )	2024-01-22 08:28:03 -08:00
Bagatur	1dc6c1ce06	core[patch], community[patch], langchain[patch], docs: Update SQL chains/agents/docs (#16168 ) Revamp SQL use cases docs. In the process update SQL chains and agents.	2024-01-22 08:19:08 -08:00
Jatin Chawda	05162928c0	Docs: Fixed Urls of AsyncHtmlLoader, AsyncChromiumLoader and HTML2Text links in Web scraping Docs (#16365 ) Fixing links in documentation.	2024-01-22 11:03:03 -05:00
Bob Lin	acc14802d1	Fix `conn` field definition in SQLiteEntityStore (#15440 )	2024-01-22 07:53:49 -08:00
James Braza	e1c59779ad	core[patch]: Remove `print` statement on missing `grandalf` dependency in favor of more explicit ImportError (#16326 ) After this PR an ImportError will be raised without a print if grandalf is missing when using grandalf related code for printing runnable graphs.	2024-01-22 10:48:54 -05:00
Nuno Campos	971a68d04f	Docs: Update README.md in core (#16329 ) Docs: Update README.md in core	2024-01-22 10:42:31 -05:00
Christophe Bornet	f9be877ed7	Docs: Add self-querying retriever and store to AstraDB provider doc (#16362 ) Add self-querying retriever and store to AstraDB provider doc	2024-01-22 10:24:28 -05:00
Mateusz Szewczyk	076dbb1a8f	docs: IBM watsonx.ai Use `invoke` instead of `__call__` (#16371 ) - Description: Updating documentation of IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM with using `invoke` instead of `__call__` - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅ The following warning information show when i use `run` and `__call__` method: ``` LangChainDeprecationWarning: The function `__call__` was deprecated in LangChain 0.1.7 and will be removed in 0.2.0. Use invoke instead. warn_deprecated( ``` We need to update documentation for using `invoke` method	2024-01-22 10:22:03 -05:00
Bob Lin	c6bd7778b0	Use `invoke` instead of `__call__` (#16369 ) The following warning information will be displayed when i use `llm(PROMPT)`: ```python /Users/169/llama.cpp/venv/lib/python3.11/site-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The function `__call__` was deprecated in LangChain 0.1.7 and will be removed in 0.2.0. Use invoke instead. warn_deprecated( ``` So I changed to standard usage.	2024-01-22 10:18:43 -05:00
Eugene Yurtsev	89372fca22	core[patch]: Update sys info information (#16297 ) Update information collected in sys info. python -m langchain_core.sys_info System Information ------------------ > OS: Linux > OS Version: #14~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Mon Nov 20 18:15:30 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.1.10 > langchain: 0.1.0 > langchain_community: 0.0.11 > langchain_cli: 0.0.20 > langchain_experimental: 0.0.36 > langchain_openai: 0.0.2 > langchainhub: 0.1.14 > langserve: 0.0.19 Packages not installed (Not Necessarily a Problem) -------------------------------------------------- The following packages were not found: > langgraph	2024-01-22 10:18:04 -05:00
Luke	5396604ef4	community: Handling missing key in Google Trends API response. (#15864 ) - Description: Handing response where _interest_over_time_ is missing. - Issue: #15859 - Dependencies: None	2024-01-21 18:11:45 -08:00
Virat Singh	c2a614eddc	community: Add PolygonLastQuote Tool and Toolkit (#15990 ) Description: In this PR, I am adding a `PolygonLastQuote` Tool, which can be used to get the latest price quote for a given ticker / stock. Additionally, I've added a Polygon Toolkit, which we can use to encapsulate future tools that we build for Polygon. Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-21 15:08:55 -08:00
Nuno Campos	ef75bb63ce	core[patch] Fix tracer output of streamed runs with non-addable output (#16324 ) - Used to be None, now is just the last chunk <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-20 18:52:26 -08:00
Ryan French	3d23a5eb36	langchain[patch]: Allow OpenSearch Query Translator to correctly work with Date types (#16022 ) Description: Fixes an issue where the Date type in an OpenSearch Self Querying Retriever would fail to generate a valid query Issue: https://github.com/langchain-ai/langchain/issues/14225	2024-01-19 17:57:18 -08:00
Ofer Mendelevitch	ffae98d371	template: Update Vectara templates (#15363 ) fixed multi-query template for Vectara added self-query template for Vectara Also added prompt_name parameter to summarization CC @efriis Twitter handle: @ofermend	2024-01-19 17:32:33 -08:00
Bagatur	1e29b676d5	core[patch]: simple fallback streaming (#16055 )	2024-01-19 16:31:54 -08:00
Eugene Yurtsev	4ef0ed4ddc	astream_events: Add version parameter while method is in beta (#16290 ) Add a version parameter while the method is in beta phase. The idea is to make it possible to minimize making breaking changes for users while we're iterating on schema. Once the API is stable we can assign a default version requirement.	2024-01-19 13:20:02 -05:00
Bagatur	91230ef5d1	openai[patch]: Release 0.0.3 (#16289 )	2024-01-19 10:15:08 -08:00
Hamza Kyamanywa	39b3c6d94c	langchain[patch]: Add konlpy based text splitting for Korean (#16003 ) - Description: Adds a text splitter based on [Konlpy](https://konlpy.org/en/latest/#start) which is a Python package for natural language processing (NLP) of the Korean language. (It is like Spacy or NLTK for Korean) - Dependencies: Konlpy would have to be installed before this splitter is used, - Twitter handle: @untilhamza	2024-01-19 09:44:56 -08:00
Hongyu Lin	9b0a531aa2	doc: Fix small typo in quickstart (#16164 ) - Description: fix small typo in quickstart --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-19 09:44:22 -08:00
Sagar B Manjunath	63e2acc964	docs: Fix minor issues in NVIDIA RAG canonical template (#16189 ) - Description: Fixes a few issues in NVIDIAcanonical RAG template's README, and adds a notebook for the template - Dependencies: Adds the pypdf dependency which is needed for ingestion, and updates the lock file --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-19 09:44:08 -08:00
Lance Martin	881d1c3ec5	Update MultiON toolkit docs (#16286 )	2024-01-19 09:37:20 -08:00
Bagatur	e3828bee43	core[patch]: Release 0.1.13 (#16287 )	2024-01-19 09:28:31 -08:00
Bagatur	2454fefc53	docs: agent prompt docs (#16105 )	2024-01-19 09:19:22 -08:00
Bagatur	84bf5787a7	core[patch], openai[patch]: Chat openai stream logprobs (#16218 )	2024-01-19 09:16:09 -08:00
Bagatur	6f7a414955	docs: fix links (#16284 )	2024-01-19 08:51:12 -08:00
Eugene Yurtsev	cc2e30fa13	CI: update the description used for privileged issue template (#16277 ) Update description	2024-01-19 10:13:33 -05:00
Eugene Yurtsev	3b649f4331	CI: Add privileged version for issue creation (#16276 ) Add privileged version for issue creation. This adds a version of issue creation which is unstructured by design to make it easier for maintainers to create issues. Maintainers are expected to write / describe issues clearly.	2024-01-19 09:53:51 -05:00
Eugene Yurtsev	c0d453d8ac	CI: Disable blank issues, add links to QA discussions & show and tell (#16275 ) Update the issue template	2024-01-19 09:34:23 -05:00
Carey	021b0484a8	community[patch]: add skipped test for inner product normalization (#14989 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 23:03:15 -08:00
Lance Martin	f63906a9c2	Test and update MultiON agent toolkit docs (#16235 )	2024-01-18 20:24:35 -08:00
Christophe Bornet	3ccbe11363	community[minor]: Add Cassandra document loader (#16215 ) - Description: document loader for Apache Cassandra - Twitter handle: cbornet_	2024-01-18 18:49:02 -08:00
Tomaz Bratanic	fc84083ce5	docs: Add neo4j semantic blog post link to templates (#16225 )	2024-01-18 18:45:22 -08:00
mikeFore4	9d32af72ce	community[patch]: huggingface hub character removal bug fix (#16233 ) - Description: Some text-generation models on huggingface repeat the prompt in their generated response, but not all do! The tests use "gpt2" which DOES repeat the prompt and as such, the HuggingFaceHub class is hardcoded to remove the first few characters of the response (to match the len(prompt)). However, if you are using a model (such as the very popular "meta-llama/Llama-2-7b-chat-hf") that DOES NOT repeat the prompt in it's generated text, then the beginning of the generated text will be cut off. This code change fixes that bug by first checking whether the prompt is repeated in the generated response and removing it conditionally. - Issue: #16232 - Dependencies: N/A - Twitter handle: N/A	2024-01-18 18:44:10 -08:00
Andreas Motl	3613d8a2ad	community[patch]: Use SQLAlchemy's `bulk_save_objects` method to improve insert performance (#16244 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to save embeddings in batches, to improve its performance. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 Hi again from the CrateDB team, following up on GH-16243, this is another minor patch to the pgvector vector store adapter. Inserting embeddings in batches, using [SQLAlchemy's `bulk_save_objects`](https://docs.sqlalchemy.org/en/20/orm/session_api.html#sqlalchemy.orm.Session.bulk_save_objects) method, can deliver substantial performance gains. With kind regards, Andreas. NB: As I am seeing just now that this method is a legacy feature of SA 2.0, it will need to be reworked on a future iteration. However, it is not deprecated yet, and I haven't been able to come up with a different implementation, yet.	2024-01-18 18:35:39 -08:00
Ashley Xu	0f99646ca6	docs: add the enrollment form for`BigQueryVectorSearch` (#16240 ) This PR adds the enrollment form for BigQueryVectorSearch.	2024-01-18 18:34:06 -08:00
Eugene Yurtsev	177af65dc4	core[minor]: RFC Add astream_events to Runnables (#16172 ) This PR adds `astream_events` method to Runnables to make it easier to stream data from arbitrary chains. * Streaming only works properly in async right now * One should use `astream()` with if mixing in imperative code as might be done with tool implementations * Astream_log has been modified with minimal additive changes, so no breaking changes are expected * Underlying callback code / tracing code should be refactored at some point to handle things more consistently (OK for now) - ~~[ ] verify event for on_retry~~ does not work until we implement streaming for retry - ~~[ ] Any rrenaming? Should we rename "event" to "hook"?~~ - [ ] Any other feedback from community? - [x] throw NotImplementedError for `RunnableEach` for now ## Example See this [Example Notebook](`dbbc7fa0d6/docs/docs/modules/agents/how_to/streaming_events.ipynb`) for an example with streaming in the context of an Agent ## Event Hooks Reference Here is a reference table that shows some events that might be emitted by the various Runnable objects. Definitions for some of the Runnable are included after the table. \| event \| name \| chunk \| input \| output \| \|----------------------\|------------------\|---------------------------------\|-----------------------------------------------\|-------------------------------------------------\| \| on_chat_model_start \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| \| \| on_chat_model_stream \| [model name] \| AIMessageChunk(content="hello") \| \| \| \| on_chat_model_end \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| {"generations": [...], "llm_output": None, ...} \| \| on_llm_start \| [model name] \| \| {'input': 'hello'} \| \| \| on_llm_stream \| [model name] \| 'Hello' \| \| \| \| on_llm_end \| [model name] \| \| 'Hello human!' \| \| on_chain_start \| format_docs \| \| \| \| \| on_chain_stream \| format_docs \| "hello world!, goodbye world!" \| \| \| \| on_chain_end \| format_docs \| \| [Document(...)] \| "hello world!, goodbye world!" \| \| on_tool_start \| some_tool \| \| {"x": 1, "y": "2"} \| \| \| on_tool_stream \| some_tool \| {"x": 1, "y": "2"} \| \| \| \| on_tool_end \| some_tool \| \| \| {"x": 1, "y": "2"} \| \| on_retriever_start \| [retriever name] \| \| {"query": "hello"} \| \| \| on_retriever_chunk \| [retriever name] \| {documents: [...]} \| \| \| \| on_retriever_end \| [retriever name] \| \| {"query": "hello"} \| {documents: [...]} \| \| on_prompt_start \| [template_name] \| \| {"question": "hello"} \| \| \| on_prompt_end \| [template_name] \| \| {"question": "hello"} \| ChatPromptValue(messages: [SystemMessage, ...]) \| Here are declarations associated with the events shown above: `format_docs`: ```python def format_docs(docs: List[Document]) -> str: '''Format the docs.''' return ", ".join([doc.page_content for doc in docs]) format_docs = RunnableLambda(format_docs) ``` `some_tool`: ```python @tool def some_tool(x: int, y: str) -> dict: '''Some_tool.''' return {"x": x, "y": y} ``` `prompt`: ```python template = ChatPromptTemplate.from_messages( [("system", "You are Cat Agent 007"), ("human", "{question}")] ).with_config({"run_name": "my_template", "tags": ["my_template"]}) ```	2024-01-18 21:27:01 -05:00
SN	f175bf7d7b	Use env for revision id if not passed in as param; use `git describe` as backup (#16227 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-01-18 16:15:26 -08:00
Erick Friis	e5878c467a	infra: scheduled testing env (#16239 )	2024-01-18 14:28:01 -08:00
Erick Friis	2f348c695a	infra: add nvidia api secret to integration testing (#15972 )	2024-01-18 14:20:02 -08:00
Erick Friis	50959abf0c	infra: google cse id integration test (#16238 )	2024-01-18 14:12:00 -08:00
Erick Friis	b9495da92d	langchain[patch]: fix stuff documents chain api docs render (#16159 )	2024-01-18 14:07:44 -08:00
Erick Friis	eec3347939	docs: together cookbook import (#16236 )	2024-01-18 14:07:19 -08:00
Erick Friis	92bc80483a	infra: google search api key (#16237 )	2024-01-18 14:06:38 -08:00
Erick Friis	0e76d84137	google-vertexai[patch]: more integration test fixes (#16234 )	2024-01-18 13:59:23 -08:00

1 2 3 4 5 ...

7024 Commits