langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
William FH	246dc4f9cc	langchain[patch]: Pass kwargs to chat fireworks (#14183 ) Otherwise `.bind()` isn't really any good	2023-12-03 15:12:02 -08:00
Kaiboon Ee	e961c57fd2	langchain[patch]: Mask API key for Arcee LLM (#14193 ) - Description: Mask API key for Arcee LLM and its associated unit tests - Issue: https://github.com/langchain-ai/langchain/issues/12165 - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: `eekaiboon` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-03 15:11:43 -08:00
Daniyar Supiyev	092f302c0f	langchain[patch]: Asynchronous human-in-the-loop callback (#14195 ) Description: Adding a possibility to use asynchronous callback handler in human-in-the-loop validation tool. Very useful, for example, if you want to implement a validation over Telegram bot. Issue: - Dependencies: - --------- Co-authored-by: Daniyar_Supiyev <daniyar_supiyev@epam.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-03 14:57:07 -08:00
Leonid Ganeline	c660b0cf79	docs[patch]: moved semadb.mdx file (#14204 ) SemaDB.mdx file was placed with additional sub-folder: `https://python.langchain.com/docs/integrations/providers/providers/semadb` - Moved file to the `https://python.langchain.com/docs/integrations/providers/semadb` - Added a redirect for the file URL	2023-12-03 14:36:47 -08:00
Mark Cusack	16c83f786c	Adds the Yellowbrick Data Warehouse as a supported vector store (#13820 ) - Description An integration to allow the Yellowbrick Data Warehouse to function as a vector store --------- Co-authored-by: markcusack <markcusack@markcusacksmac.lan> Co-authored-by: markcusack <markcusack@Mark-Cusack-sMac.local>	2023-12-03 13:35:53 -08:00
Hendrik Hogertz	e6862e6e7d	Fix Azure Openai function calling in streaming mode (#13768 ) - Description: This PR addresses an issue with the OpenAI API streaming response, where initially the key (arguments) is provided but the value is None. Subsequently, it updates with {"arguments": "{\n"}, leading to a type inconsistency that causes an exception. The specific error encountered is ValueError: additional_kwargs["arguments"] already exists in this message, but with a different type. This change aims to resolve this inconsistency and ensure smooth API interactions. - Issue: None. - Dependencies: None. - Tag maintainer: @eyurtsev This is an updated version of #13229 based on the refactored code. Credit goes to @superken01. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-03 12:07:15 -08:00
Nicolò Boschi	e204657b3c	AstraDB VectorStore: implement pre_delete_collection (#13780 ) - Description: some vector stores have a flag for try deleting the collection before creating it (such as ´vectorpg´). This is a useful flag when prototyping indexing pipelines and also for integration tests. Added the bool flag `pre_delete_collection ` to the constructor (default False) - Tag maintainer: @hemidactylus - Twitter handle: nicoloboschi --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-03 12:06:20 -08:00
Chelsea E. Manning	2780d2d4dd	Extend OpenAIEmbeddings class to support non-`tiktoken` based embeddings (#13884 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: This extends `OpenAIEmbeddings` to add support for non-`tiktoken` based embeddings, specifically for use with the new `text-generation-webui` API (`--extensions openai`) which does not support `tiktoken` encodings, but rather strings - Issue: Not found, - Dependencies: HuggingFace `transformers.AutoTokenizer` is new dependency for running the model without `tiktoken` - Tag maintainer: @baskaryan based on last commit for `langchain-core` refactor - Twitter handle: @xychelsea Modified the tokenization process to be model-agnostic, allowing for both OpenAI and non-OpenAI model tokenizations, by setting the new default `bool` flag `tiktoken_enabled` to `False`. This requeires HuggingFace’s AutoTokenizer and handling tokenization for models requiring different preprocessing steps to generate a chunked string request rather than a list of integers. Updated the embeddings generation process to accommodate non-OpenAI models. This includes converting tokenized text into embeddings using OpenAI’s and Hugging Face’s model architectures. -->	2023-12-03 12:04:17 -08:00
Changgeng Zhao	9b59bde93d	Update Hologres vector store: use hologres-vector (#13767 ) Hi, I made some code changes on the Hologres vector store to improve the data insertion performance. Also, this version of the code uses `hologres-vector` library. This library is more convenient for us to update, and more efficient in performance. The code has passed the format/lint/spell check. I have run the unit test for Hologres connecting to my own database. Please check this PR again and tell me if anything needs to change. Best, Changgeng, Developer @ Alibaba Cloud Co-authored-by: Changgeng Zhao <zhaochanggeng.zcg@alibaba-inc.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-03 11:50:45 -08:00
Nicolò Boschi	0de7cf898d	Ensure AstraDB integration tests clean up the environment (#13774 ) - Description: currently astra_db integration tests might leave orphan collections - Tag maintainer: @hemidactylus - Twitter handle: nicoloboschi	2023-12-03 11:14:42 -08:00
Harrison Chase	7bc4c12477	delete stray test (#14200 ) was added to an old path also im not sure this is even really a test file? which is why i didnt move it	2023-12-03 11:06:57 -08:00
Leonid Ganeline	283c2994de	docs: `Hugging Face` platform page (#13831 ) `Hugging Face` is definitely a platform. It includes many integrations for many modules (LLM, Embedding, DocumentLoader, Tool) So, a doc page was added that defines Hugging Face as a platform.	2023-12-03 11:06:43 -08:00
Chad Norvell	8a0951d934	Fix Mathpix PDF loader integration (#13949 ) - Description: Fixes the Mathpix PDF loader API integration. Specifically, ensures that Mathpix auth headers are provided for every request, and ensures that we recognize all errors that can occur during a request. Also, the option to provide API keys as kwargs never actually worked before, but now that's fixed too. - Issue: #11249 - Dependencies: None	2023-12-03 10:36:49 -08:00
gzyJoy	32d4bb4590	Added Slacktoolkit (#14012 ) - Description: This PR introduces the Slack toolkit to LangChain, which allows users to read and write to Slack using the Slack API. Specifically, we've added the following tools. 1. get_channel: Provides a summary of all the channels in a workspace. 2. get_message: Gets the message history of a channel. 3. send_message: Sends a message to a channel. 4. schedule_message: Sends a message to a channel at a specific time and date. - Issue: This pull request addresses [Add Slack Toolkit #11747](https://github.com/langchain-ai/langchain/issues/11747) - Dependencies: package`slack_sdk` Note: For this toolkit to function you will need to add a Slack app to your workspace. Additional info can be found [here](https://slack.com/help/articles/202035138-Add-apps-to-your-Slack-workspace). --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ArianneLavada <ariannelavada@gmail.com> Co-authored-by: ArianneLavada <84357335+ArianneLavada@users.noreply.github.com> Co-authored-by: ariannelavada@gmail.com <you@example.com>	2023-12-03 10:25:38 -08:00
Richie	99e5ee6a84	fix(vectorstores): incorrect import for mongodb atlas DriverInfo (#14060 ) - Description: fix `import` issue for `mongodb atlas` vectore store integration - Issue: none - Dependencies: none while trying to follow official `langchain`'s [mongodb integration guide](https://python.langchain.com/docs/integrations/vectorstores/mongodb_atlas), an import error will happen. It's caused by incorrect import location: - `from pymongo import DriverInfo` should be `from pymongo.driver_info import DriverInfo` - reference: [pymongo's DriverInfo class](https://pymongo.readthedocs.io/en/stable/api/pymongo/driver_info.html#pymongo.driver_info.DriverInfo) Thanks!	2023-12-03 10:22:13 -08:00
ggeutzzang	03d6b94c29	Fix: (issue #14066 ) DOC: Summarization output broken (#14078 ) - Description: : As described in the issue below, https://python.langchain.com/docs/use_cases/summarization I've modified the Python code in the above notebook to perform well. I also modified the OpenAI LLM model to the latest version as shown below. `gpt-3.5-turbo-16k --> gpt-3.5-turbo-1106` This is because it seems to be a bit more responsive. - Issue: : #14066	2023-12-03 10:13:57 -08:00
James Braza	3833882ab7	Removing extra `StdOutCallbackHandler` overridden methods (#14136 ) Unnecessarily overridden methods: - Give the idea the subclass is doing something special (when it isn't) - Block CTRL-click to the actual method This PR removes some unnecessarily overridden methods in `StdOutCallbackHandler` Supercedes https://github.com/langchain-ai/langchain/pull/12858	2023-12-03 09:38:49 -08:00
Bob Lin	ac449f186b	Update docs to use new usage in openai>1.0.0 (#14163 ) ### Description Use new [APIs](https://github.com/openai/openai-python/blob/main/api.md#finetuning) ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-03 09:37:35 -08:00
James Braza	052e23be3e	Added Python `logging` tracer (#14190 ) This PR creates a logging handler and adds a simple unit test of it Supercedes https://github.com/langchain-ai/langchain/pull/12862 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-03 09:36:30 -08:00
Bob Lin	1ea48a31da	Update fallback cases (#14164 ) ### Description The `RateLimitError` initialization method has changed after openai v1, and the usage of `patch` needs to be changed. ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-03 08:56:07 -08:00
Bob Lin	62505043be	Closed #14069 (#14166 ) ### Description Fix #14069 ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-03 08:55:25 -08:00
Yong woo Song	9938086df0	Fix Html2TextTransformer for shallow copy (#14197 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Hi, There is some unintended behavior in Html2TextTransformer. The current code is directly modifying the original documents that are passed as arguments to the function. Therefore, not only the return of the function but also the input variables are being modified simultaneously. To resolve this, I added unit test code as well. reference link: [Shallow vs Deep Copying of Python Objects](https://realpython.com/copying-python-objects/) Thanks! ☺️	2023-12-03 08:45:35 -08:00
h3l	818252b1f8	Fix: (issue #14127 ) Volc Engine MaaS import error (#14194 ) - Description: fix Volc Engine MaaS import error - Issue: [the issue # it fixes (if applicable),](https://github.com/langchain-ai/langchain/issues/14127) - Dependencies: None - Tag maintainer: @baskaryan - Twitter handle: Co-authored-by: lvzhong <lvzhong@bytedance.com>	2023-12-03 08:43:23 -08:00
Leonid Ganeline	6ae0194dc7	docs: `integrations/toolkits/office365` notebook update (#14188 ) Added more descriptions and authentication details.	2023-12-03 08:43:00 -08:00
Bagatur	0bdb434383	langchain[patch]: Release langchain 0.0.345 (#14184 )	2023-12-02 15:53:49 -08:00
Bagatur	15c04a5670	core[patch]: Release 0.0.9 (#14182 )	2023-12-02 14:40:56 -08:00
James Braza	bdb6ae2ed3	core[patch]: `BaseTracer` helper method for `Run` lookup (#14139 ) I observed the same run ID extraction logic is repeated many times in `BaseTracer`. This PR creates a helper method for DRY code.	2023-12-02 14:05:50 -08:00
Harutaka Kawamura	41ee3be95f	langchain[patch]: Support passing parameters to `llms.Databricks` and `llms.Mlflow` (#14100 ) Before, we need to use `params` to pass extra parameters: ```python from langchain.llms import Databricks Databricks(..., params={"temperature": 0.0}) ``` Now, we can directly specify extra params: ```python from langchain.llms import Databricks Databricks(..., temperature=0.0) ```	2023-12-01 19:27:18 -08:00
Abdul	82102c99b3	langchain[patch]: Running SQLDatabaseChain adds prefix "SQLQuery:\n" (#14058 ) - Issue: https://github.com/langchain-ai/langchain/issues/12077 --------- Co-authored-by: Abdul Kader Maliyakkal <maliyakk@amazon.com>	2023-12-01 19:26:16 -08:00
Samuel Kemp	fd781c89cc	langchain[minor]: add azure ai data document loader (#13404 ) This PR adds an "Azure AI data" document loader, which allows Azure AI users to load their registered data assets as a document object in langchain. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 19:25:55 -08:00
James Braza	24385a00de	core[minor], langchain[patch], experimental[patch]: Added missing `py.typed` to `langchain_core` (#14143 ) See PR title. From what I can see, `poetry` will auto-include this. Please let me know if I am missing something here. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 19:15:23 -08:00
quantum00549	f7c257553d	langchain[patch]: fixed a bug that was causing the streaming transfer to not work… (#10827 ) … properly Fixed a bug that was causing the streaming transfer to not work properly. - Description: 1、The on_llm_new_token method in the streaming callback can now be called properly in streaming transfer mode. 2、In streaming transfer mode, LLM can now correctly output the complete response instead of just the first token. - Tag maintainer: @wangxuqi - **Twitter handle: @kGX7XJjuYxzX9Km --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 18:57:50 -08:00
Eugene Yurtsev	6d0209e0aa	Improve file system blob loader and generic loader (#14004 ) * Add support for passing a specific file to the file system blob loader * Allow specifying a class parameter for the parser for the generic loader ```python class AudioLoader(GenericLoader): @staticmethod def get_parser(kwargs): return MyAudioParser(kwargs): ``` The intent of the GenericLoader is to provide on-ramps from different sources (e.g., web, s3, file system). An alternative is to use pipelining syntax or creating a Pipeline ``` FileSystemBlobLoader(...) \| MyAudioParser ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 21:23:40 -05:00
Erick Friis	700428593a	fix broken api docs links (#14154 )	2023-12-01 17:17:52 -08:00
Bagatur	340b42d8ee	docs[minor]: lcel why page (#14089 )	2023-12-01 16:13:31 -08:00
Lance Martin	cbe4753e1a	Update Open CLIP embd (#14155 ) Prior default model required a large amt of RAM and often crashed Jupyter ntbk kernel.	2023-12-01 15:13:20 -08:00
Erick Friis	b01d9d27d9	docs[patch]: docs local build (#14152 )	2023-12-01 14:03:36 -08:00
Alex Kira	0caef3cde7	Change RunnableMap to RunnableParallel for consistency (#14142 ) - Description: Change instances of RunnableMap to RunnableParallel, as that should be the one used going forward. This makes it consistent across the codebase.	2023-12-01 13:36:40 -08:00
Erick Friis	96f6b90349	templates[patch]: relock templates (#14149 )	2023-12-01 13:35:54 -08:00
Martin Jul	e3a7c96a8e	docs[patch]: Fix minor typos (casing) in quickstart (#14138 ) Fix casing of API and LangChain in the description text for the LangServe example server.	2023-12-01 13:29:53 -08:00
Erick Friis	8cf4cb9e48	docs[patch]: Fix templates/index (#14146 )	2023-12-01 13:09:36 -08:00
Amyh102	b6d26d3f9f	infra[patch]: Add unit tests for Huggingface dataset loader (#14053 ) - Description: Add unit tests for huggingface dataset loader and sample huggingface dataset for future tests. Updates dependencies for `datasets` module. - Adds coverage for [previous pull request](https://github.com/langchain-ai/langchain/pull/13864) - Tag maintainer: @hwchase17 --------- Co-authored-by: Amy Han <amyhan@Amys-Air.lan> Co-authored-by: Amy Han <amyhan@Amys-MacBook-Air.local> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 12:42:31 -08:00
Alex Kira	6eb40db353	docs[patch]: Add getting started section to LCEL doc (#14045 ) ### Description: Doc addition for LCEL introduction. Adds a more basic starter guide for using LCEL. --------- Co-authored-by: Alex Kira <akira@Alexs-MBP.local.tld> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 12:23:43 -08:00
Govinda Totla	62a3473ac0	docs[patch]: add text_splitter.py test (#14025 ) Description: Add HTMLHeaderTextSplitter unit test Dependencies: none	2023-12-01 11:57:50 -08:00
Bagatur	7d5341dbd3	docs[patch]: add contribs to readme (#14137 )	2023-12-01 11:34:28 -08:00
axiangcoding	1b36ddf16c	docs[patch]: add deprecated note for ErnieChatBot (#14061 ) - Description: just a little change of ErnieChatBot class description, sugguesting user to use more suitable class - Issue: none, - Dependencies: none, - Tag maintainer: @baskaryan , - Twitter handle: none	2023-12-01 11:16:31 -08:00
Alex Kira	1757258b2a	docs[patch]: Add mermaid JS theme dependency to docusaurus (#14051 ) - Description: Add mermaid JS dependency and configs to documentation. Allows inline doc diagrams in markdown. - Dependencies: NPM package @docusaurus/theme-mermaid	2023-12-01 11:06:29 -08:00
Devin Dahoon Kim	32da0a4d71	langchain[patch]: use async_embed_with_retry in _aget_len_safe_embeddings (#14110 ) Description `embed_with_retry` is for sync operations and not for async operations. Use `async_embed_with_retry` for appropriate async operations. I'm using `OpenAIEmbedding(http_client=httpx.AsyncClient())` with only async operations. However, I got an error when I use `embedding.aembed_documents` because `embed_with_retry` uses sync OpenAI client with async http client.	2023-12-01 10:47:07 -08:00
lijie	371bcb7580	langchain[patch]: set maxsplit when parse python function docstring (#14121 ) Description when the desc of arg in python docstring contains ":", the `_parse_python_function_docstring` will raise ValueError: too many values to unpack (expected 2). A sample desc would be: """ Args: error_arg: this is an arg with an additional ":" symbol """ So, set `maxsplit` parameter to fix it.	2023-12-01 10:46:53 -08:00
Harrison Chase	ae646701c4	Harrison/ibm (#14133 ) Co-authored-by: Mateusz Szewczyk <139469471+MateuszOssGit@users.noreply.github.com>	2023-12-01 12:44:11 -05:00

1 2 3 4 5 ...

6197 Commits