langchain

Commit Graph

Author	SHA1	Message	Date
Jon Watte	e042e5df35	fix: call _on_llm_error() (#13581 ) Description: There's a copy-paste typo where on_llm_error() calls _on_chain_error() instead of _on_llm_error(). Issue: #13580 Dependencies: None Tag maintainer: @hwchase17 Twitter handle: @jwatte "Run `make format`, `make lint` and `make test` to check this locally." The test scripts don't work in a plain Ubuntu LTS 20.04 system. It looks like the dev container pulling is stuck. Or maybe the internet is just ornery today. --------- Co-authored-by: jwatte <jwatte@observeinc.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago
Hamza Ahmed	fcc8e5e839	Update geodataframe.py (#13573 ) here it is validating shapely.geometry.point.Point: if not isinstance(data_frame[page_content_column].iloc[0], gpd.GeoSeries): raise ValueError( f"Expected data_frame[{page_content_column}] to be a GeoSeries" you need it to validate the geoSeries and not the shapely.geometry.point.Point if not isinstance(data_frame[page_content_column], gpd.GeoSeries): raise ValueError( f"Expected data_frame[{page_content_column}] to be a GeoSeries" <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	8 months ago
Harrison Chase	2213fc9711	Harrison/bookend ai (#14258 ) Co-authored-by: stvhu-bookend <142813359+stvhu-bookend@users.noreply.github.com>	8 months ago
cxumol	0d47d15a9f	add(feat): Text Embeddings by Cloudflare Workers AI (#14220 ) Add [Text Embeddings by Cloudflare Workers AI](https://developers.cloudflare.com/workers-ai/models/text-embeddings/). It's a new integration. Trying to align it with its langchain-js version counterpart [here](https://api.js.langchain.com/classes/embeddings_cloudflare_workersai.CloudflareWorkersAIEmbeddings.html). - Dependencies: N/A - Done `make format` `make lint` `make spell_check` `make integration_tests` and all my changes was passed	8 months ago
Harrison Chase	c51001f01e	fix comet tracer (#14259 )	8 months ago
Erick Friis	4351b99d2b	docs[patch]: search experiment (#14254 ) - npm - search config - custom	8 months ago
Harrison Chase	4fb72ff76f	fake consistent embeddings cleanup (#14256 ) delete code that could never be reached	8 months ago
Michael Landis	e26906c1dc	feat: implement max marginal relevance for momento vector index (#13619 ) Description Implements `max_marginal_relevance_search` and `max_marginal_relevance_search_by_vector` for the Momento Vector Index vectorstore. Additionally bumps the `momento` dependency in the lock file and adds logging to the implementation. Dependencies ✅ updates `momento` dependency in lock file Tag maintainer @baskaryan Twitter handle Please tag @momentohq for Momento Vector Index and @mloml for the contribution 🙇 <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	8 months ago
deedy5	ee9abb6722	Bugfix duckduckgo_search news search (#13670 ) - Description: Bugfix duckduckgo_search news search - Issue: https://github.com/langchain-ai/langchain/issues/13648 - Dependencies: None - Tag maintainer: @baskaryan --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago
Aliaksandr Kuzmik	676a077c4e	Add CometTracer (#13661 ) Hi! I'm Alex, Python SDK Team Lead from [Comet](https://www.comet.com/site/). This PR contains our new integration between langchain and Comet - `CometTracer` class which uses new `comet_llm` python package for submitting data to Comet. No additional dependencies for the langchain package are required directly, but if the user wants to use `CometTracer`, `comet-llm>=2.0.0` should be installed. Otherwise an exception will be raised from `CometTracer.__init__`. A test for the feature is included. There is also an already existing callback (and .ipynb file with example) which ideally should be deprecated in favor of a new tracer. I wasn't sure how exactly you'd prefer to do it. For example we could open a separate PR for that. I'm open to your ideas :)	8 months ago
Harrison Chase	921c4b5597	Harrison/searchapi (#14252 ) Co-authored-by: SebastjanPrachovskij <86522260+SebastjanPrachovskij@users.noreply.github.com>	8 months ago
Ravidhu	224aa5151d	Fix Sagemaker Endpoint documentation (#13660 ) - Description: fixed the transform_input method in the example., - Issue: example didn't work, - Dependencies: None, - Tag maintainer: @baskaryan, - Twitter handle: @Ravidhu87	8 months ago
Colin Ulin	9f9cb71d26	Embaas - added backoff retries for network requests (#13679 ) Running a large number of requests to Embaas' servers (or any server) can result in intermittent network failures (both from local and external network/service issues). This PR implements exponential backoff retries to help mitigate this issue.	8 months ago
Erick Friis	f26d88ca60	docs[patch]: fix columns (#14251 )	8 months ago
Kastan Day	65faba91ad	langchain[patch]: Adding new Github functions for reading pull requests (#9027 ) The Github utilities are fantastic, so I'm adding support for deeper interaction with pull requests. Agents should read "regular" comments and review comments, and the content of PR files (with summarization or `ctags` abbreviations). Progress: - [x] Add functions to read pull requests and the full content of modified files. - [x] Function to use Github's built in code / issues search. Out of scope: - Smarter summarization of file contents of large pull requests (`tree` output, or ctags). - Smarter functions to checkout PRs and edit the files incrementally before bulk committing all changes. - Docs example for creating two agents: - One watches issues: For every new issue, open a PR with your best attempt at fixing it. - The other watches PRs: For every new PR && every new comment on a PR, check the status and try to finish the job. <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! Please make sure you're PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> --------- Co-authored-by: Erick Friis <erick@langchain.dev>	8 months ago
Hynek Kydlíček	aa8ae31e5b	core[patch]: add response kwarg to on_llm_error # Dependencies None # Twitter handle @HKydlicek --------- Co-authored-by: Erick Friis <erick@langchain.dev>	8 months ago
Leonid Ganeline	1750cc464d	docs[patch]: moved `vectorstore` notebook file (#14181 ) The `/docs/integrations/toolkits/vectorstore` page is not the Integration page. The best place is in `/docs/modules/agents/how_to/` - Moved the file - Rerouted the page URL	8 months ago
Jacob Lee	a26c4a0930	Allow base_store to be used directly with MultiVectorRetriever (#14202 ) Allow users to pass a generic `BaseStore[str, bytes]` to MultiVectorRetriever, removing the need to use the `create_kv_docstore` method. This encoding will now happen internally. @rlancemartin @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	8 months ago
Vincent Brouwers	67662564f3	langchain[patch]: Fix `config` arg detection for wrapped lambdarunnable (#14230 ) Description: When a RunnableLambda only receives a synchronous callback, this callback is wrapped into an async one since #13408. However, this wrapping with `(args, *kwargs)` causes the `accepts_config` check at [/libs/core/langchain_core/runnables/config.py#L342](`ee94ef55ee/libs/core/langchain_core/runnables/config.py (L342)`) to fail, as this checks for the presence of a "config" argument in the method signature. Adding a `functools.wraps` around it, resolves it.	8 months ago
Jacob Lee	de86b84a70	Prefer byte store interface for Upstash BaseStore to match other Redis (#14201 ) If we are not going to make the existing Docstore class also implement `BaseStore[str, Document]`, IMO all base store implementations should always be `[str, bytes]` so that they are more interchangeable. CC @rlancemartin @eyurtsev	8 months ago
Harrison Chase	411aa9a41e	Harrison/nasa tool (#14245 ) Co-authored-by: Jacob Matias <88005863+matiasjacob25@users.noreply.github.com> Co-authored-by: Karam Daid <karam.daid@mail.utoronto.ca> Co-authored-by: Jumana <jumana.fanous@mail.utoronto.ca> Co-authored-by: KaramDaid <38271127+KaramDaid@users.noreply.github.com> Co-authored-by: Anna Chester <74325334+CodeMakesMeSmile@users.noreply.github.com> Co-authored-by: Jumana <144748640+jfanous@users.noreply.github.com>	8 months ago
nceccarelli	5fea63327b	Support Azure gov cloud in Azure Cognitive Search retriever (#13695 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: The existing version hardcoded search.windows.net in the base url. This is not compatible with the gov cloud. I am allowing the user to override the default for gov cloud support., - Issue: N/A, did not write up in an issue, - Dependencies: None Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Nicholas Ceccarelli <nceccarelli2@moog.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago
ealt	e09b876863	Fixes error loading Obsidian templates (#13888 ) - Description: Obsidian templates can include [variables](https://help.obsidian.md/Plugins/Templates#Template+variables) using double curly braces. `ObsidianLoader` uses PyYaml to parse the frontmatter of documents. This parsing throws an error when encountering variables' curly braces. This is avoided by temporarily substituting safe strings before parsing. - Issue: #13887 - Tag maintainer: @hwchase17	8 months ago
Erick Friis	f6d68d78f3	nbdoc -> quarto (#14156 ) Switches to a more maintained solution for building ipynb -> md files (`quarto`) Also bumps us down to python3.8 because it's significantly faster in the vercel build step. Uses default openssl version instead of upgrading as well.	8 months ago
Nithish Raghunandanan	eecfa3f9e5	Add Couchbase document loader (#13979 ) Description: Adds the document loader for [Couchbase](http://couchbase.com/), a distributed NoSQL database. Dependencies: Added the Couchbase SDK as an optional dependency. Twitter handle: nithishr --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
Bob Lin	805e9bfc24	Add doc for the development of core and experimental sections (#13966 ) ### Description Hi, I just started learning the source code of `langchain` and hope to contribute code. However, according to the instructions in the [CONTRIBUTING.md](https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md) document, I could not run the test command `make test` to run normally. I found that many modules did not exist after [splitting `langchain_core`](https://github.com/langchain-ai/langchain/discussions/13823), so I updated the document. ### Twitter handle lin_bob57617	8 months ago
Muntaqa Mahmood	25f72944a0	Add: Steam API tool (#14008 ) - Description: Our PR is an integration of a Steam API Tool that makes recommendations on steam games based on user's Steam profile and provides information on games based on user provided queries. - Issue: the issue # our PR implements: https://github.com/langchain-ai/langchain/issues/12120 - Dependencies: python-steam-api library, steamspypi library and decouple library - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: N/A Hello langchain Maintainers, We are a team of 4 University of Toronto students contributing to langchain as part of our course [CSCD01 (link to course page)](https://cscd01.com/work/open-source-project). We hope our changes help the community. We have run make format, make lint and make test locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Our PR integrates the python-steam-api, steamspypi and decouple packages. We have added integration tests to test our python API integration into langchain and an example notebook is also provided. Our amazing team that contributed to this PR: @JohnY2002, @shenceyang, @andrewqian2001 and @muntaqamahmood Thank you in advance to all the maintainers for reviewing our PR! --------- Co-authored-by: Shence <ysc1412799032@163.com> Co-authored-by: JohnY2002 <johnyuan0526@gmail.com> Co-authored-by: Andrew Qian <andrewqian2001@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: JohnY <94477598+JohnY2002@users.noreply.github.com>	8 months ago
Bob Lin	cd2028288e	Add openai v2 adapter (#14063 ) ### Description Starting from [openai version 1.0.0](`17ac677995 (module-level-client)`), the camel case form of `openai.ChatCompletion` is no longer supported and has been changed to lowercase `openai.chat.completions`. In addition, the returned object only accepts attribute access instead of index access: ```python import openai # optional; defaults to `os.environ['OPENAI_API_KEY']` openai.api_key = '...' # all client options can be configured just like the `OpenAI` instantiation counterpart openai.base_url = "https://..." openai.default_headers = {"x-foo": "true"} completion = openai.chat.completions.create( model="gpt-4", messages=[ { "role": "user", "content": "How do I output all files in a directory using Python?", }, ], ) print(completion.choices[0].message.content) ``` So I implemented a compatible adapter that supports both attribute access and index access: ```python In [1]: from langchain.adapters import openai as lc_openai ...: messages = [{"role": "user", "content": "hi"}] In [2]: result = lc_openai.chat.completions.create( ...: messages=messages, model="gpt-3.5-turbo", temperature=0 ...: ) In [3]: result.choices[0].message Out[3]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [4]: result["choices"][0]["message"] Out[4]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [5]: result = await lc_openai.chat.completions.acreate( ...: messages=messages, model="gpt-3.5-turbo", temperature=0 ...: ) In [6]: result.choices[0].message Out[6]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [7]: result["choices"][0]["message"] Out[7]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [8]: for rs in lc_openai.chat.completions.create( ...: messages=messages, model="gpt-3.5-turbo", temperature=0, stream=True ...: ): ...: print(rs.choices[0].delta) ...: print(rs["choices"][0]["delta"]) ...: {'role': 'assistant', 'content': ''} {'role': 'assistant', 'content': ''} {'content': 'Hello'} {'content': 'Hello'} {'content': '!'} {'content': '!'} In [20]: async for rs in await lc_openai.chat.completions.acreate( ...: messages=messages, model="gpt-3.5-turbo", temperature=0, stream=True ...: ): ...: print(rs.choices[0].delta) ...: print(rs["choices"][0]["delta"]) ...: {'role': 'assistant', 'content': ''} {'role': 'assistant', 'content': ''} {'content': 'Hello'} {'content': 'Hello'} {'content': '!'} {'content': '!'} ... ``` ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	8 months ago
billytrend-cohere	0f02081392	Add input_type override (#14068 ) Add option to override input_type for cohere's v3 embeddings models --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
Dmitrii Rashchenko	aaabc1574f	Support of custom hugging face inference endpoints url (#14125 ) - Description: to support not only publicly available Hugging Face endpoints, but also protected ones (created with "Inference Endpoints" Hugging Face feature), I have added ability to specify custom api_url. But if not specified, default behaviour won't change - Issue: #9181, - Dependencies: no extra dependencies	8 months ago
Bob Lin	702a6d7044	Closed #14159 (#14165 ) ### Description Fix: #14159 Use `from pydantic.v1 import BaseModel, Field` instead of `from pydantic import BaseModel, Field` ### [lin_bob57617](https://twitter.com/lin_bob57617)	8 months ago
Perry Lee	641e401ba8	Shorten wget commands (#14211 ) - Description: The commands can be more efficient if the output name is set to the destined filename instead of renaming in the second command.	8 months ago
Harrison Chase	e32185193e	Harrison/embass (#14242 ) Co-authored-by: Julius Lipp <lipp.julius@gmail.com>	8 months ago
umair mehmood	8504ec56e4	fixed: ModuleNotFoundError: No module named 'clarifai.auth' (#14215 ) Updated the clarifai imports fixed: #14175 @efriis @baskaryan	8 months ago
Hieu Lam	ca8a022cd9	Fixed OpenAIFunctionsAgent not returning when receiving AgentFinish (#14236 ) Description: The way the condition is checked in the `return_stopped_response` function of `OpenAIAgent` may not be correct, when the value returned is `AgentFinish` from the tools it does not work properly. Thanks for review, @baskaryan, @eyurtsev, @hwchase17.	8 months ago
Unai Garay Maestre	6826feea14	Adds `llm_chain_kwargs` to `BaseRetrievalQA.from_llm` (#14224 ) - Description: Adds `llm_chain_kwargs` to `BaseRetrievalQA.from_llm` so these can be passed to the LLM at runtime, - Issue: https://github.com/langchain-ai/langchain/issues/14216, --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com>	8 months ago
James Braza	6ce5dab38c	Clarifying descriptions in `GuardrailsOutputParser` (#14228 ) Upstreaming knowledge from https://github.com/guardrails-ai/guardrails/discussions/473 to LangChain	8 months ago
geret1	50aee687c6	langchain[patch]: Cerebrium model_api_request deprecation (#12704 ) - Description: As part of my conversation with Cerebrium team, `model_api_request` will be no longer available in cerebrium lib so it needs to be replaced. - Issue: #12705 12705, - Dependencies: Cerebrium team (agreed) - Tag maintainer: @eyurtsev - Twitter handle: No official Twitter account sorry :D --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
Harutaka Kawamura	ee94ef55ee	docs[patch]: Update MLflow and Databricks docs (#14011 ) Depends on #13699. Updates the existing mlflow and databricks examples. --------- Co-authored-by: Ben Wilson <39283302+BenWilson2@users.noreply.github.com>	8 months ago
Leonid Ganeline	94bf733dae	docs[patch]: `AWS` platform page update (#14160 ) The `AWS` platform page has many missed integrations. - added missed integration references to the `AWS` platform page - added/updated descriptions and links in the referenced notebooks - renamed two notebook files. They have file names != page Title, which generate unordered ToC. - reroute the URLs for renamed files - fixed `amazon_textract` notebook: removed failed cell outputs	8 months ago
Leonid Ganeline	74d4154bcc	docs[patch]: added `Templates Hub` menu item (#14148 ) This link was missing in Docs. Added it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
William FH	246dc4f9cc	langchain[patch]: Pass kwargs to chat fireworks (#14183 ) Otherwise `.bind()` isn't really any good	8 months ago
Kaiboon Ee	e961c57fd2	langchain[patch]: Mask API key for Arcee LLM (#14193 ) - Description: Mask API key for Arcee LLM and its associated unit tests - Issue: https://github.com/langchain-ai/langchain/issues/12165 - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: `eekaiboon` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
Daniyar Supiyev	092f302c0f	langchain[patch]: Asynchronous human-in-the-loop callback (#14195 ) Description: Adding a possibility to use asynchronous callback handler in human-in-the-loop validation tool. Very useful, for example, if you want to implement a validation over Telegram bot. Issue: - Dependencies: - --------- Co-authored-by: Daniyar_Supiyev <daniyar_supiyev@epam.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	8 months ago
Leonid Ganeline	c660b0cf79	docs[patch]: moved semadb.mdx file (#14204 ) SemaDB.mdx file was placed with additional sub-folder: `https://python.langchain.com/docs/integrations/providers/providers/semadb` - Moved file to the `https://python.langchain.com/docs/integrations/providers/semadb` - Added a redirect for the file URL	8 months ago
Mark Cusack	16c83f786c	Adds the Yellowbrick Data Warehouse as a supported vector store (#13820 ) - Description An integration to allow the Yellowbrick Data Warehouse to function as a vector store --------- Co-authored-by: markcusack <markcusack@markcusacksmac.lan> Co-authored-by: markcusack <markcusack@Mark-Cusack-sMac.local>	8 months ago
Hendrik Hogertz	e6862e6e7d	Fix Azure Openai function calling in streaming mode (#13768 ) - Description: This PR addresses an issue with the OpenAI API streaming response, where initially the key (arguments) is provided but the value is None. Subsequently, it updates with {"arguments": "{\n"}, leading to a type inconsistency that causes an exception. The specific error encountered is ValueError: additional_kwargs["arguments"] already exists in this message, but with a different type. This change aims to resolve this inconsistency and ensure smooth API interactions. - Issue: None. - Dependencies: None. - Tag maintainer: @eyurtsev This is an updated version of #13229 based on the refactored code. Credit goes to @superken01. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago
Nicolò Boschi	e204657b3c	AstraDB VectorStore: implement pre_delete_collection (#13780 ) - Description: some vector stores have a flag for try deleting the collection before creating it (such as ´vectorpg´). This is a useful flag when prototyping indexing pipelines and also for integration tests. Added the bool flag `pre_delete_collection ` to the constructor (default False) - Tag maintainer: @hemidactylus - Twitter handle: nicoloboschi --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago
Chelsea E. Manning	2780d2d4dd	Extend OpenAIEmbeddings class to support non-`tiktoken` based embeddings (#13884 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: This extends `OpenAIEmbeddings` to add support for non-`tiktoken` based embeddings, specifically for use with the new `text-generation-webui` API (`--extensions openai`) which does not support `tiktoken` encodings, but rather strings - Issue: Not found, - Dependencies: HuggingFace `transformers.AutoTokenizer` is new dependency for running the model without `tiktoken` - Tag maintainer: @baskaryan based on last commit for `langchain-core` refactor - Twitter handle: @xychelsea Modified the tokenization process to be model-agnostic, allowing for both OpenAI and non-OpenAI model tokenizations, by setting the new default `bool` flag `tiktoken_enabled` to `False`. This requeires HuggingFace’s AutoTokenizer and handling tokenization for models requiring different preprocessing steps to generate a chunked string request rather than a list of integers. Updated the embeddings generation process to accommodate non-OpenAI models. This includes converting tokenized text into embeddings using OpenAI’s and Hugging Face’s model architectures. -->	8 months ago
Changgeng Zhao	9b59bde93d	Update Hologres vector store: use hologres-vector (#13767 ) Hi, I made some code changes on the Hologres vector store to improve the data insertion performance. Also, this version of the code uses `hologres-vector` library. This library is more convenient for us to update, and more efficient in performance. The code has passed the format/lint/spell check. I have run the unit test for Hologres connecting to my own database. Please check this PR again and tell me if anything needs to change. Best, Changgeng, Developer @ Alibaba Cloud Co-authored-by: Changgeng Zhao <zhaochanggeng.zcg@alibaba-inc.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	8 months ago

1 2 3 4 5 ...

6338 Commits (0a9d933bb298bccf087e47558acf2653b1e4a6c9) All Branches Search

6338 Commits (0a9d933bb298bccf087e47558acf2653b1e4a6c9)

All Branches