langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-06 03:20:49 +00:00

Author	SHA1	Message	Date
Bagatur	340b42d8ee	docs[minor]: lcel why page (#14089 )	2023-12-01 16:13:31 -08:00
Erick Friis	b01d9d27d9	docs[patch]: docs local build (#14152 )	2023-12-01 14:03:36 -08:00
Alex Kira	0caef3cde7	Change RunnableMap to RunnableParallel for consistency (#14142 ) - Description: Change instances of RunnableMap to RunnableParallel, as that should be the one used going forward. This makes it consistent across the codebase.	2023-12-01 13:36:40 -08:00
Martin Jul	e3a7c96a8e	docs[patch]: Fix minor typos (casing) in quickstart (#14138 ) Fix casing of API and LangChain in the description text for the LangServe example server.	2023-12-01 13:29:53 -08:00
Erick Friis	8cf4cb9e48	docs[patch]: Fix templates/index (#14146 )	2023-12-01 13:09:36 -08:00
Alex Kira	6eb40db353	docs[patch]: Add getting started section to LCEL doc (#14045 ) ### Description: Doc addition for LCEL introduction. Adds a more basic starter guide for using LCEL. --------- Co-authored-by: Alex Kira <akira@Alexs-MBP.local.tld> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 12:23:43 -08:00
axiangcoding	1b36ddf16c	docs[patch]: add deprecated note for ErnieChatBot (#14061 ) - Description: just a little change of ErnieChatBot class description, sugguesting user to use more suitable class - Issue: none, - Dependencies: none, - Tag maintainer: @baskaryan , - Twitter handle: none	2023-12-01 11:16:31 -08:00
Alex Kira	1757258b2a	docs[patch]: Add mermaid JS theme dependency to docusaurus (#14051 ) - Description: Add mermaid JS dependency and configs to documentation. Allows inline doc diagrams in markdown. - Dependencies: NPM package @docusaurus/theme-mermaid	2023-12-01 11:06:29 -08:00
Harrison Chase	ae646701c4	Harrison/ibm (#14133 ) Co-authored-by: Mateusz Szewczyk <139469471+MateuszOssGit@users.noreply.github.com>	2023-12-01 12:44:11 -05:00
Bob Lin	f15859bd86	docs[patch]: Update discord.ipynb (#14099 ) ### Description Now if `example` in Message is False, it will not be displayed. Update the output in this document. ```python In [22]: m = HumanMessage(content="Text") In [23]: m Out[23]: HumanMessage(content='Text') In [24]: m = HumanMessage(content="Text", example=True) In [25]: m Out[25]: HumanMessage(content='Text', example=True) ``` ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-01 08:54:31 -08:00
Bob Lin	75312c3694	docs[patch]: Update facebook.ipynb (#14102 ) ### Description Openai version 1.0.0 and later no longer supports the usage of camel case, So [the APIs](https://github.com/openai/openai-python/blob/main/api.md#finetuning) needs to be modified. ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-01 08:49:56 -08:00
Ean Yang	ac1c8634a8	docs[patch] Update invalid guides link (#14106 )	2023-12-01 08:47:38 -08:00
Erick Friis	b161f302ff	docs[patch]: local docs build <5s (#14096 )	2023-11-30 17:39:30 -08:00
Hubert Yuan	80ed588733	docs[patch]: Update metaphor_search.ipynb (#14093 ) - Description: Touch up of the documentation page for Metaphor Search Tool integration. Removes documentation for old built-in tool wrapper. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-30 16:34:05 -08:00
Jacob Lee	3328507f11	langchain[patch], experimental[minor]: Adds OllamaFunctions wrapper (#13330 ) CC @baskaryan @hwchase17 @jmorganca Having a bit of trouble importing `langchain_experimental` from a notebook, will figure it out tomorrow ~Ah and also is blocked by #13226~ --------- Co-authored-by: Lance Martin <lance@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-30 16:13:57 -08:00
Rohan Dey	41a4c06a94	Added support for a Pandas DataFrame OutputParser (#13257 ) Description: Added support for a Pandas DataFrame OutputParser with format instructions, along with unit tests and a demo notebook. Namely, we've added the ability to request data from a DataFrame, have the LLM parse the request, and then use that request to retrieve a well-formatted response. Within LangChain, it seamlessly integrates with language models like OpenAI's `text-davinci-003`, facilitating streamlined interaction using the format instructions (just like the other output parsers). This parser structures its requests as `<operation/column/row>[<optional_array_params>]`. The instructions detail permissible operations, valid columns, and array formats, ensuring clarity and adherence to the required format. For example: - When the LLM receives the input: "Retrieve the mean of `num_legs` from rows 1 to 3." - The provided format instructions guide the LLM to structure the request as: "mean:num_legs[1..3]". The parser processes this formatted request, leveraging the LLM's understanding to extract the mean of `num_legs` from rows 1 to 3 within the Pandas DataFrame. This integration allows users to communicate requests naturally, with the LLM transforming these instructions into structured commands understood by the `PandasDataFrameOutputParser`. The format instructions act as a bridge between natural language queries and precise DataFrame operations, optimizing communication and data retrieval. Issue: - https://github.com/langchain-ai/langchain/issues/11532 Dependencies: No additional dependencies :) Tag maintainer: @baskaryan Twitter handle: No need. :) --------- Co-authored-by: Wasee Alam <waseealam@protonmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 22:08:50 -05:00
Masanori Taniguchi	235bdb9fa7	Support Vald secure connection (#13269 ) Description: When using Vald, only insecure grpc connection was supported, so secure connection is now supported. In addition, grpc metadata can be added to Vald requests to enable authentication with a token. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-11-29 22:07:29 -05:00
Nico Puhlmann	54355b651a	Update index.mdx (#13285 ) grammar correction <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 22:06:33 -05:00
AthulVincent	67c55cb5b0	Implemented MongoDB Atlas Self-Query Retriever (#13321 ) # Description This PR implements Self-Query Retriever for MongoDB Atlas vector store. I've implemented the comparators and operators that are supported by MongoDB Atlas vector store according to the section titled "Atlas Vector Search Pre-Filter" from https://www.mongodb.com/docs/atlas/atlas-vector-search/vector-search-stage/. Namely: ``` allowed_comparators = [ Comparator.EQ, Comparator.NE, Comparator.GT, Comparator.GTE, Comparator.LT, Comparator.LTE, Comparator.IN, Comparator.NIN, ] """Subset of allowed logical operators.""" allowed_operators = [ Operator.AND, Operator.OR ] ``` Translations from comparators/operators to MongoDB Atlas filter operators(you can find the syntax in the "Atlas Vector Search Pre-Filter" section from the previous link) are done using the following dictionary: ``` map_dict = { Operator.AND: "$and", Operator.OR: "$or", Comparator.EQ: "$eq", Comparator.NE: "$ne", Comparator.GTE: "$gte", Comparator.LTE: "$lte", Comparator.LT: "$lt", Comparator.GT: "$gt", Comparator.IN: "$in", Comparator.NIN: "$nin", } ``` In visit_structured_query() the filters are passed as "pre_filter" and not "filter" as in the MongoDB link above since langchain's implementation of MongoDB atlas vector store(libs\langchain\langchain\vectorstores\mongodb_atlas.py) in _similarity_search_with_score() sets the "filter" key to have the value of the "pre_filter" argument. ``` params["filter"] = pre_filter ``` Test cases and documentation have also been added. # Issue #11616 # Dependencies No new dependencies have been added. # Documentation I have created the notebook mongodb_atlas_self_query.ipynb outlining the steps to get the self-query mechanism working. I worked closely with [@Farhan-Faisal](https://github.com/Farhan-Faisal) on this PR. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 22:05:06 -05:00
Mohammad Mohtashim	f3dd4a10cf	DROP BOX Loader Documentation Update (#14047 ) - Description: Update the document for drop box loader + made the messages more verbose when loading pdf file since people were getting confused - Issue: #13952 - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17, --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-29 17:25:35 -08:00
Cheng (William) Huang	a00db4b28f	Add multi-input Reddit search tool (#13893 ) - Description: Added a tool called RedditSearchRun and an accompanying API wrapper, which searches Reddit for posts with support for time filtering, post sorting, query string and subreddit filtering. - Issue: #13891 - Dependencies: `praw` module is used to search Reddit - Tag maintainer: @baskaryan , and any of the other maintainers if needed - Twitter handle: None. Hello, This is our first PR and we hope that our changes will be helpful to the community. We have run `make format`, `make lint` and `make test` locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Our PR integrates the `praw` package which is already used by RedditPostsLoader in LangChain. Nonetheless, we have added integration tests and edited unit tests to test our changes. An example notebook is also provided. These changes were put together by me, @Anika2000, @CharlesXu123, and @Jeremy-Cheng-stack Thank you in advance to the maintainers for their time. --------- Co-authored-by: What-Is-A-Username <49571870+What-Is-A-Username@users.noreply.github.com> Co-authored-by: Anika2000 <anika.sultana@mail.utoronto.ca> Co-authored-by: Jeremy Cheng <81793294+Jeremy-Cheng-stack@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-11-29 20:16:40 -05:00
Jawad Arshad	00a6e8962c	langchain[minor]: Add serpapi tools (#13934 ) - Description: Added some of the more endpoints supported by serpapi that are not suported on langchain at the moment, like google trends, google finance, google jobs, and google lens - Issue: [Add support for many of the querying endpoints with serpapi #11811](https://github.com/langchain-ai/langchain/issues/11811) --------- Co-authored-by: zushenglu <58179949+zushenglu@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Ian Xu <ian.xu@mail.utoronto.ca> Co-authored-by: zushenglu <zushenglu1809@gmail.com> Co-authored-by: KevinT928 <96837880+KevinT928@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 14:02:57 -08:00
h3l	dbaeb163aa	langchain[minor]: add volcengine endpoint as LLM (#13942 ) - Description: Volc Engine MaaS serves as an enterprise-grade, large-model service platform designed for developers. You can visit its homepage at https://www.volcengine.com/docs/82379/1099455 for details. This change will facilitate developers to integrate quickly with the platform. - Issue: None - Dependencies: volcengine - Tag maintainer: @baskaryan - Twitter handle: @he1v3tica --------- Co-authored-by: lvzhong <lvzhong@bytedance.com>	2023-11-29 13:16:42 -08:00
Tomaz Bratanic	3eb391561b	langchain[minor]: Reduce the number of tokens required to describe a Cypher/Neo4j schema (#13851 ) Instead of using JSON-like syntax to describe node and relationship properties we changed to a shorter and more concise schema description Old: ``` Node properties are the following: [{'properties': [{'property': 'name', 'type': 'STRING'}], 'labels': 'Movie'}, {'properties': [{'property': 'name', 'type': 'STRING'}], 'labels': 'Actor'}] Relationship properties are the following: [] The relationships are the following: ['(:Actor)-[:ACTED_IN]->(:Movie)'] ``` New: ``` Node properties are the following: Movie {name: STRING},Actor {name: STRING} Relationship properties are the following: The relationships are the following: (:Actor)-[:ACTED_IN]->(:Movie) ```	2023-11-29 11:13:12 -08:00
Sauhaard	7ec4dbeb80	langchain[minor]: Add StackExchange API integration (#14002 ) Implements [#12115](https://github.com/langchain-ai/langchain/issues/12115) Who can review? @baskaryan , @eyurtsev , @hwchase17 Integrated Stack Exchange API into Langchain, enabling access to diverse communities within the platform. This addition enhances Langchain's capabilities by allowing users to query Stack Exchange for specialized information and engage in discussions. The integration provides seamless interaction with Stack Exchange content, offering content from varied knowledge repositories. A notebook example and test cases were included to demonstrate the functionality and reliability of this integration. - Add StackExchange as a tool. - Add unit test for the StackExchange wrapper and tool. - Add documentation for the StackExchange wrapper and tool. If you have time, could you please review the code and provide any feedback as necessary! My team is welcome to any suggestions. --------- Co-authored-by: Yuval Kamani <yuvalkamani@gmail.com> Co-authored-by: Aryan Thakur <aryanthakur@Aryans-MacBook-Pro.local> Co-authored-by: Manas1818 <79381912+manas1818@users.noreply.github.com> Co-authored-by: aryan-thakur <61063777+aryan-thakur@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 10:32:07 -08:00
Anton Romanov	4964278ce4	docs[patch]: Update typo in map.ipynb (#14030 ) fix the typo in docs, using "with" instead of "when"	2023-11-29 09:14:29 -08:00
Richie	1cd9d5f332	docs[patch]: fix typo langchain version for mongodb integration (#14006 ) - Description: update minimal supported langchain version for [mongodb atlast integration webpage](https://python.langchain.com/docs/integrations/vectorstores/mongodb_atlas) - Issue: none - Dependencies: none ----- Just fixing a typo. In [mongodb atlas vectorstore integration page](https://python.langchain.com/docs/integrations/vectorstores/mongodb_atlas), `langchain` support for `$vectorSearch MQL stage` should be `0.0.305` rather than `0.0.35`	2023-11-28 21:20:30 -08:00
Piotr Ząbek	d0b818b634	DOCS: added missing imports (#13736 ) (#13737 ) - Description: Fixed missing imports in docs - Issue: [#13736](https://github.com/langchain-ai/langchain/issues/13736) - Dependencies: N/A	2023-11-28 22:42:43 -05:00
Yusuf Khan	0bc7c1b5b4	Add Outline provider doc (#13938 ) - Description: Added a provider doc to `docs/integrations/providers` for the new Outline integration in #13889 - Tag maintainer: @baskaryan	2023-11-28 22:29:30 -05:00
colton	643d28847d	[docs] fix reduce prompt in summarization example (#13726 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Small fix to _summarization_ example, `reduce_template` should use `{docs}` variable. Bug likely introduced as following code suggests using `hub.pull("rlm/map-prompt")` instead of defined prompt.	2023-11-28 22:22:42 -05:00
Leonid Ganeline	5c28bb63dd	docs `microsoft` page updates (#14000 ) The Excel, PowerPoint and SharePoint document loaders were missed in the `Microsoft` platform page. - added these references	2023-11-28 22:20:21 -05:00
Leonid Ganeline	15b32cfcd4	docs `OpenAI` platform page update (#14001 ) Missed the OpenAI adapter reference in the OpenAI platform page - Added this reference	2023-11-28 22:08:21 -05:00
WaseemH	a47f1da884	docs[patch]: RAG Cookbook example fix (#13914 ) ### Description: Hey 👋🏽 this is a small docs example fix. Hoping it helps future developers who are working with Langchain. ### Problem: Take a look at the original example code. You were not able to get the `dialogue_turn[0]` while it was a tuple. Original code: ```python def _format_chat_history(chat_history: List[Tuple]) -> str: buffer = "" for dialogue_turn in chat_history: human = "Human: " + dialogue_turn[0] ai = "Assistant: " + dialogue_turn[1] buffer += "\n" + "\n".join([human, ai]) return buffer ``` In the original code you were getting this error: ```bash human = "Human: " + dialogue_turn[0].content ~~~~~~~~~~~~~^^^ TypeError: 'HumanMessage' object is not subscriptable ``` ### Solution: The fix is to just for loop over the chat history and look to see if its a human or ai message and add it to the buffer.	2023-11-28 17:37:03 -08:00
Bagatur	14799b139a	infra[patch]: add base deps and fix docs lint (#13998 )	2023-11-28 17:27:37 -08:00
Leonid Ganeline	52eee458bb	renamed `google_vertex_ai_vector_search` notebook (#13484 ) The `integrations/vectorstores/matchingengine.ipynb` example has the "Google Vertex AI Vector Search" title. This place this Title in the wrong order in the ToC (it is sorted by the file name). - Renamed `integrations/vectorstores/matchingengine.ipynb` into `integrations/vectorstores/google_vertex_ai_vector_search.ipynb`. - Updated a correspondent comment in docstring - Rerouted old URL to a new URL --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-28 16:58:29 -08:00
Leonid Ganeline	f5326cfb4e	docs[patch]: link to LangSmith docs (#13740 ) It happens that there is no link to the LangSmith Docs from the LangChain Docs. Added this link	2023-11-28 16:44:45 -08:00
Leonid Ganeline	1ab8a14742	docs[patch]: top menu (#13748 ) Addressed this issue with the top menu: It allocates too much space. If the screen is small, then the top menu items are split into two lines and look unreadable. Another issue is with several top menu items: "Chat our docs" and "Also by LangChain". They are compound of several words which also hurts readability. The top menu items should be 1-word size. Updates: - "Chat our docs" -> "Chat" (the meaning is clean after clicking/opening the item) - "Also by LangChain" -> "🦜️🔗" - "🦜️🔗" moved before "Chat" item. This new item is partially copied from the first left item, the "🦜️🔗 LangChain". This design (with two 🦜️🔗 elements, visually splits the top menu into two parts. The first item in each part holds the 🦜️🔗 symbols and, when we click the second 🦜️🔗 item, it opens the drop-down menu. So, we've got two visually similar parts, which visually split the top menu on the right side: the LangChain Docs (and Doc-related items) and the lift side: other LangChain.ai (company) products/docs.	2023-11-28 16:35:38 -08:00
Taqi Jaffri	144710ad9a	langchain[minor]: Updated DocugamiLoader, includes breaking changes (#13265 ) There are the following main changes in this PR: 1. Rewrite of the DocugamiLoader to not do any XML parsing of the DGML format internally, and instead use the `dgml-utils` library we are separately working on. This is a very lightweight dependency. 2. Added MMR search type as an option to multi-vector retriever, similar to other retrievers. MMR is especially useful when using Docugami for RAG since we deal with large sets of documents within which a few might be duplicates and straight similarity based search doesn't give great results in many cases. We are @docugami on twitter, and I am @tjaffri --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	2023-11-28 15:56:22 -08:00
Bagatur	95a472a85f	docs[patch]: install local core (#13990 )	2023-11-28 14:36:22 -08:00
Bagatur	61ec71064a	docs[patch]: update stack diagram (#13902 )	2023-11-28 14:19:13 -08:00
david qiu	9fb6805be4	langchain[minor]: Add retriever for Knowledge Bases for Amazon Bedrock (#13980 ) - Description: Adds a retriever implementation for [Knowledge Bases for Amazon Bedrock](https://aws.amazon.com/bedrock/knowledge-bases/), a new service announced at AWS re:Invent, shortly before this PR was opened. This depends on the `bedrock-agent-runtime` service, which will be included in a future version of `boto3` and of `botocore`. We will open a follow-up PR documenting the minimum required versions of `boto3` and `botocore` after that information is available. - Issue: N/A - Dependencies: `boto3>=1.33.2, botocore>=1.33.2` - Tag maintainer: @baskaryan - Twitter handles: `@pjain7` `@dead_letter_q` This PR includes a documentation notebook under `docs/docs/integrations/retrievers`, which I (@dlqqq) have verified independently. EDIT: `bedrock-agent-runtime` service is now included in `boto3>=1.33.2`: `5cf793f493` --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-28 14:10:23 -08:00
Varun	14cc907d35	Update the stable docs link (#13798 ) - Description: Point to the stable version of documentation, - Twitter handle: varunzxzx	2023-11-28 21:11:16 +00:00
Amélie	d2cad53ec0	Fix broken link on Meilisearch vector-store documentation (#13604 ) - Description: dead link replacement - Issue: no open issue Note: Hi langchain team, Sorry to open a PR for this concern but we realized that one of the links present in the documentation booklet was broken 😄	2023-11-28 15:49:32 -05:00
Rihards Gravis	9e017ff6ba	docs[patch]: Reduce largest static image file size (#13508 ) - Description: Reduce image asset file size used in documentation by running them via lossless image optimization ([tinypng](https://www.npmjs.com/package/tinypng-cli) was used in this case). Images wider than 1916px (the maximum width of an image displayed in documentation) where downsized. - Issue: No issue is created for this, but the large image file assets caused slow documentation load times - Dependencies: No dependencies affected	2023-11-28 13:00:53 -05:00
Michael Feil	686162670e	langchain[minor]: Adding `infinity` embedding integration. (#13928 ) This adds integation to https://github.com/michaelfeil/infinity. Users requested it in https://github.com/michaelfeil/infinity/issues/36 @saatvikshah Follows my implementation of gradient.ai. Feedback 1: Well done - I love your CI / repo / poetry setup - I adapted a lot in https://github.com/michaelfeil/infinity. Feedback 2: Not so good: The openai integration contains to much reverse engineering - in general projects such as michaelfeil/infinity and huggingface/text-embeddings-inference are compatible to the `pip install openai` package. Reverse engineering like this one is really hindering the use for me: `8e88ba16a8/libs/langchain/langchain/embeddings/openai.py (L347)` `8e88ba16a8/libs/langchain/langchain/embeddings/openai.py (L351)` - it is about preventing 3rd party providers to use the same url + uses interfaces of openai, that are not publically documented.	2023-11-27 16:43:47 -08:00
Oleksandr Yaremchuk	c0277d06e8	experimental[patch] Update prompt injection model (#13930 ) - Description: Existing model used for Prompt Injection is quite outdated but we fine-tuned and open-source a new model based on the same model deberta-v3-base from Microsoft - [laiyer/deberta-v3-base-prompt-injection](https://huggingface.co/laiyer/deberta-v3-base-prompt-injection). It supports more up-to-date injections and less prone to false-positives. - Dependencies: No - Tag maintainer: - - Twitter handle: @alex_yaremchuk --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-27 17:56:53 -05:00
Leonid Ganeline	e47b9c5285	DOCS: move `adapters` to integrations (#13862 ) Current docs for adapters are in the `Guides/Adapters which is not a good place. - moved Adapters into `Integratons/Components/Adapters/ - simplified the OpenAI adapter notebook - rerouted the old OpenAI adapter page URL to a new one.	2023-11-27 13:05:43 -08:00
Manuel Riezebosch	92b07ecaf3	DOCS: fix link to question answering (#13806 ) first link in [overview](https://python.langchain.com/docs/use_cases/question_answering/code_understanding#overview)	2023-11-27 12:56:15 -08:00
Chengzu Ou	4b8e053fe8	FEATURE: Add Databricks Vector Search as a new vector store (#13621 ) Description: This PR adds Databricks Vector Search as a new vector store in LangChain. - [x] Add `DatabricksVectorSearch` in `langchain/vectorstores/` - [x] Unit tests - [x] Add [`databricks-vectorsearch`](https://pypi.org/project/databricks-vectorsearch/) as a new optional dependency We ran the following checks: - `make format` passed ✅ - `make lint` failed but the failures were caused by other files + Files touched by this PR passed the linter ✅ - `make test` passed ✅ - `make coverage` failed but the failures were caused by other files. Tests added by or related to this PR all passed + langchain/vectorstores/databricks_vector_search.py test coverage 94% ✅ - `make spell_check` passed ✅ The example notebook and updates to the [provider's documentation page](https://github.com/langchain-ai/langchain/blob/master/docs/docs/integrations/providers/databricks.md) will be added later in a separate PR. Dependencies: Optional dependency: [`databricks-vectorsearch`](https://pypi.org/project/databricks-vectorsearch/) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-27 11:07:26 -08:00
Dylan Williams	1983a39894	FEATURE: Add OneNote document loader (#13841 ) - Description: Added OneNote document loader - Issue: #12125 - Dependencies: msal Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-26 23:59:52 -08:00
Ikko Eltociear Ashimine	ff7d4d9c0b	Update llamacpp.ipynb (#13840 ) specifed -> specified	2023-11-26 23:47:19 -08:00
Sᴜᴘᴇʀ Lᴇᴇ	e42e95cc11	docs: fix link to `local_retrieval_qa` (#13872 ) \The original link in [this section](https://python.langchain.com/docs/use_cases/question_answering/#:~:text=locally%2Drunning%20models-,here,-.): https://python.langchain.com/docs/modules/use_cases/question_answering/local_retrieval_qa After fix: https://python.langchain.com/docs/use_cases/question_answering/local_retrieval_qa	2023-11-26 19:16:46 -08:00
Yusuf Khan	935f78c944	FEATURE: Add retriever for Outline (#13889 ) - Description: Added a retriever for the Outline API to ask questions on knowledge base - Issue: resolves #11814 - Dependencies: None - Tag maintainer: @baskaryan	2023-11-26 18:56:12 -08:00
ggeutzzang	f2af82058f	DOCS: Fix Sample Code for Compatibility with Pydantic 2.0 (#13890 ) - Description: I encountered an issue while running the existing sample code on the page https://python.langchain.com/docs/modules/agents/how_to/agent_iter in an environment with Pydantic 2.0 installed. The following error was triggered: ```python ValidationError Traceback (most recent call last) <ipython-input-12-2ffff2c87e76> in <cell line: 43>() 41 42 tools = [ ---> 43 Tool( 44 name="GetPrime", 45 func=get_prime, 2 frames /usr/local/lib/python3.10/dist-packages/pydantic/v1/main.py in __init__(__pydantic_self__, **data) 339 values, fields_set, validation_error = validate_model(__pydantic_self__.__class__, data) 340 if validation_error: --> 341 raise validation_error 342 try: 343 object_setattr(__pydantic_self__, '__dict__', values) ValidationError: 1 validation error for Tool args_schema subclass of BaseModel expected (type=type_error.subclass; expected_class=BaseModel) ``` I have made modifications to the example code to ensure it functions correctly in environments with Pydantic 2.0.	2023-11-26 18:21:13 -08:00
Stefano Lottini	19c68c7652	FEATURE: Astra DB, LLM cache classes (exact-match and semantic cache) (#13834 ) This PR provides idiomatic implementations for the exact-match and the semantic LLM caches using Astra DB as backend through the database's HTTP JSON API. These caches require the `astrapy` library as dependency. Comes with integration tests and example usage in the `llm_cache.ipynb` in the docs. @baskaryan this is the Astra DB counterpart for the Cassandra classes you merged some time ago, tagging you for your familiarity with the topic. Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-24 18:53:37 -08:00
Stefano Lottini	272df9dcae	Astra DB, chat message history (#13836 ) This PR adds a chat message history component that uses Astra DB for persistence through the JSON API. The `astrapy` package is required for this class to work. I have added tests and a small notebook, and updated the relevant references in the other docs pages. (@rlancemartin this is the counterpart of the Cassandra equivalent class you so helpfully reviewed back at the end of June) Thank you!	2023-11-24 18:12:29 -08:00
Bagatur	23566cbea9	DOCS: core editable dep api refs (#13747 )	2023-11-22 14:33:30 -08:00
Bagatur	3d28c1a9e0	DOCS: fix core api ref build (#13744 )	2023-11-22 15:42:35 -05:00
Harrison Chase	d82cbf5e76	Separate out langchain_core package (#13577 ) Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-20 13:09:30 -08:00
Bagatur	4eec47b191	DOCS: update rag use case images (#13615 )	2023-11-20 10:14:52 -08:00
Sijun He	674bd90a47	DOCS: Fix typo in MongoDB memory docs (#13588 ) - Description: Fix typo in MongoDB memory docs - Tag maintainer: @eyurtsev <!-- Thank you for contributing to LangChain! - Description: Fix typo in MongoDB memory docs - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: @baskaryan - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2023-11-19 19:13:35 -08:00
jwbeck97	a93616e972	FEAT: Add azure cognitive health tool (#13448 ) - Description: This change adds an agent to the Azure Cognitive Services toolkit for identifying healthcare entities - Dependencies: azure-ai-textanalytics (Optional) --------- Co-authored-by: James Beck <James.Beck@sa.gov.au> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-19 18:44:01 -08:00
John Mai	16f7912e1b	BUG: fix hunyuan appid type (#13496 ) - Description: fix hunyuan appid type - Issue: https://github.com/langchain-ai/langchain/pull/12022#issuecomment-1815627855	2023-11-19 18:23:45 -08:00
Leonid Ganeline	43972be632	docs updating `AzureML` notebooks (#13492 ) - Added/updated descriptions and links --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-19 18:07:12 -08:00
Taranjeet Singh	47451764a7	Add embedchain retriever (#13553 ) Description: This commit adds embedchain retriever along with tests and docs. Embedchain is a RAG framework to create data pipelines. Twitter handle: - [Taranjeet's twitter](https://twitter.com/taranjeetio) and [Embedchain's twitter](https://twitter.com/embedchain) Reviewer @hwchase17 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-19 17:35:03 -08:00
rafly lesmana	420a17542d	fix: Make YoutubeLoader support on demand language translation (#13583 ) Description: Enhance the functionality of YoutubeLoader to enable the translation of available transcripts by refining the existing logic. Issue: Encountering a problem with YoutubeLoader (#13523) where the translation feature is not functioning as expected. Tag maintainers/contributors who might be interested: @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-19 17:34:48 -08:00
Leonid Ganeline	cc50e023d1	DOCS `langchain decorators` update (#13535 ) added disclaimer --------- Co-authored-by: Erick Friis <erickfriis@gmail.com>	2023-11-19 17:30:05 -08:00
Brace Sproul	02a13030c0	DOCS: updated langchain stack img to be svg (#13540 )	2023-11-19 16:26:53 -08:00
Martin Krasser	79ed66f870	EXPERIMENTAL Generic LLM wrapper to support chat model interface with configurable chat prompt format (#8295 ) ## Update 2023-09-08 This PR now supports further models in addition to Lllama-2 chat models. See [this comment](#issuecomment-1668988543) for further details. The title of this PR has been updated accordingly. ## Original PR description This PR adds a generic `Llama2Chat` model, a wrapper for LLMs able to serve Llama-2 chat models (like `LlamaCPP`, `HuggingFaceTextGenInference`, ...). It implements `BaseChatModel`, converts a list of chat messages into the [required Llama-2 chat prompt format](https://huggingface.co/blog/llama2#how-to-prompt-llama-2) and forwards the formatted prompt as `str` to the wrapped `LLM`. Usage example: ```python # uses a locally hosted Llama2 chat model llm = HuggingFaceTextGenInference( inference_server_url="http://127.0.0.1:8080/", max_new_tokens=512, top_k=50, temperature=0.1, repetition_penalty=1.03, ) # Wrap llm to support Llama2 chat prompt format. # Resulting model is a chat model model = Llama2Chat(llm=llm) messages = [ SystemMessage(content="You are a helpful assistant."), MessagesPlaceholder(variable_name="chat_history"), HumanMessagePromptTemplate.from_template("{text}"), ] prompt = ChatPromptTemplate.from_messages(messages) memory = ConversationBufferMemory(memory_key="chat_history", return_messages=True) chain = LLMChain(llm=model, prompt=prompt, memory=memory) # use chat model in a conversation # ... ``` Also part of this PR are tests and a demo notebook. - Tag maintainer: @hwchase17 - Twitter handle: `@mrt1nz` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-17 16:32:13 -08:00
Bagatur	2e2114d2d0	FEATURE: Runnable with message history (#13418 ) Add RunnableWithMessageHistory class that can wrap certain runnables and manages chat history for them.	2023-11-17 12:00:01 -08:00
Bagatur	0fc3af8932	IMPROVEMENT: update assistants output and doc (#13480 )	2023-11-17 11:58:54 -08:00
Leonid Ganeline	21552628c8	DOCS updated `data_connection` index page (#13426 ) - the `Index` section was missed. Created it. - text simplification --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-11-16 18:16:50 -08:00
Leonid Ganeline	e3a5cd7969	docs `integrations/vectorstores/` cleanup (#13487 ) - updated titles to consistent format - added/updated descriptions and links - format heading	2023-11-16 17:51:49 -08:00
Leonid Ganeline	1d2981114f	DOCS updated `async-faiss` example (#13434 ) The original notebook has the `faiss` title which is duplicated in the`faiss.jpynb`. As a result, we have two `faiss` items in the vectorstore ToC. And the first item breaks the searching order (it is placed between `A...` items). - I updated title to `Asynchronous Faiss`.	2023-11-16 17:41:26 -08:00
Leonid Ganeline	9ff8f69e75	DOCS updated `memory` Titles (#13435 ) - Fixed titles for two notebooks. They were inconsistent with other titles and clogged ToC. - Added `Upstash` description and link - Moved the authentication text up in the `Elasticsearch` nb, right after package installation. It was on the end of the page which was a wrong place.	2023-11-16 13:24:05 -08:00
Stefano Lottini	b029d9f4e6	Astra DB: minor improvements to docstrings and demo notebook (#13449 ) This PR brings a few minor improvements to the docs, namely class/method docstrings and the demo notebook. - A note on how to control concurrency levels to tune performance in bulk inserts, both in the class docstring and the demo notebook; - Slightly increased concurrency defaults after careful experimentation (still on the conservative side even for clients running on less-than-typical network/hardware specs) - renamed the DB token variable to the standardized `ASTRA_DB_APPLICATION_TOKEN` name (used elsewhere, e.g. in the Astra DB docs) - added a note and a reference (add_text docstring, demo notebook) on allowed metadata field names. Thank you!	2023-11-16 12:48:32 -08:00
Leonid Ganeline	283ef1f66d	DOCS fix for `integratons/document_loaders` sidebar (#13471 ) The current `integrations/document_loaders/` sidebar has the `example_data` item, which is a menu with a single item: "Notebook". It is happening because the `integrations/document_loaders/` folder has the `example_data/notebook.md` file that is used to autogenerate the above menu item. - removed an example_data/notebook.md file. Docusaurus doesn't have simple ways to fix this problem (to exclude folders/files from an autogenerated sidebar). Removing this file didn't break any existing examples, so this fix is safe.	2023-11-16 12:02:30 -08:00
Leonid Ganeline	b1fcf5b481	DOCS: `integrations/text_embeddings/` cleanup (#13476 ) Updated several notebooks: - fixed titles which are inconsistent or break the ToC sorting order. - added missed soruce descriptions and links - fixed formatting	2023-11-16 11:56:53 -08:00
Bagatur	10fddac4b5	Bagatur/chain of note template(#13470 )	2023-11-16 10:34:04 -08:00
Leonid Ganeline	d5b1a21ae4	DOCS updated `semadb` example (#13431 ) - the `SemaDB` notebook was placed in additional subfolder which breaks the vectorstore ToC. I moved file up, removed this unnecessary subfolder; updated the `vercel.json` with rerouting for the new URL - Added SemaDB description and link - improved text consistency	2023-11-16 09:57:22 -08:00
Leonid Ganeline	17c2007e0c	DOCS updated `Activeloop DeepMemory` notebook (#13428 ) - Fixed the title of the notebook. It created an ugly ToC element as `Activeloop DeepLake's DeepMemory + LangChain + ragas or how to get +27% on RAG recall.` - Added Activeloop description - improved consistency in text - fixed ToC (it was using HTML tagas that break left-side in-page ToC). Now in-page ToC works	2023-11-16 09:56:28 -08:00
Bagatur	9e6748e198	DOCS: rag nit (#13436 )	2023-11-15 18:06:52 -08:00
Leonid Ganeline	8a52c1456b	updated `clickup` example (#13424 ) - Fixed headers (was more then 1 Titles) - Removed security token value. It was OK to have it, because it is temporary token, but the automatic security swippers raise warnings on that. - Added `ClickUp` service description and link.	2023-11-15 15:11:24 -08:00
Brace Sproul	79fa9a81f4	Fix a link in docs (#13423 )	2023-11-15 15:02:26 -08:00
Bagatur	f0bb839506	DOCS: langchain stack img update (#13421 )	2023-11-15 14:10:02 -08:00
Bagatur	76c317ed78	DOCS: update rag use case (#13319 )	2023-11-15 10:54:15 -08:00
Bagatur	a0b39a4325	DOCS: install nit (#13380 )	2023-11-15 10:27:00 -08:00
Bagatur	9f543634e2	Agent window management how to (#13033 )	2023-11-15 09:38:02 -08:00
Leonid Ganeline	c9b9359647	FEAT docs integration cards site (#13379 ) The `Integrations` site is hidden now. I've added it into the `More` menu. The name is `Integration Cards` otherwise, it is confused with the `Integrations` menu. --------- Co-authored-by: Erick Friis <erickfriis@gmail.com>	2023-11-14 19:49:17 -08:00
Erick Friis	0f25ea9671	api doc newlines (#13378 ) cc @leo-gan Deploying at https://api.python.langchain.com/en/erick-api-doc-newlines-/api_reference.html (will take a bit)	2023-11-14 19:16:31 -08:00
Leonid Ganeline	342ed5c77a	`Yi` model from `01.ai` , example (#13375 ) Added an example with new soa `Yi` model to `HuggingFace-hub` notebook	2023-11-14 17:10:53 -08:00
Bagatur	3596be5210	DOCS: format notebooks (#13371 )	2023-11-14 14:17:44 -08:00
Predrag Gruevski	2ebd167dba	Lint Python notebooks with ruff. (#12677 ) The new ruff version fixed the blocking bugs, and I was able to fairly easily us to a passing state: ruff fixed some issues on its own, I fixed a handful by hand, and I added a list of narrowly-targeted exclusions for files that are currently failing ruff rules that we probably should look into eventually. I went pretty lenient on the docs / cookbooks rules, allowing dead code and such things. Perhaps in the future we may want to tighten the rules further, but this is already a good set of checks that found real issues and will prevent them going forward.	2023-11-14 15:58:22 -05:00
Leonid Ganeline	f5bf3bdf14	added `Cookbooks` link (#13078 ) It is a temporary solution before major documents refactoring. Related to #13070 (not solving it)	2023-11-14 10:52:47 -08:00
Bagatur	1c67db4c18	Move OAI assistants to langchain and add callbacks (#13236 )	2023-11-13 17:42:07 -08:00
Bagatur	8006919e52	DOCS: cleanup docs directory (#13301 )	2023-11-13 17:38:45 -08:00
mertkayhan	9b4974871d	IMPROVEMENT Increase flexibility of ElasticVectorSearch (#6863 ) Hey @rlancemartin, @eyurtsev , I did some minimal changes to the `ElasticVectorSearch` client so that it plays better with existing ES indices. Main changes are as follows: 1. You can pass the dense vector field name into `_default_script_query` 2. You can pass a custom script query implementation and the respective parameters to `similarity_search_with_score` 3. You can pass functions for building page content and metadata for the resulting `Document` <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 4. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @dev2049 - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @dev2049 - Memory: @hwchase17 - Agents / Tools / Toolkits: @vowelparrot - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md -->	2023-11-13 14:36:03 -08:00
Leonie	32c493e3df	Refine Weaviate docs and add RAG example (#13057 ) - Description: Refine Weaviate tutorial and add an example for Retrieval-Augmented Generation (RAG) - Issue: (not applicable), - Dependencies: none - Tag maintainer: @baskaryan <!-- If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Twitter handle: @helloiamleonie Co-authored-by: Leonie <leonie@Leonies-MBP-2.fritz.box>	2023-11-13 10:59:19 -08:00
Junlin Zhou	4da2faba41	docs: align custom_tool document headers (#13252 ) On the [Defining Custom Tools](https://python.langchain.com/docs/modules/agents/tools/custom_tools) page, there's a 'Subclassing the BaseTool class' paragraph under the 'Completely New Tools - String Input and Output' header. Also there's another 'Subclassing the BaseTool' paragraph under no header, which I think may belong to the 'Custom Structured Tools' header. Another thing is, there's a 'Using the tool decorator' and a 'Using the decorator' paragraph, I think should belong to 'Completely New Tools - String Input and Output' and 'Custom Structured Tools' separately. This PR moves those paragraphs to corresponding headers.	2023-11-13 09:03:56 -08:00
Ikko Eltociear Ashimine	700293cae9	Fix typo in timescalevector.ipynb (#13239 ) enviornment -> environment	2023-11-13 09:03:07 -08:00

1 2 3 4 5 ...

2579 Commits