langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-18 09:25:54 +00:00

Author	SHA1	Message	Date
Bob Lin	5a23608c41	Add custom async generator example (#14299 ) <img width="1172" alt="Screenshot 2023-12-05 at 11 19 16 PM" src="https://github.com/langchain-ai/langchain/assets/10000925/6b0fbd70-9f6b-4f91-b494-9e88676b4786">	2023-12-05 16:08:19 -08:00
Bob Lin	63fdc6e818	Update docs (#14294 ) ### Description Fixed 3 doc issues: 1. `ConfigurableField ` needs to be imported in `docs/docs/expression_language/how_to/configure.ipynb` 2. use `error` instead of `RateLimitError()` in `docs/docs/expression_language/how_to/fallbacks.ipynb` 3. I think it might be better to output the fixed json data(when I looked at this example, I didn't understand its purpose at first, but then I suddenly realized): <img width="1219" alt="Screenshot 2023-12-05 at 10 34 13 PM" src="https://github.com/langchain-ai/langchain/assets/10000925/7623ba13-7b56-4964-8c98-b7430fabc6de">	2023-12-05 16:08:03 -08:00
Lance Martin	29e993a5f2	Update OpenCLIP docs (#14319 )	2023-12-05 15:31:10 -08:00
Leonid Ganeline	0f02e94565	docs: `integrations/providers/` update (#14315 ) - added missed provider files (from `integrations/Callbacks` - updated notebooks: added links; updated into consistent formats	2023-12-05 13:05:29 -08:00
Joan Fontanals	dcccf8fa66	adapt Jina Embeddings to new Jina AI Embedding API (#13658 ) - Description: Adapt JinaEmbeddings to run with the new Jina AI Embedding platform - Twitter handle: https://twitter.com/JinaAI_ --------- Co-authored-by: Joan Fontanals Martinez <joan.fontanals.martinez@jina.ai> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-04 20:40:33 -08:00
Philippe PRADOS	e0c03d6c44	Pprados/lite google drive (#13175 ) - Fix bug in the document - Add clarification on the use of langchain-google drive.	2023-12-04 20:31:21 -08:00
guillaumedelande	ea0afd07ca	Update azuresearch.py following recent change from azure-search-documents library (#13472 ) - Description: Reference library azure-search-documents has been adapted in version 11.4.0: 1. Notebook explaining Azure AI Search updated with most recent info 2. HnswVectorSearchAlgorithmConfiguration --> HnswAlgorithmConfiguration 3. PrioritizedFields(prioritized_content_fields) --> SemanticPrioritizedFields(content_fields) 4. SemanticSettings --> SemanticSearch 5. VectorSearch(algorithm_configurations) --> VectorSearch(configurations) --> Changes now reflected on Langchain: default vector search config from langchain is now compatible with officially released library from Azure. - Issue: Issue creating a new index (due to wrong class used for default vector search configuration) if using latest version of azure-search-documents with current langchain version - Dependencies: azure-search-documents>=11.4.0, - Tag maintainer: , --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-04 20:29:20 -08:00
Erick Friis	956d55de2b	docs[patch]: chat model page names (#14264 )	2023-12-04 20:08:41 -08:00
Harrison Chase	2213fc9711	Harrison/bookend ai (#14258 ) Co-authored-by: stvhu-bookend <142813359+stvhu-bookend@users.noreply.github.com>	2023-12-04 19:42:15 -08:00
cxumol	0d47d15a9f	add(feat): Text Embeddings by Cloudflare Workers AI (#14220 ) Add [Text Embeddings by Cloudflare Workers AI](https://developers.cloudflare.com/workers-ai/models/text-embeddings/). It's a new integration. Trying to align it with its langchain-js version counterpart [here](https://api.js.langchain.com/classes/embeddings_cloudflare_workersai.CloudflareWorkersAIEmbeddings.html). - Dependencies: N/A - Done `make format` `make lint` `make spell_check` `make integration_tests` and all my changes was passed	2023-12-04 19:25:05 -08:00
Erick Friis	4351b99d2b	docs[patch]: search experiment (#14254 ) - npm - search config - custom	2023-12-04 16:58:26 -08:00
deedy5	ee9abb6722	Bugfix duckduckgo_search news search (#13670 ) - Description: Bugfix duckduckgo_search news search - Issue: https://github.com/langchain-ai/langchain/issues/13648 - Dependencies: None - Tag maintainer: @baskaryan --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-04 16:48:20 -08:00
Harrison Chase	921c4b5597	Harrison/searchapi (#14252 ) Co-authored-by: SebastjanPrachovskij <86522260+SebastjanPrachovskij@users.noreply.github.com>	2023-12-04 16:34:15 -08:00
Ravidhu	224aa5151d	Fix Sagemaker Endpoint documentation (#13660 ) - Description: fixed the transform_input method in the example., - Issue: example didn't work, - Dependencies: None, - Tag maintainer: @baskaryan, - Twitter handle: @Ravidhu87	2023-12-04 16:28:29 -08:00
Erick Friis	f26d88ca60	docs[patch]: fix columns (#14251 )	2023-12-04 16:03:09 -08:00
Kastan Day	65faba91ad	langchain[patch]: Adding new Github functions for reading pull requests (#9027 ) The Github utilities are fantastic, so I'm adding support for deeper interaction with pull requests. Agents should read "regular" comments and review comments, and the content of PR files (with summarization or `ctags` abbreviations). Progress: - [x] Add functions to read pull requests and the full content of modified files. - [x] Function to use Github's built in code / issues search. Out of scope: - Smarter summarization of file contents of large pull requests (`tree` output, or ctags). - Smarter functions to checkout PRs and edit the files incrementally before bulk committing all changes. - Docs example for creating two agents: - One watches issues: For every new issue, open a PR with your best attempt at fixing it. - The other watches PRs: For every new PR && every new comment on a PR, check the status and try to finish the job. <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! Please make sure you're PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-04 15:53:36 -08:00
Leonid Ganeline	1750cc464d	docs[patch]: moved `vectorstore` notebook file (#14181 ) The `/docs/integrations/toolkits/vectorstore` page is not the Integration page. The best place is in `/docs/modules/agents/how_to/` - Moved the file - Rerouted the page URL	2023-12-04 14:44:06 -08:00
Jacob Lee	a26c4a0930	Allow base_store to be used directly with MultiVectorRetriever (#14202 ) Allow users to pass a generic `BaseStore[str, bytes]` to MultiVectorRetriever, removing the need to use the `create_kv_docstore` method. This encoding will now happen internally. @rlancemartin @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-12-04 14:43:32 -08:00
Harrison Chase	411aa9a41e	Harrison/nasa tool (#14245 ) Co-authored-by: Jacob Matias <88005863+matiasjacob25@users.noreply.github.com> Co-authored-by: Karam Daid <karam.daid@mail.utoronto.ca> Co-authored-by: Jumana <jumana.fanous@mail.utoronto.ca> Co-authored-by: KaramDaid <38271127+KaramDaid@users.noreply.github.com> Co-authored-by: Anna Chester <74325334+CodeMakesMeSmile@users.noreply.github.com> Co-authored-by: Jumana <144748640+jfanous@users.noreply.github.com>	2023-12-04 13:43:11 -08:00
Erick Friis	f6d68d78f3	nbdoc -> quarto (#14156 ) Switches to a more maintained solution for building ipynb -> md files (`quarto`) Also bumps us down to python3.8 because it's significantly faster in the vercel build step. Uses default openssl version instead of upgrading as well.	2023-12-04 12:50:56 -08:00
Nithish Raghunandanan	eecfa3f9e5	Add Couchbase document loader (#13979 ) Description: Adds the document loader for [Couchbase](http://couchbase.com/), a distributed NoSQL database. Dependencies: Added the Couchbase SDK as an optional dependency. Twitter handle: nithishr --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-04 12:28:12 -08:00
Muntaqa Mahmood	25f72944a0	Add: Steam API tool (#14008 ) - Description: Our PR is an integration of a Steam API Tool that makes recommendations on steam games based on user's Steam profile and provides information on games based on user provided queries. - Issue: the issue # our PR implements: https://github.com/langchain-ai/langchain/issues/12120 - Dependencies: python-steam-api library, steamspypi library and decouple library - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: N/A Hello langchain Maintainers, We are a team of 4 University of Toronto students contributing to langchain as part of our course [CSCD01 (link to course page)](https://cscd01.com/work/open-source-project). We hope our changes help the community. We have run make format, make lint and make test locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Our PR integrates the python-steam-api, steamspypi and decouple packages. We have added integration tests to test our python API integration into langchain and an example notebook is also provided. Our amazing team that contributed to this PR: @JohnY2002, @shenceyang, @andrewqian2001 and @muntaqamahmood Thank you in advance to all the maintainers for reviewing our PR! --------- Co-authored-by: Shence <ysc1412799032@163.com> Co-authored-by: JohnY2002 <johnyuan0526@gmail.com> Co-authored-by: Andrew Qian <andrewqian2001@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: JohnY <94477598+JohnY2002@users.noreply.github.com>	2023-12-04 12:27:38 -08:00
Bob Lin	cd2028288e	Add openai v2 adapter (#14063 ) ### Description Starting from [openai version 1.0.0](`17ac677995 (module-level-client)`), the camel case form of `openai.ChatCompletion` is no longer supported and has been changed to lowercase `openai.chat.completions`. In addition, the returned object only accepts attribute access instead of index access: ```python import openai # optional; defaults to `os.environ['OPENAI_API_KEY']` openai.api_key = '...' # all client options can be configured just like the `OpenAI` instantiation counterpart openai.base_url = "https://..." openai.default_headers = {"x-foo": "true"} completion = openai.chat.completions.create( model="gpt-4", messages=[ { "role": "user", "content": "How do I output all files in a directory using Python?", }, ], ) print(completion.choices[0].message.content) ``` So I implemented a compatible adapter that supports both attribute access and index access: ```python In [1]: from langchain.adapters import openai as lc_openai ...: messages = [{"role": "user", "content": "hi"}] In [2]: result = lc_openai.chat.completions.create( ...: messages=messages, model="gpt-3.5-turbo", temperature=0 ...: ) In [3]: result.choices[0].message Out[3]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [4]: result["choices"][0]["message"] Out[4]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [5]: result = await lc_openai.chat.completions.acreate( ...: messages=messages, model="gpt-3.5-turbo", temperature=0 ...: ) In [6]: result.choices[0].message Out[6]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [7]: result["choices"][0]["message"] Out[7]: {'role': 'assistant', 'content': 'Hello! How can I assist you today?'} In [8]: for rs in lc_openai.chat.completions.create( ...: messages=messages, model="gpt-3.5-turbo", temperature=0, stream=True ...: ): ...: print(rs.choices[0].delta) ...: print(rs["choices"][0]["delta"]) ...: {'role': 'assistant', 'content': ''} {'role': 'assistant', 'content': ''} {'content': 'Hello'} {'content': 'Hello'} {'content': '!'} {'content': '!'} In [20]: async for rs in await lc_openai.chat.completions.acreate( ...: messages=messages, model="gpt-3.5-turbo", temperature=0, stream=True ...: ): ...: print(rs.choices[0].delta) ...: print(rs["choices"][0]["delta"]) ...: {'role': 'assistant', 'content': ''} {'role': 'assistant', 'content': ''} {'content': 'Hello'} {'content': 'Hello'} {'content': '!'} {'content': '!'} ... ``` ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-04 12:12:30 -08:00
billytrend-cohere	0f02081392	Add input_type override (#14068 ) Add option to override input_type for cohere's v3 embeddings models --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-04 12:10:24 -08:00
Bob Lin	702a6d7044	Closed #14159 (#14165 ) ### Description Fix: #14159 Use `from pydantic.v1 import BaseModel, Field` instead of `from pydantic import BaseModel, Field` ### [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-04 12:06:04 -08:00
Perry Lee	641e401ba8	Shorten wget commands (#14211 ) - Description: The commands can be more efficient if the output name is set to the destined filename instead of renaming in the second command.	2023-12-04 12:03:47 -08:00
Harrison Chase	e32185193e	Harrison/embass (#14242 ) Co-authored-by: Julius Lipp <lipp.julius@gmail.com>	2023-12-04 11:58:52 -08:00
Harutaka Kawamura	ee94ef55ee	docs[patch]: Update MLflow and Databricks docs (#14011 ) Depends on #13699. Updates the existing mlflow and databricks examples. --------- Co-authored-by: Ben Wilson <39283302+BenWilson2@users.noreply.github.com>	2023-12-03 16:07:09 -08:00
Leonid Ganeline	94bf733dae	docs[patch]: `AWS` platform page update (#14160 ) The `AWS` platform page has many missed integrations. - added missed integration references to the `AWS` platform page - added/updated descriptions and links in the referenced notebooks - renamed two notebook files. They have file names != page Title, which generate unordered ToC. - reroute the URLs for renamed files - fixed `amazon_textract` notebook: removed failed cell outputs	2023-12-03 15:42:52 -08:00
Leonid Ganeline	74d4154bcc	docs[patch]: added `Templates Hub` menu item (#14148 ) This link was missing in Docs. Added it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-03 15:36:35 -08:00
Leonid Ganeline	c660b0cf79	docs[patch]: moved semadb.mdx file (#14204 ) SemaDB.mdx file was placed with additional sub-folder: `https://python.langchain.com/docs/integrations/providers/providers/semadb` - Moved file to the `https://python.langchain.com/docs/integrations/providers/semadb` - Added a redirect for the file URL	2023-12-03 14:36:47 -08:00
Changgeng Zhao	9b59bde93d	Update Hologres vector store: use hologres-vector (#13767 ) Hi, I made some code changes on the Hologres vector store to improve the data insertion performance. Also, this version of the code uses `hologres-vector` library. This library is more convenient for us to update, and more efficient in performance. The code has passed the format/lint/spell check. I have run the unit test for Hologres connecting to my own database. Please check this PR again and tell me if anything needs to change. Best, Changgeng, Developer @ Alibaba Cloud Co-authored-by: Changgeng Zhao <zhaochanggeng.zcg@alibaba-inc.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-03 11:50:45 -08:00
Leonid Ganeline	283c2994de	docs: `Hugging Face` platform page (#13831 ) `Hugging Face` is definitely a platform. It includes many integrations for many modules (LLM, Embedding, DocumentLoader, Tool) So, a doc page was added that defines Hugging Face as a platform.	2023-12-03 11:06:43 -08:00
gzyJoy	32d4bb4590	Added Slacktoolkit (#14012 ) - Description: This PR introduces the Slack toolkit to LangChain, which allows users to read and write to Slack using the Slack API. Specifically, we've added the following tools. 1. get_channel: Provides a summary of all the channels in a workspace. 2. get_message: Gets the message history of a channel. 3. send_message: Sends a message to a channel. 4. schedule_message: Sends a message to a channel at a specific time and date. - Issue: This pull request addresses [Add Slack Toolkit #11747](https://github.com/langchain-ai/langchain/issues/11747) - Dependencies: package`slack_sdk` Note: For this toolkit to function you will need to add a Slack app to your workspace. Additional info can be found [here](https://slack.com/help/articles/202035138-Add-apps-to-your-Slack-workspace). --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ArianneLavada <ariannelavada@gmail.com> Co-authored-by: ArianneLavada <84357335+ArianneLavada@users.noreply.github.com> Co-authored-by: ariannelavada@gmail.com <you@example.com>	2023-12-03 10:25:38 -08:00
ggeutzzang	03d6b94c29	Fix: (issue #14066 ) DOC: Summarization output broken (#14078 ) - Description: : As described in the issue below, https://python.langchain.com/docs/use_cases/summarization I've modified the Python code in the above notebook to perform well. I also modified the OpenAI LLM model to the latest version as shown below. `gpt-3.5-turbo-16k --> gpt-3.5-turbo-1106` This is because it seems to be a bit more responsive. - Issue: : #14066	2023-12-03 10:13:57 -08:00
Bob Lin	ac449f186b	Update docs to use new usage in openai>1.0.0 (#14163 ) ### Description Use new [APIs](https://github.com/openai/openai-python/blob/main/api.md#finetuning) ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-03 09:37:35 -08:00
Bob Lin	1ea48a31da	Update fallback cases (#14164 ) ### Description The `RateLimitError` initialization method has changed after openai v1, and the usage of `patch` needs to be changed. ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-03 08:56:07 -08:00
Leonid Ganeline	6ae0194dc7	docs: `integrations/toolkits/office365` notebook update (#14188 ) Added more descriptions and authentication details.	2023-12-03 08:43:00 -08:00
Samuel Kemp	fd781c89cc	langchain[minor]: add azure ai data document loader (#13404 ) This PR adds an "Azure AI data" document loader, which allows Azure AI users to load their registered data assets as a document object in langchain. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 19:25:55 -08:00
Erick Friis	700428593a	fix broken api docs links (#14154 )	2023-12-01 17:17:52 -08:00
Bagatur	340b42d8ee	docs[minor]: lcel why page (#14089 )	2023-12-01 16:13:31 -08:00
Erick Friis	b01d9d27d9	docs[patch]: docs local build (#14152 )	2023-12-01 14:03:36 -08:00
Alex Kira	0caef3cde7	Change RunnableMap to RunnableParallel for consistency (#14142 ) - Description: Change instances of RunnableMap to RunnableParallel, as that should be the one used going forward. This makes it consistent across the codebase.	2023-12-01 13:36:40 -08:00
Martin Jul	e3a7c96a8e	docs[patch]: Fix minor typos (casing) in quickstart (#14138 ) Fix casing of API and LangChain in the description text for the LangServe example server.	2023-12-01 13:29:53 -08:00
Erick Friis	8cf4cb9e48	docs[patch]: Fix templates/index (#14146 )	2023-12-01 13:09:36 -08:00
Alex Kira	6eb40db353	docs[patch]: Add getting started section to LCEL doc (#14045 ) ### Description: Doc addition for LCEL introduction. Adds a more basic starter guide for using LCEL. --------- Co-authored-by: Alex Kira <akira@Alexs-MBP.local.tld> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-01 12:23:43 -08:00
axiangcoding	1b36ddf16c	docs[patch]: add deprecated note for ErnieChatBot (#14061 ) - Description: just a little change of ErnieChatBot class description, sugguesting user to use more suitable class - Issue: none, - Dependencies: none, - Tag maintainer: @baskaryan , - Twitter handle: none	2023-12-01 11:16:31 -08:00
Alex Kira	1757258b2a	docs[patch]: Add mermaid JS theme dependency to docusaurus (#14051 ) - Description: Add mermaid JS dependency and configs to documentation. Allows inline doc diagrams in markdown. - Dependencies: NPM package @docusaurus/theme-mermaid	2023-12-01 11:06:29 -08:00
Harrison Chase	ae646701c4	Harrison/ibm (#14133 ) Co-authored-by: Mateusz Szewczyk <139469471+MateuszOssGit@users.noreply.github.com>	2023-12-01 12:44:11 -05:00
Bob Lin	f15859bd86	docs[patch]: Update discord.ipynb (#14099 ) ### Description Now if `example` in Message is False, it will not be displayed. Update the output in this document. ```python In [22]: m = HumanMessage(content="Text") In [23]: m Out[23]: HumanMessage(content='Text') In [24]: m = HumanMessage(content="Text", example=True) In [25]: m Out[25]: HumanMessage(content='Text', example=True) ``` ### Twitter handle [lin_bob57617](https://twitter.com/lin_bob57617)	2023-12-01 08:54:31 -08:00

1 2 3 4 5 ...

2569 Commits