langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
Gengliang Wang	a87a2524c7	Remove autoreload in examples (#4994 ) # Remove autoreload in examples Remove the `autoreload` in examples since it is not necessary for most users: ``` %load_ext autoreload, %autoreload 2 ```	2023-05-19 17:35:58 +00:00
Eugene Yurtsev	06e524416c	power bi api wrapper integration tests & bug fix (#4983 ) # Powerbi API wrapper bug fix + integration tests - Bug fix by removing `TYPE_CHECKING` in in utilities/powerbi.py - Added integration test for power bi api in utilities/test_powerbi_api.py - Added integration test for power bi agent in agent/test_powerbi_agent.py - Edited .env.examples to help set up power bi related environment variables - Updated demo notebook with working code in docs../examples/powerbi.ipynb - AzureOpenAI -> ChatOpenAI Notes: Chat models (gpt3.5, gpt4) are much more capable than davinci at writing DAX queries, so that is important to getting the agent to work properly. Interestingly, gpt3.5-turbo needed the examples=DEFAULT_FEWSHOT_EXAMPLES to write consistent DAX queries, so gpt4 seems necessary as the smart llm. Fixes #4325 ## Before submitting Azure-core and Azure-identity are necessary dependencies check integration tests with the following: `pytest tests/integration_tests/utilities/test_powerbi_api.py` `pytest tests/integration_tests/agent/test_powerbi_agent.py` You will need a power bi account with a dataset id + table name in order to test. See .env.examples for details. ## Who can review? @hwchase17 @vowelparrot --------- Co-authored-by: aditya-pethe <adityapethe1@gmail.com>	2023-05-19 11:25:52 -04:00
Edrick Da Corte Henriquez	e80585bab0	Update tutorials.md (#4960 ) # Added a YouTube Tutorial Added a LangChain tutorial playlist aimed at onboarding newcomers to LangChain and its use cases. I've shared the video in the #tutorials channel and it seemed to be well received. I think this could be useful to the greater community. ## Who can review? @dev2049	2023-05-19 10:40:14 -04:00
Rahul Rao	13c376345e	Fixed assumptions misspelling (#4961 ) Fixed assumptions misspelling in the link mentioned below:- https://python.langchain.com/en/latest/modules/chains/examples/llm_summarization_checker.html ![image](https://github.com/hwchase17/langchain/assets/16189966/94cf2be0-b3d0-495b-98ad-e1f44331727e) Fix for Issue:- #4959 @hwchase17	2023-05-19 10:40:04 -04:00
Gengliang Wang	bf5a3c6dec	Support Databricks in SQLDatabase (#4702 ) This PR adds support for Databricks runtime and Databricks SQL by using [Databricks SQL Connector for Python](https://docs.databricks.com/dev-tools/python-sql-connector.html). As a cloud data platform, accessing Databricks requires a URL as follows `databricks://token:{api_token}@{hostname}?http_path={http_path}&catalog={catalog}&schema={schema}`. The URL is complicated and it may take users a while to figure it out. Since the fields `api_token`/`hostname`/`http_path` fields are known in the Databricks notebook, I am proposing a new method `from_databricks` to simplify the connection to Databricks. ## In Databricks Notebook After changes, Databricks users only need to specify the `catalog` and `schema` field when using langchain. <img width="881" alt="image" src="https://github.com/hwchase17/langchain/assets/1097932/984b4c57-4c2d-489d-b060-5f4918ef2f37"> ## In Jupyter Notebook The method can be used on the local setup as well: <img width="678" alt="image" src="https://github.com/hwchase17/langchain/assets/1097932/142e8805-a6ef-4919-b28e-9796ca31ef19">	2023-05-19 00:42:06 -07:00
Harrison Chase	88a3a56c1a	Add Spark SQL support (#4602 ) (#4956 ) # Add Spark SQL support * Add Spark SQL support. It can connect to Spark via building a local/remote SparkSession. * Include a notebook example I tried some complicated queries (window function, table joins), and the tool works well. Compared to the [Spark Dataframe agent](https://python.langchain.com/en/latest/modules/agents/toolkits/examples/spark.html), this tool is able to generate queries across multiple tables. --------- # Your PR Title (What it does) <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes # (issue) ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 --> --------- Co-authored-by: Gengliang Wang <gengliang@apache.org> Co-authored-by: Mike W <62768671+skcoirz@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: UmerHA <40663591+UmerHA@users.noreply.github.com> Co-authored-by: 张城铭 <z@hyperf.io> Co-authored-by: assert <zhangchengming@kkguan.com> Co-authored-by: blob42 <spike@w530> Co-authored-by: Yuekai Zhang <zhangyuekai@foxmail.com> Co-authored-by: Richard He <he.yucheng@outlook.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com> Co-authored-by: Alexey Nominas <60900649+Chae4ek@users.noreply.github.com> Co-authored-by: elBarkey <elbarkey@gmail.com> Co-authored-by: Davis Chase <130488702+dev2049@users.noreply.github.com> Co-authored-by: Jeffrey D <1289344+verygoodsoftwarenotvirus@users.noreply.github.com> Co-authored-by: so2liu <yangliu35@outlook.com> Co-authored-by: Viswanadh Rayavarapu <44315599+vishwa-rn@users.noreply.github.com> Co-authored-by: Chakib Ben Ziane <contact@blob42.xyz> Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com> Co-authored-by: Daniel Chalef <daniel.chalef@private.org> Co-authored-by: Jari Bakken <jari.bakken@gmail.com> Co-authored-by: escafati <scafatieugenio@gmail.com>	2023-05-18 20:53:08 -07:00
Harrison Chase	5feb60f426	Harrison/spell executor (#4914 ) Co-authored-by: Jan Minar <rdancer@rdancer.org>	2023-05-18 20:43:33 -07:00
Mike Wang	db6f7ed0ba	[nit] Simplify Spark Creation Validation Check A Little Bit (#4761 ) - simplify the validation check a little bit. - re-tested in jupyter notebook. Reviewer: @hwchase17	2023-05-18 18:57:54 -07:00
Daniel Chalef	c8c2276ccb	Zep Retriever - Vector Search Over Chat History (#4533 ) # Zep Retriever - Vector Search Over Chat History with the Zep Long-term Memory Service More on Zep: https://github.com/getzep/zep Note: This PR is related to and relies on https://github.com/hwchase17/langchain/pull/4834. I did not want to modify the `pyproject.toml` file to add the `zep-python` dependency a second time. Co-authored-by: Daniel Chalef <daniel.chalef@private.org>	2023-05-18 16:27:18 -07:00
Leonid Ganeline	a9bb3147d7	docs: vectorstores, different updates and fixes (#4939 ) # docs: vectorstores, different updates and fixes Multiple updates: - added/improved descriptions - fixed header levels - added headers - fixed headers	2023-05-18 15:35:47 -07:00
Leonid Ganeline	8f8593aac5	docs: added `ecosystem/dependents` page (#4941 ) # docs: added `ecosystem/dependents` page Added `ecosystem/dependents` page. Can we propose a better page name?	2023-05-18 13:11:08 -07:00
Viswanadh Rayavarapu	c9f963e295	Update custom_multi_action_agent.ipynb (#4931 ) Updated the docs from "An agent consists of three parts:" to "An agent consists of two parts:" since there are only two parts in the documentation	2023-05-18 11:53:12 -07:00
so2liu	3002c1d508	fix: error in gptcache example nb (#4930 )	2023-05-18 11:49:45 -07:00
Jeffrey D	7e8e21c914	Correct typo in APIChain example notebook (Farenheit -> Fahrenheit) (#4938 ) Correct typo in APIChain example notebook (Farenheit -> Fahrenheit)	2023-05-18 11:48:02 -07:00
Leonid Ganeline	c75c0775e1	docs supabase update (#4935 ) # docs: updated `Supabase` notebook - the title of the notebook was inconsistent (included redundant "Vectorstore"). Removed this "Vectorstore" - added `Postgress` to the title. It is important. The `Postgres` name is much more popular than `Supabase`. - added description for the `Postrgress` - added more info to the `Supabase` description	2023-05-18 10:42:08 -07:00
Alexey Nominas	c9e2a01875	Update GPT4ALL integration (#4567 ) # Update GPT4ALL integration GPT4ALL have completely changed their bindings. They use a bit odd implementation that doesn't fit well into base.py and it will probably be changed again, so it's a temporary solution. Fixes #3839, #4628	2023-05-18 09:38:54 -07:00
Leonid Ganeline	e2d7677526	docs: compound ecosystem and integrations (#4870 ) # Docs: compound ecosystem and integrations Problem statement: We have a big overlap between the References/Integrations and Ecosystem/LongChain Ecosystem pages. It confuses users. It creates a situation when new integration is added only on one of these pages, which creates even more confusion. - removed References/Integrations page (but move all its information into the individual integration pages - in the next PR). - renamed Ecosystem/LongChain Ecosystem into Integrations/Integrations. I like the Ecosystem term. It is more generic and semantically richer than the Integration term. But it mentally overloads users. The `integration` term is more concrete. UPDATE: after discussion, the Ecosystem is the term. Ecosystem/Integrations is the page (in place of Ecosystem/LongChain Ecosystem). As a result, a user gets a single place to start with the individual integration.	2023-05-18 09:29:57 -07:00
Yuekai Zhang	1ed4228822	Fix bilibili (#4860 ) # Fix bilibili api import error bilibili-api package is depracated and there is no sync module. <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes #2673 #2724 ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @vowelparrot @liaokongVFX <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 -->	2023-05-18 09:56:51 -04:00
Eugene Yurtsev	e46202829f	feat #4479 : TextLoader auto detect encoding and improved exceptions (#4927 ) # TextLoader auto detect encoding and enhanced exception handling - Add an option to enable encoding detection on `TextLoader`. - The detection is done using `chardet` - The loading is done by trying all detected encodings by order of confidence or raise an exception otherwise. ### New Dependencies: - `chardet` Fixes #4479 ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: - @eyurtsev --------- Co-authored-by: blob42 <spike@w530>	2023-05-18 09:55:14 -04:00
Eugene Yurtsev	c06a47a691	Load specific file types from Google Drive (issue #4878 ) (#4926 ) # Load specific file types from Google Drive (issue #4878) Add the possibility to define what file types you want to load from Google Drive. ``` loader = GoogleDriveLoader( folder_id="1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5", file_types=["document", "pdf"] recursive=False ) ``` Fixes ##4878 ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: DataLoaders - @eyurtsev Twitter: [@UmerHAdil](https://twitter.com/@UmerHAdil) \| Discord: RicChilligerDude#7589 --------- Co-authored-by: UmerHA <40663591+UmerHA@users.noreply.github.com>	2023-05-18 09:27:53 -04:00
Harrison Chase	dfbf45f028	bump version to 173 (#4910 )	2023-05-17 23:36:45 -07:00
Harrison Chase	b8d48939a2	Harrison/unified objectives (#4905 ) Co-authored-by: Matthias Samwald <samwald@gmx.at>	2023-05-17 23:03:57 -07:00
Harrison Chase	9165267f8a	Harrison/improved retry tool (#4842 )	2023-05-17 21:41:01 -07:00
Leonid Ganeline	c998569c8f	docs: text splitters improvements (#4490 ) #docs: text splitters improvements Changes are only in the Jupyter notebooks. - added links to the source packages and a short description of these packages - removed " Text Splitters" suffixes from the TOC elements (they made the list of the text splitters messy) - moved text splitters, based on the length function into a separate list. They can be mixed with any classes from the "Text Splitters", so it is a different classification. ## Who can review? @hwchase17 - project lead @eyurtsev @vowelparrot NOTE: please, check out the results of the `Python code` text splitter example (text_splitters/examples/python.ipynb). It looks suboptimal.	2023-05-17 21:33:34 -07:00
Steve Kim	613bf9b514	Update getting_started.md (#4482 ) # Added another helpful way for developers who want to set OpenAI API Key dynamically Previous methods like exporting environment variables are good for project-wide settings. But many use cases need to assign API keys dynamically, recently. ```python from langchain.llms import OpenAI llm = OpenAI(openai_api_key="OPENAI_API_KEY") ``` ## Before submitting ```bash export OPENAI_API_KEY="..." ``` Or, ```python import os os.environ["OPENAI_API_KEY"] = "..." ``` <hr> Thank you. Cheers, Bongsang	2023-05-17 21:32:25 -07:00
Ismael G Serrano	41e2394c9c	Fix AzureOpenAI embeddings documentation example. model -> deployment (#4389 ) # Documentation for Azure OpenAI embeddings model - OPENAI_API_VERSION environment variable is needed for the endpoint - The constructor does not work with model, it works with deployment. I fixed it in the notebook. (This is my first contribution) ## Who can review? @hwchase17 @agola Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-05-17 21:05:53 -07:00
Davis Chase	a4ac006658	Update gallery (#4873 )	2023-05-17 20:59:41 -07:00
Davis Chase	8966f61ca5	Zep memory (#4898 ) Co-authored-by: Daniel Chalef <daniel.chalef@private.org> Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com>	2023-05-17 20:01:01 -07:00
Davis Chase	df0c33a005	Faiss no avx2 (#4895 ) Co-authored-by: Ali Mirlou <alimirlou@gmail.com>	2023-05-17 19:18:57 -07:00
Alexander Miasoiedov (Myasoedov)	4c3ab55e94	feat(Add FastAPI + Vercel deployment option): (#4520 ) # Update deployments doc with langcorn API server API server example ```python from fastapi import FastAPI from langcorn import create_service app: FastAPI = create_service( "examples.ex1:chain", "examples.ex2:chain", "examples.ex3:chain", "examples.ex4:sequential_chain", "examples.ex5:conversation", "examples.ex6:conversation_with_summary", ) ``` More examples: https://github.com/msoedov/langcorn/tree/main/examples Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-17 15:50:25 -07:00
Taqi Jaffri	ef8b5f64bc	Tiny code review and docs fix for Docugami DataLoader (#4877 ) # Docs and code review fixes for Docugami DataLoader 1. I noticed a couple of hyperlinks that are not loading in the langchain docs (I guess need explicit anchor tags). Added those. 2. In code review @eyurtsev had a [suggestion](https://github.com/hwchase17/langchain/pull/4727#discussion_r1194069347) to allow string paths. Turns out just updating the type works (I tested locally with string paths). # Pre-submission checks I ran `make lint` and `make tests` successfully. --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	2023-05-17 15:31:43 -07:00
C.J. Jameson	d6e0b9a43d	fix homepage typo (#4883 ) # Fix Homepage Typo ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested... not sure	2023-05-17 15:30:23 -07:00
Leonid Ganeline	b96ab4b763	docs `retriever` improvements (#4430 ) # Docs: improvements in the `retrievers/examples/` notebooks Its primary purpose is to make the Jupyter notebook examples consistent and more suitable for first-time viewers. - add links to the integration source (if applicable) with a short description of this source; - removed `_retriever` suffix from the file names (where it existed) for consistency; - removed ` retriever` from the notebook title (where it existed) for consistency; - added code to install necessary Python package(s); - added code to set up the necessary API Key. - very small fixes in notebooks from other folders (for consistency): - docs/modules/indexes/vectorstores/examples/elasticsearch.ipynb - docs/modules/indexes/vectorstores/examples/pinecone.ipynb - docs/modules/models/llms/integrations/cohere.ipynb - fixed misspelling in langchain/retrievers/time_weighted_retriever.py comment (sorry, about this change in a .py file ) ## Who can review @dev2049	2023-05-17 15:29:22 -07:00
Justin Levi Winter	0147f845f1	Update getting_started.ipynb (#4850 ) minor grammer issue	2023-05-17 13:19:14 -07:00
UmerHA	e257380deb	Typos (#4851 ) # Fixed typos (issues #4818 & #4668 & more typos) - At some places, it said `model = ChatOpenAI(model='gpt-3.5-turbo')` but should be `model = ChatOpenAI(model_name='gpt-3.5-turbo')` - Fixes some other typos Fixes #4818, #4668 ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot	2023-05-17 11:52:22 -04:00
Harrison Chase	720ac49f42	2markdown loader (#4796 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-05-16 23:42:53 -07:00
Ankush Gola	aa73a888fa	Some notebook and client fixes (add retries, clean up docs, etc) (#4820 ) # Your PR Title (What it does) <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes # (issue) ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 -->	2023-05-16 20:23:00 -07:00
David Peterson	d5d4c0a172	Update summarize.ipynb (#4529 ) # Update order in which tasks are stated (logically correct) Fixes the order in which steps are placed under titles. @vowelparrot	2023-05-16 18:14:00 -07:00
Brendan Mannix	4e56d3119c	update qdrant docs to reflect the proper way to initialize Qdrant() constructor (#4596 ) # update qdrant docs to reflect the proper way to initialize Qdrant() constructor The [Qdrant docs](https://python.langchain.com/en/latest/modules/indexes/vectorstores/examples/qdrant.html) still contain an old reference for passing an `embedding_function` into the constructor. This is no longer supported. This PR updates the docs to reflect the proper way to initialize `Qdrant()` Old: ![Screenshot 2023-05-12 at 3 06 33 PM](https://github.com/hwchase17/langchain/assets/1552962/dd4063d2-2a07-4340-91bb-e305f7215ddd) New: ![Screenshot 2023-05-12 at 3 21 09 PM](https://github.com/hwchase17/langchain/assets/1552962/aebc3f63-1a8b-4ca3-93c0-a2ce30dcd282)	2023-05-16 17:30:38 -07:00
Sean Morgan	5372a06a8c	DOC: Fix SageMaker example (#4598 ) # Fix SageMaker example typing Since https://github.com/hwchase17/langchain/pull/3249 a new type `LLMContentHandler` is enforced for SageMaker Endpoints Fixes #4168	2023-05-16 17:28:16 -07:00
Anam Hira	3af448d72e	Update huggingface_tools.ipynb (#4700 )	2023-05-16 16:28:27 -07:00
Chandan Routray	e8d46bdd9b	Replaced `SQLDatabaseChain` deprecated direct initialisation with `from_llm` method (#4778 ) # Removed usage of deprecated methods Replaced `SQLDatabaseChain` deprecated direct initialisation with `from_llm` method ## Who can review? @hwchase17 @agola11 --------- Co-authored-by: imeckr <chandanroutray2012@gmail.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 15:59:06 -07:00
Mark Pors	8fd4d5d117	Added dependencies to make example executable (#4790 ) - Installation of non-colab packages - Get API keys # Added dependencies to make notebook executable on hosted notebooks ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @vowelparrot	2023-05-16 15:46:09 -07:00
Mark Pors	5bc7082e82	Cleanup and added dependencies to make example executable (#4795 ) - Installation of non-colab packages - Get API keys - Get rid of warnings # Cleanup and added dependencies to make notebook executable on hosted notebooks @hwchase17 @vowelparrot	2023-05-16 15:29:01 -07:00
keenangraham	bcce9a3a92	Fix age inconsistency in plan and execute Jupyter notebook example (#4814 ) The current example in https://python.langchain.com/en/latest/modules/agents/plan_and_execute.html has inconsistent reasoning step (observing 28 years and thinking it's 26 years): ``` Observation: 28 years Thought:Based on my search, Gigi Hadid's current age is 26 years old. Action: { "action": "Final Answer", "action_input": "Gigi Hadid's current age is 26 years old." } ``` Guessing this is model noise. Rerunning seems to give correct answer of 28 years.	2023-05-16 15:27:27 -07:00
Prateek K. Keshari	61f9c52fc7	Update twitter-the-algorithm-analysis-deeplake.ipynb (#4812 ) Changed model to model_name	2023-05-16 15:27:15 -07:00
Raduan Al-Shedivat	00c6ec8a2d	fix(document_loaders/telegram): fix pandas calls + add tests (#4806 ) # Fix Telegram API loader + add tests. I was testing this integration and it was broken with next error: ```python message_threads = loader._get_message_threads(df) KeyError: False ``` Also, this particular loader didn't have any tests / related group in poetry, so I added those as well. @hwchase17 / @eyurtsev please take a look on this fix PR. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 14:35:25 -07:00
了空	f7e3d97b19	Remove unnecessary spaces from document object’s page_content of BiliBiliLoader (#4619 ) - Remove unnecessary spaces from document object’s page_content of BiliBiliLoader - Fix BiliBiliLoader document and test file	2023-05-16 13:13:57 -04:00
Eugene Yurtsev	f47ec5b4b6	Docugami docs: First cell should be a title cell (#4735 ) # Make first cell a title in docugami docs This makes the first cell a title cell in docugami notebook	2023-05-16 13:12:14 -04:00
Harrison Chase	a7af32c274	Cassandra support for chat history (#4378 ) (#4764 ) # Cassandra support for chat history ### Description - Store chat messages in cassandra ### Dependency - cassandra-driver - Python Module ## Before submitting - Added Integration Test ## Who can review? @hwchase17 @agola11 # Your PR Title (What it does) <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes # (issue) ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 --> Co-authored-by: Jinto Jose <129657162+jj701@users.noreply.github.com>	2023-05-15 23:43:09 -07:00
Leonid Ganeline	a6f3ec94bc	docs: added `additional_resources` folder (#4748 ) # docs: added `additional_resources` folder The additional resource files were inside the doc top-level folder, which polluted the top-level folder. - added the `additional_resources` folder and moved correspondent files to this folder; - fixed a broken link to the "Model comparison" page (model_laboratory notebook) - fixed a broken link to one of the YouTube videos (sorry, it is not directly related to this PR) ## Who can review? @dev2049	2023-05-15 17:12:47 -07:00
Zander Chase	580861e7f2	Revert "Make serpapi base url configurable via env (#4402 )" (#4750 ) This reverts commit `5111bec540`. This PR introduced a bug in the async API (the `url` param isn't bound); it also didn't update the synchronous API correctly, which makes it error-prone (the behavior of the async and sync endpoints would be different)	2023-05-15 16:17:16 -07:00
shiyu22	21b9397342	Update the milvus example (#4706 ) # Fix issue when running example - add the query content - update the `user` parameter with Zilliz Signed-off-by: shiyu22 <shiyu.chen@zilliz.com>	2023-05-15 16:16:57 -07:00
Davis Chase	36c9fd1af7	Dev2049/docs edit0 (#4699 )	2023-05-15 15:20:37 -07:00
Jinto Jose	1e467d9fc4	Jupyter Notebook Example for using Mongodb to store Chat Message History (#4436 ) # Jupyter Notebook Example for using Mongodb Chat Message History @dev2049	2023-05-15 14:33:42 -07:00
Leonid Ganeline	6060505a9d	Add new links to `Tutorials` and `YouTube` pages (#4746 ) - added an official LangChain YouTube channel :) - added new tutorials and videos (only videos with enough subscriber or view numbers) - added a "New video" icon ## Who can review? @dev2049	2023-05-15 14:32:48 -07:00
vinoyang	5111bec540	Make serpapi base url configurable via env (#4402 ) Fixes #4328 Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 14:25:25 -07:00
Roma	cb802edf75	[Feature] Add GraphQL Query Tool (#4409 ) # Add GraphQL Query Support This PR introduces a GraphQL API Wrapper tool that allows LLM agents to query GraphQL databases. The tool utilizes the httpx and gql Python packages to interact with GraphQL APIs and provides a simple interface for running queries with LLM agents. @vowelparrot --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 14:06:12 -07:00
Leonid Ganeline	70fd7cda14	docs: `Concepts` (#4734 ) # glossary.md renamed as concepts.md and moved under the Getting Started small PR. `Concepts` looks right to the point. It is moved under Getting Started (typical place). Previously it was lost in the Additional Resources section. ## Who can review? @hwchase17	2023-05-15 11:09:25 -07:00
Harrison Chase	dd95f0892d	Harrison/add top k (#4707 ) Co-authored-by: blc16 <benlc@umich.edu>	2023-05-15 09:09:22 -07:00
Eugene Yurtsev	3c490b5ba3	Docugami DataLoader (#4727 ) ### Adds a document loader for Docugami Specifically: 1. Adds a data loader that talks to the [Docugami](http://docugami.com) API to download processed documents as semantic XML 2. Parses the semantic XML into chunks, with additional metadata capturing chunk semantics 3. Adds a detailed notebook showing how you can use additional metadata returned by Docugami for techniques like the [self-querying retriever](https://python.langchain.com/en/latest/modules/indexes/retrievers/examples/self_query_retriever.html) 4. Adds an integration test, and related documentation Here is an example of a result that is not possible without the capabilities added by Docugami (from the notebook): <img width="1585" alt="image" src="https://github.com/hwchase17/langchain/assets/749277/bb6c1ce3-13dc-4349-a53b-de16681fdd5b"> --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com> Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>	2023-05-15 10:53:00 -04:00
Lester Yang	cd3f9865f3	Feature: pdfplumber PDF loader with BaseBlobParser (#4552 ) # Feature: pdfplumber PDF loader with BaseBlobParser * Adds pdfplumber as a PDF loader * Adds pdfplumber as a blob parser.	2023-05-15 09:47:02 -04:00
Harrison Chase	b6e3ac17c4	Harrison/sitemap local (#4704 ) Co-authored-by: Lukas Bauer <lukas.bauer@mayflower.de>	2023-05-14 22:04:38 -07:00
Harrison Chase	12b4ee1fc7	Harrison/telegram chat loader (#4698 ) Co-authored-by: Akinwande Komolafe <47945512+Sensei-akin@users.noreply.github.com> Co-authored-by: Akinwande Komolafe <akhinoz@gmail.com>	2023-05-14 22:04:27 -07:00
Leonid Ganeline	2b181e5a6c	docs: tutorials are moved on the top-level of docs (#4464 ) # Added Tutorials section on the top-level of documentation Problem Statement: the Tutorials section in the documentation is top-priority. Not every project has resources to make tutorials. We have such a privilege. Community experts created several tutorials on YouTube. But the tutorial links are now hidden on the YouTube page and not easily discovered by first-time visitors. PR: I've created the `Tutorials` page (from the `Additional Resources/YouTube` page) and moved it to the top level of documentation in the `Getting Started` section. ## Who can review? @dev2049 NOTE: PR checks are randomly failing `3aefaafcdb` `258819eadf` `514d81b5b3`	2023-05-14 21:22:25 -07:00
Ashish Talati	372a5113ff	Update gallery.rst with chatpdf opensource (#4342 )	2023-05-14 19:43:16 -07:00
Samuli Rauatmaa	66828ad231	add the existing OpenWeatherMap tool to the public api (#4292 ) [OpenWeatherMapAPIWrapper](`f70e18a5b3/docs/modules/agents/tools/examples/openweathermap.ipynb`) works wonderfully, but the _tool_ itself can't be used in master branch. - added OpenWeatherMap tool to the public api, to be loadable with `load_tools` by using "openweathermap-api" tool name (that name is used in the existing [docs](`aff33d52c5/docs/modules/agents/tools/getting_started.md`), at the bottom of the page) - updated OpenWeatherMap tool's description to make the input format match what the API expects (e.g. `London,GB` instead of `'London,GB'`) - added [ecosystem documentation page for OpenWeatherMap](`f9c41594fe/docs/ecosystem/openweathermap.md`) - added tool usage example to [OpenWeatherMap's notebook](`f9c41594fe/docs/modules/agents/tools/examples/openweathermap.ipynb`) Let me know if there's something I missed or something needs to be updated! Or feel free to make edits yourself if that makes it easier for you 🙂	2023-05-14 18:50:45 -07:00
Harrison Chase	6f47ab17a4	Harrison/param notion db (#4689 ) Co-authored-by: Edward Park <ed.sh.park@gmail.com>	2023-05-14 18:26:25 -07:00
Harrison Chase	5d63fc65e1	add warning for combined memory (#4688 )	2023-05-14 18:26:16 -07:00
Harrison Chase	a48810fb21	dont have openai_api_version by default (#4687 ) an alternative to https://github.com/hwchase17/langchain/pull/4234/files	2023-05-14 18:26:08 -07:00
Harrison Chase	c48f1301ee	oops remove api key, dont worried i cycled it	2023-05-14 17:40:31 -07:00
Harrison Chase	57b2f3ffe6	add rebuff (#4637 )	2023-05-14 17:38:43 -07:00
Zander Chase	d85b04be7f	Add RELLM and JSONFormer experimental LLM decoding (#4185 ) [RELLM](https://github.com/r2d4/rellm) is a library that wraps local HuggingFace pipeline models for structured decoding. RELLM works by generating tokens one at a time. At each step, it masks tokens that don't conform to the provided partial regular expression. [JSONFormer](https://github.com/1rgs/jsonformer) is a bit different, where it sequentially adds the keys then decodes each value directly	2023-05-14 22:40:03 +00:00
Harrison Chase	243886be93	Harrison/virtual time (#4658 ) Co-authored-by: ifsheldon <39153080+ifsheldon@users.noreply.github.com> Co-authored-by: maple.liang <maple.liang@gempoll.com>	2023-05-14 10:29:17 -07:00
Harrison Chase	ef49c659f6	add embedding router (#4644 )	2023-05-13 21:47:01 -07:00
Harrison Chase	c09bb00959	Harrison/summary memory history (#4649 ) Co-authored-by: engkheng <60956360+outday29@users.noreply.github.com>	2023-05-13 21:46:11 -07:00
Harrison Chase	44ae673388	Harrison/multithreading directory loader (#4650 ) Co-authored-by: PawelFaron <42373772+PawelFaron@users.noreply.github.com> Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-13 21:46:02 -07:00
Harrison Chase	873b0c7eb6	Harrison/structured chat mem (#4652 ) Co-authored-by: d 3 n 7 <29033313+d3n7@users.noreply.github.com>	2023-05-13 21:45:42 -07:00
Harrison Chase	279605b4d3	Harrison/metaphor search (#4657 ) Co-authored-by: Jeffrey Wang <jeffreyzhiyuanwang@gmail.com>	2023-05-13 21:45:05 -07:00
Harrison Chase	9aa9fe7021	Harrison/spark connect example (#4659 ) Co-authored-by: Mike Wang <62768671+skcoirz@users.noreply.github.com>	2023-05-13 21:44:54 -07:00
Leonid Ganeline	3ce78ef6c4	docs: document_loaders classification (#4069 ) Problem statement: the [document_loaders](https://python.langchain.com/en/latest/modules/indexes/document_loaders.html#) section is too long and hard to comprehend. Proposal: group document_loaders by 3 classes: (see `Files changed` tab) UPDATE: I've completely reworked the document_loader classification. Now this PR changes only one file! FYI @eyurtsev @hwchase17	2023-05-13 19:17:32 -07:00
Harrison Chase	1e322ffc1c	change heading	2023-05-13 09:52:23 -07:00
Davis Chase	9ab7101182	WIP: FLARE-inspired chain (#4612 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-05-13 09:28:28 -07:00
Harrison Chase	daa3e6dedb	Harrison/prompt constructor methods (#4616 )	2023-05-13 09:23:51 -07:00
Harrison Chase	6265cbfb11	Harrison/standard llm interface (#4615 )	2023-05-13 09:05:31 -07:00
Harrison Chase	7d425cbf38	improve sql prompt (#4611 ) Co-authored-by: Taqi Jaffri <tjaffri@docugami.com> Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>	2023-05-12 21:55:03 -07:00
Tim Asp	ed0d557ede	docs: fix pdf docs hierarchy and formatting (#4593 ) # Fix pdf loader docs page ![image](https://github.com/hwchase17/langchain/assets/707699/4a11f379-00ed-4f7a-9870-71f74e0cadc6) Using h1's messes with hierarchy, this fixes that, and moves the PyPDFium2 loader out of the middle of PDFMiner docs	2023-05-12 15:03:01 -04:00
Zander Chase	d96f6a106b	Add Steamship Image Generation Tool (#4580 ) Co-authored-by: Enias Cailliau <enias@steamship.com>	2023-05-12 10:35:01 -07:00
Davis Chase	a4a9d1f403	Improve vespa interface (#4546 ) ![Screenshot 2023-05-11 at 7 50 31 PM](https://github.com/hwchase17/langchain/assets/130488702/bc8ab4bb-8006-44fc-ba07-df54e84ee2c1)	2023-05-12 10:11:26 -07:00
Neil Ruaro	3a2855945b	added documentation on retrieving a PG vectorstore (#4578 ) This PR adds in documentation on querying an existing vectorstore in PG Fixes 3191 (issue)	2023-05-12 13:04:06 -04:00
Harrison Chase	5ad151ed44	Add constitutional principles from paper (#4554 ) Add constitutional principles from https://arxiv.org/pdf/2212.08073.pdf --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-12 07:34:03 -07:00
Sai Vinay G	cf4c1394a2	feat: Added class to support huggingface text generation inference server (#4447 ) [Text Generation Inference](https://github.com/huggingface/text-generation-inference) is a Rust, Python and gRPC server for generating text using LLMs. This pull request add support for self hosted Text Generation Inference servers. feature: #4280 --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-12 07:32:37 -07:00
Leonid Ganeline	e17d0319d5	Add `arxiv` retriever (#4538 )	2023-05-11 22:48:38 -07:00
SimFG	7bcf238a1a	Optimize the initialization method of GPTCache (#4522 ) Optimize the initialization method of GPTCache, so that users can use GPTCache more quickly.	2023-05-11 16:15:23 -07:00
kYLe	446b60d803	Fix a typo in langchain/docs/modules/models/llms/integrations/anyscale.ipynb (#4526 )	2023-05-11 09:03:04 -07:00
Akshaya Annavajhala	b21d7c138c	Callback Handler for MLflow (#4150 ) Rebased Mahmedk's PR with the callback refactor and added the example requested by hwchase plus a couple minor fixes --------- Co-authored-by: Ahmed K <77802633+mahmedk@users.noreply.github.com> Co-authored-by: Ahmed K <mda3k27@gmail.com> Co-authored-by: Davis Chase <130488702+dev2049@users.noreply.github.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 01:10:40 -07:00
kYLe	0d51a1f12b	Add LLMs support for Anyscale Service (#4350 ) Add Anyscale service integration under LLM Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 00:39:59 -07:00
Kristóf Dombi	99b2400048	[Docs]: Add Kinsta to the list of deployment providers (#4445 ) We're fans of the LangChain framework thus we wanted to make sure we provide an easy way for our customers to be able to utilize this framework for their LLM-powered applications at our platform.	2023-05-11 00:29:48 -07:00
Zander Chase	d969f43ed8	Load HuggingFace Tool (#4475 ) # Add option to `load_huggingface_tool` Expose a method to load a huggingface Tool from the HF hub --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 00:07:36 -07:00
Harrison Chase	3ce29cb4a6	Harrison/new search (#4359 ) Co-authored-by: Jiaping(JP) Zhang <vincentzhangv@gmail.com>	2023-05-10 17:09:16 -07:00

1 2 3 4 5 ...

1003 Commits