langchain/README.md

# 🦜️🔗 LangChain

⚡ Building applications with LLMs through composability ⚡

[![lint](https://github.com/hwchase17/langchain/actions/workflows/lint.yml/badge.svg)](https://github.com/hwchase17/langchain/actions/workflows/lint.yml) [![test](https://github.com/hwchase17/langchain/actions/workflows/test.yml/badge.svg)](https://github.com/hwchase17/langchain/actions/workflows/test.yml) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT) [![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai) [![](https://dcbadge.vercel.app/api/server/6adMQxSpJS?compact=true&style=flat)](https://discord.gg/6adMQxSpJS)

## Quick Install

`pip install langchain`

## 🤔 What is this?

Large language models (LLMs) are emerging as a transformative technology, enabling
developers to build applications that they previously could not.
But using these LLMs in isolation is often not enough to
create a truly powerful app - the real power comes when you are able to
combine them with other sources of computation or knowledge.

This library is aimed at assisting in the development of those types of applications.

## 📖 Documentation

Please see [here](https://langchain.readthedocs.io/en/latest/?) for full documentation on:
- Getting started (installation, setting up environment, simple examples)
- How-To examples (demos, integrations, helper functions)
- Reference (full API docs)
- Resources (high level explanation of core concepts)

## 🚀 What can this help with?

There are three main areas (with a forth coming soon) that LangChain is designed to help with.
These are, in increasing order of complexity:
1. LLM and Prompts
2. Chains
3. Agents
4. Memory

Let's go through these categories and for each one identify key concepts (to clarify terminology) as well as the problems in this area LangChain helps solve.

### LLMs and Prompts
Calling out to an LLM once is pretty easy, with most of them being behind well documented APIs.
However, there are still some challenges going from that to an application running in production that LangChain attempts to address.

**Key Concepts**
- LLM: A large language model, in particular a text-to-text model.
- Prompt: The input to a language model. Typically this is not simply a hardcoded string but rather a combination of a template, some examples, and user input.
- Prompt Template: An object responsible for constructing the final prompt to pass to a LLM.
- Examples: Datapoints that can be included in the prompt in order to give the model more context what to do.
- Few Shot Prompt Template: A subclass of the PromptTemplate class that uses examples.
- Example Selector: A class responsible to selecting examples to use dynamically (depending on user input) in a few shot prompt.

**Problems Solved**
- Switching costs: by exposing a standard interface for all the top LLM providers, LangChain makes it easy to switch from one provider to another, whether it be for production use cases or just for testing stuff out.
- Prompt management: managing your prompts is easy when you only have one simple one, but can get tricky when you have a bunch or when they start to get more complex. LangChain provides a standard way for storing, constructing, and referencing prompts.
- Prompt optimization: despite the underlying models getting better and better, there is still currently a need for carefully constructing prompts. 

### Chains
Using an LLM in isolation is fine for some simple applications, but many more complex ones require chaining LLMs - either with eachother or with other experts.
LangChain provides several parts to help with that.

**Key Concepts**
- Tools: APIs designed for assisting with a particular use case (search, databases, Python REPL, etc). Prompt templates, LLMs, and chains can also be considered tools.
- Chains: A combination of multiple tools in a deterministic manner.

**Problems Solved**
- Standard interface for working with Chains
- Easy way to construct chains of LLMs
- Lots of integrations with other tools that you may want to use in conjunction with LLMs 
- End-to-end chains for common workflows (database question/answer, recursive summarization, etc)

### Agents
Some applications will require not just a predetermined chain of calls to LLMs/other tools, but potentially an unknown chain that depends on the user input.
In these types of chains, there is a “agent” which has access to a suite of tools.
Depending on the user input, the agent can then decide which, if any, of these tools to call.

**Key Concepts**
- Tools: same as above.
- Agent: An LLM-powered class responsible for determining which tools to use and in what order.


**Problems Solved**
- Standard agent interfaces
- A selection of powerful agents to choose from
- Common chains that can be used as tools

### Memory
By default, Chains and Agents are stateless, meaning that they treat each incoming query independently.
In some applications (chatbots being a GREAT example) it is highly important to remember previous interactions,
both at a short term but also at a long term level. The concept of "Memory" exists to do exactly that.

**Key Concepts**
- Memory: A class that can be added to an Agent or Chain to (1) pull in memory variables before calling that chain/agent, and (2) create new memories after the chain/agent finishes.
- Memory Variables: Variables returned from a Memory class, to be passed into the chain/agent along with the user input.

**Problems Solved**
- Standard memory interfaces
- A collection of common memory implementations to choose from
- Common chains/agents that use memory (e.g. chatbots)

## 🤖 Developer Guide

To begin developing on this project, first clone the repo locally.

### Quick Start

This project uses [Poetry](https://python-poetry.org/) as a dependency manager. Check out Poetry's own [documentation on how to install it](https://python-poetry.org/docs/#installation) on your system before proceeding.

To install requirements:

```bash
poetry install -E all
```

This will install all requirements for running the package, examples, linting, formatting, and tests. Note the `-E all` flag will install all optional dependencies necessary for integration testing.

Now, you should be able to run the common tasks in the following section.

### Common Tasks

#### Code Formatting

Formatting for this project is a combination of [Black](https://black.readthedocs.io/en/stable/) and [isort](https://pycqa.github.io/isort/).

To run formatting for this project:

```bash
make format
```

#### Linting

Linting for this project is a combination of [Black](https://black.readthedocs.io/en/stable/), [isort](https://pycqa.github.io/isort/), [flake8](https://flake8.pycqa.org/en/latest/), and [mypy](http://mypy-lang.org/).

To run linting for this project:

```bash
make lint
```

We recognize linting can be annoying - if you do not want to do it, please contact a project maintainer and they can help you with it. We do not want this to be a blocker for good code getting contributed.

#### Testing

Unit tests cover modular logic that does not require calls to outside apis.

To run unit tests:

```bash
make tests
```

If you add new logic, please add a unit test.

Integration tests cover logic that requires making calls to outside APIs (often integration with other services).

To run integration tests:

```bash
make integration_tests
```

If you add support for a new external API, please add a new integration test.

#### Adding a Jupyter Notebook

If you are adding a Jupyter notebook example, you'll want to install the optional `dev` dependencies.

To install dev dependencies:

```bash
poetry install --with dev
```

Launch a notebook:

```bash
poetry run jupyter notebook
```

When you run `poetry install`, the `langchain` package is installed as editable in the virtualenv, so your new logic can be imported into the notebook.

#### Contribute Documentation

Docs are largely autogenerated by [sphinx](https://www.sphinx-doc.org/en/master/) from the code.

For that reason, we ask that you add good documentation to all classes and methods.

Similar to linting, we recognize documentation can be annoying - if you do not want to do it, please contact a project maintainer and they can help you with it. We do not want this to be a blocker for good code getting contributed.
initial commit 2022-10-24 21:51:15 +00:00			`# 🦜️🔗 LangChain`

			`⚡ Building applications with LLMs through composability ⚡`

Improve credential handing to allow passing in constructors (#79) Addresses the issue in #76 by either using the relevant environment variable if set or using a string passed in the constructor. Prefers the constructor string over the environment variable, which seemed like the natural choice to me. 2022-11-07 21:34:45 +00:00			[![lint](https://github.com/hwchase17/langchain/actions/workflows/lint.yml/badge.svg)](https://github.com/hwchase17/langchain/actions/workflows/lint.yml) [![test](https://github.com/hwchase17/langchain/actions/workflows/test.yml/badge.svg)](https://github.com/hwchase17/langchain/actions/workflows/test.yml) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT) [![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai) [![](https://dcbadge.vercel.app/api/server/6adMQxSpJS?compact=true&style=flat)](https://discord.gg/6adMQxSpJS)
initial commit 2022-10-24 21:51:15 +00:00
			`## Quick Install`

			`pip install langchain`

			`## 🤔 What is this?`

			`Large language models (LLMs) are emerging as a transformative technology, enabling`
			`developers to build applications that they previously could not.`
			`But using these LLMs in isolation is often not enough to`
			`create a truly powerful app - the real power comes when you are able to`
			`combine them with other sources of computation or knowledge.`

			`This library is aimed at assisting in the development of those types of applications.`

Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`## 📖 Documentation`

			`Please see [here](https://langchain.readthedocs.io/en/latest/?) for full documentation on:`
			`- Getting started (installation, setting up environment, simple examples)`
			`- How-To examples (demos, integrations, helper functions)`
			`- Reference (full API docs)`
			`- Resources (high level explanation of core concepts)`
Implements NLTK and Spacy-based TextSplitters (#103) This PR is for Issue #88 - [x] `make format` - [x] `make lint` - [x] `make tests` 2022-11-10 04:45:30 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`## 🚀 What can this help with?`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`There are three main areas (with a forth coming soon) that LangChain is designed to help with.`
			`These are, in increasing order of complexity:`
			`1. LLM and Prompts`
			`2. Chains`
			`3. Agents`
Update README.md memory now added as a feature (#208) 2022-11-27 04:21:42 +00:00			`4. Memory`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`Let's go through these categories and for each one identify key concepts (to clarify terminology) as well as the problems in this area LangChain helps solve.`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`### LLMs and Prompts`
			`Calling out to an LLM once is pretty easy, with most of them being behind well documented APIs.`
			`However, there are still some challenges going from that to an application running in production that LangChain attempts to address.`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`Key Concepts`
			`- LLM: A large language model, in particular a text-to-text model.`
			`- Prompt: The input to a language model. Typically this is not simply a hardcoded string but rather a combination of a template, some examples, and user input.`
			`- Prompt Template: An object responsible for constructing the final prompt to pass to a LLM.`
add custom prompt notebooks (#198) 2022-11-26 14:07:02 +00:00			`- Examples: Datapoints that can be included in the prompt in order to give the model more context what to do.`
			`- Few Shot Prompt Template: A subclass of the PromptTemplate class that uses examples.`
			`- Example Selector: A class responsible to selecting examples to use dynamically (depending on user input) in a few shot prompt.`
initial commit 2022-10-24 21:51:15 +00:00
Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`- Switching costs: by exposing a standard interface for all the top LLM providers, LangChain makes it easy to switch from one provider to another, whether it be for production use cases or just for testing stuff out.`
			`- Prompt management: managing your prompts is easy when you only have one simple one, but can get tricky when you have a bunch or when they start to get more complex. LangChain provides a standard way for storing, constructing, and referencing prompts.`
			`- Prompt optimization: despite the underlying models getting better and better, there is still currently a need for carefully constructing prompts.`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`### Chains`
			`Using an LLM in isolation is fine for some simple applications, but many more complex ones require chaining LLMs - either with eachother or with other experts.`
			`LangChain provides several parts to help with that.`
initial commit 2022-10-24 21:51:15 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`Key Concepts`
			`- Tools: APIs designed for assisting with a particular use case (search, databases, Python REPL, etc). Prompt templates, LLMs, and chains can also be considered tools.`
			`- Chains: A combination of multiple tools in a deterministic manner.`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00
Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`- Standard interface for working with Chains`
			`- Easy way to construct chains of LLMs`
			`- Lots of integrations with other tools that you may want to use in conjunction with LLMs`
			`- End-to-end chains for common workflows (database question/answer, recursive summarization, etc)`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`### Agents`
			`Some applications will require not just a predetermined chain of calls to LLMs/other tools, but potentially an unknown chain that depends on the user input.`
			`In these types of chains, there is a “agent” which has access to a suite of tools.`
			`Depending on the user input, the agent can then decide which, if any, of these tools to call.`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`Key Concepts`
			`- Tools: same as above.`
			`- Agent: An LLM-powered class responsible for determining which tools to use and in what order.`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00

Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`- Standard agent interfaces`
			`- A selection of powerful agents to choose from`
			`- Common chains that can be used as tools`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`### Memory`
Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`By default, Chains and Agents are stateless, meaning that they treat each incoming query independently.`
			`In some applications (chatbots being a GREAT example) it is highly important to remember previous interactions,`
			`both at a short term but also at a long term level. The concept of "Memory" exists to do exactly that.`

			`Key Concepts`
			`- Memory: A class that can be added to an Agent or Chain to (1) pull in memory variables before calling that chain/agent, and (2) create new memories after the chain/agent finishes.`
			`- Memory Variables: Variables returned from a Memory class, to be passed into the chain/agent along with the user input.`

			`Problems Solved`
			`- Standard memory interfaces`
			`- A collection of common memory implementations to choose from`
			`- Common chains/agents that use memory (e.g. chatbots)`
ElasticVectorSearch: Add in vector search backed by Elastic (#67) ![image](https://user-images.githubusercontent.com/6690839/200147455-33a68e20-c3c0-4045-9bff-598b38ae8fb2.png) woo! Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> 2022-11-08 15:01:42 +00:00
add developer guide (#44) 2022-10-31 05:48:52 +00:00			`## 🤖 Developer Guide`

chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00			`To begin developing on this project, first clone the repo locally.`

			`### Quick Start`

			`This project uses [Poetry](https://python-poetry.org/) as a dependency manager. Check out Poetry's own [documentation on how to install it](https://python-poetry.org/docs/#installation) on your system before proceeding.`

			`To install requirements:`

			```bash
			`poetry install -E all`
			```

			This will install all requirements for running the package, examples, linting, formatting, and tests. Note the `-E all` flag will install all optional dependencies necessary for integration testing.

			`Now, you should be able to run the common tasks in the following section.`

			`### Common Tasks`

			`#### Code Formatting`
add developer guide (#44) 2022-10-31 05:48:52 +00:00
			`Formatting for this project is a combination of [Black](https://black.readthedocs.io/en/stable/) and [isort](https://pycqa.github.io/isort/).`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
			`To run formatting for this project:`

			```bash
			`make format`
			```

			`#### Linting`
add developer guide (#44) 2022-10-31 05:48:52 +00:00
			`Linting for this project is a combination of [Black](https://black.readthedocs.io/en/stable/), [isort](https://pycqa.github.io/isort/), [flake8](https://flake8.pycqa.org/en/latest/), and [mypy](http://mypy-lang.org/).`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
			`To run linting for this project:`

			```bash
			`make lint`
			```

add developer guide (#44) 2022-10-31 05:48:52 +00:00			`We recognize linting can be annoying - if you do not want to do it, please contact a project maintainer and they can help you with it. We do not want this to be a blocker for good code getting contributed.`

chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00			`#### Testing`

add developer guide (#44) 2022-10-31 05:48:52 +00:00			`Unit tests cover modular logic that does not require calls to outside apis.`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
			`To run unit tests:`

			```bash
			`make tests`
			```

add developer guide (#44) 2022-10-31 05:48:52 +00:00			`If you add new logic, please add a unit test.`

			`Integration tests cover logic that requires making calls to outside APIs (often integration with other services).`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
			`To run integration tests:`

			```bash
			`make integration_tests`
			```

add developer guide (#44) 2022-10-31 05:48:52 +00:00			`If you add support for a new external API, please add a new integration test.`

chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00			`#### Adding a Jupyter Notebook`

			If you are adding a Jupyter notebook example, you'll want to install the optional `dev` dependencies.

			`To install dev dependencies:`

			```bash
			`poetry install --with dev`
			```

			`Launch a notebook:`

			```bash
			`poetry run jupyter notebook`
			```

			When you run `poetry install`, the `langchain` package is installed as editable in the virtualenv, so your new logic can be imported into the notebook.

			`#### Contribute Documentation`
Refactor prompts into module, add example generation utils (#64) 2022-11-06 23:40:33 +00:00
add developer guide (#44) 2022-10-31 05:48:52 +00:00			`Docs are largely autogenerated by [sphinx](https://www.sphinx-doc.org/en/master/) from the code.`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
add developer guide (#44) 2022-10-31 05:48:52 +00:00			`For that reason, we ask that you add good documentation to all classes and methods.`
chore: use poetry as dependency manager (#242) * Adopts [Poetry](https://python-poetry.org/) as a dependency manager * Introduces dependency version requirements * Deprecates Python 3.7 support TODO - [x] Update developer guide - [x] Add back `playwright`, `manifest-ml`, and `jupyter` to dependency group Not Doing => Fast Follow - Investigate single source for version, perhaps relying on GitHub tags and [tackling this issue](https://github.com/hwchase17/langchain/issues/26) 2022-12-04 00:42:59 +00:00
add developer guide (#44) 2022-10-31 05:48:52 +00:00			`Similar to linting, we recognize documentation can be annoying - if you do not want to do it, please contact a project maintainer and they can help you with it. We do not want this to be a blocker for good code getting contributed.`