langchain/docs/index.rst

Welcome to LangChain
==========================

Large language models (LLMs) are emerging as a transformative technology, enabling
developers to build applications that they previously could not.
But using these LLMs in isolation is often not enough to
create a truly powerful app - the real power comes when you are able to
combine them with other sources of computation or knowledge.

This library is aimed at assisting in the development of those types of applications.

There are five main areas that LangChain is designed to help with.
These are, in increasing order of complexity:

1. LLM and Prompts
2. Chains
3. Data Augmented Generation
4. Agents
5. Memory

Let's go through these categories and for each one identify key concepts (to clarify terminology) as well as the problems in this area LangChain helps solve.

**🦜 LLMs and Prompts**

Calling out to an LLM once is pretty easy, with most of them being behind well documented APIs.
However, there are still some challenges going from that to an application running in production that LangChain attempts to address.

*Key Concepts*

- LLM: A large language model, in particular a text-to-text model.
- Prompt: The input to a language model. Typically this is not simply a hardcoded string but rather a combination of a template, some examples, and user input.
- Prompt Template: An object responsible for constructing the final prompt to pass to a LLM.

*Problems Solved*

- Switching costs: by exposing a standard interface for all the top LLM providers, LangChain makes it easy to switch from one provider to another, whether it be for production use cases or just for testing stuff out.
- Prompt management: managing your prompts is easy when you only have one simple one, but can get tricky when you have a bunch or when they start to get more complex. LangChain provides a standard way for storing, constructing, and referencing prompts.
- Prompt optimization: despite the underlying models getting better and better, there is still currently a need for carefully constructing prompts.

**🔗️ Chains**

Using an LLM in isolation is fine for some simple applications, but many more complex ones require chaining LLMs - either with eachother or with other experts.
LangChain provides several parts to help with that.

*Key Concepts*

- Tools: APIs designed for assisting with a particular use case (search, databases, Python REPL, etc). Prompt templates, LLMs, and chains can also be considered tools.
- Chains: A combination of multiple tools in a deterministic manner.

*Problems Solved*

- Standard interface for working with Chains
- Easy way to construct chains of LLMs
- Lots of integrations with other tools that you may want to use in conjunction with LLMs
- End-to-end chains for common workflows (database question/answer, api calling, etc)

**📚 Data Augmented Generation**

LLMs have access to all the data they were trained on, but there are still large chunks of data they were not trained on.
Data Augmented Generation covers how to use LLMs to generate text conditioning on data outside of what the LLM was trained on.

*Key Concepts*

- Documents: A document is a piece of text, along with some associated metadata, that can be inserted into the context of a query to condition generation on that text.
- Embeddings: A vector representation of text (or other unstructured data). Useful for being able to numerically compare pieces of text.
- Vectorstore: A database which stores embeddings and can be searched over.

*Problems Solved*

- Standard interface for working with Documents, Embeddings, and Vectorstores
- Lots of integrations with common embedding providers and vectorstores
- End-to-end chains for common workflows (recursive summarization, question answering over documents, etc)


**🤖 Agents**

Some applications will require not just a predetermined chain of calls to LLMs/other tools, but potentially an unknown chain that depends on the user input.
In these types of chains, there is a “agent” which has access to a suite of tools.
Depending on the user input, the agent can then decide which, if any, of these tools to call.

*Key Concepts*

- Tools: same as above.
- Agent: An LLM-powered class responsible for determining which tools to use and in what order.


*Problems Solved*

- Standard agent interfaces
- A selection of powerful agents to choose from
- Common chains that can be used as tools

**🧠 Memory**

By default, Chains and Agents are stateless, meaning that they treat each incoming query independently.
In some applications (chatbots being a GREAT example) it is highly important to remember previous interactions,
both at a short term but also at a long term level. The concept of "Memory" exists to do exactly that.

*Key Concepts*

- Memory: A class that can be added to an Agent or Chain to (1) pull in memory variables before calling that chain/agent, and (2) create new memories after the chain/agent finishes.
- Memory Variables: Variables returned from a Memory class, to be passed into the chain/agent along with the user input.

*Problems Solved*

- Standard memory interfaces
- A collection of common memory implementations to choose from
- Common chains/agents that use memory (e.g. chatbots)

Documentation Structure
=======================
The documentation is structured into the following sections:


.. toctree::
   :maxdepth: 1
   :caption: Getting Started
   :name: getting_started

   getting_started/installation.md
   getting_started/environment.md
   getting_started/llm.md
   getting_started/llm_chain.md
   getting_started/sequential_chains.ipynb
   getting_started/data_augmented_generation.ipynb
   getting_started/agents.ipynb
   getting_started/memory.ipynb

Goes over a simple walk through and tutorial for getting started setting up a simple chain that generates a company name based on what the company makes.
Covers installation, environment set up, calling LLMs, and using prompts.
Start here if you haven't used LangChain before.


.. toctree::
   :maxdepth: 1
   :caption: How-To Examples
   :name: examples

   examples/prompts.rst
   examples/chains.rst
   examples/data_augmented_generation.rst
   examples/agents.rst
   examples/memory.rst
   examples/model_laboratory.ipynb

More elaborate examples and walk-throughs of particular
integrations and use cases. This is the place to look if you have questions
about how to integrate certain pieces, or if you want to find examples of
common tasks or cool demos.


.. toctree::
   :maxdepth: 1
   :caption: Reference
   :name: reference

   reference/installation.md
   reference/integrations.md
   reference/prompts.rst
   reference/chains.rst
   reference/data_augmented_generation.rst
   reference/modules/agents


Full API documentation. This is the place to look if you want to
see detailed information about the various classes, methods, and APIs.


.. toctree::
   :maxdepth: 1
   :caption: Resources
   :name: resources

   explanation/core_concepts.md
   explanation/combine_docs.md
   explanation/agents.md
   explanation/tools.md
   explanation/glossary.md
   explanation/cool_demos.md
   Discord <https://discord.gg/6adMQxSpJS>

Higher level, conceptual explanations of the LangChain components.
This is the place to go if you want to increase your high level understanding
of the problems LangChain is solving, and how we decided to go about do so.
initial commit 2022-10-24 21:51:15 +00:00			`Welcome to LangChain`
			`==========================`

Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`Large language models (LLMs) are emerging as a transformative technology, enabling`
			`developers to build applications that they previously could not.`
			`But using these LLMs in isolation is often not enough to`
			`create a truly powerful app - the real power comes when you are able to`
			`combine them with other sources of computation or knowledge.`

			`This library is aimed at assisting in the development of those types of applications.`

Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`There are five main areas that LangChain is designed to help with.`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`These are, in increasing order of complexity:`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`1. LLM and Prompts`
			`2. Chains`
Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`3. Data Augmented Generation`
			`4. Agents`
			`5. Memory`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`Let's go through these categories and for each one identify key concepts (to clarify terminology) as well as the problems in this area LangChain helps solve.`

change to agent (#173) 2022-11-23 02:02:20 +00:00			`🦜 LLMs and Prompts`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`Calling out to an LLM once is pretty easy, with most of them being behind well documented APIs.`
			`However, there are still some challenges going from that to an application running in production that LangChain attempts to address.`

			`Key Concepts`

			`- LLM: A large language model, in particular a text-to-text model.`
			`- Prompt: The input to a language model. Typically this is not simply a hardcoded string but rather a combination of a template, some examples, and user input.`
			`- Prompt Template: An object responsible for constructing the final prompt to pass to a LLM.`

Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`- Switching costs: by exposing a standard interface for all the top LLM providers, LangChain makes it easy to switch from one provider to another, whether it be for production use cases or just for testing stuff out.`
			`- Prompt management: managing your prompts is easy when you only have one simple one, but can get tricky when you have a bunch or when they start to get more complex. LangChain provides a standard way for storing, constructing, and referencing prompts.`
			`- Prompt optimization: despite the underlying models getting better and better, there is still currently a need for carefully constructing prompts.`

change to agent (#173) 2022-11-23 02:02:20 +00:00			`🔗️ Chains`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`Using an LLM in isolation is fine for some simple applications, but many more complex ones require chaining LLMs - either with eachother or with other experts.`
			`LangChain provides several parts to help with that.`

			`Key Concepts`

			`- Tools: APIs designed for assisting with a particular use case (search, databases, Python REPL, etc). Prompt templates, LLMs, and chains can also be considered tools.`
			`- Chains: A combination of multiple tools in a deterministic manner.`

Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`- Standard interface for working with Chains`
			`- Easy way to construct chains of LLMs`
			`- Lots of integrations with other tools that you may want to use in conjunction with LLMs`
Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`- End-to-end chains for common workflows (database question/answer, api calling, etc)`

			`📚 Data Augmented Generation`

			`LLMs have access to all the data they were trained on, but there are still large chunks of data they were not trained on.`
			`Data Augmented Generation covers how to use LLMs to generate text conditioning on data outside of what the LLM was trained on.`

			`Key Concepts`

			`- Documents: A document is a piece of text, along with some associated metadata, that can be inserted into the context of a query to condition generation on that text.`
			`- Embeddings: A vector representation of text (or other unstructured data). Useful for being able to numerically compare pieces of text.`
			`- Vectorstore: A database which stores embeddings and can be searched over.`

			`Problems Solved`

			`- Standard interface for working with Documents, Embeddings, and Vectorstores`
			`- Lots of integrations with common embedding providers and vectorstores`
			`- End-to-end chains for common workflows (recursive summarization, question answering over documents, etc)`

(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
change to agent (#173) 2022-11-23 02:02:20 +00:00			`🤖 Agents`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`Some applications will require not just a predetermined chain of calls to LLMs/other tools, but potentially an unknown chain that depends on the user input.`
			`In these types of chains, there is a “agent” which has access to a suite of tools.`
			`Depending on the user input, the agent can then decide which, if any, of these tools to call.`

			`Key Concepts`

			`- Tools: same as above.`
			`- Agent: An LLM-powered class responsible for determining which tools to use and in what order.`


Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`Problems Solved`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`- Standard agent interfaces`
			`- A selection of powerful agents to choose from`
			`- Common chains that can be used as tools`

change to agent (#173) 2022-11-23 02:02:20 +00:00			`🧠 Memory`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
Harrison/memory docs (#195) update memory docs and change variables 2022-11-26 13:58:54 +00:00			`By default, Chains and Agents are stateless, meaning that they treat each incoming query independently.`
			`In some applications (chatbots being a GREAT example) it is highly important to remember previous interactions,`
			`both at a short term but also at a long term level. The concept of "Memory" exists to do exactly that.`

			`Key Concepts`

			`- Memory: A class that can be added to an Agent or Chain to (1) pull in memory variables before calling that chain/agent, and (2) create new memories after the chain/agent finishes.`
			`- Memory Variables: Variables returned from a Memory class, to be passed into the chain/agent along with the user input.`

			`Problems Solved`

			`- Standard memory interfaces`
			`- A collection of common memory implementations to choose from`
			`- Common chains/agents that use memory (e.g. chatbots)`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00
			`Documentation Structure`
			`=======================`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`The documentation is structured into the following sections:`


			`.. toctree::`
			`:maxdepth: 1`
			`:caption: Getting Started`
			`:name: getting_started`

			`getting_started/installation.md`
			`getting_started/environment.md`
			`getting_started/llm.md`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`getting_started/llm_chain.md`
Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`getting_started/sequential_chains.ipynb`
			`getting_started/data_augmented_generation.ipynb`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`getting_started/agents.ipynb`
documentation (#191) 2022-11-25 17:41:27 +00:00			`getting_started/memory.ipynb`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00
			`Goes over a simple walk through and tutorial for getting started setting up a simple chain that generates a company name based on what the company makes.`
			`Covers installation, environment set up, calling LLMs, and using prompts.`
			`Start here if you haven't used LangChain before.`


initial commit 2022-10-24 21:51:15 +00:00			`.. toctree::`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`:maxdepth: 1`
			`:caption: How-To Examples`
			`:name: examples`
initial commit 2022-10-24 21:51:15 +00:00
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`examples/prompts.rst`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`examples/chains.rst`
Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`examples/data_augmented_generation.rst`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`examples/agents.rst`
Harrison/update docs mem (#201) 2022-11-26 14:38:49 +00:00			`examples/memory.rst`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`examples/model_laboratory.ipynb`

			`More elaborate examples and walk-throughs of particular`
			`integrations and use cases. This is the place to look if you have questions`
			`about how to integrate certain pieces, or if you want to find examples of`
			`common tasks or cool demos.`


			`.. toctree::`
			`:maxdepth: 1`
			`:caption: Reference`
			`:name: reference`

Harrison/improve data augmented generation docs (#390) Co-authored-by: cameronccohen <cameron.c.cohen@gmail.com> Co-authored-by: Cameron Cohen <cameron.cohen@quantco.com> 2022-12-21 03:24:08 +00:00			`reference/installation.md`
			`reference/integrations.md`
			`reference/prompts.rst`
			`reference/chains.rst`
			`reference/data_augmented_generation.rst`
			`reference/modules/agents`
Harrison/initial glossary (#61) 2022-11-04 15:02:21 +00:00
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00
			`Full API documentation. This is the place to look if you want to`
			`see detailed information about the various classes, methods, and APIs.`


Harrison/initial glossary (#61) 2022-11-04 15:02:21 +00:00			`.. toctree::`
			`:maxdepth: 1`
			`:caption: Resources`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00			`:name: resources`
Harrison/initial glossary (#61) 2022-11-04 15:02:21 +00:00
add few shot example (#148) 2022-11-20 04:32:45 +00:00			`explanation/core_concepts.md`
Harrison/base combine doc chain (#264) 2022-12-08 06:56:26 +00:00			`explanation/combine_docs.md`
(WIP) agents (#171) 2022-11-22 14:16:26 +00:00			`explanation/agents.md`
Harrison/tools exp (#372) 2022-12-19 02:51:23 +00:00			`explanation/tools.md`
add few shot example (#148) 2022-11-20 04:32:45 +00:00			`explanation/glossary.md`
Harrison/list of examples (#218) 2022-11-30 04:08:00 +00:00			`explanation/cool_demos.md`
Harrison/initial glossary (#61) 2022-11-04 15:02:21 +00:00			`Discord <https://discord.gg/6adMQxSpJS>`
Harrison/redo docs (#130) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> 2022-11-14 04:13:23 +00:00
			`Higher level, conceptual explanations of the LangChain components.`
			`This is the place to go if you want to increase your high level understanding`
			`of the problems LangChain is solving, and how we decided to go about do so.`