langchain

Author	SHA1	Message	Date
Harrison Chase	2ffc643086	add listen api docs (#1855 )	2023-03-21 09:29:34 -07:00
Tomoko Uchida	b706966ebc	Add setup instruction in Getting Started for Indexing (#1847 ) `VectorstoreIndexCreator` [uses Chroma as the vectorstore by default](`1c22657256/langchain/indexes/vectorstore.py (L49)`). It may be helpful to add a short note for the setup. You can see how the notebook looks here. https://github.com/mocobeta/langchain/blob/feat/add-setup-instruction-to-index-getting-started/docs/modules/indexes/getting_started.ipynb	2023-03-21 09:06:35 -07:00
Harrison Chase	1c22657256	Harrison/faiss merge (#1843 ) Co-authored-by: Ting Su <ting.su.1995@outlook.com>	2023-03-20 22:54:08 -07:00
Simon Zhou	3674074eb0	Add Qdrant to ecosystem page (#1830 ) Add [Qdrant](https://qdrant.tech/) to [LangChain ecosystem](https://langchain.readthedocs.io/en/latest/ecosystem.html) page.	2023-03-20 22:06:40 -07:00
Wenbin Fang	a7e09d46c5	Add podcast api tool to use NLP to search all podcasts or episodes. (#1833 ) Use the following code to test: ```python import os from langchain.llms import OpenAI from langchain.chains.api import podcast_docs from langchain.chains import APIChain # Get api key here: https://openai.com/pricing os.environ["OPENAI_API_KEY"] = "sk-xxxxx" # Get api key here: https://www.listennotes.com/api/pricing/ listen_api_key = 'xxx' llm = OpenAI(temperature=0) headers = {"X-ListenAPI-Key": listen_api_key} chain = APIChain.from_llm_and_api_docs(llm, podcast_docs.PODCAST_DOCS, headers=headers, verbose=True) chain.run("Search for 'silicon valley bank' podcast episodes, audio length is more than 30 minutes, return only 1 results") ``` Known issues: the api response data might be too big, and we'll get such error: `openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 6733 tokens (6477 in your prompt; 256 for the completion). Please reduce your prompt; or completion length.`	2023-03-20 22:04:17 -07:00
Ikko Eltociear Ashimine	9555bbd5bb	Fix typo in sqlite.ipynb (#1828 ) overriden -> overridden	2023-03-20 16:47:19 -07:00
Harrison Chase	d5b4393bb2	Harrison/llm math (#1808 ) Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2023-03-20 07:53:26 -07:00
Bryan Helmig	7b6ff7fe00	Follow up to #1803 to remove dynamic docs route. (#1818 ) The base docs are going to be more stable and familiar for folks. Dynamic route is currently in flux.	2023-03-20 07:52:41 -07:00
Harrison Chase	76c7b1f677	Harrison/wandb (#1764 ) Co-authored-by: Anish Shah <93145909+ash0ts@users.noreply.github.com>	2023-03-20 07:52:27 -07:00
Harrison Chase	d5d50c39e6	Harrison/azure embeddings (#1787 ) Co-authored-by: Hemant <4627288+ghaccount@users.noreply.github.com>	2023-03-19 10:42:33 -07:00
Harrison Chase	1f18698b2a	Harrison/token buffer memory (#1786 ) Co-authored-by: Aratako <127325395+Aratako@users.noreply.github.com>	2023-03-19 10:42:24 -07:00
Harrison Chase	ef4945af6b	Harrison/chat token usage (#1785 )	2023-03-19 10:32:31 -07:00
Harrison Chase	7de2ada3ea	Harrison/add source column (#1784 ) Co-authored-by: Brian Graham <46691715+briangrahamww@users.noreply.github.com> Co-authored-by: briangrahamww <brian.graham@ww.com>	2023-03-19 10:32:13 -07:00
hitoshi44	3cf493b089	Fix Document & Expose StringPromptTemplate as a custom-prompt-template. (#1753 ) Regarding [this issue](https://github.com/hwchase17/langchain/issues/1754), the code in the document [Creating a custom prompt template](https://langchain.readthedocs.io/en/latest/modules/prompts/examples/custom_prompt_template.html) is no longer functional and outdated. To address this, I have made the following changes: 1. Updated the guide in the document to use `StringPromptTemplate` instead of `BasePromptTemplate`. 2. Exposed `StringPromptTemplate` in `prompts/__init__.py` for easier importing.	2023-03-19 09:47:56 -07:00
hung_ng__	3d6fcb85dc	Add load json prompt example (#1776 ) Hi, I just want to add a PR on the prompt serialization examples of loading from JSON so that it can contain the same as loading from YAML.	2023-03-19 09:28:56 -07:00
Piyush Jain	1a8790d808	Corrects copyright year (#1762 ) Corrected copyright year.	2023-03-18 19:55:05 -07:00
Harrison Chase	8685d53adc	querying tabular data (#1758 )	2023-03-18 11:12:18 -07:00
Harrison Chase	dd90fd02d5	Harrison/move docs (#1741 )	2023-03-17 08:49:10 -07:00
Harrison Chase	07766a69f3	move docs (#1740 )	2023-03-17 08:42:28 -07:00
Harrison Chase	96ebe98dc2	Harrison/latex splitter (#1738 ) Co-authored-by: Aidan Holland <thehappydinoa@gmail.com> Co-authored-by: Jan de Boer <44832123+Janldeboer@users.noreply.github.com>	2023-03-17 08:10:27 -07:00
Harrison Chase	45f05fc939	Harrison/blackboard loader (#1737 ) Co-authored-by: Aidan Holland <thehappydinoa@gmail.com>	2023-03-17 08:02:44 -07:00
Vincent Liao	cf9c3f54f7	docs: add docs link to agent toolkits (#1735 ) New to Langchain, was a bit confused where I should find the toolkits section when I'm at `agent/key_concepts` docs. I added a short link that points to the how to section.	2023-03-17 07:59:49 -07:00
Piyush Jain	cdff6c8181	Sagemaker Endpoint LLM (#1686 ) Updates #965 --------- Co-authored-by: Nimisha Mehta <116048415+nimimeht@users.noreply.github.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	2023-03-16 21:58:06 -07:00
libra	8a95fdaee1	Fix all the bug in init Tool in docs (#1725 ) Fix all the example in the docs when init `Tool` Test by render with jupyter	2023-03-16 21:55:44 -07:00
jerwelborn	55efbb8a7e	pydantic/json parsing (#1722 ) ``` class Joke(BaseModel): setup: str = Field(description="question to set up a joke") punchline: str = Field(description="answer to resolve the joke") joke_query = "Tell me a joke." # Or, an example with compound type fields. #class FloatArray(BaseModel): # values: List[float] = Field(description="list of floats") # #float_array_query = "Write out a few terms of fiboacci." model = OpenAI(model_name='text-davinci-003', temperature=0.0) parser = PydanticOutputParser(pydantic_object=Joke) prompt = PromptTemplate( template="Answer the user query.\n{format_instructions}\n{query}\n", input_variables=["query"], partial_variables={"format_instructions": parser.get_format_instructions()} ) _input = prompt.format_prompt(query=joke_query) print("Prompt:\n", _input.to_string()) output = model(_input.to_string()) print("Completion:\n", output) parsed_output = parser.parse(output) print("Parsed completion:\n", parsed_output) ``` ``` Prompt: Answer the user query. The output should be formatted as a JSON instance that conforms to the JSON schema below. For example, the object {"foo": ["bar", "baz"]} conforms to the schema {"foo": {"description": "a list of strings field", "type": "string"}}. Here is the output schema: --- {"setup": {"description": "question to set up a joke", "type": "string"}, "punchline": {"description": "answer to resolve the joke", "type": "string"}} --- Tell me a joke. Completion: {"setup": "Why don't scientists trust atoms?", "punchline": "Because they make up everything!"} Parsed completion: setup="Why don't scientists trust atoms?" punchline='Because they make up everything!' ``` Ofc, works only with LMs of sufficient capacity. DaVinci is reliable but not always. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-03-16 21:43:11 -07:00
Jonathan Pedoeem	606605925d	Adding ability to `return_pl_id` to all PromptLayer Models in LangChain (#1699 ) PromptLayer now has support for [several different tracking features.](https://magniv.notion.site/Track-4deee1b1f7a34c1680d085f82567dab9) In order to use any of these features you need to have a request id associated with the request. In this PR we add a boolean argument called `return_pl_id` which will add `pl_request_id` to the `generation_info` dictionary associated with a generation. We also updated the relevant documentation.	2023-03-16 17:05:23 -07:00
Harrison Chase	3ea6d9c4d2	add docs for save/load messages (#1697 )	2023-03-15 13:13:08 -07:00
Piyush Jain	1279c8de39	Fixed typo, clarified language (#1682 )	2023-03-15 08:00:11 -07:00
at-b612	c7779c800a	Added Mynd URL to gallery (#1684 )	2023-03-15 07:59:59 -07:00
Jithin James	6f4f771897	docs: add path to state_of_the_union.txt in indexes/getting_started page (#1691 ) add the state_of_the_union.txt file so that its easier to follow through with the example. --------- Co-authored-by: Jithin James <jjmachan@pop-os.localdomain>	2023-03-15 07:59:47 -07:00
Ankush Gola	d4edd3c312	Zapier Integration (#1654 ) * Zapier Wrapper and Tools (implemented by Zapier Team) * Zapier Toolkit, examples with mrkl agent --------- Co-authored-by: Mike Knoop <mikeknoop@gmail.com> Co-authored-by: Robert Lewis <robert.lewis@zapier.com>	2023-03-14 23:06:17 -07:00
Harrison Chase	0b29e68c17	Harrison/pgvector (#1679 ) Co-authored-by: Aman Kumar <krsingh.aman@gmail.com>	2023-03-14 21:13:58 -07:00
Harrison Chase	4d7fdb8957	Harrison/gml save (#1676 ) Co-authored-by: Satoru Sakamoto <51464932+satoru814@users.noreply.github.com>	2023-03-14 20:00:22 -07:00
Harrison Chase	656efe6ef3	Harrison/fix nb (#1678 )	2023-03-14 19:34:23 -07:00
Matt Robinson	63aa28e2a6	feat: allow the unstructured kwargs to be passed in to Unstructured document loaders (#1667 ) ### Summary Allows users to pass in `**unstructured_kwargs` to Unstructured document loaders. Implemented with the `strategy` kwargs in mind, but will pass in other kwargs like `include_page_breaks` as well. The two currently supported strategies are `"hi_res"`, which is more accurate but takes longer, and `"fast"`, which processes faster but with lower accuracy. The `"hi_res"` strategy is the default. For PDFs, if `detectron2` is not available and the user selects `"hi_res"`, the loader will fallback to using the `"fast"` strategy. ### Testing #### Make sure the `strategy` kwarg works Run the following in iPython to verify that the `"fast"` strategy is indeed faster. ```python from langchain.document_loaders import UnstructuredFileLoader loader = UnstructuredFileLoader("layout-parser-paper-fast.pdf", strategy="fast", mode="elements") %timeit loader.load() loader = UnstructuredFileLoader("layout-parser-paper-fast.pdf", mode="elements") %timeit loader.load() ``` On my system I get: ```python In [3]: from langchain.document_loaders import UnstructuredFileLoader In [4]: loader = UnstructuredFileLoader("layout-parser-paper-fast.pdf", strategy="fast", mode="elements") In [5]: %timeit loader.load() 247 ms ± 369 µs per loop (mean ± std. dev. of 7 runs, 1 loop each) In [6]: loader = UnstructuredFileLoader("layout-parser-paper-fast.pdf", mode="elements") In [7]: %timeit loader.load() 2.45 s ± 31 ms per loop (mean ± std. dev. of 7 runs, 1 loop each) ``` #### Make sure older versions of `unstructured` still work Run `pip install unstructured==0.5.3` and then verify the following runs without error: ```python from langchain.document_loaders import UnstructuredFileLoader loader = UnstructuredFileLoader("layout-parser-paper-fast.pdf", mode="elements") loader.load() ```	2023-03-14 18:15:28 -07:00
Matthias Kern	c3dfbdf0da	Remove outdated code from Chat VectorDB QA example (#1670 )	2023-03-14 18:13:51 -07:00
Bilel MEDIMEGH	a2280f321f	Docs: Fix typo in memory/key_concepts.md (#1671 ) dialouge -> dialogue	2023-03-14 18:12:01 -07:00
Xin Qiu	4e13cef05a	feat: add redisearch vectorstore (#1307 ) # Description Add `RediSearch` vectorstore for LangChain RediSearch: [RediSearch quick start](https://redis.io/docs/stack/search/quick_start/) # How to use ``` from langchain.vectorstores.redisearch import RediSearch rds = RediSearch.from_documents(docs, embeddings,redisearch_url="redis://localhost:6379") ```	2023-03-14 18:06:03 -07:00
Harrison Chase	2d098e8869	Harrison/agent eval (#1620 ) Co-authored-by: jerwelborn <jeremy.welborn@gmail.com>	2023-03-14 12:37:48 -07:00
Harrison Chase	7cf46b3fee	Harrison/convo agent (#1642 )	2023-03-14 09:42:24 -07:00
Jon Luo	0a1b1806e9	sql: do not hard code the LIMIT clause in the table_info section (#1563 ) Seeing a lot of issues in Discord in which the LLM is not using the correct LIMIT clause for different SQL dialects. ie, it's using `LIMIT` for mssql instead of `TOP`, or instead of `ROWNUM` for Oracle, etc. I think this could be due to us specifying the LIMIT statement in the example rows portion of `table_info`. So the LLM is seeing the `LIMIT` statement used in the prompt. Since we can't specify each dialect's method here, I think it's fine to just replace the `SELECT... LIMIT 3;` statement with `3 rows from table_name table:`, and wrap everything in a block comment directly following the `CREATE` statement. The Rajkumar et al paper wrapped the example rows and `SELECT` statement in a block comment as well anyway. Thoughts @fpingham?	2023-03-13 23:08:27 -07:00
Tim Asp	b3234bf3b0	cleanup: unify 3 different pdf loaders, rename PagedPDFSplitter (#1615 ) `OnlinePDFLoader` and `PagedPDFSplitter` lived separate from the rest of the pdf loaders. Because they're all similar, I propose moving all to `pdy.py` and the same docs/examples page. Additionally, `PagedPDFSplitter` naming doesn't match the pattern the rest of the loaders follow, so I renamed to `PyPDFLoader` and had it inherit from `BasePDFLoader` so it can now load from remote file sources.	2023-03-13 23:06:50 -07:00
Harrison Chase	56aff797c0	docs req (#1647 )	2023-03-13 16:03:32 -07:00
Harrison Chase	d53ff270e0	bump version to 109 (#1646 )	2023-03-13 15:52:35 -07:00
Harrison Chase	df6c33d4b3	Harrison/new output parser (#1617 )	2023-03-13 15:08:39 -07:00
Eugene Yurtsev	bd4a2a670b	Add copy button to sphinx notebooks (#1622 ) This adds a copy button at the top right corner of all notebook cells in sphinx notebooks.	2023-03-12 21:15:07 -07:00
Ikko Eltociear Ashimine	6e98ab01e1	Fix typo in vectorstore.ipynb (#1614 ) Initalize -> Initialize	2023-03-12 14:12:47 -07:00
yakigac	acd86d33bc	Add read only shared memory (#1491 ) Provide shared memory capability for the Agent. Inspired by #1293 . ## Problem If both Agent and Tools (i.e., LLMChain) use the same memory, both of them will save the context. It can be annoying in some cases. ## Solution Create a memory wrapper that ignores the save and clear, thereby preventing updates from Agent or Tools.	2023-03-12 09:34:36 -07:00
Harrison Chase	c9b5a30b37	move output parsing (#1605 )	2023-03-11 16:41:03 -08:00
Harrison Chase	15de3e8137	Harrison/docs footer (#1600 ) Co-authored-by: Albert Avetisian <albert.avetisian@gmail.com>	2023-03-11 09:18:35 -08:00

1 2 3 4 5 ...

408 Commits