langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
Davis Chase	9ab7101182	WIP: FLARE-inspired chain (#4612 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-05-13 09:28:28 -07:00
Harrison Chase	daa3e6dedb	Harrison/prompt constructor methods (#4616 )	2023-05-13 09:23:51 -07:00
Harrison Chase	6265cbfb11	Harrison/standard llm interface (#4615 )	2023-05-13 09:05:31 -07:00
Harrison Chase	7d425cbf38	improve sql prompt (#4611 ) Co-authored-by: Taqi Jaffri <tjaffri@docugami.com> Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>	2023-05-12 21:55:03 -07:00
Tim Asp	ed0d557ede	docs: fix pdf docs hierarchy and formatting (#4593 ) # Fix pdf loader docs page ![image](https://github.com/hwchase17/langchain/assets/707699/4a11f379-00ed-4f7a-9870-71f74e0cadc6) Using h1's messes with hierarchy, this fixes that, and moves the PyPDFium2 loader out of the middle of PDFMiner docs	2023-05-12 15:03:01 -04:00
Zander Chase	d96f6a106b	Add Steamship Image Generation Tool (#4580 ) Co-authored-by: Enias Cailliau <enias@steamship.com>	2023-05-12 10:35:01 -07:00
Davis Chase	a4a9d1f403	Improve vespa interface (#4546 ) ![Screenshot 2023-05-11 at 7 50 31 PM](https://github.com/hwchase17/langchain/assets/130488702/bc8ab4bb-8006-44fc-ba07-df54e84ee2c1)	2023-05-12 10:11:26 -07:00
Neil Ruaro	3a2855945b	added documentation on retrieving a PG vectorstore (#4578 ) This PR adds in documentation on querying an existing vectorstore in PG Fixes 3191 (issue)	2023-05-12 13:04:06 -04:00
Harrison Chase	5ad151ed44	Add constitutional principles from paper (#4554 ) Add constitutional principles from https://arxiv.org/pdf/2212.08073.pdf --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-12 07:34:03 -07:00
Sai Vinay G	cf4c1394a2	feat: Added class to support huggingface text generation inference server (#4447 ) [Text Generation Inference](https://github.com/huggingface/text-generation-inference) is a Rust, Python and gRPC server for generating text using LLMs. This pull request add support for self hosted Text Generation Inference servers. feature: #4280 --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-12 07:32:37 -07:00
Leonid Ganeline	e17d0319d5	Add `arxiv` retriever (#4538 )	2023-05-11 22:48:38 -07:00
SimFG	7bcf238a1a	Optimize the initialization method of GPTCache (#4522 ) Optimize the initialization method of GPTCache, so that users can use GPTCache more quickly.	2023-05-11 16:15:23 -07:00
kYLe	446b60d803	Fix a typo in langchain/docs/modules/models/llms/integrations/anyscale.ipynb (#4526 )	2023-05-11 09:03:04 -07:00
Akshaya Annavajhala	b21d7c138c	Callback Handler for MLflow (#4150 ) Rebased Mahmedk's PR with the callback refactor and added the example requested by hwchase plus a couple minor fixes --------- Co-authored-by: Ahmed K <77802633+mahmedk@users.noreply.github.com> Co-authored-by: Ahmed K <mda3k27@gmail.com> Co-authored-by: Davis Chase <130488702+dev2049@users.noreply.github.com> Co-authored-by: Corey Zumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 01:10:40 -07:00
kYLe	0d51a1f12b	Add LLMs support for Anyscale Service (#4350 ) Add Anyscale service integration under LLM Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 00:39:59 -07:00
Kristóf Dombi	99b2400048	[Docs]: Add Kinsta to the list of deployment providers (#4445 ) We're fans of the LangChain framework thus we wanted to make sure we provide an easy way for our customers to be able to utilize this framework for their LLM-powered applications at our platform.	2023-05-11 00:29:48 -07:00
Zander Chase	d969f43ed8	Load HuggingFace Tool (#4475 ) # Add option to `load_huggingface_tool` Expose a method to load a huggingface Tool from the HF hub --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-11 00:07:36 -07:00
Harrison Chase	3ce29cb4a6	Harrison/new search (#4359 ) Co-authored-by: Jiaping(JP) Zhang <vincentzhangv@gmail.com>	2023-05-10 17:09:16 -07:00
Davis Chase	9ec60ad832	Add azure cognitive search retriever (#4467 ) All credit to @UmerHA, made a couple small changes --------- Co-authored-by: UmerHA <40663591+UmerHA@users.noreply.github.com>	2023-05-10 15:27:27 -07:00
Davis Chase	46b100ea63	Add DocArray vector stores (#4483 ) Thanks to @anna-charlotte and @jupyterjazz for the contribution! Made few small changes to get it across the finish line --------- Signed-off-by: anna-charlotte <charlotte.gerhaher@jina.ai> Signed-off-by: jupyterjazz <saba.sturua@jina.ai> Co-authored-by: anna-charlotte <charlotte.gerhaher@jina.ai> Co-authored-by: jupyterjazz <saba.sturua@jina.ai> Co-authored-by: Saba Sturua <45267439+jupyterjazz@users.noreply.github.com>	2023-05-10 15:22:16 -07:00
Davis Chase	04475bea7d	Mv plan and execute to experimental (#4459 )	2023-05-10 08:31:53 -07:00
Matt Robinson	3637d6da6e	feat: add loader for open office odt files (#4405 ) # ODF File Loader Adds a data loader for handling Open Office ODT files. Requires `unstructured>=0.6.3`. ### Testing The following should work using the `fake.odt` example doc from the [`unstructured` repo](https://github.com/Unstructured-IO/unstructured). ```python from langchain.document_loaders import UnstructuredODTLoader loader = UnstructuredODTLoader(file_path="fake.odt", mode="elements") loader.load() loader = UnstructuredODTLoader(file_path="fake.odt", mode="single") loader.load() ```	2023-05-10 01:37:17 -07:00
Harrison Chase	f0cfed636f	change nb name	2023-05-09 21:22:35 -07:00
Harrison Chase	6b8d144ccc	Harrison/plan and solve (#4422 )	2023-05-09 21:07:56 -07:00
mbchang	9fafe7b2b9	fix: remove unnecessary line of code (#4408 ) Removes unnecessary line of code in https://python.langchain.com/en/latest/use_cases/agent_simulations/two_agent_debate_tools.html	2023-05-09 10:35:09 -07:00
Leonid Ganeline	ce15ffae6a	added `Wikipedia` retriever (#4302 ) - added `Wikipedia` retriever. It is effectively a wrapper for `WikipediaAPIWrapper`. It wrapps load() into get_relevant_documents() - sorted `__all__` in the `retrievers/__init__` - added integration tests for the WikipediaRetriever - added an example (as Jupyter notebook) for the WikipediaRetriever	2023-05-09 10:08:39 -07:00
Prayson Wilfred Daniel	2b4ba203f7	query correction from when to what (#4383 ) # Minor Wording Documentation Change ```python agent_chain.run("When's my friend Eric's surname?") # Answer with 'Zhu' ``` is change to ```python agent_chain.run("What's my friend Eric's surname?") # Answer with 'Zhu' ``` I think when is a residual of the old query that was "When’s my friends Eric`s birthday?".	2023-05-09 07:42:47 -07:00
BioErrorLog	04f765b838	Fix grammar in Text Splitters docs (#4373 ) # Fix grammar in Text Splitters docs Just a small fix of grammar in the documentation: "That means there two different axes" -> "That means there are two different axes"	2023-05-08 22:38:40 -04:00
mbchang	f1401a6dff	new example: two agent debate with tools (#4024 )	2023-05-08 17:10:44 -07:00
Ankush Gola	b3ecce0545	fix json saving, update docs to reference anthropic chat model (#4364 ) Fixes # (issue) https://github.com/hwchase17/langchain/issues/4085	2023-05-08 15:30:52 -07:00
Simba Khadder	d84df25466	Add example on how to use Featureform with langchain (#4337 ) Added an example on how to use Featureform to connecting_to_a_feature_store.ipynb .	2023-05-08 10:32:17 -07:00
Zander Chase	8b284f9ad0	Pass parsed inputs through to tool _run (#4309 )	2023-05-08 09:13:05 -07:00
Harrison Chase	c8b0b6e6c1	add youtube tools (#4320 )	2023-05-08 08:29:30 -07:00
PawelFaron	04b74d0446	Adjusted GPT4All llm to streaming API and added support for GPT4All_J (#4131 ) Fix for these issues: https://github.com/hwchase17/langchain/issues/4126 https://github.com/hwchase17/langchain/issues/3839#issuecomment-1534258559 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-06 15:14:09 -07:00
Harrison Chase	64940e9d0f	docs for azure (#4238 )	2023-05-06 10:16:00 -07:00
Myeongseop Kim	747b5f87c2	Add HumanInputLLM (#4160 ) Related: #4028, I opened a new PR because (1) I was unable to unstage mistakenly committed files (I'm not familiar with git enough to resolve this issue), (2) I felt closing the original PR and opening a new PR would be more appropriate if I changed the class name. This PR creates HumanInputLLM(HumanLLM in #4028), a simple LLM wrapper class that returns user input as the response. I also added a simple Jupyter notebook regarding how and why to use this LLM wrapper. In the notebook, I went over how to use this LLM wrapper and showed example of testing `WikipediaQueryRun` using HumanInputLLM. I believe this LLM wrapper will be useful especially for debugging, educational or testing purpose.	2023-05-06 09:48:40 -07:00
Davis Chase	6cd51ef3d0	Simplify router chain constructor signatures (#4146 )	2023-05-06 09:38:17 -07:00
Leonid Ganeline	9544b30821	added `Wikipedia` document loader (#4141 ) - Added the `Wikipedia` document loader. It is based on the existing `unilities/WikipediaAPIWrapper` - Added a respective ut-s and example notebook - Sorted list of classes in __init__	2023-05-06 09:32:45 -07:00
Davis Chase	5ca13cc1f0	Dev2049/pypdfium2 (#4209 ) thanks @jerrytigerxu for the addition! --------- Co-authored-by: Jere Xu <jtxu2008@gmail.com> Co-authored-by: jerrytigerxu <jere.tiger.xu@gmailc.om>	2023-05-05 17:55:31 -07:00
Leonid Ganeline	59204a5033	docs: `document_loaders` improvements (#4200 ) - made notebooks consistent: titles, service/format descriptions. - corrected short names to full names, for example, `Word` -> `Microsoft Word` - added missed descriptions - renamed notebook files to make ToC correctly sorted	2023-05-05 17:44:54 -07:00
Aivin V. Solatorio	6567b73e1a	JSON loader (#4067 ) This implements a loader of text passages in JSON format. The `jq` syntax is used to define a schema for accessing the relevant contents from the JSON file. This requires dependency on the `jq` package: https://pypi.org/project/jq/. --------- Signed-off-by: Aivin V. Solatorio <avsolatorio@gmail.com>	2023-05-05 14:48:13 -07:00
PawelFaron	bb6d97c18c	Fixed the example code (#4117 ) Fixed the issue mentioned here: https://github.com/hwchase17/langchain/issues/3799#issuecomment-1534785861 Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-05 14:22:10 -07:00
Nicolas	a57259ec83	docs: Mendable Fixes and Improvements (#4184 ) Overall fixes and improvements.	2023-05-05 13:04:24 -07:00
Harrison Chase	26534457f5	simplify csv args (#4182 )	2023-05-05 09:22:08 -07:00
Davis Chase	d84bb02881	Add Chroma self query (#4149 ) Add internal query language -> chroma metadata filter translator	2023-05-05 08:43:08 -07:00
Vinoo Ganesh	905a2114d7	Fix: Typo in Docs (#4179 ) Fixing small typo in docs	2023-05-05 08:35:49 -07:00
Harrison Chase	a9c2450330	Harrison/toml loader (#4090 ) Co-authored-by: Mika Ayenson <Mikaayenson@users.noreply.github.com>	2023-05-03 23:14:39 -07:00
Harrison Chase	fba6921b50	Harrison/one drive loader (#4081 ) Co-authored-by: José Ferraz Neto <netoferraz@gmail.com>	2023-05-03 22:55:34 -07:00
AndreLCanada	bf726f9d8a	Update python_repl docs (#4012 ) In the example for creating a Python REPL tool under the Agent module, the ".run" was omitted in the example. I believe this is required when defining a Tool.	2023-05-03 22:45:32 -07:00
Mike Wang	67db495fcf	[agent] Add Spark Agent (#4020 ) - added support for spark through pyspark library. - added jupyter notebook as example.	2023-05-03 22:45:23 -07:00
Gengliang Wang	8af25867cb	Simplify HumanMessages in the quick start guide (#4026 ) In the section `Get Message Completions from a Chat Model` of the quick start guide, the HumanMessage doesn't need to include `Translate this sentence from English to French.` when there is a system message. Simplify HumanMessages in these examples can further demonstrate the power of LLM.	2023-05-03 22:45:03 -07:00
Harrison Chase	087a4bd2b8	improve agent documentation (#4062 )	2023-05-03 22:44:01 -07:00
rogerserper	b1446bea5f	google-serper: async + full json results + support for Google Images, Places and News (#4078 ) * implemented arun, results, and aresults. Reuses aiosession if available. * helper tools GoogleSerperRun and GoogleSerperResults * support for Google Images, Places and News (examples given) and filtering based on time (e.g. past hour) * updated docs	2023-05-03 22:35:48 -07:00
mbchang	cdea47491d	refactor: refactor dialogue examples (DialogueAgent, DialogueSimulator) (#4074 ) refactor dialogue examples to have same DialogueAgent and DialogueSimulator definitions	2023-05-03 22:32:26 -07:00
Davis Chase	7f8727bbcd	Router chains (#4019 ) Unpolished router examples to help flesh out abstractions and use cases ![Screenshot 2023-05-02 at 7 02 58 PM](https://user-images.githubusercontent.com/130488702/235820394-389e5584-db0b-415e-a260-2824b5555167.png) --------- Co-authored-by: Shreya Rajpal <shreya.rajpal@gmail.com>	2023-05-03 22:02:55 -07:00
Leonid Ganeline	6caba8e759	docs: added a link to the `Google Scholar` articles (#4007 ) Google Scholar outputs a nice list of scientific and research articles that use LangChain. I added a link to the Google Scholar page to the `gallery` doc page	2023-05-03 21:54:44 -07:00
Harrison Chase	5f30cc8713	Harrison/knn retriever (#4083 ) Co-authored-by: Yuichi Tateno (secon) <hotchpotch@users.noreply.github.com>	2023-05-03 21:21:58 -07:00
Harrison Chase	5a269d3175	Harrison/media wiki xml (#4072 ) Co-authored-by: Géraud de Drouas <gdedrouas@users.noreply.github.com>	2023-05-03 20:45:33 -07:00
Nikolas Garske	1608f5dcae	Remove pip stdout and fix typo (#4050 )	2023-05-03 18:06:39 -07:00
Ivo Stranic	3b556eae44	Update deeplake example (#4055 )	2023-05-03 18:03:51 -07:00
Steve Kim	9b830f437c	Deleted importing Document from document_loaders.base because Documen… (#4068 ) Hi, - Modification: https://python.langchain.com/en/latest/modules/indexes/document_loaders/examples/arxiv.html - Reason: In this example, the first line is unnecessary because the Document class does not exist in the base. - Resolves: Issue #4052 -------- P.S: This pull-request is my first time, so please let me know if I need to correct or write more explanation.	2023-05-03 17:54:30 -07:00
Akash Sharma	525db1b6cb	Fixed typo leading to broken link (#4034 )	2023-05-03 14:45:54 -07:00
Zander Chase	7e967aa4d5	Update Notebooks (#4051 )	2023-05-03 09:31:02 -07:00
mbchang	f291fd7eed	docs: remove stdout from pip install (for gymnasium) (#3993 )	2023-05-02 21:51:40 -07:00
Davis Chase	df3bc707fc	Dev2049/callback example fix (#4010 ) Closes #3997 --------- Co-authored-by: Akshaj Jain <akshaj.jain@gmail.com>	2023-05-02 16:20:16 -07:00
Zander Chase	aa38355999	Vwp/docs improved document loaders (#4006 ) Huge thanks to @leo-gan for improving the document loaders notebooks --------- Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com>	2023-05-02 15:24:53 -07:00
MichaelMDowling	36ee60c96c	Update \docs\modules\models\text_embedding\examples\openai.ipynb (#3976 ) Single edit to: models/text_embedding/examples/openai.ipynb - Line 88: changed from: "embeddings = OpenAIEmbeddings(model_name=\"ada\")" to "embeddings = OpenAIEmbeddings()" as model_name is no longer part of the OpenAIEmbeddings class.	2023-05-02 14:41:31 -07:00
Jinto Jose	013208cce6	Fix Documentation - Nomic - Atlas Jupyter Notebook (#3987 ) Correction to Numic-Atlas Jupyter Notebook Docs	2023-05-02 14:20:01 -07:00
Chop Tr	71a337dac6	Update output_fixing_parser.ipynb (#3978 )	2023-05-02 09:33:46 -07:00
mbchang	3993166b5e	docs: remove stdout from pip install (#3945 )	2023-05-01 22:05:22 -07:00
liviuasnash1	6396a4ad8d	Fix documentation typos (#3870 ) Co-authored-by: Liviu Asnash <liviua@maximallearning.com>	2023-05-01 20:58:38 -07:00
Samuel Dion-Girardeau	c5c33786a7	Fix bad spellings for 'convenience' (#3936 ) Found in the docs for chat prompt templates: https://python.langchain.com/en/latest/getting_started/getting_started.html#chat-prompt-templates and fixed similar issues in neighboring notebooks.	2023-05-01 20:57:06 -07:00
Harrison Chase	f04faf8496	Harrison/spreedly (#3937 ) Co-authored-by: Esmit Pérez <esmitperez@users.noreply.github.com>	2023-05-01 20:56:56 -07:00
Zander Chase	c4cb55a0c5	[Breaking] Migrate GPT4All to use PyGPT4All (#3934 ) Seems the pyllamacpp package is no longer the supported bindings from gpt4all. Tested that this works locally. Given that the older models weren't very performant, I think it's better to migrate now without trying to include a lot of try / except blocks --------- Co-authored-by: Nissan Pow <npow@users.noreply.github.com> Co-authored-by: Nissan Pow <pownissa@amazon.com>	2023-05-01 20:42:45 -07:00
leo-gan	f0a4bbb8e2	updated `YouTube` links (#3916 ) Added several links to fresh videos Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-05-01 20:39:59 -07:00
Matt Robinson	c51dec5101	feat: add Unstructured API loaders (#3906 ) ### Summary Adds `UnstructuredAPIFileLoaders` and `UnstructuredAPIFIleIOLoaders` that partition documents through the Unstructured API. Defaults to the URL for hosted Unstructured API, but can switch to a self hosted or locally running API using the `url` kwarg. Currently, the Unstructured API is open and does not require an API, but it will soon. A note was added about that to the Unstructured ecosystem page. ### Testing ```python from langchain.document_loaders import UnstructuredAPIFileIOLoader filename = "fake-email.eml" with open(filename, "rb") as f: loader = UnstructuredAPIFileIOLoader(file=f, file_filename=filename) docs = loader.load() docs[0] ``` ```python from langchain.document_loaders import UnstructuredAPIFileLoader filename = "fake-email.eml" loader = UnstructuredAPIFileLoader(file_path=filename, mode="elements") docs = loader.load() docs[0] ```	2023-05-01 20:37:35 -07:00
Zander Chase	c582f2e9e3	Add Structure Chat Agent (#3912 ) Create a new chat agent that is compatible with the Multi-input tools	2023-05-01 20:34:50 -07:00
Davis Chase	e7e29f9937	Dev2049/add modern treasury (#3924 ) Modified Modern Treasury and Strip slightly so credentials don't have to be passed in explicitly. Thanks @mattgmarcus for adding Modern Treasury! --------- Co-authored-by: Matt Marcus <matt.g.marcus@gmail.com>	2023-05-01 20:28:02 -07:00
mbchang	ffc87233a1	refactor GymnasiumAgent (#3927 ) refactor GymnasiumAgent (for single-agent environments) to be extensible to PettingZooAgent (multi-agent environments)	2023-05-01 20:25:03 -07:00
mbchang	81601d886c	new example: multi-agent simulations with environment (#3928 )	2023-05-01 20:24:15 -07:00
Harrison Chase	f7a828685d	Harrison/constitutional chain (#3931 ) Co-authored-by: Sam Ching <samuel@duolingo.com>	2023-05-01 20:23:16 -07:00
Venelin Valkov	bc7e4d5cd4	Add links to YouTube videos by Venelin Valkov (#3820 ) Hi, I've added links to my YouTube videos on LangChain. Thank you for making/maintaining LangChain! Venelin	2023-05-01 20:20:30 -07:00
Johan Stenberg (MSFT)	6bd367916c	Update adding_memory_chain_multiple_inputs.ipynb (#3895 ) Fix misleading docs in memory chain example (used the term "outputs" instead of "inputs")	2023-05-01 19:57:27 -07:00
Zander Chase	9b9b231e10	Update some Tools Docs (#3913 ) Haven't gotten to all of them, but this: - Updates some of the tools notebooks to actually instantiate a tool (many just show a 'utility' rather than a tool. More changes to come in separate PR) - Move the `Tool` and decorator definitions to `langchain/tools/base.py` (but still export from `langchain.agents`) - Add scene explain to the load_tools() function - Add unit tests for public apis for the langchain.tools and langchain.agents modules	2023-05-01 19:07:26 -07:00
engkheng	21335d43b2	Minor `LLMChain` docs correction (#3791 ) `LLMChain` run method can take multiple input variables.	2023-05-01 15:50:57 -07:00
Younis Shah	22a1896c30	[docs]: updates connecting_to_a_feature_store.ipynb (#3776 ) * fixes `FeastPromptTemplate.format` example to use `driver_id`	2023-05-01 15:45:59 -07:00
Harrison Chase	e28c6403aa	Harrison/cohere reranker (#3904 )	2023-05-01 15:40:16 -07:00
mbchang	3e1cb31f63	fix: add import for gymnasium (#3899 )	2023-05-01 10:37:25 -07:00
Nikolas Garske	c4d3d74148	Fix typos in arxiv.ipynb (#3887 ) Several minor typos in the doc for the arxiv document loaders were fixed.	2023-05-01 09:17:37 -07:00
Ankush Gola	e87f81b3ec	add more color to callbacks docs (#3856 )	2023-04-30 19:13:01 -07:00
Zander Chase	19912d755e	Vwp/arxiv (#3855 ) Co-authored-by: Mike Wang <62768671+skcoirz@users.noreply.github.com>	2023-04-30 18:59:22 -07:00
Zander Chase	e17858470c	Vwp/multi line input (#3854 ) Co-authored-by: Paolo Rechia <paolorechia@gmail.com>	2023-04-30 18:59:11 -07:00
Zander Chase	fbbdf161cd	Lambda Tool (#3842 ) Co-authored-by: Jason Holtkamp <holtkam2@gmail.com>	2023-04-30 15:15:09 -07:00
Ankush Gola	d3ec00b566	Callbacks Refactor [base] (#3256 ) Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Davis Chase <130488702+dev2049@users.noreply.github.com> Co-authored-by: Zander Chase <130414180+vowelparrot@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-30 11:14:09 -07:00
Zander Chase	18ec22fe56	Remove multi-input tool section (#3810 ) Moving to new notebook. Will re-intro w/ new agent	2023-04-29 15:29:08 -07:00
mbchang	adcad98bee	fix: fix filepath error in agent simulations docs (#3795 )	2023-04-29 11:21:27 -07:00
Harrison Chase	20aad0bed1	stripe docs	2023-04-29 08:16:37 -07:00
Sheldon	399065e858	update zilliz example (#3578 ) 1. Now the Zilliz example can't connect to Zilliz Cloud, fixed Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-28 22:10:13 -07:00
Harrison Chase	c494ca3ad2	Harrison/doc2txt (#3772 ) Co-authored-by: rishni ratnam <rishniratnam@gmail.com>	2023-04-28 21:54:16 -07:00
Harrison Chase	0c0f14407c	Harrison/tair (#3770 ) Co-authored-by: Seth Huang <848849+seth-hg@users.noreply.github.com>	2023-04-28 21:25:33 -07:00
Harrison Chase	b7ae9f715d	Langchain with reddit (#3661 ) (#3768 ) I have added a reddit document loader which fetches the text from the Posts of Subreddits or Reddit users, using the `praw` Python package. I have also added an example notebook reddit.ipynb in order to guide users to use this dataloader. This code was made in format similar to twiiter document loader. I have run code formating, linting and also checked the code myself for different scenarios. This is my first contribution to an open source project and I am really excited about this. If you want to suggest some improvements in my code, I will be happy to do it. :) Co-authored-by: Taaha Bajwa <taaha.s.bajwa@gmail.com>	2023-04-28 20:59:56 -07:00
Harrison Chase	be7a8e0824	Harrison/redis cache (#3766 ) Co-authored-by: Tyler Hutcherson <tyler.hutcherson@redis.com>	2023-04-28 20:47:18 -07:00
engkheng	f37a932b24	Improve chat prompt template docs (#3719 ) Add a few more explanations and examples.	2023-04-28 20:16:22 -07:00
Jon Saginaw	f8d69e4e52	Enhancement: Blockchain Document Loader with better Metadata support (#3710 ) This PR includes some minor alignment updates, including: - metadata object extended to support contractAddress, blockchainType, and tokenId - notebook doc better aligned to standard langchain format - startToken changed from int to str to support multiple hex value types on the Alchemy API The updated metadata will look like the below. It's possible for a single contractAddress to exist across multiple blockchains (e.g. Ethereum, Polygon, etc.) so it's important to include the blockchainType. ``` metadata = {"source": self.contract_address, "blockchain": self.blockchainType, "tokenId": tokenId} ```	2023-04-28 20:13:05 -07:00
Davis Chase	220a7076ac	Add Mathpix pdf loader (#3727 ) Inspo https://twitter.com/danielgross/status/1651695062307274754?s=46&t=1zHLap5WG4I_kQPPjfW9fA Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-28 20:11:22 -07:00
Harrison Chase	40f6e60e68	Harrison/stripe (#3762 ) Co-authored-by: Ismail Pelaseyed <homanp@gmail.com>	2023-04-28 20:03:21 -07:00
Harrison Chase	7a129ac043	Harrison/pypdf loader (#3764 ) Co-authored-by: Felipe Meres <felipe@felipemeres.com>	2023-04-28 19:56:21 -07:00
mbchang	4eefea0fe8	new example: single agent, simulated environment (openai gym) (#3758 ) For many applications of LLM agents, the environment is real (internet, database, REPL, etc). However, we can also define agents to interact in simulated environments like text-based games. This is an example of how to create a simple agent-environment interaction loop with [Gymnasium](https://github.com/Farama-Foundation/Gymnasium) (formerly [OpenAI Gym](https://github.com/openai/gym)).	2023-04-28 19:52:05 -07:00
0xDTE	6ce34bb4fe	Fixing broken document links (#3756 ) simple document url fixes. nothing fancy.	2023-04-28 19:51:23 -07:00
Harrison Chase	c55ba43093	Harrison/vespa (#3761 ) Co-authored-by: Lester Solbakken <lesters@users.noreply.github.com>	2023-04-28 19:48:43 -07:00
mbchang	ee20b3e0d0	bug fix: initialize the arxivAPIWrapper object (#3733 )	2023-04-28 19:35:01 -07:00
leo-gan	e510732ad2	docs: improved `vectorstore` notebooks (#3724 ) - Added links to the vectorstore providers - Added installation code (it is not clear that we have to go to the `LangChan Ecosystem` page to get installation instructions.)	2023-04-28 19:26:50 -07:00
BioErrorLog	ad4eae7ef0	Fix linting on the Quickstart Guide sample codes (#3701 ) When copying and pasting the sample code from the Quickstart Guide, lint errors ("missing whitespace around operator") occur."	2023-04-28 17:29:05 -07:00
Zander Chase	a46f1d830e	Synchronous Browser (#3745 ) Split out sync methods in playwright	2023-04-28 17:09:00 -07:00
Zander Chase	6c2b16e465	Add SceneXplain Tool (#3752 )	2023-04-28 17:01:54 -07:00
erwanlc	72c5c15f7f	Fix: Updated links for in depth explanation of chain types in the Question Answering notebooks (#3714 ) In the notebook question_answering.ipynb ([link](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/question_answering.ipynb)), and the notebook qa_with_sources.ipynb ([link](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/qa_with_sources.ipynb)), the first paragraph contains a dead link: > This notebook walks through how to use LangChain for question answering over a list of documents. It covers four different types of chains: stuff, map_reduce, refine, map_rerank. For a more in depth explanation of what these chain types are, see [here](`32793f94fd/docs/modules/chains/combine_docs.md`). The file combine_docs.md doesn't exist anymore and thus provide 404 - Page not found. I updated the links so it redirect to https://docs.langchain.com/docs/components/chains/index_related_chains as in the summarize notebook ([link](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/summarize.ipynb)) present in the same folder.	2023-04-28 15:06:46 -07:00
Alan Cha	e3b7a20454	Fix typo (#3728 )	2023-04-28 13:01:09 -07:00
Zander Chase	5042bd40d3	Add Shell Tool (#3335 ) Create an official bash shell tool to replace the dynamically generated one	2023-04-28 11:10:43 -07:00
Zander Chase	334c162f16	Add Other File Utilities (#3209 ) Add other File Utilities, include - List Directory - Search for file - Move - Copy - Remove file Bundle as toolkit Add a notebook that connects to the Chat Agent, which somewhat supports multi-arg input tools Update original read/write files to return the original dir paths and better handle unsupported file paths. Add unit tests	2023-04-28 10:53:37 -07:00
Zander Chase	491c27f861	PlayWright Web Browser Toolkit (#3262 ) Adds a PlayWright web browser toolkit with the following tools: - NavigateTool (navigate_browser) - navigate to a URL - NavigateBackTool (previous_page) - wait for an element to appear - ClickTool (click_element) - click on an element (specified by selector) - ExtractTextTool (extract_text) - use beautiful soup to extract text from the current web page - ExtractHyperlinksTool (extract_hyperlinks) - use beautiful soup to extract hyperlinks from the current web page - GetElementsTool (get_elements) - select elements by CSS selector - CurrentPageTool (current_page) - get the current page URL	2023-04-28 10:42:44 -07:00
mbchang	1da3ee1386	Multiagent authoritarian (#3686 ) This notebook showcases how to implement a multi-agent simulation where a privileged agent decides who to speak. This follows the polar opposite selection scheme as [multi-agent decentralized speaker selection](https://python.langchain.com/en/latest/use_cases/agent_simulations/multiagent_bidding.html). We show an example of this approach in the context of a fictitious simulation of a news network. This example will showcase how we can implement agents that - think before speaking - terminate the conversation	2023-04-27 23:33:29 -07:00
Hasan Patel	03c05b15f6	Fixed some typos on deployment.md (#3652 ) Fixed typos and added better formatting for easier readability	2023-04-27 13:01:24 -07:00
Davis Chase	3b609642ae	Self-query with generic query constructor (#3607 ) Alternate implementation of #3452 that relies on a generic query constructor chain and language and then has vector store-specific translation layer. Still refactoring and updating examples but general structure is there and seems to work s well as #3452 on exampels --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-27 08:36:00 -07:00
plutopulp	6d6fd1b9e1	Add PipelineAI LLM integration (#3644 ) Add PipelineAI LLM integration	2023-04-27 08:22:26 -07:00
Harrison Chase	a35bbbfa9e	Harrison/lancedb (#3634 ) Co-authored-by: Minh Le <minhle@canva.com>	2023-04-27 08:14:36 -07:00
Ehsan M. Kermani	4a246e2fd6	Allow clearing cache and fix gptcache (#3493 ) This PR * Adds `clear` method for `BaseCache` and implements it for various caches * Adds the default `init_func=None` and fixes gptcache integtest * Since right now integtest is not running in CI, I've verified the changes by running `docs/modules/models/llms/examples/llm_caching.ipynb` (until proper e2e integtest is done in CI)	2023-04-26 22:03:50 -07:00
Shukri	fac4f36a87	Update models used for embeddings in the weaviate example (#3594 ) Use text-embedding-ada-002 because it [outperforms all other models](https://openai.com/blog/new-and-improved-embedding-model).	2023-04-26 21:48:08 -07:00
brian-tecton-ai	615812581e	Add Tecton example to the "Connecting to a Feature Store" example notebook (#3626 ) This PR adds a similar example to the Feast example, using the [Tecton Feature Platform](https://www.tecton.ai/) and features from the [Tecton Fundamentals Tutorial](https://docs.tecton.ai/docs/tutorials/tecton-fundamentals).	2023-04-26 21:38:50 -07:00
mbchang	3b7d27d39e	new example: multiagent dialogue with decentralized speaker selection (#3629 ) This notebook showcases how to implement a multi-agent simulation without a fixed schedule for who speaks when. Instead the agents decide for themselves who speaks. We can implement this by having each agent bid to speak. Whichever agent's bid is the highest gets to speak. We will show how to do this in the example below that showcases a fictitious presidential debate.	2023-04-26 21:37:36 -07:00
leo-gan	36c59e0c25	`Arxiv` document loader (#3627 ) It makes sense to use `arxiv` as another source of the documents for downloading. - Added the `arxiv` document_loader, based on the `utilities/arxiv.py:ArxivAPIWrapper` - added tests - added an example notebook - sorted `__all__` in `__init__.py` (otherwise it is hard to find a class in the very long list)	2023-04-26 21:04:56 -07:00
Zander Chase	443a893ffd	Align names of search tools (#3620 ) Tools for Bing, DDG and Google weren't consistent even though the underlying implementations were. All three services now have the same tools and implementations to easily switch and experiment when building chains.	2023-04-26 16:21:34 -07:00
James O'Dwyer	860fa59cd3	add metal to ecosystem (#3613 )	2023-04-26 15:57:48 -07:00
Zander Chase	ee670c448e	Persistent Bash Shell (#3580 ) Clean up linting and make more idiomatic by using an output parser --------- Co-authored-by: FergusFettes <fergusfettes@gmail.com>	2023-04-26 15:20:28 -07:00
Kátia Nakamura	e1a4fc55e6	Add docs for Fly.io deployment (#3584 ) A minimal example of how to deploy LangChain to Fly.io using Flask.	2023-04-26 14:41:08 -07:00
Chirag Bhatia	08478deec5	Fixed typo for HuggingFaceHub (#3612 ) The current text has a typo. This PR contains the corrected spelling for HuggingFaceHub	2023-04-26 14:33:31 -07:00
Charlie Holtz	246710def9	Fix Replicate llm response to handle iterator / multiple outputs (#3614 ) One of our users noticed a bug when calling streaming models. This is because those models return an iterator. So, I've updated the Replicate `_call` code to join together the output. The other advantage of this fix is that if you requested multiple outputs you would get them all – previously I was just returning output[0]. I also adjusted the demo docs to use dolly, because we're featuring that model right now and it's always hot, so people won't have to wait for the model to boot up. The error that this fixes: ``` > llm = Replicate(model=“replicate/flan-t5-xl:eec2f71c986dfa3b7a5d842d22e1130550f015720966bec48beaae059b19ef4c”) > llm(“hello”) > Traceback (most recent call last): File "/Users/charlieholtz/workspace/dev/python/main.py", line 15, in <module> print(llm(prompt)) File "/opt/homebrew/lib/python3.10/site-packages/langchain/llms/base.py", line 246, in __call__ return self.generate([prompt], stop=stop).generations[0][0].text File "/opt/homebrew/lib/python3.10/site-packages/langchain/llms/base.py", line 140, in generate raise e File "/opt/homebrew/lib/python3.10/site-packages/langchain/llms/base.py", line 137, in generate output = self._generate(prompts, stop=stop) File "/opt/homebrew/lib/python3.10/site-packages/langchain/llms/base.py", line 324, in _generate text = self._call(prompt, stop=stop) File "/opt/homebrew/lib/python3.10/site-packages/langchain/llms/replicate.py", line 108, in _call return outputs[0] TypeError: 'generator' object is not subscriptable ```	2023-04-26 14:26:33 -07:00
Chirag Bhatia	f174aa7712	Fix broken Cerebrium link in documentation (#3554 ) The current hyperlink has a typo. This PR contains the corrected hyperlink to Cerebrium docs	2023-04-26 08:11:58 -07:00
Harrison Chase	d880775e5d	Harrison/plugnplai (#3573 ) Co-authored-by: Eduardo Reis <edu.pontes@gmail.com>	2023-04-26 08:09:34 -07:00
Zander Chase	d6d697a41b	Sentence Transformers Aliasing (#3541 ) The sentence transformers was a dup of the HF one. This is a breaking change (model_name vs. model) for anyone using `SentenceTransformerEmbeddings(model="some/nondefault/model")`, but since it was landed only this week it seems better to do this now rather than doing a wrapper.	2023-04-25 23:29:20 -07:00
Eric Peter	603ea75bcd	Fix docs error for google drive loader (#3574 )	2023-04-25 22:52:59 -07:00
CG80499	cfd34e268e	Add ReAct eval chain (#3161 ) - Adds GPT-4 eval chain for arbitrary agents using any set of tools - Adds notebook --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-25 21:22:25 -07:00
mbchang	4bc209c6f7	example: multi player dnd (#3560 ) This notebook shows how the DialogueAgent and DialogueSimulator class make it easy to extend the [Two-Player Dungeons & Dragons example](https://python.langchain.com/en/latest/use_cases/agent_simulations/two_player_dnd.html) to multiple players. The main difference between simulating two players and multiple players is in revising the schedule for when each agent speaks To this end, we augment DialogueSimulator to take in a custom function that determines the schedule of which agent speaks. In the example below, each character speaks in round-robin fashion, with the storyteller interleaved between each player.	2023-04-25 21:20:39 -07:00
Harrison Chase	f4829025fe	add feast nb (#3565 )	2023-04-25 17:46:06 -07:00
Filip Michalsky	49593a3e41	Notebook example: Context-Aware AI Sales Agent (#3547 ) I would like to contribute with a jupyter notebook example implementation of an AI Sales Agent using `langchain`. The bot understands the conversation stage (you can define your own stages fitting your needs) using two chains: 1. StageAnalyzerChain - takes context and LLM decides what part of sales conversation is one in 2. SalesConversationChain - generate next message Schema: https://images-genai.s3.us-east-1.amazonaws.com/architecture2.png my original repo: https://github.com/filip-michalsky/SalesGPT This example creates a sales person named Ted Lasso who is trying to sell you mattresses. Happy to update based on your feedback. Thanks, Filip https://twitter.com/FilipMichalsky	2023-04-25 16:14:33 -07:00
Harrison Chase	52d95ec47d	anthropic docs: deprecated LLM, add chat model (#3549 )	2023-04-25 16:11:14 -07:00
mbchang	628e93a9a0	docs: simplification of two agent d&d simulation (#3550 ) Simplifies the [Two Agent D&D](https://python.langchain.com/en/latest/use_cases/agent_simulations/two_player_dnd.html) example with a cleaner, simpler interface that is extensible for multiple agents. `DialogueAgent`: - `send()`: applies the chatmodel to the message history and returns the message string - `receive(name, message)`: adds the `message` spoken by `name` to message history The `DialogueSimulator` class takes a list of agents. At each step, it performs the following: 1. Select the next speaker 2. Calls the next speaker to send a message 3. Broadcasts the message to all other agents 4. Update the step counter. The selection of the next speaker can be implemented as any function, but in this case we simply loop through the agents.	2023-04-25 16:10:32 -07:00
apurvsibal	af7906f100	Update Alchemy Key URL (#3559 ) Update Alchemy Key URL in Blockchain Document Loader. I want to say thank you for the incredible work the LangChain library creators have done. I am amazed at how seamlessly the Loader integrates with Ethereum Mainnet, Ethereum Testnet, Polygon Mainnet, and Polygon Testnet, and I am excited to see how this technology can be extended in the future. @hwchase17 - Please let me know if I can improve or if I have missed any community guidelines in making the edit? Thank you again for your hard work and dedication to the open source community.	2023-04-25 16:08:42 -07:00
Tiago De Gaspari	4d53cefbe9	Fix agents' notebooks outputs (#3517 ) Fix agents' notebooks to make the answer reflect what is being asked by the user.	2023-04-25 16:06:47 -07:00
engkheng	5680fb6894	Fix typo in Prompts Templates Getting Started page (#3514 ) `from_templates` -> `from_template`	2023-04-25 16:05:13 -07:00
Zander Chase	b49ee372f1	Change Chain Docs (#3537 ) Co-authored-by: engkheng <60956360+outday29@users.noreply.github.com>	2023-04-25 10:51:09 -07:00
Ikko Eltociear Ashimine	cf71b5d396	fix typo in comet_tracking.ipynb (#3505 ) intializing -> initializing	2023-04-25 10:50:58 -07:00
mbchang	a08e9a3109	Docs: fix naming typo (#3532 )	2023-04-25 09:58:25 -07:00
mbchang	831ca61481	docs: two_player_dnd docs (#3528 )	2023-04-25 08:24:53 -07:00
leo-gan	6b28cbe058	improved arxiv (#3495 ) Improved `arxiv/tool.py` by adding more specific information to the `description`. It would help with selecting `arxiv` tool between other tools. Improved `arxiv.ipynb` with more useful descriptions.	2023-04-25 08:09:17 -07:00
mbchang	29f321046e	doc: add two player D&D game (#3476 ) In this notebook, we show how we can use concepts from [CAMEL](https://www.camel-ai.org/) to simulate a role-playing game with a protagonist and a dungeon master. To simulate this game, we create a `TwoAgentSimulator` class that coordinates the dialogue between the two agents.	2023-04-25 08:07:18 -07:00
Harrison Chase	0fc0aa62f2	Harrison/blockchain docloader (#3491 ) Co-authored-by: Jon Saginaw <saginawj@users.noreply.github.com>	2023-04-25 08:07:06 -07:00
Harrison Chase	bee59b4689	Updated missing refactor in docs "return_map_steps" (#2956 ) (#3469 ) Minor rename in the documentation that was overlooked when refactoring. --------- Co-authored-by: Ehmad Zubair <ehmad@cogentlabs.co>	2023-04-24 22:28:47 -07:00
Harrison Chase	707741de58	Harrison/prediction guard (#3490 ) Co-authored-by: Daniel Whitenack <whitenack.daniel@gmail.com>	2023-04-24 22:27:22 -07:00
Maxwell Mullin	696f840426	GuessedAtParserWarning from RTD document loader documentation example (#3397 ) Addresses #3396 by adding `features='html.parser'` in example	2023-04-24 21:54:39 -07:00
engkheng	06f6c49e61	Improve `llm_chain.ipynb` and `getting_started.ipynb` for chains docs (#3380 ) My attempt at improving the `Chain`'s `Getting Started` docs and `LLMChain` docs. Might need some proof-reading as English is not my first language. In LLM examples, I replaced the example use case when a simpler one (shorter LLM output) to reduce cognitive load.	2023-04-24 21:49:55 -07:00
tkarper	6b49be9951	Add Databutton to list of Deployment options (#3364 )	2023-04-24 21:45:38 -07:00
jrhe	980cc41709	Adds progress bar using tqdm to directory_loader (#3349 ) Approach copied from `WebBaseLoader`. Assumes the user doesn't have `tqdm` installed.	2023-04-24 21:42:42 -07:00
engkheng	7c2c73af5f	Update `Getting Started` page of `Prompt Templates` (#3298 ) Updated `Getting Started` page of `Prompt Templates` to showcase more features provided by the class. Might need some proof reading because apparently English is not my first language.	2023-04-24 21:10:22 -07:00
Zander Chase	416f3bdf11	Vwp/alpaca streaming (#3468 ) Co-authored-by: Luke Stanley <306671+lukestanley@users.noreply.github.com>	2023-04-24 16:27:51 -07:00
Harrison Chase	675d86aa11	show how to use memory in convo chain (#3463 )	2023-04-24 13:29:51 -07:00
leo-gan	d5086d4760	added integration links to the ecosystem.rst (#3453 ) Now it is hard to search for the integration points between data_loaders, retrievers, tools, etc. I've placed links to all groups of providers and integrations on the `ecosystem` page. So, it is easy to navigate between all integrations from a single location.	2023-04-24 12:17:44 -07:00
Harrison Chase	bdb5f2f9fb	update notebook	2023-04-24 11:30:06 -07:00
mbchang	82845e3821	add meta-prompt to autonomous agents use cases (#3254 ) An implementation of [meta-prompt](https://noahgoodman.substack.com/p/meta-prompt-a-simple-self-improving), where the agent modifies its own instructions across episodes with a user. ![figure](https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F468217b9-96d9-47c0-a08b-dbf6b21b9f49_492x384.png)	2023-04-24 10:48:38 -07:00
Eduard van Valkenburg	46c9636012	small constructor change and updated notebook (#3426 ) small change in the pydantic definitions, same api. updated notebook with right constructure and added few shot example	2023-04-24 10:42:38 -07:00
Davit Buniatyan	2c0023393b	Deep Lake mini upgrades (#3375 ) Improvements * set default num_workers for ingestion to 0 * upgraded notebooks for avoiding dataset creation ambiguity * added `force_delete_dataset_by_path` * bumped deeplake to 3.3.0 * creds arg passing to deeplake object that would allow custom S3 Notes * please double check if poetry is not messed up (thanks!) Asks * Would be great to create a shared slack channel for quick questions --------- Co-authored-by: Davit Buniatyan <d@activeloop.ai>	2023-04-23 21:23:54 -07:00
Haste171	93d53e417a	Update unstructured_file.ipynb (#3377 ) Fix typo in docs	2023-04-23 21:22:38 -07:00
Zander Chase	738ee56b86	Move Generative Agent definition to Experimental (#3245 ) Extending @BeautyyuYanli 's #3220 to move from the notebook --------- Co-authored-by: BeautyyuYanli <beautyyuyanli@gmail.com>	2023-04-23 18:32:37 -07:00
Zander Chase	20f530e9c5	Add Sentence Transformers Embeddings (#3409 ) Add embeddings based on the sentence transformers library. Add a notebook and integration tests. Co-authored-by: khimaros <me@khimaros.com>	2023-04-23 18:25:20 -07:00
Zander Chase	73bc70b4fa	Update marathon notebook (#3408 ) Fixes #3404	2023-04-23 18:14:11 -07:00
Harrison Chase	e5ffbee5eb	Harrison/hf document loader (#3394 ) Co-authored-by: Azam Iftikhar <azamiftikhar1000@gmail.com>	2023-04-23 10:17:43 -07:00
Hadi Curtay	acfd11c8e4	Updated incorrect link to Weaviate notebook (#3362 ) The detailed walkthrough of the Weaviate wrapper was pointing to the getting-started notebook. Fixed it to point to the Weaviable notebook in the examples folder.	2023-04-22 20:47:41 -07:00
Ismail Pelaseyed	b21fe0a18f	Add example on deploying LangChain to `Cloud Run` (#3366 ) ## Summary Adds a link to a minimal example of running LangChain on Google Cloud Run.	2023-04-22 20:09:00 -07:00
Harrison Chase	a6664be79c	Harrison/myscale (#3352 ) Co-authored-by: Fangrui Liu <fangruil@moqi.ai> Co-authored-by: 刘方瑞 <fangrui.liu@outlook.com> Co-authored-by: Fangrui.Liu <fangrui.liu@ubc.ca>	2023-04-22 09:17:38 -07:00
Honkware	a5ad1c270f	Add ChatGPT Data Loader (#3336 ) This pull request adds a ChatGPT document loader to the document loaders module in `langchain/document_loaders/chatgpt.py`. Additionally, it includes an example Jupyter notebook in `docs/modules/indexes/document_loaders/examples/chatgpt_loader.ipynb` which uses fake sample data based on the original structure of the `conversations.json` file. The following files were added/modified: - `langchain/document_loaders/__init__.py` - `langchain/document_loaders/chatgpt.py` - `docs/modules/indexes/document_loaders/examples/chatgpt_loader.ipynb` - `docs/modules/indexes/document_loaders/examples/example_data/fake_conversations.json` This pull request was made in response to the recent release of ChatGPT data exports by email: https://help.openai.com/en/articles/7260999-how-do-i-export-my-chatgpt-history	2023-04-22 09:06:24 -07:00
Zander Chase	61d40ba042	Fix Sagemaker Batch Endpoints (#3249 ) Add different typing for @evandiewald 's heplful PR --------- Co-authored-by: Evan Diewald <evandiewald@gmail.com>	2023-04-22 08:49:51 -07:00
Harrison Chase	8191c6b81a	Harrison/voice assistant (#3347 ) Co-authored-by: Jaden <jaden.lorenc@gmail.com>	2023-04-22 08:25:50 -07:00
Richy Wang	88a8f59aa7	Add a full PostgresSQL syntax database 'AnalyticDB' as vector store. (#3135 ) Hi there！ I'm excited to open this PR to add support for using a fully Postgres syntax compatible database 'AnalyticDB' as a vector. As AnalyticDB has been proved can be used with AutoGPT, ChatGPT-Retrieve-Plugin, and LLama-Index, I think it is also good for you. AnalyticDB is a distributed Alibaba Cloud-Native vector database. It works better when data comes to large scale. The PR includes: - [x] A new memory: AnalyticDBVector - [x] A suite of integration tests verifies the AnalyticDB integration I have read your [contributing guidelines](`72b7d76d79/.github/CONTRIBUTING.md`). And I have passed the tests below - [x] make format - [x] make lint - [x] make coverage - [x] make test	2023-04-22 08:25:41 -07:00
Harrison Chase	cc6fe18152	Harrison/power bi (#3205 ) Co-authored-by: Eduard van Valkenburg <eavanvalkenburg@users.noreply.github.com>	2023-04-22 08:24:48 -07:00
Daniel Chalef	61e09229c8	args_schema type hint on subclassing (#3323 ) per https://github.com/hwchase17/langchain/issues/3297 Co-authored-by: Daniel Chalef <daniel.chalef@private.org>	2023-04-21 15:51:13 -07:00
Davis Chase	e933be9605	Update docs api references (#3315 )	2023-04-21 12:21:33 -07:00
Paul Garner	aa9d5707e0	Add PythonLoader which auto-detects encoding of Python files (#3311 ) This PR contributes a `PythonLoader`, which inherits from `TextLoader` but detects and sets the encoding automatically.	2023-04-21 10:47:57 -07:00
Daniel Chalef	1ecbeec24e	Fix example match_documents fn table name, grammar (#3294 ) ref https://github.com/hwchase17/langchain/pull/3100#issuecomment-1517086472 Co-authored-by: Daniel Chalef <daniel.chalef@private.org>	2023-04-21 10:21:23 -07:00
leo-gan	3bc703b0d6	added links to the important YouTube videos (#3244 ) Added links to the important YouTube videos	2023-04-21 01:31:42 -07:00
Harrison Chase	87544d2378	gradio tools (#3255 )	2023-04-20 22:09:15 -07:00
Davis Chase	46542dc774	Contextual compression retriever (#2915 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-20 17:01:14 -07:00
Harrison Chase	5ef2d1e2a1	add to docs	2023-04-20 15:43:57 -07:00
Harrison Chase	4aedbeaffb	Merge branch 'master' of github.com:hwchase17/langchain	2023-04-20 15:43:04 -07:00
Harrison Chase	2dbb5261b5	wikibase agent	2023-04-20 15:37:56 -07:00
Albert Castellana	0684aa081a	Ecosystem/Yeager.ai (#3239 ) Added yeagerai.md to ecosystem	2023-04-20 15:20:21 -07:00
Harrison Chase	8f22949dc4	update nnotebook title	2023-04-20 11:53:23 -07:00
leo-gan	130e4b9fcb	fixed a link to the youtube page (#3232 ) A link to the `YouTube` page was missing on the `index` page.	2023-04-20 10:47:16 -07:00
Harrison Chase	b7f2061736	Harrison/google places (#3207 ) Co-authored-by: Cao Hoang <65607230+cnhhoang850@users.noreply.github.com> Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-20 07:57:07 -07:00
Harrison Chase	d2520a5f1e	Harrison/ddg (#3206 ) Co-authored-by: itai <itai.marks@gmail.com> Co-authored-by: Itai Marks <itaim@users.noreply.github.com> Co-authored-by: Tianyi Pan <60060750+tipani86@users.noreply.github.com> Co-authored-by: Tianyi Pan <tianyi.pan@clobotics.com> Co-authored-by: Adilzhan Ismailov <13088690+aismlv@users.noreply.github.com> Co-authored-by: Justin Flick <Justinjayflick@gmail.com> Co-authored-by: Justin Flick <jflick@homesite.com>	2023-04-19 21:32:26 -07:00
Harrison Chase	36c10f8a52	nits (#3203 )	2023-04-19 21:14:46 -07:00
Daniel Chalef	27cdf8d675	supabase vectorstore - first cut (#3100 ) First cut of a supabase vectorstore loosely patterned on the langchainjs equivalent. Doesn't support async operations which is a limitation of the supabase python client. --------- Co-authored-by: Daniel Chalef <daniel.chalef@private.org>	2023-04-19 21:06:44 -07:00
Harrison Chase	96809b5794	Harrison/discord loader (#3200 ) Co-authored-by: Rajtilak Bhattacharjee <rajtilak.blog@gmail.com>	2023-04-19 21:04:12 -07:00
Zander Chase	c757c3cde4	Add HuggingFace Examples (#3187 ) Add a Pipeline example and add other models in th ehub notebook To close issue [#3077](https://github.com/hwchase17/langchain/issues/3099)	2023-04-19 17:08:10 -07:00
Donald "Max" Ziff	6adf2d1c39	first draft (#2690 ) There is a long way to go on this! --------- Co-authored-by: Max Ziff <max.ziff@concur.com>	2023-04-19 17:06:55 -07:00
Harrison Chase	68cd37175e	Harrison/arxiv tool (#3186 ) Co-authored-by: leo-gan <leo.gan.57@gmail.com>	2023-04-19 16:53:34 -07:00
Pranabendra Prasad Chandra	7b1f0656b8	Fix typo in ElasticSearch sample notebook (#3171 ) Added missing parenthesis in example notebook [elasticsearch.ipynb](https://github.com/hwchase17/langchain/blob/master/docs/modules/indexes/vectorstores/examples/elasticsearch.ipynb)	2023-04-19 16:06:31 -07:00
Zander Chase	74342ab209	Update the marathon notebook (#3183 ) There were some steps that didn't make sense. Update now. This time it produced a nice markdown formatted table too	2023-04-19 16:03:21 -07:00
leo-gan	a78f55b851	Additional resources - `YouTube` (#3180 ) Added links to the YouTube tutorials and videos in the `youtube.md`. Added link to the ^ in `index.rst`.	2023-04-19 15:16:29 -07:00
det-sys	26c8cd1ea2	Update gallery.rst (#3176 ) Add https://anysummary.app to the gallery	2023-04-19 15:06:59 -07:00
Happydog	5e66d05928	Fix: typo in custom_mrkl_agents.ipynb document (#3159 ) I have noticed a typo error in the `custom_mrkl_agents.ipynb` document while trying the example from the documentation page. As a result, I have opened a pull request (PR) to address this minor issue, even though it may seem insignificant 😂.	2023-04-19 14:57:33 -07:00
Harrison Chase	99b1983461	add example	2023-04-19 14:35:24 -07:00
Zander Chase	89c63cf8a6	Add Marathon Notebook (#3163 ) Add an example using autogpt to get the boston marathon winning times Add a web browser + summarization tool in the notebook	2023-04-19 11:23:08 -07:00
Quentin Pleplé	126d7f11dd	Fix notebook example (#3142 ) The following calls were throwing an exception: `575b717d10/docs/use_cases/evaluation/agent_vectordb_sota_pg.ipynb (L192)` `575b717d10/docs/use_cases/evaluation/agent_vectordb_sota_pg.ipynb (L239)` Exception: ``` --------------------------------------------------------------------------- ValidationError Traceback (most recent call last) Cell In[14], line 1 ----> 1 chain_sota = RetrievalQA.from_chain_type(llm=OpenAI(temperature=0), chain_type="stuff", retriever=vectorstore_sota, input_key="question") File ~/github/langchain/venv/lib/python3.9/site-packages/langchain/chains/retrieval_qa/base.py:89, in BaseRetrievalQA.from_chain_type(cls, llm, chain_type, chain_type_kwargs, kwargs) 85 _chain_type_kwargs = chain_type_kwargs or {} 86 combine_documents_chain = load_qa_chain( 87 llm, chain_type=chain_type, _chain_type_kwargs 88 ) ---> 89 return cls(combine_documents_chain=combine_documents_chain, *kwargs) File ~/github/langchain/venv/lib/python3.9/site-packages/pydantic/main.py:341, in pydantic.main.BaseModel.__init__() ValidationError: 1 validation error for RetrievalQA retriever instance of BaseRetriever expected (type=type_error.arbitrary_type; expected_arbitrary_type=BaseRetriever) ``` The vectorstores had to be converted to retrievers: `vectorstore_sota.as_retriever()` and `vectorstore_pg.as_retriever()`. The PR also: - adds the file `paul_graham_essay.txt` referenced by this notebook - adds to gitignore .pkl and *.bin files that are generated by this notebook Interestingly enough, the performance of the prediction greatly increased (new version of langchain or ne version of OpenAI models since the last run of the notebook): from 19/33 correct to 28/33 correct!	2023-04-19 08:55:06 -07:00
Jakub Kukul	599e17cea8	Working example for Anthropic (#3151 ) would be great if the provided example worked out of the box 😄	2023-04-19 08:52:33 -07:00
Harrison Chase	b7dc04c086	fix links	2023-04-18 22:44:53 -07:00
Zander Chase	8a050ba4bf	Notebook Nit (#3125 ) The required arg is `question` not `query`	2023-04-18 22:43:52 -07:00
Harrison Chase	364257d967	agent docs fixes (#3128 )	2023-04-18 21:54:30 -07:00
Zander Chase	f329196cf4	Agents 4 18 (#3122 ) Creating an experimental agents folder, containing BabyAGI, AutoGPT, and later, other examples --------- Co-authored-by: Rahul Behal <rahulbehal01@hotmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-18 21:41:03 -07:00
Zander Chase	90ef705ced	Update Tool Input (#3103 ) - Remove dynamic model creation in the `args()` property. _Only infer for the decorator (and add an argument to NOT infer if someone wishes to only pass as a string)_ - Update the validation example to make it less likely to be misinterpreted as a "safe" way to run a repl There is one example of "Multi-argument tools" in the custom_tools.ipynb from yesterday, but we could add more. The output parsing for the base MRKL agent hasn't been adapted to handle structured args at this point in time --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-18 18:18:33 -07:00
Harrison Chase	aad0a498ac	Harrison/output error (#3094 ) Co-authored-by: yummydum <sumita@nowcast.co.jp>	2023-04-18 08:59:56 -07:00
Harrison Chase	1c1b77bbfe	Harrison/discord (#3092 ) Co-authored-by: Rajtilak Bhattacharjee <rajtilak.blog@gmail.com>	2023-04-18 08:19:23 -07:00
engkheng	fe68051d34	Fix typo in `docs/reference.rst` (#3081 ) fix typo	2023-04-18 07:31:00 -07:00
TysBradford	7dae39b57d	slightly clearer docs (#3088 ) Took me a second to realise the examples required to manually print the output of the conversation predict. This might make it clearer for others	2023-04-18 07:28:29 -07:00
James O'Dwyer	0257829776	Bump Metal to use index_id (#3089 ) ## Use `index_id` over `app_id` We made a major update to index + retrieve based on Metal Indexes (instead of apps). With this change, we accept an index instead of an app in each of our respective core apis. [More details here](https://docs.getmetal.io/api-reference/core/indexing).	2023-04-18 07:28:13 -07:00
Hamza Kyamanywa	064a1db2b2	[Documentation] Show how to initiate pinecone from an existing index (#3070 ) ## What is this PR for: * This PR adds a commented line of code in the documentation that shows how someone can use the Pinecone client with an already existing Pinecone index * The documentation currently only shows how to create a pinecone index from langchain documents but not how to load one that already exists	2023-04-18 07:27:46 -07:00
Harrison Chase	894c272a56	tool validation logic	2023-04-17 21:59:32 -07:00
Harrison Chase	1920536d99	Harrison/obsidian (#3060 ) Co-authored-by: Ben Hofferber <hofferber.ben@gmail.com>	2023-04-17 21:57:32 -07:00
Zander Chase	93c0514105	Add Twitter Tweet Loader (#3050 ) Reformatted version of #3022 --------- Co-authored-by: LiaoKong <568250549@qq.com>	2023-04-17 21:44:54 -07:00
Harrison Chase	db968284f8	tools refactor (#2961 ) Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-17 21:35:29 -07:00
Sebastian	7a8c935b90	Edited for better readability (#3059 ) It looks like some dropdown functionality was intended, but it caused the markdown code to glitch which hurt readability.	2023-04-17 21:34:57 -07:00
Harrison Chase	b140d366e3	Harrison/jira (#3055 ) Co-authored-by: William Li <32046231+zywilliamli@users.noreply.github.com> Co-authored-by: William Li <twelvehertz@Williams-MacBook-Air.local>	2023-04-17 21:14:40 -07:00
leo-gan	c33883a40e	fixed the Cohere example title (#3053 ) - fixed the Cohere example title (bug in #3041, sorry for it) - fixed the runhouse.ipynb file name inconsistency	2023-04-17 21:02:52 -07:00
Harrison Chase	5107fac656	Harrison/rec gd (#3054 ) Co-authored-by: Benjamin Scholtz <BenSchZA@users.noreply.github.com>	2023-04-17 21:02:35 -07:00
Harrison Chase	eee2f23a79	Harrison/qa eg (#3052 ) Co-authored-by: Sukhpal Saini <bdcorps@users.noreply.github.com>	2023-04-17 20:56:42 -07:00
Harrison Chase	db7106cb79	Harrison/image caption loader (#3051 ) Co-authored-by: Sean Saito <saitosean@ymail.com>	2023-04-17 20:49:10 -07:00
leo-gan	5420a0e404	updated langchain/docs/modules/models/llms/integrations/ notebooks (#3041 ) - Updated `langchain/docs/modules/models/llms/integrations/` notebooks: added links to the original sites, the install information, etc. - Added the `nlpcloud` notebook. - Removed "Example" from Titles of some notebooks, so all notebook titles are consistent.	2023-04-17 20:25:32 -07:00
Azam Iftikhar	471ef84835	Examples fixed (#3042 ) ### https://github.com/hwchase17/langchain/issues/2997 Replaced `conversation.memory.store` to `conversation.memory.entity_store.store` As conversation.memory.store doesn't exist and re-ran the whole file.	2023-04-17 20:25:01 -07:00
Harrison Chase	afd3e70ae5	Harrison/confluent loader (#2994 ) Co-authored-by: Justin Flick <Justinjayflick@gmail.com>	2023-04-17 20:23:45 -07:00
vowelparrot	2356447323	Update Characters notebook (#3019 ) - Most important - fixes the relevance_fn name in the notebook to align with the docs - Updates comments for the summary: <img width="787" alt="image" src="https://user-images.githubusercontent.com/130414180/232520616-2a99e8c3-a821-40c2-a0d5-3f3ea196c9bb.png"> - The new conversation is a bit better, still unfortunate they try to schedule a followup. - Rm the max dialogue turns argument to the conversation function	2023-04-17 07:48:48 -07:00
Harrison Chase	f1d15b4a75	update nb	2023-04-16 22:09:31 -07:00
Harrison Chase	e54f1b69ca	add notebook	2023-04-16 21:54:15 -07:00
vowelparrot	99c0382209	Generative Characters (#2859 ) Add a time-weighted memory retriever and a notebook that approximates a Generative Agent from https://arxiv.org/pdf/2304.03442.pdf The "daily plan" components are removed for now since they are less useful without a virtual world, but the memory is an interesting component to build off. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-04-16 21:41:00 -07:00
Jan Backes	a9310a3e8b	Add Annoy as VectorStore (#2939 ) Adds Annoy (https://github.com/spotify/annoy) as vector Store. RESOLVES hwchase17/langchain#2842 discord ref: https://discord.com/channels/1038097195422978059/1051632794427723827/1096089994168377354 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-04-16 13:44:04 -07:00
Harrison Chase	e12e00df12	use output parsers in agents (#2987 )	2023-04-16 13:15:21 -07:00
Mauricio Scheffer	7302787a7b	Fix docs for parse_with_prompt (#2986 )	2023-04-16 12:57:04 -07:00
Azam Iftikhar	1e655d5ffd	Fixed Regular expression (#2933 ) ### https://github.com/hwchase17/langchain/issues/2898 Instead of `"Action" and "Action Input"` keywords, we are getting `"Action 1" and "Action 1 Input" or "Action Input 1" ` from gpt-3.5-turbo Updated the Regular expression to handle all these cases Attaching the screenshot of the result from the updated Regular expression. <img width="1036" alt="Screenshot 2023-04-16 at 1 39 00 AM" src="https://user-images.githubusercontent.com/55012400/232251184-23ca6cc2-7229-411a-b6e1-53b2f5ec18a5.png">	2023-04-16 09:16:50 -07:00
Harrison Chase	88d3ce12b8	Harrison/diffbot (#2984 ) Co-authored-by: Manuel Saelices <msaelices@gmail.com>	2023-04-16 09:11:24 -07:00
vowelparrot	5ca7ce77cd	Remove pythonrepl from LLM-MathChain (#2943 ) Use numexpr evaluate instead of the python REPL to avoid malicious code injection. Tested against the (limited) math dataset and got the same score as before. For more permissive tools (like the REPL tool itself), other approaches ought to be provided (some combination of Sanitizer + Restricted python + unprivileged-docker + ...), but for a calculator tool, only mathematical expressions should be permitted. See https://github.com/hwchase17/langchain/issues/814	2023-04-16 08:50:32 -07:00
Chetanya Rastogi	aead062a70	Add an example tutorial for using PDFMinerPDFasHTMLLoader (#2960 ) Last week I added the `PDFMinerPDFasHTMLLoader`. I am adding some example code in the notebook to serve as a tutorial for how that loader can be used to create snippets of a pdf that are structured within sections. All the other loaders only provide the `Document` objects segmented by pages but that's pretty loose given the amount of other metadata that can be extracted. With the new loader, one can leverage font-size of the text to decide when a new sections starts and can segment the text more semantically as shown in the tutorial notebook. The cell shows that we are able to find the content of entire section under Related Work for the example pdf which is spread across 2 pages and hence is stored as two separate documents by other loaders	2023-04-16 08:34:39 -07:00
Nahin Khan	9a03f00e6c	Fix typos (#2977 )	2023-04-16 08:28:36 -07:00
Harrison Chase	274b25c010	SVM retriever (#2947 ) (#2949 ) Add SVM retriever class, based on https://github.com/karpathy/randomfun/blob/master/knn_vs_svm.ipynb. Testing still WIP, but the logic is correct (I have a local implementation outside of Langchain working). --------- Co-authored-by: Lance Martin <122662504+PineappleExpress808@users.noreply.github.com> Co-authored-by: rlm <31treehaus@31s-MacBook-Pro.local>	2023-04-15 12:49:59 -07:00

... 3 4 5 6 7 ...

1071 Commits