langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
Harrison Chase	b35260ed47	Harrison/memory base (#2122 ) @3coins + @zoltan-fedor.... heres the pr + some minor changes i made. thoguhts? can try to get it into tmrws release --------- Co-authored-by: Zoltan Fedor <zoltan.0.fedor@gmail.com> Co-authored-by: Piyush Jain <piyushjain@duck.com>	2023-03-29 10:10:09 -07:00
Patrick Storm	7bea3b302c	Add ability for GoogleDrive loader to load google sheets (#2135 ) Currently only google documents and pdfs can be loaded from google drive. This PR implements the latest recommended method for getting google sheets including all tabs. It currently parses the google sheet data the exact same way as the csv loader - the only difference is that the gdrive sheets loader is not using the `csv` library since the data is already in a list.	2023-03-29 07:56:04 -07:00
Chase Adams	b5449a866d	docs: tiny fix on docs verbiage (#2124 ) Changed `RecursiveCharaterTextSplitter` => `RecursiveCharacterTextSplitter`. GH's diff doesn't handle the long string well.	2023-03-28 22:56:29 -07:00
Jonathan Page	8441cbfc03	Add successful request count to OpenAI callback (#2128 ) I've found it useful to track the number of successful requests to OpenAI. This gives me a better sense of the efficiency of my prompts and helps compare map_reduce/refine on a cheaper model vs. stuffing on a more expensive model with higher capacity.	2023-03-28 22:56:17 -07:00
Sebastien Kerbrat	4ab66c4f52	Strip sitemap entries (#2132 ) Loading this sitemap didn't work for me https://www.alzallies.com/sitemap.xml Changing this fixed it and it seems like a good idea to do it in general. Integration tests pass	2023-03-28 22:56:07 -07:00
Harrison Chase	27f80784d0	fix link (#2123 )	2023-03-28 22:51:36 -07:00
blob42	031e32f331	searx: implement async + helper tool providing json results (#2129 ) - implemented `arun` and `aresults`. Reuses aiosession if available. - helper tools `SearxSearchRun` and `SearxSearchResults` - update doc Co-authored-by: blob42 <spike@w530>	2023-03-28 22:49:02 -07:00
Ankush Gola	ccee1aedd2	add async support for anthropic (#2114 ) should not be merged in before https://github.com/anthropics/anthropic-sdk-python/pull/11 gets released	2023-03-28 22:49:14 -04:00
Harrison Chase	e2c26909f2	Harrison/memory check (#2119 ) Co-authored-by: JIAQIA <jqq1716@gmail.com>	2023-03-28 15:40:36 -07:00
Harrison Chase	3e879b47c1	Harrison/gitbook (#2044 ) Co-authored-by: Irene López <45119610+ireneisdoomed@users.noreply.github.com>	2023-03-28 15:28:33 -07:00
Walter Beller-Morales	859502b16c	Fix issue#1712: Update `BaseQAWithSourcesChain` to handle space & newline after `SOURCES:` (#2118 ) Fix the issue outlined in #1712 to ensure the `BaseQAWithSourcesChain` can properly separate the sources from an agent response even when they are delineated by a newline. This will ensure the `BaseQAWithSourcesChain` can reliably handle both of these agent outputs: * `"This Agreement is governed by English law.\nSOURCES: 28-pl"` -> `"This Agreement is governed by English law.\n`, `"28-pl"` * `"This Agreement is governed by English law.\nSOURCES:\n28-pl"` -> `"This Agreement is governed by English law.\n`, `"28-pl"` I couldn't find any unit tests for this but please let me know if you'd like me to add any test coverage.	2023-03-28 15:28:20 -07:00
Saurabh Misra	c33e055f17	Improve ConversationKGMemory and its function load_memory_variables (#1999 ) 1. Removed the `summaries` dictionary in favor of directly appending to the summary_strings list, which avoids the unnecessary double-loop. 2. Simplified the logic for populating the `context` variable. Co-created with GPT-4 @agihouse	2023-03-28 15:19:48 -07:00
Harrison Chase	a5bf8c9b9d	Harrison/aleph alpha embeddings (#2117 ) Co-authored-by: Piotr Mazurek <piotr635@gmail.com> Co-authored-by: PiotrMazurek <piotr.mazurek@aleph-alpha.com>	2023-03-28 15:18:03 -07:00
Nick	0874872dee	add token reduction to ConversationalRetrievalChain (#2075 ) This worked for me, but I'm not sure if its the right way to approach something like this, so I'm open to suggestions. Adds class properties `reduce_k_below_max_tokens: bool` and `max_tokens_limit: int` to the `ConversationalRetrievalChain`. The code is basically copied from [`RetreivalQAWithSourcesChain`](`46d141c6cb/langchain/chains/qa_with_sources/retrieval.py (L24)`)	2023-03-28 15:07:31 -07:00
Alex Telon	ef25904ecb	Fixed 1 missing line in getting_started.md (#2107 ) Seems like a copy paste error. The very next example does have this line. Please tell me if I missed something in the process and should have created an issue or something first!	2023-03-28 15:03:28 -07:00
Francis Felici	9d6f649ba5	fix typo in docs (#2115 ) simple typo	2023-03-28 15:03:17 -07:00
Harrison Chase	c58932e8fd	Harrison/better async (#2112 ) Co-authored-by: Ammar Husain <ammo700@gmail.com>	2023-03-28 13:28:04 -07:00
Harrison Chase	6e85cbcce3	Harrison/unstructured validation (#2111 ) Co-authored-by: kravetsmic <79907559+kravetsmic@users.noreply.github.com>	2023-03-28 13:27:52 -07:00
Tim Asp	b25dbcb5b3	add missing `source` field to pymupdf output (#2110 ) To be consistent with other loaders for use with the `Sources` vector workflows.	2023-03-28 13:22:05 -07:00
Harrison Chase	a554e94a1a	v125 (#2109 ) for hackathon tonight!	2023-03-28 13:12:41 -07:00
Michael Gokhman	5f34dffedc	fix(llms): update default AI21 model to j2, as j1 being deprecated (#2108 ) the j1-* models are marked as [Legacy] in the docs and are expected to be deprecated in 2023-06-01 according to https://docs.ai21.com/docs/jurassic-1-models-legacy ensured `tests/integration_tests/llms/test_ai21.py` pass. empirically observed that `j2-jumbo-instruct` works better the `j2-jumbo` in various simple agent chains, as also expected given the prompt templates are mostly zero shot. Co-authored-by: Michael Gokhman <michaelg@ai21.com>	2023-03-28 13:07:05 -07:00
Honkware	aff33d52c5	Add OpenWeatherMap API Tool (#2083 ) Added tool for OpenWeatherMap API	2023-03-28 12:02:14 -07:00
Charlie Holtz	f16c1fb6df	Add replicate take 2 (#2077 ) This PR adds a replicate integration to langchain. It's an updated version of https://github.com/hwchase17/langchain/pull/1993, but with updates to match latest replicate-python code. https://github.com/replicate/replicate-python. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Zeke Sikelianos <zeke@sikelianos.com>	2023-03-28 11:56:57 -07:00
Harrison Chase	a9e1043673	bump version 124 (#2101 )	2023-03-28 08:58:52 -07:00
Harrison Chase	f281033362	rm pandas dependency (#2102 )	2023-03-28 08:38:19 -07:00
Harrison Chase	410bf37fb8	Harrison/big query (#2100 ) Co-authored-by: lu-cashmoney <lucas.corley@gmail.com>	2023-03-28 08:17:22 -07:00
Harrison Chase	eff5eed719	Harrison/jina (#2043 ) Co-authored-by: numb3r3 <wangfelix87@gmail.com> Co-authored-by: felix-wang <35718120+numb3r3@users.noreply.github.com>	2023-03-28 08:16:17 -07:00
Klein Tahiraj	d0a56f47ee	add ConversationalChatAgent to agent.__init__ (fix #2093 ) (#2098 ) As pointed out in #2093, ConversationalChatAgent was missing from agent.__init__. This PR fixes that.	2023-03-28 08:14:21 -07:00
Harrison Chase	9e74df2404	Fix issue#1645: Parse llm_output even there's newline (#2092 ) (#2099 ) Fix issue#1645: Parse either whitespace or newline after 'Action Input:' in llm_output in mrkl agent. Unittests added accordingly. Co-authored-by: ₿ingnan.ΞTH <brillliantz@outlook.com>	2023-03-28 08:14:09 -07:00
Stéphane Busso	0bee219cb3	feat: Add Notion database document loader (#2056 ) This PR adds Notion DB loader for langchain. It reads content from pages within a Notion Database. It uses the Notion API to query the database and read the pages. It also reads the metadata from the pages and stores it in the Document object.	2023-03-28 08:07:09 -07:00
Harrison Chase	923a7dde5a	Harrison/llama index loader (#2097 ) Co-authored-by: Jerry Liu <jerryjliu98@gmail.com>	2023-03-28 08:06:27 -07:00
Harrison Chase	4cd5cf2e95	notebook for tokens (#2086 )	2023-03-28 07:59:40 -07:00
blob42	33ebb05251	include the tool name for on_tool_end callback (#2000 ) This is useful if you rely on the `on_tool_end` callback to detect which tool has finished in a multi agents scenario. For example, I'm working on a project where I consume the `on_tool_end` event where the event could be emitted by many agents or tools. Right now the only way to know which tool has finished would be set a marker on the `on_tool_start` and catch it on `on_tool_end`. I didn't want to break the signature of the function, but what would have been cleaner would be to pass the same details as in `on_tool_start` Co-authored-by: blob42 <spike@w530>	2023-03-28 10:23:04 -04:00
Clark	e0331b55bb	fix(sql_database): related to #2020 (#2021 ) Fixed https://github.com/hwchase17/langchain/issues/2020 Co-authored-by: qianjun.wqj <qianjun.wqj@alibaba-inc.com>	2023-03-27 23:45:50 -07:00
Harrison Chase	d5825bd3e8	Harrison/whatsapp loader (#2085 ) Co-authored-by: Moshe <hello@moshemalka.me>	2023-03-27 23:43:45 -07:00
iocuydi	e8d9cbca3f	Add prompt and completion token tracking (#2080 ) Tracking the breakdown of token usage is useful when using GPT-4, where prompt and completion tokens are priced differently.	2023-03-27 23:41:25 -07:00
Michael Gokhman	b5020c7d9c	docs: fix promptlayer link typo (#2005 ) tiny typo, just stumbled upon it when reading the docs Co-authored-by: Michael Gokhman <michaelg@ai21.com>	2023-03-27 23:35:54 -07:00
Deepankar Mahapatro	5bea731fb4	docs(deployment): add langchain-serve (#2006 ) Adds documentation to deploy Langchain Chains & Agents using Jina. Repo: https://github.com/jina-ai/langchain-serve	2023-03-27 23:32:04 -07:00
Harrison Chase	0e3b0c827e	Harrison/ai plugin (#2084 ) Co-authored-by: Xupeng (Tony) Tong <tongxupeng.cpu@gmail.com>	2023-03-27 23:31:53 -07:00
Harrison Chase	365669a7fd	Harrison/fix save context (#2082 ) Co-authored-by: Saurabh Misra <misra.saurabh1@gmail.com>	2023-03-27 23:10:46 -07:00
blob42	b7f392fdd6	[agent_executor] convenience func: lookup tool by name (#2001 ) A quick convenience function to lookup a tool by name Co-authored-by: blob42 <spike@w530>	2023-03-27 23:10:34 -07:00
Ace Eldeib	4be2f9d75a	fix: numerous broken documentation links (#2070 ) seems linkchecker isn't catching them because it runs on generated html. at that point the links are already missing. the generation process seems to strip invalid references when they can't be re-written from md to html. I used https://github.com/tcort/markdown-link-check to check the doc source directly. There are a few false positives on localhost for development.	2023-03-27 23:07:03 -07:00
Harrison Chase	f74a1bebf5	Harrison/duckdb (#2064 ) Co-authored-by: Trent Hauck <trent@trenthauck.com>	2023-03-27 19:51:34 -07:00
Harrison Chase	76ecca4d53	redis retriever (#2060 )	2023-03-27 19:51:23 -07:00
Ankush Gola	b7ebb8fe30	enable streaming in anthropic llm wrapper (#2065 )	2023-03-27 20:25:00 -04:00
Francisco Ingham	41c8a42e22	Improve chat tool prompt (#1989 ) I have found that when the user has not asked an explicit question the agent might have trouble answering the latest comment and might instead try to answer a question that came before in the conversation which would not be what is desired. I also found that the agent might get confused with the current prompt and talk about the tools themselves instead of the results obtained from them. I added two changes to the tool prompt so that the agent answers only the last comment/question and only returns information from tool results.	2023-03-27 16:34:01 -07:00
Francisco Ingham	1cc9e90041	Solve small bug in the kg prompt (#1988 ) I think that the 'Person' line should be under 'Last line of conversation' as is the case in the other examples in the kg prompt	2023-03-27 16:33:26 -07:00
Harrison Chase	30e3b31b04	Harrison/document cleanup (#2062 ) Co-authored-by: Delip Rao <delip@users.noreply.github.com>	2023-03-27 16:32:55 -07:00
Harrison Chase	a0cd6672aa	Harrison/site map (#2061 ) Co-authored-by: Tim Asp <707699+timothyasp@users.noreply.github.com>	2023-03-27 16:28:08 -07:00
Arttii	8b5a43d720	Correctly pass filter down to the similarity_search_with_score function for chroma filtering logic (#1934 ) Should slightly fix the work in #1869	2023-03-27 15:50:46 -07:00

... 25 26 27 28 29 ...

2305 Commits