langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
Harrison Chase	2eeaccf01c	Harrison/apify (#2215 ) Co-authored-by: Jiří Moravčík <jiri.moravcik@gmail.com>	2023-03-30 20:58:14 -07:00
Alex Stachowiak	e6a9ee64b3	Update vectorstore-retriever.ipynb (#2210 )	2023-03-30 20:51:46 -07:00
Arttii	4e9ee566ef	Add MMR methods to chroma (#2148 ) Hi, I added MMR similar to faais and milvus to chroma. Please let me know what you think.	2023-03-30 20:51:16 -07:00
Harrison Chase	fc009f61c8	sitemap more flexible (#2214 )	2023-03-30 20:46:36 -07:00
Matt Robinson	3dfe1cf60e	feat: document loader for epublications (#2202 ) ### Summary Adds a new document loader for processing e-publications. Works with `unstructured>=0.5.4`. You need to have [`pandoc`](https://pandoc.org/installing.html) installed for this loader to work. ### Testing ```python from langchain.document_loaders import UnstructuredEPubLoader loader = UnstructuredEPubLoader("winter-sports.epub", mode="elements") data = loader.load() data[0] ```	2023-03-30 20:45:31 -07:00
Ikko Eltociear Ashimine	a4a1ee6b5d	Update huggingface_length_function.ipynb (#2203 ) HuggingFace -> Hugging Face	2023-03-30 20:43:58 -07:00
Harrison Chase	2d3918c152	make requests more general (#2209 )	2023-03-30 20:41:56 -07:00
Harrison Chase	1c03205cc2	embedding docs (#2200 )	2023-03-30 08:34:14 -07:00
Harrison Chase	feec4c61f4	Harrison/docs reqs (#2199 )	2023-03-30 08:20:30 -07:00
Harrison Chase	097684e5f2	bump version to 127 (#2197 )	2023-03-30 08:11:04 -07:00
Ben Heckmann	fd1fcb5a7d	fix typing for LLMMathChain (#2183 ) Fix typing in LLMMathChain to allow chat models (#1834). Might have been forgotten in related PR #1807.	2023-03-30 07:52:58 -07:00
Cory Zue	3207a74829	fix typo in chat_prompt_template docs (#2193 )	2023-03-30 07:52:40 -07:00
Alan deLevie	597378d1f6	Small typo in custom_agent.ipynb (#2194 ) determin -> determine	2023-03-30 07:52:29 -07:00
Jeru2023	64b9843b5b	Update text.py (#2195 ) Add encoding parameter when open txt file to support unicode files.	2023-03-30 07:52:17 -07:00
Rui Ferreira	5d86a6acf1	Fix wikipedia summaries (#2187 ) This upsteam wikipedia page loading seems to still have issues. Finding a compromise solution where it does an exact match search and not a search for the completion. See previous PR: https://github.com/hwchase17/langchain/pull/2169	2023-03-30 07:34:13 -07:00
Kei Kamikawa	35a3218e84	supported async retriever (#2149 )	2023-03-30 10:14:05 -04:00
Harrison Chase	65c0c73597	Harrison/arize (#2180 ) Co-authored-by: Hakan Tekgul <tekgul2@illinois.edu>	2023-03-29 22:55:21 -07:00
Harrison Chase	33a001933a	Harrison/clear ml (#2179 ) Co-authored-by: Victor Sonck <victor.sonck@gmail.com>	2023-03-29 22:45:34 -07:00
Harrison Chase	fe804d2a01	Harrison/aim integration (#2178 ) Co-authored-by: Hovhannes Tamoyan <hovhannes.tamoyan@gmail.com> Co-authored-by: Gor Arakelyan <arakelyangor10@gmail.com>	2023-03-29 22:37:56 -07:00
Gene Ruebsamen	68f039704c	missing word 'not' in constitutional prompts (#2176 ) arson should not be condoned. not was missing in the critique	2023-03-29 22:29:48 -07:00
Harrison Chase	bcfd071784	Harrison/engine args (#2177 ) Co-authored-by: Alvaro Sevilla <alvarosevilla95@gmail.com>	2023-03-29 22:29:38 -07:00
Tim Asp	7d90691adb	Add kwargs to from_* in PrompTemplate (#2161 ) This will let us use output parsers, etc, while using the `from_*` helper functions	2023-03-29 22:13:27 -07:00
Rui Ferreira	f83c36d8fd	Fix incorrect wikipage summaries (#2169 ) Creating a page using the title causes a wikipedia search with autocomplete set to true. This frequently causes the summaries to be unrelated to the actual page found. See: `1554943e8a/wikipedia/wikipedia.py (L254-L280)`	2023-03-29 22:13:03 -07:00
Tim Asp	6be67279fb	Add apredict_and_parse to LLM (#2164 ) `predict_and_parse` exists, and it's a nice abstraction to allow for applying output parsers to LLM generations. And async is very useful. As an aside, the difference between `call/acall`, `predict/apredict` and `generate/agenerate` isn't entirely clear to me other than they all call into the LLM in slightly different ways. Is there some documentation or a good way to think about these differences? One thought: output parsers should just work magically for all those LLM calls. If the `output_parser` arg is set on the prompt, the LLM has access, so it seems like extra work on the user's end to have to call `output_parser.parse` If this sounds reasonable, happy to throw something together. @hwchase17	2023-03-29 22:12:50 -07:00
Max Caldwell	3dc49a04a3	[Documents] Updated Figma docs and added example (#2172 ) - Current docs are pointing to the wrong module, fixed - Added some explanation on how to find the necessary parameters - Added chat-based codegen example w/ retrievers Picture of the new page: ![Screenshot 2023-03-29 at 20-11-29 Figma — 🦜🔗 LangChain 0 0 126](https://user-images.githubusercontent.com/2172753/228719338-c7ec5b11-01c2-4378-952e-38bc809f217b.png) Please let me know if you'd like any tweaks! I wasn't sure if the example was too heavy for the page or not but decided "hey, I probably would want to see it" and so included it. Co-authored-by: maxtheman <max@maxs-mbp.lan>	2023-03-29 22:11:45 -07:00
Harrison Chase	5c907d9998	Harrison/base agent without docs (#2166 )	2023-03-29 22:11:25 -07:00
Zoltan Fedor	1b7cfd7222	Bugfix: Redis `lrange()` retrieves records in opposite order of inseerting (#2167 ) The new functionality of Redis backend for chat message history ([see](https://github.com/hwchase17/langchain/pull/2122)) uses the Redis list object to store messages and then uses the `lrange()` to retrieve the list of messages ([see](https://github.com/hwchase17/langchain/blob/master/langchain/memory/chat_message_histories/redis.py#L50)). Unfortunately this retrieves the messages as a list sorted in the opposite order of how they were inserted - meaning the last inserted message will be first in the retrieved list - which is not what we want. This PR fixes that as it changes the order to match the order of insertion.	2023-03-29 22:09:01 -07:00
blob42	7859245fc5	doc: more details on BaseOutputParser docstrings (#2171 ) Co-authored-by: blob42 <spike@w530>	2023-03-29 22:07:05 -07:00
Ankush Gola	529a1f39b9	make tool verbosity override agent verbosity (#2173 ) Currently, if a tool is set to verbose, an agent can override it by passing in its own verbose flag. This is not ideal if we want to stream back responses from agents, as we want the llm and tools to be sending back events but nothing else. This also makes the behavior consistent with ts.	2023-03-29 22:05:58 -07:00
Harrison Chase	f5a4bf0ce4	remove prep (#2136 ) agents should be stateless or async stuff may not work	2023-03-29 14:38:21 -07:00
sergerdn	a0453ebcf5	docs: update docstrings in ElasticVectorSearch class (#2141 ) This merge includes updated comments in the ElasticVectorSearch class to provide information on how to connect to `Elasticsearch` instances that require login credentials, including Elastic Cloud, without any functional changes. The `ElasticVectorSearch` class now inherits from the `ABC` abstract base class, which does not break or change any functionality. This allows for easy subclassing and creation of custom implementations in the future or for any users, especially for me 😄 I confirm that before pushing these changes, I ran: ```bash make format && make lint ``` To ensure that the new documentation is rendered correctly I ran ```bash make docs_build ``` To ensure that the new documentation has no broken links, I ran a check ```bash make docs_linkcheck ``` ![Capture](https://user-images.githubusercontent.com/64213648/228541688-38f17c7b-b012-4678-86b9-4dd607469062.JPG) Also take a look at https://github.com/hwchase17/langchain/issues/1865 P.S. Sorry for spamming you with force-pushes. In the future, I will be smarter.	2023-03-29 16:20:29 -04:00
Ankush Gola	ffb7de34ca	Fix docstring (#2147 ) (#2160 ) Somehow docstring was doubled. A minor fix for this --------- Co-authored-by: Piotr Mazurek <piotr635@gmail.com>	2023-03-29 16:17:54 -04:00
Shota Terashita	09085c32e3	Add `temperature` to ChatOpenAI (#2152 ) Just add `temperature` parameter to ChatOpenAI class. https://python.langchain.com/en/latest/getting_started/getting_started.html#building-a-language-model-application-chat-models There are descriptions like `chat = ChatOpenAI(temperature=0)` in the documents, but it is confusing because it is not supported as an explicit parameter.	2023-03-29 16:04:44 -04:00
Harrison Chase	8b91a21e37	fix memory docs (#2157 )	2023-03-29 11:39:06 -07:00
Harrison Chase	55b52bad21	bump version to 126 (#2155 )	2023-03-29 11:36:52 -07:00
Harrison Chase	b35260ed47	Harrison/memory base (#2122 ) @3coins + @zoltan-fedor.... heres the pr + some minor changes i made. thoguhts? can try to get it into tmrws release --------- Co-authored-by: Zoltan Fedor <zoltan.0.fedor@gmail.com> Co-authored-by: Piyush Jain <piyushjain@duck.com>	2023-03-29 10:10:09 -07:00
Patrick Storm	7bea3b302c	Add ability for GoogleDrive loader to load google sheets (#2135 ) Currently only google documents and pdfs can be loaded from google drive. This PR implements the latest recommended method for getting google sheets including all tabs. It currently parses the google sheet data the exact same way as the csv loader - the only difference is that the gdrive sheets loader is not using the `csv` library since the data is already in a list.	2023-03-29 07:56:04 -07:00
Chase Adams	b5449a866d	docs: tiny fix on docs verbiage (#2124 ) Changed `RecursiveCharaterTextSplitter` => `RecursiveCharacterTextSplitter`. GH's diff doesn't handle the long string well.	2023-03-28 22:56:29 -07:00
Jonathan Page	8441cbfc03	Add successful request count to OpenAI callback (#2128 ) I've found it useful to track the number of successful requests to OpenAI. This gives me a better sense of the efficiency of my prompts and helps compare map_reduce/refine on a cheaper model vs. stuffing on a more expensive model with higher capacity.	2023-03-28 22:56:17 -07:00
Sebastien Kerbrat	4ab66c4f52	Strip sitemap entries (#2132 ) Loading this sitemap didn't work for me https://www.alzallies.com/sitemap.xml Changing this fixed it and it seems like a good idea to do it in general. Integration tests pass	2023-03-28 22:56:07 -07:00
Harrison Chase	27f80784d0	fix link (#2123 )	2023-03-28 22:51:36 -07:00
blob42	031e32f331	searx: implement async + helper tool providing json results (#2129 ) - implemented `arun` and `aresults`. Reuses aiosession if available. - helper tools `SearxSearchRun` and `SearxSearchResults` - update doc Co-authored-by: blob42 <spike@w530>	2023-03-28 22:49:02 -07:00
Ankush Gola	ccee1aedd2	add async support for anthropic (#2114 ) should not be merged in before https://github.com/anthropics/anthropic-sdk-python/pull/11 gets released	2023-03-28 22:49:14 -04:00
Harrison Chase	e2c26909f2	Harrison/memory check (#2119 ) Co-authored-by: JIAQIA <jqq1716@gmail.com>	2023-03-28 15:40:36 -07:00
Harrison Chase	3e879b47c1	Harrison/gitbook (#2044 ) Co-authored-by: Irene López <45119610+ireneisdoomed@users.noreply.github.com>	2023-03-28 15:28:33 -07:00
Walter Beller-Morales	859502b16c	Fix issue#1712: Update `BaseQAWithSourcesChain` to handle space & newline after `SOURCES:` (#2118 ) Fix the issue outlined in #1712 to ensure the `BaseQAWithSourcesChain` can properly separate the sources from an agent response even when they are delineated by a newline. This will ensure the `BaseQAWithSourcesChain` can reliably handle both of these agent outputs: * `"This Agreement is governed by English law.\nSOURCES: 28-pl"` -> `"This Agreement is governed by English law.\n`, `"28-pl"` * `"This Agreement is governed by English law.\nSOURCES:\n28-pl"` -> `"This Agreement is governed by English law.\n`, `"28-pl"` I couldn't find any unit tests for this but please let me know if you'd like me to add any test coverage.	2023-03-28 15:28:20 -07:00
Saurabh Misra	c33e055f17	Improve ConversationKGMemory and its function load_memory_variables (#1999 ) 1. Removed the `summaries` dictionary in favor of directly appending to the summary_strings list, which avoids the unnecessary double-loop. 2. Simplified the logic for populating the `context` variable. Co-created with GPT-4 @agihouse	2023-03-28 15:19:48 -07:00
Harrison Chase	a5bf8c9b9d	Harrison/aleph alpha embeddings (#2117 ) Co-authored-by: Piotr Mazurek <piotr635@gmail.com> Co-authored-by: PiotrMazurek <piotr.mazurek@aleph-alpha.com>	2023-03-28 15:18:03 -07:00
Nick	0874872dee	add token reduction to ConversationalRetrievalChain (#2075 ) This worked for me, but I'm not sure if its the right way to approach something like this, so I'm open to suggestions. Adds class properties `reduce_k_below_max_tokens: bool` and `max_tokens_limit: int` to the `ConversationalRetrievalChain`. The code is basically copied from [`RetreivalQAWithSourcesChain`](`46d141c6cb/langchain/chains/qa_with_sources/retrieval.py (L24)`)	2023-03-28 15:07:31 -07:00
Alex Telon	ef25904ecb	Fixed 1 missing line in getting_started.md (#2107 ) Seems like a copy paste error. The very next example does have this line. Please tell me if I missed something in the process and should have created an issue or something first!	2023-03-28 15:03:28 -07:00

1 2 3 4 5 ...

1140 Commits