langchain

Author	SHA1	Message	Date
Leonid Ganeline	ce15ffae6a	added `Wikipedia` retriever (#4302 ) - added `Wikipedia` retriever. It is effectively a wrapper for `WikipediaAPIWrapper`. It wrapps load() into get_relevant_documents() - sorted `__all__` in the `retrievers/__init__` - added integration tests for the WikipediaRetriever - added an example (as Jupyter notebook) for the WikipediaRetriever	2023-05-09 10:08:39 -07:00
Davis Chase	ea83eed9ba	Bump to version 0.0.163 (#4382 )	2023-05-09 07:51:51 -07:00
Prayson Wilfred Daniel	2b4ba203f7	query correction from when to what (#4383 ) # Minor Wording Documentation Change ```python agent_chain.run("When's my friend Eric's surname?") # Answer with 'Zhu' ``` is change to ```python agent_chain.run("What's my friend Eric's surname?") # Answer with 'Zhu' ``` I think when is a residual of the old query that was "When’s my friends Eric`s birthday?".	2023-05-09 07:42:47 -07:00
Eugene Yurtsev	2ceb807da2	Add PDF parser implementations (#4356 ) # Add PDF parser implementations This PR separates the data loading from the parsing for a number of existing PDF loaders. Parser tests have been designed to help encourage developers to create a consistent interface for parsing PDFs. This interface can be made more consistent in the future by adding information into the initializer on desired behavior with respect to splitting by page etc. This code is expected to be backwards compatible -- with the exception of a bug fix with pymupdf parser which was returning `bytes` in the page content rather than strings. Also changing the lazy parser method of document loader to return an Iterator rather than Iterable over documents. ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @ <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoader Abstractions - @eyurtsev LLM/Chat Wrappers - @hwchase17 - @agola11 Tools / Toolkits - @vowelparrot -->	2023-05-09 10:24:17 -04:00
Eugene Yurtsev	ae0c3382dd	Add MimeType based parser (#4376 ) # Add MimeType Based Parser This PR adds a MimeType Based Parser. The parser inspects the mime-type of the blob it is parsing and based on the mime-type can delegate to the sub parser. ## Before submitting Waiting on adding notebooks until more implementations are landed. ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @vowelparrot	2023-05-09 10:22:56 -04:00
Leonid Ganeline	c485e7ab59	added GitHub star number (#4214 ) added GitHub star number with a link to the `GitHub star history chart` This is an interesting chart https://star-history.com/#hwchase17/langchain :)	2023-05-09 09:39:53 -04:00
Heath	0d568daacb	Update writer integration (#4363 ) # Update Writer LLM integration Changes the parameters and base URL to be in line with Writer's current API. Based on the documentation on this page: https://dev.writer.com/reference/completions-1	2023-05-08 21:59:46 -07:00
BioErrorLog	04f765b838	Fix grammar in Text Splitters docs (#4373 ) # Fix grammar in Text Splitters docs Just a small fix of grammar in the documentation: "That means there two different axes" -> "That means there are two different axes"	2023-05-08 22:38:40 -04:00
Zander Chase	c73cec5ac1	Add Example Notebook for LCP Client (#4207 ) Add a notebook in the `experimental/` directory detailing: - How to capture traces with the v2 endpoint - How to create datasets - How to run traces over the dataset	2023-05-08 18:33:19 -07:00
mbchang	f1401a6dff	new example: two agent debate with tools (#4024 )	2023-05-08 17:10:44 -07:00
玄猫	deffc65693	fix: vectorstore pgvector ensure compatibility #3884 (#4248 ) Ensure compatibility with both SQLAlchemy v1/v2 fix the issue when using SQLAlchemy v1 (reported at #3884) ` langchain/vectorstores/pgvector.py", line 168, in create_tables_if_not_exists self._conn.commit() AttributeError: 'Connection' object has no attribute 'commit' ` Ref Doc : https://docs.sqlalchemy.org/en/14/changelog/migration_20.html#migration-20-autocommit	2023-05-08 16:43:50 -07:00
Davis Chase	ba0057c077	Check OpenAI model kwargs (#4366 ) Handle duplicate and incorrectly specified OpenAI params Thanks @PawelFaron for the fix! Made small update Closes #4331 --------- Co-authored-by: PawelFaron <42373772+PawelFaron@users.noreply.github.com> Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-08 16:37:34 -07:00
Davis Chase	02ebb15c4a	Fix TextSplitter.from_tiktoken(#4361 ) Thanks to @danb27 for the fix! Minor update Fixes https://github.com/hwchase17/langchain/issues/4357 --------- Co-authored-by: Dan Bianchini <42096328+danb27@users.noreply.github.com>	2023-05-08 16:36:38 -07:00
Naveen Tatikonda	782df1db10	OpenSearch: Add Similarity Search with Score (#4089 ) ### Description Add `similarity_search_with_score` method for OpenSearch to return scores along with documents in the search results Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	2023-05-08 16:35:21 -07:00
Ankush Gola	b3ecce0545	fix json saving, update docs to reference anthropic chat model (#4364 ) Fixes # (issue) https://github.com/hwchase17/langchain/issues/4085	2023-05-08 15:30:52 -07:00
ImmortalZ	b04d84f6b3	fix: solve the infinite loop caused by 'add_memory' function when run… (#4318 ) fix: solve the infinite loop caused by 'add_memory' function when run 'pause_to_reflect' function run steps: 'add_memory' -> 'pause_to_reflect' -> 'add_memory': infinite loop	2023-05-08 15:13:23 -07:00
Eugene Yurtsev	aa11f7c89b	Add progress bar to filesystemblob loader, update pytest config for unit tests (#4212 ) This PR adds: * Option to show a tqdm progress bar when using the file system blob loader * Update pytest run configuration to be stricter * Adding a new marker that checks that required pkgs exist	2023-05-08 16:15:09 -04:00
Eduard van Valkenburg	f4c8502e61	fix for cosmos not loading old messages (#4094 ) I noticed cosmos was not loading old messages properly, fixed now.	2023-05-08 12:48:15 -07:00
Simba Khadder	d84df25466	Add example on how to use Featureform with langchain (#4337 ) Added an example on how to use Featureform to connecting_to_a_feature_store.ipynb .	2023-05-08 10:32:17 -07:00
Harrison Chase	42df78d396	bump ver 162 (#4346 )	2023-05-08 09:28:41 -07:00
Zander Chase	8b284f9ad0	Pass parsed inputs through to tool _run (#4309 )	2023-05-08 09:13:05 -07:00
Zander Chase	35c9e6ab40	Pass Callbacks through load_tools (#4298 ) - Update the load_tools method to properly accept `callbacks` arguments. - Add a deprecation warning when `callback_manager` is passed - Add two unit tests to check the deprecation warning is raised and to confirm the callback is passed through. Closes issue #4096	2023-05-08 08:44:26 -07:00
Zander Chase	0870a45a69	Add Pull Request Template (#4247 )	2023-05-08 08:34:37 -07:00
Jinto Jose	8a338412fa	mongodb support for chat history (#4266 )	2023-05-08 08:34:05 -07:00
Harrison Chase	f510940bde	add check for lower bound of lark (#4287 )	2023-05-08 08:31:05 -07:00
Harrison Chase	c8b0b6e6c1	add youtube tools (#4320 )	2023-05-08 08:29:30 -07:00
PawelFaron	1d1166ded6	Fixed huggingfacehub_api_token hadning in HuggingFaceEndpoint (#4335 ) Reported here: https://github.com/hwchase17/langchain/issues/4334 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-08 08:29:17 -07:00
Arjun Aravindan	637c61cffb	Add support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers (#4305 ) This commit adds support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers. This allows users to specify the Browser binary location which is required when deploying to services such as Heroku This change also includes updated documentation and type hints to reflect the new binary_location parameter and its usage. fixes #4304	2023-05-08 11:05:55 -04:00
Lior Neudorfer	65c95f9fb2	Better error when running chain without any args (#4294 ) Today, when running a chain without any arguments, the raised ValueError incorrectly specifies that user provided "both positional arguments and keyword arguments". This PR adds a more accurate error in that case.	2023-05-07 21:11:51 -07:00
Harrison Chase	edcd171535	bring back ref (#4308 )	2023-05-07 17:32:28 -07:00
Wuxian Zhang	6f386628c2	Permit unicode outputs when dumping json in GetElementsTool (#4276 ) Adds ensure_ascii=False when dumping json in the GetElementsTool Fixes issue https://github.com/hwchase17/langchain/issues/4265	2023-05-07 14:43:03 -07:00
Eugene Brodsky	a1001b29eb	Incorrect docstring for PythonCodeTextSplitter (#4296 ) Fixes a copy-paste error in the doctring	2023-05-07 14:04:54 -07:00
Ikko Eltociear Ashimine	f70e18a5b3	Fix typo in huggingface.py (#4277 ) enviroment -> environment	2023-05-07 11:37:06 -04:00
Eugene Yurtsev	0c646bb703	Minor clean up in BlobParser (#4210 ) Minor clean up to use `abstractmethod` and `ABC` instead of `abc.abstractmethod` and `abc.ABC`.	2023-05-07 11:32:53 -04:00
PawelFaron	04b74d0446	Adjusted GPT4All llm to streaming API and added support for GPT4All_J (#4131 ) Fix for these issues: https://github.com/hwchase17/langchain/issues/4126 https://github.com/hwchase17/langchain/issues/3839#issuecomment-1534258559 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-06 15:14:09 -07:00
Harrison Chase	075d9631f5	bump ver to 161 (#4239 )	2023-05-06 10:20:36 -07:00
Harrison Chase	64940e9d0f	docs for azure (#4238 )	2023-05-06 10:16:00 -07:00
Myeongseop Kim	747b5f87c2	Add HumanInputLLM (#4160 ) Related: #4028, I opened a new PR because (1) I was unable to unstage mistakenly committed files (I'm not familiar with git enough to resolve this issue), (2) I felt closing the original PR and opening a new PR would be more appropriate if I changed the class name. This PR creates HumanInputLLM(HumanLLM in #4028), a simple LLM wrapper class that returns user input as the response. I also added a simple Jupyter notebook regarding how and why to use this LLM wrapper. In the notebook, I went over how to use this LLM wrapper and showed example of testing `WikipediaQueryRun` using HumanInputLLM. I believe this LLM wrapper will be useful especially for debugging, educational or testing purpose.	2023-05-06 09:48:40 -07:00
Davis Chase	6cd51ef3d0	Simplify router chain constructor signatures (#4146 )	2023-05-06 09:38:17 -07:00
玄猫	43a7a89e93	opt: document_loader notiondb to extract url (#4222 )	2023-05-06 09:34:33 -07:00
Leonid Ganeline	9544b30821	added `Wikipedia` document loader (#4141 ) - Added the `Wikipedia` document loader. It is based on the existing `unilities/WikipediaAPIWrapper` - Added a respective ut-s and example notebook - Sorted list of classes in __init__	2023-05-06 09:32:45 -07:00
Eugene Yurtsev	423f497168	Add BlobParser abstraction (#3979 ) This PR adds the BlobParser abstraction. It follows the proposal described here: https://github.com/hwchase17/langchain/pull/2833#issuecomment-1509097756	2023-05-05 21:43:38 -04:00
Davis Chase	5ca13cc1f0	Dev2049/pypdfium2 (#4209 ) thanks @jerrytigerxu for the addition! --------- Co-authored-by: Jere Xu <jtxu2008@gmail.com> Co-authored-by: jerrytigerxu <jere.tiger.xu@gmailc.om>	2023-05-05 17:55:31 -07:00
Leonid Ganeline	59204a5033	docs: `document_loaders` improvements (#4200 ) - made notebooks consistent: titles, service/format descriptions. - corrected short names to full names, for example, `Word` -> `Microsoft Word` - added missed descriptions - renamed notebook files to make ToC correctly sorted	2023-05-05 17:44:54 -07:00
Harrison Chase	eeb7c96e0c	bump version to 160 (#4205 )	2023-05-05 17:02:39 -07:00
Davis Chase	f1fc4dfebc	Dev2049/obsidian patch (#4204 ) thanks @shkarlsson for the fix! (just updated formatting) --------- Co-authored-by: shkarlsson <sven.henrik.karlsson@gmail.com>	2023-05-05 16:49:19 -07:00
George	2324f19c85	Update qdrant interface (#3971 ) Hello 1) Passing `embedding_function` as a callable seems to be outdated and the common interface is to pass `Embeddings` instance 2) At the moment `Qdrant.add_texts` is designed to be used with `embeddings.embed_query`, which is 1) slow 2) causes ambiguity due to 1. It should be used with `embeddings.embed_documents` This PR solves both problems and also provides some new tests	2023-05-05 16:46:40 -07:00
Harrison Chase	76ed41f48a	update docs (#4194 )	2023-05-05 16:45:26 -07:00
Zander Chase	1017e5cee2	Add LCP Client (#4198 ) Adding a client to fetch datasets, examples, and runs from a LCP instance and run objects over them.	2023-05-05 16:28:56 -07:00
Zander Chase	a30f42da4e	Update V2 Tracer (#4193 ) - Update the RunCreate object to work with recent changes - Add optional Example ID to the tracer - Adjust default persist_session behavior to attempt to load the session if it exists - Raise more useful HTTP errors for logging - Add unit testing - Fix the default ID to be a UUID for v2 tracer sessions Broken out from the big draft here: https://github.com/hwchase17/langchain/pull/4061	2023-05-05 14:55:01 -07:00

1 2 3 4 5 ...

1822 Commits