langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-06 03:20:49 +00:00

Author	SHA1	Message	Date
Davis Chase	f6c97e6af4	Fix Lark import error (#4421 ) Any import that touches langchain.retrievers currently requires Lark. Here's one attempt to fix. Not very pretty, very open to other ideas. Alternatives I thought of are 1) make Lark requirement, 2) put everything in parser.py in the try/except. Neither sounds much better Related to #4316, #4275	2023-05-10 01:07:34 -07:00
Harrison Chase	f0cfed636f	change nb name	2023-05-09 21:22:35 -07:00
Harrison Chase	6b8d144ccc	Harrison/plan and solve (#4422 )	2023-05-09 21:07:56 -07:00
StephaneBereux	d383c0cb43	fixed the filtering error in chromadb (#1621 ) Fixed two small bugs (as reported in issue #1619 ) in the filtering by metadata for `chroma` databases : - ```langchain.vectorstores.chroma.similarity_search``` takes a ```filter``` input parameter but do not forward it to ```langchain.vectorstores.chroma.similarity_search_with_score``` - ```langchain.vectorstores.chroma.similarity_search_by_vector``` doesn't take this parameter in input, although it could be very useful, without any additional complexity - and it would thus be coherent with the syntax of the two other functions. Co-authored-by: Davis Chase <130488702+dev2049@users.noreply.github.com>	2023-05-09 16:43:00 -07:00
jrhe	28091c2101	Use passed LLM for default chain in MultiPromptChain (#4418 ) Currently, MultiPromptChain instantiates a ChatOpenAI LLM instance for the default chain to use if none of the prompts passed match. This seems like an error as it means that you can't use your choice of LLM, or configure how to instantiate the default LLM (e.g. passing in an API key that isn't in the usual env variable).	2023-05-09 16:15:25 -07:00
Davis Chase	5c8e12558d	Dev2049/pinecone try except (#4424 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bernie G <bernie.gandin2@gmail.com>	2023-05-09 16:03:19 -07:00
Rukmani	2b14036126	Update WhatsAppChatLoader to include the character ~ in the sender name (#4420 ) Fixes #4153 If the sender of a message in a group chat isn't in your contact list, they will appear with a ~ prefix in the exported chat. This PR adds support for parsing such lines.	2023-05-09 15:00:04 -07:00
Zander Chase	f2150285a4	Fix nested runs example ID (#4413 ) #### Only reference example ID on the parent run Previously, I was assigning the example ID to every child run. Adds a test.	2023-05-09 12:21:53 -07:00
Davis Chase	e4ca511ec8	Delete comment (#4412 )	2023-05-09 10:38:44 -07:00
mbchang	9fafe7b2b9	fix: remove unnecessary line of code (#4408 ) Removes unnecessary line of code in https://python.langchain.com/en/latest/use_cases/agent_simulations/two_agent_debate_tools.html	2023-05-09 10:35:09 -07:00
Aivin V. Solatorio	6335cb5b3a	Add support for Qdrant nested filter (#4354 ) # Add support for Qdrant nested filter This extends the filter functionality for the Qdrant vectorstore. The current filter implementation is limited to a single-level metadata structure; however, Qdrant supports nested metadata filtering. This extends the functionality for users to maximize the filter functionality when using Qdrant as the vectorstore. Reference: https://qdrant.tech/documentation/filtering/#nested-key --------- Signed-off-by: Aivin V. Solatorio <avsolatorio@gmail.com>	2023-05-09 10:34:11 -07:00
Martin Holzhauer	872605a5c5	Add an option to extract more metadata from crawled websites (#4347 ) This pr makes it possible to extract more metadata from websites for later use. my usecase: parsing ld+json or microdata from sites and store it as structured data in the metadata field	2023-05-09 10:18:33 -07:00
Leonid Ganeline	ce15ffae6a	added `Wikipedia` retriever (#4302 ) - added `Wikipedia` retriever. It is effectively a wrapper for `WikipediaAPIWrapper`. It wrapps load() into get_relevant_documents() - sorted `__all__` in the `retrievers/__init__` - added integration tests for the WikipediaRetriever - added an example (as Jupyter notebook) for the WikipediaRetriever	2023-05-09 10:08:39 -07:00
Davis Chase	ea83eed9ba	Bump to version 0.0.163 (#4382 )	2023-05-09 07:51:51 -07:00
Prayson Wilfred Daniel	2b4ba203f7	query correction from when to what (#4383 ) # Minor Wording Documentation Change ```python agent_chain.run("When's my friend Eric's surname?") # Answer with 'Zhu' ``` is change to ```python agent_chain.run("What's my friend Eric's surname?") # Answer with 'Zhu' ``` I think when is a residual of the old query that was "When’s my friends Eric`s birthday?".	2023-05-09 07:42:47 -07:00
Eugene Yurtsev	2ceb807da2	Add PDF parser implementations (#4356 ) # Add PDF parser implementations This PR separates the data loading from the parsing for a number of existing PDF loaders. Parser tests have been designed to help encourage developers to create a consistent interface for parsing PDFs. This interface can be made more consistent in the future by adding information into the initializer on desired behavior with respect to splitting by page etc. This code is expected to be backwards compatible -- with the exception of a bug fix with pymupdf parser which was returning `bytes` in the page content rather than strings. Also changing the lazy parser method of document loader to return an Iterator rather than Iterable over documents. ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @ <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoader Abstractions - @eyurtsev LLM/Chat Wrappers - @hwchase17 - @agola11 Tools / Toolkits - @vowelparrot -->	2023-05-09 10:24:17 -04:00
Eugene Yurtsev	ae0c3382dd	Add MimeType based parser (#4376 ) # Add MimeType Based Parser This PR adds a MimeType Based Parser. The parser inspects the mime-type of the blob it is parsing and based on the mime-type can delegate to the sub parser. ## Before submitting Waiting on adding notebooks until more implementations are landed. ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @vowelparrot	2023-05-09 10:22:56 -04:00
Leonid Ganeline	c485e7ab59	added GitHub star number (#4214 ) added GitHub star number with a link to the `GitHub star history chart` This is an interesting chart https://star-history.com/#hwchase17/langchain :)	2023-05-09 09:39:53 -04:00
Heath	0d568daacb	Update writer integration (#4363 ) # Update Writer LLM integration Changes the parameters and base URL to be in line with Writer's current API. Based on the documentation on this page: https://dev.writer.com/reference/completions-1	2023-05-08 21:59:46 -07:00
BioErrorLog	04f765b838	Fix grammar in Text Splitters docs (#4373 ) # Fix grammar in Text Splitters docs Just a small fix of grammar in the documentation: "That means there two different axes" -> "That means there are two different axes"	2023-05-08 22:38:40 -04:00
Zander Chase	c73cec5ac1	Add Example Notebook for LCP Client (#4207 ) Add a notebook in the `experimental/` directory detailing: - How to capture traces with the v2 endpoint - How to create datasets - How to run traces over the dataset	2023-05-08 18:33:19 -07:00
mbchang	f1401a6dff	new example: two agent debate with tools (#4024 )	2023-05-08 17:10:44 -07:00
玄猫	deffc65693	fix: vectorstore pgvector ensure compatibility #3884 (#4248 ) Ensure compatibility with both SQLAlchemy v1/v2 fix the issue when using SQLAlchemy v1 (reported at #3884) ` langchain/vectorstores/pgvector.py", line 168, in create_tables_if_not_exists self._conn.commit() AttributeError: 'Connection' object has no attribute 'commit' ` Ref Doc : https://docs.sqlalchemy.org/en/14/changelog/migration_20.html#migration-20-autocommit	2023-05-08 16:43:50 -07:00
Davis Chase	ba0057c077	Check OpenAI model kwargs (#4366 ) Handle duplicate and incorrectly specified OpenAI params Thanks @PawelFaron for the fix! Made small update Closes #4331 --------- Co-authored-by: PawelFaron <42373772+PawelFaron@users.noreply.github.com> Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-08 16:37:34 -07:00
Davis Chase	02ebb15c4a	Fix TextSplitter.from_tiktoken(#4361 ) Thanks to @danb27 for the fix! Minor update Fixes https://github.com/hwchase17/langchain/issues/4357 --------- Co-authored-by: Dan Bianchini <42096328+danb27@users.noreply.github.com>	2023-05-08 16:36:38 -07:00
Naveen Tatikonda	782df1db10	OpenSearch: Add Similarity Search with Score (#4089 ) ### Description Add `similarity_search_with_score` method for OpenSearch to return scores along with documents in the search results Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	2023-05-08 16:35:21 -07:00
Ankush Gola	b3ecce0545	fix json saving, update docs to reference anthropic chat model (#4364 ) Fixes # (issue) https://github.com/hwchase17/langchain/issues/4085	2023-05-08 15:30:52 -07:00
ImmortalZ	b04d84f6b3	fix: solve the infinite loop caused by 'add_memory' function when run… (#4318 ) fix: solve the infinite loop caused by 'add_memory' function when run 'pause_to_reflect' function run steps: 'add_memory' -> 'pause_to_reflect' -> 'add_memory': infinite loop	2023-05-08 15:13:23 -07:00
Eugene Yurtsev	aa11f7c89b	Add progress bar to filesystemblob loader, update pytest config for unit tests (#4212 ) This PR adds: * Option to show a tqdm progress bar when using the file system blob loader * Update pytest run configuration to be stricter * Adding a new marker that checks that required pkgs exist	2023-05-08 16:15:09 -04:00
Eduard van Valkenburg	f4c8502e61	fix for cosmos not loading old messages (#4094 ) I noticed cosmos was not loading old messages properly, fixed now.	2023-05-08 12:48:15 -07:00
Simba Khadder	d84df25466	Add example on how to use Featureform with langchain (#4337 ) Added an example on how to use Featureform to connecting_to_a_feature_store.ipynb .	2023-05-08 10:32:17 -07:00
Harrison Chase	42df78d396	bump ver 162 (#4346 )	2023-05-08 09:28:41 -07:00
Zander Chase	8b284f9ad0	Pass parsed inputs through to tool _run (#4309 )	2023-05-08 09:13:05 -07:00
Zander Chase	35c9e6ab40	Pass Callbacks through load_tools (#4298 ) - Update the load_tools method to properly accept `callbacks` arguments. - Add a deprecation warning when `callback_manager` is passed - Add two unit tests to check the deprecation warning is raised and to confirm the callback is passed through. Closes issue #4096	2023-05-08 08:44:26 -07:00
Zander Chase	0870a45a69	Add Pull Request Template (#4247 )	2023-05-08 08:34:37 -07:00
Jinto Jose	8a338412fa	mongodb support for chat history (#4266 )	2023-05-08 08:34:05 -07:00
Harrison Chase	f510940bde	add check for lower bound of lark (#4287 )	2023-05-08 08:31:05 -07:00
Harrison Chase	c8b0b6e6c1	add youtube tools (#4320 )	2023-05-08 08:29:30 -07:00
PawelFaron	1d1166ded6	Fixed huggingfacehub_api_token hadning in HuggingFaceEndpoint (#4335 ) Reported here: https://github.com/hwchase17/langchain/issues/4334 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-08 08:29:17 -07:00
Arjun Aravindan	637c61cffb	Add support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers (#4305 ) This commit adds support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers. This allows users to specify the Browser binary location which is required when deploying to services such as Heroku This change also includes updated documentation and type hints to reflect the new binary_location parameter and its usage. fixes #4304	2023-05-08 11:05:55 -04:00
Lior Neudorfer	65c95f9fb2	Better error when running chain without any args (#4294 ) Today, when running a chain without any arguments, the raised ValueError incorrectly specifies that user provided "both positional arguments and keyword arguments". This PR adds a more accurate error in that case.	2023-05-07 21:11:51 -07:00
Harrison Chase	edcd171535	bring back ref (#4308 )	2023-05-07 17:32:28 -07:00
Wuxian Zhang	6f386628c2	Permit unicode outputs when dumping json in GetElementsTool (#4276 ) Adds ensure_ascii=False when dumping json in the GetElementsTool Fixes issue https://github.com/hwchase17/langchain/issues/4265	2023-05-07 14:43:03 -07:00
Eugene Brodsky	a1001b29eb	Incorrect docstring for PythonCodeTextSplitter (#4296 ) Fixes a copy-paste error in the doctring	2023-05-07 14:04:54 -07:00
Ikko Eltociear Ashimine	f70e18a5b3	Fix typo in huggingface.py (#4277 ) enviroment -> environment	2023-05-07 11:37:06 -04:00
Eugene Yurtsev	0c646bb703	Minor clean up in BlobParser (#4210 ) Minor clean up to use `abstractmethod` and `ABC` instead of `abc.abstractmethod` and `abc.ABC`.	2023-05-07 11:32:53 -04:00
PawelFaron	04b74d0446	Adjusted GPT4All llm to streaming API and added support for GPT4All_J (#4131 ) Fix for these issues: https://github.com/hwchase17/langchain/issues/4126 https://github.com/hwchase17/langchain/issues/3839#issuecomment-1534258559 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-06 15:14:09 -07:00
Harrison Chase	075d9631f5	bump ver to 161 (#4239 )	2023-05-06 10:20:36 -07:00
Harrison Chase	64940e9d0f	docs for azure (#4238 )	2023-05-06 10:16:00 -07:00
Myeongseop Kim	747b5f87c2	Add HumanInputLLM (#4160 ) Related: #4028, I opened a new PR because (1) I was unable to unstage mistakenly committed files (I'm not familiar with git enough to resolve this issue), (2) I felt closing the original PR and opening a new PR would be more appropriate if I changed the class name. This PR creates HumanInputLLM(HumanLLM in #4028), a simple LLM wrapper class that returns user input as the response. I also added a simple Jupyter notebook regarding how and why to use this LLM wrapper. In the notebook, I went over how to use this LLM wrapper and showed example of testing `WikipediaQueryRun` using HumanInputLLM. I believe this LLM wrapper will be useful especially for debugging, educational or testing purpose.	2023-05-06 09:48:40 -07:00

1 2 3 4 5 ...

1834 Commits