langchain

Author	SHA1	Message	Date
Zander Chase	0870a45a69	Add Pull Request Template (#4247 )	2023-05-08 08:34:37 -07:00
Jinto Jose	8a338412fa	mongodb support for chat history (#4266 )	2023-05-08 08:34:05 -07:00
Harrison Chase	f510940bde	add check for lower bound of lark (#4287 )	2023-05-08 08:31:05 -07:00
Harrison Chase	c8b0b6e6c1	add youtube tools (#4320 )	2023-05-08 08:29:30 -07:00
PawelFaron	1d1166ded6	Fixed huggingfacehub_api_token hadning in HuggingFaceEndpoint (#4335 ) Reported here: https://github.com/hwchase17/langchain/issues/4334 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-08 08:29:17 -07:00
Arjun Aravindan	637c61cffb	Add support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers (#4305 ) This commit adds support for passing binary_location to the SeleniumURLLoader when creating Chrome or Firefox web drivers. This allows users to specify the Browser binary location which is required when deploying to services such as Heroku This change also includes updated documentation and type hints to reflect the new binary_location parameter and its usage. fixes #4304	2023-05-08 11:05:55 -04:00
Lior Neudorfer	65c95f9fb2	Better error when running chain without any args (#4294 ) Today, when running a chain without any arguments, the raised ValueError incorrectly specifies that user provided "both positional arguments and keyword arguments". This PR adds a more accurate error in that case.	2023-05-07 21:11:51 -07:00
Harrison Chase	edcd171535	bring back ref (#4308 )	2023-05-07 17:32:28 -07:00
Wuxian Zhang	6f386628c2	Permit unicode outputs when dumping json in GetElementsTool (#4276 ) Adds ensure_ascii=False when dumping json in the GetElementsTool Fixes issue https://github.com/hwchase17/langchain/issues/4265	2023-05-07 14:43:03 -07:00
Eugene Brodsky	a1001b29eb	Incorrect docstring for PythonCodeTextSplitter (#4296 ) Fixes a copy-paste error in the doctring	2023-05-07 14:04:54 -07:00
Ikko Eltociear Ashimine	f70e18a5b3	Fix typo in huggingface.py (#4277 ) enviroment -> environment	2023-05-07 11:37:06 -04:00
Eugene Yurtsev	0c646bb703	Minor clean up in BlobParser (#4210 ) Minor clean up to use `abstractmethod` and `ABC` instead of `abc.abstractmethod` and `abc.ABC`.	2023-05-07 11:32:53 -04:00
PawelFaron	04b74d0446	Adjusted GPT4All llm to streaming API and added support for GPT4All_J (#4131 ) Fix for these issues: https://github.com/hwchase17/langchain/issues/4126 https://github.com/hwchase17/langchain/issues/3839#issuecomment-1534258559 --------- Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-06 15:14:09 -07:00
Harrison Chase	075d9631f5	bump ver to 161 (#4239 )	2023-05-06 10:20:36 -07:00
Harrison Chase	64940e9d0f	docs for azure (#4238 )	2023-05-06 10:16:00 -07:00
Myeongseop Kim	747b5f87c2	Add HumanInputLLM (#4160 ) Related: #4028, I opened a new PR because (1) I was unable to unstage mistakenly committed files (I'm not familiar with git enough to resolve this issue), (2) I felt closing the original PR and opening a new PR would be more appropriate if I changed the class name. This PR creates HumanInputLLM(HumanLLM in #4028), a simple LLM wrapper class that returns user input as the response. I also added a simple Jupyter notebook regarding how and why to use this LLM wrapper. In the notebook, I went over how to use this LLM wrapper and showed example of testing `WikipediaQueryRun` using HumanInputLLM. I believe this LLM wrapper will be useful especially for debugging, educational or testing purpose.	2023-05-06 09:48:40 -07:00
Davis Chase	6cd51ef3d0	Simplify router chain constructor signatures (#4146 )	2023-05-06 09:38:17 -07:00
玄猫	43a7a89e93	opt: document_loader notiondb to extract url (#4222 )	2023-05-06 09:34:33 -07:00
Leonid Ganeline	9544b30821	added `Wikipedia` document loader (#4141 ) - Added the `Wikipedia` document loader. It is based on the existing `unilities/WikipediaAPIWrapper` - Added a respective ut-s and example notebook - Sorted list of classes in __init__	2023-05-06 09:32:45 -07:00
Eugene Yurtsev	423f497168	Add BlobParser abstraction (#3979 ) This PR adds the BlobParser abstraction. It follows the proposal described here: https://github.com/hwchase17/langchain/pull/2833#issuecomment-1509097756	2023-05-05 21:43:38 -04:00
Davis Chase	5ca13cc1f0	Dev2049/pypdfium2 (#4209 ) thanks @jerrytigerxu for the addition! --------- Co-authored-by: Jere Xu <jtxu2008@gmail.com> Co-authored-by: jerrytigerxu <jere.tiger.xu@gmailc.om>	2023-05-05 17:55:31 -07:00
Leonid Ganeline	59204a5033	docs: `document_loaders` improvements (#4200 ) - made notebooks consistent: titles, service/format descriptions. - corrected short names to full names, for example, `Word` -> `Microsoft Word` - added missed descriptions - renamed notebook files to make ToC correctly sorted	2023-05-05 17:44:54 -07:00
Harrison Chase	eeb7c96e0c	bump version to 160 (#4205 )	2023-05-05 17:02:39 -07:00
Davis Chase	f1fc4dfebc	Dev2049/obsidian patch (#4204 ) thanks @shkarlsson for the fix! (just updated formatting) --------- Co-authored-by: shkarlsson <sven.henrik.karlsson@gmail.com>	2023-05-05 16:49:19 -07:00
George	2324f19c85	Update qdrant interface (#3971 ) Hello 1) Passing `embedding_function` as a callable seems to be outdated and the common interface is to pass `Embeddings` instance 2) At the moment `Qdrant.add_texts` is designed to be used with `embeddings.embed_query`, which is 1) slow 2) causes ambiguity due to 1. It should be used with `embeddings.embed_documents` This PR solves both problems and also provides some new tests	2023-05-05 16:46:40 -07:00
Harrison Chase	76ed41f48a	update docs (#4194 )	2023-05-05 16:45:26 -07:00
Zander Chase	1017e5cee2	Add LCP Client (#4198 ) Adding a client to fetch datasets, examples, and runs from a LCP instance and run objects over them.	2023-05-05 16:28:56 -07:00
Zander Chase	a30f42da4e	Update V2 Tracer (#4193 ) - Update the RunCreate object to work with recent changes - Add optional Example ID to the tracer - Adjust default persist_session behavior to attempt to load the session if it exists - Raise more useful HTTP errors for logging - Add unit testing - Fix the default ID to be a UUID for v2 tracer sessions Broken out from the big draft here: https://github.com/hwchase17/langchain/pull/4061	2023-05-05 14:55:01 -07:00
Mike Wang	c3044b1bf0	[test] Add integration_test for PandasAgent (#4056 ) - confirm creation - confirm functionality with a simple dimension check. The test now is calling OpenAI API directly, but learning from @vowelparrot that we’re caching the requests, so that it’s not that expensive. I also found we’re calling OpenAI api in other integration tests. Please lmk if there is any concern of real external API calls. I can alternatively make a fake LLM for this test. Thanks	2023-05-05 14:49:02 -07:00
Aivin V. Solatorio	6567b73e1a	JSON loader (#4067 ) This implements a loader of text passages in JSON format. The `jq` syntax is used to define a schema for accessing the relevant contents from the JSON file. This requires dependency on the `jq` package: https://pypi.org/project/jq/. --------- Signed-off-by: Aivin V. Solatorio <avsolatorio@gmail.com>	2023-05-05 14:48:13 -07:00
PawelFaron	bb6d97c18c	Fixed the example code (#4117 ) Fixed the issue mentioned here: https://github.com/hwchase17/langchain/issues/3799#issuecomment-1534785861 Co-authored-by: Pawel Faron <ext-pawel.faron@vaisala.com>	2023-05-05 14:22:10 -07:00
Anurag	19e28d8784	feat: Allow users to pass additional arguments to the WebDriver (#4121 ) This commit adds support for passing additional arguments to the `SeleniumURLLoader ` when creating Chrome or Firefox web drivers. Previously, only a few arguments such as `headless` could be passed in. With this change, users can pass any additional arguments they need as a list of strings using the `arguments` parameter. The `arguments` parameter allows users to configure the driver with any options that are available for that particular browser. For example, users can now pass custom `user_agent` strings or `proxy` settings using this parameter. This change also includes updated documentation and type hints to reflect the new `arguments` parameter and its usage. fixes #4120	2023-05-05 13:24:42 -07:00
hp0404	2a3c5f8353	Update WhatsAppChatLoader regex to handle multiple date-time formats (#4186 ) This PR updates the `message_line_regex` used by `WhatsAppChatLoader` to support different date-time formats used in WhatsApp chat exports; resolves #4153. The new regex handles the following input formats: ```terminal [05.05.23, 15:48:11] James: Hi here [11/8/21, 9:41:32 AM] User name: Message 123 1/23/23, 3:19 AM - User 2: Bye! 1/23/23, 3:22_AM - User 1: And let me know if anything changes ``` Tests have been added to verify that the loader works correctly with all formats.	2023-05-05 13:13:05 -07:00
Nicolas	a57259ec83	docs: Mendable Fixes and Improvements (#4184 ) Overall fixes and improvements.	2023-05-05 13:04:24 -07:00
Harrison Chase	7dcc698ebf	bump version to 159 (#4183 )	2023-05-05 09:31:08 -07:00
Harrison Chase	26534457f5	simplify csv args (#4182 )	2023-05-05 09:22:08 -07:00
Eduard van Valkenburg	3095546851	PowerBI fix for table names with spaces (#4170 ) small fix to make sure a table name with spaces is passed correctly to the API for the schema lookup.	2023-05-05 09:15:47 -07:00
obbiondo	b1e2e29222	fix: remove expand parameter from ConfluenceLoader by label (#4181 ) expand is not an allowed parameter for the method confluence.get_all_pages_by_label, since it doesn't return the body of the text but just metadata of documents Co-authored-by: Andrea Biondo <a.biondo@reply.it>	2023-05-05 09:15:21 -07:00
Zander Chase	84cfa76e00	Update Cohere Reranker (#4180 ) The forward ref annotations don't get updated if we only iimport with type checking --------- Co-authored-by: Abhinav Verma <abhinav_win12@yahoo.co.in>	2023-05-05 09:11:37 -07:00
Davis Chase	d84bb02881	Add Chroma self query (#4149 ) Add internal query language -> chroma metadata filter translator	2023-05-05 08:43:08 -07:00
Vinoo Ganesh	905a2114d7	Fix: Typo in Docs (#4179 ) Fixing small typo in docs	2023-05-05 08:35:49 -07:00
Ankush Gola	8de1b4c4c2	Revert "fix: #4128 missing run_manager parameter" (#4159 ) Reverts hwchase17/langchain#4130	2023-05-05 00:52:16 -07:00
Chakib Ben Ziane	878d0c8155	fix: #4128 missing run_manager parameter (#4130 ) `run_manager` was not being passed downstream. Not sure if this was a deliberate choice but it seems like it broke many agent callbacks like `agent_action` and `agent_finish`. This fix needs a proper review. Co-authored-by: blob42 <spike@w530>	2023-05-04 23:59:55 -07:00
Zander Chase	6032a051e9	Add Tenant ID to V2 Tracer (#4135 ) Update the V2 tracer to - use UUIDs instead of int's - load a tenant ID and use that when saving sessions	2023-05-04 21:35:20 -07:00
Zander Chase	fea639c1fc	Vwp/sqlalchemy (#4145 ) Bump threshold to 1.4 from 1.3. Change import to be compatible Resolves #4142 and #4129 --------- Co-authored-by: ndaugreal <ndaugreal@gmail.com> Co-authored-by: Jeremy Lopez <lopez86@users.noreply.github.com>	2023-05-04 20:46:38 -07:00
Zander Chase	2f087d63af	Fix Python RePL Tool (#4137 ) Filter out kwargs from inferred schema when determining if a tool is single input. Add a couple unit tests. Move tool unit tests to the tools dir	2023-05-04 20:31:16 -07:00
Zander Chase	cc068f1b77	Add Issue Templates (#4021 ) Add issue templates for - bug reports - feature suggestions - documentation and a link to the discord for general discussion. Open to other suggestions here. Could also add another "Other" template with just a raw text box if we think this is too restrictive <img width="1464" alt="image" src="https://user-images.githubusercontent.com/130414180/236115358-e603bcbe-282c-40c7-82eb-905eb93ccec0.png">	2023-05-04 16:33:52 -07:00
Zander Chase	ac0a9d02bd	Visual Studio Code/Github Codespaces Dev Containers (#4035 ) (#4122 ) Having dev containers makes its easier, faster and secure to setup the dev environment for the repository. The pull request consists of: - .devcontainer folder with: - devcontainer.json : (minimal necessary vscode extensions and settings) - docker-compose.yaml : (could be modified to run necessary services as per need. Ex vectordbs, databases) - Dockerfile:(non root with dev tools) - Changes to README - added the Open in Github Codespaces Badge - added the Open in dev container Badge Co-authored-by: Jinto Jose <129657162+jj701@users.noreply.github.com>	2023-05-04 11:37:00 -07:00
Harrison Chase	d86ed15d88	bump version to 158 (#4091 )	2023-05-04 09:14:47 -07:00
OlajideOgun	624554a43a	DeepLake: Pass in rest of args to self._search_helper (#4080 ) As of right now when trying to use functions like `max_marginal_relevance_search()` or `max_marginal_relevance_search_by_vector()` the rest of the kwargs are not propagated to `self._search_helper()`. For example a user cannot explicitly state the distance_metric they want to use when calling `max_marginal_relevance_search`	2023-05-04 02:14:22 -07:00

1 2 3 4 5 ...

1800 Commits