langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-08 07:10:35 +00:00

Author	SHA1	Message	Date
Nguyen Trung Duc (john)	49e4aaf673	Fix subclassing OpenAIEmbeddings (#4500 ) # Fix subclassing OpenAIEmbeddings Fixes #4498 ## Before submitting - Problem: Due to annotated type `Tuple[()]`. - Fix: Change the annotated type to "Iterable[str]". Even though tiktoken use [Collection[str]](`095924e02c/tiktoken/core.py (L80)`) type annotation, but pydantic doesn't support Collection type, and [Iterable](https://docs.pydantic.dev/latest/usage/types/#typing-iterables) is the closest to Collection.	2023-05-16 18:35:19 -07:00
Harrison Chase	08df80bed6	console callback verbose (#4696 ) add verbose callback Co-authored-by: vowelparrot <130414180+vowelparrot@users.noreply.github.com>	2023-05-17 01:28:43 +00:00
David Peterson	d5d4c0a172	Update summarize.ipynb (#4529 ) # Update order in which tasks are stated (logically correct) Fixes the order in which steps are placed under titles. @vowelparrot	2023-05-16 18:14:00 -07:00
Django	bcffc704c1	fix: agenerate miss run_manager args in llm.py (#4566 ) # fix: agenerate miss run_manager args in llm.py <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes # (issue) fix: agenerate miss run_manager args in llm.py <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 -->	2023-05-16 17:37:56 -07:00
Brendan Mannix	4e56d3119c	update qdrant docs to reflect the proper way to initialize Qdrant() constructor (#4596 ) # update qdrant docs to reflect the proper way to initialize Qdrant() constructor The [Qdrant docs](https://python.langchain.com/en/latest/modules/indexes/vectorstores/examples/qdrant.html) still contain an old reference for passing an `embedding_function` into the constructor. This is no longer supported. This PR updates the docs to reflect the proper way to initialize `Qdrant()` Old: ![Screenshot 2023-05-12 at 3 06 33 PM](https://github.com/hwchase17/langchain/assets/1552962/dd4063d2-2a07-4340-91bb-e305f7215ddd) New: ![Screenshot 2023-05-12 at 3 21 09 PM](https://github.com/hwchase17/langchain/assets/1552962/aebc3f63-1a8b-4ca3-93c0-a2ce30dcd282)	2023-05-16 17:30:38 -07:00
Sean Morgan	5372a06a8c	DOC: Fix SageMaker example (#4598 ) # Fix SageMaker example typing Since https://github.com/hwchase17/langchain/pull/3249 a new type `LLMContentHandler` is enforced for SageMaker Endpoints Fixes #4168	2023-05-16 17:28:16 -07:00
Steve Kim	e90654f39b	Added cleaning up the downloaded PDF files (#4601 ) ArxivAPIWrapper searches and downloads PDFs to get related information. But I found that it doesn't delete the downloaded file. The reason why this is a problem is that a lot of PDF files remain on the server. For example, one size is about 28M. So, I added a delete line because it's too big to maintain on the server. # Clean up downloaded PDF files - Changes: Added new line to delete downloaded file - Background: To get the information on arXiv's paper, ArxivAPIWrapper class downloads a PDF. It's a natural approach, but the wrapper retains a lot of PDF files on the server. - Problem: One size of PDFs is about 28M. It's too big to maintain on a small server like AWS. - Dependency: import os Thank you. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 17:26:56 -07:00
Quinn	6fbd5e837f	Update planner_prompt.py, change usery to user (#4623 ) # Fix misspell in planner_prompt.py before ``` Usery query: I want to buy a couch ``` after ``` User query: I want to buy a couch ```	2023-05-16 17:24:27 -07:00
Tony Zhang	432421ffa5	[Fix][GenerativeAgent] Get the memory importance score from regex matched group (#4636 ) # Get the memory importance score from regex matched group In `GenerativeAgentMemory`, the `_score_memory_importance()` will make a prompt to get a rating score. The prompt is: ``` prompt = PromptTemplate.from_template( "On the scale of 1 to 10, where 1 is purely mundane" + " (e.g., brushing teeth, making bed) and 10 is" + " extremely poignant (e.g., a break up, college" + " acceptance), rate the likely poignancy of the" + " following piece of memory. Respond with a single integer." + "\nMemory: {memory_content}" + "\nRating: " ) ``` For some LLM, it will respond with, for example, `Rating: 8`. Thus we might want to get the score from the matched regex group.	2023-05-16 16:59:50 -07:00
Daniel Maturana	be405ac139	Query_constructor.base.py function _get_prompt() not including passed examples. (#4680 ) The function _get_prompt() was returning the DEFAULT_EXAMPLES even if some custom examples were given. The return FewShotPromptTemplate was returnong DEFAULT_EXAMPLES and not examples	2023-05-16 16:31:10 -07:00
Anam Hira	3af448d72e	Update huggingface_tools.ipynb (#4700 )	2023-05-16 16:28:27 -07:00
rajib	e28f4a5f39	changed cohere.py to update the default model of embedding (#4709 ) # The cohere embedding model do not use large, small. It is deprecated. Changed the modules default model Fixes #4694 Co-authored-by: rajib76 <rajib76@yahoo.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 16:27:23 -07:00
charosen	75fe9d3555	Add from_file method to message prompt template (#4713 ) Feature: This PR adds `from_template_file` class method to BaseStringMessagePromptTemplate. This is useful to help user to create message prompt templates directly from template files, including `ChatMessagePromptTemplate`, `HumanMessagePromptTemplate`, `AIMessagePromptTemplate` & `SystemMessagePromptTemplate`. Tests: Unit tests have been added in this PR. Co-authored-by: charosen <charosen@bupt.cn> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 16:25:17 -07:00
Chandan Routray	e8d46bdd9b	Replaced `SQLDatabaseChain` deprecated direct initialisation with `from_llm` method (#4778 ) # Removed usage of deprecated methods Replaced `SQLDatabaseChain` deprecated direct initialisation with `from_llm` method ## Who can review? @hwchase17 @agola11 --------- Co-authored-by: imeckr <chandanroutray2012@gmail.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 15:59:06 -07:00
Chandan Routray	11341fcecb	Fixed query checker for SQLDatabaseChain (#4780 ) # Fixed query checker for SQLDatabaseChain When `SQLDatabaseChain`'s llm attribute was deprecated, the query checker stopped working if `SQLDatabaseChain` is initialised via `from_llm` method. With this fix, `SQLDatabaseChain`'s query checker would use the same `llm` as used in the `llm_chain` ## Who can review? @hwchase17 - project lead Co-authored-by: imeckr <chandanroutray2012@gmail.com>	2023-05-16 15:58:58 -07:00
Yeong0228	08876ad066	Fix SelfQueryRetriever, passing new query to vector store (#4774 ) # Fix SelfQueryRetriever, passing new query to vector store	2023-05-16 15:46:22 -07:00
Mark Pors	8fd4d5d117	Added dependencies to make example executable (#4790 ) - Installation of non-colab packages - Get API keys # Added dependencies to make notebook executable on hosted notebooks ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @vowelparrot	2023-05-16 15:46:09 -07:00
Mark Pors	5bc7082e82	Cleanup and added dependencies to make example executable (#4795 ) - Installation of non-colab packages - Get API keys - Get rid of warnings # Cleanup and added dependencies to make notebook executable on hosted notebooks @hwchase17 @vowelparrot	2023-05-16 15:29:01 -07:00
keenangraham	bcce9a3a92	Fix age inconsistency in plan and execute Jupyter notebook example (#4814 ) The current example in https://python.langchain.com/en/latest/modules/agents/plan_and_execute.html has inconsistent reasoning step (observing 28 years and thinking it's 26 years): ``` Observation: 28 years Thought:Based on my search, Gigi Hadid's current age is 26 years old. Action: { "action": "Final Answer", "action_input": "Gigi Hadid's current age is 26 years old." } ``` Guessing this is model noise. Rerunning seems to give correct answer of 28 years.	2023-05-16 15:27:27 -07:00
Prateek K. Keshari	61f9c52fc7	Update twitter-the-algorithm-analysis-deeplake.ipynb (#4812 ) Changed model to model_name	2023-05-16 15:27:15 -07:00
yujiosaka	6561efebb7	Accept uuids kwargs for weaviate (#4800 ) # Accept uuids kwargs for weaviate Fixes #4791	2023-05-16 15:26:46 -07:00
Adam Quigley	e78c9be312	Add Confluence Loader unit tests (#3333 ) Adds some basic unit tests for the ConfluenceLoader that can be extended later. Ports this [PR from llama-hub](https://github.com/emptycrown/llama-hub/pull/208) and adapts it to `langchain`. @Jflick58 and @zywilliamli adding you here as potential reviewers --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 15:17:07 -07:00
Magnus Friberg	d126276693	Specify which data to return from chromadb (#4393 ) # Improve the Chroma get() method by adding the optional "include" parameter. The Chroma get() method excludes embeddings by default. You can customize the response by specifying the "include" parameter to selectively retrieve the desired data from the collection. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 14:43:09 -07:00
Raduan Al-Shedivat	00c6ec8a2d	fix(document_loaders/telegram): fix pandas calls + add tests (#4806 ) # Fix Telegram API loader + add tests. I was testing this integration and it was broken with next error: ```python message_threads = loader._get_message_threads(df) KeyError: False ``` Also, this particular loader didn't have any tests / related group in poetry, so I added those as well. @hwchase17 / @eyurtsev please take a look on this fix PR. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-16 14:35:25 -07:00
Zander Chase	206c87d525	Change server start name (#4811 ) to `langchain plus start/stop`	2023-05-16 20:04:09 +00:00
Eugene Yurtsev	255690d78e	Catch changes to test group (#4802 ) # Catch changes to test group Add test to catch changes to test group.	2023-05-16 14:48:56 -04:00
Eugene Yurtsev	c3b6129beb	Block sockets for unit-tests (#4803 ) # Block usage of sockets during unit tests Catch any tests that attempt to use the network.	2023-05-16 14:41:24 -04:00
了空	f7e3d97b19	Remove unnecessary spaces from document object’s page_content of BiliBiliLoader (#4619 ) - Remove unnecessary spaces from document object’s page_content of BiliBiliLoader - Fix BiliBiliLoader document and test file	2023-05-16 13:13:57 -04:00
Eugene Yurtsev	f47ec5b4b6	Docugami docs: First cell should be a title cell (#4735 ) # Make first cell a title in docugami docs This makes the first cell a title cell in docugami notebook	2023-05-16 13:12:14 -04:00
Eugene Yurtsev	d403f659ea	Update google protobuf dep (#4798 ) # Update google protobuf dep Resolve: https://github.com/hwchase17/langchain/security/dependabot/11	2023-05-16 12:25:07 -04:00
Eugene Yurtsev	3ecd7c9641	Add check to verify poetry.toml (#4794 ) # Add poetry check to github action Check poetry toml file during tests for errors	2023-05-16 11:53:06 -04:00
Ikko Eltociear Ashimine	f5a476fdd4	Fix typo in dataframe.py (#4786 ) # Fix typo in dataframe.py (#4786) Fixed typo. ``` yeild -> yield ```	2023-05-16 11:49:04 -04:00
Eugene Yurtsev	14bedf1cc5	Github Action: Fix poetry lock file checking (#4789 ) Fix how poetry lock file is checked to avoid skipping caches silently.	2023-05-16 11:40:28 -04:00
Davis Chase	7ce43372c3	Version 171 (#4788 )	2023-05-16 08:24:45 -07:00
Zander Chase	bee136efa4	Update Tracing Walkthrough (#4760 ) Add client methods to read / list runs and sessions. Update walkthrough to: - Let the user create a dataset from the runs without going to the UI - Use the new CLI command to start the server Improve the error message when `docker` isn't found	2023-05-16 13:26:43 +00:00
Zander Chase	fc0a3c8500	Persist Volume After Stop (#4763 ) Previously, the data would be removed after shutting down the server. This mounts a db volume that isn't erased between calls	2023-05-16 13:10:13 +00:00
Harrison Chase	a7af32c274	Cassandra support for chat history (#4378 ) (#4764 ) # Cassandra support for chat history ### Description - Store chat messages in cassandra ### Dependency - cassandra-driver - Python Module ## Before submitting - Added Integration Test ## Who can review? @hwchase17 @agola11 # Your PR Title (What it does) <!-- Thank you for contributing to LangChain! Your PR will appear in our next release under the title you set. Please make sure it highlights your valuable contribution. Replace this with a description of the change, the issue it fixes (if applicable), and relevant context. List any dependencies required for this change. After you're done, someone will review your PR. They may suggest improvements. If no one reviews your PR within a few days, feel free to @-mention the same people again, as notifications can get lost. --> <!-- Remove if not applicable --> Fixes # (issue) ## Before submitting <!-- If you're adding a new integration, include an integration test and an example notebook showing its use! --> ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: <!-- For a quicker response, figure out the right person to tag with @ @hwchase17 - project lead Tracing / Callbacks - @agola11 Async - @agola11 DataLoaders - @eyurtsev Models - @hwchase17 - @agola11 Agents / Tools / Toolkits - @vowelparrot VectorStores / Retrievers / Memory - @dev2049 --> Co-authored-by: Jinto Jose <129657162+jj701@users.noreply.github.com>	2023-05-15 23:43:09 -07:00
Harrison Chase	c4c7936caa	Harrison/wiki loader (#4765 ) Co-authored-by: Guillermo Segovia <T1b4lt@users.noreply.github.com>	2023-05-15 23:42:57 -07:00
Filip Haltmayer	c632f7fc4e	Add Milvus and Zilliz Retrievals (#4416 ) Adds the basic retrievers for Milvus and Zilliz. Hybrid search support will be added in the future. Signed-off-by: Filip Haltmayer <filip.haltmayer@zilliz.com>	2023-05-15 21:22:54 -07:00
Bradley James	2e43954bc3	fixed on_llm issue (#4717 ) Fixes #4714	2023-05-16 01:36:21 +00:00
Zander Chase	bf0904b676	Add Server Command (#4695 ) Add Support for `langchain server {start\|stop}` commands, with support for using ngrok to tunnel to a remote notebook	2023-05-16 00:44:30 +00:00
Anirudh Suresh	03ac39368f	Fixing DeepLake Overwrite Flag (#4683 ) # Fix DeepLake Overwrite Flag Issue Fixes Issue #4682: essentially, setting overwrite to False in the DeepLake constructor still triggers an overwrite, because the logic is just checking for the presence of "overwrite" in kwargs. The fix is simple--just add some checks to inspect if "overwrite" in kwargs AND kwargs["overwrite"]==True. Added a new test in tests/integration_tests/vectorstores/test_deeplake.py to reflect the desired behavior. Co-authored-by: Anirudh Suresh <ani@Anirudhs-MBP.cable.rcn.com> Co-authored-by: Anirudh Suresh <ani@Anirudhs-MacBook-Pro.local> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 17:39:16 -07:00
d 3 n 7	8bb32d77d0	Update utils.py to make headless an optional argument (#4745 ) Making headless an optional argument for create_async_playwright_browser() and create_sync_playwright_browser() By default no functionality is changed. This allows for disabled people to use a web browser intelligently with their voice, for example, while still seeing the content on the screen. As well as many other use cases --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 17:29:06 -07:00
Mose Tronci	a9dbe90447	Exponential back-off support for Google PaLM api (#4001 ) This PR adds exponential back-off to the Google PaLM api to gracefully handle rate limiting errors. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 17:21:11 -07:00
Leonid Ganeline	a6f3ec94bc	docs: added `additional_resources` folder (#4748 ) # docs: added `additional_resources` folder The additional resource files were inside the doc top-level folder, which polluted the top-level folder. - added the `additional_resources` folder and moved correspondent files to this folder; - fixed a broken link to the "Model comparison" page (model_laboratory notebook) - fixed a broken link to one of the YouTube videos (sorry, it is not directly related to this PR) ## Who can review? @dev2049	2023-05-15 17:12:47 -07:00
Zander Chase	a128d95aeb	Fix Async Shared Resource Bug (#4751 ) Use an async queue to distribute tracers rather than inappropriately sharing a single one	2023-05-16 00:04:01 +00:00
whuwxl	3f0357f94a	Add summarization task type for HuggingFace APIs (#4721 ) # Add summarization task type for HuggingFace APIs Add summarization task type for HuggingFace APIs. This task type is described by [HuggingFace inference API](https://huggingface.co/docs/api-inference/detailed_parameters#summarization-task) My project utilizes LangChain to connect multiple LLMs, including various HuggingFace models that support the summarization task. Integrating this task type is highly convenient and beneficial. Fixes #4720	2023-05-15 16:26:17 -07:00
Zander Chase	580861e7f2	Revert "Make serpapi base url configurable via env (#4402 )" (#4750 ) This reverts commit `5111bec540`. This PR introduced a bug in the async API (the `url` param isn't bound); it also didn't update the synchronous API correctly, which makes it error-prone (the behavior of the async and sync endpoints would be different)	2023-05-15 16:17:16 -07:00
shiyu22	21b9397342	Update the milvus example (#4706 ) # Fix issue when running example - add the query content - update the `user` parameter with Zilliz Signed-off-by: shiyu22 <shiyu.chen@zilliz.com>	2023-05-15 16:16:57 -07:00
hilarious-viking	7d15669b41	llama-cpp: add gpu layers parameter (#4739 ) Adds gpu layers parameter to llama.cpp wrapper Co-authored-by: andrew.khvalenski <andrew.khvalenski@behavox.com> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>	2023-05-15 16:01:48 -07:00

... 7 8 9 10 11 ...

2402 Commits