langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	72f9150a50	Update 2 more pydantic imports (#9203 ) Update two more pydantic imports to use v1 explicitly	2023-08-14 10:11:30 -04:00
Eugene Yurtsev	c172f972ea	Create pydantic v1 namespace, add partial compatibility for pydantic v2 (#9123 ) First of a few PRs to add full compatibility to both pydantic v1 and v2. This PR creates pydantic v1 namespace and adds it to sys.modules. Upcoming changes: 1. Handle `openapi-schema-pydantic = "^1.2"` and dependent chains/tools 2. bump dependencies to versions that are cross compatible for pydantic or remove them (see below) 3. Add tests to github workflows to test with pydantic v1 and v2 Dependencies From a quick look (could be wrong since was done manually) dependencies pinning pydantic below 2 (some of these can be bumped to newer versions are provide cross-compatible code) anthropic bentoml confection fastapi langsmith octoai-sdk openapi-schema-pydantic qdrant-client spacy steamship thinc zep-python Unpinned marqo () nomic () xinference(*)	2023-08-14 09:37:32 -04:00
Evan Schultz	8189dea0d8	Fixes typing issues in BaseOpenAI (#9183 ) ## Description: Sets default values for `client` and `model` attributes in the BaseOpenAI class to fix Pylance Typing issue. - Issue: #9182. - Twitter handle: @evanmschultz	2023-08-13 23:03:28 -07:00
Massimiliano Pronesti	d95eeaedbe	feat(llms): support vLLM's OpenAI-compatible server (#9179 ) This PR aims at supporting [vLLM's OpenAI-compatible server feature](https://vllm.readthedocs.io/en/latest/getting_started/quickstart.html#openai-compatible-server), i.e. allowing to call vLLM's LLMs like if they were OpenAI's. I've also udpated the related notebook providing an example usage. At the moment, vLLM only supports the `Completion` API.	2023-08-13 23:03:05 -07:00
Michael Goin	621da3c164	Adds DeepSparse as an LLM (#9184 ) Adds [DeepSparse](https://github.com/neuralmagic/deepsparse) as an LLM backend. DeepSparse supports running various open-source sparsified models hosted on [SparseZoo](https://sparsezoo.neuralmagic.com/) for performance gains on CPUs. Twitter handles: @mgoin_ @neuralmagic --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-13 22:35:58 -07:00
Bagatur	0fa69d8988	Bagatur/zep python 1.0 (#9186 ) Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com>	2023-08-13 21:52:53 -07:00
Eugene Yurtsev	9b24f0b067	Enhance deprecation decorator to modify docs with sphinx directives (#9069 ) Enhance deprecation decorator	2023-08-13 15:35:01 -04:00
Harrison Chase	8d69dacdf3	multiple retreival in parralel (#9174 )	2023-08-13 10:03:54 -07:00
Bagatur	cdfe2c96c5	bump 263 (#9156 )	2023-08-12 12:36:44 -07:00
Leonid Ganeline	19f504790e	docstrings: document_loaders consitency 2 (#9148 ) This is Part 2. See #9139 (Part 1).	2023-08-11 16:25:40 -07:00
Harrison Chase	1b58460fe3	update keys for chain (#5164 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 16:25:13 -07:00
Eugene Yurtsev	aca8cb5fba	API Reference: Do not document private modules (#9042 ) This PR prevents documentation of private modules in the API reference	2023-08-11 15:58:14 -07:00
胡亮	7edf4ca396	Support multi gpu inference for HuggingFaceEmbeddings (#4732 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 15:55:44 -07:00
UmerHA	8aab39e3ce	Added SmartGPT workflow (issue #4463 ) (#4816 ) # Added SmartGPT workflow by providing SmartLLM wrapper around LLMs Edit: As @hwchase17 suggested, this should be a chain, not an LLM. I have adapted the PR. It is used like this: ``` from langchain.prompts import PromptTemplate from langchain.chains import SmartLLMChain from langchain.chat_models import ChatOpenAI hard_question = "I have a 12 liter jug and a 6 liter jug. I want to measure 6 liters. How do I do it?" hard_question_prompt = PromptTemplate.from_template(hard_question) llm = ChatOpenAI(model_name="gpt-4") prompt = PromptTemplate.from_template(hard_question) chain = SmartLLMChain(llm=llm, prompt=prompt, verbose=True) chain.run({}) ``` Original text: Added SmartLLM wrapper around LLMs to allow for SmartGPT workflow (as in https://youtu.be/wVzuvf9D9BU). SmartLLM can be used wherever LLM can be used. E.g: ``` smart_llm = SmartLLM(llm=OpenAI()) smart_llm("What would be a good company name for a company that makes colorful socks?") ``` or ``` smart_llm = SmartLLM(llm=OpenAI()) prompt = PromptTemplate( input_variables=["product"], template="What is a good name for a company that makes {product}?", ) chain = LLMChain(llm=smart_llm, prompt=prompt) chain.run("colorful socks") ``` SmartGPT consists of 3 steps: 1. Ideate - generate n possible solutions ("ideas") to user prompt 2. Critique - find flaws in every idea & select best one 3. Resolve - improve upon best idea & return it Fixes #4463 ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: - @hwchase17 - @agola11 Twitter: [@UmerHAdil](https://twitter.com/@UmerHAdil) \| Discord: RicChilligerDude#7589 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 15:44:27 -07:00
Lucas Pickup	1d3735a84c	Ensure deployment_id is set to provided deployment, required for Azure OpenAI. (#5002 ) # Ensure deployment_id is set to provided deployment, required for Azure OpenAI. --------- Co-authored-by: Lucas Pickup <lupickup@microsoft.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 15:43:01 -07:00
Bagatur	45741bcc1b	Bagatur/vectara nit (#9140 ) Co-authored-by: Ofer Mendelevitch <ofer@vectara.com>	2023-08-11 15:32:03 -07:00
Dominick DEV	9b64932e55	Add LangChain utility for real-time crypto exchange prices (#4501 ) This commit adds the LangChain utility which allows for the real-time retrieval of cryptocurrency exchange prices. With LangChain, users can easily access up-to-date pricing information by running the command ".run(from_currency, to_currency)". This new feature provides a convenient way to stay informed on the latest exchange rates and make informed decisions when trading crypto. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 14:45:06 -07:00
Joshua Sundance Bailey	eaa505fb09	Create ArcGISLoader & example notebook (#8873 ) - Description: Adds the ArcGISLoader class to `langchain.document_loaders` - Allows users to load data from ArcGIS Online, Portal, and similar - Users can authenticate with `arcgis.gis.GIS` or retrieve public data anonymously - Uses the `arcgis.features.FeatureLayer` class to retrieve the data - Defines the most relevant keywords arguments and accepts `**kwargs` - Dependencies: Using this class requires `arcgis` and, optionally, `bs4.BeautifulSoup`. Tagging maintainers: - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 14:33:40 -07:00
Bagatur	e21152358a	fix (#9145 )	2023-08-11 13:58:23 -07:00
Leonid Ganeline	edb585228d	docstrings: document_loaders consitency (#9139 ) Formatted docstrings from different formats to consistent format, lile: >Loads processed docs from Docugami. "Load from `Docugami`." >Loader that uses Unstructured to load HTML files. "Load `HTML` files using `Unstructured`." >Load documents from a directory. "Load from a directory." - `Load` - no `Loads` - DocumentLoader always loads Documents, so no more "documents/docs/texts/ etc" - integrated systems and APIs enclosed in backticks,	2023-08-11 13:09:31 -07:00
Aashish Saini	0aabded97f	Updating interactive walkthrough link in index.md to resolve 404 error (#9063 ) Updated interactive walkthrough link in index.md to resolve 404 error. Also, expressing deep gratitude to LangChain library developers for their exceptional efforts 🥇 . --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 13:08:56 -07:00
Markus Schiffer	00bf472265	Fix for SVM retriever discarding document metadata (#9141 ) As stated in the title the SVM retriever discarded the metadata of passed in docs. This code fixes that. I also added one unit test that should test that. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 13:08:17 -07:00
Bagatur	bace17e0aa	rm integration deps (#9142 )	2023-08-11 12:43:08 -07:00
Eugene Yurtsev	44bc89b7bf	Support a few list like operations on ChatPromptTemplate (#9077 ) Make it easier to work with chat prompt template	2023-08-11 14:49:51 -04:00
Hai The Dude	e4418d1b7e	Added new use case docs for Web Scraping, Chromium loader, BS4 transformer (#8732 ) - Description: Added a new use case category called "Web Scraping", and a tutorial to scrape websites using OpenAI Functions Extraction chain to the docs. - Tag maintainer:@baskaryan @hwchase17 , - Twitter handle: https://www.linkedin.com/in/haiphunghiem/ (I'm on LinkedIn mostly) --------- Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 11:46:59 -07:00
sseide	6cb763507c	add basic support for redis cluster server (#9128 ) This change updates the central utility class to recognize a Redis cluster server after connection and returns an new cluster aware Redis client. The "normal" Redis client would not be able to talk to a cluster node because keys might be stored on other shards of the Redis cluster and therefor not readable or writable. With this patch clients do not need to know what Redis server it is, they just connect though the same API calls for standalone and cluster server. There are no dependencies added due to this MR. Remark - with current redis-py client library (4.6.0) a cluster cannot be used as VectorStore. It can be used for other use-cases. There is a bug / missing feature(?) in the Redis client breaking the VectorStore implementation. I opened an issue at the client library too (redis/redis-py#2888) to fix this. As soon as this is fixed in `redis-py` library it should be usable there too. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-11 11:37:44 -07:00
David Duong	6d03f8b5d8	Add serialisable support for Replicate (#8525 )	2023-08-11 11:35:21 -07:00
niklub	16af5f8690	Add LabelStudio integration (#8880 ) This PR introduces [Label Studio](https://labelstud.io/) integration with LangChain via `LabelStudioCallbackHandler`: - sending data to the Label Studio instance - labeling dataset for supervised LLM finetuning - rating model responses - tracking and displaying chat history - support for custom data labeling workflow ### Example ``` chat_llm = ChatOpenAI(callbacks=[LabelStudioCallbackHandler(mode="chat")]) chat_llm([ SystemMessage(content="Always use emojis in your responses."), HumanMessage(content="Hey AI, how's your day going?"), AIMessage(content="🤖 I don't have feelings, but I'm running smoothly! How can I help you today?"), HumanMessage(content="I'm feeling a bit down. Any advice?"), AIMessage(content="🤗 I'm sorry to hear that. Remember, it's okay to seek help or talk to someone if you need to. 💬"), HumanMessage(content="Can you tell me a joke to lighten the mood?"), AIMessage(content="Of course! 🎭 Why did the scarecrow win an award? Because he was outstanding in his field! 🌾"), HumanMessage(content="Haha, that was a good one! Thanks for cheering me up."), AIMessage(content="Always here to help! 😊 If you need anything else, just let me know."), HumanMessage(content="Will do! By the way, can you recommend a good movie?"), ]) ``` <img width="906" alt="image" src="https://github.com/langchain-ai/langchain/assets/6087484/0a1cf559-0bd3-4250-ad96-6e71dbb1d2f3"> ### Dependencies - [label-studio](https://pypi.org/project/label-studio/) - [label-studio-sdk](https://pypi.org/project/label-studio-sdk/) https://twitter.com/labelstudiohq --------- Co-authored-by: nik <nik@heartex.net>	2023-08-11 11:24:10 -07:00
Bagatur	8cb2594562	Bagatur/dingo (#9079 ) Co-authored-by: gary <1625721671@qq.com>	2023-08-11 10:54:45 -07:00
Jacques Arnoux	926c64da60	Fix web research retriever for unknown links in results (#9115 ) Fixes an issue with web research retriever for unknown links in results. This is currently making the retrieve crash sometimes. @rlancemartin	2023-08-11 10:50:37 -07:00
Manuel Soria	31cfc00845	Code understanding use case (#8801 ) Code understanding docs --------- Co-authored-by: Manuel Soria <manuel.soria@greyscaleai.com> Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 10:16:05 -07:00
Alvaro Bartolome	f7ae183f40	`ArgillaCallbackHandler` to properly use default values for `api_url` and `api_key` (#9113 ) As of the recent PR at #9043, after some testing we've realised that the default values were not being used for `api_key` and `api_url`. Besides that, the default for `api_key` was set to `argilla.apikey`, but since the default values are intended for people using the Argilla Quickstart (easy to run and setup), the defaults should be instead `owner.apikey` if using Argilla 1.11.0 or higher, or `admin.apikey` if using a lower version of Argilla. Additionally, we've removed the f-string replacements from the docstrings. --------- Co-authored-by: Gabriel Martin <gabriel@argilla.io>	2023-08-11 09:37:06 -07:00
Bagatur	0e5d09d0da	dalle nb fix (#9125 )	2023-08-11 08:21:48 -07:00
Francisco Ingham	9249d305af	tagging docs refactor (#8722 ) refactor of tagging use case according to new format --------- Co-authored-by: Lance Martin <lance@langchain.dev>	2023-08-11 08:06:07 -07:00
Bagatur	01ef786e7e	bump 262 (#9108 )	2023-08-11 01:29:07 -07:00
Bagatur	3b754b5461	Bagatur/filter metadata (#9015 ) Co-authored-by: Matt Robinson <mrobinson@unstructuredai.io>	2023-08-11 01:10:00 -07:00
Aayush Shah	a429145420	Minor grammatical error (#9102 ) Have corrected a grammatical error in: https://python.langchain.com/docs/modules/model_io/models/llms/ document 😄	2023-08-11 01:01:40 -07:00
Kim Minjong	7f0e847c13	Update pydantic format instruction prompt (#9095 ) - remove unopened bracket	2023-08-11 00:22:13 -07:00
Ashutosh Sanzgiri	991b448dfc	minor edits (#9093 ) Description: Minor edit to PR#845 Thanks!	2023-08-10 23:40:36 -07:00
Bagatur	3ab4e21579	fix json tool (#9096 )	2023-08-10 23:39:25 -07:00
Sam Groenjes	2184e3a400	Fix IndexError when input_list is Empty in prep_prompts (#5769 ) This MR corrects the IndexError arising in prep_prompts method when no documents are returned from a similarity search. Fixes #1733 Co-authored-by: Sam Groenjes <sam.groenjes@darkwolfsolutions.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 22:50:39 -07:00
Chenyu Zhao	c0acbdca1b	Update Fireworks model names (#9085 )	2023-08-10 19:23:42 -07:00
Charles Lanahan	a2588d6c57	Update openai embeddings notebook with correct embedding model in section 2 (#5831 ) In second section it looks like a copy/paste from the first section and doesn't include the specific embedding model mentioned in the example so I added it for clarity. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 19:02:10 -07:00
Bagatur	b80e3825a6	Bagatur/pinecone by vector (#9087 ) Co-authored-by: joseph <joe@outverse.com>	2023-08-10 18:28:55 -07:00
Nikhil Kumar	6abb2c2c08	Buffer method of ConversationTokenBufferMemory should be able to return messages as string (#7057 ) ### Description: `ConversationBufferTokenMemory` should have a simple way of returning the conversation messages as a string. Previously to complete this, you would only have the option to return memory as an array through the buffer method and call `get_buffer_string` by importing it from `langchain.schema`, or use the `load_memory_variables` method and key into `self.memory_key`. ### Maintainer @hwchase17 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 18:17:22 -07:00
William FH	57dd4daa9a	Add string example mapper (#9086 ) Now that we accept any runnable or arbitrary function to evaluate, we don't always look up the input keys. If an evaluator requires references, we should try to infer if there's one key present. We only have delayed validation here but it's better than nothing	2023-08-10 17:07:02 -07:00
Josh Phillips	5fc07fa524	change id column type to uuid to match function (#7456 ) The table creation process in these examples commands do not match what the recently updated functions in these example commands is looking for. This change updates the type in the table creation command. Issue Number for my report of the doc problem #7446 @rlancemartin and @eyurtsev I believe this is your area Twitter: @j1philli Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 16:57:19 -07:00
Bidhan Roy	02430e25b6	BagelDB (bageldb.ai), VectorStore integration. (#8971 ) - Description: [BagelDB](bageldb.ai) a collaborative vector database. Integrated the bageldb PyPi package with langchain with related tests and code. - Issue: Not applicable. - Dependencies: `betabageldb` PyPi package. - Tag maintainer: @rlancemartin, @eyurtsev, @baskaryan - Twitter handle: bageldb_ai (https://twitter.com/BagelDB_ai) We ran `make format`, `make lint` and `make test` locally. Followed the contribution guideline thoroughly https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --------- Co-authored-by: Towhid1 <nurulaktertowhid@gmail.com>	2023-08-10 16:48:36 -07:00
DJ Atha	ee52482db8	Fix issue 7445 (#7635 ) Description: updated BabyAGI examples and experimental to append the iteration to the result id to fix error storing data to vectorstore. Issue: 7445 Dependencies: no Tag maintainer: @eyurtsev This fix worked for me locally. Happy to take some feedback and iterate on a better solution. I was considering appending a uuid instead but didn't want to over complicate the example. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-10 16:29:31 -07:00
Harrison Chase	bb6fbf4c71	openai adapters (#8988 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Nuno Campos <nuno@boringbits.io>	2023-08-10 16:08:50 -07:00

... 5 6 7 8 9 ...

4069 Commits