langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-04 06:00:26 +00:00

Author	SHA1	Message	Date
Harrison Chase	9a10b2b047	fix plate chain (#12673 )	2023-10-31 13:45:09 -07:00
Margaret Qian	acfc485808	Update MosaicML Embedding Input Key (#12657 ) This input key was missed in the last update PR: https://github.com/langchain-ai/langchain/pull/7391 The input/output formats are intended to be like this: ``` {"inputs": [<prompt>]} {"outputs": [<output_text>]} ```	2023-10-31 14:43:30 -04:00
Erika Cardenas	d26ac5f999	Update README for Hybrid Search Weaviate (#12661 ) - Description: Updated the README for Hybrid Search Weaviate	2023-10-31 11:02:34 -07:00
Predrag Gruevski	c871cc5055	Remove `print()` statements which seemed leftover from debugging. (#12648 ) Added in #12159 presumably during debugging. Right now they cause a bit of visual noise.	2023-10-31 13:45:48 -04:00
Erick Friis	2a7e0a27cb	update lc version (#12655 ) also updated py version in `csv-agent` and `rag-codellama-fireworks` because they have stricter python requirements	2023-10-31 10:19:15 -07:00
Predrag Gruevski	360cff81a3	Overwrite existing distributions when uploading to test PyPI. (#12658 )	2023-10-31 10:02:50 -07:00
Lance Martin	da94c750c5	Add RAG template for Timescale Vector (#12651 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Matvey Arye <mat@timescale.com>	2023-10-31 09:56:29 -07:00
Noam Gat	14e8c74736	LM Format Enforcer Integration + Sample Notebook (#12625 ) ## Description This PR adds support for [lm-format-enforcer](https://github.com/noamgat/lm-format-enforcer) to LangChain. ![image](https://raw.githubusercontent.com/noamgat/lm-format-enforcer/main/docs/Intro.webp) The library is similar to jsonformer / RELLM which are supported in Langchain, but has several advantages such as - Batching and Beam search support - More complete JSON Schema support - LLM has control over whitespace, improving quality - Better runtime performance due to only calling the LLM's generate() function once per generate() call. The integration is loosely based on the jsonformer integration in terms of project structure. ## Dependencies No compile-time dependency was added, but if `lm-format-enforcer` is not installed, a runtime error will occur if it is trying to be used. ## Tests Due to the integration modifying the internal parameters of the underlying huggingface transformer LLM, it is not possible to test without building a real LM, which requires internet access. So, similar to the jsonformer and RELLM integrations, the testing is via the notebook. ## Twitter Handle [@noamgat](https://twitter.com/noamgat) Looking forward to hearing feedback! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-31 09:49:01 -07:00
Stefano Lottini	a4e4b5a86f	Relax python version and remove need for explicit setup step (#12637 ) This PR addresses what seems like a unnecessary Python version restriction in the pyroject.toml specs within both Cassandra (/Astra DB) templates. With "^3.11" I got some version incompatibilities with the latest "langchain add [...]" commands, so these are now relaxed in line with the other templates I could inspect. Incidentally, in the "entomology" template, the need for an explicit "setup" step for the user to carry on has been removed, replaced by a check-and-execute-if-necessary instruction on app startup. Thank you for your attention!	2023-10-31 09:42:27 -07:00
Predrag Gruevski	5308b836c7	Upgrade to `actions/checkout@v4` in the docs lint job. (#12581 )	2023-10-31 12:41:18 -04:00
Predrag Gruevski	94f018f1ba	Support release-testing packages with dashes in their names. (#12654 )	2023-10-31 12:40:34 -04:00
Erick Friis	912ace18e9	fix template py verisons (#12650 )	2023-10-31 09:20:29 -07:00
Brian McBrayer	b74468f399	Fix small typo on Founcational -> Router notebook (#12634 ) - Description: Fix small typo on Founcational -> Router notebook	2023-10-31 09:16:29 -07:00
Predrag Gruevski	72fa5a463d	Show ruff output inline in GitHub PRs. (#12647 )	2023-10-31 12:16:01 -04:00
William FH	17c2e3b87e	Rename Template (#12649 ) To chatbot feedback. Update import	2023-10-31 09:15:30 -07:00
Erick Friis	7f6e751a3d	template updates (#12646 )	2023-10-31 09:13:58 -07:00
Leonid Kuligin	a53cac4508	added template to use Vertex Vector Search for q&a (#12622 ) added template to use Vertex Vector Search for q&a	2023-10-31 08:49:24 -07:00
Lance Martin	944cb552bb	Minor updates to READMEs (#12642 )	2023-10-31 08:34:46 -07:00
William FH	88f0f1e73b	Conversational Feedback (#12590 ) Context in the README. Show how score chat responses based on a followup from the user and then log that as feedback in LangSmith	2023-10-31 08:34:17 -07:00
Predrag Gruevski	f94e24dfd7	Install and use `ruff format` instead of black for code formatting. (#12585 ) Best to review one commit at a time, since two of the commits are 100% autogenerated changes from running `ruff format`: - Install and use `ruff format` instead of black for code formatting. - Output of `ruff format .` in the `langchain` package. - Use `ruff format` in experimental package. - Format changes in experimental package by `ruff format`. - Manual formatting fixes to make `ruff .` pass.	2023-10-31 10:53:12 -04:00
William FH	bfd719f9d8	bind_functions convenience method (#12518 ) I always take 20-30 seconds to re-discover where the `convert_to_openai_function` wrapper lives in our codebase. Chat langchain [has no clue](https://smith.langchain.com/public/3989d687-18c7-4108-958e-96e88803da86/r) what to do either. There's the older `create_openai_fn_chain` , but we haven't been recommending it in LCEL. The example we show in the [cookbook](https://python.langchain.com/docs/expression_language/how_to/binding#attaching-openai-functions) is really verbose. General function calling should be as simple as possible to do, so this seems a bit more ergonomic to me (feel free to disagree). Another option would be to directly coerce directly in the class's init (or when calling invoke), if provided. I'm not 100% set against that. That approach may be too easy but not simple. This PR feels like a decent compromise between simple and easy. ``` from enum import Enum from typing import Optional from pydantic import BaseModel, Field class Category(str, Enum): """The category of the issue.""" bug = "bug" nit = "nit" improvement = "improvement" other = "other" class IssueClassification(BaseModel): """Classify an issue.""" category: Category other_description: Optional[str] = Field( description="If classified as 'other', the suggested other category" ) from langchain.chat_models import ChatOpenAI llm = ChatOpenAI().bind_functions([IssueClassification]) llm.invoke("This PR adds a convenience wrapper to the bind argument") # AIMessage(content='', additional_kwargs={'function_call': {'name': 'IssueClassification', 'arguments': '{\n "category": "improvement"\n}'}}) ```	2023-10-31 07:15:37 -07:00
Nuno Campos	3143324984	Improve Runnable type inference for input_schemas (#12630 ) - Prefer lambda type annotations over inferred dict schema - For sequences that start with RunnableAssign infer seq input type as "input type of 2nd item in sequence - output type of runnable assign"	2023-10-31 13:22:54 +00:00
Nuno Campos	2f563cee20	Add Runnable.with_listeners() (#12549 ) - This binds start/end/error listeners to a runnable, which will be called with the Run object	2023-10-31 11:04:51 +00:00
Bagatur	bcc62d63be	bump 327 (#12623 )	2023-10-31 02:18:08 -07:00
Erick Friis	a1fae1fddd	Readme rewrite (#12615 ) Co-authored-by: Lance Martin <lance@langchain.dev> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-31 00:06:02 -07:00
Ankur Singh	00766c9f31	Improves the description of the installation command (#12354 ) - Description: Before: ` To install modules needed for the common LLM providers, run: ` After: ` To install modules needed for the common LLM providers, run the following command. Please bear in mind that this command is exclusively compatible with the `bash` shell: ` > This is required for the user so that the user will know if this command is compatible with `zsh` or not. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 18:56:48 -07:00
Yujie Qian	1dbb77d7db	VoyageEmbeddings (#12608 ) - Description: Integrate VoyageEmbeddings into LangChain, with tests and docs - Issue: N/A - Dependencies: N/A - Tag maintainer: N/A - Twitter handle: @Voyage_AI_ --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 18:37:43 -07:00
chocolate4	92bf40a921	Add a new vector store hippo for langchain #11763 (#12412 ) #11763 --------- Co-authored-by: TranswarpHippo <hippo.0.assistant@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 18:35:23 -07:00
Karthik Raja A	342d6c7ab6	Multi on client toolkit (#12392 ) Replace this entire comment with: -Add MultiOn close function and update key value and add async functionality - solved the key value TabId not found.. (updated to use latest key value) @hwchase17	2023-10-30 18:34:56 -07:00
Prabin Nepal	b109cb031b	SecretStr for fireworks api (#12475 ) - Description: This pull request removes secrets present in raw format, - Issue: Fireworks api key was exposed when printing out the langchain object [#12165](https://github.com/langchain-ai/langchain/issues/12165) - Maintainer: @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 18:17:53 -07:00
Harrison Chase	f35a65124a	improve agent templates (#12528 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-10-30 18:15:13 -07:00
Harrison Chase	75bb28afd8	Harrison/pii chatbot (#12523 ) the pii detection in the template is pretty basic, will need to be customized per use case the chain it "protects" can be swapped out for any chain	2023-10-30 18:13:12 -07:00
Harrison Chase	a32c236c64	bump cli to 009 (#12611 )	2023-10-30 18:12:08 -07:00
Erika Cardenas	b97b9eda21	Hybrid Search Weaviate Template (#12606 ) - Description: This template covers hybrid search in Weaviate - Dependencies: No - Twitter handle: @ecardenas300 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-30 18:10:48 -07:00
Martin Schade	0c7f1d8b21	Textract linearizer (#12446 ) Description: Textract PDF Loader generating linearized output, meaning it will replicate the structure of the source document as close as possible based on the features passed into the call (e. g. LAYOUT, FORMS, TABLES). With LAYOUT reading order for multi-column documents or identification of lists and figures is supported and with TABLES it will generate the table structure as well. FORMS will indicate "key: value" with columms. - Issue: the issue fixes #12068 - Dependencies: amazon-textract-textractor is added, which provides the linearization - Tag maintainer: @3coins --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 18:02:10 -07:00
Harrison Chase	a7d5e0ce8a	add guardrails profanity (#12609 )	2023-10-30 17:01:23 -07:00
Erick Friis	e933212a3d	run poetry build in working dir (#12610 ) Was failing because was trying to build from root: https://github.com/langchain-ai/langchain/actions/runs/6700033981/job/18205251365	2023-10-30 16:58:34 -07:00
Erick Friis	f39246bd7e	cli should pull instead of delete+clone (#12607 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-10-30 16:44:09 -07:00
Harrison Chase	8b5e879171	add a template for the package readme (#12499 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2023-10-30 16:39:39 -07:00
Bagatur	9bedda50f2	Bagatur/lakefs loader2 (#12524 ) Co-authored-by: Jonathan Rosenberg <96974219+Jonathan-Rosenberg@users.noreply.github.com>	2023-10-30 16:30:27 -07:00
Brian McBrayer	3243dcc83e	Fix very small typo (#12603 ) - Description: this is the world's smallest typo change of a typo I saw while reading the docs	2023-10-30 16:30:18 -07:00
Ackermann Yuriy	99b69fe607	Fixed missing optional tags. Added default key value for Ollama (#12599 ) Added missing Optional typings. Added default values for Ollama optional keys. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 16:30:10 -07:00
Lance Martin	f6f3ca12e7	Codebase RAG fireworks (#12597 )	2023-10-30 16:21:56 -07:00
Harrison Chase	481bf6fae6	hosting note (#12589 )	2023-10-30 15:31:31 -07:00
David Duong	b5c17ff188	Force List[Tuple[str,str]] to chat history widget (#12530 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 15:19:32 -07:00
David Duong	d39b4b61b6	Batch apply `poetry lock --no-update` for all templates (#12531 ) Ran the following bash script for all templates ```bash #!/bin/bash set -e current_dir="$(pwd)" for directory in */; do if [ -d "$directory" ]; then (cd "$directory" && poetry lock --no-update) fi done cd "$current_dir" ``` Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 15:18:53 -07:00
Kenzie Mihardja	e914283cf9	add docs to min_chunk_size (#12537 ) Minor addition to documentation to elaborate on min_chunk_size. Co-authored-by: Kenzie Mihardja <kenzie@docugami.com>	2023-10-30 15:13:52 -07:00
Bagatur	016813d189	factor out to_secret (#12593 )	2023-10-30 15:10:25 -07:00
hsuyuming	630ae24b28	implement get_num_tokens to use google's count_tokens function (#10565 ) can get the correct token count instead of using gpt-2 model Description: Implement get_num_tokens within VertexLLM to use google's count_tokens function. (https://cloud.google.com/vertex-ai/docs/generative-ai/get-token-count). So we don't need to download gpt-2 model from huggingface, also when we do the mapreduce chain we can get correct token count. Tag maintainer: @lkuligin Twitter handle: My twitter: @abehsu1992626 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-30 15:10:05 -07:00
Pham Vu Thai Minh	33e77a1007	Async support for FAISS (#11333 ) Following this tutoral about using OpenAI Embeddings with FAISS https://python.langchain.com/docs/integrations/vectorstores/faiss ```python from langchain.embeddings.openai import OpenAIEmbeddings from langchain.text_splitter import CharacterTextSplitter from langchain.vectorstores import FAISS from langchain.document_loaders import TextLoader from langchain.document_loaders import TextLoader loader = TextLoader("../../../extras/modules/state_of_the_union.txt") documents = loader.load() text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0) docs = text_splitter.split_documents(documents) embeddings = OpenAIEmbeddings() ``` This works fine ```python db = FAISS.from_documents(docs, embeddings) query = "What did the president say about Ketanji Brown Jackson" docs = db.similarity_search(query) ``` But the async version is not ```python db = await FAISS.afrom_documents(docs, embeddings) # NotImplementedError query = "What did the president say about Ketanji Brown Jackson" docs = await db.asimilarity_search(query) # this will use await asyncio.get_event_loop().run_in_executor under the hood and will not call OpenAIEmbeddings.aembed_query but call OpenAIEmbeddings.embed_query ``` So this PR add async/await supports for FAISS --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-10-30 15:08:53 -07:00

... 4 5 6 7 8 ...

5868 Commits