langchain

Commit Graph

Author	SHA1	Message	Date
Leonid Ganeline	1ab8a14742	docs[patch]: top menu (#13748 ) Addressed this issue with the top menu: It allocates too much space. If the screen is small, then the top menu items are split into two lines and look unreadable. Another issue is with several top menu items: "Chat our docs" and "Also by LangChain". They are compound of several words which also hurts readability. The top menu items should be 1-word size. Updates: - "Chat our docs" -> "Chat" (the meaning is clean after clicking/opening the item) - "Also by LangChain" -> "🦜️🔗" - "🦜️🔗" moved before "Chat" item. This new item is partially copied from the first left item, the "🦜️🔗 LangChain". This design (with two 🦜️🔗 elements, visually splits the top menu into two parts. The first item in each part holds the 🦜️🔗 symbols and, when we click the second 🦜️🔗 item, it opens the drop-down menu. So, we've got two visually similar parts, which visually split the top menu on the right side: the LangChain Docs (and Doc-related items) and the lift side: other LangChain.ai (company) products/docs.	10 months ago
Taqi Jaffri	144710ad9a	langchain[minor]: Updated DocugamiLoader, includes breaking changes (#13265 ) There are the following main changes in this PR: 1. Rewrite of the DocugamiLoader to not do any XML parsing of the DGML format internally, and instead use the `dgml-utils` library we are separately working on. This is a very lightweight dependency. 2. Added MMR search type as an option to multi-vector retriever, similar to other retrievers. MMR is especially useful when using Docugami for RAG since we deal with large sets of documents within which a few might be duplicates and straight similarity based search doesn't give great results in many cases. We are @docugami on twitter, and I am @tjaffri --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	10 months ago
Bagatur	95a472a85f	docs[patch]: install local core (#13990 )	10 months ago
Bagatur	61ec71064a	docs[patch]: update stack diagram (#13902 )	10 months ago
david qiu	9fb6805be4	langchain[minor]: Add retriever for Knowledge Bases for Amazon Bedrock (#13980 ) - Description: Adds a retriever implementation for [Knowledge Bases for Amazon Bedrock](https://aws.amazon.com/bedrock/knowledge-bases/), a new service announced at AWS re:Invent, shortly before this PR was opened. This depends on the `bedrock-agent-runtime` service, which will be included in a future version of `boto3` and of `botocore`. We will open a follow-up PR documenting the minimum required versions of `boto3` and `botocore` after that information is available. - Issue: N/A - Dependencies: `boto3>=1.33.2, botocore>=1.33.2` - Tag maintainer: @baskaryan - Twitter handles: `@pjain7` `@dead_letter_q` This PR includes a documentation notebook under `docs/docs/integrations/retrievers`, which I (@dlqqq) have verified independently. EDIT: `bedrock-agent-runtime` service is now included in `boto3>=1.33.2`: `5cf793f493` --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Varun	14cc907d35	Update the stable docs link (#13798 ) - Description: Point to the stable version of documentation, - Twitter handle: varunzxzx	10 months ago
Amélie	d2cad53ec0	Fix broken link on Meilisearch vector-store documentation (#13604 ) - Description: dead link replacement - Issue: no open issue Note: Hi langchain team, Sorry to open a PR for this concern but we realized that one of the links present in the documentation booklet was broken 😄	10 months ago
Rihards Gravis	9e017ff6ba	docs[patch]: Reduce largest static image file size (#13508 ) - Description: Reduce image asset file size used in documentation by running them via lossless image optimization ([tinypng](https://www.npmjs.com/package/tinypng-cli) was used in this case). Images wider than 1916px (the maximum width of an image displayed in documentation) where downsized. - Issue: No issue is created for this, but the large image file assets caused slow documentation load times - Dependencies: No dependencies affected	10 months ago
Michael Feil	686162670e	langchain[minor]: Adding `infinity` embedding integration. (#13928 ) This adds integation to https://github.com/michaelfeil/infinity. Users requested it in https://github.com/michaelfeil/infinity/issues/36 @saatvikshah Follows my implementation of gradient.ai. Feedback 1: Well done - I love your CI / repo / poetry setup - I adapted a lot in https://github.com/michaelfeil/infinity. Feedback 2: Not so good: The openai integration contains to much reverse engineering - in general projects such as michaelfeil/infinity and huggingface/text-embeddings-inference are compatible to the `pip install openai` package. Reverse engineering like this one is really hindering the use for me: `8e88ba16a8/libs/langchain/langchain/embeddings/openai.py (L347)` `8e88ba16a8/libs/langchain/langchain/embeddings/openai.py (L351)` - it is about preventing 3rd party providers to use the same url + uses interfaces of openai, that are not publically documented.	10 months ago
Oleksandr Yaremchuk	c0277d06e8	experimental[patch] Update prompt injection model (#13930 ) - Description: Existing model used for Prompt Injection is quite outdated but we fine-tuned and open-source a new model based on the same model deberta-v3-base from Microsoft - [laiyer/deberta-v3-base-prompt-injection](https://huggingface.co/laiyer/deberta-v3-base-prompt-injection). It supports more up-to-date injections and less prone to false-positives. - Dependencies: No - Tag maintainer: - - Twitter handle: @alex_yaremchuk --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Leonid Ganeline	e47b9c5285	DOCS: move `adapters` to integrations (#13862 ) Current docs for adapters are in the `Guides/Adapters which is not a good place. - moved Adapters into `Integratons/Components/Adapters/ - simplified the OpenAI adapter notebook - rerouted the old OpenAI adapter page URL to a new one.	10 months ago
Manuel Riezebosch	92b07ecaf3	DOCS: fix link to question answering (#13806 ) first link in [overview](https://python.langchain.com/docs/use_cases/question_answering/code_understanding#overview)	10 months ago
Chengzu Ou	4b8e053fe8	FEATURE: Add Databricks Vector Search as a new vector store (#13621 ) Description: This PR adds Databricks Vector Search as a new vector store in LangChain. - [x] Add `DatabricksVectorSearch` in `langchain/vectorstores/` - [x] Unit tests - [x] Add [`databricks-vectorsearch`](https://pypi.org/project/databricks-vectorsearch/) as a new optional dependency We ran the following checks: - `make format` passed ✅ - `make lint` failed but the failures were caused by other files + Files touched by this PR passed the linter ✅ - `make test` passed ✅ - `make coverage` failed but the failures were caused by other files. Tests added by or related to this PR all passed + langchain/vectorstores/databricks_vector_search.py test coverage 94% ✅ - `make spell_check` passed ✅ The example notebook and updates to the [provider's documentation page](https://github.com/langchain-ai/langchain/blob/master/docs/docs/integrations/providers/databricks.md) will be added later in a separate PR. Dependencies: Optional dependency: [`databricks-vectorsearch`](https://pypi.org/project/databricks-vectorsearch/) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Dylan Williams	1983a39894	FEATURE: Add OneNote document loader (#13841 ) - Description: Added OneNote document loader - Issue: #12125 - Dependencies: msal Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Ikko Eltociear Ashimine	ff7d4d9c0b	Update llamacpp.ipynb (#13840 ) specifed -> specified	10 months ago
Sᴜᴘᴇʀ Lᴇᴇ	e42e95cc11	docs: fix link to `local_retrieval_qa` (#13872 ) \The original link in [this section](https://python.langchain.com/docs/use_cases/question_answering/#:~:text=locally%2Drunning%20models-,here,-.): https://python.langchain.com/docs/modules/use_cases/question_answering/local_retrieval_qa After fix: https://python.langchain.com/docs/use_cases/question_answering/local_retrieval_qa	10 months ago
Yusuf Khan	935f78c944	FEATURE: Add retriever for Outline (#13889 ) - Description: Added a retriever for the Outline API to ask questions on knowledge base - Issue: resolves #11814 - Dependencies: None - Tag maintainer: @baskaryan	10 months ago
ggeutzzang	f2af82058f	DOCS: Fix Sample Code for Compatibility with Pydantic 2.0 (#13890 ) - Description: I encountered an issue while running the existing sample code on the page https://python.langchain.com/docs/modules/agents/how_to/agent_iter in an environment with Pydantic 2.0 installed. The following error was triggered: ```python ValidationError Traceback (most recent call last) <ipython-input-12-2ffff2c87e76> in <cell line: 43>() 41 42 tools = [ ---> 43 Tool( 44 name="GetPrime", 45 func=get_prime, 2 frames /usr/local/lib/python3.10/dist-packages/pydantic/v1/main.py in __init__(__pydantic_self__, **data) 339 values, fields_set, validation_error = validate_model(__pydantic_self__.__class__, data) 340 if validation_error: --> 341 raise validation_error 342 try: 343 object_setattr(__pydantic_self__, '__dict__', values) ValidationError: 1 validation error for Tool args_schema subclass of BaseModel expected (type=type_error.subclass; expected_class=BaseModel) ``` I have made modifications to the example code to ensure it functions correctly in environments with Pydantic 2.0.	10 months ago
Stefano Lottini	19c68c7652	FEATURE: Astra DB, LLM cache classes (exact-match and semantic cache) (#13834 ) This PR provides idiomatic implementations for the exact-match and the semantic LLM caches using Astra DB as backend through the database's HTTP JSON API. These caches require the `astrapy` library as dependency. Comes with integration tests and example usage in the `llm_cache.ipynb` in the docs. @baskaryan this is the Astra DB counterpart for the Cassandra classes you merged some time ago, tagging you for your familiarity with the topic. Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Stefano Lottini	272df9dcae	Astra DB, chat message history (#13836 ) This PR adds a chat message history component that uses Astra DB for persistence through the JSON API. The `astrapy` package is required for this class to work. I have added tests and a small notebook, and updated the relevant references in the other docs pages. (@rlancemartin this is the counterpart of the Cassandra equivalent class you so helpfully reviewed back at the end of June) Thank you!	10 months ago
Bagatur	23566cbea9	DOCS: core editable dep api refs (#13747 )	10 months ago
Bagatur	3d28c1a9e0	DOCS: fix core api ref build (#13744 )	10 months ago
Harrison Chase	d82cbf5e76	Separate out langchain_core package (#13577 ) Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	10 months ago
Bagatur	4eec47b191	DOCS: update rag use case images (#13615 )	10 months ago
Sijun He	674bd90a47	DOCS: Fix typo in MongoDB memory docs (#13588 ) - Description: Fix typo in MongoDB memory docs - Tag maintainer: @eyurtsev <!-- Thank you for contributing to LangChain! - Description: Fix typo in MongoDB memory docs - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: @baskaryan - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	10 months ago
jwbeck97	a93616e972	FEAT: Add azure cognitive health tool (#13448 ) - Description: This change adds an agent to the Azure Cognitive Services toolkit for identifying healthcare entities - Dependencies: azure-ai-textanalytics (Optional) --------- Co-authored-by: James Beck <James.Beck@sa.gov.au> Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
John Mai	16f7912e1b	BUG: fix hunyuan appid type (#13496 ) - Description: fix hunyuan appid type - Issue: https://github.com/langchain-ai/langchain/pull/12022#issuecomment-1815627855	10 months ago
Leonid Ganeline	43972be632	docs updating `AzureML` notebooks (#13492 ) - Added/updated descriptions and links --------- Co-authored-by: Erick Friis <erick@langchain.dev>	10 months ago
Taranjeet Singh	47451764a7	Add embedchain retriever (#13553 ) Description: This commit adds embedchain retriever along with tests and docs. Embedchain is a RAG framework to create data pipelines. Twitter handle: - [Taranjeet's twitter](https://twitter.com/taranjeetio) and [Embedchain's twitter](https://twitter.com/embedchain) Reviewer @hwchase17 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
rafly lesmana	420a17542d	fix: Make YoutubeLoader support on demand language translation (#13583 ) Description: Enhance the functionality of YoutubeLoader to enable the translation of available transcripts by refining the existing logic. Issue: Encountering a problem with YoutubeLoader (#13523) where the translation feature is not functioning as expected. Tag maintainers/contributors who might be interested: @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	10 months ago
Leonid Ganeline	cc50e023d1	DOCS `langchain decorators` update (#13535 ) added disclaimer --------- Co-authored-by: Erick Friis <erickfriis@gmail.com>	10 months ago
Brace Sproul	02a13030c0	DOCS: updated langchain stack img to be svg (#13540 )	10 months ago
Martin Krasser	79ed66f870	EXPERIMENTAL Generic LLM wrapper to support chat model interface with configurable chat prompt format (#8295 ) ## Update 2023-09-08 This PR now supports further models in addition to Lllama-2 chat models. See [this comment](#issuecomment-1668988543) for further details. The title of this PR has been updated accordingly. ## Original PR description This PR adds a generic `Llama2Chat` model, a wrapper for LLMs able to serve Llama-2 chat models (like `LlamaCPP`, `HuggingFaceTextGenInference`, ...). It implements `BaseChatModel`, converts a list of chat messages into the [required Llama-2 chat prompt format](https://huggingface.co/blog/llama2#how-to-prompt-llama-2) and forwards the formatted prompt as `str` to the wrapped `LLM`. Usage example: ```python # uses a locally hosted Llama2 chat model llm = HuggingFaceTextGenInference( inference_server_url="http://127.0.0.1:8080/", max_new_tokens=512, top_k=50, temperature=0.1, repetition_penalty=1.03, ) # Wrap llm to support Llama2 chat prompt format. # Resulting model is a chat model model = Llama2Chat(llm=llm) messages = [ SystemMessage(content="You are a helpful assistant."), MessagesPlaceholder(variable_name="chat_history"), HumanMessagePromptTemplate.from_template("{text}"), ] prompt = ChatPromptTemplate.from_messages(messages) memory = ConversationBufferMemory(memory_key="chat_history", return_messages=True) chain = LLMChain(llm=model, prompt=prompt, memory=memory) # use chat model in a conversation # ... ``` Also part of this PR are tests and a demo notebook. - Tag maintainer: @hwchase17 - Twitter handle: `@mrt1nz` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	10 months ago
Bagatur	2e2114d2d0	FEATURE: Runnable with message history (#13418 ) Add RunnableWithMessageHistory class that can wrap certain runnables and manages chat history for them.	10 months ago
Bagatur	0fc3af8932	IMPROVEMENT: update assistants output and doc (#13480 )	10 months ago
Leonid Ganeline	21552628c8	DOCS updated `data_connection` index page (#13426 ) - the `Index` section was missed. Created it. - text simplification --------- Co-authored-by: Erick Friis <erick@langchain.dev>	10 months ago
Leonid Ganeline	e3a5cd7969	docs `integrations/vectorstores/` cleanup (#13487 ) - updated titles to consistent format - added/updated descriptions and links - format heading	10 months ago
Leonid Ganeline	1d2981114f	DOCS updated `async-faiss` example (#13434 ) The original notebook has the `faiss` title which is duplicated in the`faiss.jpynb`. As a result, we have two `faiss` items in the vectorstore ToC. And the first item breaks the searching order (it is placed between `A...` items). - I updated title to `Asynchronous Faiss`.	10 months ago
Leonid Ganeline	9ff8f69e75	DOCS updated `memory` Titles (#13435 ) - Fixed titles for two notebooks. They were inconsistent with other titles and clogged ToC. - Added `Upstash` description and link - Moved the authentication text up in the `Elasticsearch` nb, right after package installation. It was on the end of the page which was a wrong place.	10 months ago
Stefano Lottini	b029d9f4e6	Astra DB: minor improvements to docstrings and demo notebook (#13449 ) This PR brings a few minor improvements to the docs, namely class/method docstrings and the demo notebook. - A note on how to control concurrency levels to tune performance in bulk inserts, both in the class docstring and the demo notebook; - Slightly increased concurrency defaults after careful experimentation (still on the conservative side even for clients running on less-than-typical network/hardware specs) - renamed the DB token variable to the standardized `ASTRA_DB_APPLICATION_TOKEN` name (used elsewhere, e.g. in the Astra DB docs) - added a note and a reference (add_text docstring, demo notebook) on allowed metadata field names. Thank you!	10 months ago
Leonid Ganeline	283ef1f66d	DOCS fix for `integratons/document_loaders` sidebar (#13471 ) The current `integrations/document_loaders/` sidebar has the `example_data` item, which is a menu with a single item: "Notebook". It is happening because the `integrations/document_loaders/` folder has the `example_data/notebook.md` file that is used to autogenerate the above menu item. - removed an example_data/notebook.md file. Docusaurus doesn't have simple ways to fix this problem (to exclude folders/files from an autogenerated sidebar). Removing this file didn't break any existing examples, so this fix is safe.	10 months ago
Leonid Ganeline	b1fcf5b481	DOCS: `integrations/text_embeddings/` cleanup (#13476 ) Updated several notebooks: - fixed titles which are inconsistent or break the ToC sorting order. - added missed soruce descriptions and links - fixed formatting	10 months ago
Bagatur	10fddac4b5	Bagatur/chain of note template(#13470 )	10 months ago
Leonid Ganeline	d5b1a21ae4	DOCS updated `semadb` example (#13431 ) - the `SemaDB` notebook was placed in additional subfolder which breaks the vectorstore ToC. I moved file up, removed this unnecessary subfolder; updated the `vercel.json` with rerouting for the new URL - Added SemaDB description and link - improved text consistency	10 months ago
Leonid Ganeline	17c2007e0c	DOCS updated `Activeloop DeepMemory` notebook (#13428 ) - Fixed the title of the notebook. It created an ugly ToC element as `Activeloop DeepLake's DeepMemory + LangChain + ragas or how to get +27% on RAG recall.` - Added Activeloop description - improved consistency in text - fixed ToC (it was using HTML tagas that break left-side in-page ToC). Now in-page ToC works	10 months ago
Bagatur	9e6748e198	DOCS: rag nit (#13436 )	10 months ago
Leonid Ganeline	8a52c1456b	updated `clickup` example (#13424 ) - Fixed headers (was more then 1 Titles) - Removed security token value. It was OK to have it, because it is temporary token, but the automatic security swippers raise warnings on that. - Added `ClickUp` service description and link.	10 months ago
Brace Sproul	79fa9a81f4	Fix a link in docs (#13423 )	10 months ago
Bagatur	f0bb839506	DOCS: langchain stack img update (#13421 )	10 months ago
Bagatur	76c317ed78	DOCS: update rag use case (#13319 )	10 months ago

1 2 3 4 5 ...

2493 Commits (bf5787f58bbf847d7567720bed1bcb7a92ccbb2b)