langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
Bagatur	af284518bc	openai[patch]: Release 0.1.7, bump tiktoken 0.7.0 (#21723 )	2024-05-15 12:19:29 -07:00
William FH	ca768c8353	[Core] Check is async callable (#21714 ) To permit proper coercion of objects like the following: ```python class MyAsyncCallable: async def __call__(self, foo): return await ... class MyAsyncGenerator: async def __call__(self, foo): await ... yield ```	2024-05-15 10:49:49 -07:00
Eugene Yurtsev	5c2cfabec6	core[minor]: Add v2 implementation of astream events (#21638 ) This PR introduces a v2 implementation of astream events that removes intermediate abstractions and fixes some issues with v1 implementation. The v2 implementation significantly reduces relevant code that's associated with the astream events implementation together with overhead. After this PR, the astream events implementation: - Uses an async callback handler - No longer relies on BaseTracer - No longer relies on json patch As a result of this re-write, a number of issues were discovered with the existing implementation. ## Changes in V2 vs. V1 ### on_chat_model_end `output` The outputs associated with `on_chat_model_end` changed depending on whether it was within a chain or not. As a root level runnable the output was: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` As part of a chain the output was: ``` "data": { "output": { "generations": [ [ { "generation_info": None, "message": AIMessageChunk( content="hello world!", id=AnyStr() ), "text": "hello world!", "type": "ChatGenerationChunk", } ] ], "llm_output": None, } }, ``` After this PR, we will always use the simpler representation: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` NOTE Non chat models (i.e., regular LLMs) are still associated with the more verbose format. ### Remove some `_stream` events `on_retriever_stream` and `on_tool_stream` events were removed -- these were not real events, but created as an artifact of implementing on top of astream_log. The same information is already available in the `x_on_end` events. ### Propagating Names Names of runnables have been updated to be more consistent ```python model = GenericFakeChatModel(messages=infinite_cycle).configurable_fields( messages=ConfigurableField( id="messages", name="Messages", description="Messages return by the LLM", ) ) ``` Before: ```python "name": "RunnableConfigurableFields", ``` After: ```python "name": "GenericFakeChatModel", ``` ### on_retriever_end on_retriever_end will always return `output` which is a list of documents (rather than a dict containing a key called "documents") ### Retry events Removed the `on_retry` callback handler. It was incorrectly showing that the failed function being retried has invoked `on_chain_end` https://github.com/langchain-ai/langchain/pull/21638/files#diff-e512e3f84daf23029ebcceb11460f1c82056314653673e450a5831147d8cb84dL1394	2024-05-15 11:48:47 -04:00
Rajendra Kadam	54e003268e	langchain[minor]: Add PebbloRetrievalQA chain with Identity & Semantic Enforcement support (#20641 ) - Description: PebbloRetrievalQA chain introduces identity enforcement using vector-db metadata filtering - Dependencies: None - Issue: None - Documentation: Adding documentation for PebbloRetrievalQA chain in a separate PR(https://github.com/langchain-ai/langchain/pull/20746) - Unit tests: New unit-tests added --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-05-15 13:14:52 +00:00
Bagatur	241a6e43a5	docs: update structured how to (#21679 )	2024-05-14 22:19:51 +00:00
Jib	f369495fa0	mongodb: [performance] Increase DEFAULT_INSERT_BATCH_SIZE to 100,000 and introduce sizing constraints (#19608 )	2024-05-14 22:11:26 +00:00
Eugene Yurtsev	e69a9bedf8	core[patch]: Update mypy config (#21684 ) Update mypy config to ignore checking deps from numpy and pytest (which are optional in langsmith sdk)	2024-05-14 17:29:07 -04:00
Erick Friis	9973547aef	mongodb: release 0.1.4 (#21678 )	2024-05-14 11:54:23 -07:00
Jib	a97473c846	mongodb[patch]: Make ObjectId JSON-serializable on generation (#21394 )	2024-05-14 11:52:29 -07:00
Eugene Yurtsev	5c64c004cc	core[patch]: Add unit tests with some streaming scenarios (#21668 ) Add unit tests that show differences between sync / async versions when streaming. The inner on_chain_chunk event is missing if mixing sync and async functionality. Likely due to missing tap_output_iter implementation on the sync variant of `_transform_stream_with_config`	2024-05-14 15:30:57 +00:00
Eugene Yurtsev	2ac4d2960c	core[patch]: Add unit test to catch ordering (#21669 ) Add unit test to catch ordering issues	2024-05-14 15:25:33 +00:00
Zhao Blake	972d2071c6	core[patch]: Fix typo in VectorStoreExampleSelector doc-string (#21574 )	2024-05-14 13:31:37 +00:00
Erick Friis	2a984e8e3f	docs: huggingface package (#21645 )	2024-05-14 03:17:40 +00:00
Erick Friis	c77d2f2b06	multiple: core 0.2 nonbreaking dep, check_diff community->langchain dep (#21646 ) 0.2 is not a breaking release for core (but it is for langchain and community) To keep the core+langchain+community packages in sync at 0.2, we will relax deps throughout the ecosystem to tolerate `langchain-core` 0.2	2024-05-13 19:50:36 -07:00
Anush	edd68e4ad4	qdrant: init package (#21146 ) ## Description This PR introduces the new `langchain-qdrant` partner package, intending to deprecate the community package. ## Changes - Moved the Qdrant vector store implementation `/libs/partners/qdrant` with integration tests. - The conditional imports of the client library are now regular with minor implementation improvements. - Added a deprecation warning to `langchain_community.vectorstores.qdrant.Qdrant`. - Replaced references/imports from `langchain_community` with either `langchain_core` or by moving the definitions to the `langchain_qdrant` package itself. - Updated the Qdrant vector store documentation to reflect the changes. ## Testing - `QDRANT_URL` and [`QDRANT_API_KEY`](`583e36bf6b`) env values need to be set to [run integration tests](`d608c93d1f`) in the [cloud](https://cloud.qdrant.tech). - If a Qdrant instance is running at `http://localhost:6333`, the integration tests will use it too. - By default, tests use an [`in-memory`](https://github.com/qdrant/qdrant-client?tab=readme-ov-file#local-mode) instance(Not comprehensive). --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-13 18:20:03 -07:00
Prashanth Rao	63c3a0e56c	[community][graph]: Update KuzuQAChain and docs (#21218 ) This PR makes some small updates for `KuzuQAChain` for graph QA. - Updated Cypher generation prompt (we now support `WHERE EXISTS`) and generalize it more - Support different LLMs for Cypher generation and QA - Update docs and examples	2024-05-13 17:17:14 -07:00
Tomaz Bratanic	89ff6a3d3b	Add sentiment and confidence levels to diffbotgraphtransformer (#21590 ) Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-13 23:00:52 +00:00
Erick Friis	9b51ca08bc	huggingface: fix community dep checking (#21628 )	2024-05-13 21:52:18 +00:00
Erick Friis	91a2ea5cd6	chroma, mongodb: fix docstrings (#21629 )	2024-05-13 21:27:43 +00:00
Jofthomas	afd85b60fc	huggingface: init package (#21097 ) First Pr for the langchain_huggingface partner Package - Moved some of the hugging face related class from `community` to the new `partner package` Still needed : - Documentation - Tests - Support for the new apply_chat_template in `ChatHuggingFace` - Confirm choice of class to support for embeddings witht he sentence-transformer team. cc : @efriis --------- Co-authored-by: Cyril Kondratenko <kkn1993@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-13 20:53:15 +00:00
Tomaz Bratanic	9fce03e7db	community[patch]: Fix neo4j enhanced schema (#21582 )	2024-05-13 15:26:06 -04:00
Christophe Bornet	66a4da8ad0	community[patch]: Improve Cassandra VectorStore docsctrings (#21620 )	2024-05-13 15:24:26 -04:00
adreo00	40aff1eacc	core[major]: AsyncCallbackManagerForToolRun no longer casts return object to string (#20374 ) - Description: Stops `AsyncCallbackManagerForToolRun` from converting the output to str - Issue: #20372 - Dependencies: None	2024-05-13 15:09:12 -04:00
Eugene Yurtsev	25fbe356b4	community[patch]: upgrade to recent version of mypy (#21616 ) This PR upgrades community to a recent version of mypy. It inserts type: ignore on all existing failures.	2024-05-13 14:55:07 -04:00
Eugene Yurtsev	b923951062	langchain[patch]: CI add lint rule for community imports (#21618 ) Add a rule to check for imports from community in global scope	2024-05-13 14:51:25 -04:00
Jorge Piedrahita Ortiz	4378fbbef0	community[patch]: Fix typos in Sambanova integration doc-strings (#21617 ) - Description: Sambanova integration docstrings updated, bad formated --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-05-13 18:35:16 +00:00
Christophe Bornet	bcf53f93e1	[community]: Add missing docstring param to CassandraLoader (#21611 )	2024-05-13 16:03:18 +00:00
Christophe Bornet	e6fa4547b1	community[minor]: Add alazy_load to AsyncHtmlLoader (#21536 ) Also fixes a bug that `_scrape` was called and was doing a second HTTP request synchronously. Twitter handle: cbornet_	2024-05-13 12:01:03 -04:00
Wang Guan	b53548dcda	langchain[minor]: allow CacheBackedEmbeddings to cache queries (#20073 ) Add optional caching of queries to cache backed embeddings	2024-05-13 15:18:04 +00:00
Guangdong Liu	a156aace2b	core[patch]:Fix Incorrect listeners parameters for Runnable.with_listeners() and .map() (#20661 ) - Issue: fix #20509 - @baskaryan, @eyurtsev ![image](https://github.com/langchain-ai/langchain/assets/48236177/f799a976-b983-4d8b-b373-64392e1fd6c6)	2024-05-13 11:16:17 -04:00
junkeon	480c02bf55	upstage[minor]: add merge_and_split function for document loader (#21603 ) - Introduce the `merge_and_split` function in the `UpstageLayoutAnalysisLoader`. - The `merge_and_split` function takes a list of documents and a splitter as inputs. - This function merges all documents and then divides them using the `split_documents` method, which is a proprietary function of the splitter. - If the provided splitter is `None` (which is the default setting), the function will simply merge the documents without splitting them.	2024-05-13 10:55:19 -04:00
Leonid Ganeline	500569da48	community[patch]: `vectorstores` import update (#21169 ) Issue: we have several helper functions to import third-party libraries like lancedb.import_lancedb in [community.vectorstores](https://api.python.langchain.com/en/latest/vectorstores/langchain_community.vectorstores.lancedb.import_lancedb.html#langchain_community.vectorstores.lancedb.import_lancedb). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-13 10:45:31 -04:00
ccurme	3003363605	langchain, community: remove cap on sqlalchemy and bump duckdb (#21509 )	2024-05-13 10:16:09 -04:00
ccurme	01a3228d8e	standard tests: add test for few-shot examples (#21019 )	2024-05-13 10:06:12 -04:00
Chuyuan Qu	af875cff57	prompty: adding Microsoft langchain_prompty package (#21346 ) Co-authored-by: Micky Liu <wayliu@microsoft.com> Co-authored-by: wayliums <wayliums@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-11 04:03:44 +00:00
Matt Florence	d3ca2cc8c3	langchain: Fix broken `OpenAIModerationChain` and implement async (#18537 ) Thank you for contributing to LangChain! ## PR title lancghain[patch]: fix `OpenAIModerationChain` and implement async ## PR message Description: fix `OpenAIModerationChain` and implement async Issues: - https://github.com/langchain-ai/langchain/issues/18533 - https://github.com/langchain-ai/langchain/issues/13685 Dependencies: none Twitter handle: mattflo ## Add tests and docs Existing documentation is broken: https://python.langchain.com/docs/guides/safety/moderation - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Emilia Katari <emilia@outpace.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-10 19:04:13 +00:00
ccurme	4170e72a42	openai: fix loads unit test (#21542 ) following changes to tests in core here: https://github.com/langchain-ai/langchain/pull/21342/files	2024-05-10 18:46:34 +00:00
Erick Friis	3db85cbb5b	community: deps (#21508 )	2024-05-09 15:12:34 -07:00
Erick Friis	8580e350be	cli: release 0.0.22 (#21507 )	2024-05-09 21:45:20 +00:00
Anthony Chu	c735849e76	azure-dynamic-sessions: add Python REPL tool (#21264 ) Adds a Python REPL that executes code in a code interpreter session using Azure Container Apps dynamic sessions. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-09 21:39:04 +00:00
Erick Friis	02701c277f	langchain: core min version (#21506 )	2024-05-09 13:45:44 -07:00
Erick Friis	13b01104c9	langchain: drop sqlalchemy max, release 0.2.0rc2 (#21504 )	2024-05-09 13:12:38 -07:00
ccurme	375f447e58	community: fix builds with min dependencies (#21495 )	2024-05-09 13:01:44 -07:00
Trayan Azarov	ba7d53689c	community: Chroma Adding create_collection_if_not_exists flag to Chroma constructor (#21420 ) - Description: Adds the ability to either `get_or_create` or simply `get_collection`. This is useful when dealing with read-only Chroma instances where users are constraint to using `get_collection`. Targeted at Http/CloudClients mostly. - Issue: chroma-core/chroma#2163 - Dependencies: N/A - Twitter handle: `@t_azarov` \| Collection Exists \| create_collection_if_not_exists \| Outcome \| test \| \|-------------------\|---------------------------------\|----------------------------------------------------------------\|----------------------------------------------------------\| \| True \| False \| No errors, collection state unchanged \| `test_create_collection_if_not_exist_false_existing` \| \| True \| True \| No errors, collection state unchanged \| `test_create_collection_if_not_exist_true_existing` \| \| False \| False \| Error, `get_collection()` fails \| `test_create_collection_if_not_exist_false_non_existing` \| \| False \| True \| No errors, `get_or_create_collection()` creates the collection \| `test_create_collection_if_not_exist_true_non_existing` \|	2024-05-09 11:45:10 -04:00
ccurme	3bb9bec314	bedrock: add unit test for retriever (#21485 ) This was implemented in https://github.com/langchain-ai/langchain/pull/21349 but dropped before merge.	2024-05-09 11:37:03 -04:00
Renu Rozera	4035a1d234	Add source metadata to bedrock retriever response (#21349 ) Thank you for contributing to LangChain! - [X] PR title: "community: Add source metadata to bedrock retriever response" - [X] PR message: - Description: Bedrock retrieve API returns extra metadata in the response which is currently not returned in the retriever response - Issue: The change adds the metadata from bedrock retrieve API response to the bedrock retriever in a backward compatible way. Renamed metadata to sourceMetadata as metadata term is being used in the Document already. This is in sync with what we are doing in llama-index as well. - Dependencies: No - [X] Add tests and docs: 1. Added unit tests 2. Notebook already exists and does not need any change 3. Response from end to end testing, just to ensure backward compatibility: `[Document(page_content='Exoplanets.', metadata={'location': {'s3Location': {'uri': 's3://bucket/file_name.txt'}, 'type': 'S3'}, 'score': 0.46886647, 'source_metadata': {'x-amz-bedrock-kb-source-uri': 's3://bucket/file_name.txt', 'tag': 'space', 'team': 'Nasa', 'year': 1946.0}})]` - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-05-09 11:06:22 -04:00
Erick Friis	f178c67ad0	community: release 0.2.0rc1, bump deps (#21470 )	2024-05-08 23:32:44 -07:00
William FH	b28be5d407	Pass through Run ID Explicitly (#21469 )	2024-05-08 22:20:51 -07:00
Erick Friis	83eecd54fe	experimental: 0.2 relax (#21468 )	2024-05-08 21:39:42 -07:00
roiperlman	9992beaff9	community: Add arguments to whisper parser (#20378 ) Description: Added a few additional arguments to the whisper parser, which can be consumed by the underlying API. The prompt is especially important to fine-tune transcriptions. --------- Co-authored-by: Roi Perlman <roi@fivesigmalabs.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-08 17:53:13 -07:00
Yash	cb31c3611f	Ndb enterprise (#21233 ) Description: Adds NeuralDBClientVectorStore to the langchain, which is our enterprise client. --------- Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com>	2024-05-08 16:30:58 -07:00
Oguz Vuruskaner	5b35f077f9	[community][fix](DeepInfraEmbeddings): Implement chunking for large batches (#21189 ) Description: This PR introduces chunking logic to the `DeepInfraEmbeddings` class to handle large batch sizes without exceeding maximum batch size of the backend. This enhancement ensures that embedding generation processes large batches by breaking them down into smaller, manageable chunks, each conforming to the maximum batch size limit. Issue: Fixes #21189 Dependencies: No new dependencies introduced.	2024-05-08 14:45:42 -07:00
Sokolov Fedor	f4ddf64faa	community: Add MarkdownifyTransformer to langchain_community.document_transformers (#21247 ) - Added new document_transformer: MarkdonifyTransformer, that uses `markdonify` package with customizable options to convert HTML to Markdown. It's similar to Html2TextTransformer, but has more flexible options and also I've noticed that sometimes MarkdownifyTransformer performs better than html2text one, so that's why I use markdownify on my project. - Added docs and tests - Usage: ```python from langchain_community.document_transformers import MarkdownifyTransformer markdownify = MarkdownifyTransformer() docs_transform = markdownify.transform_documents(docs) ``` - Example of better performance on simple task, that I've noticed: ``` <html> <head><title>Reports on product movement</title></head> <body> <p data-block-key="2wst7">The reports on product movement will be useful for forming supplier orders and controlling outcomes.</p> </body> ``` Html2TextTransformer: ```python [Document(page_content='The reports on product movement will be useful for forming supplier orders and\ncontrolling outcomes.\n\n')] # Here we can see 'and\ncontrolling', which has extra '\n' in it ``` MarkdownifyTranformer: ```python [Document(page_content='Reports on product movement\n\nThe reports on product movement will be useful for forming supplier orders and controlling outcomes.')] ``` --------- Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.bbrouter> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.local> Co-authored-by: Sokolov Fedor <f.sokolov@192.168.1.6>	2024-05-08 14:45:13 -07:00
Alex JW	d3ce6aad2e	community: Instantiate GPT4AllEmbeddings with parameters (#21238 ) ### GPT4AllEmbeddings parameters --- Description: As of right now the Embed4All class inside _GPT4AllEmbeddings_ is instantiated as it's default which leaves no room to customize the chosen model and it's behavior. Thus: - GPT4AllEmbeddings can now be instantiated with custom parameters like a different model that shall be used. --------- Co-authored-by: AlexJauchWalser <alexander.jauch-walser@knime.com>	2024-05-08 14:44:47 -07:00
Philippe PRADOS	7be68228da	community[patch]: Make sql record manager fully compatible with async (#20735 ) The `_amake_session()` method does not allow modifying the `self.session_factory` with anything other than `async_sessionmaker`. This prohibits advanced uses of `index()`. In a RAG architecture, it is necessary to import document chunks. To keep track of the links between chunks and documents, we can use the `index()` API. This API proposes to use an SQL-type record manager. In a classic use case, using `SQLRecordManager` and a vector database, it is impossible to guarantee the consistency of the import. Indeed, if a crash occurs during the import (problem with the network, ...) there is an inconsistency between the SQL database and the vector database. With the [PR](https://github.com/langchain-ai/langchain-postgres/pull/32) we are proposing for `langchain-postgres`, it is now possible to guarantee the consistency of the import of chunks into a vector database. It's possible only if the outer session is built with the connection. ```python def main(): db_url = "postgresql+psycopg://postgres:password_postgres@localhost:5432/" engine = create_engine(db_url, echo=True) embeddings = FakeEmbeddings() pgvector:VectorStore = PGVector( embeddings=embeddings, connection=engine, ) record_manager = SQLRecordManager( namespace="namespace", engine=engine, ) record_manager.create_schema() with engine.connect() as connection: session_maker = scoped_session(sessionmaker(bind=connection)) # NOTE: Update session_factories record_manager.session_factory = session_maker pgvector.session_maker = session_maker with connection.begin(): loader = CSVLoader( "data/faq/faq.csv", source_column="source", autodetect_encoding=True, ) result = index( source_id_key="source", docs_source=loader.load()[:1], cleanup="incremental", vector_store=pgvector, record_manager=record_manager, ) print(result) ``` The same thing is possible asynchronously, but a bug in `sql_record_manager.py` in `_amake_session()` must first be fixed. ```python async def _amake_session(self) -> AsyncGenerator[AsyncSession, None]: """Create a session and close it after use.""" # FIXME: REMOVE if not isinstance(self.session_factory, async_sessionmaker):~~ if not isinstance(self.engine, AsyncEngine): raise AssertionError("This method is not supported for sync engines.") async with self.session_factory() as session: yield session ``` Then, it is possible to do the same thing asynchronously: ```python async def main(): db_url = "postgresql+psycopg://postgres:password_postgres@localhost:5432/" engine = create_async_engine(db_url, echo=True) embeddings = FakeEmbeddings() pgvector:VectorStore = PGVector( embeddings=embeddings, connection=engine, ) record_manager = SQLRecordManager( namespace="namespace", engine=engine, async_mode=True, ) await record_manager.acreate_schema() async with engine.connect() as connection: session_maker = async_scoped_session( async_sessionmaker(bind=connection), scopefunc=current_task) record_manager.session_factory = session_maker pgvector.session_maker = session_maker async with connection.begin(): loader = CSVLoader( "data/faq/faq.csv", source_column="source", autodetect_encoding=True, ) result = await aindex( source_id_key="source", docs_source=loader.load()[:1], cleanup="incremental", vector_store=pgvector, record_manager=record_manager, ) print(result) asyncio.run(main()) ``` --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Sean <sean@upstage.ai> Co-authored-by: JuHyung-Son <sonju0427@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: YISH <mokeyish@hotmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Jason_Chen <820542443@qq.com> Co-authored-by: Joan Fontanals <joan.fontanals.martinez@jina.ai> Co-authored-by: Pavlo Paliychuk <pavlo.paliychuk.ca@gmail.com> Co-authored-by: fzowl <160063452+fzowl@users.noreply.github.com> Co-authored-by: samanhappy <samanhappy@gmail.com> Co-authored-by: Lei Zhang <zhanglei@apache.org> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: merdan <48309329+merdan-9@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Andres Algaba <andresalgaba@gmail.com> Co-authored-by: davidefantiniIntel <115252273+davidefantiniIntel@users.noreply.github.com> Co-authored-by: Jingpan Xiong <71321890+klaus-xiong@users.noreply.github.com> Co-authored-by: kaka <kaka@zbyte-inc.cloud> Co-authored-by: jingsi <jingsi@leadincloud.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Rahul Triptahi <rahul.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Shengsheng Huang <shannie.huang@gmail.com> Co-authored-by: Michael Schock <mjschock@users.noreply.github.com> Co-authored-by: Anish Chakraborty <anish749@users.noreply.github.com> Co-authored-by: am-kinetica <85610855+am-kinetica@users.noreply.github.com> Co-authored-by: Dristy Srivastava <58721149+dristysrivastava@users.noreply.github.com> Co-authored-by: Matt <matthew.gotteiner@microsoft.com> Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2024-05-08 17:31:11 -04:00
Andreas Motl	17e42bbd18	community[patch]: pgvector: Slight refactoring to make code a bit more reusable (#16243 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to make it reusable by adapters deriving from that. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 - Addressed to: @eyurtsev, @cbornet Hi from the CrateDB team, first of all, thanks a stack for conceiving and maintaining LangChain. We are currently [preparing a patch](https://github.com/crate-workbench/langchain/pull/1) for adding [CrateDB](https://github.com/crate/crate) to the list of community adapters. Because CrateDB aims to be compatible with PostgreSQL to some degree, the vector store subsystem in LangChain derives functionality from the corresponding implementation for pgvector. Therefore, in order to make the implementation more reusable, we needed to rename the private methods `__from` and `__query_collection` to the less private counterparts `_from` and `_query_collection`, so they can be overwritten, in order to unlock other adapters deriving from [pgvector](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py). With kind regards, Andreas.	2024-05-08 17:21:30 -04:00
Mehrdad Shokri	f103927b88	bugfix(community): fix Playwright import paths. (#21395 ) - Description: Fix import class name exporeted from 'playwright.async_api' and 'playwright.sync_api' to match the correct name in playwright tool. Change import from inline guard_import to helper function that calls guard_import to make code more readable in gmail tool. Upgrade playwright version to 1.43.0 - Issue: #21354 - Dependencies: upgrade playwright version(this is not required for the bugfix itself, just trying to keep dependencies fresh. I can remove the playwright version upgrade if you want.)	2024-05-08 14:20:25 -07:00
Shailendra Mishra	aa966b6161	Replaced bind variable in SQL with formatted string for compatibility with sql syntax. (#21439 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-08 13:51:30 -07:00
Eugene Yurtsev	f92006de3c	multiple: langchain 0.2 in master (#21191 ) 0.2rc migrations - [x] Move memory - [x] Move remaining retrievers - [x] graph_qa chains - [x] some dependency from evaluation code potentially on math utils - [x] Move openapi chain from `langchain.chains.api.openapi` to `langchain_community.chains.openapi` - [x] Migrate `langchain.chains.ernie_functions` to `langchain_community.chains.ernie_functions` - [x] migrate `langchain/chains/llm_requests.py` to `langchain_community.chains.llm_requests` - [x] Moving `langchain_community.cross_enoders.base:BaseCrossEncoder` -> `langchain_community.retrievers.document_compressors.cross_encoder:BaseCrossEncoder` (namespace not ideal, but it needs to be moved to `langchain` to avoid circular deps) - [x] unit tests langchain -- add pytest.mark.community to some unit tests that will stay in langchain - [x] unit tests community -- move unit tests that depend on community to community - [x] mv integration tests that depend on community to community - [x] mypy checks Other todo - [x] Make deprecation warnings not noisy (need to use warn deprecated and check that things are implemented properly) - [x] Update deprecation messages with timeline for code removal (likely we actually won't be removing things until 0.4 release) -- will give people more time to transition their code. - [ ] Add information to deprecation warning to show users how to migrate their code base using langchain-cli - [ ] Remove any unnecessary requirements in langchain (e.g., is SQLALchemy required?) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 16:46:52 -04:00
ccurme	6b392d6d12	robocorp: release 0.0.6 (#21441 )	2024-05-08 16:16:24 -04:00
Tommi Holmgren	ee35b9ba56	langchain-robocorp: remove toolkit return content max length (#21436 ) Robocorp (action server) toolkit had a limitation that the content length returned by the tool was always cut to max 5000 chars. This was from the time when context windows were much more limited. This PR removes the limitation. Whatever the underlying tool provides gets sent back to the agent. As the robocorp toolkit no longer restricts the content, the implication is that either the Action (tool) developer or the agent developer needs to be aware of potentially oversized tool responses. Our point of view is this should be the agent developer's responsibility, them being in control of the use case and aware of the context window the LLM has.	2024-05-08 15:05:55 -04:00
JuHyung Son	710e57d779	upstage: deprecate UPSTAGE_DOCUMENT_AI_API_KEY (#21363 ) Description: We are merging UPSTAGE_DOCUMENT_AI_API_KEY and UPSTAGE_API_KEY into one, and only UPSTAGE_API_KEY will be used going forward. And we changed the base class of ChatUpstage to BaseChatOpenAI. --------- Co-authored-by: Sean <chosh0615@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 18:02:26 +00:00
Erick Friis	6a295d1ec0	upstage: release 0.1.4 (#21432 )	2024-05-08 17:57:40 +00:00
Mateusz Szewczyk	7926cc1929	ibm: Fix llm and embeddings "verify" attribute default value (#21429 ) Thank you for contributing to LangChain! - [x] PR title: "langchain-ibm: Fix llm and embeddings 'verify' attribute default value" - [x] PR message: - Description: fix default value of "verify" attribute - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 17:23:14 +00:00
Dobiichi-Origami	5b00885b49	community: add `bind_tools` and `with_structured_output` support to `QianfanChatEndpoint` (#21412 ) …Endpoint` Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: add `bind_tools` and `with_structured_output` support to `QianfanChatEndpoint` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-05-08 11:35:10 -04:00
Silas Xu	aafaf3e193	The predict_and_parse is deprecated, instead pass an output parser directly to LLMChain. (#20130 ) The `predict_and_parse` method is deprecated, instead pass an output parser directly to LLMChain. - [x] PR title: "langchain: update chain_extract.py" ![image](https://github.com/langchain-ai/langchain/assets/40889019/e950d79f-5a0f-4086-86e9-89f627990fe5) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-08 09:32:17 -04:00
ccurme	3c31bd0ed0	langchain: update use of predict_and_parse in LLMChainFilter (#21389 ) Following https://github.com/langchain-ai/langchain/pull/20130 Removes deprecation warnings in docs here: https://python.langchain.com/docs/modules/data_connection/retrievers/contextual_compression/ Tested using the same docs notebook + existing integration test.	2024-05-08 09:31:33 -04:00
Erick Friis	bbdf0f8801	experimental[patch]: core and langchain dep (#21402 )	2024-05-07 21:39:34 -07:00
Erick Friis	e4aca0d052	experimental[patch]: release 0.0.58 (#21397 )	2024-05-08 03:52:44 +00:00
Leonid Ganeline	791d59a2c8	community: `callbacks` guard_imports (#21173 ) Issue: we have several helper functions to import third-party libraries like import_uptrain in [community.callbacks](https://api.python.langchain.com/en/latest/callbacks/langchain_community.callbacks.uptrain_callback.import_uptrain.html#langchain_community.callbacks.uptrain_callback.import_uptrain). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-07 15:04:54 -07:00
Rahul Triptahi	7994cba18d	[Community][Minor]: Fetch loader_source of GoogleDriveLoader in PebbloSafeLoader. (#21314 ) Description: This PR includes fix for loader_source to be fetched from metadata in case of GdriveLoaders. Documentation: NA Unit Test: NA Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-05-07 14:45:58 -07:00
Nuno Campos	ad0f3c14c2	core: allow mermaid node labels to have any characters (#21385 ) - it's only node ids that are limited Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-07 12:16:49 -07:00
Eugene Yurtsev	6a1d61dbf1	community[patch]: Fix in memory vectorstore to take into account ids when adding docs (#21384 ) Should respect `ids` if passed	2024-05-07 15:05:16 -04:00
Miroslav	04e2611fea	Added additional headers for HuggingFaceInferenceAPIEmbeddings endpoint. (#21282 ) Thank you for contributing to LangChain! - [ ] HuggingFaceInferenceAPIEmbeddings: "Additional Headers" - Where: langchain, community, embeddings. huggingface.py. - Community: add additional headers when needed by custom HuggingFace TEI embedding endpoints. HuggingFaceInferenceAPIEmbeddings" - [ ] PR message: *Delete this entire checklist* and replace with - Description: Adding the `additional_headers` to be passed to requests library if needed - Dependencies: none - [ ] Add tests and docs: If you're adding a new integration, please include 1. Tested with locally available TEI endpoints with and without `additional_headers` 2. Example Usage ```python embeddings=HuggingFaceInferenceAPIEmbeddings( api_key=MY_CUSTOM_API_KEY, api_url=MY_CUSTOM_TEI_URL, additional_headers={ "Content-Type": "application/json" } ) ``` Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Massimiliano Pronesti <massimiliano.pronesti@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-05-07 14:17:53 -04:00
Guangdong Liu	1fe66f5d39	community(patch) fix MoonshotChat moonshot_api_key is invaild for api key (#21361 ) Description: close https://github.com/langchain-ai/langchain/issues/21237 @baskaryan, @eyurtsev	2024-05-07 08:44:30 -07:00
Tomaz Bratanic	0bf7596839	Add simple node properties to llm graph transformer (#21369 ) Add support for simple node properties in llm graph transformer. Linter and dynamic pydantic classes aren't friends, hence I added two ignores	2024-05-07 08:41:09 -07:00
ccurme	080af0ec53	langchain: sync -> async methods in OpenAI assistants (#21378 )	2024-05-07 10:25:55 -04:00
Tomaz Bratanic	ad3fd44a7f	experimental: Fix llm graph transformer bug (#21362 )	2024-05-06 23:59:55 -07:00
Erick Friis	bb81ae5c8c	together: fix chat model and embedding classes (#21353 )	2024-05-06 18:26:03 -07:00
Hassan El Mghari	d6ef5fe86a	together: add chat models, use openai base (#21337 ) Description: Adding chat completions to the Together AI package, which is our most popular API. Also staying backwards compatible with the old API so folks can continue to use the completions API as well. Also moved the embedding API to use the OpenAI library to standardize it further. Twitter handle: @nutlope - [x] Add tests and docs: If you're adding a new integration, please include - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 17:47:06 -07:00
Jacob Lee	a2d31307bb	Adds confirmation logs after creating a new project (#12618 ) @efriis @hwchase17 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 23:28:12 +00:00
Erick Friis	0fb93cd740	core: release 0.1.52 (#21350 )	2024-05-06 22:20:35 +00:00
Wu Enze	32c61b3ece	community[patch]: chat message history mypy fixes #17048 (#20114 ) Relates [#17048] Description : Applied fix to redis and neo4j file. Error was : `Cannot override writeable attribute with read-only property` fix with the same solution of [[langchain/libs/community/langchain_community/chat_message_histories/elasticsearch.py](`d5c412b0a9/libs/community/langchain_community/chat_message_histories/elasticsearch.py (L170-L175)`)] --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 22:17:45 +00:00
nrpd25	95cc8e3fc3	premai[patch]:Standardized model init args (#21308 ) [Standardized model init args #20085](https://github.com/langchain-ai/langchain/issues/20085) - Enable premai chat model to be initialized with `model_name` as an alias for `model`, `api_key` as an alias for `premai_api_key`. - Add initialization test `test_premai_initialization` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 18:12:29 -04:00
Nuno Campos	6f17158606	fix: core: Include in json output also fields set outside the constructor (#21342 )	2024-05-06 14:37:36 -07:00
Tomaz Bratanic	ac14f171ac	Add indexed properties to neo4j enhanced schema (#21335 )	2024-05-06 14:28:34 -07:00
scaserini	a6cdf6572f	community: add Kendra DocumentRelevanceOverrideConfigurations request parameter (#20695 ) - Description: add DocumentRelevanceOverrideConfigurations request parameter to Kendra retriever Co-authored-by: Simone Caserini <simone.caserini@klarna.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-06 14:26:36 -07:00
Nuno Campos	0345bcf4ef	Fix failing test for serialization (#21344 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-06 21:19:54 +00:00
Trayan Azarov	93226b1945	community: Updated Chroma version range to include 0.5.0 release (#21224 ) - Updated Chroma version range to allow releases in 0.5.x. - Bumped mypy version as linting was failing	2024-05-06 13:31:40 -07:00
Jorge Piedrahita Ortiz	e65652c3e8	community: add SambaNova embeddings integration (#21227 ) - Description: SambaNova hosted embeddings integration	2024-05-06 13:29:59 -07:00
Jorge Piedrahita Ortiz	df1c10260c	community: minor changes sambanova integration (#21231 ) - Description: fix: variable names in root validator not allowing pass credentials as named parameters in llm instancing, also added sambanova's sambaverse and sambastudio llms to __init__.py for module import	2024-05-06 13:28:35 -07:00
Jan Soubusta	d9a61c0fa9	fix: respect table_name argument when calling from_texts (#21252 ) valid for from_documents() as well fixes #21251	2024-05-06 20:28:22 +00:00
Pedro Lima	bebf46c4a2	community: added args_schema to YahooFinanceNewsTool (#21232 ) Description: this change adds args_schema (pydantic BaseModel) to YahooFinanceNewsTool for correct schema formatting on LLM function calls Issue: currently using YahooFinanceNewsTool with OpenAI function calling returns the following error "TypeError("YahooFinanceNewsTool._run() got an unexpected keyword argument '__arg1'")". This happens because the schema sent to the LLM is "input: "{'__arg1': 'MSFT'}"" while the method should be called with the "query" parameter. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-06 13:27:54 -07:00
Mark Cusack	060987d755	community[minor]: Add indexing via locality sensitive hashing to the Yellowbrick vector store (#20856 ) - Description: Add LSH-based indexing to the Yellowbrick vector store module - Twitter handle: @markcusack --------- Co-authored-by: markcusack <markcusack@markcusacksmac.lan> Co-authored-by: markcusack <markcusack@Mark-Cusack-sMac.local> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-06 20:18:02 +00:00
Rashmi Pawar	a2fdabdad2	mark NemoEmbeddings as deprecated (#21239 ) The NemoEmbeddings is deprecated, instead use langchain-nvidia-ai-endpoints NVIDIAEmbeddings interface. cc: @mattf --------- Co-authored-by: Daniel Glogowski <167348611+dglogo@users.noreply.github.com> Co-authored-by: andyjessen <62343929+andyjessen@users.noreply.github.com> Co-authored-by: Chris Germann <88305668+TAAGECH9@users.noreply.github.com> Co-authored-by: gere <gere@kapo.zh.ch> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 19:44:58 +00:00
Erick Friis	9e4b24a2d6	langchain: release 0.1.18 (#21338 )	2024-05-06 19:39:46 +00:00
Erick Friis	5c000f8d79	community: release 0.0.37 (#21332 )	2024-05-06 12:17:42 -07:00
Leonid Ganeline	8c13e8a79b	langchain: `qa_chain` fix (#21279 ) Issue: `load_qa_chain` is placed in the __init__.py file. As a result, it is not listed in the API Reference docs. BTW `load_qa_chain` is heavily presented in the doc examples, but is missed in API Ref. Change: moved code from init.py into a new file. Related: #21266	2024-05-06 14:45:51 -04:00
Erick Friis	7ecf9996f1	community: Revert "community: langkit dependency" (#21333 ) Reverts langchain-ai/langchain#21174 Hey team - going to revert this because it doesn't seem necessary for testing. We should only be adding optional + extended_testing dependencies for deps that have extended tests. otherwise it just increases probability of dependency conflicts in the community lockfile.	2024-05-06 18:44:41 +00:00
Param Singh	fee91d43b7	baichuan[patch]:standardize chat init args (#21298 ) Thank you for contributing to LangChain! community:baichuan[patch]: standardize init args updated `baichuan_api_key` so that aliased to `api_key`. Added test that it continues to set the same underlying attribute. Test checks for `SecretStr` updated `temperature` with Pydantic Field, added unit test. Related to https://github.com/langchain-ai/langchain/issues/20085	2024-05-06 18:33:57 +00:00
Christophe Bornet	484a009012	community[minor]: Relax constraints on Cassandra VectorStore constructors (#21209 ) If Session and/or keyspace are not provided, they are resolved from cassio's context. So they are not required. This change is fully backward compatible.	2024-05-06 14:32:32 -04:00
Leonid Ganeline	6feddfae88	community: langkit dependency (#21174 ) Issue: the `langkit` package is not presented in the `pyproject.toml` but it is a requirement for the `WhyLabsCallbackHandler` Change: added `langkit` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 18:09:31 +00:00
Erick Friis	811e9cee8b	core: release 0.1.51 (#21328 )	2024-05-06 10:40:19 -07:00
Mateusz Szewczyk	682d21c3de	ibm: Add support for ibm-watsonx-ai new major version (#21313 ) Thank you for contributing to LangChain! - [x] PR title: "langchain-ibm: Add support for ibm-watsonx-ai new major version" - [x] PR message: - Description: Add support for ibm-watsonx-ai new major version - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 16:48:26 +00:00
Chris Papademetrious	ee6c922c91	langchain[minor]: enhance `LocalFileStore` to offer `update_atime` parameter that updates access times on read (#20951 ) Description: The `LocalFileStore` class can be used to create an on-disk `CacheBackedEmbeddings` cache. The number of files in these embeddings caches can grow to be quite large over time (hundreds of thousands) as embeddings are computed for new versions of content, but the embeddings for old/deprecated content are not removed. A least-recently-used (LRU) cache policy could be applied to the `LocalFileStore` directory to delete cache entries that have not been referenced for some time: ```bash # delete files that have not been accessed in the last 90 days find embeddings_cache_dir/ -atime 90 -print0 \| xargs -0 rm ``` However, most filesystems in enterprise environments disable access time modification on read to improve performance. As a result, the access times of these cache entry files are not updated when their values are read. To resolve this, this pull request updates the `LocalFileStore` constructor to offer an `update_atime` parameter that causes access times to be updated when a cache entry is read. For example, ```python file_store = LocalFileStore(temp_dir, update_atime=True) ``` The default is `False`, which retains the original behavior. Testing: I updated the LocalFileStore unit tests to test the access time update.	2024-05-06 11:52:29 -04:00
Tomaz Bratanic	5b6d1a907d	Add the extract types to diffbot graph transformer (#21315 ) Before you could only extract triples (diffbot calls it facts) from diffbot to avoid isolated nodes. However, sometimes isolated nodes can still be useful like for prefiltering, so we want to allow users to extract them if they want. Default behaviour is unchanged.	2024-05-06 09:19:52 -04:00
aditya thomas	b868c78a12	partners[anthropic]: update unit test for key passed in from the environment (#21290 ) Description: Update unit test for ChatAnthropic Issue: Test for key passed in from the environment should not have the key initialized in the constructor Dependencies: None	2024-05-05 16:19:10 -04:00
Rohan Aggarwal	8021d2a2ab	community[minor]: Oraclevs integration (#21123 ) Thank you for contributing to LangChain! - Oracle AI Vector Search Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. - Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. This Pull Requests Adds the following functionalities Oracle AI Vector Search : Vector Store Oracle AI Vector Search : Document Loader Oracle AI Vector Search : Document Splitter Oracle AI Vector Search : Summary Oracle AI Vector Search : Oracle Embeddings - We have added unit tests and have our own local unit test suite which verifies all the code is correct. We have made sure to add guides for each of the components and one end to end guide that shows how the entire thing runs. - We have made sure that make format and make lint run clean. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: skmishraoracle <shailendra.mishra@oracle.com> Co-authored-by: hroyofc <harichandan.roy@oracle.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-04 03:15:35 +00:00
ccurme	c9e9470c5a	langchain: fix deprecation decorators on extraction chains (#21276 ) Calling any of these raises ``` ValueError: A pending deprecation cannot have a scheduled removal ```	2024-05-03 18:29:40 -04:00
Wickes Wong	ee1adaacaa	langchain[patch]: Fix summary buffer memory with return message flag (#21115 ) ## Description Memory return could be set as `str` or `message` by `return_messages` flag as mentioned in https://python.langchain.com/docs/modules/memory/#whether-memory-is-a-string-or-a-list-of-messages, where `langchain.chains.conversation.memory.ConversationSummaryBufferMemory` did not implement that. This commit added `buffer_as_str` and `buffer_as_messages` function, and `buffer` now affected by `return_messages` flag. ## Example Test Code and Output ```python # Fix: ConversationSummaryBufferMemory with return_messages flag function # Test code from langchain.chains.conversation.memory import ConversationSummaryBufferMemory from langchain_community.llms.ollama import Ollama llm = Ollama() # Create an instance of ConversationSummaryBufferMemory with return_messages set to True memory = ConversationSummaryBufferMemory(return_messages=True, llm=llm) # Add user and AI messages to the chat memory memory.chat_memory.add_user_message("hi!") memory.chat_memory.add_ai_message("what's up?") # Print the buffer print("Buffer:") print(map(type, memory.buffer), sep="\n") print(memory.buffer, "\n") # Print the buffer as a string print("Buffer as String:") print(type(memory.buffer_as_str)) print(memory.buffer_as_str, "\n") # Print the buffer as messages print("Buffer as Messages:") print(map(type, memory.buffer_as_messages), sep="\n") print(memory.buffer_as_messages, "\n") # Print the buffer after setting return_messages to False memory.return_messages = False print("Buffer after setting return_messages to False:") print(type(memory.buffer)) print(memory.buffer, "\n") ``` ```plaintext Buffer: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer as String: <class 'str'> Human: hi! AI: what's up? Buffer as Messages: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer after setting return_messages to False: <class 'str'> Human: hi! AI: what's up? ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-03 17:25:09 -04:00
Leonid Ganeline	9639457222	community[patch]: `tools` imports (#21156 ) Issue: we have several helper functions to import third-party libraries like tools.gmail.utils.import_google in [community.tools](https://api.python.langchain.com/en/latest/community_api_reference.html#id37). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-03 17:22:45 -04:00
Leonid Ganeline	3ef8b24277	core[patch]: `utils.guard_import` fix (#21133 ) Issues (nit): 1. `utils.guard_import` prints wrong error message when there is an import `error.` It prints the whole `module_name` but should be only the first part as the pip package name. E.i. `langchain_core.utils` -> print not `langchain-core` but `langchain_core.utils`. Also replace '_' with '-' in the pip package name. 2. it does not handle the `ModuleNotFoundError` which raised if `guard_import("wrong_module")` Fixed issues; added ut-s. Controversial: I've reraised `ModuleNotFoundError` as `ImportError`, since in case of the error, the proposed action is the same - we need to install a missed package.	2024-05-03 17:21:36 -04:00
Erick Friis	36c2ca3c8b	mistralai: relax tokenizers dep (#21277 )	2024-05-03 14:16:22 -07:00
Nuno Campos	6e1e0c7d5c	fix: core: draw_mermaid() would create subgroup for edges with same src and tgt (#21275 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-03 13:51:08 -07:00
Eugene Yurtsev	26a37dce0a	langchain[patch]: Remove jsonpatch from poetry file (#21272 ) jsonpatch is only used in langchain-core not in langchain	2024-05-03 15:46:05 -04:00
Eugene Yurtsev	335bd01e45	langchain[patch]: Update deprecation warning (#21268 ) Update deprecation warning	2024-05-03 15:31:29 -04:00
Leonid Ganeline	23a05c3986	langchain: `summarize` chain fix (#21266 ) Issue: `load_summarize_chain` is placed in the __init__.py file. As a result, it doesn't listed in the API Reference docs. Change: moved code from __init__.py into a new file.	2024-05-03 14:44:39 -04:00
ccurme	6da3d92b42	(all): update removal in deprecation warnings from 0.2 to 0.3 (#21265 ) We are pushing out the removal of these to 0.3. `find . -type f -name "*.py" -exec sed -i '' 's/removal="0\.2/removal="0.3/g' {} +`	2024-05-03 14:29:36 -04:00
Eugene Yurtsev	d6e34f9ee5	langchain[patch]: Improve deprecation warnings (#21262 ) * Remove spurious derprecation warning * Make deprecation warnings consistent with 0.1 namespaces that were announced as deprecated	2024-05-03 13:40:16 -04:00
Eugene Yurtsev	487aff7e46	langchain[patch]: Revert 20794 until 0.2 release (#21257 ) PR of 2079 was already released as part of 0.1.17rc. Issue for 0.2 release: https://github.com/langchain-ai/langchain/issues/21080	2024-05-03 17:02:48 +00:00
Eugene Yurtsev	ba4a309d98	langchain[patch]: Revert breaking change until 0.2 release (#21256 ) Reverts a minor breaking change until 0.2 release	2024-05-03 09:42:27 -07:00
Eugene Yurtsev	66a1e3f083	langchain[patch]: Fix flaky unit test (#21258 ) Should sort the results of the import test since it depends on import order	2024-05-03 15:55:46 +00:00
Eugene Yurtsev	0989c48028	langchain[minor]: Re-add deleted ainetwork tool (#21254 ) * Adding __init__.py to turn it into a package in community * Adding proxy imports that assume that langchain_community is optional	2024-05-03 11:39:40 -04:00
Christophe Bornet	2fbe82f5e6	community[minor]: Relax constraints on CassandraChatMessageHistory constructor (#21241 )	2024-05-03 10:20:39 -04:00
Chris Germann	3a8d1d8838	Hotfix RetrievalQA Docs: docs: Fix formatting (#21183 ) # Newline Characters breaking formatting Description: As you can see in the image below, the formatting in the documentation is broken. As far as I can see the two added `\n` characters are breaking the documentation. Therefore I would propose to remove those ![image](https://github.com/langchain-ai/langchain/assets/88305668/23b6e726-71b2-4812-91ea-3e8600683733) Dependencies: None Twitter Handle - epu9byj --------- Co-authored-by: gere <gere@kapo.zh.ch> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-03 12:46:29 +00:00
Bagatur	67a5cc34c6	openai[patch]: Release 0.1.6 (#21236 )	2024-05-03 04:10:39 +00:00
Erick Friis	c1eb95b967	core: release 0.1.50 (#21230 )	2024-05-02 22:44:18 +00:00
Nuno Campos	47ce8d5a57	core: tracer: remove numeric execution order (#21220 ) - this hasn't been used in a long time and requires some additional bookkeeping i'm going to streamline in the next pr	2024-05-02 15:38:55 -07:00
Bagatur	6ac6158a07	openai[patch]: support tool_choice="required" (#21216 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-02 18:33:25 -04:00
xindoo	c1aa237bc2	langchain: fix syntax error in code comment for create_tool_calling_agent (#21205 ) PR message: - Description: Corrected a syntax error in the code comments within the `create_tool_calling_agent` function in the langchain package. - Issue: N/A - Dependencies: No additional dependencies required. - Twitter handle: N/A	2024-05-02 19:17:23 +00:00
ccurme	eb0a2fd53a	mistral: release 0.1.6 (#21214 )	2024-05-02 13:59:19 -04:00
ccurme	2d77e5e3a1	(standard tests): add test for basic conversation sequence (#21213 )	2024-05-02 13:47:10 -04:00
Maxime Perrin	1ebb5a70ad	partners(mistralai): Removing unused variable in completion request (using tool_calls or content) (#21201 ) This PR fixes #21196. The error was occurring when calling chat completion API with a chat history. Indeed, the Mistral API does not accept both `content` and `tool_calls` in the same body. This PR removes one of theses variables depending on the necessity. --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-02 13:20:14 -04:00
Christophe Bornet	683fb45c6b	community[patch]: Refactor CassandraDatabase wrapper (#21075 ) * Introduce individual `fetch_` methods for easier typing. * Rework some docstrings to google style * Move some logic to the tool * Merge the 2 cassandra utility files	2024-05-02 13:13:08 -04:00
Raghav Dixit	7d451d0041	community[patch]: Update lancedb.py (#21192 ) very minor update in LanceDB integration, 'metric' argument was missing.	2024-05-02 17:06:39 +00:00
Bagatur	d297d90ad9	core[patch]: Release 0.1.49 (#21211 )	2024-05-02 17:06:27 +00:00
Nuno Campos	663747b730	core[patch]: Fixes for convert_messages (#21207 ) - support two-tuples of any sequence type (eg. json.loads never produces tuples) - support type alias for role key - if id is passed in in dict form use it - if tool_calls passed in in dict form use them --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-02 16:55:42 +00:00
Eugene Yurtsev	df49404794	langchain[patch]: Make more memory code handle community dependency as optional (#21199 )	2024-05-02 11:05:26 -04:00
ccurme	bd5d2c2674	langchain: import InMemoryChatMessageHistory from core (#21198 )	2024-05-02 14:53:07 +00:00
Eugene Yurtsev	3cd7fced5f	langchain[patch],community[minor]: Migrate memory implementations to community (#20845 ) Migrates memory implementations to community	2024-05-02 10:46:50 -04:00
Eugene Yurtsev	b5c3a04e4b	langchain[patch]: chat histories to handle optional community dependence (#21194 )	2024-05-02 10:36:08 -04:00
Eugene Yurtsev	c9119b0e75	langchain[patch],community[minor]: Move some unit tests from langchain to community, use core for fake models (#21190 )	2024-05-02 09:57:52 -04:00
Eugene Yurtsev	c306364b06	langchain[patch]: Update more code to use langchain community as an optional dependency (#21170 ) More code to use langchain community as an optional dependency	2024-05-02 09:05:48 -04:00
Bagatur	6fa8626e2f	openai[patch]: fix azure open lc serialization, release 0.1.5 (#21159 )	2024-05-01 18:03:29 -04:00
Eugene Yurtsev	94a838740e	langchain[patch]: Migrate more code in utils to use optional langchain import (#21166 ) Moving is interactive util to avoid circular deps	2024-05-01 17:18:42 -04:00
Eugene Yurtsev	23fdd320bc	langchain[patch]: Migrate more code to use optional community in agents namespace (#21167 )	2024-05-01 16:25:44 -04:00
Tomaz Bratanic	9e53fa7d2e	Some more fixes to neo4j enhanced schema (#21139 )	2024-05-01 13:12:43 -07:00
Erick Friis	0694538c39	ai21: fix core version (#21168 )	2024-05-01 13:10:22 -07:00
Eugene Yurtsev	44602bdc20	langchain[patch],community[minor]: Move load_tools to community (#21158 ) Move load tools to community	2024-05-01 16:05:41 -04:00
Eugene Yurtsev	9932f49b3e	langchain[patch]: Migrate llms to use optional community imports (#21101 )	2024-05-01 16:04:45 -04:00
Eugene Yurtsev	57e8e70daa	langchain[patch]: Migrate chat models to optional community imports (#21090 ) Migrate chat models to optional community imports	2024-05-01 16:04:12 -04:00
Eugene Yurtsev	2914abd747	langchain[patch]: Fix how the serializable test identifies serializable objects (#21165 ) dir() will not work if we're using optional imports. The only way to do this is by using contents of __all__	2024-05-01 15:56:11 -04:00
Eugene Yurtsev	23c5d87311	langchain[patch]: Migrate utils to use optional langchain_community (#21163 ) Migrate utils to use optional imports from langchain community	2024-05-01 15:24:02 -04:00
Eugene Yurtsev	bec3eee3fa	langchain[patch]: Migrate retrievers to use optional langchain community imports (#21155 )	2024-05-01 14:44:44 -04:00
Eugene Yurtsev	43110daea5	langchain[patch]: Update some agent tool kits to handle community import as optional (#21157 ) A few things that were not caught by the migration script	2024-05-01 14:22:54 -04:00
Eugene Yurtsev	59f10ab3e0	langchain[patch]: Migrate embeddings to optional imports (#21099 )	2024-05-01 13:47:37 -04:00
Eugene Yurtsev	2f709d94d7	langchain[patch]: Migrate vectorstores to use optional langchain community imports (#21150 )	2024-05-01 13:33:37 -04:00
Eugene Yurtsev	7230e430db	langchain[patch]: Migrate top level files to use optional langchain community (#21152 ) Migrate a few top level files to treat langchain community as an optional dependency	2024-05-01 13:23:03 -04:00
Erick Friis	daab9789a8	ai21: release 0.1.4 (#21151 )	2024-05-01 17:16:27 +00:00
Asaf Joseph Gardin	642975dd9f	partners: AI21 Labs Jamba Support (#20815 ) Description: Added support for AI21 new model - Jamba Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-01 10:12:44 -07:00
Eugene Yurtsev	7a39fe60da	langchain[patch]: Migrate utilities to handle langchain community as optional (#21149 )	2024-05-01 13:09:34 -04:00
Eugene Yurtsev	b879184595	langchain[patch]: embedddings distance move import of openai embeddings into local scope (#21148 )	2024-05-01 12:51:51 -04:00
Eugene Yurtsev	0e5bf16d00	langchain[patch]: Migrate document loaders to use optional langchain community imports (#21095 )	2024-05-01 11:26:25 -04:00
Harrison Chase	4d1c21d97d	community[patch]: Fix alternative name in deprecation notice for sql_database (#21144 )	2024-05-01 10:59:42 -04:00
East Agile	2a6f78a53f	community[minor]: Rememberizer retriever (#20052 ) Description: This pull request introduces a new feature for LangChain: the integration with the Rememberizer API through a custom retriever. This enables LangChain applications to allow users to load and sync their data from Dropbox, Google Drive, Slack, their hard drive into a vector database that LangChain can query. Queries involve sending text chunks generated within LangChain and retrieving a collection of semantically relevant user data for inclusion in LLM prompts. User knowledge dramatically improved AI applications. The Rememberizer integration will also allow users to access general purpose vectorized data such as Reddit channel discussions and US patents. Issue: N/A Dependencies: N/A Twitter handle: https://twitter.com/Rememberizer	2024-05-01 10:41:44 -04:00
Eugene Yurtsev	1ce1a10f2b	langchain[patch],community[minor]: Move graph index creator (#20795 ) Move graph index creator to community	2024-05-01 10:04:30 -04:00
Eugene Yurtsev	aa0bc7467c	langchain[patch]: Migrate agents module into optional imports for community (#21088 )	2024-05-01 09:36:03 -04:00
Eugene Yurtsev	86ff8a3fb4	langchain[patch]: Update docstore module to use optional imports from community (#21091 )	2024-05-01 09:35:05 -04:00
Eugene Yurtsev	d640605694	langchain[patch]: Migrate chat loaders to optional community imports (#21089 ) Migrate chat loaders to optional community imports	2024-05-01 09:34:44 -04:00
Eugene Yurtsev	2fcab9acd9	langchain[patch]: Upgrade storage to treat langchain community as optional (#21105 )	2024-05-01 09:33:31 -04:00
William FH	ab55f6996d	[Core] Tracing: update parent run_tree's child_runs (#21049 )	2024-05-01 06:33:08 -07:00
aditya thomas	12b1caf295	openai[patch]: add tests for secret_str for keys (#20982 ) Description: Add tests to check API keys and Active Directory tokens are masked Issue: Resolves #12165 for OpenAI and Azure OpenAI models Dependencies: None Also resolves #12473 which may be closed. Additional contributors @alex4321 (#12473) and @onesolpark (#12542)	2024-05-01 01:26:20 -04:00
Noah	45ddf4d26f	community[patch]: Update comments for lazy_load method (#21063 ) - [ ] PR message: - Description: Refactored the lazy_load method to use asynchronous execution for improved performance. The method now initiates scraping of all URLs simultaneously using asyncio.gather, enhancing data fetching efficiency. Each Document object is yielded immediately once its content becomes available, streamlining the entire process. - Issue: N/A - Dependencies: Requires the asyncio library for handling asynchronous tasks, which should already be part of standard Python libraries in Python 3.7 and above. - Email: [r73327118@gmail.com](mailto:r73327118@gmail.com) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 01:20:57 -04:00
Liu Xiaodong	3b473d10f2	experimental: clean python repl input（experimental：Added code for PythonREPL） (#20930 ) Update python.py（experimental：Added code for PythonREPL） Added code for PythonREPL, defining a static method 'sanitize_input' that takes the string 'query' as input and returns a sanitizing string. The purpose of this method is to remove unwanted characters from the input string, Specifically: 1. Delete the whitespace at the beginning and end of the string (' \s'). 2. Remove the quotation marks (`` ` ``) at the beginning and end of the string. 3. Remove the keyword "python" at the beginning of the string (case insensitive) because the user may have typed it. This method uses regular expressions (regex) to implement sanitizing. It all started with this code： from langchain.agents import Tool from langchain_experimental.utilities import PythonREPL python_repl = PythonREPL() repl_tool = Tool( name="python_repl", description="Remove redundant formatting marks at the beginning and end of source code from input.Use a Python shell to execute python commands. If you want to see the output of a value, you should print it out with `print(...)`.", func=python_repl.run, ) When I call the agent to write a piece of code for me and execute it with the defined code, I must get an error: SyntaxError('invalid syntax', ('<string>', 1, 1,'In', 1, 2)) After checking, I found that pythonREPL has less formatting of input code than the soon-to-be deprecated pythonREPL tool, so I added this step to it, so that no matter what code I ask the agent to write for me, it can be executed smoothly and get the output result. I have tried modifying the prompt words to solve this problem before, but it did not work, and by adding a simple format check, the problem is well resolved. <img width="1271" alt="image" src="https://github.com/langchain-ai/langchain/assets/164149097/c49a685f-d246-4b11-b655-fd952fc2f04c"> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:19:09 +00:00
Ismail Hossain Polas	1fdf63fa6c	community[patch]: update package name to bagelML (#19948 ) Description This pull request updates the Bagel Network package name from "betabageldb" to "bagelML" to align with the latest changes made by the Bagel Network team. The following modifications have been made: - Updated all references to the old package name ("betabageldb") with the new package name ("bagelML") throughout the codebase. - Modified the documentation, and any relevant scripts to reflect the package name change. - Tested the changes to ensure that the functionality remains intact and no breaking changes were introduced. By merging this pull request, our project will stay up to date with the latest Bagel Network package naming convention, ensuring compatibility and smooth integration with their updated library. Please review the changes and provide any feedback or suggestions. Thank you!	2024-05-01 01:17:33 -04:00
Tomaz Bratanic	7860e4c649	experimental[patch]: Add support for non-function calling LLMs in llm graph transformers (#21014 )	2024-05-01 01:16:07 -04:00
tianzedavid	5a8909440b	docs: remove repetitive words (#21058 ) remove repetitive words Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-01 01:10:42 +00:00
Tomaz Bratanic	c9e96bb5e2	community[patch]: Fix neo4j enhanced schema bugs (#21072 )	2024-04-30 20:16:26 -04:00
junkeon	8d2909ee25	upstage[minor]: Update few codes and add upstage loader in pdf section (#21085 ) Description: Update UpstageLayoutAnalysisParser and Loader and add upstage loader example in pdf section Dependencies: langchain_community Twitter handle: [@upstageai](https://twitter.com/upstageai) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-30 20:15:49 -04:00
Bagatur	bef50ded63	openai[patch]: fix special token default behavior (#21131 ) By default handle special sequences as regular text	2024-04-30 20:08:24 -04:00
MacanPN	0f7f448603	community[patch]: add delete() method to AzureSearch vector store (#21127 ) Issue: Currently `AzureSearch` vector store does not implement `delete` method. This PR implements it. This also makes it compatible with LangChain indexer. Dependencies: None Twitter handle: @martintriska1 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:46:18 +00:00
Erick Friis	14422a4220	langchain: fix core dep (#21128 )	2024-04-30 14:55:12 -07:00
Erick Friis	6c938da302	langchain: release 0.1.17 (#21125 )	2024-04-30 14:43:59 -07:00
Eugene Yurtsev	bf95414758	langchain[minor]: enhance unit test to test imports recursively (#21122 )	2024-04-30 17:05:53 -04:00
Eugene Yurtsev	e4f51f59a2	langchain[patch]: Migrate tools to treat community imports as optional (#21117 ) Migrate tools to treat community imports as optional	2024-04-30 16:26:18 -04:00
Eugene Yurtsev	9e788f09c6	langchain[patch]: Migrate output parsers to support optional community imports (#21103 ) Migrate output parsers	2024-04-30 16:24:29 -04:00
Eugene Yurtsev	3853fe9f64	langchain[patch]: Migrate graphs to use optional community imports (#21100 ) Migrate graphs to use optional community imports.	2024-04-30 16:24:06 -04:00
Eugene Yurtsev	8658d52587	langchain[patch]: Upgrade prompts to optional imports (#21078 ) Upgrades prompts module to use optional imports. This code was generated with a migration script, but had to be adjusted manually a bit. Testing in preparation for applying this code modification across the rest of the modules in langchain package to reverse the dependency between langchain community and langchain.	2024-04-30 16:23:39 -04:00
Eugene Yurtsev	9b6d04a187	langchain[patch]: Migrate document transformers (#21098 ) Migrate document transformers	2024-04-30 16:20:02 -04:00
Eugene Yurtsev	aec13a6123	langchain[patch]: Migrate callbacks module to use optional imports for community (#21086 )	2024-04-30 16:19:13 -04:00
Erick Friis	8a62fb0570	community: release 0.0.36 (#21118 )	2024-04-30 13:18:44 -07:00
Erick Friis	2407c353be	core: release 0.1.48 (#21113 )	2024-04-30 19:52:36 +00:00
Charlie Marsh	fd94aa8366	partner[patch]: Upgrade to Ruff v0.4.2 (#21108 ) ## Summary No new diagnostics (given that the set of enabled rules hasn't changed), but gains access to our new parser (much faster) and reduced false positives all around.	2024-04-30 15:06:42 -04:00
Jamsheed Mistri	3e749369ef	community[minor]: bump version of LayerupSecurity, add support for untrusted_input parameter (#19985 ) Description: update version of LayerupSecurity package for the Layerup Security integration. Add untrusted_input parameter.	2024-04-30 14:55:26 -04:00
fubuki8087	f1c3687aa5	community[patch]: Using the right encoding to parse the web page in RecursiveUrlLoader (#20632 ) As shown in #13749 , `RecursiveUrlLoader` has encoding issue. This PR is to solve this. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 18:41:36 +00:00
Jakub Pawłowski	b0b1a67771	community[patch]: Skip unexpected 404 HTTP Error in Arxiv download (#21042 ) ### Description: When attempting to download PDF files from arXiv, an unexpected 404 error frequently occurs. This error halts the operation, regardless of whether there are additional documents to process. As a solution, I suggest implementing a mechanism to ignore and communicate this error and continue processing the next document from the list. Proposed Solution: To address the issue of unexpected 404 errors during PDF downloads from arXiv, I propose implementing the following solution: - Error Handling: Implement error handling mechanisms to catch and handle 404 errors gracefully. - Communication: Inform the user or logging system about the occurrence of the 404 error. - Continued Processing: After encountering a 404 error, continue processing the remaining documents from the list without interruption. This solution ensures that the application can handle unexpected errors without terminating the entire operation. It promotes resilience and robustness in the face of intermittent issues encountered during PDF downloads from arXiv. ### Issue: #20909 ### Dependencies: none --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-30 18:29:22 +00:00
Erick Friis	b9c53e95b7	community: release 0.0.35 (#21104 )	2024-04-30 17:48:56 +00:00
Eugene Yurtsev	3c064a757f	core[minor],langchain[patch],community[patch]: Move storage interfaces to core (#20750 ) * Move storage interface to core * Move in memory and file system implementation to core	2024-04-30 13:14:26 -04:00
Charlie Marsh	8f38b7a725	multiple: Remove unnecessary Ruff suppression comments (#21050 ) ## Summary I ran `ruff check --extend-select RUF100 -n` to identify `# noqa` comments that weren't having any effect in Ruff, and then `ruff check --extend-select RUF100 -n --fix` on select files to remove all of the unnecessary `# noqa: F401` violations. It's possible that these were needed at some point in the past, but they're not necessary in Ruff v0.1.15 (used by LangChain) or in the latest release. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-30 17:13:48 +00:00
Erick Friis	748f2ba9ea	core: release 0.1.47 (#21094 )	2024-04-30 09:22:05 -07:00
Eugene Yurtsev	c8f18a2524	langchain[patch]: Update import handling in `adapters` (#21079 )	2024-04-30 10:55:29 -04:00
William FH	5c63ac3dd7	[Patch] Dedent docstring (#20959 ) Technically a slight prompt breaking change, but I think positive EV in that it saves tokens and results in more sane / in-distribution prompts	2024-04-30 07:40:57 -07:00
Eugene Yurtsev	845d8e0025	langchain[patch]: Update handling of deprecation warnings (#21083 ) Chains should not be emitting deprecation warnings.	2024-04-30 10:30:23 -04:00
Christophe Bornet	5c77f45b06	community[minor]: Add async methods to CassandraCache and CassandraSemanticCache (#20654 )	2024-04-30 10:27:44 -04:00
William FH	db14d4326d	[Core] Feat Pretty Print Tool calls (#20997 ) Right now, `tool_calls` are not included in the `pretty_print()` output. Would be nice to show! ![image](https://github.com/langchain-ai/langchain/assets/13333726/6a0ffca3-d02f-4e18-bc76-513eeca2e964)	2024-04-30 07:14:43 -07:00
Kuro Denjiro	fa4124b821	community[minor]: add mintbase loader to langchain (#20089 ) - [x] Add Near NFT loader: "community: Load NFT near block chain using mintbase graph API" - [x] PR message: - Description: a description of the change - Twitter handle:Kurodenjiro --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 04:11:56 +00:00
Alexander Dicke	d7e12750df	community[patch]: allows using `text-generation-inference` /generate route with `HuggingFaceEndpoint` (#20100 ) - Description: allows to use the /generate route of `text-generation-inference` with the `HuggingFaceEndpoint`	2024-04-29 23:09:55 -04:00
davidkgp	28b0b0d863	community[patch]: Fix for github issue #17690 (#20117 ) …/17690 Thank you for contributing to LangChain! - [x] Fix Google Lens knowledge graph issue: "langchain: community" - Fix for [No "knowledge_graph" property in Google Lens API call from SerpAPI](https://github.com/langchain-ai/langchain/issues/17690) - [x] PR message: *Delete this entire checklist* and replace with - Description: handled the existence of keys in the json response of Google Lens - Issue: [No "knowledge_graph" property in Google Lens API call from SerpAPI](https://github.com/langchain-ai/langchain/issues/17690) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-30 00:10:08 +00:00
高远	a7a4630bf4	community[patch]: Modify the text field type and add new exception handling (#20116 ) Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-04-29 20:06:00 -04:00
Rahul Triptahi	c172611647	community[patch]: Add classifier_url argument in PebbloSafeLoader and documentation update. (#21030 ) Description: Add classifier_url argument in PebbloSafeLoader. Documentation: Updated PebbloSafeLoader documentation with above change and new links for pebblo github pages. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-29 17:41:09 -04:00
Leonid Ganeline	08d08d7c83	docs: langchain docstrings updates (#21032 ) Added missed docstings. Formatted docstrings into a consistent format.	2024-04-29 17:40:44 -04:00
Leonid Ganeline	85094cbb3a	docs: community docstring updates (#21040 ) Added missed docstrings. Updated docstrings to consistent format.	2024-04-29 17:40:23 -04:00
Rodrigo Nogueira	90f19028e5	community[patch]: Add maritalk streaming (sync and async) (#19203 ) Co-authored-by: RosevalJr <rdmalajr@gmail.com> Co-authored-by: Roseval Donisete Malaquias Junior <roseval@maritaca.ai>	2024-04-29 21:31:14 +00:00
Cahid Arda Öz	cc6191cb90	community[minor]: Add support for Upstash Vector (#20824 ) ## Description Adding `UpstashVectorStore` to utilize [Upstash Vector](https://upstash.com/docs/vector/overall/getstarted)! #17012 was opened to add Upstash Vector to langchain but was closed to wait for filtering. Now filtering is added to Upstash vector and we open a new PR. Additionally, [embedding feature](https://upstash.com/docs/vector/features/embeddingmodels) was added and we add this to our vectorstore aswell. ## Dependencies [upstash-vector](https://pypi.org/project/upstash-vector/) should be installed to use `UpstashVectorStore`. Didn't update dependencies because of [this comment in the previous PR](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1876522450). ## Tests Tests are added and they pass. Tests are naturally network bound since Upstash Vector is offered through an API. There was [a discussion in the previous PR about mocking the unittests](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1891820567). We didn't make changes to this end yet. We can update the tests if you can explain how the tests should be mocked. --------- Co-authored-by: ytkimirti <yusuftaha9@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 17:25:01 -04:00
Leonid Ganeline	1a2ff56cd8	core[patch[: docstring update (#21036 ) Added missed docstrings. Updated docstrings to consistent format.	2024-04-29 15:35:34 -04:00
Eugene Yurtsev	f479a337cc	langchain[patch]: replace deprecated imports with imports from langchain_core (#21033 ) * Output of running the migration script. * Ran only against langchain code itself and not the unit tests.	2024-04-29 15:34:31 -04:00
Eugene Yurtsev	82d4afcac0	langchain[minor]: Code to handle dynamic imports (#20893 ) Proposing to centralize code for handling dynamic imports. This allows treating langchain-community as an optional dependency. --- The proposal is to scan the code base and to replace all existing imports with dynamic imports using this functionality.	2024-04-29 15:34:03 -04:00
Erick Friis	854ae3e1de	mistralai: release 0.1.5, allow client passing in (#21034 )	2024-04-29 17:14:26 +00:00
chyroc	3e241956d3	community[minor]: add coze chat model (#20770 ) add coze chat model, to call coze.com apis	2024-04-29 12:26:16 -04:00
Eugene Yurtsev	29493bb598	cli[minor]: improve confirmation message with more details (#21027 ) Improve confirmation message with more details	2024-04-29 12:20:42 -04:00
Eugene Yurtsev	aab78a37f3	cli[patch]: Ignore imports that change the name of the class (#21026 ) Not currently handeled by migration script	2024-04-29 12:20:30 -04:00
Massimiliano Pronesti	ce89b34fc0	community[patch]: support hybrid search with threshold in Azure AI Search Retriever (#20907 ) Support hybrid search with a score threshold -- similar to what we do for similarity search.	2024-04-29 12:11:44 -04:00
Andrei Panferov	b3efa38cc0	community[patch]: GigaChat model selection fix (#20988 ) Fixed the error that the model name is never actually put into GigaChat request payload, always defaulting to `GigaChat-Lite`. With this fix, model selection through ```python import os from langchain.chat_models.gigachat import GigaChat chat = GigaChat( name="GigaChat-Pro", # <- HERE!!!!! ... ) ``` should actually work, as intended in [here](`804390ba4b/libs/community/langchain_community/llms/gigachat.py (L36)`). --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-29 16:08:26 +00:00
Patrick McFadin	3331865f6b	community[minor]: add Cassandra Database Toolkit (#20246 ) Description: ToolKit and Tools for accessing data in a Cassandra Database primarily for Agent integration. Initially, this includes the following tools: - `cassandra_db_schema` Gathers all schema information for the connected database or a specific schema. Critical for the agent when determining actions. - `cassandra_db_select_table_data` Selects data from a specific keyspace and table. The agent can pass paramaters for a predicate and limits on the number of returned records. - `cassandra_db_query` Expiriemental alternative to `cassandra_db_select_table_data` which takes a query string completely formed by the agent instead of parameters. May be removed in future versions. Includes unit test and two notebooks to demonstrate usage. Dependencies: cassio Twitter handle: @PatrickMcFadin --------- Co-authored-by: Phil Miesle <phil.miesle@datastax.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 15:51:43 +00:00
Igor Brai	b3e74f2b98	community[minor]: add mojeek search util (#20922 ) Description: This pull request introduces a new feature to community tools, enhancing its search capabilities by integrating the Mojeek search engine Dependencies: None --------- Co-authored-by: Igor Brai <igor@mojeek.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-29 15:49:53 +00:00
hmn falahi	4822beb298	Ignore self/cls from required args of class functions in convert_to_openai_tool (#20691 ) Removed redundant self/cls from required args of class functions in _get_python_function_required_args: ```python class MemberTool: def search_member( self, keyword: str, args, *kwargs, ): """Search on members with any keyword like first_name, last_name, email Args: keyword: Any keyword of member """ headers = dict(authorization=kwargs['token']) members = [] try: members = request_( method='SEARCH', url=f'{service_url}/apiv1/members', headers=headers, json=dict(query=keyword), ) except Exception as e: logger.info(e.__doc__) return members convert_to_openai_tool(MemberTool.search_member) ``` expected result: ``` {'type': 'function', 'function': {'name': 'search_member', 'description': 'Search on members with any keyword like first_name, last_name, username, email', 'parameters': {'type': 'object', 'properties': {'keyword': {'type': 'string', 'description': 'Any keyword of member'}}, 'required': ['keyword']}}} ``` #20685 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 11:46:26 -04:00
Eugene Yurtsev	4f4ee8e2cf	cli[patch]: Update migrations file manually (#21021 ) We need to replace occurrences in the code of RunnableMap not just the import, so for now, we don't replace RunnableMap.	2024-04-29 10:53:31 -04:00
Tomaz Bratanic	67428c4052	community[patch]: Neo4j enhanced schema (#20983 ) Scan the database for example values and provide them to an LLM for better inference of Text2cypher	2024-04-29 10:45:55 -04:00
aditya thomas	8b59bddc03	anthropic[patch]: add tests for secret_str for api key (#20986 ) Description: Add tests to check API keys are masked Issue: Resolves https://github.com/langchain-ai/langchain/issues/12165 for Anthropic models Dependencies: None	2024-04-29 10:39:14 -04:00
Pengcheng Liu	1fad39be1c	community[minor]: Add LarkSuite wiki document loader. (#21016 ) Description: Add LarkSuite wiki document loader. Refer to [LarkSuite api document ](https://open.feishu.cn/document/server-docs/docs/wiki-v2/space-node/list)for details. Issue: None Dependencies: None Twitter handle: None	2024-04-29 10:37:50 -04:00
Leonid Ganeline	dc7c06bc07	community[minor]: import fix (#20995 ) Issue: When the third-party package is not installed, whenever we need to `pip install <package>` the ImportError is raised. But sometimes, the `ValueError` or `ModuleNotFoundError` is raised. It is bad for consistency. Change: replaced the `ValueError` or `ModuleNotFoundError` with `ImportError` when we raise an error with the `pip install <package>` message. Note: Ideally, we replace all `try: import... except... raise ... `with helper functions like `import_aim` or just use the existing [langchain_core.utils.utils.guard_import](https://api.python.langchain.com/en/latest/utils/langchain_core.utils.utils.guard_import.html#langchain_core.utils.utils.guard_import) But it would be much bigger refactoring. @baskaryan Please, advice on this.	2024-04-29 10:32:50 -04:00
Karim Lalani	2ddac9a7c3	experimental[minor]: Add bind_tools and with_structured_output functions to OllamaFunctions (#20881 ) Implemented bind_tools for OllamaFunctions. Made OllamaFunctions sub class of ChatOllama. Implemented with_structured_output for OllamaFunctions. integration unit test has been updated. notebook has been updated. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 14:13:33 +00:00
Eugene Yurtsev	d781560722	cli[minor]: Add ipynb support, add text_splitters (#20963 )	2024-04-29 10:11:21 -04:00
WilliamEspegren	804390ba4b	community: Spider integration (#20937 ) Added the [Spider.cloud](https://spider.cloud) document loader. [Spider](https://github.com/spider-rs/spider) is the [fastest](https://github.com/spider-rs/spider/blob/main/benches/BENCHMARKS.md) and cheapest crawler that returns LLM-ready data. ``` - Description: Adds Spider data loader - Dependencies: spider-client - Twitter handle: @WilliamEspegren ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: = <=> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-27 21:45:03 +00:00
ccurme	9ec7151317	fireworks: fix integration tests (#20973 )	2024-04-27 19:49:46 +00:00
William FH	9fa9f05e5d	Catch System Error in ast parse (#20961 ) I can't seem to reproduce, but i got this: ``` SystemError: AST constructor recursion depth mismatch (before=102, after=37) ``` And the operation isn't critical for the actual forward pass so seems preferable to expand our caught exceptions	2024-04-26 19:31:55 -07:00
YH	2aca7fcdcf	core[patch]: Enhance link extraction with query parameters (#20259 ) Description: This update enhances the `extract_sub_links` function within the `langchain_core/utils/html.py` module to include query parameters in the extracted URLs. Issue: N/A Dependencies: No additional dependencies required for this change. Twitter handle: N/A Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 02:22:36 +00:00
Chip Davis	e818c75f8a	infra: test directory loader multithreaded (#20281 ) This is a unit test for #20230 which was a fix for using multithreaded mode with directory loader @eyurtsev	2024-04-26 19:16:47 -07:00
Guilherme Zanotelli	f931a9ce60	community[patch]: Pass kwargs to SPARQLStore from RdfGraph (#20385 ) This introduces `store_kwargs` which behaves similarly to `graph_kwargs` on the `RdfGraph` object, which will enable users to pass `headers` and other arguments to the underlying `SPARQLStore` object. I have also made a [PR in `rdflib` to support passing `default_graph`](https://github.com/RDFLib/rdflib/pull/2761). Example usage: ```python from langchain_community.graphs import RdfGraph graph = RdfGraph( query_endpoint="http://localhost/sparql", standard="rdf", store_kwargs=dict( default_graph="http://example.com/mygraph" ) ) ``` <!--If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.--> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:38:29 +00:00
Jorge Piedrahita Ortiz	40b2e2916b	community[minor]: Sambanova llm integration (#20955 ) - Description: Added [Sambanova systems](https://sambanova.ai/) integration, including sambaverse and sambastudio LLMs - Dependencies: sseclient-py (optional) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:05:13 +00:00
Rahul Triptahi	955cf186d2	community[patch]: Ingest source, owner and full_path if present in Document's metadata. (#20949 ) Description: The PebbloSafeLoader should first check for owner, full_path and size in metadata before implementing its own logic. Dependencies: None Documentation: NA. Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-26 17:50:57 -07:00
Amine Djeghri	790ea75cf7	community[minor]: add exllamav2 library for GPTQ & EXL2 models (#17817 ) Added 3 files : - Library : ExLlamaV2 - Test integration - Notebook --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 00:44:43 +00:00
Naveen Tatikonda	8bbdb4f6a0	community[patch]: Add OpenSearch as semantic cache (#20254 ) ### Description Use OpenSearch vector store as Semantic Cache. ### Twitter Handle @OpenSearchProj --------- Signed-off-by: Naveen Tatikonda <navtat@amazon.com> Co-authored-by: Harish Tatikonda <harishtatikonda@Harishs-MacBook-Air.local> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-31-155.ec2.internal> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 00:20:24 +00:00
Mayank Solanki	8c085fc697	community[patch]: Added a function `from_existing_collection` in `Qdrant` vector database. (#20779 ) Issue: #20514 The current implementation of `construct_instance` expects a `texts: List[str]` that will call the embedding function. This might not be needed when we already have a client with collection and `path, you don't want to add any text. This PR adds a class method that returns a qdrant instance with an existing client. Here everytime `cb6e5e56c2/libs/community/langchain_community/vectorstores/qdrant.py (L1592)` `construct_instance` is called, this line sends some text for embedding generation. --------- Co-authored-by: Anush <anushshetty90@gmail.com>	2024-04-26 15:34:09 -07:00
Leonid Kuligin	893a924b90	core[minor], community[patch], langchain[patch]: move BaseChatLoader to core (#19607 ) Thank you for contributing to LangChain! - [ ] PR title: "core: move BaseChatLoader and BaseToolkit from community" - [ ] PR message: move BaseChatLoader and BaseToolkit --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-26 21:45:51 +00:00
Erick Friis	d4befd0cfb	core: fix batch ordering test (#20952 )	2024-04-26 21:17:26 +00:00
Eugene Yurtsev	8ed150b2fe	cli[minor]: Fix bug to account for name changes (#20948 ) * Fix bug to account for name changes / aliases * Generate migration list from langchain to langchain_core	2024-04-26 15:45:11 -04:00
Eugene Yurtsev	2fa0ff1a2d	cli[minor]: update code to generate migrations from langchain to community (#20946 ) Updates code that generates migrations from langchain to community	2024-04-26 15:11:32 -04:00
ccurme	bf16cefd18	langchain: deprecate create_structured_output_runnable (#20933 )	2024-04-26 14:00:40 -04:00
Erick Friis	38eccab3ae	upstage: release 0.1.3 (#20941 )	2024-04-26 10:36:11 -07:00
Sean	e1c2e2fdfa	upstage: Upstage Groundedness Check parameter update (#20914 ) * Groundedness Check takes `str` or `list[Document]` as input. * Deprecate `GroundednessCheck` due to its naming. * Added `UpstageGroundednessCheck`. * Hotfix for Groundedness Check parameter. The name `query` was misleading and it should be `answer` instead. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-26 17:34:05 +00:00
ccurme	84b8e67c9c	mistral: release 0.1.4 (#20940 )	2024-04-26 13:06:02 -04:00
ccurme	465fbaa30b	openai: release 0.1.4 (#20939 )	2024-04-26 09:56:49 -07:00
Eugene Yurtsev	12c906f6ce	cli[minor]: Improve partner migrations (#20938 ) This auto generates partner migrations. At the moment the migration is from community -> partner. So one would need to run the migration script twice to go from langchain to partner.	2024-04-26 12:30:15 -04:00
Eugene Yurtsev	5653f36adc	cli[minor]: Add script to generate migrations for partner packages (#20932 ) Add script to help generate migrations. This works well for partner packages. Migrations are generated based on run time rather than static analysis (much simpler to get the correct migrations implemented). The script for generating migrations from langchain to community still needs work.	2024-04-26 11:17:20 -04:00
ccurme	fe1304afc4	openai: add unit test (#20931 ) Test a helper function that was added earlier.	2024-04-26 15:02:19 +00:00
Eugene Yurtsev	6598757037	cli[minor]: Add first version of migrate (#20902 ) Adds a first version of the migrate script.	2024-04-26 10:50:21 -04:00
Lei Zhang	9281841cfe	community[patch]: fix integrated test case test_recursive_url_loader.py assertions (issue-20919) (#20920 ) Description: Fix integrated test case test_recursive_url_loader.py Local testing successful ```shell (venv) lei@LeideMacBook-Pro community % poetry run pytest tests/integration_tests/document_loaders/test_recursive_url_loader.py ================================================================================ test session starts ================================================================================ platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.4.0 -- /Users/zhanglei/Work/github/langchain/venv/bin/python cachedir: .pytest_cache rootdir: /Users/zhanglei/Work/github/langchain/libs/community configfile: pyproject.toml plugins: syrupy-4.6.1, asyncio-0.20.3, cov-4.1.0, vcr-1.0.2, mock-3.12.0, anyio-3.7.1, dotenv-0.5.2, requests-mock-1.11.0, socket-0.6.0 asyncio: mode=Mode.AUTO collected 6 items tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader PASSED [ 16%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic PASSED [ 33%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader FAILED [ 50%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent PASSED [ 66%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_loading_invalid_url PASSED [ 83%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties PASSED [100%] ===================================================================================== FAILURES ====================================================================================== __________________________________________________________________________ test_sync_recursive_url_loader ___________________________________________________________________________ def test_sync_recursive_url_loader() -> None: url = "https://docs.python.org/3.9/" loader = RecursiveUrlLoader( url, extractor=lambda _: "placeholder", use_async=False, max_depth=2 ) docs = loader.load() > assert len(docs) == 23 E AssertionError: assert 24 == 23 E + where 24 = len([Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/', 'content_type': 'text/html', 'title': '3.9.18 Documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/py-modindex.html', 'content_type': 'text/html', 'title': 'Python Module Index — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/download.html', 'content_type': 'text/html', 'title': 'Download — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/howto/index.html', 'content_type': 'text/html', 'title': 'Python HOWTOs — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/whatsnew/index.html', 'content_type': 'text/html', 'title': 'Whatâ\x80\x99s New in Python — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/c-api/index.html', 'content_type': 'text/html', 'title': 'Python/C API Reference Manual — Python 3.9.18 documentation', 'language': None}), ...]) tests/integration_tests/document_loaders/test_recursive_url_loader.py:38: AssertionError ================================================================================= warnings summary ================================================================================== tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties /Users/zhanglei/.pyenv/versions/3.11.4/lib/python3.11/html/parser.py:170: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument `features="xml"` into the BeautifulSoup constructor. k = self.parse_starttag(i) -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html ================================================================================ slowest 5 durations ================================================================================ 56.75s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic 38.99s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader 31.20s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties 30.37s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent 15.44s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader ============================================================================== short test summary info ============================================================================== FAILED tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader - AssertionError: assert 24 == 23 ================================================================ 1 failed, 5 passed, 5 warnings in 172.97s (0:02:52) ================================================================ (venv) zhanglei@LeideMacBook-Pro community % poetry run pytest tests/integration_tests/document_loaders/test_recursive_url_loader.py ================================================================================ test session starts ================================================================================ platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.4.0 -- /Users/zhanglei/Work/github/langchain/venv/bin/python cachedir: .pytest_cache rootdir: /Users/zhanglei/Work/github/langchain/libs/community configfile: pyproject.toml plugins: syrupy-4.6.1, asyncio-0.20.3, cov-4.1.0, vcr-1.0.2, mock-3.12.0, anyio-3.7.1, dotenv-0.5.2, requests-mock-1.11.0, socket-0.6.0 asyncio: mode=Mode.AUTO collected 6 items tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader PASSED [ 16%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic PASSED [ 33%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader PASSED [ 50%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent PASSED [ 66%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_loading_invalid_url PASSED [ 83%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties PASSED [100%] ================================================================================= warnings summary ================================================================================== tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties /Users/zhanglei/.pyenv/versions/3.11.4/lib/python3.11/html/parser.py:170: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument `features="xml"` into the BeautifulSoup constructor. k = self.parse_starttag(i) -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html ================================================================================ slowest 5 durations ================================================================================ 46.99s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic 32.43s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader 31.23s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent 30.75s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties 15.89s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader ===================================================================== 6 passed, 5 warnings in 157.42s (0:02:37) ===================================================================== (venv) lei@LeideMacBook-Pro community % ``` Issue: https://github.com/langchain-ai/langchain/issues/20919 Twitter handle: @coolbeevip	2024-04-26 10:00:08 -04:00
ccurme	7d8d0229fa	remove placeholder error message (#20340 )	2024-04-26 13:48:48 +00:00
William FH	4c437ebb9c	Use lstv2 (#20747 )	2024-04-25 16:51:42 -07:00
ccurme	891ae37437	langchain: support PineconeVectorStore in self query retriever (#20905 ) `langchain_pinecone.Pinecone` is deprecated in favor of `PineconeVectorStore`, and is currently a subclass of `PineconeVectorStore`. ```python @deprecated(since="0.0.3", removal="0.2.0", alternative="PineconeVectorStore") class Pinecone(PineconeVectorStore): """Deprecated. Use PineconeVectorStore instead.""" pass ```	2024-04-25 20:54:58 +00:00
Matt	28df4750ef	community[patch]: Add initial tests for AzureSearch vector store (#17663 ) Description: AzureSearch vector store has no tests. This PR adds initial tests to validate the code can be imported and used. Issue: N/A Dependencies: azure-search-documents and azure-identity are added as optional dependencies for testing --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 20:42:01 +00:00
Dristy Srivastava	5f1d1666e3	community[patch]: Add support for pebblo server and client version (#20269 ) Description: _PebbloSafeLoader_: Add support for pebblo server and client version Documentation: NA Unit test: NA Issue: NA Dependencies: None --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 20:39:17 +00:00
am-kinetica	b54b19ba1c	community[minor]: Implemented Kinetica Document Loader and added notebooks (#20002 ) - [ ] Kinetica Document Loader: "community: a class to load Documents from Kinetica" - [ ] Kinetica Document Loader: - Description: implemented KineticaLoader in `kinetica_loader.py` - Dependencies: install the Kinetica API using `pip install gpudb==7.2.0.1 `	2024-04-25 13:39:00 -07:00
Michael Schock	5e60d65917	experimental[patch]: return from HuggingGPT task executor task.run() exception (#20219 ) Description: Fixes a bug in the HuggingGPT task execution logic here: except Exception as e: self.status = "failed" self.message = str(e) self.status = "completed" self.save_product() where a caught exception effectively just sets `self.message` and can then throw an exception if, e.g., `self.product` is not defined. Issue: None that I'm aware of. Dependencies: None Twitter handle: https://twitter.com/michaeljschock Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 20:16:39 +00:00
Anish Chakraborty	898362de81	core[patch]: improve comma separated list output parser to handle non-space separated list (#20434 ) - Description: Changes `lanchain_core.output_parsers.CommaSeparatedListOutputParser` to handle `,` as a delimiter alongside the previous implementation which used `, ` as delimiter. - Issue: Started noticing that some results returned by LLMs were not getting parsed correctly when the output contained `,` instead of `, `. - Dependencies: No - Twitter handle: not active on twitter. <!--- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. -->	2024-04-25 20:10:56 +00:00
Michael Schock	63a07f52df	experimental[patch]: remove \n from AutoGPT feedback_tool exit check (#20132 )	2024-04-25 20:10:33 +00:00
Shengsheng Huang	fd1061e7bf	community[patch]: add more data types support to ipex-llm llm integration (#20833 ) - Description: - add support for more data types: by default `IpexLLM` will load the model in int4 format. This PR adds more data types support such as `sym_in5`, `sym_int8`, etc. Data formats like NF3, NF4, FP4 and FP8 are only supported on GPU and will be added in future PR. - Fix a small issue in saving/loading, update api docs - Dependencies: `ipex-llm` library - Document: In `docs/docs/integrations/llms/ipex_llm.ipynb`, added instructions for saving/loading low-bit model. - Tests: added new test cases to `libs/community/tests/integration_tests/llms/test_ipex_llm.py`, added config params. - Contribution maintainer: @shane-huang	2024-04-25 12:58:18 -07:00
Rahul Triptahi	dc921f0823	community[patch]: Add semantic info to metadata, classified by pebblo-server. (#20468 ) Description: Add support for Semantic topics and entities. Classification done by pebblo-server is not used to enhance metadata of Documents loaded by document loaders. Dependencies: None Documentation: Updated. Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-25 12:55:33 -07:00
Eugene Yurtsev	a5028b6356	cli[minor]: Add __version__ (#20903 ) Add __version__ to cli	2024-04-25 15:51:33 -04:00
Jingpan Xiong	1202017c56	community[minor]: Add relyt vector database (#20316 ) Co-authored-by: kaka <kaka@zbyte-inc.cloud> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: jingsi <jingsi@leadincloud.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 19:49:29 +00:00
davidefantiniIntel	f386f71bb3	community: fix tqdm import (#20263 ) Description: Fix tqdm import in QuantizedBiEncoderEmbeddings	2024-04-25 19:44:53 +00:00
Andres Algaba	05ae8ca7d4	community[patch]: deprecate persist method in Chroma (#20855 ) Thank you for contributing to LangChain! - [x] PR title - [x] PR message: - Description: Deprecate persist method in Chroma no longer exists in Chroma 0.4.x - Issue: #20851 - Dependencies: None - Twitter handle: AndresAlgaba1 - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 19:42:03 +00:00
ccurme	fdabd3cdf5	mistral, openai: support custom tokenizers in chat models (#20901 )	2024-04-25 15:23:29 -04:00
ccurme	b8db73233c	core, community: deprecate tool.__call__ (#20900 ) Does not update docs.	2024-04-25 14:50:39 -04:00
Tomaz Bratanic	520972fd0f	community[patch]: Support passing graph object to Neo4j integrations (#20876 ) For driver connection reusage, we introduce passing the graph object to neo4j integrations	2024-04-25 11:30:22 -07:00
Lei Zhang	748a6ae609	community[patch]: add HTTP response headers Content-Type to metadata of RecursiveUrlLoader document (#20875 ) Description: The RecursiveUrlLoader loader offers a link_regex parameter that can filter out URLs. However, this filtering capability is limited, and if the internal links of the website change, unexpected resources may be loaded. These resources, such as font files, can cause problems in subsequent embedding processing. > https://blog.langchain.dev/assets/fonts/source-sans-pro-v21-latin-ext_latin-regular.woff2?v=0312715cbf We can add the Content-Type in the HTTP response headers to the document metadata so developers can choose which resources to use. This allows developers to make their own choices. For example, the following may be a good choice for text knowledge. - text/plain - simple text file - text/html - HTML web page - text/xml - XML format file - text/json - JSON format data - application/pdf - PDF file - application/msword - Word document and ignore the following - text/css - CSS stylesheet - text/javascript - JavaScript script - application/octet-stream - binary data - image/jpeg - JPEG image - image/png - PNG image - image/gif - GIF image - image/svg+xml - SVG image - audio/mpeg - MPEG audio files - video/mp4 - MP4 video file - application/font-woff - WOFF font file - application/font-ttf - TTF font file - application/zip - ZIP compressed file - application/octet-stream - binary data Twitter handle: @coolbeevip --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 11:29:41 -07:00
Erick Friis	eca3640af7	upstage: release 0.1.2 (#20898 )	2024-04-25 10:41:19 -07:00
Joan Fontanals	baefbfb14e	community[mionr]: add Jina Reranker in retrievers module (#19406 ) - Description: Adapt JinaEmbeddings to run with the new Jina AI Rerank API - Twitter handle: https://twitter.com/JinaAI_ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 10:27:10 -07:00
Erick Friis	92969d49cb	multiple: remove external repo mds (#20896 ) api docs build doesn't tolerate them	2024-04-25 17:18:29 +00:00
Jason_Chen	53bb7dbd29	community[patch]: add BeautifulSoupTransformer remove_unwanted_classnames method (#20467 ) Add the remove_unwanted_classnames method to the BeautifulSoupTransformer class, which can filter more effectively. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 17:04:04 +00:00
YISH	ed26149a29	openai[patch]: Allow disablling safe_len_embeddings(OpenAIEmbeddings) (#19743 ) OpenAI API compatible server may not support `safe_len_embedding`， use `disable_safe_len_embeddings=True` to disable it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 09:45:52 -07:00
Bagatur	5b83130855	core[minor], langchain[patch], community[patch]: mv StructuredQuery (#20849 ) mv StructuredQuery to core	2024-04-25 09:40:26 -07:00
Sean	540f384197	partner: Upstage quick documentation update (#20869 ) * Updating the provider docs page. The RAG example was meant to be moved to cookbook, but was merged by mistake. * Fix bug in Groundedness Check --------- Co-authored-by: JuHyung-Son <sonju0427@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-25 16:36:54 +00:00
Bagatur	ffad3985a1	core[patch]: Release 0.1.46 (#20891 )	2024-04-25 15:40:17 +00:00
Mish Ushakov	6ccecf2363	community[minor]: added Browserbase loader (#20478 )	2024-04-25 01:11:03 +00:00
Erick Friis	5da9dd1195	mistral: comment batching param (#20868 ) Addresses #20523	2024-04-25 00:38:21 +00:00
Ivaylo Bratoev	7c5063ef60	infra: fix how Poetry is installed in the dev container (#20521 ) Currently, when a new dev container is created, poetry does not work in it with the error "No module named 'rapidfuzz'". Install Poetry outside the project venv so that poetry and project dependencies do not get mixed. Use pipx to install poetry securely in its own isolated environment. Issue: #12237 Twitter handle: https://twitter.com/ibratoev Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 17:33:25 -07:00
GustavoSept	c2d09a5186	experimental[patch]: Makes regex customizable in text_splitter.py (SemanticChunker class) (#20485 ) - Description: Currently, the regex is static (`r"(?<=[.?!])\s+"`), which is only useful for certain use cases. The current change only moves this to be a parameter of split_text(). Which adds flexibility without making it more complex (as the default regex is still the same). - Issue: Not applicable (I searched, no one seems to have created this issue yet). - Dependencies: None. _If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17._ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 00:32:40 +00:00
William FH	a936f696a6	[Core] Feat: update config CVar in tool.invoke (#20808 )	2024-04-24 17:17:21 -07:00
Lei Zhang	2cd907ad7e	text-splitters[patch]: fix MarkdownHeaderTextSplitter fails to parse headers with non-printable characters (#20645 ) Description: MarkdownHeaderTextSplitter Fails to Parse Headers with non-printable characters. more #20643 The following is the official test case. Just replacing `# Foo\n\n` with `\ufeff# Foo\n\n` will cause the test case to fail. chunk metadata is empty ```python def test_md_header_text_splitter_1() -> None: """Test markdown splitter by header: Case 1.""" markdown_document = ( "\ufeff# Foo\n\n" " ## Bar\n\n" "Hi this is Jim\n\n" "Hi this is Joe\n\n" " ## Baz\n\n" " Hi this is Molly" ) headers_to_split_on = [ ("#", "Header 1"), ("##", "Header 2"), ] markdown_splitter = MarkdownHeaderTextSplitter( headers_to_split_on=headers_to_split_on, ) output = markdown_splitter.split_text(markdown_document) expected_output = [ Document( page_content="Hi this is Jim \nHi this is Joe", metadata={"Header 1": "Foo", "Header 2": "Bar"}, ), Document( page_content="Hi this is Molly", metadata={"Header 1": "Foo", "Header 2": "Baz"}, ), ] assert output == expected_output ``` twitter: @coolbeevip Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 00:07:42 +00:00
ccurme	481d3855dc	patch: remove usage of llm, chat model __call__ (#20788 ) - `llm(prompt)` -> `llm.invoke(prompt)` - `llm(prompt=prompt` -> `llm.invoke(prompt)` (same with `messages=`) - `llm(prompt, callbacks=callbacks)` -> `llm.invoke(prompt, config={"callbacks": callbacks})` - `llm(prompt, kwargs)` -> `llm.invoke(prompt, kwargs)`	2024-04-24 19:39:23 -04:00
Raghav Dixit	9b7fb381a4	community[patch]: LanceDB integration patch update (#20686 ) Description : - added functionalities - delete, index creation, using existing connection object etc. - updated usage - Added LaceDB cloud OSS support make lint_diff , make test checks done	2024-04-24 16:27:43 -07:00
Nikita Pokidyshev	9e983c9500	langchain[patch]: fix agent_token_buffer_memory not working with openai tools (#20708 ) - Description: fix a bug in the agent_token_buffer_memory - Issue: agent_token_buffer_memory was not working with openai tools - Dependencies: None - Twitter handle: @pokidyshef	2024-04-24 15:51:58 -07:00
Erick Friis	1aef8116de	upstage: release 0.1.1 (#20864 )	2024-04-24 15:18:30 -07:00
junkeon	c8fd51e8c8	upstage: Add Upstage partner package LA and GC (#20651 ) --------- Co-authored-by: Sean <chosh0615@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Sean Cho <sean@upstage.ai>	2024-04-24 15:17:20 -07:00
Alex Lee	243ba71b28	langchain[patch]: add `aprep_output` method to `langchain/chains/base.py` (#20748 ) ## Description Add `aprep_output` method to `langchain/chains/base.py`. Some downstream `ChatMessageHistory` objects that use async connections require an async way to append to the context. It turned out that `ainvoke()` was calling `prep_output` which is synchronous. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 22:16:25 +00:00
Harrison Chase	43c041cda5	support messages in messages out (#20862 )	2024-04-24 14:58:58 -07:00
back2nix	a1614b88ac	groq[patch]: groq proxy support (#20758 ) # Proxy Fix for Groq Class 🐛 🚀 ## Description This PR fixes a bug related to proxy settings in the `Groq` class, allowing users to connect to LangChain services via a proxy. ## Changes Made - ✅ FIX support for specifying proxy settings in the `Groq` class. - ✅ Resolved the bug causing issues with proxy settings. - ❌ Did not include unit tests and documentation updates. - ❌ Did not run make format, make lint, and make test to ensure code quality and functionality because I couldn't get it to run, so I don't program in Python and couldn't run `ruff`. - ❔ Ensured that the changes are backwards compatible. - ✅ No additional dependencies were added to `pyproject.toml`. ### Error Before Fix ```python Traceback (most recent call last): File "/home/bg/Documents/code/github.com/back2nix/test/groq/main.py", line 9, in <module> chat = ChatGroq( ^^^^^^^^^ File "/home/bg/Documents/code/github.com/back2nix/test/groq/venv310/lib/python3.11/site-packages/langchain_core/load/serializable.py", line 120, in __init__ super().__init__(**kwargs) File "/home/bg/Documents/code/github.com/back2nix/test/groq/venv310/lib/python3.11/site-packages/pydantic/v1/main.py", line 341, in __init__ raise validation_error pydantic.v1.error_wrappers.ValidationError: 1 validation error for ChatGroq __root__ Invalid `http_client` argument; Expected an instance of `httpx.AsyncClient` but got <class 'httpx.Client'> (type=type_error) ``` ### Example usage after fix ```python3 import os import httpx from langchain_core.prompts import ChatPromptTemplate from langchain_groq import ChatGroq chat = ChatGroq( temperature=0, groq_api_key=os.environ.get("GROQ_API_KEY"), model_name="mixtral-8x7b-32768", http_client=httpx.Client( proxies="socks5://127.0.0.1:1080", transport=httpx.HTTPTransport(local_address="0.0.0.0"), ), http_async_client=httpx.AsyncClient( proxies="socks5://127.0.0.1:1080", transport=httpx.HTTPTransport(local_address="0.0.0.0"), ), ) system = "You are a helpful assistant." human = "{text}" prompt = ChatPromptTemplate.from_messages([("system", system), ("human", human)]) chain = prompt \| chat out = chain.invoke({"text": "Explain the importance of low latency LLMs"}) print(out) ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:58:03 +00:00
volodymyr-memsql	493afe4d8d	community[patch]: add hybrid search to singlestoredb vectorstore (#20793 ) Implemented the ability to enable full-text search within the SingleStore vector store, offering users a versatile range of search strategies. This enhancement allows users to seamlessly combine full-text search with vector search, enabling the following search strategies: * Search solely by vector similarity. * Conduct searches exclusively based on text similarity, utilizing Lucene internally. * Filter search results by text similarity score, with the option to specify a threshold, followed by a search based on vector similarity. * Filter results by vector similarity score before conducting a search based on text similarity. * Perform searches using a weighted sum of vector and text similarity scores. Additionally, integration tests have been added to comprehensively cover all scenarios. Updated notebook with examples. CC: @baskaryan, @hwchase17 --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:34:50 +00:00
Tomaz Bratanic	9efab3ed66	community[patch]: Add driver config param for neo4j graph (#20772 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:14:41 +00:00
Leonid Ganeline	13751c3297	community: `tigergraph` fixes (#20034 ) - added guard on the `pyTigerGraph` import - added a missed example page in the `docs/integrations/graphs/` - formatted the `docs/integrations/providers/` page to the consistent format. Added links.	2024-04-24 16:49:21 -04:00
Martin Kolb	0186e4e633	community[patch]: Advanced filtering for HANA Cloud Vector Engine (#20821 ) - Description: This PR adds support for advanced filtering to the integration of HANA Vector Engine. The newly supported filtering operators are: $eq, $ne, $gt, $gte, $lt, $lte, $between, $in, $nin, $like, $and, $or - Issue: N/A - Dependencies: no new dependencies added Added integration tests to: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Description of the new capabilities in notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb`	2024-04-24 13:47:27 -07:00
Alex Sherstinsky	12e5ec6de3	community: Support both Predibase SDK-v1 and SDK-v2 in Predibase-LangChain integration (#20859 )	2024-04-24 13:31:01 -07:00
Erick Friis	8c95ac3145	docs, multiple: de-beta with_structured_output (#20850 )	2024-04-24 19:34:57 +00:00
Nuno Campos	477eb1745c	Better support for subgraphs in graph viz (#20840 )	2024-04-24 12:32:52 -07:00
JeffKatzy	5ab3f9a995	community[patch]: standardize chat init args (#20844 ) Thank you for contributing to LangChain! community:perplexity[patch]: standardize init args updated pplx_api_key and request_timeout so that aliased to api_key, and timeout respectively. Added test that both continue to set the same underlying attributes. Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 12:26:05 -07:00
Massimiliano Pronesti	8d1167b32f	community[patch]: add support for similarity_score_threshold search in… (#20852 ) See https://github.com/langchain-ai/langchain/issues/20600#issuecomment-2075569338 for details. @chrislrobert	2024-04-24 19:14:33 +00:00
Eugene Yurtsev	d8aa72f51d	core[minor],langchain[patch]: Move base indexing interface and logic to core (#20667 ) This PR moves the interface and the logic to core. The following changes to namespaces: `indexes` -> `indexing` `indexes._api` -> `indexing.api` Testing code is intentionally duplicated for now since it's testing different implementations of the record manager (in-memory vs. SQL). Common logic will need to be pulled out into the test client. A follow up PR will move the SQL based implementation outside of LangChain.	2024-04-24 13:18:42 -04:00
ccurme	3bcfbcc871	groq: handle null queue_time (#20839 )	2024-04-24 09:50:09 -07:00
Eugene Yurtsev	30e48c9878	core[patch],community[patch]: Move file chat history back to community (#20834 ) Marking as patch since we haven't had releases in between. This just reverting part of a PR from yesterday.	2024-04-24 12:47:25 -04:00
ccurme	6debadaa70	groq: bump core (#20838 )	2024-04-24 11:51:46 -04:00
Erick Friis	7984206c95	groq: release 0.1.3 (#20836 ) Fixes #20811	2024-04-24 08:06:06 -07:00
Nestor Qin	9111d3a636	community[patch]: Fix message formatting for Anthropic models on Amazon Bedrock (#20801 ) Description: This PR fixes an issue in message formatting function for Anthropic models on Amazon Bedrock. Currently, LangChain BedrockChat model will crash if it uses Anthropic models and the model return a message in the following type: - `AIMessageChunk` Moreover, when use BedrockChat with for building Agent, the following message types will trigger the same issue too: - `HumanMessageChunk` - `FunctionMessage` Issue: https://github.com/langchain-ai/langchain/issues/18831 Dependencies: No. Testing: Manually tested. The following code was failing before the patch and works after. ``` @tool def square_root(x: str): "Useful when you need to calculate the square root of a number" return math.sqrt(int(x)) llm = ChatBedrock( model_id="anthropic.claude-3-sonnet-20240229-v1:0", model_kwargs={ "temperature": 0.0 }, ) prompt = ChatPromptTemplate.from_messages( [ ("system", FUNCTION_CALL_PROMPT), ("human", "Question: {user_input}"), MessagesPlaceholder(variable_name="agent_scratchpad"), ] ) tools = [square_root] tools_string = format_tool_to_anthropic_function(square_root) agent = ( RunnablePassthrough.assign( user_input=lambda x: x['user_input'], agent_scratchpad=lambda x: format_to_openai_function_messages( x["intermediate_steps"] ) ) \| prompt \| llm \| AnthropicFunctionsAgentOutputParser() ) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True, return_intermediate_steps=True) output = agent_executor.invoke({ "user_input": "What is the square root of 2?", "tools_string": tools_string, }) ``` List of messages returned from Bedrock: ``` <SystemMessage> content='You are a helpful assistant.' <HumanMessage> content='Question: What is the square root of 2?' <AIMessageChunk> content="Okay, let's calculate the square root of 2.<scratchpad>\nTo calculate the square root of a number, I can use the square_root tool:\n\n<function_calls>\n <invoke>\n <tool_name>square_root</tool_name>\n <parameters>\n <__arg1>2</__arg1>\n </parameters>\n </invoke>\n</function_calls>\n</scratchpad>\n\n<function_results>\n<search_result>\nThe square root of 2 is approximately 1.414213562373095\n</search_result>\n</function_results>\n\n<answer>\nThe square root of 2 is approximately 1.414213562373095\n</answer>" id='run-92363df7-eff6-4849-bbba-fa16a1b2988c'" <FunctionMessage> content='1.4142135623730951' name='square_root' ```	2024-04-23 22:40:39 +00:00
ccurme	06b04b80b8	groq: fix warning filter for integration test (#20806 )	2024-04-23 18:11:41 -04:00
ccurme	5a3c65a756	standard tests: add xfails (#20659 )	2024-04-23 17:14:16 -04:00
Erick Friis	ddc2274aea	standard-tests: split tool calling test (#20803 ) just making it a bit easier to grok	2024-04-23 20:59:45 +00:00
ccurme	6622829c67	mistral: catch GatedRepoError, release 0.1.3 (#20802 ) https://github.com/langchain-ai/langchain/issues/20618 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-23 20:56:42 +00:00
Eugene Yurtsev	a7c347ab35	langchain[patch]: Update evaluation logic that instantiates a default LLM (#20760 ) Favor langchain_openai over langchain_community for evaluation logic. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-23 16:09:32 -04:00
Eugene Yurtsev	72f720fa38	langchain[major]: Remove default instantations of LLMs from VectorstoreToolkit (#20794 ) Remove default instantiation from vectorstore toolkit.	2024-04-23 16:09:14 -04:00
ccurme	42de5168b1	langchain: deprecate LLMChain, RetrievalQA, and ConversationalRetrievalChain (#20751 )	2024-04-23 15:55:34 -04:00
Erick Friis	30c7951505	core: use qualname in beta message (#20361 )	2024-04-23 11:20:13 -07:00
Aliaksandr Kuzmik	5560cc448c	community[patch]: fix CometTracer bug (#20796 ) Hi! My name is Alex, I'm an SDK engineer from [Comet](https://www.comet.com/site/) This PR updates the `CometTracer` class. Fixed an issue when `CometTracer` failed while logging the data to Comet because this data is not JSON-encodable. The problem was in some of the `Run` attributes that could contain non-default types inside, now these attributes are taken not from the run instance, but from the `run.dict()` return value.	2024-04-23 13:24:41 -04:00
Eugene Yurtsev	1c89e45c14	langchain[major]: breaks some chains to remove hidden defaults (#20759 ) Breaks some chains in langchain to remove hidden chat model / llm instantiation.	2024-04-23 11:11:40 -04:00
Eugene Yurtsev	ad6b5f84e5	community[patch],core[minor]: Move in memory cache implementation to core (#20753 ) This PR moves the InMemoryCache implementation from community to core.	2024-04-23 11:10:11 -04:00
Eugene Yurtsev	a2cc9b55ba	core[patch]: Remove autoupgrade to addable dict in Runnable/RunnableLambda/RunnablePassthrough transform (#20677 ) Causes an issue for this code ```python from langchain.chat_models.openai import ChatOpenAI from langchain.output_parsers.openai_tools import JsonOutputToolsParser from langchain.schema import SystemMessage prompt = SystemMessage(content="You are a nice assistant.") + "{question}" llm = ChatOpenAI( model_kwargs={ "tools": [ { "type": "function", "function": { "name": "web_search", "description": "Searches the web for the answer to the question.", "parameters": { "type": "object", "properties": { "query": { "type": "string", "description": "The question to search for.", }, }, }, }, } ], }, streaming=True, ) parser = JsonOutputToolsParser(first_tool_only=True) llm_chain = prompt \| llm \| parser \| (lambda x: x) for chunk in llm_chain.stream({"question": "tell me more about turtles"}): print(chunk) # message = llm_chain.invoke({"question": "tell me more about turtles"}) # print(message) ``` Instead by definition, we'll assume that RunnableLambdas consume the entire stream and that if the stream isn't addable then it's the last message of the stream that's in the usable format. --- If users want to use addable dicts, they can wrap the dict in an AddableDict class. --- Likely, need to follow up with the same change for other places in the code that do the upgrade	2024-04-23 10:35:06 -04:00
Oleksandr Yaremchuk	9428923bab	experimental[minor]: upgrade the prompt injection model (#20783 ) - Description: In January, Laiyer.ai became part of ProtectAI, which means the model became owned by ProtectAI. In addition to that, yesterday, we released a new version of the model addressing issues the Langchain's community and others mentioned to us about false-positives. The new model has a better accuracy compared to the previous version, and we thought the Langchain community would benefit from using the [latest version of the model](https://huggingface.co/protectai/deberta-v3-base-prompt-injection-v2). - Issue: N/A - Dependencies: N/A - Twitter handle: @alex_yaremchuk	2024-04-23 10:23:39 -04:00
Eugene Yurtsev	645b1e142e	core[minor],langchain[patch],community[patch]: Move InMemory and File implementations of Chat History to core (#20752 ) This PR moves the implementations for chat history to core. So it's easier to determine which dependencies need to be broken / add deprecation warnings	2024-04-23 10:22:11 -04:00
ccurme	7a922f3e48	core, openai: support custom token encoders (#20762 )	2024-04-23 13:57:05 +00:00
Christophe Bornet	0ae5027d98	community[patch]: Remove usage of deprecated StoredBlobHistory in CassandraChatMessageHistory (#20666 )	2024-04-22 17:11:05 -04:00
Bagatur	eb18f4e155	infra: rm sep repo partner dirs (#20756 ) so you can `poetry run pip install -e libs/partners/*/` to your hearts content	2024-04-22 14:05:39 -07:00
Eugene Yurtsev	936c6cc74a	langchain[patch]: Add missing deprecation for openai adapters (#20668 ) Add missing deprecation for openai adapters	2024-04-22 14:05:55 -04:00
Eugene Yurtsev	38adbfdf34	community[patch],core[minor]: Move BaseToolKit to core.tools (#20669 )	2024-04-22 14:04:30 -04:00
Mark Needham	ce23f8293a	Community patch clickhouse make it possible to not specify index (#20460 ) Vector indexes in ClickHouse are experimental at the moment and can sometimes break/change behaviour. So this PR makes it possible to say that you don't want to specify an index type. Any queries against the embedding column will be brute force/linear scan, but that gives reasonable performance for small-medium dataset sizes. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-22 10:46:37 -07:00
ccurme	c010ec8b71	patch: deprecate (a)get_relevant_documents (#20477 ) - `.get_relevant_documents(query)` -> `.invoke(query)` - `.get_relevant_documents(query=query)` -> `.invoke(query)` - `.get_relevant_documents(query, callbacks=callbacks)` -> `.invoke(query, config={"callbacks": callbacks})` - `.get_relevant_documents(query, kwargs)` -> `.invoke(query, kwargs)` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-22 11:14:53 -04:00
Matheus Henrique Raymundo	bb69819267	community: Fix the stop sequence key name for Mistral in Bedrock (#20709 ) Fixing the wrong stop sequence key name that causes an error on AWS Bedrock. You can check the MistralAI bedrock parameters [here](https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-mistral.html) This change fixes this [issue](https://github.com/langchain-ai/langchain/issues/20095)	2024-04-21 20:06:06 -04:00
Bagatur	1c7b3c75a7	community[patch], experimental[patch]: support tool-calling sql and p… (#20639 ) d agents	2024-04-21 15:43:09 -07:00
Bagatur	d0cee65cdc	langchain[patch]: langchain-pinecone self query support (#20702 )	2024-04-21 15:42:39 -07:00
shumway743	cb6e5e56c2	community[minor]: add graph store implementation for apache age (#20582 ) Description: implemented GraphStore class for Apache Age graph db Dependencies: depends on psycopg2 Unit and integration tests included. Formatting and linting have been run. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-20 14:31:04 -07:00
Christophe Bornet	c909ae0152	community[minor]: Add async methods to CassandraVectorStore (#20602 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-20 02:09:58 +00:00
Leonid Ganeline	06d18c106d	langchain[patch]: `example_selector` import fix (#20676 ) Cleaned up updated imports	2024-04-19 21:42:18 -04:00
Leonid Ganeline	d6470aab60	langchain: `dosctore` import fix (#20678 ) Cleaned up imports	2024-04-19 21:41:36 -04:00
Dmitry Tyumentsev	f111efeb6e	community[patch]: YandexGPT API add ability to disable request logging (#20670 ) Closes (#20622) Added the ability to [disable logging of requests to YandexGPT](https://yandex.cloud/en/docs/foundation-models/operations/yandexgpt/disable-logging).	2024-04-19 21:40:37 -04:00
Mateusz Szewczyk	75ffe51bbe	ibm: Add support for Embedding Models (#20647 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 20:56:24 +00:00
Erick Friis	73809817ff	community: release 0.0.34 (#20672 )	2024-04-19 12:44:41 -07:00
Tomaz Bratanic	8c08cf4619	community: Add support for relationship indexes in neo4j vector (#20657 ) Neo4j has added relationship vector indexes. We can't populate them, but we can use existing indexes for retrieval	2024-04-19 11:22:42 -07:00
Erick Friis	940242c1ec	core: release 0.1.45 (#20664 )	2024-04-19 09:55:02 -07:00
Souls-R	36084e7500	docs: fix variable name typo in example code (#20658 ) This pull request corrects a mistake in the variable name within the example code. The variable doc_schema has been changed to dog_schema to fix the error.	2024-04-19 14:08:25 +00:00
Sivaudha	baedc3ec0a	langchain[minor]: Databricks vector search self query integration (#20627 ) - Enable self querying feature for databricks vector search --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-19 03:44:38 +00:00
ccurme	6d530481c1	openai: fix allowed block types (#20636 )	2024-04-18 22:12:57 -04:00
Erick Friis	5c216ad08f	upstage[patch]: un-xfail tool calling test, release 0.1.0 (#20635 )	2024-04-19 02:02:21 +00:00
Nuno Campos	48307e46a3	core[patch]: Fix runnable map ser/de (#20631 )	2024-04-18 18:52:33 -07:00
Charlie Holtz	1cbab0ebda	community: update Replicate to work with official models (#20633 ) Description: you don't need to pass a version for Replicate official models. That was broken on LangChain until now! You can now run: ``` llm = Replicate( model="meta/meta-llama-3-8b-instruct", model_kwargs={"temperature": 0.75, "max_length": 500, "top_p": 1}, ) prompt = """ User: Answer the following yes/no question by reasoning step by step. Can a dog drive a car? Assistant: """ llm(prompt) ``` I've updated the replicate.ipynb to reflect that. twitter: @charliebholtz --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 01:43:40 +00:00
Congyu	dd5139e304	community[patch]: truncate zhipuai `temperature` and `top_p` parameters to [0.01, 0.99] (#20261 ) ZhipuAI API only accepts `temperature` parameter between `(0, 1)` open interval, and if `0` is passed, it responds with status code `400`. However, 0 and 1 is often accepted by other APIs, for example, OpenAI allows `[0, 2]` for temperature closed range. This PR truncates temperature parameter passed to `[0.01, 0.99]` to improve the compatibility between langchain's ecosystem's and ZhipuAI (e.g., ragas `evaluate` often generates temperature 0, which results in a lot of 400 invalid responses). The PR also truncates `top_p` parameter since it has the same restriction. Reference: [glm-4 doc](https://open.bigmodel.cn/dev/api#glm-4) (which unfortunately is in Chinese though). --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-19 01:31:30 +00:00
Lance Martin	d5c22b80a5	community[patch]: Fix Ollama for LLaMA3 (#20624 ) We see verbose generations w/ LLaMA3 and Ollama - https://smith.langchain.com/public/88c4cd21-3d57-4229-96fe-53443398ca99/r --- Fix here implies that when stop was being set to an empty list, the stream had no conditions under which to stop, which could lead to excessive or unintended output. Test LLaMA2 - https://smith.langchain.com/public/57dfc64a-591b-46fa-a1cd-8783acaefea2/r Test LLaMA3 - https://smith.langchain.com/public/76ff5f47-ac89-4772-a7d2-5caa907d3fd6/r https://smith.langchain.com/public/a31d2fad-9094-4c93-949a-964b27630ccb/r Test Mistral - https://smith.langchain.com/public/a4fe7114-c308-4317-b9fd-6c86d31f1c5b/r --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 00:20:32 +00:00
Erick Friis	3425988de7	core: deprecation default to qualname (#20578 )	2024-04-18 15:35:17 -07:00
hulitaitai	7d0a008744	community[minor]: Add audio-parser "faster-whisper" in audio.py (#20012 ) faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is up to 4 times faster than enai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU. It can automatically detect the following 14 languages and transcribe the text into their respective languages: en, zh, fr, de, ja, ko, ru, es, th, it, pt, vi, ar, tr. The gitbub repository for faster-whisper is : https://github.com/SYSTRAN/faster-whisper --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-18 20:50:59 +00:00
Guangdong Liu	e3c2431c5b	comminuty[patch]:Fix Error in apache doris insert (#19989 ) - Issue: #19886	2024-04-18 16:34:32 -04:00
Tomaz Bratanic	27370b679e	community[patch]: Ignore null and invalid embedding values for neo4j metadata filtering (#20558 )	2024-04-18 16:15:45 -04:00
Eugene Yurtsev	718c9cbe3a	mistral[patch]: Support both model and model_name (#20557 )	2024-04-18 16:12:33 -04:00
Leonid Ganeline	95dc90609e	experimental[patch]: `prompts` import fix (#20534 ) Replaced `from langchain.prompts` with `from langchain_core.prompts` where it is appropriate. Most of the changes go to `langchain_experimental` Similar to #20348	2024-04-18 16:09:11 -04:00
Massimiliano Pronesti	2542a09abc	community[patch]: AzureSearch incorrectly converted to retriever (#20601 ) Closes #20600. Please see the issue for more details.	2024-04-18 16:06:47 -04:00
Christophe Bornet	8f0b5687a3	community[minor]: Add hybrid search to Cassandra VectorStore (#20286 ) Only supported by Astra DB at the moment. Twitter handle: cbornet_	2024-04-18 15:58:43 -04:00
Christophe Bornet	d2d01370bc	community[minor]: Add async methods to CassandraLoader (#20609 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-18 19:45:20 +00:00
Eugene Yurtsev	8c29b7bf35	mistralai[patch]: Use public attribute for eventsource.response (#20580 ) Minor change, use the public attribute instead of the protected one.	2024-04-18 14:12:12 -04:00
Erick Friis	66fb0b1f35	core: fix fireworks mapping (#20613 )	2024-04-18 18:08:40 +00:00
balloonio	e786da7774	community[patch]: Invoke callback prior to yielding token fix [HuggingFaceTextGenInference] (#20426 ) …gFaceTextGenInference) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for [HuggingFaceTextGenInference] - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in [HuggingFaceTextGenInference] - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-18 14:25:20 +00:00
Ethan Yang	2d6d796040	community: Add save_model function for openvino reranker and embedding (#19896 )	2024-04-18 10:20:33 -04:00
zR	9c1d7f2405	update zhipuai notebook (#20595 ) fix timeout issue fix zhipuai usecase notebookbook Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 10:12:12 -04:00
ccurme	c897264b9b	community: (milvus) check for num_shards (#20603 ) @rgupta2508 I believe this change is necessary following https://github.com/langchain-ai/langchain/pull/20318 because of how Milvus handles defaults: `59bf5e811a/pymilvus/client/prepare.py (L82-L85)` ```python num_shards = kwargs[next(iter(same_key))] if not isinstance(num_shards, int): msg = f"invalid num_shards type, got {type(num_shards)}, expected int" raise ParamError(message=msg) req.shards_num = num_shards ``` this way lets Milvus control the default value (instead of maintaining a separate default in Langchain). Let me know if I've got this wrong or you feel it's unnecessary. Thanks.	2024-04-18 09:44:56 -04:00
Rohit Gupta	25c4c24e89	Support to create shards_num in milvus vectorstores (#20318 ) To support number of the shards for the collection to create in milvus vvectorstores. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 08:58:00 -04:00
aditya thomas	cea379e7c7	community, core[callbacks]: move FileCallbackHandler from community to core (#20495 ) Description: Move `FileCallbackHandler` from community to core Issue: #20493 Dependencies: None (imo) `FileCallbackHandler` is a built-in LangChain callback handler like `StdOutCallbackHandler` and should properly be in in core.	2024-04-17 22:29:30 -04:00
Erick Friis	e7e94b37f1	upstage: fix core dep (#20576 )	2024-04-17 16:33:09 -07:00
Erick Friis	e395115807	docs: aws docs updates (#20571 )	2024-04-17 23:32:00 +00:00
Erick Friis	f09bd0b75b	upstage: init package (#20574 ) Co-authored-by: Sean Cho <sean@upstage.ai> Co-authored-by: JuHyung-Son <sonju0427@gmail.com>	2024-04-17 23:25:36 +00:00
Marco Perini	11c9ed3362	community[patch]: exposing headless flag parameter to AsyncChromiumLoader class (#20424 ) - Description: added the headless parameter as optional argument to the langchain_community.document_loaders AsyncChromiumLoader class - Dependencies: None - Twitter handle: @perinim_98 If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-17 16:00:28 -07:00
Bagatur	54e9271504	anthropic[patch]: fix msg mutation (#20572 )	2024-04-17 15:47:19 -07:00
Nuno Campos	719da8746e	core: fix attributeerror in runnablelambda.deps (#20569 ) - would happen when user's code tries to access attritbute that doesnt exist, we prefer to let this crash in the user's code, rather than here - also catch more cases where a runnable is invoked/streamed inside a lambda. before we weren't seeing these as deps	2024-04-17 15:38:39 -07:00
Christophe Bornet	a22da4315b	community[patch]: Replace function in CassandraVectorStore with simpler lambda (#20323 )	2024-04-17 17:13:13 -04:00
Christophe Bornet	75733c5cc1	community[minor]: Improve CassandraVectorStore from_texts (#20284 )	2024-04-17 17:12:28 -04:00
Tomer Cagan	463160c3f6	community: fix `DirectoryLoader` progress bar (#19821 ) Description: currently, the `DirectoryLoader` progress-bar maximum value is based on an incorrect number of files to process In langchain_community/document_loaders/directory.py:127: ```python paths = p.rglob(self.glob) if self.recursive else p.glob(self.glob) items = [ path for path in paths if not (self.exclude and any(path.match(glob) for glob in self.exclude)) ] ``` `paths` returns both files and directories. `items` is later used to determine the maximum value of the progress-bar which gives an incorrect progress indication.	2024-04-17 21:12:16 +00:00
Bagatur	984e7e36c2	anthropic[patch]: Release 0.1.10 (#20568 )	2024-04-17 14:05:42 -07:00
Pengcheng Liu	ecd19a9e58	community[patch]: Add function call support in Tongyi chat model. (#20119 ) - [ ] PR message: - Description: This pr adds function calling support in Tongyi chat model. - Issue: None - Dependencies: None - Twitter handle: None Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 20:42:23 +00:00
kaijietti	80679ab906	zep[patch]: implement add_messages and aadd_messages (#20099 ) This PR implement `add_messages` and `aadd_messages` to avoid unnecessary round-trips.	2024-04-17 13:40:24 -07:00
ccurme	2238490069	mistral, openai: allow anthropic-style messages in message histories (#20565 )	2024-04-17 15:55:45 -04:00
Eugene Yurtsev	7a7851aa06	anthropic[patch]: Handle empty text block (#20566 ) Handle empty text block	2024-04-17 15:37:04 -04:00
Bagatur	7917e2c418	core[patch]: Release 0.1.44 (#20564 )	2024-04-17 18:34:44 +00:00
ccurme	4a17951900	mistral: read tool calls from AIMessage (#20554 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-17 13:38:24 -04:00
Eugene Yurtsev	f257909699	mistralai[patch]: Surface http errors (#20555 ) Do not swallow errors when streaming with httpx. Update affected code if this PR gets merged to httpx: https://github.com/florimondmanca/httpx-sse/pull/25/files	2024-04-17 10:47:56 -04:00
Sevin F. Varoglu	3f156e0ece	community[minor]: add ChatOctoAI (#20059 ) This PR adds ChatOctoAI, a chat model integration for OctoAI.	2024-04-17 03:20:56 -07:00
Eun Hye Kim	b34f1086fe	community[patch]: Add streaming logic in ChatHuggingFace (#18784 ) - Add functions (_stream, _astream) - Connect to _generate and _agenerate Thank you for contributing to LangChain! - [x] PR title: "community: Add streaming logic in ChatHuggingFace" - [x] PR message: *Delete this entire checklist* and replace with - Description: Addition functions (_stream, _astream) and connection to _generate and _agenerate - Issue: #18782 - Dependencies: none - Twitter handle: @lunara_x	2024-04-16 19:17:03 -07:00
pjb157	479be3cc91	community[minor]: Unify Titan Takeoff Integrations and Adding Embedding Support (#18775 ) Community: Unify Titan Takeoff Integrations and Adding Embedding Support Description: Titan Takeoff no longer reflects this either of the integrations in the community folder. The two integrations (TitanTakeoffPro and TitanTakeoff) where causing confusion with clients, so have moved code into one place and created an alias for backwards compatibility. Added Takeoff Client python package to do the bulk of the work with the requests, this is because this package is actively updated with new versions of Takeoff. So this integration will be far more robust and will not degrade as badly over time. Issue: Fixes bugs in the old Titan integrations and unified the code with added unit test converge to avoid future problems. Dependencies: Added optional dependency takeoff-client, all imports still work without dependency including the Titan Takeoff classes but just will fail on initialisation if not pip installed takeoff-client Twitter @MeryemArik9 Thanks all :) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 01:43:35 +00:00
Rahul Triptahi	2cbfc94bcb	community[patch]: Add support for authorized identities in PebbloSafeLoader. (#20055 ) Description: Add support for authorized identities in PebbloSafeLoader. Now with this change, PebbloSafeLoader will extract authorized_identities from metadata and send it to pebblo server Dependencies: None Documentation: None Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-16 18:34:06 -07:00
Guangdong Liu	b78ede2f96	community[patch]: standardize init args (#20166 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-16 18:30:26 -07:00
Guangdong Liu	3729bec1a2	community[patch]: standardize init args (#20210 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-16 18:29:57 -07:00
sdan	a7c5e41443	community[minor]: Added VLite as VectorStore (#20245 ) Support [VLite](https://github.com/sdan/vlite) as a new VectorStore type. Description: vlite is a simple and blazing fast vector database(vdb) made with numpy. It abstracts a lot of the functionality around using a vdb in the retrieval augmented generation(RAG) pipeline such as embeddings generation, chunking, and file processing while still giving developers the functionality to change how they're made/stored. Before submitting: Added tests [here](`c09c2ebd5c/libs/community/tests/integration_tests/vectorstores/test_vlite.py`) Added ipython notebook [here](`c09c2ebd5c/docs/docs/integrations/vectorstores/vlite.ipynb`) Added simple docs on how to use [here](`c09c2ebd5c/docs/docs/integrations/providers/vlite.mdx`) Profiles Maintainers: @sdan Twitter handles: [@sdand](https://x.com/sdand) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-17 01:24:38 +00:00
Hyeongchan Kim	7824291252	community[patch]: Fix not to cast to str type when `file_path` is None (#20057 ) From `langchain_community 0.0.30`, there's a bug that cannot send a file-like object via `file` parameter instead of `file path` due to casting the `file_path` to str type even if `file_path` is None. which means that when I call the `partition_via_api()`, exactly one of `filename` and `file` must be specified by the following error message. however, from `langchain_community 0.0.30`, `file_path` is casted into `str` type even `file_path` is None in `get_elements_from_api()` and got an error at `exactly_one(filename=filename, file=file)`. here's an error message ``` ---> 51 exactly_one(filename=filename, file=file) 53 if metadata_filename and file_filename: 54 raise ValueError( 55 "Only one of metadata_filename and file_filename is specified. " 56 "metadata_filename is preferred. file_filename is marked for deprecation.", 57 ) File /opt/homebrew/lib/python3.11/site-packages/unstructured/partition/common.py:441, in exactly_one(**kwargs) 439 else: 440 message = f"{names[0]} must be specified." --> 441 raise ValueError(message) ValueError: Exactly one of filename and file must be specified. ``` So, I simply made a change that casting to str type when `file_path` is not None. I use `UnstructuredAPIFileLoader` like below. ``` from langchain_community.document_loaders.unstructured import UnstructuredAPIFileLoader documents: list = UnstructuredAPIFileLoader( file_path=None, file=file, # file-like object, io.BytesIO type mode='elements', url='http://127.0.0.1:8000/general/v0/general', content_type='application/pdf', metadata_filename='asdf.pdf', ).load_and_split() ```	2024-04-16 18:06:21 -07:00
Prashanth Rao	295b9b704b	community[patch]: Improve Kuzu Cypher generation prompt (#20481 ) - [x] PR title: "community: improve kuzu cypher generation prompt" - [x] PR message: *Delete this entire checklist* and replace with - Description: Improves the Kùzu Cypher generation prompt to be more robust to open source LLM outputs - Issue: N/A - Dependencies: N/A - Twitter handle: @kuzudb - [x] Add tests and docs: If you're adding a new integration, please include No new tests (non-breaking. change) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-04-16 18:01:36 -07:00
MacanPN	bce69ae43d	community[patch]: Changes to base_o365 and sharepoint document loaders (#20373 ) ## Description: The PR introduces 3 changes: 1. added `recursive` property to `O365BaseLoader`. (To keep the behavior unchanged, by default is set to `False`). When `recursive=True`, `_load_from_folder()` also recursively loads all nested folders. 2. added `folder_id` to SharePointLoader.(similar to (this PR)[https://github.com/langchain-ai/langchain/pull/10780] ) This provides an alternative to `folder_path` that doesn't seem to reliably work. 3. when none of `document_ids`, `folder_id`, `folder_path` is provided, the loader fetches documets from root folder. Combined with `recursive=True` this provides an easy way of loading all compatible documents from SharePoint. The PR contains the same logic as [this stale PR](https://github.com/langchain-ai/langchain/pull/10780) by @WaleedAlfaris. I'd like to ask his blessing for moving forward with this one. ## Issue: - As described in https://github.com/langchain-ai/langchain/issues/19938 and https://github.com/langchain-ai/langchain/pull/10780 the sharepoint loader often does not seem to work with folder_path. - Recursive loading of subfolders is a missing functionality ## Dependecies: None Twitter handle: @martintriska1 @WRhetoric This is my first PR here, please be gentle :-) Please review @baskaryan --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 00:36:15 +00:00
Sevin F. Varoglu	54d388d898	community[patch]: update OctoAI endpoint to subclass BaseOpenAI (#19757 ) This PR updates OctoAIEndpoint LLM to subclass BaseOpenAI as OctoAI is an OpenAI-compatible service. The documentation and tests have also been updated.	2024-04-16 17:32:20 -07:00
Benito Geordie	57b226532d	community[minor]: Added integrations for ThirdAI's NeuralDB as a Retriever (#17334 ) Description: Adds ThirdAI NeuralDB retriever integration. NeuralDB is a CPU-friendly and fine-tunable text retrieval engine. We previously added a vector store integration but we think that it will be easier for our customers if they can also find us under under langchain-community/retrievers. --------- Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com>	2024-04-16 16:36:55 -07:00
WeichenXu	e9fc87aab1	community[patch]: Make ChatDatabricks model supports streaming response (#19912 ) Description: Make ChatDatabricks model supports stream Issue: N/A Dependencies: MLflow nightly build version (we will release next MLflow version soon) Twitter handle: N/A Manually test: (Before testing, please install `pip install git+https://github.com/mlflow/mlflow.git`) ```python # Test Databricks Foundation LLM model from langchain.chat_models import ChatDatabricks chat_model = ChatDatabricks( endpoint="databricks-llama-2-70b-chat", max_tokens=500 ) from langchain_core.messages import AIMessageChunk for chunk in chat_model.stream("What is mlflow?"): print(chunk.content, end="\|") ``` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Signed-off-by: Weichen Xu <weichen.xu@databricks.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-16 23:34:49 +00:00
ccurme	a892f985d3	standardized-tests[patch]: test tool call messages (#20519 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-16 23:25:50 +00:00
Erick Friis	e7fe5f7d3f	anthropic[patch]: serialization in partner package (#18828 )	2024-04-16 16:05:58 -07:00
Bagatur	f74d5d642e	anthropic[patch]: bump to core 0.1.43 (#20537 )	2024-04-16 22:47:07 +00:00
Bagatur	96d8769eae	anthropic[patch]: release 0.1.9, use tool calls if content is empty (#20535 )	2024-04-16 15:27:29 -07:00
Erick Friis	6adca37eb7	core: default chat/llm _identifying_params to lc_attributes (#20232 )	2024-04-16 14:55:47 -07:00
ccurme	22da9f5f3f	update scheduled tests (#20526 ) repurpose scheduled tests to test over provider packages	2024-04-16 16:49:46 -04:00
Nuno Campos	806a54908c	Runnable graph viz improvements (#20529 ) - Add conditional: bool property to json representation of the graphs - Add option to generate mermaid graph stripped of styles (useful as a text representation of graph)	2024-04-16 20:17:47 +00:00
Nuno Campos	f3aa26d6bf	Fix getattr in runnable binding for cases where config is passed in as arg too (#20528 ) …s arg too Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-16 13:10:29 -07:00
Dhruv Chawla	d6d559d50d	community[minor]: add UpTrainCallbackHandler (#19956 ) - Description: This PR adds a callback handler for UpTrain. It performs evaluations in the RAG pipeline to check the quality of retrieved documents, generated queries and responses. - Dependencies: - The UpTrainCallbackHandler requires the uptrain package --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-04-16 19:32:03 +00:00
Leonid Ganeline	45d045b2c5	core[minor], langchain[patch]: `tools` dependencies refactoring (#18759 ) The `langchain.tools` [namespace](https://api.python.langchain.com/en/latest/langchain_api_reference.html#module-langchain.tools) can be completely eliminated by moving one class and 3 functions into `core`. It makes sense since the class and functions are very core.	2024-04-16 14:15:09 -04:00
Erick Friis	77eba10f47	standard-tests: fix default fixtures (#20520 )	2024-04-16 16:12:36 +00:00
Ravindu Somawansa	5acc7ba622	community[minor]: Add glue catalog loader (#20220 ) Add Glue Catalog loader	2024-04-16 11:39:23 -04:00
Dawson Bauer	aab075345e	core[patch]: Fix imports defined in messages sub-package (#20500 ) core[patch]: Fix imports defined in messages sub-package (#20500)	2024-04-16 14:19:51 +00:00
Fayfox	9fd36efdb5	anthropic[patch]: env ANTHROPIC_API_URL not work (#20507 ) enviroment variable ANTHROPIC_API_URL will not work if anthropic_api_url has default value --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-04-16 10:16:51 -04:00
Martín Gotelli Ferenaz	b48add4353	community[patch]: Fix pgvector deprecated filter clause usage with OR and AND conditions (#20446 ) Description: Support filter by OR and AND for deprecated PGVector version Issue: #20445 Dependencies: N/A Twitter handle: @martinferenaz	2024-04-16 14:08:07 +00:00
Eugene Yurtsev	c50099161b	community[patch]: Use uuid4 not uuid1 (#20487 ) Using UUID1 is incorrect since it's time dependent, which makes it easy to generate the exact same uuid	2024-04-16 09:40:44 -04:00
Bagatur	f7667c614b	docs: update tool use case (#20404 )	2024-04-16 04:27:27 +00:00
Erick Friis	86cf1d3ee1	community: release 0.0.33 (#20490 )	2024-04-16 00:30:05 +00:00
Erick Friis	90184255f8	core: release 0.1.43 (#20489 )	2024-04-15 22:48:34 +00:00
Erick Friis	7997f3b7f8	core: forward config params to default (#20402 ) nuno's fault not mine --------- Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Nuno Campos <nuno@langchain.dev>	2024-04-15 15:42:39 -07:00
Nuno Campos	97b2191e99	core: Add concept of conditional edge to graph rendering (#20480 ) - implement for mermaid, graphviz and ascii - this is to be used in langgraph	2024-04-15 13:49:06 -07:00
Ángel Igareta	60c7a17781	Remove logic to exclude intermediate nodes from rendering time (#20459 ) Description: For simplicity, migrate the logic of excluding intermediate nodes in the .get_graph() of langgraph package (https://github.com/langchain-ai/langgraph/pull/310) at graph creation time instead of graph rendering time. Note: #20381 needs to be approved first --------- Co-authored-by: Angel Igareta <angel.igareta@klarna.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Nuno Campos <nuno@boringbits.io>	2024-04-15 16:40:51 +00:00
Ángel Igareta	d55a365c6c	Fix CDN URL in mermaid graph renderer (#20381 ) Description of features on mermaid graph renderer: - Fixing CDN to use official Mermaid JS CDN: https://www.jsdelivr.com/package/npm/mermaid?tab=files - Add device_scale_factor to allow increasing quality of resulting PNG.	2024-04-15 08:01:35 -07:00
Leonid Kuligin	676c68d318	community[patch]: deprecating remaining google_community integrations (#20471 ) Deprecating remaining google community integrations	2024-04-15 09:57:12 -04:00
balloonio	b66a4f48fa	community[patch]: Invoke callback prior to yielding token fix [DeepInfra] (#20427 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for [DeepInfra] - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in [DeepInfra] - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-14 14:32:52 -04:00
Juan Carlos José Camacho	450c458f8f	community[minor]: Add Datahareld tool (#19680 ) Description: Integrate [dataherald](https://www.dataherald.com) tool, It is a natural language-to-SQL tool. Dependencies: Install dataherald sdk to use it, ``` pip install dataherald ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com>	2024-04-13 23:27:16 +00:00
Alexander Smirnov	ece008f117	docs: Refine RunnablePassthrough docstring (#19812 ) Description: This update refines the documentation for `RunnablePassthrough` by removing an unnecessary import and correcting a minor syntactical error in the example provided. This change enhances the clarity and correctness of the documentation, ensuring that users have a more accurate guide to follow. Issue: N/A Dependencies: None This PR focuses solely on documentation improvements, specifically targeting the `RunnablePassthrough` class within the `langchain_core` module. By clarifying the example provided in the docstring, users are offered a more straightforward and error-free guide to utilizing the `RunnablePassthrough` class effectively. As this is a documentation update, it does not include changes that require new integrations, tests, or modifications to dependencies. It adheres to the guidelines of minimal package interference and backward compatibility, ensuring that the overall integrity and functionality of the LangChain package remain unaffected. Thank you for considering this documentation refinement for inclusion in the LangChain project.	2024-04-13 16:23:32 -07:00
Egor Krasheninnikov	c8391d4ff1	community[patch]: Fix YandexGPT embeddings (#19720 ) Fix of YandexGPT embeddings. The current version uses a single `model_name` for queries and documents, essentially making the `embed_documents` and `embed_query` methods the same. Yandex has a different endpoint (`model_uri`) for encoding documents, see [this](https://yandex.cloud/en/docs/yandexgpt/concepts/embeddings). The bug may impact retrievers built with `YandexGPTEmbeddings` (for instance FAISS database as retriever) since they use both `embed_documents` and `embed_query`. A simple snippet to test the behaviour: ```python from langchain_community.embeddings.yandex import YandexGPTEmbeddings embeddings = YandexGPTEmbeddings() q_emb = embeddings.embed_query('hello world') doc_emb = embeddings.embed_documents(['hello world', 'hello world']) q_emb == doc_emb[0] ``` The response is `True` with the current version and `False` with the changes I made. Twitter: @egor_krash --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-13 16:23:01 -07:00
Guangdong Liu	4be7ca7b4c	community[patch]:sparkllm standardize init args (#20194 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-13 16:03:19 -07:00
Yuki Oshima	0758da8940	community[patch]: Set default value for _ListSQLDatabaseToolInput tool_input (#20409 ) Description: `_ListSQLDatabaseToolInput` raise error if model returns `{}`. For example, gpt-4-turbo returns `{}` with SQL Agent initialized by `create_sql_agent`. So, I set default value `""` for `_ListSQLDatabaseToolInput` tool_input. This is actually a gpt-4-turbo issue, not a LangChain issue, but I thought it would be helpful to set a default value `""`. This problem is discussed in detail in the following Issue. Issue: https://github.com/langchain-ai/langchain/issues/20405 Dependencies: none Sorry, I did not add or change the test code, as tests for this components was not exist . However, I have tested the following code based on the [SQL Agent Document](https://python.langchain.com/docs/use_cases/sql/agents/), to make sure it works. ``` from langchain_community.agent_toolkits.sql.base import create_sql_agent from langchain_community.utilities.sql_database import SQLDatabase from langchain_openai import ChatOpenAI db = SQLDatabase.from_uri("sqlite:///Chinook.db") llm = ChatOpenAI(model="gpt-4-turbo", temperature=0) agent_executor = create_sql_agent(llm, db=db, agent_type="openai-tools", verbose=True) result = agent_executor.invoke("List the total sales per country. Which country's customers spent the most?") print(result["output"]) ```	2024-04-13 15:58:47 -07:00
saberuster	160bcaeb93	text-splitters[minor]: Add lua code splitting (#20421 ) - Description: Complete the support for Lua code in langchain.text_splitter module. - Dependencies: No - Twitter handle: @saberuster If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-13 22:42:51 +00:00
ccurme	4b6b0a87b6	groq[patch]: Make stream robust to ToolMessage (#20417 ) ```python from langchain.agents import AgentExecutor, create_tool_calling_agent, tool from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder from langchain_groq import ChatGroq prompt = ChatPromptTemplate.from_messages( [ ("system", "You are a helpful assistant"), ("human", "{input}"), MessagesPlaceholder("agent_scratchpad"), ] ) model = ChatGroq(model_name="mixtral-8x7b-32768", temperature=0) @tool def magic_function(input: int) -> int: """Applies a magic function to an input.""" return input + 2 tools = [magic_function] agent = create_tool_calling_agent(model, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what is the value of magic_function(3)?"}) ``` ``` > Entering new AgentExecutor chain... Invoking: `magic_function` with `{'input': 3}` 5The value of magic\_function(3) is 5. > Finished chain. {'input': 'what is the value of magic_function(3)?', 'output': 'The value of magic\\_function(3) is 5.'} ```	2024-04-13 15:40:55 -07:00
ccurme	38faa74c23	community[patch]: update use of deprecated llm methods (#20393 ) .predict and .predict_messages for BaseLanguageModel and BaseChatModel	2024-04-12 17:28:23 -04:00
Corey Zumar	3a068b26f3	community[patch]: Databricks - fix scope of dangerous deserialization error in Databricks LLM connector (#20368 ) fix scope of dangerous deserialization error in Databricks LLM connector --------- Signed-off-by: dbczumar <corey.zumar@databricks.com>	2024-04-12 17:27:26 -04:00
Bagatur	f1248f8d9a	core[patch]: configurable init params (#20070 ) Proposed fix for #20061. need to test --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 21:18:43 +00:00
aditya thomas	4f75b230ed	partner[ai21]: masking of the api key for ai21 models (#20257 ) Description: Masking of the API key for AI21 models Issue: Fixes #12165 for AI21 Dependencies: None Note: This fix came in originally through #12418 but was possibly missed in the refactor to the AI21 partner package --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 20:19:31 +00:00
Leonid Ganeline	e512d3c6a6	langchain: `callbacks` imports fix (#20348 ) Replaced all `from langchain.callbacks` into `from langchain_core.callbacks` . Changes in the `langchain` and `langchain_experimental` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 20:13:14 +00:00
Erick Friis	d83b720c40	templates: readme langsmith not private beta (#20173 )	2024-04-12 13:08:10 -07:00
balloonio	e7b1a44c5b	community[patch]: Invoke callback prior to yielding token fix for Llamafile (#20365 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for Llamafile - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in community llamafile.py - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-12 19:26:12 +00:00
balloonio	93caa568f9	community[patch]: Invoke callback prior to yielding token fix for HuggingFaceEndpoint (#20366 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for HuggingFaceEndpoint - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in community HuggingFaceEndpoint - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-12 19:16:34 +00:00
Nicolas	ad04585e30	community[minor]: Firecrawl.dev integration (#20364 ) Added the [FireCrawl](https://firecrawl.dev) document loader. Firecrawl crawls and convert any website into LLM-ready data. It crawls all accessible subpages and give you clean markdown for each. - Description: Adds FireCrawl data loader - Dependencies: firecrawl-py - Twitter handle: @mendableai ccing contributors: (@ericciarla @nickscamara) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-12 19:13:48 +00:00
Tomaz Bratanic	a1b105ac00	experimental[patch]: Skip pydantic validation for llm graph transformer and fix JSON response where possible (#19915 ) LLMs might sometimes return invalid response for LLM graph transformer. Instead of failing due to pydantic validation, we skip it and manually check and optionally fix error where we can, so that more information gets extracted	2024-04-12 11:29:25 -07:00
P. Taylor Goetz	9317df7f16	community[patch]: Add "model" attribute to the payload sent to Ollama in `ChatOllama` (#20354 ) Example Ollama API calls: Request without "model": ``` curl --location 'http://localhost:11434/api/chat' \ --header 'Content-Type: application/json' \ --data '{ "messages": [ { "role": "user", "content": "What is the capitol of PA?" } ], "stream": false }' ``` Response: ``` {"error":"model is required"} ``` Request with "model": ``` curl --location 'http://localhost:11434/api/chat' \ --header 'Content-Type: application/json' \ --data '{ "model": "openchat", "messages": [ { "role": "user", "content": "What is the capitol of PA?" } ], "stream": false }' ``` Response: ``` { "eval_duration" : 733248000, "created_at" : "2024-04-11T23:04:08.735766843Z", "model" : "openchat", "message" : { "content" : " The capital city of Pennsylvania is Harrisburg.", "role" : "assistant" }, "total_duration" : 3138731168, "prompt_eval_count" : 25, "load_duration" : 466562959, "done" : true, "prompt_eval_duration" : 1938495000, "eval_count" : 10 } ```	2024-04-12 13:32:53 -04:00
Alex Sherstinsky	fad0962643	community: for Predibase -- enable both Predibase-hosted and HuggingFace-hosted fine-tuned adapter repositories (#20370 )	2024-04-12 08:32:00 -07:00
Eugene Yurtsev	6470b30173	langchain[patch]: Add deprecation warning to extraction chains (#20224 ) Add deprecation warnings to extraction chains	2024-04-12 10:24:32 -04:00
Eugene Yurtsev	b65a1d4cfd	langchain[patch]: Add another unit test for indexing code (#20387 ) Add another unit test for indexing	2024-04-12 10:19:18 -04:00
Erick Friis	29282371db	core: bind_tools interface on basechatmodel (#20360 )	2024-04-12 01:32:19 +00:00
Erick Friis	e6806a08d4	multiple: standard chat model tests (#20359 )	2024-04-11 18:23:13 -07:00

... 7 8 9 10 11 ...

4593 Commits