langchain

Commit Graph

Author	SHA1	Message	Date
Bagatur	37aec1e050	bump 326 (#12569 )	11 months ago
Eugene Yurtsev	1b1a2d5740	Image Caption accepts bytes for images (#12561 ) Accept bytes for images in image caption --------- Co-authored-by: webcoderz <19884161+webcoderz@users.noreply.github.com>	11 months ago
Nuno Campos	7897483819	Allow astream_log to be used inside atrace_as_chain_group (#12558 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Holt Skinner	e05bb938de	Merge pull request #12433 * feat: Add Google Cloud Translation document transformer * Merge branch 'langchain-ai:master' into google-translate * Add documentation for Google Translate Document Transformer * Fix line length error * Merge branch 'master' into google-translate * Merge branch 'google-translate' of https://github.com/holtskinner/lan… * Addressed code review comments * Merge branch 'master' into google-translate * Merge branch 'google-translate' of https://github.com/holtskinner/lan… * Removed extra variable * Merge branch 'google-translate' of https://github.com/holtskinner/lan… * Merge branch 'master' into google-translate * Merge branch 'google-translate' of https://github.com/holtskinner/lan… * Removed extra import	11 months ago
Samad Koita	d1fdcd4fcb	Masking of API Key for GooseAI LLM (#12496 ) Description: Add masking of API Key for GooseAI LLM when printed. Issue: https://github.com/langchain-ai/langchain/issues/12165 Dependencies: None Tag maintainer: @eyurtsev --------- Co-authored-by: Samad Koita <>	11 months ago
Andrew Zhou	64c4a698a8	More comprehensive readthedocs document loader (#12382 ) ## Description: When building our own readthedocs.io scraper, we noticed a couple interesting things: 1. Text lines with a lot of nested <span> tags would give unclean text with a bunch of newlines. For example, for [Langchain's documentation](https://api.python.langchain.com/en/latest/document_loaders/langchain.document_loaders.readthedocs.ReadTheDocsLoader.html#langchain.document_loaders.readthedocs.ReadTheDocsLoader), a single line is represented in a complicated nested HTML structure, and the naive `soup.get_text()` call currently being made will create a newline for each nested HTML element. Therefore, the document loader would give a messy, newline-separated blob of text. This would be true in a lot of cases. <img width="945" alt="Screenshot 2023-10-26 at 6 15 39 PM" src="https://github.com/langchain-ai/langchain/assets/44193474/eca85d1f-d2bf-4487-a18a-e1e732fadf19"> <img width="1031" alt="Screenshot 2023-10-26 at 6 16 00 PM" src="https://github.com/langchain-ai/langchain/assets/44193474/035938a0-9892-4f6a-83cd-0d7b409b00a3"> Additionally, content from iframes, code from scripts, css from styles, etc. will be gotten if it's a subclass of the selector (which happens more often than you'd think). For example, [this page](https://pydeck.gl/gallery/contour_layer.html#) will scrape 1.5 million characters of content that looks like this: <img width="1372" alt="Screenshot 2023-10-26 at 6 32 55 PM" src="https://github.com/langchain-ai/langchain/assets/44193474/dbd89e39-9478-4a18-9e84-f0eb91954eac"> Therefore, I wrote a recursive _get_clean_text(soup) class function that 1. skips all irrelevant elements, and 2. only adds newlines when necessary. 2. Index pages (like [this one](https://api.python.langchain.com/en/latest/api_reference.html)) would be loaded, chunked, and eventually embedded. This is really bad not just because the user will be embedding irrelevant information - but because index pages are very likely to show up in retrieved content, making retrieval less effective (in our tests). Therefore, I added a bool parameter `exclude_index_pages` defaulted to False (which is the current behavior — although I'd petition to default this to True) that will skip all pages where links take up 50%+ of the page. Through manual testing, this seems to be the best threshold. ## Other Information: - Issue: n/a - Dependencies: n/a - Tag maintainer: n/a - Twitter handle: @andrewthezhou --------- Co-authored-by: Andrew Zhou <andrew@heykona.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Peter Vandenabeele	3468c038ba	Add unit tests for document_transformers/beautiful_soup_transformer.py (#12520 ) - Description: * Add unit tests for document_transformers/beautiful_soup_transformer.py * Basic functionality is tested (extract tags, remove tags, drop lines) * add a FIXME comment about the order of tags that is not preserved (and a passing test, but with the expected tags now out-of-order) - Issue: None - Dependencies: None - Tag maintainer: @rlancemartin - Twitter handle: `peter_v` Please make sure your PR is passing linting and testing before submitting. => OK: I ran `make format`, `make test` (passing after install of beautifulsoup4) and `make lint`.	11 months ago
Anirudh Gautam	b257e6a4e8	Mask API key for AI21 LLM (#12418 ) - Description: Added masking of the API Key for AI21 LLM when printed and improved the docstring for AI21 LLM. - Updated the AI21 LLM to utilize SecretStr from pydantic to securely manage API key. - Made improvements in the docstring of AI21 LLM. It now mentions that the API key can also be passed as a named parameter to the constructor. - Added unit tests. - Issue: #12165 - Tag maintainer: @eyurtsev --------- Co-authored-by: Anirudh Gautam <anirudh@Anirudhs-Mac-mini.local>	11 months ago
silvhua	9dead1034c	`_dalle_image_url` returns list of urls if n>1 (#11800 ) - Description: Updated the `_dalle_image_url` method to return a list of URLs if self.n>1, - Issue: #10691, - Dependencies: unsure, - Tag maintainer: @eyurtsev, - Twitter handle: @silvhua --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Bagatur	1815ea2fdb	OpenAI runnable constructor (#12455 )	11 months ago
William FH	a830b809f3	Patch forward ref bug (#12508 ) Currently this gives a bug: ``` from langchain.schema.runnable import RunnableLambda bound = RunnableLambda(lambda x: x).with_config({"callbacks": []}) # ConfigError: field "callbacks" not yet prepared so type is still a ForwardRef, you might need to call RunnableConfig.update_forward_refs(). ``` Rather than deal with cyclic imports and extra load time, etc., I think it makes sense to just have a separate Callbacks definition here that is a relaxed typehint.	11 months ago
William FH	36204c2baf	Evaluation Callback Multi Response (#12505 ) 1. Allow run evaluators to return {"results": [list of evaluation results]} in the evaluator callback. 2. Allows run evaluators to pick the target run ID to provide feedback to (1) means you could do something like a function call that populates a full rubric in one go (not sure how reliable that is in general though) rather than splitting off into separate LLM calls - cheaper and less code to write (2) means you can provide feedback to runs on subsequent calls. Immediate use case is if you wanted to add an evaluator to a chat bot and assign to assign to previous conversation turns have a corresponding one in the SDK	11 months ago
Harrison Chase	9e0ae56287	various templates improvements (#12500 )	11 months ago
0xC9	79cf01366e	Update tool.py (#12472 ) In the GoogleSerperResults class, the name field is defined as 'google_serrper_results_json'. This looks like a typo, and perhaps should be 'google_serper_results_json'. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Harrison Chase	eb903e211c	bump to 36 (#12487 )	11 months ago
Tyler Hutcherson	4209457bdc	Redis langserve template (#12443 ) Add Redis langserve template! Eventually will add semantic caching to this too. But I was struggling to get that to work for some reason with the LCEL implementation here. - Description: Introduces the Redis LangServe template. A simple RAG based app built on top of Redis that allows you to chat with company's public financial data (Edgar 10k filings) - Issue: None - Dependencies: The template contains the poetry project requirements to run this template - Tag maintainer: @baskaryan @Spartee - Twitter handle: @tchutch94 Note: this requires the commit here that deletes the `_aget_relevant_documents()` method from the Redis retriever class that wasn't implemented. That was breaking the langserve app. --------- Co-authored-by: Sam Partee <sam.partee@redis.com>	11 months ago
Erick Friis	9adaa78c65	cli improvements (#12465 ) Features - add multiple repos by their branch/repo - generate `pip install` commands and `add_route()` code ![Screenshot 2023-10-27 at 4 49 52 PM](https://github.com/langchain-ai/langchain/assets/9557659/3aec4cbb-3f67-4f04-8370-5b54ea983b2a) Optimizations: - group installs by repo/branch to avoid duplicate cloning	11 months ago
Adam Law	df4960a6d8	add reranking to azuresearch (#12454 ) -Description Adds returning the reranking score when using semantic search -*Issue: #12317 --------- Co-authored-by: Adam Law <adamlaw@microsoft.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Eugene Yurtsev	60d009f75a	Add security note to API chain (#12452 ) Add security note	11 months ago
Matvey Arye	11505f95d3	Improve handling of empty queries for timescale vector (#12393 ) Description: Improve handling of empty queries in timescale-vector. For timescale-vector it is more efficient to get a None embedding when the embedding has no semantic meaning. It allows timescale-vector to perform more optimizations. Thus, when the query is empty, use a None embedding. Also pass down constructor arguments to the timescale vector client. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Erick Friis	38cee5fae0	cli updates 2 (#12447 ) - extras group - readme - another readme --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
William FH	5d40e36c75	Trace if run tree set (#12444 ) This code path is hit in the following case: - Start in langchain code and manually provide a tracer - Handoff to the traceable - Hand back to langchain code. Which happens for evaluating `@traceable` functions unfortunately	11 months ago
Bagatur	c2a0a6b6df	make doc utils public (#12394 )	11 months ago
Henter	d6888a90d0	Fix the missing temperature parameter for Baichuan-AI chat_model (#12420 ) Description: the missing `temperature` parameter for Baichuan-AI chat_model Baichuan-AI api doc: https://platform.baichuan-ai.com/docs/api	11 months ago
Erick Friis	6908634428	cli updates oct27 (#12436 )	11 months ago
HwangJohn	d38c8369b3	added rrf argument in ApproxRetrievalStrategy class __init__() (#11987 ) - Description: To handle the hybrid search with RRF(Reciprocal Rank Fusion) in the Elasticsearch, rrf argument was added for adjusting 'rank_constant' and 'window_size' to combine multiple result sets with different relevance indicators into a single result set. (ref: https://www.elastic.co/kr/blog/whats-new-elastic-enterprise-search-8-9-0), - Issue: the issue # it fixes (if applicable), - Dependencies: No dependencies changed, - Tag maintainer: @baskaryan, Nice to meet you, I'm a newbie for contributions and it's my first PR. I only changed the langchain/vectorstores/elasticsearch.py file. I did make format&lint I got this message, ```shell make lint_diff ./scripts/check_pydantic.sh . ./scripts/check_imports.sh poetry run ruff . [ "langchain/vectorstores/elasticsearch.py" = "" ] \|\| poetry run black langchain/vectorstores/elasticsearch.py --check All done! ✨ 🍰 ✨ 1 file would be left unchanged. [ "langchain/vectorstores/elasticsearch.py" = "" ] \|\| poetry run mypy langchain/vectorstores/elasticsearch.py langchain/__init__.py: error: Source file found twice under different module names: "mvp.nlp.langchain.libs.langchain.langchain" and "langchain" Found 1 error in 1 file (errors prevented further checking) make: * [lint_diff] Error 2 ``` Thank you --------- Co-authored-by: 황중원 <jwhwang@amorepacific.com>	11 months ago
Roman Vasilyev	2c58dca5f0	optional reusable connection (#12051 ) My postgres out of connections after continuous PGVector usage, and the reason because it constantly creates new connections, so adding a reusable pre established connection seems like solves an issue --------- Co-authored-by: Roman Vasilyev <rvasilyev@mozilla.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
Ennio Pastore	48fde2004f	Update long_context_reorder.py (#12422 ) The function comment was confusing and inaccurate	11 months ago
Bagatur	a8c68d4ffa	Type LLMChain.llm as runnable (#12385 )	11 months ago
Bagatur	d12b88557a	Bagatur/bump 325 (#12440 )	11 months ago
Eugene Yurtsev	cadfce295f	Deprecate PythonRepl tools and Pandas/Xorbits/Spark DataFrame/Python/CSV agents (#12427 ) See discussion here: https://github.com/langchain-ai/langchain/discussions/11680 The code is available for usage from langchain_experimental. The reason for the deprecation is that the agents are relying on a Python REPL. The code can only be run safely with appropriate sandboxing. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Harrison Chase	0ca539eb85	Clean up deprecated agents and update __init__ in experimental (#12231 ) Update init paths in experimental	11 months ago
Holt Skinner	134f085824	feat: Add Google Speech to Text API Document Loader (#12298 ) - Add Document Loader for Google Speech to Text - Similar Structure to [Assembly AI Document Loader][1] [1]: https://python.langchain.com/docs/integrations/document_loaders/assemblyai	11 months ago
David Duong	52c194ec3a	Fix templates typos (#12428 )	11 months ago
Massimiliano Pronesti	c8195769f2	fix(openai-callback): completion count logic (#12383 ) The changes introduced in #12267 and #12190 broke the cost computation of the `completion` tokens for fine-tuned models because of the early return. This PR aims at fixing this. @baskaryan.	11 months ago
Stefan Langenbach	b22da81af8	Mask API key for Aleph Alpha LLM (#12377 ) - Description: Add masking of API Key for Aleph Alpha LLM when printed. - Issue: #12165 - Dependencies: None - Tag maintainer: @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
William FH	4254028c52	Str Evaluator Mapper (#12401 )	11 months ago
William FH	fcad1d2965	Add space (#12395 )	11 months ago
William FH	922d7910ef	Wfh/json schema evaluation (#12389 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	11 months ago
Christian Kasim Loan	a35445c65f	johnsnowlabs embeddings support (#11271 ) - Description: Introducing the [JohnSnowLabsEmbeddings](https://www.johnsnowlabs.com/) - Dependencies: johnsnowlabs - Tag maintainer: @C-K-Loan - Twitter handle: https://twitter.com/JohnSnowLabs https://twitter.com/ChristianKasimL --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
SteveLiao	c08b622b2d	Add HTML Title and Page Language into metadata for AsyncHtmlLoader (#11326 ) Description: Revise `libs/langchain/langchain/document_loaders/async_html.py` to store the HTML Title and Page Language in the `metadata` of `AsyncHtmlLoader`.	11 months ago
Shorthills AI	25c98dbba9	Fixed some grammatical and Exception types issues (#12015 ) Fixed some grammatical issues and Exception types. @baskaryan , @eyurtsev --------- Co-authored-by: Sanskar Tanwar <142409040+SanskarTanwarShorthillsAI@users.noreply.github.com> Co-authored-by: UpneetShorthillsAI <144228282+UpneetShorthillsAI@users.noreply.github.com> Co-authored-by: HarshGuptaShorthillsAI <144897987+HarshGuptaShorthillsAI@users.noreply.github.com> Co-authored-by: AdityaKalraShorthillsAI <143726711+AdityaKalraShorthillsAI@users.noreply.github.com> Co-authored-by: SakshiShorthillsAI <144228183+SakshiShorthillsAI@users.noreply.github.com>	11 months ago
William FH	923696b664	Wfh/json edit dist (#12361 ) Compare predicted json to reference. First canonicalize (sort keys, rm whitespace separators), then return normalized string edit distance. Not a silver bullet but maybe an easy way to capture structure differences in a less flakey way --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	11 months ago
Erick Friis	4db8d82c55	CLI CI 2 (#12387 ) Will run all CI because of _test change, but future PRs against CLI will only trigger the new CLI one Has a bunch of file changes related to formatting/linting. No mypy yet - coming soon	11 months ago
Tyler Hutcherson	231d553824	Update broken redis tests (#12371 ) Update broken redis tests -- tiny PR :) - Description: Fixes Redis tests on master (look like it was broken by https://github.com/langchain-ai/langchain/pull/11257) - Issue: None, - Dependencies: No - Tag maintainer: @baskaryan @Spartee - Twitter handle: N/A Co-authored-by: Sam Partee <sam.partee@redis.com>	11 months ago
Erick Friis	03e79e62c2	cli fix (#12380 )	11 months ago
Bagatur	76230d2c08	fireworks scheduled integration tests (#12373 )	11 months ago
Josh Phillips	01c5cd365b	Fix SupbaseVectoreStore write operation timeout (#12318 ) Description This small change will make chunk_size a configurable parameter for loading documents into a Supabase database. Issue https://github.com/langchain-ai/langchain/issues/11422 Dependencies No chanages Twitter @ j1philli Reminder If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Greg Richardson <greg.nmr@gmail.com>	11 months ago
Bagatur	b10cefb160	lint fix: rm init (#12374 )	11 months ago
Harrison Chase	b43996e553	Harrison/improve cli (#12368 )	11 months ago
Harrison Chase	9ce38726a2	fix some stuff (#12292 ) Co-authored-by: Erick Friis <erick@langchain.dev>	11 months ago
Cynthia Yang	6ce276e099	Support Fireworks batching (#8 ) (#12052 ) Description * Add _generate and _agenerate to support Fireworks batching. * Add stop words test cases * Opt out retry mechanism Issue - Not applicable Dependencies - None Tag maintainer - @baskaryan	11 months ago
Tyler Hutcherson	2f0c9d8269	Fix redis vectorfield schema defaults (#12223 ) - Description: refactors the redis vector field schema to properly handle default values, includes a new unit test suite. - Issue: N/A - Dependencies: nothing new. - Tag maintainer: @baskaryan @Spartee - Twitter handle: this is a tiny fix/improvement :) This issue was causing some clients/cuatomers issues when building a vector index on Redis on smaller db instances (due to fault default values in index configuration). It would raise an error like: ```redis.exceptions.ResponseError: Vector index initial capacity 20000 exceeded server limit (852 with the given parameters)``` This PR will address this moving forward.	11 months ago
Jakub Novák	9544d64ad8	E2B tool - Improve description wuth uploaded files info (#12355 )	11 months ago
Bagatur	c6a733802b	bump 324 and 35 (#12352 )	11 months ago
Nuno Campos	683e97766d	Fix json key output parser in partial (streaming) mode (#12332 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nikhil Jha	dff24285ea	Comprehend Moderation 0.2 (#11730 ) This PR replaces the previous `Intent` check with the new `Prompt Safety` check. The logic and steps to enable chain moderation via the Amazon Comprehend service, allowing you to detect and redact PII, Toxic, and Prompt Safety information in the LLM prompt or answer remains unchanged. This implementation updates the code and configuration types with respect to `Prompt Safety`. ### Usage sample ```python from langchain_experimental.comprehend_moderation import (BaseModerationConfig, ModerationPromptSafetyConfig, ModerationPiiConfig, ModerationToxicityConfig ) pii_config = ModerationPiiConfig( labels=["SSN"], redact=True, mask_character="X" ) toxicity_config = ModerationToxicityConfig( threshold=0.5 ) prompt_safety_config = ModerationPromptSafetyConfig( threshold=0.5 ) moderation_config = BaseModerationConfig( filters=[pii_config, toxicity_config, prompt_safety_config] ) comp_moderation_with_config = AmazonComprehendModerationChain( moderation_config=moderation_config, #specify the configuration client=comprehend_client, #optionally pass the Boto3 Client verbose=True ) template = """Question: {question} Answer:""" prompt = PromptTemplate(template=template, input_variables=["question"]) responses = [ "Final Answer: A credit card number looks like 1289-2321-1123-2387. A fake SSN number looks like 323-22-9980. John Doe's phone number is (999)253-9876.", "Final Answer: This is a really shitty way of constructing a birdhouse. This is fucking insane to think that any birds would actually create their motherfucking nests here." ] llm = FakeListLLM(responses=responses) llm_chain = LLMChain(prompt=prompt, llm=llm) chain = ( prompt \| comp_moderation_with_config \| {llm_chain.input_keys[0]: lambda x: x['output'] } \| llm_chain \| { "input": lambda x: x['text'] } \| comp_moderation_with_config ) try: response = chain.invoke({"question": "A sample SSN number looks like this 123-456-7890. Can you give me some more samples?"}) except Exception as e: print(str(e)) else: print(response['output']) ``` ### Output ```python > Entering new AmazonComprehendModerationChain chain... Running AmazonComprehendModerationChain... Running pii Validation... Running toxicity Validation... Running prompt safety Validation... > Finished chain. > Entering new AmazonComprehendModerationChain chain... Running AmazonComprehendModerationChain... Running pii Validation... Running toxicity Validation... Running prompt safety Validation... > Finished chain. Final Answer: A credit card number looks like 1289-2321-1123-2387. A fake SSN number looks like XXXXXXXXXXXX John Doe's phone number is (999)253-9876. ``` --------- Co-authored-by: Jha <nikjha@amazon.com> Co-authored-by: Anjan Biswas <anjanavb@amazon.com> Co-authored-by: Anjan Biswas <84933469+anjanvb@users.noreply.github.com>	11 months ago
Blake (Yung Cher Ho)	b9410f2b6f	Takeoff pro support (#12070 ) Description: This PR adds support for the [Pro version of Titan Takeoff Server](https://docs.titanml.co/docs/category/pro-features). Users of the Pro version will have to import the TitanTakeoffPro model, which is different from TitanTakeoff. Issue: Also minor fixes to docs for Titan Takeoff (Community version) Dependencies: No additional dependencies Twitter handle: @becoming_blake @baskaryan @hwchase17	11 months ago
Leonid Kuligin	4e47fe1dce	fixed error message and a check for processor name (#12200 ) Replace this entire comment with: - Description: a small fix on error description / a check for processor name - Issue: the issue #11407	11 months ago
Nir Kopler	9298aff783	Finetuned openai azure models cost calculation (#12267 ) Description: Add cost calculation for fine tuned Azure with relevant unit tests. see https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/fine-tuning?tabs=turbo&pivots=programming-language-studio for more information. this PR is the result of this PR: https://github.com/langchain-ai/langchain/pull/12190 Twitter handle: @nirkopler	11 months ago
gnakw	20fe515f20	Fix the exception from langchain.utilities import ArceeWrapper (#12342 ) - Description: Fix the exception from langchain.utilities import ArceeWrapper	11 months ago
Qihui Xie	6720458c7d	add allowed_operators property in QdrantTranslator (#12328 ) - Description: This PR adds `allowd_operators` property to `QdrantTranslator` to fix the `TypeError: can only join an iterable` bug. This property is required in `get_query_constructor_prompt` in `query_constructor\base.py`: ``` allowed_operators=" \| ".join(allowed_operators), ``` - Issue: #12061 --------- Co-authored-by: XIE Qihui <qihui.xie@bopufund.com>	11 months ago
Bagatur	f5a57fc1ef	fix self query constructor (#12349 )	11 months ago
Vasek Mlejnsky	cdd75b687e	e2b tool - fix initialization and improve tool description (#12345 )	11 months ago
Harrison Chase	8ec7aade9f	add docs for templates (#12346 )	11 months ago
Erick Friis	ebf998acb6	Templates (#12294 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Lance Martin <lance@langchain.dev> Co-authored-by: Jacob Lee <jacoblee93@gmail.com>	11 months ago
Erick Friis	43257a295c	CLI Git Improvements (#12311 ) - delete repo sources like pip - git dep fixes - error messaging	11 months ago
William FH	1d568e1add	Better wrap traceable (#12303 ) If user function is wrapped as a traceable function, this will help hand off the trace between the two. Also update handling fields to reflect optional values	11 months ago
Eugene Yurtsev	5a71b81609	Relax type annotation for custom input/output types (#12300 ) This is needed to be able to do stuff like: ```python runnable.with_types(input_type=List[str]) ```	11 months ago
William FH	988f6d9912	Rm langchain server (#12305 )	11 months ago
wemysschen	3f16acc538	Add baidu cloud vector search in vectorstore and fix some unit test in vectorstores (#11605 ) Description: Add baidu cloud vector search in vectorstore --------- Co-authored-by: root <root@icoding-cwx.bcc-szzj.baidu.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
mrbean	b7e559c7e1	use snippet search optionally (#12236 ) Add an additional flag which allows for hitting our new endpoint.	11 months ago
felixocker	cce132d146	fix sparql queries for relations in schema description (#9136 ) - Description: Fix for the SPARQL QA chain: fixed SPARQL queries for retrieving information about relations in the graph to create a textual description of the schema for the language model. This should resolve #8907 - Issue: #8907 - Dependencies: None - Tag maintainer: @baskaryan, @hwchase17	11 months ago
Donato Azevedo	d9f1bcf366	Strips leading/trailing whitespace before parsing xml (#12297 ) Description: When llms output leading or trailing whitespace for xml (when using XMLOutputParser) the parser would raise a `ValueError: Could not parse output: ...`. However, leading or trailing whitespace are "ignorable" in the sense of XML standard. Issue: I did not find an issue related. Dependencies: None Tag maintainer: Twitter handle: donatoaz Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. Done, updated unit test and ran `make docker_test`.	11 months ago
Erick Friis	47070b8314	CLI (#12284 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	11 months ago
Shwu Ku	07c2649753	response parser for ArceeRetriever (#12270 ) - Description: Response parser for arcee retriever, - Issue: follow-up pr on #11578 and [discussion](https://github.com/arcee-ai/arcee-python/issues/15#issuecomment-1759874053), - Dependencies: NA This pr implements a parser for the response from ArceeRetreiver to convert to langchain `Document`. This closes the loop of generation and retrieval for Arcee DALMs in langchain. The reference for the response parser is [api-docs:retrieve](https://api.arcee.ai/docs#/v2/retrieve_model) Attaching screenshot of working implementation: <img width="1984" alt="Screenshot 2023-10-25 at 7 42 34 PM" src="https://github.com/langchain-ai/langchain/assets/65639964/026987b9-34b2-4e4b-b87d-69fcd0c6641a"> \*api key deleted --- Successful tests, lints, etc. ```shell Re-run pytest with --snapshot-update to delete unused snapshots. ==================================================================================================================== slowest 5 durations ===================================================================================================================== 1.56s call tests/unit_tests/schema/runnable/test_runnable.py::test_retrying 0.63s call tests/unit_tests/schema/runnable/test_runnable.py::test_map_astream 0.33s call tests/unit_tests/schema/runnable/test_runnable.py::test_map_stream_iterator_input 0.30s call tests/unit_tests/schema/runnable/test_runnable.py::test_map_astream_iterator_input 0.20s call tests/unit_tests/indexes/test_indexing.py::test_cleanup_with_different_batchsize ======================================================================================================= 1265 passed, 270 skipped, 32 warnings in 6.55s ======================================================================================================= [ "." = "" ] \|\| poetry run black . All done! ✨ 🍰 ✨ 1871 files left unchanged. [ "." = "" ] \|\| poetry run ruff --select I --fix . ./scripts/check_pydantic.sh . ./scripts/check_imports.sh poetry run ruff . [ "." = "" ] \|\| poetry run black . --check All done! ✨ 🍰 ✨ 1871 files would be left unchanged. [ "." = "" ] \|\| poetry run mypy . Success: no issues found in 1868 source files poetry run codespell --toml pyproject.toml poetry run codespell --toml pyproject.toml -w ``` Co-authored-by: Shubham Kushwaha <shwu@Shubhams-MacBook-Pro.local>	11 months ago
Johanna Appel	c26ec7789f	CohereEmbeddings: Add max_retries and request_timeout (#12275 ) Add max_retries and request_timeout to CohereEmbeddings, akin to how it works in OpenAIEmbeddings. Since the Cohere client already implements these parameters, we can simply pass them down. Uses parameters from these two cohere client objects: https://github.com/cohere-ai/cohere-python/blob/main/cohere/client.py https://github.com/cohere-ai/cohere-python/blob/main/cohere/client_async.py	11 months ago
Nuno Campos	7108084947	Remove CLI (#12283 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	b5b2d07681	Pop max concurrency when recursing (#12281 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Bagatur	69f4e402e4	bump 323 (#12278 )	11 months ago
David Duong	c25b174db5	Add serialisation props to Fireworks and ChatFireworks (#12255 )	11 months ago
Richard Adams	fd5f549a9e	demonstrate use of RetrievalQAWithSourcesChain.from_chain (#12235 ) Description: Documents further usage of RetrievalQAWithSourcesChain in an existing test. I'd not found much documented usage of RetrievalQAWithSourcesChain and how to get the sources out. This additional code will hopefully be useful to other potential users of this retriever. Issue: No raised issue Dependencies: No new dependencies needed to run the test (it already needs `open-ai`, `faiss-cpu` and `unstructured`). Note - `make lint` showed 8 linting errors in unrelated files --------- Co-authored-by: richarda23 <richard.c.adams@infinityworks.com>	11 months ago
James Braza	53f35c5f5c	Adding `STRUCTURED_FORMAT_SIMPLE_INSTRUCTIONS` missing backticks (#12238 ) This PR fixes the fact that `STRUCTURED_FORMAT_SIMPLE_INSTRUCTIONS` was missing backticks at the end	11 months ago
William FH	276c6ba115	Check for ls project in run tree context (#12242 ) If I go traceable -> runnable when the project is manually specified, the runnable wont be logged. This makes sure the session/project is threaded through appropriately.	11 months ago
Vasek Mlejnsky	1f8094938f	Integrate E2B's data analysis/code interpreter (#12011 ) This PR adds a data [E2B's](https://e2b.dev/) analysis/code interpreter sandbox as a tool --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Jakub Novak <jakub@e2b.dev>	11 months ago
Bagatur	286a29a49e	bump 322 and 34 (#12228 )	11 months ago
Eugene Yurtsev	583dc49477	Add type to Generation and sub-classes, handle root validator (#12220 ) * Add a type literal for the generation and sub-classes for serialization purposes. * Fix the root validator of ChatGeneration to return ValueError instead of KeyError or Attribute error if intialized improperly. * This change is done for langserve to make sure that llm related callbacks can be serialized/deserialized properly.	11 months ago
Eugene Yurtsev	81052ee18e	Fix code block in runnable doc (#12221 ) Fix code block syntax in runnable doc-string	11 months ago
Mikelarg	46e28b9613	Added GigaChat chat model support (#12201 ) - Description: Added integration with [GigaChat](https://developers.sber.ru/portal/products/gigachat) language model. - Twitter handle: @dvoshansky	11 months ago
Anurag Wagh	d5c2ce7c2e	[fix] create redis vector index before adding docs, add prefix to doc… (#11257 ) Fix Description: For Redis Vector integration in add_texts method, there were two issues that lead to this bug. 1. Vector index is not being created leading to no such_index error 2. `doc:index` prefix was also missing for Redis Keys. resolves #11197 Maintainer: @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Eugene Yurtsev	079d1f3b8e	Expose handle_event and ahandle_events as public API (#12181 ) Expose functionality to handle generic events.	11 months ago
William FH	67c4fd0ad0	Update deprecation (#12178 ) in runner_utils	11 months ago
Nir Kopler	d3744175bf	Finetuned OpenAI models cost calculation #11715 (#12190 ) Description: Add cost calculation for fine tuned models (new and legacy), this is required after OpenAI added new models for fine tuning and separated the costs of I/O for fine tuned models. Also I updated the relevant unit tests see https://platform.openai.com/docs/guides/fine-tuning for more information. issue: https://github.com/langchain-ai/langchain/issues/11715 - Issue: 11715 - Twitter handle: @nirkopler	11 months ago
Spyros	a2840a2b42	fix vertexai codey models (#12173 ) Description: This PR fixes issue #12156 by checking for Codey models appropriately before result parsing. Maintainer: @hwchase17 , @agola11	11 months ago
Hech	d76f026d72	Fix flexible dimension and doc for DingoDB (#12187 )	11 months ago
Erick Friis	95ae40ff90	Fix Anthropic Functions ainvoke (#12215 ) Removes custom `NotImplementedError` in experimental anthropic functions, allowing it to fallback on default `ainvoke` implementation.	11 months ago
Iskren Ivov Chernev	d5d7ba582a	Improvements to llm/deepinfra (#10846 ) - replace `requests` package with `langchain.requests` - add `_acall` support - add `_stream` and `_astream` - freshen up the documentation a bit - update vendor doc	11 months ago
sudranga	f09f82541b	Expose configuration options in GraphCypherQAChain (#12159 ) Allows for passing arguments into the LLM chains used by the GraphCypherQAChain. This is to address a request by a user to include memory in the Cypher creating chain. Will keep the prompt variables as-is to be backward compatible. But, would be a good idea to deprecate them and use the **kwargs variables. Added a test case. In general, I think it would be good for any chain to automatically pass in a readonlymemory(of its input) to its subchains whilist allowing for an override. But, this would be a different change.	11 months ago
Leonid Ganeline	11f13aed53	docstrings update (#12093 ) Added missed docstrings. Added missed Args:, Returns: Raises:	11 months ago
Johnny Oshika	ba20c14e28	Fix typo in stuff_prompt's system_template (#12063 ) - Description: Add missing apostrophe in `user's` in stuff_prompt's system_template. The first sentence in the system template went from: > Use the following pieces of context to answer the users question. to > Use the following pieces of context to answer the user's question. - Issue: - Dependencies: none - Tag maintainer: @baskaryan - Twitter handle: ojohnnyo	11 months ago
Holt Skinner	69d9eae5cd	feat: Add Client Info to available Google Cloud Clients (#12168 ) - This is used internally to gather aggregate usage metrics for the LangChain integrations - Note: This cannot be added to some of the Vertex AI integrations at this time because the SDK doesn't allow overriding the [`ClientInfo`](https://googleapis.dev/python/google-api-core/latest/client_info.html#module-google.api_core.client_info) - Added to: - BigQuery - Google Cloud Storage - Document AI - Vertex AI Model Garden - Document AI Warehouse - Vertex AI Search - Vertex AI Matching Engine (Cloud Storage Client) @baskaryan, @eyurtsev, @hwchase17 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
Lukas Wolf	69f5f82804	Update extraction.py (#12207 ) Description: Pass tags as argument to create_extraction_chain Issue: create_extraction_chain does not pass tags to chain yet @baskaryan	11 months ago
Nuno Campos	34ffb94770	Remove GetLocal, PutLocal (#12133 ) Do you agree?	11 months ago
Eric Hartford	8c150ad7f6	Add COBOL parser and splitter (#11674 ) - Description: Add COBOL parser and splitter - Issue: n/a - Dependencies: n/a - Tag maintainer: @baskaryan - Twitter handle: erhartford --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
John Mai	ebf749c40c	Baichuan & Hunyuan set default api_base (#12059 ) ### Description Baichuan & Hunyuan set default api_base env	11 months ago
Shilong Dai	99afc1b4f8	Fixed hardcoded "vector" and replaced with vector_query_field variable (#12126 ) - Description: In the max_marginal_relevance_search function of the ElasticsearchStore vector store, the name of the field corresponding to the vector embedding of the document is hard coded in the delete statement that drops the field from the document metadata. This results in an exception if the vector embedding field is customized. This PR changes the hard-coded "vector" into the vector_query_field variable. - Issue: None - Dependencies: None - Tag maintainer: @hwchase17 Co-authored-by: Shilong Dai <sdai@viperfish.net>	11 months ago
Vikram Shitole	0d44746430	10634: Added the capability to inject boto3 client in SagemakerEndpointEmbeddings (#12146 ) Description: Allow to inject boto3 client for Cross account access type of scenarios in using SagemakerEndpointEmbeddings and also updated the documentation for same in the sample notebook Issue:SagemakerEndpointEmbeddings cross account capability #10634 #10184 Dependencies: None Tag maintainer: Twitter handle:lethargicoder Co-authored-by: Vikram(VS) <vssht@amazon.com>	11 months ago
aubin_mzt	66f8cb015d	Add connection args for pgvector vector store (#11930 ) - Description: sqlalchemy create_engine() does not take into account connect_args which are mandatory for managed PGSQL instances on cloud providers (ssl_context for example). Also re-enabled create_vector_extension at post_init for using pgvector class seamlessly - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Sami Bargaoui <bargaoui.sam@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	11 months ago
NuODaniel	4d6243fa87	fix: doc string of default params in chat_models, llm qianfan (#12153 ) - Description: a fix of the doc string in Qianfan - Issue: no - Dependencies: no - Tag maintainer: @baskaryan - Twitter handle: no	11 months ago
Predrag Gruevski	f82bdf4613	Update deprecated `langchain` imports with suggested new paths. (#12164 ) Let's help our users find the proper import to use instead of the deprecated top-level ones.	11 months ago
Bagatur	963ff93476	bump 321 (#12161 )	11 months ago
Nuno Campos	d0505c0d47	Update default recursion_limit, update docs (#12134 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
William FH	4f23aa677a	Fix Pickle Error (#12141 ) If non-pickleable objects (like locks) get passed to the tracing callback, they'll fail in the deepcopy. Fallback to a shallow copy in these instances .	11 months ago
Predrag Gruevski	95a1b598fe	Update to `actions/checkout@v4`. (#11951 ) We don't use any of the new functionality at the moment. Just making sure we don't fall back on versions and fail to benefit from new patches. This is an easy upgrade and it's always harder to upgrade across multiple major versions at once.	11 months ago
William FH	7c4f340cc0	Include Parent Run ID (#12139 ) If you set local callbacks	11 months ago
omahs	f3cc9bba5b	Fix typos (#12128 ) Fix typos	11 months ago
Nuno Campos	1afdb40b48	Add optional config arg to RunnablePassthrough func arg (#12131 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	325fdde8b4	Fix bug where types were lost when calling with_cconfig or bind (#12137 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	02dce74b97	Fix type hint for older py versions (#12132 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	d0ce374731	Allow specifying custom input/output schemas for runnables with .with_types() (#12083 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Harrison Chase	ee69116761	move csv agent to langchain experimental (#12113 )	11 months ago
Harrison Chase	03bf6ef473	add missing init files (#12114 )	11 months ago
Bagatur	ef8b180d6d	bump 320 (#12108 )	11 months ago
Rotem Weiss	78d186fb44	Add Tavily Search API as a Tool (#12103 ) Adding Tavily Search API as a tool. I will be the maintainer and assaf_elovic is the twitter handler. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Bagatur	85302a9ec1	Add CI check that integration tests compile (#12090 )	11 months ago
verlocks	5dbe456aae	Bug fix tongyi.py to be compatible with DashScope API (#11956 ) Current ChatTongyi is not compatible with DashScope API, which will cause error when passing api key to chat model directly. - Description: Update tongyi.py to be compatible with DashScope API. Specifically, update parameter name "dashscope_api_key" to "api_key". - Issue: None. - Dependencies: Nothing new, Tongyi would require DashScope as before.	11 months ago
Tomaz Bratanic	82f4c0589c	Add neo4j graph environment variables (#12080 )	11 months ago
Mohammad Mohtashim	d5400f6502	Google Scholar Search Tool using serpapi (#11513 ) - Description: Implementing the Google Scholar Tool as requested in PR #11505. The tool will be using the [serpapi python package](https://serpapi.com/integrations/python#search-google-scholar). The main idea of the tool will be to return the results from a Google Scholar search given a query as an input to the tool. - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17	11 months ago
Holt Skinner	f5be2d525a	fix: Add `_serving_config` property to `GoogleVertexAISearchRetriever` (#12084 ) - Fixes error: ``` ValueError: "GoogleVertexAISearchRetriever" object has no field "_serving_config" ``` Introduced in #11736 @baskaryan, @eyurtsev, @hwchase17 if you could review and merge quickly, that would be appreciated :)	11 months ago
Nuno Campos	5fee61a207	Support runnable factories in .configurable_alts() (#12065 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Zhitao Xu	a4c3a44712	Fix documentation typo in Clickhouse Class (#12047 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: The return info in the documentation for similarity_search_by_vector and similarity_search_with_relevance_scores is wrong	11 months ago
William FH	25418b9b4d	Always add run ID (#12046 ) in eval callback handler. Useful if you're using a custom run evaluator and don't want to thread things through.	11 months ago
Eugene Yurtsev	44d7763580	Add zapier deprecation warning (#12045 ) Add zapier deprecation	11 months ago
John Mai	4188f046ec	Add Tencent Hunyuan chat model (#12022 ) ### Description: The Tencent Hunyuan model, developed by Tencent, is a large language model by robust Chinese text generation capabilities, adeptness in logical reasoning within complex contexts, and reliable task execution proficiency.For more information, see [https://cloud.tencent.com/document/product/1729](https://cloud.tencent.com/document/product/1729)	11 months ago
Eugene Yurtsev	68599d98c2	More security notes (#12040 ) Add more security notes	11 months ago
Bagatur	0006075b08	bump 319 (#12041 )	11 months ago
John Mai	8eb40b5fe2	`baichuan_secret_key` use pydantic.types.SecretStr & Add Baichuan tests (#12031 ) ### Description - `baichuan_secret_key` use pydantic.types.SecretStr - Add Baichuan tests	11 months ago
Nuno Campos	85bac75729	nc/runnable-dynamic-schemas-from-config (#12038 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	85eaa4ccee	Revert "nc/runnable-dynamic-schemas-from-config" (#12037 ) This reverts commit `a46eef64a7`. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	a46eef64a7	nc/runnable-dynamic-schemas-from-config	11 months ago
Nuno Campos	d392e030be	Add default value (#12032 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Kenneth Choe	62efe1ffb9	support add_embeddings for elasticsearch (#11002 ) - Description: Provide a way to use different text for embedding. - For example, if you are ingesting stack-overflow Q&As for RAG, you would want to embed the questions and return the answer(s) for the hits. With this change, the consumer of langchain can implement that easily. - I noticed the similar function is added on faiss.py with #1912 which was for performance reason, but I see the same function can be used to achieve what I thought. So instead of changing Document class to have embedding_content, I mimicked the implementation of faiss.py. - The test should provide some guidance on how to use it. It would be more intuitive if I just pass texts and embedding_texts as separate arguments, but I chose to use `zip`-ed object for the consistency with faiss.py implementation. - I plan to make similar pull request for OpenSearch. - Issue: N/A - Dependencies: None other than the existing ones. Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Bagatur	76d3afaef0	bump 318 (#12030 )	11 months ago
Dmitry Tyumentsev	5dd2161c4b	add _acall method to YandexGPT (#12029 ) - Description: Add async support for YandexGPT LLM model Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	11 months ago
Peter Krenesky	8425f33363	Pydantic v2 support for OpenAPI Specs (#11936 ) - Description: Adding Pydantic v2 support for OpenAPI Specs - Issue: - OpenAPI spec support was disabled because `openapi-schema-pydantic` doesn't support Pydantic v2: #9205 - Caused errors in `get_openapi_chain` - This may be the cause of #9520. - Tag maintainer: @eyurtsev - Twitter handle: kreneskyp The root cause was that `openapi-schema-pydantic` hasn't been updated in some time but [openapi-pydantic](https://github.com/mike-oakley/openapi-pydantic) forked and updated the project.	11 months ago
Joe McElroy	c9f1768cb9	Elasticsearch Query Retriever: Use match + fuzziness for LIKE (#12023 ) Updated the elasticsearch self query retriever to use the match clause for LIKE operator instead of the non-analyzed fuzzy search clause. Other small updates include: - fixing the stack inference integration test where the index's default pipeline didn't use the inference pipeline created - adding a user-agent to the old implementation to track usage - improved the documentation for ElasticsearchStore filters	11 months ago
Nuno Campos	7db6aabf65	Update chat model output type (#11833 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Simon Dai	ed62984cb2	update Weaviate to support multi tenancy (#11842 ) - Description: update Weaviate to support multi tenancy - Issue: 9956 - Dependencies: - Tag maintainer: hwchase17 - Twitter handle: dsx1986_	11 months ago
hiigao	f818ec49b8	Encapsulate alicloud pai-eas access method for chatmodels and llms (#11852 ) ### Description: To provide an eas llm service access methods in this pull request by impletementing `PaiEasEndpoint` and `PaiEasChatEndpoint` classes in `langchain.llms` and `langchain.chat_models` modules. Base on this pr, langchain users can build up a chain to call remote eas llm service and get the llm inference results. ### About EAS Service EAS is a Alicloud product on Alibaba Cloud Machine Learning Platform for AI which is short for AliCloud PAI. EAS provides model inference deployment services for the users. We build up a llm inference services on EAS with a general llm docker images. Therefore, end users can quickly setup their llm remote instances to load majority of the hugginface llm models, and serve as a backend for most of the llm apps. ### Dependencies This pr does't involve any new dependencies. --------- Co-authored-by: 子洪 <gaoyihong.gyh@alibaba-inc.com>	11 months ago
John Mai	a6b483dcbc	Supported RetryOutputParser & RetryWithErrorOutputParser max_retries (#11903 ) Description: Supported RetryOutputParser & RetryWithErrorOutputParser max_retries - max_retries: Maximum number of retries to parser. Issue: None Dependencies: None Tag maintainer: @baskaryan Twitter handle:	11 months ago
Hugues Chocart	008c7df80d	[LLMonitorCallbackHandler] Refactor + add llmonitor-py dependency (#11948 ) We now require uses to have the pip package `llmonitor` installed. It allows us to have cleaner code and avoid duplicates between our library and our code in Langchain.	11 months ago
Sian Cao	77fc2f7644	fix: impl missing embeddings method (#10823 ) FAISS does not implement embeddings method and use embed_query to embedding texts which is wrong for some embedding models. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Holt Skinner	2661dc94f3	feat: Google Vertex AI Search Retriever - Add support for Website Data Stores (#11736 ) - Only works for Data stores with Advanced Website Indexing - https://cloud.google.com/generative-ai-app-builder/docs/about-advanced-features - Minor restructuring - Follow up to #10513 - Remove outdated docs (readded in https://github.com/langchain-ai/langchain/pull/11620) - Move legacy class into new py file to clean up the directory - Shouldn't cause backwards compatibility issues as the import works the same way for users	11 months ago
Shorthills AI	4b6fdd7bf0	Update modal.py (#11588 ) feat: Raise KeyError when 'prompt' key is missing in JSON response This commit updates the error handling in the code to raise a KeyError when the 'prompt' key is not found in the JSON response. This change makes the code more explicit about the nature of the error, helping to improve clarity and debugging. @baskaryan, @eyurtsev.	11 months ago
William FH	dfb4baa3f9	Fix Fireworks Callbacks (#12003 ) I may be missing something but it seems like we inappropriately overrode the 'stream()' method, losing callbacks in the process. I don't think (?) it gave us anything in this case to customize it here? See new trace: https://smith.langchain.com/public/fbb82825-3a16-446b-8207-35622358db3b/r and confirmed it streams. Also fixes the stopwords issues from #12000	11 months ago
Wang Wei	e26559f512	Add ERNIE-Bot-4 model support for ErnieBotChat. (#11969 ) - Description: According to the document https://cloud.baidu.com/doc/WENXINWORKSHOP/s/clntwmv7t, add ERNIE-Bot-4 model support for ErnieBotChat. - Dependencies: Before using the ERNIE-Bot-4, you should have the model's access authority.	11 months ago
Eugene Yurtsev	f4bec9686d	Add more security notes (#11990 ) Add more security notes	11 months ago
Eugene Yurtsev	3d81c76160	Add security notes to agent toolkits (#11989 ) Add more security notes to agent toolkits.	11 months ago
Leonid Ganeline	b81a4c1d94	docstrings added (#11988 ) Added docstrings. Some docsctrings formatting.	11 months ago
Bagatur	35c7c1f050	bump 317 (#11986 )	11 months ago
Bagatur	122af2effe	fix chroma from_texts bug (#11984 )	11 months ago
Erick Friis	c149954cc5	Hub Runnable (#11946 ) Adds `langchain.runnables.hub.HubRunnable` for pulling configurable objects from the hub	11 months ago
Owen	9e24626e87	chore: remove duplicated export variables (#11962 ) - Description: remove duplicated `__all__` variables	11 months ago
Nuno Campos	6bd9c1d2b3	Make prompt validation opt-in (#11973 ) By default replace input_variables with the correct value <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	9bc7e1851a	Ensure dict() does not raise not implemented error, which should instead be raised in our custom method save() (#11970 ) .dict() is a Pydantic method that cannot raise exceptions, as it is used eg. in `__eq__` <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	653cf56e0e	Lint	11 months ago
Predrag Gruevski	debcf053eb	Fix `invalid escape sequence` warnings by using raw strings for regexes. (#11943 ) This code also generates warnings when our users' apps hit it, which is annoying and doesn't look great. Let's fix it.	11 months ago
Nuno Campos	e4ae690244	Sort order	11 months ago
Nuno Campos	b753bf3323	Make prompt validation opt-in By default replace input_variables with the correct value	11 months ago
Nuno Campos	202acce0c9	Ensure dict() does not raise not implemented error, which should instead be raised in our custom method save()	11 months ago
Predrag Gruevski	392df7b2e3	Type hints on varargs and kwargs that take anything should be `Any`. (#11950 ) Type hinting `args` as `List[Any]` means that each positional argument should be a list. Type hinting `*kwargs` as `Dict[str, Any]` means that each keyword argument should be a dict of strings. This is almost never what we actually wanted, and doesn't seem to be what we want in any of the cases I'm replacing here.	11 months ago
Eugene Yurtsev	908c7bf33e	Add documentation to tools (#11938 ) Add security notes to tools --------- Co-authored-by: Predrag Gruevski <2348618+obi1kenobi@users.noreply.github.com>	11 months ago
Eugene Yurtsev	43dc669332	Update playwright documentation (#11949 ) Add security note to playwright tool	11 months ago
Daniel Chalef	2beb767ae5	zep: Memory Retriever MMR Support & Docs Updates (#11954 ) - Update Zep Memory and Retriever docstrings - Zep Memory Retriever: Add support for native MMR - Add MMR example to existing ZepRetriever Notebook @baskaryan	11 months ago
William FH	a27fa9bf10	Use traceable context (#11896 ) Example ``` from langchain.schema.runnable import RunnableLambda from langsmith import traceable chain = RunnableLambda(lambda x: x) @traceable(run_type = "chain") def my_traceable(a): chain.invoke(a) my_traceable(5) ``` Would have a nested result. This would NOT work for interleaving chains and traceables. E.g., things like thiswould still not work well ``` from langchain.schema.runnable import RunnableLambda from langsmith import traceable @traceable() def other_traceable(a): return a def foo(x): return other_traceable(x) chain = RunnableLambda(foo) @traceable(run_type = "chain") def my_traceable(a): chain.invoke(a) my_traceable(5) ```	11 months ago
Predrag Gruevski	dcd0392423	Upgrade to newer black (23.10) and ruff (first 0.1.x!) versions. (#11944 ) Minor lint dependency version upgrade to pick up latest functionality. Ruff's new v0.1 version comes with lots of nice features, like fix-safety guarantees and a preview mode for not-yet-stable features: https://astral.sh/blog/ruff-v0.1.0	11 months ago
Trayan Azarov	1fd21ed21c	Chroma batching (#11203 ) - Description: Chroma >= 0.4.10 added support for batch sizes validation of add/upsert. This batch size is dependent on the SQLite limits of the target system and varies. In this change, for Chroma>=0.4.10 batch splitting was added as the aforementioned validation is starting to surface in the Chroma community (users using LC) - Issue: N/A - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: t_azarov	11 months ago
Guy Korland	9373b9c004	Add Graph interface (#11012 ) Replace this entire comment with: - Description: Add a Graph interface - Tag maintainer: @baskaryan @hwchase17 - Twitter handle: @g_korland	11 months ago
DanielZzz	b647505280	feat: support ChatModels Qianfan `QianfanChatEndpoint` function_call (#11107 ) - Description: * feature for `QianfanChatEndpoint` function_call ability, add integration_test for it * add `model`, `endpoint` supported in calling params * add raw response in ChatModel Message - Issue: * #10867 * #11105 * #10215 - Dependencies: no - Tag maintainer: @baskaryan - Twitter handle: no	11 months ago
M Bharat lal	67300567d3	GCSFileLoader retrieve blob custom metadata and append to document metadata (#11066 ) - Description: GCSFileLoader retrieve blob's custom metadata and append to document's metadata - Issue: #9975, - Tag maintainer: @baskaryan please review Co-authored-by: b0l00ib <bharat.lal@walmart.com>	11 months ago
billytrend-cohere	f4742dce50	Add Cohere retrieval augmented generation to retrievers (#11483 ) Add Cohere retrieval augmented generation to retrievers --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
刘方瑞	0a24ac7388	Revised notebook and add delete to MyScale vector store (#11848 ) - Description: - Add `.delete` to myscale vector store. - Revised vector store notebooks - Tag maintainer: @baskaryan - Twitter handle: @myscaledb @mpsk_liu	11 months ago
John Mai	3fb5e4d185	Add Baichuan chat model (#11923 ) Description: A large language models developed by Baichuan Intelligent Technology，https://www.baichuan-ai.com/home Issue: None Dependencies: None Tag maintainer: Twitter handle:	11 months ago
Eugene Yurtsev	9ecb7240a4	Add security note to recursive url loader (#11934 ) Add security note to recursive loader	11 months ago
maks-operlejn-ds	42dcc502c7	Anonymizer small fixes (#11915 )	11 months ago
Eugene Yurtsev	90e9ec6962	Sitemap specify default filter url (#11925 ) Specify default filter URL in sitemap loader and add a security note --------- Co-authored-by: Predrag Gruevski <2348618+obi1kenobi@users.noreply.github.com>	11 months ago
Bagatur	ba0d729961	bump 316 (#11928 )	11 months ago
Eugene Yurtsev	12d7eaa0c2	Add security notices to toolkits (#11900 ) This adds security notices to toolkits init, and to several toolkits. We'll need to continue documenting the rest of the toolkits. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Eugene Yurtsev	5f4a697ce3	Add deprecation warnings (#11899 ) Add deprecation warnings Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Nuno Campos	8b79cf9566	Add lock for using global config enum weak map (#11920 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	2a8ded6c8c	Export merge_configs function (#11916 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	11 months ago
Nuno Campos	778e7c526e	Add comment	11 months ago
Nuno Campos	19319e1746	Allow configs with None values	11 months ago
Nuno Campos	b0d5882fe1	Export merge_configs function	11 months ago
Nuno Campos	12596b9a9b	Add validation for configurable keys passed to .with_config() - Fix some typing issues found while doing that	11 months ago
Nuno Campos	754aca794f	remove print	11 months ago
Nuno Campos	cf448a6314	Ensure that configurable fields with enums support deduplication	11 months ago
Leonid Ganeline	31f264169d	evaluation criteria (#11681 ) the updated value was: ` Criteria.MISOGYNY: "Is the submission misogynistic? If so, respond Y." ` The " If so, respond Y." should not be here. This sub-string is not presented in any other criteria and should not be presented here. I also added a synonym to "misogynistic" as it done in many other criteria.	11 months ago
Dmitry Tyumentsev	e8c1850369	Add YandexGPT LLM and Chat model (#11703 ) Description: Introducing an ability to work with the [YandexGPT](https://cloud.yandex.com/en/services/yandexgpt) language model.	11 months ago
Bagatur	c15701eebf	Revert "Add baichuan model" (#11901 ) cc @cloudscool, apologies your PR wasn't actually passing CI	11 months ago
cloudscool	c1d811c4bc	Add baichuan model	11 months ago
John Mai	0169d45ba8	Supported OutputFixingParser max_retries (#11754 ) Description: Supported OutputFixingParser max_retries - max_retries: Maximum number of retries to parser. Issue: None Dependencies: None Tag maintainer: @baskaryan Twitter handle: @JohnMai95 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
volodymyr-memsql	ff8e6981ff	SingleStoreDBChatMessageHistory: Add singlestoredb support for ChatMessageHistory (#11705 ) Description - Added the `SingleStoreDBChatMessageHistory` class that inherits `BaseChatMessageHistory` and allows to use of a SingleStoreDB database as a storage for chat message history. - Added integration test to check that everything works (requires `singlestoredb` to be installed) - Added notebook with usage example - Removed custom retriever for SingleStoreDB vector store (as it is useless) --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com>	11 months ago
Mohammad Mohtashim	634ccb8ccd	test_stream_log_retriever Unit Test + Tool names fix (#11808 ) ## Description \| Tool \| Original Tool Name \| \|-----------------------------\|---------------------------\| \| open-meteo-api \| Open Meteo API \| \| news-api \| News API \| \| tmdb-api \| TMDB API \| \| podcast-api \| Podcast API \| \| golden_query \| Golden Query \| \| dall-e-image-generator \| Dall-E Image Generator \| \| twilio \| Text Message \| \| searx_search_results \| Searx Search Results \| \| dataforseo \| DataForSeo Results JSON \| When using these tools through `load_tools`, I encountered the following validation error: ```console openai.error.InvalidRequestError: 'TMDB API' does not match '^[a-zA-Z0-9_-]{1,64}$' - 'functions.0.name' ``` In order to avoid this error, I replaced spaces with hyphens in the tool names: \| Tool \| Corrected Tool Name \| \|-----------------------------\|---------------------------\| \| open-meteo-api \| Open-Meteo-API \| \| news-api \| News-API \| \| tmdb-api \| TMDB-API \| \| podcast-api \| Podcast-API \| \| golden_query \| Golden-Query \| \| dall-e-image-generator \| Dall-E-Image-Generator \| \| twilio \| Text-Message \| \| searx_search_results \| Searx-Search-Results \| \| dataforseo \| DataForSeo-Results-JSON \| This correction resolved the validation error. Additionally, a unit test, `tests/unit_tests/schema/runnable/test_runnable.py::test_stream_log_retriever`, was failing at random. Upon further investigation, I confirmed that the failure was not related to the above-mentioned changes. The `stream_log` variable was generating the order of logs in two ways at random The reason for this behavior is unclear, but in the assertion, I included both possible orders to account for this variability.	11 months ago
Predrag Gruevski	7c0f1bf23f	Upgrade experimental package dependencies and use Poetry 1.6.1. (#11339 ) Part of upgrading our CI to use Poetry 1.6.1.	11 months ago
Eugene Yurtsev	c2c0814a94	Add security notice to file management tool (#11878 ) Add security notice to file management tool --------- Co-authored-by: Predrag Gruevski <2348618+obi1kenobi@users.noreply.github.com>	11 months ago
zhaoshengbo	cb7e12f6ba	Adapt to the latest version of Alibaba Cloud OpenSearch vector store API (#11849 ) Hello Folks, Alibaba Cloud OpenSearch has released a new version of the vector storage engine, which has significantly improved performance compared to the previous version. At the same time, the sdk has also undergone changes, requiring adjustments alibaba opensearch vector store code to adapt. This PR includes: Adapt to the latest version of Alibaba Cloud OpenSearch API. More comprehensive unit testing. Improve documentation. I have read your contributing guidelines. And I have passed the tests below - [x] make format - [x] make lint - [x] make coverage - [x] make test --------- Co-authored-by: zhaoshengbo <shengbo.zsb@alibaba-inc.com>	11 months ago
Lee	e669f9d731	Fix: Sitemap Document Loader Tests and Documentation (#11866 ) Description: While working on the Docusaurus site loader #9138, I noticed some outdated docs and tests for the Sitemap Loader. Issue: This is tangentially related to #6691 in reference to doc links. I plan on digging in to a few of these issue when I find time next.	11 months ago
Jean-Louis Queguiner	8b697ff0ee	feat(llm): add together.xyz as an LLM provider (#11892 ) - Description: added together.xyz as an LLM provider, - Issues: fix some linting issues - twitter handle @jilijeanlouis --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Leonid Kuligin	d269dd2e2f	added a multiturn search based on Vertex AI Search (#11885 ) Replace this entire comment with: - Description: Added a retriever based on multi-turn Vertex AI Search - Twitter handle: lkuligin	11 months ago
Leonid Kuligin	38ed55245f	added Vertex examples as attributes (#11890 ) - Description: added examples to Vertex chat models as optional class attributes, so that a model with examples can be used inside a chain - Twitter handle: lkuligin	11 months ago
eryk-dsai	5019f59724	fix: more robust check whether the HF model is quantized (#11891 ) Removes the check of `model.is_quantized` and adds more robust way of checking for 4bit and 8bit quantization in the `huggingface_pipeline.py` script. I had to make the original change on the outdated version of `transformers`, because the models had this property before. Seems redundant now. Fixes: https://github.com/langchain-ai/langchain/issues/11809 and https://github.com/langchain-ai/langchain/issues/11759	11 months ago
Eugene Yurtsev	210a48cfb5	Add security considerations (#11869 ) Add security considerations to existing graph tools.	11 months ago
Bagatur	25b1d65305	bump 315 (#11850 )	11 months ago
Nuno Campos	4321d192ea	Use a less specific return type for \| on Runnables (#11762 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Harrison Chase	a506302772	bearly tool (#11812 )	11 months ago
Harrison Chase	4a2f0c51a1	use get_llm_cache and set_llm_cache (#11741 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Harrison Chase	f3ad22e64a	pipe default key (#11788 )	11 months ago
Eugene Yurtsev	0d37b4c27d	Add python,pandas,xorbits,spark agents to experimental (#11774 ) See for contex https://github.com/langchain-ai/langchain/discussions/11680	11 months ago
Michael Feil	233a904f2e	GradientLLM Docs update and model_id renaming. (#10963 ) Related to #10800 - Errors in the Docstring of GradientLLM / Gradient.ai LLM - Renamed the `model_id` to `model` and adapting this in all tests. Reason to so is to be in Sync with `GradientEmbeddings` and other LLM's. - inmproving tests so they check the headers in the sent request. - making the aiosession a private attribute in the docs, as in the future `pip install gradientai` will be replacing aiosession. - adding a example how to fine-tune on the Prompt Template as suggested in #10800	11 months ago
Bagatur	1559ba4bfc	fix upstash test import (#11781 )	11 months ago
Leonid Kuligin	9f0a718198	added candidate_count for Vertex models (#11729 ) - Description: added support for `candidate_count` parameter on Vertex	11 months ago
David	9d200e6cbe	Create ChatEverlyAI (#11357 ) - Description: Adds the ChatEverlyAI class with llama-2 7b on [EverlyAI Hosted Endpoints](https://everlyai.xyz/) - It inherits from ChatOpenAI and requires openai (probably unnecessary but it made for a quick and easy implementation) --------- Co-authored-by: everly-studio <127131037+everly-studio@users.noreply.github.com>	11 months ago
Hristo G	7fb25b4154	Add graceful fallback for ES vectorstore when content field is missing (#11726 ) - Description: - If the Elasticsearch field used for Langchain > Document.page_content is missing because the specific document is somehow malformed fail gracefully. - Tag maintainer: - @joemcelroy	11 months ago
Bagatur	f06fcde0d7	rm duplicate zilliz import (#11777 )	11 months ago
Bagatur	a3330c4258	bump 314 (#11773 )	11 months ago
Erick Friis	1861cc7100	General anthropic functions, steps towards experimental integration tests (#11727 ) To match change in js here https://github.com/langchain-ai/langchainjs/pull/2892 Some integration tests need a bit more work in experimental: ![Screenshot 2023-10-12 at 12 02 49 PM](https://github.com/langchain-ai/langchain/assets/9557659/262d7d22-c405-40e9-afef-669e8d585307) Pretty sure the sqldatabase ones are an actual regression or change in interface because it's returning a placeholder. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	11 months ago
Nuno Campos	17c69678ab	Revert "New add Baichuan Model" (#11761 ) Reverts langchain-ai/langchain#11714 This has linting and formatting issues, plus it's added to chat models folder but doesn't subclass Chat Model base class	11 months ago
cloudscool	56653c53aa	New add Baichuan Model (#11714 ) Motivation and Context At present, the Baichuan Large Language Model is relatively popular and efficient in performance. Due to widespread market recognition, this model has been added to enhance the scalability of Langchain's ability to access the big language model, so as to facilitate application access and usage for interested users. System Info langchain： 0.0.295 python：3.8.3 IDE：vs code Description Add the following files: 1. Add baichuan_baichuaninc_endpoint.py in the libs/langchain/langchain/chat_models 2. Modify the __init__.py file,which is located in the libs/langchain/langchain/chat_models/__init__.py： a. Add "from langchain.chat_models.baichuan_baichuaninc_endpoint import BaichuanChatEndpoint" b. Add "BaichuanChatEndpoint" In the file's __ All__ method Your contribution I am willing to help implement this feature and submit a PR, but I would appreciate guidance from the maintainers or community to ensure the changes are made correctly and in line with the project's standards and practices.	12 months ago
Yang, Bo	9e1e0f54d2	Add `TrainableLLM` (#11721 ) - Description: Add `TrainableLLM` for those LLM support fine-tuning - Tag maintainer: @hwchase17 This PR add training methods to `GradientLLM` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Burak Yılmaz	63e516c2b0	Upstash redis integration (#10871 ) - Description: Introduced Upstash provider with following wrappers: UpstashRedisCache, UpstashRedisEntityStore, UpstashRedisChatMessageHistory, UpstashRedisStore - Issue: -, - Dependencies: upstash-redis python package is needed, - Tag maintainer: @baskaryan - Twitter handle: @BurakY744 --------- Co-authored-by: Burak Yılmaz <burakyilmaz@Buraks-MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Bagatur	a9db2b0b92	fix tongyi import (#11745 )	12 months ago
Aaron Pham	6c61315067	fix(openllm): update with newer remote client implementation (#11740 ) cc @baskaryan --------- Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	12 months ago
Richy Wang	11cdfe44af	Implement Alibaba Tongyi chat model apis. (#10922 ) Hi there This PR is aim to implement chat model for Alibaba Tongyi LLM model. It contains work below: 1.Implement ChatTongyi chat model in langchain.chat_models.tongyi. Note this is different with tongyi llm model to another PR https://github.com/langchain-ai/langchain/pull/10878. For detail it implements _generate() and _stream() function in ChatTongyi. 2. Add some examples in chat/tongyi.ipynb. 3. Add integration test in chat_models/test_tongyi.py Note async completion for the Text API is not yet supported. Dependencies: dashscope. It will be installed manually cause it is not need by everyone.	12 months ago
Adam Demjen	008348ce71	Add ElasticsearchChatMessageHistory (#10932 ) Description This PR adds the `ElasticsearchChatMessageHistory` implementation that stores chat message history in the configured [Elasticsearch](https://www.elastic.co/elasticsearch/) deployment. ```python from langchain.memory.chat_message_histories import ElasticsearchChatMessageHistory history = ElasticsearchChatMessageHistory( es_url="https://my-elasticsearch-deployment-url:9200", index="chat-history-index", session_id="123" ) history.add_ai_message("This is me, the AI") history.add_user_message("This is me, the human") ``` Dependencies - [elasticsearch client](https://elasticsearch-py.readthedocs.io/) required Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Jonathan Soma	48cf978391	Allow placeholders in OpenAPI endpoints #2938 (#2940 ) Use regex matches when checking endpoints instead of exact matches. `{varname}` becomes `.*` Fixes #2938 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Predrag Gruevski	9e32120cbb	Deprecate direct access to globals like `debug` and `verbose`. (#11311 ) Instead of accessing `langchain.debug`, `langchain.verbose`, or `langchain.llm_cache`, please use the new getter/setter functions in `langchain.globals`: - `langchain.globals.set_debug()` and `langchain.globals.get_debug()` - `langchain.globals.set_verbose()` and `langchain.globals.get_verbose()` - `langchain.globals.set_llm_cache()` and `langchain.globals.get_llm_cache()` Using the old globals directly will now raise a warning. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	12 months ago
Richard Adams	35965df20d	Rspace doc loader (#11511 ) Description: Add a document loader for the RSpace Electronic Lab Notebook (www.researchspace.com), so that scientific documents and research notes can be easily pulled into Langchain pipelines. Issue This is an new contribution, rather than an issue fix. Dependencies: There are no new required dependencies. In order to use the loader, clients will need to install rspace_client SDK using `pip install rspace_client` --------- Co-authored-by: richarda23 <richard.c.adams@infinityworks.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Ryan Zotti	9d1867c77f	Update docs to specify Indexing-API-compatible vectorstores (#11581 ) Description: Update Indexing API docs to specify vectorstores that are compatible with the Indexing API. I add a unit test to remind developers to update the documentation whenever they add or change a vectorstore in a way that affects compatibility. For the unit test I repurposed existing code from [here](https://github.com/langchain-ai/langchain/blob/v0.0.311/libs/langchain/langchain/indexes/_api.py#L245-L257). This is my first PR to an open source project. This is a trivially simple PR whose main purpose is to make me more comfortable submitting Langchain PRs. If this PR goes through I plan to submit PRs with more substantive changes in the near future. Issue: Resolves [10482](https://github.com/langchain-ai/langchain/discussions/10482). Dependencies: No new dependencies. Twitter handle: None.	12 months ago
Richard Wang	6402c33299	Let Notion document loader support utf-8 and make it default. (#10613 ) Use utf-8 encoding by default	12 months ago
Bagatur	bd74eba152	add azure openai sched tests (#11723 )	12 months ago
Bagatur	9c0584be74	bump 313 (#11718 )	12 months ago
sudranga	361f8e1bc6	Add MMR functionality to elasticsearch retriever (#11633 ) Allows MMR functionality only for the case where we have access to the embedding function. Also allows for users to request for fields from elasticsearch store. These are added to the document metadata. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Dmitry Tyumentsev	ead9d5b55c	Add yandex stt parser (#11435 ) Description: Introducing an ability to load a transcription document of audio file using [Yandex SpeechKit](https://cloud.yandex.com/en-ru/services/speechkit) Issue: None Dependencies: yandex-speechkit Tag maintainer: @rlancemartin, @eyurtsev	12 months ago
Janos Tolgyesi	15687a28d5	Use correct tokenizer for Bedrock/Anthropic LLMs (#11561 ) Description This PR implements the usage of the correct tokenizer in Bedrock LLMs, if using anthropic models. Issue: #11560 Dependencies: optional dependency on `anthropic` python library. Twitter handle: jtolgyesi --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
kYLe	467b082c34	Modify Anyscale integration to work with Anyscale Endpoint (#11569 ) Description: Modify Anyscale integration to work with [Anyscale Endpoint](https://docs.endpoints.anyscale.com/) and it supports invoke, async invoke, stream and async invoke features --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
plpycoin	51193309ea	Update readthedocs.py (#11110 ) Only parse .html files .svg .png favicon.ico will crash processing phase --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Nuno Campos	ca9de26f2b	Add callback function to RunnablePassthrough (#11564 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	7f4734c0dd	Add deploy command to repos generated by cli template (#11711 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	1c0857b53e	Fix default impl of aparse_result (#11702 ) Should delegate to parse_result, not to aparse, as parse_result is a method that some output parsers override <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
nuric	44da27c07b	Add SemaDB VST wrapper (#11484 ) - Description: Adding vectorstore wrapper for [SemaDB](https://rapidapi.com/semafind-semadb/api/semadb). - Issue: None - Dependencies: None - Twitter handle: semafind Checks performed: - [x] `make format` - [x] `make lint` - [x] `make test` - [x] `make spell_check` - [x] `make docs_build` Documentation added: - SemaDB vectorstore wrapper tutorial	12 months ago
hsuyuming	0b743f005b	Feature/enhance huggingfacepipeline to handle different return type (#11394 ) Description: Avoid huggingfacepipeline to truncate the response if user setup return_full_text as False within huggingface pipeline. Dependencies: : None Tag maintainer: Maybe @sam-h-bean ? --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Leonid Kuligin	2aba9ab47e	Retriever based on GCP DocAI Warehouse (#11400 ) - Description: implements a retriever on top of DocAI Warehouse (to interact with existing enterprise documents) https://cloud.google.com/document-ai-warehouse?hl=en - Issue: new functionality @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Erick Friis	a477ddda45	Langsmith in readme update (#11497 )	12 months ago
Leonid Kuligin	9e81ab47be	Added a better error description if processor name is wrong. (#11488 ) Replace this entire comment with: - Description: added a better error description for this error - Issue: #11407 @baskaryan	12 months ago
Robert Yi	e75766b759	fix: incorrect arguments in clickhouse docstring (#11693 ) fix docstring for clickhouse	12 months ago
Eugene Yurtsev	17b5090c18	Add `type` to Agent actions (#11682 ) Add `type` to agent actions.	12 months ago
April	c14a8df2ee	wrap confluence attachment processing with a try-except block (#11503 ) Prevents document loading from erroring out when an attachment is not found at the url. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
eajechiloae	4ba2c8ba75	Fix ClearML callback (#11472 ) Handle different field names in dicts/dataframes, fixing the ClearML callback. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Lawrence Wu	93bb19f69a	Fix chains/loading.py error messages (#11688 ) - Description: make the error messages consistent in chains/loading.py - Dependencies: None	12 months ago
Harrison Chase	18ebce2032	fix tool async (#11689 )	12 months ago
sudranga	9beb03e771	11474 (#11519 ) No relevant documents may be found for a given question. In some use cases, we could directly respond with a fixed message instead of doing an LLM call with an empty context. This PR exposes this as an option: response_if_no_docs_found. --------- Co-authored-by: Sudharsan Rangarajan <sudranga@nile-global.com>	12 months ago
Joaquin Menendez	ef99b06362	feature: add metadata information into the embedding file before uplo… (#11553 ) Replace this entire comment with: - Description: In this modified version of the function, if the metadatas parameter is not None, the function includes the corresponding metadata in the JSON object for each text. This allows the metadata to be stored alongside the text's embedding in the vector store. - - Issue: #10924 - Dependencies: None - Tag maintainer: @hwchase17 @agola11 - Twitter handle: @MelliJoaco --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Marcin Wątroba	51a3a86022	#11655 Add SQLAlchemyMd5Cache implementation (#11660 ) - Description: Add SQLAlchemyMd5Cache implementation, - Issue: the issue # #11655, - Dependencies: no deps, - Tag maintainer: @markowanga --------- Co-authored-by: Marcin Wątroba <marcin.watroba@pwr.edu.pl> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Suresh Kumar Ponnusamy	70f7558db2	langchain-experimental: Add allow_list support in experimental/data_anonymizer (#11597 ) - Description: Add allow_list support in langchain experimental data-anonymizer package - Issue: no - Dependencies: no - Tag maintainer: @hwchase17 - Twitter handle:	12 months ago
wemysschen	2363c02cf3	Bos loader (#11525 ) Description: Add BaiduCloud BOS document loader. --------- Co-authored-by: chenweixu01 <chenweixu01@baidu.com> Co-authored-by: root <root@icoding-cwx.bcc-szzj.baidu.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Kwanghoon Choi	fbb82608cd	Fixed a bug in reporting Python code validation (#11522 ) - Description: fixed a bug in pal-chain when it reports Python code validation errors. When node.func does not have any ids, the original code tried to print node.func.id in raising ValueError. - Issue: n/a, - Dependencies: no dependencies, - Tag maintainer: @hazzel-cn, @eyurtsev - Twitter handle: @lazyswamp --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Harrison Chase	9f39c23a13	add input type for convo retrieval chain (#11679 )	12 months ago
zhaozhiming	d5e762d328	fix: Change the docs of JSONAgentOutputParser (#11594 ) I am merely making some minor adjustments to the function documentation. I hope to provide a small assistance to LangChain. - Description: Change the docs of JSONAgentOutputParser. It will be `JSON` better, - Issue: no, - Dependencies: no, - Tag maintainer: @hwchase17, - Twitter handle: Not worth mentioning.	12 months ago
Vinay Kakade	dd0cd98861	Add support for ChatOpenAI models in Infino callback handler (#11608 ) Description: This PR adds support for ChatOpenAI models in the Infino callback handler. In particular, this PR implements `on_chat_model_start` callback, so that ChatOpenAI models are supported. With this change, Infino callback handler can be used to track latency, errors, and prompt tokens for ChatOpenAI models too (in addition to the support for OpenAI and other non-chat models it has today). The existing example notebook is updated to show how to use this integration as well. cc/ @naman-modi @savannahar68 Issue: https://github.com/langchain-ai/langchain/issues/11607 Dependencies: None Tag maintainer: @hwchase17 Twitter handle: [@vkakade](https://twitter.com/vkakade)	12 months ago
Israel Ekpo	d0603c86b6	Add Support for Azure Cosmos DB MongoDB vCore Vector Store #11627 (#11632 ) This PR adds support for the Azure Cosmos DB MongoDB vCore Vector Store https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/ https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/vector-search Summary: - Description: added vector store integration for Azure Cosmos DB MongoDB vCore Vector Store, - Issue: the issue # it fixes #11627, - Dependencies: pymongo dependency, - Tag maintainer: @hwchase17, - Twitter handle: @izzyacademy --------- Co-authored-by: Israel Ekpo <israel.ekpo@gmail.com> Co-authored-by: Israel Ekpo <44282278+izzyacademy@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Erick Friis	28ee6a7c12	Track ChatFireworks time to first_token (#11672 )	12 months ago
Eugene Yurtsev	539941281d	Fix output types for BaseChatModel (#11670 ) * Should use non chunked messages for Invoke/Batch * After this PR, stream output type is not represented, do we want to use the union? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
Eugene Yurtsev	99adcdb1c9	Add dedicated `type` attribute to be used solely for serialization purposes (#11585 ) Adds standard `type` field for all messages that will be serialized/validated by pydantic. * The presence of `type` makes it easier for developers consuming schemas to write client code to serialize/deserialize. * In LangServe `type` will be used for both validation and will appear in the generated openapi specs	12 months ago
eryk-dsai	06d5971be9	Fix issue #10985 - Skip model.to(device) if it is instantiated with bitsandbytes config (#11009 ) Preventing error caused by attempting to move the model that was already loaded on the GPU using the Accelerate module to the same or another device. It is not possible to load model with Accelerate/PEFT to CPU for now Addresses: [#10985](https://github.com/langchain-ai/langchain/issues/10985)	12 months ago
Nuno Campos	64969bc8ae	Add patch_config(configurable=) arg, make with_config(configurable=) merge it with existing (#11662 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Harrison Chase	ce0019b646	make utils conditional (#11646 )	12 months ago
Harrison Chase	8f06085b24	make tools conditional (#11647 )	12 months ago
Bassem Yacoube	5451b724fc	Adds support for llama2 and fixes MPT-7b url (#11465 ) - Description: This is an update to OctoAI LLM provider that adds support for llama2 endpoints hosted on OctoAI and updates MPT-7b url with the current one. @baskaryan Thanks! --------- Co-authored-by: ML Wiz <bassemgeorgi@gmail.com>	12 months ago
Todd Kerpelman	0bff399af1	Make metadata from the url_selenium loader match that of the web_base loader (#11617 ) Description: I noticed the metadata returned by the url_selenium loader was missing several values included by the web_base loader. (The former returned `{source: ...}`, the latter returned `{source: ..., title: ..., description: ..., language: ...}`.) This change fixes it so both loaders return all 4 key value pairs. Files have been properly formatted and all tests are passing. Note, however, that I am not much of a python expert, so that whole "Adding the imports inside the code so that tests pass" thing seems weird to me. Please LMK if I did anything wrong.	12 months ago
Tarun Thotakura	c9d4d53545	Fixed the assignment of custom_llm_provider argument (#11628 ) - Description: Assigning the custom_llm_provider to the default params function so that it will be passed to the litellm - Issue: Even though the custom_llm_provider argument is being defined it's not being assigned anywhere in the code and hence its not being passed to litellm, therefore any litellm call which uses the custom_llm_provider as required parameter is being failed. This parameter is mainly used by litellm when we are doing inference via Custom API server. https://docs.litellm.ai/docs/providers/custom_openai_proxy - Dependencies: No dependencies are required @krrishdholakia , @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Leonid Ganeline	db67ccb0bb	docstrings cleanup (#11640 ) Added missed docstrings. Some reformatting.	12 months ago
Yang, Bo	3a82bd7bdb	Use raise from statement so that users can find detailed error message (#11461 ) - Description: Use `raise from` statement so that users can find detailed error message - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17	12 months ago
Nuno Campos	9a0ed75a95	Add configurable fields with options (#11601 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Bagatur	7232e082de	bump 312 (#11621 )	12 months ago
Eugene Yurtsev	58220cda72	Remove LLM Bash and related bash utilities (#11619 ) Deprecate LLMBash and related bash utilities	12 months ago
Shubham Kushwaha	49de862076	Arcee.ai LLM & Retriever integration (#11579 ) - Description: This PR introduces a new LLM and Retriever API to https://arcee.ai for the python client - Issue: implements the integrations as requested in #11578 , - Dependencies: no dependencies are required, - Tag maintainer: @hwchase17 - Twitter handle: shwooobham ✅ `make format`, `make lint` and `make test` runs locally. ```shell =========== 1245 passed, 277 skipped, 20 warnings in 16.26s =========== ./scripts/check_pydantic.sh . ./scripts/check_imports.sh poetry run ruff . [ "." = "" ] \|\| poetry run black . --check All done! ✨ 🍰 ✨ 1818 files would be left unchanged. [ "." = "" ] \|\| poetry run mypy . Success: no issues found in 1815 source files [ "." = "" ] \|\| poetry run black . All done! ✨ 🍰 ✨ 1818 files left unchanged. [ "." = "" ] \|\| poetry run ruff --select I --fix . poetry run codespell --toml pyproject.toml poetry run codespell --toml pyproject.toml -w ``` Contributions 1. Arcee (langchain/llms), ArceeRetriever (langchain/retrievers), ArceeWrapper (langchain/utilities) 2. docs for Arcee (llms/arcee.py) and ArceeRetriever(retrievers/arcee.py) 3. cc: @jacobsolawetz @ben-epstein --------- Co-authored-by: Shubham <shubham@sORo.local>	12 months ago
Eugene Yurtsev	b56ca0c2a4	Deprecate LLMSymbolicMath from langchain core (#11615 ) Deprecate LLMSymbolicMath from langchain core package.	12 months ago
Eugene Yurtsev	c9bce5bbfb	Add version to langchain_experimental (#11613 ) Add version to langchain experimental	12 months ago
Predrag Gruevski	22abeb9f6c	Disable loading jinja2 `PromptTemplate` from file. (#10252 ) jinja2 templates are not sandboxed and are at risk for arbitrary code execution. To mitigate this risk: - We no longer support loading jinja2-formatted prompt template files. - `PromptTemplate` with jinja2 may still be constructed manually, but the class carries a security warning reminding the user to not pass untrusted input into it. Resolves #4394.	12 months ago
Nuno Campos	c7c03d4709	Fix mutation bugs in callback manager configure (#11603 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
cccs-eric	e2a9072b80	Fix CohereRerank configuration (#11583 ) Description: CohereRerank is missing `cohere_api_key` as a field and since extras are forbidden, it is not possible to pass-in the key. The only way is to use an env variable named `COHERE_API_KEY`. For example, if trying to create a compressor like this: ```python cohere_api_key = "......Cohere api key......" compressor = CohereRerank(cohere_api_key=cohere_api_key) ``` you will get the following error: ``` File "/langchain/.venv/lib/python3.10/site-packages/pydantic/v1/main.py", line 341, in __init__ raise validation_error pydantic.v1.error_wrappers.ValidationError: 1 validation error for CohereRerank cohere_api_key extra fields not permitted (type=value_error.extra) ```	12 months ago
Anar	55fef4b64b	implemented add files method in LLMRails (#11518 ) This PR provides add files method with LLMRails. Implemented here are: docs/extras/integrations/vectorstores/llm-rails.ipynb --------- Co-authored-by: Anar Aliyev <aaliyev@mgmt.cloudnet.services>	12 months ago
Stephen Hankinson	316dddc7cd	fix wording of query_sql_database_tool_description (#11530 ) - Description: Fixes minor typo for the query_sql_database_tool_description in the db toolkit - Issue: N/A - Dependencies: N/A - Tag maintainer: @nfcampos - Twitter handle: N/A	12 months ago
Ash Vardanian	1acfe86353	Accelerating Math Utils with SimSIMD (#11566 ) LangChain relies on NumPy to compute cosine distances, which becomes a bottleneck with the growing dimensionality and number of embeddings. To avoid this bottleneck, in our libraries at [Unum](https://github.com/unum-cloud), we have created a specialized package - [SimSIMD](https://github.com/ashvardanian/simsimd), that knows how to use newer hardware capabilities. Compared to SciPy and NumPy, it reaches 3x-200x performance for various data types. Since publication, several LangChain users have asked me if I can integrate it into LangChain to accelerate their workflows, so here I am 🤗 ## Benchmarking To conduct benchmarks locally, run this in your Jupyter: ```py import numpy as np import scipy as sp import simsimd as simd import timeit as tt def cosine_similarity_np(X: np.ndarray, Y: np.ndarray) -> np.ndarray: X_norm = np.linalg.norm(X, axis=1) Y_norm = np.linalg.norm(Y, axis=1) with np.errstate(divide="ignore", invalid="ignore"): similarity = np.dot(X, Y.T) / np.outer(X_norm, Y_norm) similarity[np.isnan(similarity) \| np.isinf(similarity)] = 0.0 return similarity def cosine_similarity_sp(X: np.ndarray, Y: np.ndarray) -> np.ndarray: return 1 - sp.spatial.distance.cdist(X, Y, metric='cosine') def cosine_similarity_simd(X: np.ndarray, Y: np.ndarray) -> np.ndarray: return 1 - simd.cdist(X, Y, metric='cosine') X = np.random.randn(1, 1536).astype(np.float32) Y = np.random.randn(1, 1536).astype(np.float32) repeat = 1000 print("NumPy: {:,.0f} ops/s, SciPy: {:,.0f} ops/s, SimSIMD: {:,.0f} ops/s".format( repeat / tt.timeit(lambda: cosine_similarity_np(X, Y), number=repeat), repeat / tt.timeit(lambda: cosine_similarity_sp(X, Y), number=repeat), repeat / tt.timeit(lambda: cosine_similarity_simd(X, Y), number=repeat), )) ``` ## Results I ran this on an M2 Pro Macbook for various data types and different number of rows in `X` and reformatted the results as a table for readability: \| Data Type \| NumPy \| SciPy \| SimSIMD \| \| :--- \| ---: \| ---: \| ---: \| \| `f32, 1` \| 59,114 ops/s \| 80,330 ops/s \| 475,351 ops/s \| \| `f16, 1` \| 32,880 ops/s \| 82,420 ops/s \| 650,177 ops/s \| \| `i8, 1` \| 47,916 ops/s \| 115,084 ops/s \| 866,958 ops/s \| \| `f32, 10` \| 40,135 ops/s \| 24,305 ops/s \| 185,373 ops/s \| \| `f16, 10` \| 7,041 ops/s \| 17,596 ops/s \| 192,058 ops/s \| \| `f16, 10` \| 21,989 ops/s \| 25,064 ops/s \| 619,131 ops/s \| \| `f32, 100` \| 3,536 ops/s \| 3,094 ops/s \| 24,206 ops/s \| \| `f16, 100` \| 900 ops/s \| 2,014 ops/s \| 23,364 ops/s \| \| `i8, 100` \| 5,510 ops/s \| 3,214 ops/s \| 143,922 ops/s \| It's important to note that SimSIMD will underperform if both matrices are huge. That, however, seems to be an uncommon usage pattern for LangChain users. You can find a much more detailed performance report for different hardware models here: - [Apple M2 Pro](https://ashvardanian.com/posts/simsimd-faster-scipy/#appendix-1-performance-on-apple-m2-pro). - [4th Gen Intel Xeon Platinum](https://ashvardanian.com/posts/simsimd-faster-scipy/#appendix-2-performance-on-4th-gen-intel-xeon-platinum-8480). - [AWS Graviton 3](https://ashvardanian.com/posts/simsimd-faster-scipy/#appendix-3-performance-on-aws-graviton-3). ## Additional Notes 1. Previous version used `X = np.array(X)`, to repackage lists of lists. It's an anti-pattern, as it will use double-precision floating-point numbers, which are slow on both CPUs and GPUs. I have replaced it with `X = np.array(X, dtype=np.float32)`, but a more selective approach should be discussed. 2. In numerical computations, it's recommended to explicitly define tolerance levels, which were previously avoided in `np.allclose(expected, actual)` calls. For now, I've set absolute tolerance to distance computation errors as 0.01: `np.allclose(expected, actual, atol=1e-2)`. --- - Dependencies: adds `simsimd` dependency - Tag maintainer: @hwchase17 - Twitter handle: @ashvardanian --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
benchello	5de64e6d60	Add option to specify metadata columns in CSV loader (#11576 ) #### Description This PR adds the option to specify additional metadata columns in the CSVLoader beyond just `Source`. The current CSV loader includes all columns in `page_content` and if we want to have columns specified for `page_content` and `metadata` we have to do something like the below.: ``` csv = pd.read_csv( "path_to_csv" ).to_dict("records") documents = [ Document( page_content=doc["content"], metadata={ "last_modified_by": doc["last_modified_by"], "point_of_contact": doc["point_of_contact"], } ) for doc in csv ] ``` #### Usage Example Usage: ``` csv_test = CSVLoader( file_path="path_to_csv", metadata_columns=["last_modified_by", "point_of_contact"] ) ``` Example CSV: ``` content, last_modified_by, point_of_contact "hello world", "Person A", "Person B" ``` Example Result: ``` Document { page_content: "hello world" metadata: { row: '0', source: 'path_to_csv', last_modified_by: 'Person A', point_of_contact: 'Person B', } ``` --------- Co-authored-by: Ben Chello <bchello@dropbox.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Stephen Hankinson	447a523662	fix comments in output format (#11536 ) - Description: Fixes the comments in the ConvoOutputParser. Because the \\\\ is escaping a single \\, they render something like: `"action_input": string \ The input to the action` in the prompt. Changing this to \\\\\\\\ lets it escape two slashes so that it renders a proper comment: `"action_input": string \\ The input to the action` - Issue: N/A - Dependencies: - Tag maintainer: @hwchase17 - Twitter handle:	12 months ago
Michael Landis	8e45f720a8	feat: add momento vector index as a vector store provider (#11567 ) Description: - Added Momento Vector Index (MVI) as a vector store provider. This includes an implementation with docstrings, integration tests, a notebook, and documentation on the docs pages. - Updated the Momento dependency in pyproject.toml and the lock file to enable access to MVI. - Refactored the Momento cache and chat history session store to prefer using "MOMENTO_API_KEY" over "MOMENTO_AUTH_TOKEN" for consistency with MVI. This change is backwards compatible with the previous "auth_token" variable usage. Updated the code and tests accordingly. Dependencies: - Updated Momento dependency in pyproject.toml. Testing: - Run the integration tests with a Momento API key. Get one at the [Momento Console](https://console.gomomento.com) for free. MVI is available in AWS us-west-2 with a superuser key. - `MOMENTO_API_KEY=<your key> poetry run pytest tests/integration_tests/vectorstores/test_momento_vector_index.py` Tag maintainer: @eyurtsev Twitter handle: Please mention @momentohq for this addition to langchain. With the integration of Momento Vector Index, Momento caching, and session store, Momento provides serverless support for the core langchain data needs. Also mention @mlonml for the integration.	12 months ago
Eugene Yurtsev	ca2eed36b7	LangChain cli fix a few bugs (#11573 ) Code was assuming that `git` and `poetry` exist. In addition, it was not ignoring pycache files that get generated during run time	12 months ago
Hugues Chocart	258ae1ba5f	[LLMonitor Callback Handler]: Add error handling (#11563 ) Wraps every callback handler method in error handlers to avoid breaking users' programs when an error occurs inside the handler. Thanks @valdo99 for the suggestion 🙂	12 months ago
Eugene Yurtsev	2aabfafe1e	Module documentation for langchain runnables (#11550 ) Add in code documentation for langchain runnables module.	12 months ago
Eugene Yurtsev	d8fa94e6fa	RunnablePassthrough: In code documentation (#11552 ) Add in code documentation for a runnable passthrough	12 months ago
Eugene Yurtsev	b42f218cfc	RunnableLambda: Add in code docs (#11521 ) Add in code docs for Runnable Lambda	12 months ago
maks-operlejn-ds	f64522fbaf	Reset deanonymizer mapping (#11559 ) @hwchase17 @baskaryan	12 months ago
maks-operlejn-ds	b14b65d62a	Support all presidio entities (#11558 ) https://microsoft.github.io/presidio/supported_entities/ @baskaryan @hwchase17	12 months ago
maks-operlejn-ds	4d62def9ff	Better deanonymizer matching strategy (#11557 ) @baskaryan, @hwchase17	12 months ago
Ash Vardanian	a992b9670d	Fix: Missing DuckDuckGo package version (#11535 ) [The `duckduckgo-search` v3.9.2 was removed from PyPi](https://pypi.org/project/duckduckgo-search/#history). That breaks the build. - Description: refreshes the Poetry dependency to v3.9.3 - Tag maintainer: @baskaryan - Twitter handle: @ashvardanian	12 months ago
Bagatur	8932ed3f07	bump 311 (#11555 )	12 months ago
Bagatur	e7a0def1bc	QoL improvements to query constructor (#11504 ) updating query constructor and self query retriever to - make it easier to pass in examples - validate attributes used in query - remove invalid parts of query - make it easier to get + edit prompt - make query constructor a runnable - make self query retriever use as runnable	12 months ago
Taikono-Himazin	eec53fa294	Added autodetect_encoding option to csvLoader (#11327 )	12 months ago
Holt Skinner	09c66fe04f	feat: Update Google Document AI Parser (#11413 ) - Description: Code Refactoring, Documentation Improvements for Google Document AI PDF Parser - Adds Online (synchronous) processing option. - Adds default field mask to limit payload size. - Skips Human review by default. - Issue: Fixes #10589 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
Nuno Campos	628cc4cce8	Rename RunnableMap to RunnableParallel (#11487 ) - keep alias for RunnableMap - update docs to use RunnableParallel and RunnablePassthrough.assign <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Eugene Yurtsev	6a10e8ef31	Add documentation to Runnable (#11516 )	12 months ago
William FH	eb572f41a6	Add LangSmith Run Chat Loader (#11458 )	12 months ago
David Duong	484947c492	Fetch up-to-date attributes for env-pulled kwargs during serialisation of OpenAI classes (#11499 )	12 months ago
Bagatur	5470e730d2	raise openapi import error (#11495 )	12 months ago
Erick Friis	29f5f70415	Rename some last hwchase17/langchain links (#11494 )	12 months ago
Fabrice Pont	872836c541	feat: add markdown list parser (#11411 ) Description: add `MarkdownListOutputParser` as a new `ListOutputParser` Issue: #11410	12 months ago
Erick Friis	8f50b616c5	Remove optional from vectara source (#11493 ) fyi @ofermend --------- Co-authored-by: Ofer Mendelevitch <ofer@vectara.com> Co-authored-by: Ofer Mendelevitch <ofermend@gmail.com>	12 months ago
Bagatur	53887242a1	bump 310 (#11486 )	12 months ago
Jesús Vélez Santiago	a1c7532298	Add async sql record manager and async indexing API (#10726 ) - Description: Add support for a SQLRecordManager in async environments. It includes the creation of `RecorManagerAsync` abstract class. - Issue: None - Dependencies: Optional `aiosqlite`. - Tag maintainer: @nfcampos - Twitter handle: @jvelezmagic --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Qihui Xie	57ade13b2b	fix llm_inputs duplication problem in intermediate_steps in SQLDatabaseChain (#10279 ) Use `.copy()` to fix the bug that the first `llm_inputs` element is overwritten by the second `llm_inputs` element in `intermediate_steps`. *Problem description:* In [line 127]( `c732d8fffd/libs/experimental/langchain_experimental/sql/base.py (L127C17-L127C17)`), the `llm_inputs` of the sql generation step is appended as the first element of `intermediate_steps`: ``` intermediate_steps.append(llm_inputs) # input: sql generation ``` However, `llm_inputs` is a mutable dict, it is updated in [line 179](https://github.com/langchain-ai/langchain/blob/master/libs/experimental/langchain_experimental/sql/base.py#L179) for the final answer step: ``` llm_inputs["input"] = input_text ``` Then, the updated `llm_inputs` is appended as another element of `intermediate_steps` in [line 180](`c732d8fffd/libs/experimental/langchain_experimental/sql/base.py (L180)`): ``` intermediate_steps.append(llm_inputs) # input: final answer ``` As a result, the final `intermediate_steps` returned in [line 189](`c732d8fffd/libs/experimental/langchain_experimental/sql/base.py (L189C43-L189C43)`) actually contains two same `llm_inputs` elements, i.e., the `llm_inputs` for the sql generation step overwritten by the one for final answer step by mistake. Users are not able to get the actual `llm_inputs` for the sql generation step from `intermediate_steps` Simply calling `.copy()` when appending `llm_inputs` to `intermediate_steps` can solve this problem.	12 months ago
Florian	d78f418c0d	Extract abstracts from Pubmed articles, even if they have no extra label (#10245 ) ### Description This pull request involves modifications to the extraction method for abstracts/summaries within the PubMed utility. A condition has been added to verify the presence of unlabeled abstracts. Now an abstract will be extracted even if it does not have a subtitle. In addition, the extraction of the abstract was extended to books. ### Issue The PubMed utility occasionally returns an empty result when extracting abstracts from articles, despite the presence of an abstract for the paper on PubMed. This issue arises due to the varying structure of articles; some articles follow a "subtitle/label: text" format, while others do not include subtitles in their abstracts. An example of the latter case can be found at: [https://pubmed.ncbi.nlm.nih.gov/37666905/](url) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Viktor Zhemchuzhnikov	fd9da60aea	Add async support to SelfQueryRetriever (#10175 ) ### Description SelfQueryRetriever is missing async support, so I am adding it. I also removed deprecated predict_and_parse method usage here, and added some tests. ### Issue N/A ### Tag maintainer Not yet ### Twitter handle N/A	12 months ago
Theron Tau	35297ca0d3	Add feature for extracting images from pdf and recognizing text from images. (#10653 ) Description It is for #10423 that it will be a useful feature if we can extract images from pdf and recognize text on them. I have implemented it with `PyPDFLoader`, `PyPDFium2Loader`, `PyPDFDirectoryLoader`, `PyMuPDFLoader`, `PDFMinerLoader`, and `PDFPlumberLoader`. [RapidOCR](https://github.com/RapidAI/RapidOCR.git) is used to recognize text on extracted images. It is time-consuming for ocr so a boolen parameter `extract_images` is set to control whether to extract and recognize. I have tested the time usage for each parser on my own laptop thinkbook 14+ with AMD R7-6800H by unit test and the result is: \| extract_images \| PyPDFParser \| PDFMinerParser \| PyMuPDFParser \| PyPDFium2Parser \| PDFPlumberParser \| \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| \| False \| 0.27s \| 0.39s \| 0.06s \| 0.08s \| 1.01s \| \| True \| 17.01s \| 20.67s \| 20.32s \| 19,75s \| 20.55s \| Issue #10423 Dependencies rapidocr_onnxruntime in [RapidOCR](https://github.com/RapidAI/RapidOCR/tree/main) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Bagatur	8e3fbc97ca	Add vowpal_wabbit RL chain (#11462 )	12 months ago
Haris Wang	f1269830a0	Fix bug in MarkdownHeaderTextSplitter for codeblock (#10262 ) - Description: The previous version of the MarkdownHeaderTextSplitter did not take into account the possibility of '#' appearing within code blocks, which caused segmentation anomalies in these situations. This PR has fixed this issue. - Issue: - Dependencies: No - Tag maintainer: - Twitter handle: cc @baskaryan @eyurtsev @rlancemartin --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Eddie Cohen	656d2303f7	add in, nin for pinecone (#10303 ) Description: Adds the in and nin comparators for pinecone seen [here](https://docs.pinecone.io/docs/metadata-filtering#metadata-query-language) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Bagatur	a3a2ce623e	Revise vowpal_wabbit notebook	12 months ago
Bagatur	8fafa1af91	merge	12 months ago
olgavrou	3b07c0cf3d	RL Chain with VowpalWabbit (#10242 ) - Description: This PR adds a new chain `rl_chain.PickBest` for learned prompt variable injection, detailed description and usage can be found in the example notebook added. It essentially adds a [VowpalWabbit](https://github.com/VowpalWabbit/vowpal_wabbit) layer before the llm call in order to learn or personalize prompt variable selections. Most of the code is to make the API simple and provide lots of defaults and data wrangling that is needed to use Vowpal Wabbit, so that the user of the chain doesn't have to worry about it. - Dependencies: [vowpal-wabbit-next](https://pypi.org/project/vowpal-wabbit-next/), - sentence-transformers (already a dep) - numpy (already a dep) - tagging @ataymano who contributed to this chain - Tag maintainer: @baskaryan - Twitter handle: @olgavrou Added example notebook and unit tests	12 months ago
Manikanta5112	56048b909f	added ContentFormatter escape special characters for message content (#10319 ) --------- Co-authored-by: Manikanta5112 <42089393+mani5112@users.noreply.github.com>	12 months ago
Leonid Ganeline	d17416ec79	docstrings `callbacks` (#11456 ) Added missed docstrings to the `callbacks/` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
Ofer Mendelevitch	3c7653bf0f	"source" argument in constructor of Vectara (#11454 ) Replace this entire comment with: - Description: minor update to constructor to allow for specification of "source" - Tag maintainer: @baskaryan - Twitter handle: @ofermend	12 months ago
Eugene Yurtsev	d9018ae5f1	Improve CLI ux (#11452 ) Improve UX for cli	12 months ago
Jaikanth J	9f85f7c543	fix(cache): use dumps for RedisCache (#10408 ) # Description Attempts to fix RedisCache for ChatGenerations using `loads` and `dumps` used in SQLAlchemy cache by @hwchase17 . this is better than pickle dump, because this won't execute any arbitrary code during de-serialisation. # Issues #7722 & #8666 # Dependencies None, but removes the warning introduced in #8041 by @baskaryan Handle: @jaikanthjay46	12 months ago
rodrigo-clickup	5944c1851b	Add ClickUp Toolkit (#10662 ) - Description: Adds a toolkit to interact with the [ClickUp](https://clickup.com/) [Public API](https://clickup.com/api/) - Dependencies: None - Tag maintainer: @rodrigo-georgian, @rodrigo-clickup, @aiswaryasankarwork - Twitter handle: - Aiswarya (https://twitter.com/Aiswarya_Sankar, https://www.linkedin.com/in/sankaraiswarya/) - Rodrigo (https://www.linkedin.com/in/rodrigo-ceballos-lentini/) --------- Co-authored-by: Aiswarya Sankar <aiswaryasankar@Aiswaryas-MacBook-Pro.local> Co-authored-by: aiswaryasankarwork <143119412+aiswaryasankarwork@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
John Reynolds	68901e1e40	Update output_parser.py (#10430 ) - Description: Updated output parser for mrkl to remove any hallucination actions after the final answer; this was encountered when using Anthropic claude v2 for planning; reopening PR with updated unit tests - Issue: #10278 - Dependencies: N/A - Twitter handle: @johnreynolds	12 months ago
Joshua Sundance Bailey	790010703b	ArcGISLoader: Limit number of results in query (#10615 ) Description: this PR changes the `ArcGISLoader` to set `return_all_records` to `False` when `result_record_count` is provided as a keyword argument. Previously, `return_all_records` was `True` by default and this made the API ignore `result_record_count`. Issue: `ArcGISLoader` would ignore `result_record_count` unless user also passed `return_all_records=False`.	12 months ago
mrbean	9903a70379	Add youdotcom retriever (#11304 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
ashish-dahal	1655ff2ded	Fix PyMuPDFLoader kwargs (#11434 ) - Description: Fix the `PyMuPDFLoader` to accept `loader_kwargs` from the document loader's `loader_kwargs` option. This provides more flexibility in formatting the output from documents. - Issue: The `loader_kwargs` is not passed into the `load` method from the document loader, which limits configuration options. - Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Leonid Kuligin	e4a46747dc	integration test for DocAI parser (#11424 ) - Description: added an integration test - Issue: #11407 @baskaryan	12 months ago
Aashish Saini	2abbdc6ecb	Update bageldb.py (#11421 ) I have restructured the code to ensure uniform handling of ImportError. In place of previously used ValueError, I've adopted the standard practice of raising ImportError with explanatory messages. This modification enhances code readability and clarifies that any problems stem from module importation.	12 months ago
maks-operlejn-ds	2aae1102b0	Instance anonymization (#10501 ) ### Description Add instance anonymization - if `John Doe` will appear twice in the text, it will be treated as the same entity. The difference between `PresidioAnonymizer` and `PresidioReversibleAnonymizer` is that only the second one has a built-in memory, so it will remember anonymization mapping for multiple texts: ``` >>> anonymizer = PresidioAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Brett Russell. Hi Brett Russell!' ``` ``` >>> anonymizer = PresidioReversibleAnonymizer() >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' >>> anonymizer.anonymize("My name is John Doe. Hi John Doe!") 'My name is Noah Rhodes. Hi Noah Rhodes!' ``` ### Twitter handle @deepsense_ai / @MaksOpp ### Tag maintainer @baskaryan @hwchase17 @hinthornw --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Kyle Pancamo	203258b4d6	Update pdf.py comment for PyPDFLoader (#10495 ) PyPDF does not chunk at the character level to my understanding. Description: PyPDF does not chunk at the character level, but instead breaks up content by page. Fixup comment --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Juan Daza	4236ae3851	Added Streaming Capability to SageMaker LLMs (#10535 ) This PR adds the ability to declare a Streaming response in the SageMaker LLM by leveraging the `invoke_endpoint_with_response_stream` capability in `boto3`. It is heavily based on the AWS Blog Post announcement linked [here](https://aws.amazon.com/blogs/machine-learning/elevating-the-generative-ai-experience-introducing-streaming-support-in-amazon-sagemaker-hosting/). It does not add any additional dependencies since it uses the existing `boto3` version. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Laurentiu Piciu	d9670a5945	openai_functions_multi_agent: solved the case when the "arguments" is valid JSON but it does not contain `actions` key (#10543 ) Description: There are cases when the output from the LLM comes fine (i.e. function_call["arguments"] is a valid JSON object), but it does not contain the key "actions". So I split the validation in 2 steps: loading arguments as JSON and then checking for "actions" in it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Eugene Yurtsev	fcccde406d	Add SymbolicMathChain to experiment in preparation for deprecation (#11129 ) Move symbolic math chain to experimental	12 months ago
Holt Skinner	9f73fec057	fix: Update Google Cloud Enterprise Search to Vertex AI Search (#10513 ) - Description: Google Cloud Enterprise Search was renamed to Vertex AI Search - https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-search-and-conversation-is-now-generally-available - This PR updates the documentation and Retriever class to use the new terminology. - Changed retriever class from `GoogleCloudEnterpriseSearchRetriever` to `GoogleVertexAISearchRetriever` - Updated documentation to specify that `extractive_segments` requires the new [Enterprise edition](https://cloud.google.com/generative-ai-app-builder/docs/about-advanced-features#enterprise-features) to be enabled. - Fixed spelling errors in documentation. - Change parameter for Retriever from `search_engine_id` to `data_store_id` - When this retriever was originally implemented, there was no distinction between a data store and search engine, but now these have been split. - Fixed an issue blocking some users where the api_endpoint can't be set	12 months ago
Patrick Randell	1d678f805f	Additional Weaviate Filter Comparators (#10522 ) ### Description When using Weaviate Self-Retrievers, certain common filter comparators generated by user queries were unimplemented, resulting in errors. This PR implements some of them. All linting and format commands have been run and tests passed. ### Issue #10474 ### Dependencies timestamp module --------- Co-authored-by: Patrick Randell <prandell@deloitte.com.au>	12 months ago
Nuno Campos	79011f835f	Remove str() from RunnableConfigurableAlternatives (#11446 )	12 months ago
Harrison Chase	31d5bd84d7	make vectorstores optional (#11393 )	12 months ago
Eugene Yurtsev	8aa545901a	Update agent type docs (#11137 ) In code docs for agent types	12 months ago
Eugene Yurtsev	3e31d6e35f	Start deprecation of LLMBashChain (#11300 ) In preparation for migration LLMBashChain and related tools add a derprecation warning to the code.	12 months ago
Bagatur	8b6b8bf68c	bump 309 (#11443 )	12 months ago
billytrend-cohere	2ff91a46c0	Add cohere /chat integration (#11389 ) Add cohere /chat integration and an iPython notebook to demonstrate the addition. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
adrienohana	ca346011b7	added interactive login for azure cognitive search vector store (#11360 ) Description: Previously if the access to Azure Cognitive Search was not done via an API key, the default credential was called which doesn't allow to use an interactive login. I simply added the option to use "INTERACTIVE" as a key name, and this will launch a login window upon initialization of the AzureSearch object.	12 months ago
Eugene Yurtsev	5a1f614175	Add docker compose to CLI (#11406 ) Add docker compose to cli	12 months ago
Predrag Gruevski	e2d6c41177	Upgrade langchain dependencies. (#11420 ) I was hoping this would pick up numpy 1.26, which is required to support the new Python 3.12 release, but it didn't. It seems that some transitive dependency requirement on numpy is preventing that, and the highest we can currently go is 1.24.x. But to find this out required a 15min `poetry lock`, so I figured we might as well upgrade the dependencies we can and hopefully make the next dependency upgrade a bit smaller.	12 months ago
Jacob Lee	71fd6428c5	Remove overridden async not implemented method on embeddings filters and add default async implementation for document compressors (#11415 ) @nfcampos @eyurtsev @baskaryan --------- Co-authored-by: Nuno Campos <nuno@boringbits.io>	12 months ago
Nuno Campos	2f490be09b	Fix .dict() for agent/chain (#11436 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	1e59c44d36	Nc/5oct/runnable release (#11428 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Bagatur	58b7a3ba16	Rm bedrock anthropic error (#11403 )	12 months ago
Predrag Gruevski	c9986bc3a9	Tweak type hints to match dependency's behavior. (#11355 ) Needs #11353 to merge first, and a new `langchain` to be published with those changes.	12 months ago
William FH	940b9ae30a	Normalize Option in Scoring Chain (#11412 )	12 months ago
Eugene Yurtsev	70be04a816	CLI: Readme update (#11404 ) Consolidating to a single README for now, will be easier to maintain we can differentiate between poetry and pip later. Does not seem critical. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
Nuno Campos	fde19c8667	Add CLI command to create a new project (#7837 ) First version of CLI command to create a new langchain project template Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
mhwang-stripe	9cea796671	Make langchain compatible with SQLAlchemy<1.4.0 (#11390 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> ## Description Currently SQLAlchemy >=1.4.0 is a hard requirement. We are unable to run `from langchain.vectorstores import FAISS` with SQLAlchemy <1.4.0 due to top-level imports, even if we aren't even using parts of the library that use SQLAlchemy. See Testing section for repro. Let's make it so that langchain is still compatible with SQLAlchemy <1.4.0, especially if we aren't using parts of langchain that require it. The main conflict is that SQLAlchemy removed `declarative_base` from `sqlalchemy.ext.declarative` in 1.4.0 and moved it to `sqlalchemy.orm`. We can fix this by try-catching the import. This is the same fix as applied in https://github.com/langchain-ai/langchain/pull/883. (I see that there seems to be some refactoring going on about isolating dependencies, e.g. `c87e9fb2ce`, so if this issue will be eventually fixed by isolating imports in langchain.vectorstores that also works). ## Issue I can't find a matching issue. ## Dependencies No additional dependencies ## Maintainer @hwchase17 since you reviewed https://github.com/langchain-ai/langchain/pull/883 ## Testing I didn't add a test, but I manually tested this. 1. Current failure: ``` langchain==0.0.305 sqlalchemy==1.3.24 ``` ``` python python -i >>> from langchain.vectorstores import FAISS Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/pay/src/zoolander/vendor3/lib/python3.8/site-packages/langchain/vectorstores/__init__.py", line 58, in <module> from langchain.vectorstores.pgembedding import PGEmbedding File "/pay/src/zoolander/vendor3/lib/python3.8/site-packages/langchain/vectorstores/pgembedding.py", line 10, in <module> from sqlalchemy.orm import Session, declarative_base, relationship ImportError: cannot import name 'declarative_base' from 'sqlalchemy.orm' (/pay/src/zoolander/vendor3/lib/python3.8/site-packages/sqlalchemy/orm/__init__.py) ``` 2. This fix: ``` langchain==<this PR> sqlalchemy==1.3.24 ``` ``` python python -i >>> from langchain.vectorstores import FAISS <succeeds> ```	12 months ago
Nuno Campos	4d66756d93	Improve output of Runnable.astream_log() (#11391 ) - Make logs a dictionary keyed by run name (and counter for repeats) - Ensure no output shows up in lc_serializable format - Fix up repr for RunLog and RunLogPatch <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Lester Solbakken	a30f98f534	Add Vespa vector store (#11329 ) Addition of Vespa vector store integration including notebook showing its use. Maintainer: @lesters Twitter handle: LesterSolbakken	12 months ago
Nuno Campos	58a88f3911	Add optional input_types to prompt template (#11385 ) - default MessagesPlaceholder one to list of messages <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Tomaz Bratanic	71290315cf	Add optional Cypher validation tool (#11078 ) LLMs have trouble with consistently getting the relationship direction accurately. That's why I organized a competition how to best and most simple to fix it based on the existing schema as a post-processing step. https://github.com/tomasonjo/cypher-direction-competition I am adding the winner's code in this PR: https://github.com/sakusaku-rich/cypher-direction-competition	12 months ago
Bagatur	dd514c2781	bump 308 (#11383 )	12 months ago
Leonid Kuligin	4f4e0f38fc	a better error description when GCP project is not set (#11377 ) - Description: a little bit better error description - Issue: #10879	12 months ago
Nuno Campos	0d80226c64	Add _type to json functions output parser (#11381 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Bagatur	106608bc89	add default async (#11141 )	12 months ago
Nuno Campos	b0893c7c6a	Use an enum for configurable_alternatives to make the generated json schema nicer (#11350 )	12 months ago
Bagatur	b499de2926	Anthropic system message fix (#11301 ) Removes human prompt prefix before system message for anthropic models Bedrock anthropic api enforces that Human and Assistant messages must be interleaved (cannot have same type twice in a row). We currently treat System Messages as human messages when converting messages -> string prompt. Our validation when using Bedrock/BedrockChat raises an error when this happens. For ChatAnthropic we don't validate this so no error is raised, but perhaps the behavior is still suboptimal	12 months ago
Massimiliano Angelino	2f83350eac	Feat bedrock cohere support (#11230 ) Description: Added support for Cohere command model via Bedrock. With this change it is now possible to use the `cohere.command-text-v14` model via Bedrock API. About Streaming: Cohere model outputs 2 additional chunks at the end of the text being generated via streaming: a chunk containing the text `<EOS_TOKEN>`, and a chunk indicating the end of the stream. In this implementation I chose to ignore both chunks. An alternative solution could be to replace `<EOS_TOKEN>` with `\n` Tests: manually tested that the new model work with both `llm.generate()` and `llm.stream()`. Tested with `temperature`, `p` and `stop` parameters. Issue: #11181 Dependencies: No new dependencies Tag maintainer: @baskaryan Twitter handle: mangelino --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	12 months ago
Daniel Butler	939bceccb0	GitHubIssuesLoader Custom API URL Support (#11378 ) - Description: Adds support for custom API URL in the GitHubIssuesLoader. This allows it to be used with Github enterprise instances.	12 months ago
Bagatur	16a80779b9	bump 307 (#11380 )	12 months ago
mziru	9e3c1d4463	add HTMLHeaderTextSplitter (#11039 ) Description: Similar in concept to the `MarkdownHeaderTextSplitter`, the `HTMLHeaderTextSplitter` is a "structure-aware" chunker that splits text at the element level and adds metadata for each header "relevant" to any given chunk. It can return chunks element by element or combine elements with the same metadata, with the objectives of (a) keeping related text grouped (more or less) semantically and (b) preserving context-rich information encoded in document structures. It can be used with other text splitters as part of a chunking pipeline. Dependency: lxml python package Maintainer: @hwchase17 Twitter handle: @MartinZirulnik --------- Co-authored-by: PresidioVantage <github@presidiovantage.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Predrag Gruevski	289de601c8	Use parameterized queries to select SQL schemas. (#11356 )	12 months ago
Nuno Campos	b0097f8908	In ProgressBarCallback update the progress counter also when runs fin… (#11332 )	12 months ago
William FH	06f39be1c2	Wfh/eval max concurrency (#11368 )	12 months ago
Aashish Saini	4adb2b399d	Fixed exception type in py files (#11322 ) I've refactored the code to ensure that ImportError is consistently handled. Instead of using ValueError as before, I've now followed the standard practice of raising ImportError along with clear and informative error messages. This change enhances the code's clarity and explicitly signifies that any problems are associated with module imports.	12 months ago
니콜라스	c6d7124675	Add 'device' to GPT4All (#11216 ) Add device to GPT4All - Description: GPT4All now supports GPU. This commit adds the option to enable it. - Issue: It closes https://github.com/langchain-ai/langchain/issues/10486 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	12 months ago
Harrison Chase	6e848b879a	add default for async (#11367 )	12 months ago
Fynn Flügge	0a4baca291	chore: add kotlin code splitter (#11364 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: Adds Kotlin language to `TextSplitter` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Ofer Mendelevitch	b93a08079e	Updates to Vectara Implementation (#11366 ) Replace this entire comment with: - Description: updates to documentation and API headers - Tag maintainer: @baskarya - Twitter handle: @ofermend	12 months ago
Erick Friis	745e3e29da	add getattr case for llms.type_to_cls_dict (#11362 ) For external libraries that depend on `type_to_cls_dict`, adds a workaround to continue using the old format. Recommend people use `get_type_to_cls_dict()` instead and only resolve the imports when they're used.	12 months ago
Vicente Reyes	f3e13e7e5a	Use term keyword according to the official python doc glossary (#11338 ) - Description: use term keyword according to the official python doc glossary, see https://docs.python.org/3/glossary.html - Issue: not applicable - Dependencies: not applicable - Tag maintainer: @hwchase17 - Twitter handle: vreyespue	12 months ago
Predrag Gruevski	5d6b83d9cf	Make a copy of external data instead of mutating another object's attributes. (#11349 ) Fix for a bug surfaced as part of #11339. `mypy` caught this since the types didn't match up.	12 months ago
Predrag Gruevski	42d979efdd	Improve type hints and interface for SQL execution functionality. (#11353 ) The previous API of the `_execute()` function had a few rough edges that this PR addresses: - The `fetch` argument was type-hinted as being able to take any string, but any string other than `"all"` or `"one"` would `raise ValueError`. The new type hints explicitly declare that only those values are supported. - The return type was type-hinted as `Sequence` but using `fetch = "one"` would actually return a single result item. This was incorrectly suppressed using `# type: ignore`. We now always return a list. - Using `fetch = "one"` would return a single item if data was found, or an empty list if no data was found. This was confusing, and we now always return a list to simplify. - The return type was `Sequence[Any]` which was a bit difficult to use since it wasn't clear what one could do with the returned rows. I'm making the new type `Dict[str, Any]` that corresponds to the column names and their values in the query. I've updated the use of this method elsewhere in the file to match the new behavior.	12 months ago
Mohammad Mohtashim	3bddd708f7	Add memory to sql chain (#8597 ) continuation of PR #8550 @hwchase17 please see and merge. And also close the PR #8550. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
Harrison Chase	feabf2e0d5	make llm imports optional (#11237 )	12 months ago
Harrison Chase	88bad37ec2	fix get_tool_return (#11346 )	12 months ago
Harrison Chase	bdf865d8e8	better error message on parsing errors (#11342 )	12 months ago
Eugene Yurtsev	2343302fc6	Remove langserve from langchain repo (#11288 ) LangServe has been moved to a separate repo	12 months ago
William FH	6950b44bfc	Consolidate run collector. Add link helper (#11269 ) Instead of: ``` client = Client() with collect_runs() as cb: chain.invoke() run = cb.traced_runs[0] client.get_run_url(run) ``` it's ``` with tracing_v2_enabled() as cb: chain.invoke() cb.get_run_url() ```	12 months ago
Nuno Campos	0aedbcf7b2	Pass kwargs in runnable retry (#11324 )	12 months ago
Jacob Lee	933655b4ac	Adds Tavily Search API retriever (#11314 ) @baskaryan @efriis	12 months ago
David Duong	3ec970cc11	Mark Vertex AI classes as serialisable (#10484 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. These live is docs/extras directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17, @rlancemartin. --> --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
David Duong	db36a0ee99	Make Google PaLM classes serialisable (#11121 ) Similarly to Vertex classes, PaLM classes weren't marked as serialisable. Should be working fine with LangSmith. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	12 months ago
CG80499	943e4f30d8	Add scoring chain (#11123 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Predrag Gruevski	cd2479dfae	Upgrade `langchain` dependency versions to resolve dependabot alerts. (#11307 )	12 months ago
Nuno Campos	4df3191092	Add .configurable_fields() and .configurable_alternatives() to expose fields of a Runnable to be configured at runtime (#11282 )	12 months ago
Eugene Yurtsev	5e2d5047af	add LLMBashChain to experimental (#11305 ) Add LLMBashChain to experimental	12 months ago
Bagatur	38d5b63a10	Bedrock scheduled tests (#11194 )	12 months ago
Eugene Yurtsev	f9b565fa8c	Bump min version of numexpr (#11302 ) Bump min version	12 months ago
William FH	64febf7751	Make numexpr optional (#11049 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Eugene Yurtsev	20b7bd497c	Add pending deprecation warning (#11133 ) This PR uses 2 dedicated LangChain warnings types for deprecations (mirroring python's built in deprecation and pending deprecation warnings). These deprecation types are unslienced during initialization in langchain achieving the same default behavior that we have with our current warnings approach. However, because these warnings have a dedicated type, users will be able to silence them selectively (I think this is strictly better than our current handling of warnings). The PR adds a deprecation warning to llm symbolic math. --------- Co-authored-by: Predrag Gruevski <2348618+obi1kenobi@users.noreply.github.com>	12 months ago
Nuno Campos	0638f7b83a	Create new RunnableSerializable base class in preparation for configurable runnables (#11279 ) - Also move RunnableBranch to its own file <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Bagatur	8eec43ed91	bump 306 (#11289 )	12 months ago
Nuno Campos	c6a720f256	Lint	12 months ago
Nuno Campos	1d46ddd16d	Lint	12 months ago
Nuno Campos	17708fc156	Lint	12 months ago
Nuno Campos	a3b82d1831	Move RunnableWithFallbacks to its own file	12 months ago
Nuno Campos	01dbfc2bc7	Lint	12 months ago
Nuno Campos	a6afd45c63	Lint	12 months ago
Nuno Campos	f7dd10b820	Lint	12 months ago
Nuno Campos	040bb2983d	Lint	12 months ago
Nuno Campos	52e5a8b43e	Create new RunnableSerializable class in preparation for configurable runnables - Also move RunnableBranch to its own file	12 months ago
Yeonji-Lim	61ab1b1266	Fix typo in docstring (#11256 ) Description : Remove meaningless 's' in docstring	12 months ago
Kazuki Maeda	a363ab5292	rename repo namespace to langchain-ai (#11259 ) ### Description renamed several repository links from `hwchase17` to `langchain-ai`. ### Why I discovered that the README file in the devcontainer contains an old repository name, so I took the opportunity to rename the old repository name in all files within the repository, excluding those that do not require changes. ### Dependencies none ### Tag maintainer @baskaryan ### Twitter handle [kzk_maeda](https://twitter.com/kzk_maeda)	12 months ago
Dayuan Jiang	17cdeb72ef	minor fix: remove redundant code from OpenAIFunctionsAgent (#11245 ) minor fix: remove redundant code from OpenAIFunctionsAgent (#11245)	12 months ago
Michael Goin	33eb5f8300	Update DeepSparse LLM (#11236 ) Description: Adds streaming and many more sampling parameters to the DeepSparse interface --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	12 months ago
Eugene Yurtsev	f91ce4eddf	Bump deps in langserve (#11234 ) Bump deps in langserve lockfile	12 months ago
Haozhe	4c97a10bd0	fix code injection vuln (#11233 ) - Description: Fix a code injection vuln by adding one more keyword into the filtering list - Issue: N/A - Dependencies: N/A - Tag maintainer: - Twitter handle: Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Eugene Yurtsev	aebdb1ad01	Ignore aadd (#11235 )	12 months ago
Eugene Yurtsev	8b4cb4eb60	Add type to message chunks (#11232 )	12 months ago
Nuno Campos	fb66b392c6	Implement RunnablePassthrough.assign(...) (#11222 ) Passes through dict input and assigns additional keys <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	1ddf9f74b2	Add a streaming json parser (#11193 ) <img width="1728" alt="Screenshot 2023-09-28 at 20 15 01" src="https://github.com/langchain-ai/langchain/assets/56902/ed0644c3-6db7-41b9-9543-e34fce46d3e5"> <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	ee56c616ff	Remove flawed test - It is not possible to access properties on classes, only on instances, therefore this test is not something we can implement	12 months ago
Nuno Campos	f3f3f71811	Lint	12 months ago
Nuno Campos	f6b0b065d3	Update json.py Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Nuno Campos	cbe18057b0	Update json.py Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Nuno Campos	aa8b4120a8	Keep exceptions when not in streaming mode	12 months ago
Nuno Campos	1f30e25681	Lint	12 months ago
Nuno Campos	c9d0f2b984	Combine with existing json output parsers	12 months ago
Eugene Yurtsev	b4354b7694	Make tests stricter, remove old code, fix up pydantic import when using v2 (#11231 ) Make tests stricter, remove old code, fix up pydantic import when using v2 (#11231)	12 months ago
Eugene Yurtsev	572968fee3	Using langchain input types (#11204 ) Using langchain input type	12 months ago
Bagatur	77c7c9ab97	bump 305 (#11224 )	12 months ago
Nuno Campos	4b8442896b	Make test deterministic	12 months ago
Attila Tőkés	ba9371854f	OpenAI gpt-3.5-turbo-instruct cost information (#11218 ) Added pricing info for `gpt-3.5-turbo-instruct` for OpenAI and Azure OpenAI. Co-authored-by: Attila Tőkés <atokes@rws.com>	12 months ago
Eugene Yurtsev	de69ea26e8	Suppress warnings in interactive env that stem from tab completion (#11190 ) Suppress warnings in interactive environments that can arise from users relying on tab completion (without even using deprecated modules). jupyter seems to filter warnings by default (at least for me), but ipython surfaces them all	12 months ago
Jon Saginaw	715ffda28b	mongodb doc loader init (#10645 ) - Description: A Document Loader for MongoDB - Issue: n/a - Dependencies: Motor, the async driver for MongoDB - Tag maintainer: n/a - Twitter handle: pigpenblue Note that an initial mongodb document loader was created 4 months ago, but the [PR ](https://github.com/langchain-ai/langchain/pull/4285)was never pulled in. @leo-gan had commented on that PR, but given it is extremely far behind the master branch and a ton has changed in Langchain since then (including repo name and structure), I rewrote the branch and issued a new PR with the expectation that the old one can be closed. Please reference that old PR for comments/context, but it can be closed in favor of this one. Thanks! --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	12 months ago
Nuno Campos	3d8aa88e26	Add async tests and comments	12 months ago
Nuno Campos	4ad0f3de2b	Add RunnableGenerator (#11214 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Guy Korland	748a757306	Clean warnings: replace type with isinstance and fix syntax (#11219 ) Clean warnings: replace type with `isinstance` and fix on notebook syntax syntax	12 months ago
Nuno Campos	091d8845d5	Backwards compat	12 months ago
Nuno Campos	4e28a7a513	Implement diff	12 months ago
Nuno Campos	5cbe2b7b6a	Implement diff	12 months ago
Nuno Campos	6c0a6b70e0	WIP Add tests§	12 months ago
Nuno Campos	63f2ef8d1c	Implement str one	12 months ago
Nuno Campos	f672b39cc9	Add a streaming json parser	12 months ago
Nuno Campos	2387647d30	Lint	12 months ago
Nuno Campos	0318cdd33c	Add tests	12 months ago
Nuno Campos	b67db8deaa	Add RunnableGenerator	12 months ago
Nuno Campos	e35ea565d1	Lint	12 months ago
Nuno Campos	7f589ebbc2	Lint	12 months ago
Nuno Campos	8be598f504	Fix invocation	12 months ago
Nuno Campos	6eb6c45c98	Enable creating Tools from any Runnable	12 months ago
Nuno Campos	61b5942adf	Implement better reprs for Runnables (#11175 ) ``` ChatPromptTemplate(messages=[SystemMessagePromptTemplate(prompt=PromptTemplate(input_variables=[], template='You are a nice assistant.')), HumanMessagePromptTemplate(prompt=PromptTemplate(input_variables=['question'], template='{question}'))]) \| RunnableLambda(lambda x: x) \| { chat: FakeListChatModel(responses=["i'm a chatbot"]), llm: FakeListLLM(responses=["i'm a textbot"]) } ``` <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Nuno Campos	e8e2b812c9	Even more	12 months ago
Nuno Campos	fc072100fa	skip more	12 months ago
Nuno Campos	7bfee012d5	Skip in py3.8	12 months ago
Nuno Campos	b8e3e1118d	Skip for py3.8	12 months ago
William FH	db05ea2b78	Add from_embeddings for opensearch (#10957 )	12 months ago
William FH	73693c18fc	Add support for project metadata in run_on_dataset (#11200 )	12 months ago
James Braza	b11f21c25f	Updated `LocalAIEmbeddings` docstring to better explain why `openai` (#10946 ) Fixes my misgivings in https://github.com/langchain-ai/langchain/issues/10912	12 months ago
Eugene Yurtsev	2c114fcb5e	Fix web-base loader (#11135 ) Fix initialization https://github.com/langchain-ai/langchain/issues/11095	12 months ago
jreinjr	3bc44b01c0	Typo fix to MathpixPDFLoader - changed processed_file_format default … (#10960 ) …from mmd to md. https://github.com/langchain-ai/langchain/issues/7282 <!-- - Description: minor fix to a breaking typo - MathPixPDFLoader processed_file_format is "mmd" by default, doesn't work, changing to "md" fixes the issue, - Issue: 7282 (https://github.com/langchain-ai/langchain/issues/7282), - Dependencies: none, - Tag maintainer: @hwchase17, - Twitter handle: none --> Co-authored-by: jare0530 <7915+jare0530@users.noreply.ghe.oculus-rep.com>	12 months ago
Dr. Fabien Tarrade	66415eed6e	Support new version of tiktoken that are working with langchain (tag "^0.3.2" => "">=0.3.2,<0.6.0" and python "^3.9" =>">=3.9") (#11006 ) - Description: be able to use langchain with other version than tiktoken 0.3.3 i.e 0.5.1 - Issue: cannot installed the conda-forge version since it applied all optional dependency: https://github.com/conda-forge/langchain-feedstock/pull/85 replace "^0.3.2" by "">=0.3.2,<0.6.0" and "^3.9" by python=">=3.9" Tested with python 3.10, langchain=0.0.288 and tiktoken==0.5.0 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Clément Sicard	1b48d6cb8c	`LlamaCppEmbeddings`: adds `verbose` parameter, similar to `llms.LlamaCpp` class (#11038 ) ## Description As of now, when instantiating and during inference, `LlamaCppEmbeddings` outputs (a lot of) verbose when controlled from Langchain binding - it is a bit annoying when computing the embeddings of long documents, for instance. This PR adds `verbose` for `LlamaCppEmbeddings` objects to be able not to print the verbose of the model to `stderr`. It is natively supported by `llama-cpp-python` and directly passed to the library – the PR is hence very small. The value of `verbose` is `True` by default, following the way it is defined in [`LlamaCpp` (`llamacpp.py` #L136-L137)](`c87e9fb2ce/libs/langchain/langchain/llms/llamacpp.py (L136-L137)`) ## Issue _No issue linked_ ## Dependencies _No additional dependency needed_ ## To see it in action ```python from langchain.embeddings import LlamaCppEmbeddings MODEL_PATH = "<path_to_gguf_file>" if __name__ == "__main__": llm_embeddings = LlamaCppEmbeddings( model_path=MODEL_PATH, n_gpu_layers=1, n_batch=512, n_ctx=2048, f16_kv=True, verbose=False, ) ``` Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Noah Czelusta	a00a73ef18	Add last_edited_time and created_time props to NotionDBLoader (#11020 ) # Description Adds logic for NotionDBLoader to correctly populate `last_edited_time` and `created_time` fields from [page properties](https://developers.notion.com/reference/page#property-value-object). There are no relevant tests for this code to be updated. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Eugene Yurtsev	e06e84b293	LangServe: Relax requirements (#11198 ) Relax requirements	12 months ago
PaperMoose	5d7c6d1bca	Synthetic Data generation (#9472 ) --------- Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Donatas Remeika	a4e0cf6300	SearchApi integration (#11023 ) Based on the customers' requests for native langchain integration, SearchApi is ready to invest in AI and LLM space, especially in open-source development. - This is our initial PR and later we want to improve it based on customers' and langchain users' feedback. Most likely changes will affect how the final results string is being built. - We are creating similar native integration in Python and JavaScript. - The next plan is to integrate into Java, Ruby, Go, and others. - Feel free to assign @SebastjanPrachovskij as a main reviewer for any SearchApi-related searches. We will be glad to help and support langchain development.	12 months ago
Bagatur	8cd18a48e4	fix trubrics lint issue (#11202 )	12 months ago
Fynn Flügge	b738ccd91e	chore: add support for TypeScript code splitting (#11160 ) - Description: Adds typescript language to `TextSplitter` --------- Co-authored-by: Jacob Lee <jacoblee93@gmail.com>	12 months ago
Kenneth Choe	17fcbed92c	Support add_embeddings for opensearch (#11050 ) - Description: - Make running integration test for opensearch easy - Provide a way to use different text for embedding: refer to #11002 for more of the use case and design decision. - Issue: N/A - Dependencies: None other than the existing ones.	12 months ago
Jeff Kayne	c586f6dc1b	Callback integration for Trubrics (#11059 ) After contributing to some examples in the [langsmith-cookbook](https://github.com/langchain-ai/langsmith-cookbook) with @hinthornw, here is a PR that adds a callback handler to use LangChain with [Trubrics](https://github.com/trubrics/trubrics-sdk).	12 months ago
Michael Landis	a8db594012	fix: short-circuit black and mypy calls when no changes made (#11051 ) Both black and mypy expect a list of files or directories as input. As-is the Makefile computes a list files changed relative to the last commit; these are passed to black and mypy in the `format_diff` and `lint_diff` targets. This is done by way of the Makefile variable `PYTHON_FILES`. This is to save time by skipping running mypy and black over the whole source tree. When no changes have been made, this variable is empty, so the call to black (and mypy) lacks input files. The call exits with error causing the Makefile target to error out with: ```bash $ make format_diff poetry run black Usage: black [OPTIONS] SRC ... One of 'SRC' or 'code' is required. make: *** [format_diff] Error 1 ``` This is unexpected and undesirable, as the naive caller (that's me! 😄 ) will think something else is wrong. This commit smooths over this by short circuiting when `PYTHON_FILES` is empty.	12 months ago
Michael Kim	fbcd8e02f2	Change type annotations from LLMChain to Chain in MultiPromptChain (#11082 ) - Description: The types of 'destination_chains' and 'default_chain' in 'MultiPromptChain' were changed from 'LLMChain' to 'Chain'. and removed variables declared overlapping with the parent class - Issue: When a class that inherits only Chain and not LLMChain, such as 'SequentialChain' or 'RetrievalQA', is entered in 'destination_chains' and 'default_chain', a pydantic validation error is raised. - - codes ``` retrieval_chain = ConversationalRetrievalChain( retriever=doc_retriever, combine_docs_chain=combine_docs_chain, question_generator=question_gen_chain, ) destination_chains = { 'retrieval': retrieval_chain, } main_chain = MultiPromptChain( router_chain=router_chain, destination_chains=destination_chains, default_chain=default_chain, verbose=True, ) ``` ✅ `make format`, `make lint` and `make test`	12 months ago
Piyush Jain	32d09bcd1e	Expanded version range for networkx, fixed sample notebook (#11094 ) ## Description Expanded the upper bound for `networkx` dependency to allow installation of latest stable version. Tested the included sample notebook with version 3.1, and all steps ran successfully. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Piotr Mardziel	b40ecee4b9	FIx eval prompt (#11087 ) Description: fixes a common typo in some of the eval criteria.	12 months ago
Guy Korland	5564833bd2	Add `add_graph_documents` support for FalkorDBGraph (#11122 ) Adding `add_graph_documents` support for FalkorDBGraph and extending the `Neo4JGraph` api so it can support `cypher.py`	12 months ago
Tomaz Bratanic	7d25a65b10	add from_existing_graph to neo4j vector (#11124 ) This PR adds the option to create a Neo4jvector instance from existing graph, which embeds existing text in the database and creates relevant indices.	12 months ago
Noah Stapp	2c952de21a	Add support for MongoDB Atlas $vectorSearch vector search (#11139 ) Adds support for the `$vectorSearch` operator for MongoDBAtlasVectorSearch, which was announced at .Local London (September 26th, 2023). This change maintains breaks compatibility support for the existing `$search` operator used by the original integration (https://github.com/langchain-ai/langchain/pull/5338) due to incompatibilities in the Atlas search implementations. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Hugues	b599f91e33	LLMonitor Callback handler: fix bug (#11128 ) Here is a small bug fix for the LLMonitor callback handler. I've also added user identification capabilities.	12 months ago
William FH	e9b51513e9	Shared Executor (#11028 )	12 months ago
Justin Plock	926e4b6bad	[Feat] Add optional client-side encryption to DynamoDB chat history memory (#11115 ) Description: Added optional client-side encryption to the Amazon DynamoDB chat history memory with an AWS KMS Key ID using the [AWS Database Encryption SDK for Python](https://docs.aws.amazon.com/database-encryption-sdk/latest/devguide/python.html) Issue: #7886 Dependencies: [dynamodb-encryption-sdk](https://pypi.org/project/dynamodb-encryption-sdk/) Tag maintainer: @hwchase17 Twitter handle: [@jplock](https://twitter.com/jplock/) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	12 months ago
Eugene Yurtsev	4947ac2965	Add langserve version (#11195 ) Add langserve version	12 months ago
Joseph McElroy	822fc590d9	[ElasticsearchStore] Improve migration text to ElasticsearchStore (#11158 ) We noticed that as we have been moving developers to the new `ElasticsearchStore` implementation, we want to keep the ElasticVectorSearch class still available as developers transition slowly to the new store. To speed up this process, I updated the blurb giving them a better recommendation of why they should use ElasticsearchStore.	12 months ago
Naveen Tatikonda	9b0029b9c2	[OpenSearch] Add Self Query Retriever Support to OpenSearch (#11184 ) ### Description Add Self Query Retriever Support to OpenSearch ### Maintainers @rlancemartin, @eyurtsev, @navneet1v ### Twitter Handle @OpenSearchProj Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	12 months ago
Arthur Telders	0da484be2c	Add source metadata to OutlookMessageLoader (#11183 ) Description: Add "source" metadata to OutlookMessageLoader This pull request adds the "source" metadata to the OutlookMessageLoader class in the load method. The "source" metadata is required when indexing with RecordManager in order to sync the index documents with a source. Issue: None Dependencies: None Twitter handle: @ATelders Co-authored-by: Arthur Telders <arthur.telders@roquette.com>	12 months ago
Bagatur	3508e582f1	add anthropic scheduled tests and unit tests (#11188 )	12 months ago
Eugene Yurtsev	fd96878c4b	Fix anthropic secret key when passed in via init (#11185 ) Fixes anthropic secret key when passed via init https://github.com/langchain-ai/langchain/issues/11182	12 months ago
Bagatur	f201d80d40	temporarily skip embedding empty string test (#11187 )	12 months ago
Eugene Yurtsev	b3cf9c8759	LangServe: Update langchain requirement for publishing (#11186 ) Update langchain requirement for publishing	12 months ago
mani2348	89ddc7cbb6	Update Bedrock service name to "bedrock-runtime" and model identifiers (#11161 ) - Description: Bedrock updated boto service name to "bedrock-runtime" for the InvokeModel and InvokeModelWithResponseStream APIs. This update also includes new model identifiers for Titan text, embedding and Anthropic. Co-authored-by: Mani Kumar Adari <maniadar@amazon.com>	12 months ago
Eugene Yurtsev	de3e25683e	Expose lc_id as a classmethod (#11176 ) * Expose LC id as a class method * User should not need to know that the last part of the id is the class name	12 months ago
Nuno Campos	5ca461160b	Lint	12 months ago
Nuno Campos	151f27d502	Lint	12 months ago
Eugene Yurtsev	4ba9c16f74	mypy	12 months ago
Eugene Yurtsev	44489e7029	LangServe: Clean up init files (#11174 ) Clean up init files	12 months ago
Akio Nishimura	785b9d47b7	Fix stop key of TextGen. (#11109 ) The key of stopping strings used in text-generation-webui api is [`stopping_strings`](https://github.com/oobabooga/text-generation-webui/blob/main/api-examples/api-example.py#L51), not `stop`. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Eugene Yurtsev	d1d7d0cb27	x	12 months ago
Eugene Yurtsev	c86b2b5e42	x	12 months ago
Eugene Yurtsev	fe4f3b8fdf	x	12 months ago
Eugene Yurtsev	a5b15e9d0f	x	12 months ago
Nuno Campos	5c1f462bb9	Implement better reprs for Runnables	12 months ago
Nan LI	53a9d6115e	Xata chat memory FIX (#11145 ) - Description: Changed data type from `text` to `json` in xata for improved performance. Also corrected the `additionalKwargs` key in the `messages()` function to `additional_kwargs` to adhere to `BaseMessage` requirements. - Issue: The Chathisroty.messages() will return {} of `additional_kwargs`, as the name is wrong for `additionalKwargs` . - Dependencies: N/A - Tag maintainer: N/A - Twitter handle: N/A My PR is passing linting and testing before submitting.	12 months ago
William FH	8ae9b71e41	Async support for OpenAIFunctionsAgentOutputParser (#11140 )	12 months ago
Bagatur	ce08f436db	Expose loads and dumps in load namespace	12 months ago
Nuno Campos	cfa2203c62	Add input/output schemas to runnables (#11063 ) This adds `input_schema` and `output_schema` properties to all runnables, which are Pydantic models for the input and output types respectively. These are inferred from the structure of the Runnable as much as possible, the only manual typing needed is - optionally add type hints to lambdas (which get translated to input/output schemas) - optionally add type hint to RunnablePassthrough These schemas can then be used to create JSON Schema descriptions of input and output types, see the tests - [x] Ensure no InputType and OutputType in our classes use abstract base classes (replace with union of subclasses) - [x] Implement in BaseChain and LLMChain - [x] Implement in RunnableBranch - [x] Implement in RunnableBinding, RunnableMap, RunnablePassthrough, RunnableEach, RunnableRouter - [x] Implement in LLM, Prompt, Chat Model, Output Parser, Retriever - [x] Implement in RunnableLambda from function signature - [x] Implement in Tool <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Eugene Yurtsev	b05bb9e136	LangServe (#11046 ) Adds LangServe package * Integrate Runnables with Fast API creating Server and a RemoteRunnable client * Support multiple runnables for a given server * Support sync/async/batch/abatch/stream/astream/astream_log on the client side (using async implementations on server) * Adds validation using annotations (relying on pydantic under the hood) -- this still has some rough edges -- e.g., open api docs do NOT generate correctly at the moment * Uses pydantic v1 namespace Known issues: type translation code doesn't handle a lot of types (e.g., TypedDicts) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	12 months ago
Nuno Campos	77ce9ed6f1	Support using async callback handlers with sync callback manager (#10945 ) The current behaviour just calls the handler without awaiting the coroutine, which results in exceptions/warnings, and obviously doesn't actually execute whatever the callback handler does <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	12 months ago
Bagatur	48a04aed75	bump 304 (#11147 )	1 year ago
Jonathan Evans	23065f54c0	Added prompt wrapping for Claude with Bedrock (#11090 ) - Description: Prompt wrapping requirements have been implemented on the service side of AWS Bedrock for the Anthropic Claude models to provide parity between Anthropic's offering and Bedrock's offering. This overnight change broke most existing implementations of Claude, Bedrock and Langchain. This PR just steals the the Anthropic LLM implementation to enforce alias/role wrapping and implements it in the existing mechanism for building the request body. This has also been tested to fix the chat_model implementation as well. Happy to answer any further questions or make changes where necessary to get things patched and up to PyPi ASAP, TY. - Issue: No issue opened at the moment, though will update when these roll in. - Dependencies: None --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	1 year ago
xiaoyu	b87cc8b31e	add 3 property types in metadata for notiondb loader (#8509 ) ### Description: NotionDB supports a number of common property types. I have found three common types that are not included in notiondb loader. When programs loaded them with notiondb, which will cause some metadata information not to be passed to langchain. Therefore, I added three common types: - date - created_time - last_edit_time. ### Issue: no ### Dependencies: No dependencies added :) ### Tag maintainer: @rlancemartin, @eyurtsev ### Twitter handle: @BJTUTC	1 year ago
Harrison Chase	258d67b0ac	Revert "improve the performance of base.py" (#11143 ) Reverts langchain-ai/langchain#8610 this is actually an oversight - this merges all dfs into one df. we DO NOT want to do this - the idea is we work and manipulate multiple dfs	1 year ago
Mohamad Zamini	9306394078	improve the performance of base.py (#8610 ) This removes the use of the intermediate df list and directly concatenates the dataframes if path is a list of strings. The pd.concat function combines the dataframes efficiently, making it faster and more memory-efficient compared to appending dataframes to a list. <!-- Thank you for contributing to LangChain! Replace this comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced and you'd like a mention, we'll gladly shout you out! Please make sure you're PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. Maintainer responsibilities: - General / Misc / if you don't know who to tag: @baskaryan - DataLoaders / VectorStores / Retrievers: @rlancemartin, @eyurtsev - Models / Prompts: @hwchase17, @baskaryan - Memory: @hwchase17 - Agents / Tools / Toolkits: @hinthornw - Tracing / Callbacks: @agola11 - Async: @agola11 If no one reviews your PR within a few days, feel free to @-mention the same people again. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	1 year ago
Mincoolee	05b75f3f13	feat: add support for arxiv identifier in ArxivAPIWrapper() (#9318 ) - Description: this PR adds the support for arxiv identifier of the ArxivAPIWrapper. I modified the `run()` and `load()` functions in `arxiv.py`, using regex to recognize if the query is in the form of arxiv identifier (see [https://info.arxiv.org/help/find/index.html](https://info.arxiv.org/help/find/index.html)). If so, it will directly search the paper corresponding to the arxiv identifier. I also modified and added tests in `test_arxiv.py`. - Issue: #9047 - Dependencies: N/A - Tag maintainer: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	1 year ago
William FH	d3c2ca5656	Enhanced pairwise error (#11131 )	1 year ago
Taqi Jaffri	b7e9db5e73	Stop sequences in fireworks, plus notebook updates (#11136 ) The new Fireworks and FireworksChat implementations are awesome! Added in this PR https://github.com/langchain-ai/langchain/pull/11117 thank you @ZixinYang However, I think stop words were not plumbed correctly. I've made some simple changes to do that, and also updated the notebook to be a bit clearer with what's needed to use both new models. --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	1 year ago
William FH	33da8bd711	Add Exact match and Regex Match Evaluators (#11132 )	1 year ago
Harrison Chase	e355606b11	add more import checks (#11033 )	1 year ago
Dan Bolser	efb7c459a2	Update base.py (#10843 ) Fixing a typo in the example code in the docstring... You have to start somewhere though right? Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	1 year ago
tanujtiwari-at	a79f595543	Support extra tools argument for pandas agent toolkit (#11040 ) Description We support adding new tools in some toolkits already like the [SQLAgent toolkit](https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/agents/agent_toolkits/sql/base.py#L27). Related [SO](https://stackoverflow.com/questions/76583163/are-langchain-toolkits-able-to-be-modified-can-we-add-tools-to-a-pandas-datafra) thread This replicates the same functionality here, so users can add custom bespoke tools.	1 year ago
Bagatur	410ac8129d	bump 303 (#11120 )	1 year ago
Bagatur	8e4dbae428	Add fireworks chat model (#11117 )	1 year ago
Bagatur	657581dbdf	Fix ChatFireworks typing	1 year ago
Bagatur	12aad659dd	add ChatFireworks to chat_models	1 year ago
Bagatur	872ebdaf90	remove FireworksChat from llms	1 year ago
Bagatur	9451240941	Fix fireworks chat linting issues	1 year ago
Tomáš Dvořák	865a21938c	speed up enforce_stop_tokens helper function (#10984 ) Description: As long as `enforce_stop_tokens` returns a first occurrence, we can speed up the execution by setting the optional `maxsplit` parameter to 1. Tag maintainer: @agola11 @hwchase17 <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	1 year ago
Austin Walker	bb41252dab	fix: bump min_unstructured_version for UnstructuredAPIFileLoader (#11025 ) Description: New metadata fields were added to `unstructured==0.10.15`, and our hosted api has been updated to reflect this. When users call `partition_via_api` with an older version of the library, they'll hit a parsing error related to the new fields.	1 year ago
William FH	75b3893daf	Fix runnable branch callbacks (#11091 ) We aren't calling on_chain_end here unless we use the default option	1 year ago
Bagatur	6c5251feb0	poetry	1 year ago
Bagatur	5310184f96	poetry	1 year ago
Cynthia Yang	6dd44ff1c0	Refactor Fireworks and add ChatFireworks (#3 ) (#10597 ) Description * Refactor Fireworks within Langchain LLMs. * Remove FireworksChat within Langchain LLMs. * Add ChatFireworks (which uses chat completion api) to Langchain chat models. * Users have to install `fireworks-ai` and register an api key to use the api. Issue - Not applicable Dependencies - None Tag maintainer - @rlancemartin @baskaryan	1 year ago
Bagatur	5514ebe859	Don't type chains in output_parsers (#11092 ) Can't use TYPE_CHECKING style imports for pydantic params because it will try to instantiate the typed object by default.	1 year ago
CG80499	64385c4eae	Make pairwise comparison chain more like LLM as a judge (#11013 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description:: Adds LLM as a judge as an eval chain - Tag maintainer: @hwchase17 Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	1 year ago
Joseph McElroy	175ef0a55d	[ElasticsearchStore] Enable custom Bulk Args (#11065 ) This enables bulk args like `chunk_size` to be passed down from the ingest methods (from_text, from_documents) to be passed down to the bulk API. This helps alleviate issues where bulk importing a large amount of documents into Elasticsearch was resulting in a timeout. Contribution Shoutout - @elastic - [x] Updated Integration tests --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	1 year ago
Eugene Yurtsev	d19fd0cfae	LogEntry/LogStream use str instead of uuid for id (#11080 ) Cast the UUID to a string	1 year ago
Bagatur	d85339b9f2	extract sublinks exclude by abs path (#11079 )	1 year ago
Bagatur	7ee8b2d1bf	exclude dirs in async recursive loading (#11077 )	1 year ago
Bagatur	12fb393a43	bump 302 (#11070 )	1 year ago

... 9 10 11 12 13 ...

2111 Commits (5d1deddbfb0c34df16d48697e3e1aaa7aacbbabc)