langchain

mirror of https://github.com/hwchase17/langchain synced 2024-11-10 01:10:59 +00:00

Author	SHA1	Message	Date
AlpinDale	b0588774f1	community[minor]: Add Aphrodite Engine support (#14759 ) This PR adds support for PygmalionAI's [Aphrodite Engine](https://github.com/PygmalionAI/aphrodite-engine), based on vLLM's attention mechanism. At the moment, this PR does not include support for the API servers, but they will be added in a later PR. The only dependency as of now is `aphrodite-engine==0.4.2`. We pin the version to prevent breakage due to changes in the aphrodite-engine library. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 01:16:57 -05:00
Dmitry Tyumentsev	d21f44b484	community[minor]: Add YandexGPT embeddings (#14767 ) - Description: Introducing an ability to work with the [YandexGPT](https://cloud.yandex.com/en/services/yandexgpt) embeddings models. --------- Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-20 01:11:07 -05:00
Nicolas Suzor	529144649e	community[patch]: add png support for vertexai._parse_chat_history_gemini() (#14788 ) - Description: Modify community chat model vertexai to handle png and other image types encoded in base64 - Dependencies: added `import re` but no new dependencies. This addresses a problem where the vertexai method _parse_chat_history_gemini() was only recognizing image uris in jpeg format. I made a simple change to cover other extension types.	2023-12-20 00:58:39 -05:00
Dr. Christoph Mittendorf	f348ad4ba8	docs: typo LLaMA2_sql_chat.ipynb (#14798 ) "language" (right) vs "langugae" (wrong)	2023-12-20 00:54:06 -05:00
Liu Jun	b0c48dc983	community[patch]: make ak and sk optional in qianfan endpoint (#14835 ) - Description: The Qianfan SDK offers multiple authentication methods, but in the `QianfanEndpoint` of Langchain, it currently only supports authentication through AK and SK. In order to accommodate users who wish to use alternative authentication methods, this pull request makes AK and SK optional. This change should not impact existing users, while allowing users to configure other authentication methods as per the Qianfan SDK documentation. - Issue: / - Dependencies: No - Tag maintainer: No - Twitter handle:	2023-12-20 00:49:33 -05:00
Archan Ghosh	65678b3816	community[patch]: Update arxiv.py with Entry ID as a return value (#14915 ) Added Entry ID as a return value inside get_summaries_as_docs - Description: Added the Entry ID as a return, so it's easier to track the IDs of the papers that are being returned. With the addition return of the entry ID in functions like ArxivRetriever, it will be easier to reference the ID of the paper itself.	2023-12-20 00:30:24 -05:00
thehunmonkgroup	dc20766513	docs: readme for langchain-mistralai (#14917 ) - Description: Add README doc for MistralAI partner package. - Tag maintainer: @baskaryan	2023-12-20 00:22:43 -05:00
Elena Mata Yandiola	b66659fc28	docs: Clarification google_cloud_storage_directory.ipynb (#14922 ) - Description: Just a minor add to the documentation to clarify how to load all files from a folder. I assumed and try to do it specifying it in the bucket (BUCKET/FOLDER), instead of using the prefix.	2023-12-20 00:21:42 -05:00
Ari Roffe	8bcadfd446	docs: nit embedding_distance.ipynb (#14929 ) Description: Fix the docs about embedding distance evaluations guide.	2023-12-20 00:13:17 -05:00
Yacine	20eacd4b5e	docs: update notebook documentation for custom tool (#14942 ) - Description: Documentation update. The custom tool notebook documentation is updated to revome the warning caused by directly instantiating of the LLMMathChain with an llm which is is deprecated. The from_llm class method is used instead. LLM output results gets updated as well. - Issue: no applicable - Dependencies: No dependencies - Tag maintainer: @baskaryan - Twitter handle: @ybouakkaz Co-authored-by: Yacine Bouakkaz <Yacine.Bouakkaz@evokegroup.com>	2023-12-20 00:08:58 -05:00
Bagatur	345acb26ac	community[patch]: Matching engine, return doc id (#14930 )	2023-12-20 00:03:11 -05:00
Erick Friis	8a3360edf6	anthropic: beta messages integration (#14928 )	2023-12-19 18:55:19 -08:00
Erick Friis	795cf2ddda	together: package and embedding model (#14936 )	2023-12-19 18:48:32 -08:00
Erick Friis	c21379438c	docs: remove unused contributor steps (#14938 )	2023-12-19 18:41:50 -08:00
William FH	758bcd4671	Add langsmith and benchmark repo links (#14931 ) Think we could link to these in more places	2023-12-19 17:44:31 -08:00
João Galego	d306d89a9b	template: Add Bedrock JCVD template (#14480 ) This PR adds a simple LangChain template that uses [Anthropic's Claude on Amazon Bedrock ⛰️](https://aws.amazon.com/bedrock/claude/) to behave like JCVD. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-19 15:55:58 -08:00
Erick Friis	8b29b31554	cli: test_integration group (#14924 )	2023-12-19 12:09:04 -08:00
Erick Friis	4d48aedea3	cli: 0.0.20 (#14920 )	2023-12-19 11:56:21 -08:00
Erick Friis	bbb20804bd	templates: fix sql-research-assistant (#14921 )	2023-12-19 11:55:59 -08:00
Erick Friis	9ef2feb674	cli[patch]: add embedding to integration template (#14881 )	2023-12-19 09:58:21 -08:00
Michael Feil	7b96de3d5d	community[patch]: update Gradient embeddings (#14846 ) - Description: Going forward, we have a own API `pip install gradientai`. Therefore gradually removing the self-build packages in llamaindex, haystack and langchain. - Issue: None. - Dependencies: `pip install gradientai` - Tag maintainer: @michaelfeil	2023-12-19 11:46:33 -05:00
Igor Dvorkin	6cc3c2452c	community[patch]: Enhance iMessage chat loader with timestamp parsing and message ownership (#14804 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 11:09:01 -05:00
Mohammad Mohtashim	e3abe12243	community[patch]: helpful error message for GitHubAPIWrapper (#14803 ) Very simple change in relation to the issue https://github.com/langchain-ai/langchain/issues/14550 @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 11:08:06 -05:00
Leonid Ganeline	922693caba	docs: `chunkviz` reference (#14802 ) Added a reference to the `Chunkviz` utility.	2023-12-19 10:58:16 -05:00
Dmitry Tyumentsev	50381abc42	community[patch]: Add retry logic to Yandex GPT API Calls (#14907 ) Description: Added logic for re-calling the YandexGPT API in case of an error --------- Co-authored-by: Dmitry Tyumentsev <dmitry.tyumentsev@raftds.com>	2023-12-19 10:51:42 -05:00
Sirjanpreet Singh Banga	425e5e1791	community[minor]: rename ChatGPTRouter to GPTRouter (#14913 ) Description:: Rename integration to GPTRouter Tag maintainer: @Gupta-Anubhav12 @samanyougarg @sirjan-ws-ext Twitter handle: [@SamanyouGarg](https://twitter.com/SamanyouGarg)	2023-12-19 10:48:52 -05:00
JaguarDB	992b04e475	community[minor]: added jaguar vector store (#14838 ) Description: A new vector store Jaguar is being added. Class, test scripts, and documentation is added. Issue: None -- This is the first PR contributing to LangChain Dependencies: This depends on "pip install -U jaguardb-http-client" client http package Tag maintainer: @baskaryan, @eyurtsev, @hwchase1 Twitter handle: @workbot --------- Co-authored-by: JY <jyjy@jaguardb> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 10:40:18 -05:00
Bagatur	a5be9f9475	mistralai: Add langchain-mistralai partner package (#14783 ) Co-authored-by: Chad Phillips <chad@apartmentlines.com>	2023-12-19 10:34:19 -05:00
Sirjanpreet Singh Banga	44cb899a93	community[minor]: Integrating GPTRouter (#14900 ) Description: Adding a langchain integration for [GPTRouter](https://gpt-router.writesonic.com/) 🚀 , Tag maintainer: @Gupta-Anubhav12 @samanyougarg @sirjan-ws-ext Twitter handle: [@SamanyouGarg](https://twitter.com/SamanyouGarg) Integration Tests Passing: <img width="1137" alt="Screenshot 2023-12-19 at 5 45 31 PM" src="https://github.com/Writesonic/langchain/assets/151817113/4a59df9a-ee30-47aa-9df9-b8c4eeb9dc76">	2023-12-19 10:08:36 -05:00
Bagatur	1069a93d18	langchain[patch]: export sagemaker LLMContentHandler (#14906 ) Resolves #14904	2023-12-19 10:00:32 -05:00
Kostas Botsas	4f4b078bf3	docs: add reference for XataVectorStore constructor (#14903 ) Adds doc reference to the XataVectorStore constructor for use with existing Xata table contents. @tsg @philkra	2023-12-19 09:04:46 -05:00
Leonid Ganeline	b2fd41331e	docs: docstrings `langchain_community` update (#14889 ) Addded missed docstrings. Fixed inconsistency in docstrings. Note CC @efriis There were PR errors on `langchain_experimental/prompt_injection_identifier/hugging_face_identifier.py` But, I didn't touch this file in this PR! Can it be some cache problems? I fixed this error.	2023-12-19 08:58:24 -05:00
William FH	583696732c	[Partner] NVIDIA TRT Package (#14733 ) Simplify #13976 and add as a separate package. - [] Add README - [X] Add doc notebook - [X] Add simple LLM integration --------- Co-authored-by: Jeremy Dyer <jdye64@gmail.com>	2023-12-18 19:08:25 -08:00
William FH	0d4cbbcc85	[Partner] Update google integration test (#14883 ) Gemini has decided that pickle rick is unsafe: https://github.com/langchain-ai/langchain/actions/runs/7256642294/job/19769249444#step:8:189 ![image](https://github.com/langchain-ai/langchain/assets/13333726/cfbf4312-53b6-4290-84ee-6ce0742e739e)	2023-12-18 18:46:24 -08:00
William FH	f88af1f1cd	[Partner] Google GenAi new release (#14882 ) to support the system message merging Also fix integration tests that weren't passing	2023-12-18 18:35:57 -08:00
Leonid Kuligin	2d0f1cae8c	added history and support for system_message as param (#14824 ) - Description: added support for chat_history for Google GenerativeAI (to actually use the `chat` API) plus since Gemini currently doesn't have a support for SystemMessage, added support for it only if a user provides additional `convert_system_message_to_human` flag during model initialization (in this case, SystemMessage would be prepanded to the first HumanMessage) - Issue: #14710 - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: lkuligin --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2023-12-18 18:23:14 -08:00
Leonid Ganeline	2861766d0d	Docs `tencent` pages update (#14879 ) - updated `Tencent` provider page: added a chat model and document loader references; company description - updated Chat model and Document loader pages with descriptions, links - renamed files to consistent formats; redirected file names Note: I was getting this linting error on code that was not changed in my PR! > Error: docs/docs/guides/safety/hugging_face_prompt_injection.ipynb:1:1: I001 Import block is un-sorted or un-formatted > make: *** [Makefile:47: lint_package] Error 1 I've fixed this error in the notebook	2023-12-18 18:21:39 -08:00
Timothy Ji	c5a685b10b	OPENAI_PROXY not working (#14833 ) Replace this entire comment with: - Description: OPENAI_PROXY is not working for openai==1.3.9, The `proxies` argument is deprecated. The `http_client` argument should be passed instead, - Issue: OPENAI_PROXY is not working, - Dependencies: None, - Tag maintainer: @hwchase17 , - Twitter handle: timothy66666	2023-12-18 18:06:14 -08:00
Oleksandr Yaremchuk	d82a3828f2	Improve prompt injection detection (#14842 ) - Description: This is addition to [my previous PR](https://github.com/langchain-ai/langchain/pull/13930) with improvements to flexibility allowing different models and notebook to use ONNX runtime for faster speed. Since the last PR, [our model](https://huggingface.co/laiyer/deberta-v3-base-prompt-injection) got more than 660k downloads, and with the [public benchmark](https://huggingface.co/spaces/laiyer/prompt-injection-benchmark) showed much fewer false-positives than the previous one from deepset. Additionally, on the ONNX runtime, it can be running 3x faster on the CPU, which might be handy for builders using Langchain. Issue: N/A - Dependencies: N/A - Tag maintainer: N/A - Twitter handle: `@laiyer_ai`	2023-12-18 17:50:21 -08:00
Harrison Chase	f8dccaa027	Harrison/agent docs custom (#14877 )	2023-12-18 17:49:32 -08:00
abhjaw	6fbd068b3f	Update kendra.py to avoid Kendra query ValidationException (#14866 ) Fixing issue - https://github.com/langchain-ai/langchain/issues/14494 to avoid Kendra query ValidationException <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: Update kendra.py to avoid Kendra query ValidationException, - Issue: the issue #https://github.com/langchain-ai/langchain/issues/14494, - Dependencies: None, - Tag maintainer: , - Twitter handle: If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-12-18 17:46:18 -08:00
Michael Landis	7b2a68ac72	docs: fix typo in contributing re installing integration test deps (#14861 ) Description The contributing docs lists a poetry command to install community for dev work that includes a poetry group called `integration_tests`. This is a mistake: the poetry group for integration tests is called `test_integration`, not `integration_tests`. See here: https://github.com/langchain-ai/langchain/blob/master/libs/community/pyproject.toml#L119	2023-12-18 17:43:56 -08:00
Bin	07ba030a4e	docs: fixed tiktoken link error (#14840 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: fixed tiktoken link error, - Issue: no, - Dependencies: no, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: no! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: fixed tiktoken link error, - Issue: no, - Dependencies: no, - Tag maintainer: @baskaryan, - Twitter handle: SignetCode!	2023-12-18 17:16:22 -08:00
Leonid Ganeline	6577b0d987	docstrings `langchain` update (#14870 ) Added missed docstrings	2023-12-18 17:16:08 -08:00
Kane Sweet	ea331f3136	Fix token text splitter duplicates (#14848 ) - Description: - Add a break case to `text_splitter.py::split_text_on_tokens()` to avoid unwanted item at the end of result. - Add a testcase to enforce the behavior. - Issue: - #14649 - #5897 - Dependencies: n/a, --- Quick illustration of change: ``` text = "foo bar baz 123" tokenizer = Tokenizer( chunk_overlap=3, tokens_per_chunk=7 ) output = split_text_on_tokens(text=text, tokenizer=tokenizer) ``` output before change: `["foo bar", "bar baz", "baz 123", "123"]` output after change: `["foo bar", "bar baz", "baz 123"]`	2023-12-18 17:15:57 -08:00
Leonid Ganeline	14d04180eb	docstrings `core` update (#14871 ) Added missed docstrings	2023-12-18 17:13:35 -08:00
Harrison Chase	d2cce54bf1	WIP: sql research assistant (#14240 )	2023-12-18 14:00:18 -08:00
Erick Friis	5f839beab9	community: replace deprecated davinci models (#14860 ) This is technically a breaking change because it'll switch out default models from `text-davinci-003` to `gpt-3.5-turbo-instruct`, but OpenAI is shutting off those endpoints on 1/4 anyways. Feels less disruptive to switch out the default instead.	2023-12-18 13:49:46 -08:00
Harrison Chase	193f107cb5	add methods to deserialize prompts that were old (#14857 )	2023-12-18 13:45:08 -08:00
Bagatur	714bef0cb6	langchain[patch]: Release 0.0.351 (#14867 )	2023-12-18 16:41:48 -05:00

1 2 3 4 5 ...

6500 Commits