langchain

Author	SHA1	Message	Date
Tim Asp	fea5619ce9	Add title, lang, description to Web loader document metadata (#2955 ) Title, lang and description are on almost every web page, and are incredibly useful pieces of information that currently isn't captured with the current web base loader I thought about adding the title and description to the content of the document, as that content could be useful in search, but I left it out for right now. If you think it'd be worth adding, happy to add it. I've found it's nice to have the title/description in the metadata to have some structured data when retrieving rows from vectordbs for use with summary and source citation, so if we do want to add it to the `page_content`, i'd advocate for it to also be included in metadata.	2023-04-15 16:07:08 -07:00
Maciej Pióro	f7bf917baf	Fix missing docker-compose (#2899 ) Fix missing `docker-compose` command if only `docker compose` (note space) is available.	2023-04-15 16:05:11 -07:00
Harrison Chase	b634489b2e	bump version to 141 (#2950 )	2023-04-15 12:56:39 -07:00
Harrison Chase	274b25c010	SVM retriever (#2947 ) (#2949 ) Add SVM retriever class, based on https://github.com/karpathy/randomfun/blob/master/knn_vs_svm.ipynb. Testing still WIP, but the logic is correct (I have a local implementation outside of Langchain working). --------- Co-authored-by: Lance Martin <122662504+PineappleExpress808@users.noreply.github.com> Co-authored-by: rlm <31treehaus@31s-MacBook-Pro.local>	2023-04-15 12:49:59 -07:00
Harrison Chase	baf350e32b	parametrize redis (#2946 )	2023-04-15 12:47:36 -07:00
dev2049	36aa7f30e4	Move PythonRepl -> langchain.utilities (#2917 )	2023-04-15 10:50:25 -07:00
dev2049	7c73e9df5d	Add kwargs to VectorStore.maximum_marginal_relevance (#2921 ) Same as similarity_search, allows child classes to add vector store-specific args (this was technically already happening in couple places but now typing is correct).	2023-04-15 10:49:49 -07:00
Davit Buniatyan	b3a5b51728	[minor] Deep Lake auth improvements in docs, kwargs pass, faster tests (#2927 ) Minor cosmetic changes - Activeloop environment cred authentication in notebooks with `getpass.getpass` (instead of CLI which not always works) - much faster tests with Deep Lake pytest mode on - Deep Lake kwargs pass Notes - I put pytest environment creds inside `vectorstores/conftest.py`, but feel free to suggest a better location. For context, if I put in `test_deeplake.py`, `ruff` doesn't let me to set them before import deeplake --------- Co-authored-by: Davit Buniatyan <d@activeloop.ai>	2023-04-15 10:49:16 -07:00
Harrison Chase	c4ae8c1d24	bump ver to 140 (#2895 )	2023-04-15 09:23:19 -07:00
Nahin Khan	ad3973a3b8	Fix typo (#2942 )	2023-04-15 08:53:25 -07:00
Harrison Chase	cf2789d86d	delete antropic chat notebook (#2945 )	2023-04-15 08:48:51 -07:00
Hai Nguyen Mau	0aa828b1dc	typo fix (#2937 ) missing w in link	2023-04-15 08:31:43 -07:00
Ankush Gola	ec59e9d886	Fix ChatAnthropic stop_sequences error (#2919 ) (#2920 ) Note to self: Always run integration tests, even on "that last minute change you thought would be safe" :) --------- Co-authored-by: Mike Lambert <mike.lambert@anthropic.com>	2023-04-14 17:22:01 -07:00
Akash NP	13a0ed064b	add encoding to avoid UnicodeDecodeError (#2908 ) About Specify encoding to avoid UnicodeDecodeError when reading .txt for users who are following the tutorial. Reference ``` return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 1205: character maps to <undefined> ``` Environment OS: Win 11 Python: 3.8	2023-04-14 16:36:03 -07:00
Mike Lambert	392f1b3218	Add Anthropic ChatModel to langchain (#2293 ) * Adds an Anthropic ChatModel * Factors out common code in our LLMModel and ChatModel * Supports streaming llm-tokens to the callbacks on a delta basis (until a future V2 API does that for us) * Some fixes	2023-04-14 15:09:07 -07:00
Kwuang Tang	66bef1d7ed	Ignore files from .gitignore in Git loader (#2909 ) fixes #2905 extends #2851	2023-04-14 15:02:21 -07:00
Boris Feld	7ee87eb0c8	Comet callback updates (#2889 ) I'm working with @DN6 and I made some small fixes and improvements after playing with the integration.	2023-04-14 13:19:58 -07:00
dev2049	634358db5e	Fix OpenAI LLM docstring (#2910 )	2023-04-14 11:09:36 -07:00
pranjaldoshi96	30573b2e30	Correct instruction to use openweathermap utility in docstring (#2906 ) Co-authored-by: Pranjal Doshi <pranjald@nvidia.com>	2023-04-14 10:46:20 -07:00
Kwuang Tang	a508afa91c	Add file filter param to Git loader (#2904 ) Allows users to specify what files should be loaded instead of indiscriminately loading the entire repo. extends #2851 NOTE: for reviewers, `hide whitespace` option recommended since I changed the indentation of an if-block to use `continue` instead so it looks less like a Christmas tree :)	2023-04-14 10:45:54 -07:00
Ismail Pelaseyed	7e525a3b91	Add link to repo for deploying LangChain to Digitalocean App Platform (#2894 ) This PR adds a link to a minimal example of deploying `LangChain` to `Digitalocean App Platform`.	2023-04-14 08:55:21 -07:00
Peter Stolz	ccacf804a8	Fix format string in pinecone error handling (#2897 )	2023-04-14 08:53:02 -07:00
Francis Felici	86189cdcf9	Update load_qa_chain() docstring (#2900 ) Seems to be missing `map_rerank` as a potential argument of `chain_type`	2023-04-14 08:51:30 -07:00
Harrison Chase	8fef69296d	nits (#2873 )	2023-04-14 07:55:12 -07:00
Harrison Chase	0a38bbc750	updates to vectorstore memory (#2875 )	2023-04-14 07:54:57 -07:00
Ikko Eltociear Ashimine	203c0eb2ae	docs: update getting_started.ipynb (#2883 ) HuggingFace -> Hugging Face	2023-04-14 07:40:26 -07:00
ecneladis	1a44b71ddf	Fix Baby AGI notebooks (#2882 ) - fix broken notebook cell in `ae485b623d` - Python Black formatting	2023-04-14 07:40:04 -07:00
Nicolas	3c7204d604	docs: Quick fix to Mendable Search (#2876 ) Fixed a small issue on the icon UI when using in Safari.	2023-04-13 23:15:57 -07:00
Harrison Chase	1e9378d0a8	Harrison/weaviate fixes (#2872 ) Co-authored-by: cs0lar <cristiano.solarino@gmail.com> Co-authored-by: cs0lar <cristiano.solarino@brightminded.com>	2023-04-13 22:37:34 -07:00
Harrison Chase	07d7096de6	Harrison/playwright (#2871 ) Co-authored-by: Manuel Saelices <msaelices@gmail.com>	2023-04-13 22:15:03 -07:00
Jon Luo	5565f56273	Use SQL dialect-specific prompts for SQLDatabaseChain (#2748 ) Mentioned the idea here initially: https://github.com/hwchase17/langchain/pull/2106#issuecomment-1487509106 Since there have been dialect-specific issues, we should use dialect-specific prompts. This way, each prompt can be separately modified to best suit each dialect as needed. This adds a prompt for each dialect supported in sqlalchemy (mssql, mysql, mariadb, postgres, oracle, sqlite). For this initial implementation, the only differencse between the prompts is the instruction for the clause to use to limit the number of rows queried for, and the instruction for wrapping column names using each dialect's identifier quote character.	2023-04-13 22:10:49 -07:00
drod	9907cb0485	Refactor similarity_search function in elastic_vector_search.py (#2761 ) Optimization :Limit search results when k < 10 Fix issue when k > 10: Elasticsearch will return only 10 docs [default-search-result](https://www.elastic.co/guide/en/elasticsearch/reference/current/paginate-search-results.html) By default, searches return the top 10 matching hits Add size parameter to the search request to limit the number of returned results from Elasticsearch. Remove slicing of the hits list, since the response will already contain the desired number of results.	2023-04-13 22:09:00 -07:00
rafael	1cc7ea333c	chat_models.openai: Set tenacity timeout to openai's recommendation (#2768 ) [OpenAI's cookbook](https://github.com/openai/openai-cookbook/blob/main/examples/How_to_handle_rate_limits.ipynb) suggest a tenacity backoff between 1 and 60 seconds. Currently langchain's backoff is between 4 and 10 seconds, which causes frequent timeout errors on my end. This PR changes the timeout to the suggested values.	2023-04-13 22:08:46 -07:00
Harrison Chase	705596b46a	Harrison/fix create sql agent (#2870 ) Co-authored-by: Timothé Pearce <timothe.pearce@gmail.com>	2023-04-13 22:07:58 -07:00
Harrison Chase	8a98e5b50b	Harrison/index name (#2869 ) Co-authored-by: Mesum Raza Hemani <mes.javacca@gmail.com>	2023-04-13 22:01:32 -07:00
Andrey Vasnetsov	dcb17503f2	Update qdrant.py (#2750 ) At the moment of upload we should already know the format of data, therefore we can skip the costly pydantic validation.	2023-04-13 21:57:05 -07:00
ecneladis	74abeb8c53	Update output in Git notebook (#2868 ) Supplemental to https://github.com/hwchase17/langchain/pull/2851. Updates one notebook cell that I forgot to commit before.	2023-04-13 21:56:17 -07:00
Nicolas	0226b375d9	docs: Mendable Search integration (#2803 ) Mendable Seach Integration is Finally here! Hey yall, After various requests for Mendable in Python docs, we decided to get our hands dirty and try to implement it. Here is a version where we implement our floating button that sits on the bottom right of the screen that once triggered (via press or CMD K) will work the same as the js langchain docs. Super excited about this and hopefully the community will be too. @hwchase17 will send you the admin details via dm etc. The anon_key is fine to be public. Let me know if you need any further customization. I added the langchain logo to it.	2023-04-13 21:52:25 -07:00
sergerdn	04c458a270	feat: improve pinecone tests (#2806 ) Improve the integration tests for Pinecone by adding an `.env.example` file for local testing. Additionally, add some dev dependencies specifically for integration tests. This change also helps me understand how Pinecone deals with certain things, see related issues https://github.com/hwchase17/langchain/issues/2484 https://github.com/hwchase17/langchain/issues/2816	2023-04-13 21:49:31 -07:00
ecneladis	016738e676	Add GitLoader (#2851 )	2023-04-13 21:39:20 -07:00
lizelive	8cfec2c5fe	torch 2 support (#2865 ) Lang-chain seems to work with torch 2	2023-04-13 21:38:49 -07:00
vowelparrot	bf0887c486	Add Slack Directory Loader (#2841 ) Fixes linting issue from #2835 Adds a loader for Slack Exports which can be a very valuable source of knowledge to use for internal QA bots and other use cases. ```py # Export data from your Slack Workspace first. from langchain.document_loaders import SLackDirectoryLoader SLACK_WORKSPACE_URL = "https://awesome.slack.com" loader = ("Slack_Exports", SLACK_WORKSPACE_URL) docs = loader.load() ```	2023-04-13 21:31:59 -07:00
Harrison Chase	ed2ef5cbe4	Harrison/rwkv utf8 (#2867 ) Co-authored-by: Akihiro <ueyama0105@gmail.com>	2023-04-13 21:31:18 -07:00
Adam McCabe	6be5d7c612	Update reduce_openapi_spec for PATCH and DELETE (#2861 ) My recent pull request (#2729) neglected to update the `reduce_openapi_spec` in spec.py to also accommodate PATCH and DELETE added to planner.py and prompt_planner.py.	2023-04-13 20:27:40 -07:00
Benjamin Tan Wei Hao	c26a259ba6	Fix tiny typo (#2863 )	2023-04-13 20:26:26 -07:00
Jon Luo	f3180f05f9	Update sql chain notebook to clarify use of SQLAlchemy for connections (#2850 ) Have seen questions about whether or not the `SQLDatabaseChain` supports more than just sqlite, which was unclear in the docs, so tried to clarify that and how to connect to other dialects.	2023-04-13 11:46:59 -07:00
leo-gan	ecc1a0c051	added code-analysis-deeplake.ipynb (#2844 ) This notebook is heavily copied from the `twitter-the-algorithm-analysis-deeplake.ipynb`	2023-04-13 11:29:59 -07:00
Tim Asp	70ffe470aa	Add easy print method to openai callback (#2848 ) Found myself constantly copying the snippet outputting all the callback tracking details. so adding a simple way to output the full context	2023-04-13 11:28:42 -07:00
Tim Asp	be4fb24b32	OpenAI LLM: update `modelname_to_contextsize` with new models (#2843 ) Token counts pulled from https://openai.com/pricing	2023-04-13 11:13:34 -07:00
vowelparrot	82d1d5f24e	Fix grammar in Vector Memory Docs (#2847 )	2023-04-13 11:00:09 -07:00

... 3 4 5 6 7 ...

1522 Commits