langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-31 15:20:26 +00:00

Author	SHA1	Message	Date
Klein Tahiraj	8a0751dadd	adding .ipynb loader and documentation Fixes #1248 (#1252 ) `NotebookLoader.load()` loads the `.ipynb` notebook file into a `Document` object. Parameters: * `include_outputs` (bool): whether to include cell outputs in the resulting document (default is False). * `max_output_length` (int): the maximum number of characters to include from each cell output (default is 10). * `remove_newline` (bool): whether to remove newline characters from the cell sources and outputs (default is False). * `traceback` (bool): whether to include full traceback (default is False).	2023-02-24 07:10:35 -08:00
Harrison Chase	4b5d427421	Harrison/source docs (#1275 ) Co-authored-by: Tushar Dhadiwal <tushardhadiwal@users.noreply.github.com>	2023-02-24 07:09:10 -08:00
Enrico Shippole	9becdeaadf	Add Writer, Banana, Modal, StochasticAI (#1270 ) Add LLM wrappers and examples for Banana, Writer, Modal, Stochastic AI Added rigid json format for Banana and Modal	2023-02-24 06:58:58 -08:00
blob42	5457d48416	searx: add `query_suffix` parameter (#1259 ) - allows to build tools and dynamically inject extra searxh suffix in the query. example: `search.run("python library", query_suffix="site:github.com")` resulting query: `python library site:github.com` Co-authored-by: blob42 <spike@w530>	2023-02-23 16:00:40 -08:00
Harrison Chase	9381005098	fix bug with length function (#1257 )	2023-02-23 16:00:15 -08:00
Matt Robinson	10e73a3723	docs: remove nltk download steps (#1253 ) ### Summary Updates the docs to remove the `nltk` download steps from `unstructured`. As of `unstructured` `0.4.14`, this is handled automatically in the relevant modules within `unstructured`.	2023-02-23 12:34:44 -08:00
Justin Torre	5bc6dc076e	added caching and properties docs (#1255 )	2023-02-23 11:03:04 -08:00
Harrison Chase	6d37d089e9	bump version to 0093 (#1251 )	2023-02-23 08:00:42 -08:00
Iskren Ivov Chernev	8e3cd3e0dd	Add DeepInfra LLM support (#1232 ) DeepInfra is an Inference-as-a-Service provider. Add a simple wrapper using HTTPS requests.	2023-02-23 07:37:15 -08:00
Dmitri Melikyan	b7765a95a0	docs: add Graphsignal ecosystem page (#1228 ) Adds a Graphsignal ecosystem page	2023-02-23 07:33:00 -08:00
Satoru Sakamoto	d480330fae	fix to specific language transcript (#1231 ) Currently youtube loader only seems to support English audio. Changed to load videos in the specified language.	2023-02-23 07:32:46 -08:00
Harrison Chase	6085fe18d4	add ifttt tool (#1244 )	2023-02-22 22:29:43 -08:00
Jon Luo	8a35811556	Don't instruct LLM to use the LIMIT clause, which is incompatible with SQL Server (#1242 ) The current prompt specifically instructs the LLM to use the `LIMIT` clause. This will cause issues with MS SQL Server, which uses `SELECT TOP` instead of `LIMIT`. The generated SQL will use `LIMIT`; the instruction to "always limit... using the LIMIT clause" seems to override the "create a syntactically correct mssql query to run" portion. Reported here: https://github.com/hwchase17/langchain/issues/1103#issuecomment-1441144224 I don't have access to a SQL Server instance to test, but removing that part of the prompt in OpenAI Playground results in the correct `SELECT TOP` syntax, whereas keeping it in results in the `LIMIT` clause, even when instructing it to generate syntactically correct mssql. It's also still correctly using `LIMIT` in my MariaDB database. I think in this case we can assume that the model will select the appropriate method based on the dialect specified. In general, it would be nice to be able to test a suite of SQL dialects for things like dialect-specific syntax and other issues we've run into in the past, but I'm not quite sure how to best approach that yet.	2023-02-22 22:21:26 -08:00
Harrison Chase	71709ad5d5	Update key_concepts.md (#1209 ) (#1237 ) Link for easier navigation (it's not immediately clear where to find more info on SimpleSequentialChain (3 clicks away) --------- Co-authored-by: Larry Fisherman <l4rryfisherman@protonmail.com>	2023-02-22 13:30:53 -08:00
Dennis Antela Martinez	53c67e04d4	add aleph alpha llm (#1207 ) Integrate Aleph Alpha's client into Langchain to provide access to the luminous models - more info on latest benchmarks here: https://www.aleph-alpha.com/luminous-performance-benchmarks	2023-02-22 10:37:36 -08:00
Klein Tahiraj	c6ab1bb3cb	Fixing typo in loading.py (#1235 ) Just fixing a typo I found in loading.py	2023-02-22 10:36:14 -08:00
Ikko Eltociear Ashimine	334b553260	Update petals.md (#1225 ) Huggingface -> Hugging Face	2023-02-22 10:34:16 -08:00
Jon Luo	ac1320aae8	fix sqlite internal tables breaking table_info (#1224 ) With the current method used to get the SQL table info, sqlite internal schema tables are being included and are not being handled correctly by sqlalchemy because the columns have no types. This is easy to see with the Chinook database: ```python db = SQLDatabase.from_uri("sqlite:///Chinook.db") print(db.table_info) ``` ```python ... sqlalchemy.exc.CompileError: (in table 'sqlite_sequence', column 'name'): Can't generate DDL for NullType(); did you forget to specify a type on this Column? ``` SQLAlchemy 2.0 [ignores these by default](`63d90b0f44/lib/sqlalchemy/dialects/sqlite/base.py (L856-L880)`): `63d90b0f44/lib/sqlalchemy/dialects/sqlite/base.py (L2096-L2123)`	2023-02-22 10:34:05 -08:00
djacobs7	4e28982d2b	Fix typo in constitutional_ai base.py (#1216 ) Found a typo in the documentation code for the constitutional_ai module	2023-02-21 17:03:44 -08:00
Sason	cc7d2e5621	Correct typo in "Question Answering" How-To Guide (#1221 )	2023-02-21 17:02:58 -08:00
blob42	424e71705d	searx: remove duplicate param (#1219 ) Co-authored-by: blob42 <spike@w530>	2023-02-21 17:02:42 -08:00
Harrison Chase	4e43b0efe9	bump version 0092 (#1204 )	2023-02-21 08:56:07 -08:00
Matt Robinson	3d5f56a8a1	docs: add quotes to `unstructured[local-inference]` install instructions (#1208 ) ### Summary Corrects the install instruction for local inference to `pip install "unstructured[local-inference]"`	2023-02-21 08:06:43 -08:00
Harrison Chase	047231840d	add docs for chroma persistance (#1202 )	2023-02-20 23:04:17 -08:00
Harrison Chase	5bdb8dd6fe	Harrison/unstructured io (#1200 )	2023-02-20 22:54:49 -08:00
Harrison Chase	d90a287d8f	Harrison/updating docs (#1196 )	2023-02-20 22:54:26 -08:00
Harrison Chase	b7708bbec6	rfc: callback changes (#1165 ) conceptually, no reason a tool should know what an "agent action" is unless any objections, can change in all callback handlers	2023-02-20 22:54:15 -08:00
Harrison Chase	fb83cd4ff4	catch networkx error (#1201 )	2023-02-20 21:43:02 -08:00
Harrison Chase	44c8d8a9ac	move serpapi wrapper (#1199 ) Co-authored-by: Tim Asp <707699+timothyasp@users.noreply.github.com>	2023-02-20 21:15:45 -08:00
Konstantin Hebenstreit	af94f1dd97	HuggingFaceEndpoint: Correct Example for ImportError (#1176 ) When I try to import the Class HuggingFaceEndpoint I get an Import Error: cannot import name 'HuggingFaceEndpoint' from 'langchain'. (langchain version 0.0.88) These two imports work fine: from langchain import HuggingFacePipeline and from langchain import HuggingFaceHub. So I corrected the import statement in the example. There is probably a better solution to this, but this fixes the Error for me.	2023-02-20 21:09:39 -08:00
Harrison Chase	0c84ce1082	Harrison/add documents (#1197 ) Co-authored-by: OmriNach <32659330+OmriNach@users.noreply.github.com>	2023-02-20 21:02:28 -08:00
Francisco Ingham	0b6a650cb4	added ability to override default verbose and memory when load chain … (#1153 ) It is useful to be able to specify `verbose` or `memory` while still keeping the chain's overall structure. --------- Co-authored-by: Francisco Ingham <>	2023-02-20 21:00:32 -08:00
Anton Troynikov	d2ef5d6167	Default Chroma collection name (#1198 ) For persistence, it's convenient to have a default collection name which gets used everywhere.	2023-02-20 20:59:34 -08:00
Dennis Antela Martinez	23243ae69c	add gitbook document loader (#1180 ) Added a GitBook document loader. It lets you both, (1) fetch text from any single GitBook page, or (2) fetch all relative paths and return their respective content in Documents. I've modified the `scrape` method in the `WebBaseLoader` to accept custom web paths if given, but happy to remove it and move that logic into the `GitbookLoader` itself.	2023-02-20 20:05:04 -08:00
William FH	13ba0177d0	Add a StdIn "Interaction" Tool (#1193 ) Lets a chain prompt the user for more input as a part of its execution.	2023-02-20 18:40:02 -08:00
Naveen Tatikonda	0118706fd6	Add Support for OpenSearch Vector database (#1191 ) ### Description This PR adds a wrapper which adds support for the OpenSearch vector database. Using opensearch-py client we are ingesting the embeddings of given text into opensearch cluster using Bulk API. We can perform the `similarity_search` on the index using the 3 popular searching methods of OpenSearch k-NN plugin: - `Approximate k-NN Search` use approximate nearest neighbor (ANN) algorithms from the [nmslib](https://github.com/nmslib/nmslib), [faiss](https://github.com/facebookresearch/faiss), and [Lucene](https://lucene.apache.org/) libraries to power k-NN search. - `Script Scoring` extends OpenSearch’s script scoring functionality to execute a brute force, exact k-NN search. - `Painless Scripting` adds the distance functions as painless extensions that can be used in more complex combinations. Also, supports brute force, exact k-NN search like Script Scoring. ### Issues Resolved https://github.com/hwchase17/langchain/issues/1054 --------- Signed-off-by: Naveen Tatikonda <navtat@amazon.com>	2023-02-20 18:39:34 -08:00
Andrew White	c5015d77e2	Allow k to be higher than doc size in max_marginal_relevance_search (#1187 ) Fixes issue #1186. For some reason, #1117 didn't seem to fix it.	2023-02-20 16:39:13 -08:00
Zach Schillaci	159c560c95	Refactor some loops into list comprehensions (#1185 )	2023-02-20 16:38:43 -08:00
Harrison Chase	926c121b98	Harrison/text splitter docs (#1188 )	2023-02-20 15:14:03 -08:00
Harrison Chase	91446a5e9b	clean up text splitting docs (#1184 )	2023-02-20 11:24:31 -08:00
Harrison Chase	a5a14405ad	bump version to 0091 (#1181 )	2023-02-20 08:53:45 -08:00
Harrison Chase	5a954efdd7	update gallery with slack bot (#1177 )	2023-02-20 08:21:00 -08:00
Harrison Chase	4766b20223	clean up loaders (#1178 )	2023-02-20 08:20:48 -08:00
blob42	9962bda70b	searx_search: docs updates (#1175 ) - fix notebook formatting, remove empty cells and add scrolling for long text --------- Co-authored-by: blob42 <spike@w530>	2023-02-20 06:46:44 -08:00
Harrison Chase	4f3fbd7267	improve docs for indexes (#1146 )	2023-02-19 23:14:50 -08:00
Harrison Chase	28781a6213	Harrison/markdown splitter (#1169 ) Co-authored-by: Michael Chen <flamingdescent@gmail.com> Co-authored-by: Michael Chen <michaelchen@stripe.com>	2023-02-19 21:31:58 -08:00
Harrison Chase	37dd34bea5	fix path (#1168 )	2023-02-19 21:28:49 -08:00
Nan Wang	e8f224fd3a	docs: add missing links to toc (#1163 ) add missing links to toc --------- Signed-off-by: Nan Wang <nan.wang@jina.ai>	2023-02-19 21:15:11 -08:00
Nick	afe884fb96	AI21 documentation incorrectly titled Cohere (#1167 )	2023-02-19 21:14:59 -08:00
Ji	ed37fbaeff	for ChatVectorDBChain, add top_k_docs_for_context to allow control how many chunks of context will be retrieved (#1155 ) given that we allow user define chunk size, think it would be useful for user to define how many chunks of context will be retrieved.	2023-02-19 20:48:23 -08:00

1 2 3 4 5 ...

720 Commits