langchain

Commit Graph

Author	SHA1	Message	Date
Harrison Chase	c96ac3e591	Harrison/semantic subset (#1079 ) Co-authored-by: Chen Wu (吴尘) <henrychenwu@cmu.edu>	2 years ago
Harrison Chase	19c2797bed	add anthropic example (#1041 ) Co-authored-by: Ivan Vendrov <ivendrov@gmail.com> Co-authored-by: Sasmitha Manathunga <70096033+mmz-001@users.noreply.github.com>	2 years ago
blob42	3ecdea8be4	SearxNG meta search api helper (#854 ) This is a work in progress PR to track my progres. ## TODO: - [x] Get results using the specifed searx host - [x] Prioritize returning an `answer` or results otherwise - [ ] expose the field `infobox` when available - [ ] expose `score` of result to help agent's decision - [ ] expose the `suggestions` field to agents so they could try new queries if no results are found with the orignial query ? - [ ] Dynamic tool description for agents ? - Searx offers many engines and a search syntax that agents can take advantage of. It would be nice to generate a dynamic Tool description so that it can be used many times as a tool but for different purposes. - [x] Limit number of results - [ ] Implement paging - [x] Miror the usage of the Google Search tool - [x] easy selection of search engines - [x] Documentation - [ ] update HowTo guide notebook on Search Tools - [ ] Handle async - [ ] Tests ### Add examples / documentation on possible uses with - [ ] getting factual answers with `!wiki` option and `infoboxes` - [ ] getting `suggestions` - [ ] getting `corrections` --------- Co-authored-by: blob42 <spike@w530> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2 years ago
Hasegawa Yuya	e08961ab25	Fixed openai embeddings to be safe by batching them based on token size calculation. (#991 ) I modified the logic of the batch calculation for embedding according to this cookbook https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_long_inputs.ipynb	2 years ago
seanaedmiston	f0a258555b	Support similarity search by vector (in FAISS) (#961 ) Alternate implementation to PR #960 Again - only FAISS is implemented. If accepted can add this to other vectorstores or leave as NotImplemented? Suggestions welcome...	2 years ago
Jonathan Pedoeem	05ad399abe	Update PromptLayerOpenAI LLM to include support for ASYNC API (#1066 ) This PR updates `PromptLayerOpenAI` to now support requests using the [Async API](https://langchain.readthedocs.io/en/latest/modules/llms/async_llm.html) It also updates the documentation on Async API to let users know that PromptLayerOpenAI also supports this. `PromptLayerOpenAI` now redefines `_agenerate` a similar was to how it redefines `_generate`	2 years ago
Harrison Chase	98186ef180	Harrison/evernote nb (#1078 ) Co-authored-by: Akshay <64036106+akshayvkt@users.noreply.github.com>	2 years ago
rogerserper	e46cd3b7db	Google Search API integration with serper.dev (wrapper, tests, docs, … (#909 ) Adds Google Search integration with [Serper](https://serper.dev) a low-cost alternative to SerpAPI (10x cheaper + generous free tier). Includes documentation, tests and examples. Hopefully I am not missing anything. Developers can sign up for a free account at [serper.dev](https://serper.dev) and obtain an api key. ## Usage ```python from langchain.utilities import GoogleSerperAPIWrapper from langchain.llms.openai import OpenAI from langchain.agents import initialize_agent, Tool import os os.environ["SERPER_API_KEY"] = "" os.environ['OPENAI_API_KEY'] = "" llm = OpenAI(temperature=0) search = GoogleSerperAPIWrapper() tools = [ Tool( name="Intermediate Answer", func=search.run ) ] self_ask_with_search = initialize_agent(tools, llm, agent="self-ask-with-search", verbose=True) self_ask_with_search.run("What is the hometown of the reigning men's U.S. Open champion?") ``` ### Output ``` Entering new AgentExecutor chain... Yes. Follow up: Who is the reigning men's U.S. Open champion? Intermediate answer: Current champions Carlos Alcaraz, 2022 men's singles champion. Follow up: Where is Carlos Alcaraz from? Intermediate answer: El Palmar, Spain So the final answer is: El Palmar, Spain > Finished chain. 'El Palmar, Spain' ```	2 years ago
Harrison Chase	52753066ef	Harrison/handle stop tokens ai21 (#1077 ) Co-authored-by: Andrew Huang <jhuang16888@gmail.com>	2 years ago
Akshay	d8ed286200	Update and rename everynote.py to evernote.py (#1060 ) Updating this base file as well as the .ipynb file of the example on the website: https://github.com/hwchase17/langchain/compare/master...akshayvkt:langchain:patch-1 https://langchain.readthedocs.io/en/latest/modules/document_loaders/examples/everynote.html	2 years ago
Jeff Huber	34cba2da32	Fix typo in integration with Chroma (#1070 ) We introduced a breaking change but missed this call. This PR fixes `langchain` to work with upstream `chroma`.	2 years ago
Jonathan Pedoeem	05df480376	Update `PromptLayerOpenAI` LLM usage instructions in documentation (#1053 ) This PR updates the usage instructions for PromptLayerOpenAI in Langchain's documentation. The updated instructions provide more detail and conform better to the style of other LLM integration documentation pages. No code changes were made in this PR, only improvements to the documentation. This update will make it easier for users to understand how to use `PromptLayerOpenAI`	2 years ago
Matt Robinson	3ea1e5af1e	feat: added element metadata to unstructured loader (#1068 ) ### Summary Adds tracked metadata from `unstructured` elements to the document metadata when `UnstructuredFileLoader` is used in `"elements"` mode. Tracked metadata is available in `unstructured>=0.4.9`, but the code is written for backward compatibility with older `unstructured` versions. ### Testing Before running, make sure to upgrade to `unstructured==0.4.9`. In the code snippet below, you should see `page_number`, `filename`, and `category` in the metadata for each document. `doc[0]` should have `page_number: 1` and `doc[-1]` should have `page_number: 2`. The example document is `layout-parser-paper-fast.pdf` from the [`unstructured` sample docs](https://github.com/Unstructured-IO/unstructured/tree/main/example-docs). ```python from langchain.document_loaders import UnstructuredFileLoader loader = UnstructuredFileLoader(file_path=f"layout-parser-paper-fast.pdf", mode="elements") docs = loader.load() ```	2 years ago
Harrison Chase	bac676c8e7	bump version (#1057 )	2 years ago
Ankush Gola	d8ac274fc2	add to async chain notebook (#1056 )	2 years ago
Ankush Gola	caa8e4742e	Enable streaming for OpenAI LLM (#986 ) * Support a callback `on_llm_new_token` that users can implement when `OpenAI.streaming` is set to `True`	2 years ago
Harrison Chase	f05f025e41	bump version to 0086 (#1050 )	2 years ago
Sasmitha Manathunga	c67c5383fd	docs: fix typo in notebook (#1046 )	2 years ago
Harrison Chase	88bebb4caa	Harrison/llm integrations (#1039 ) Co-authored-by: jped <jonathanped@gmail.com> Co-authored-by: Justin Torre <justintorre75@gmail.com> Co-authored-by: Ivan Vendrov <ivan@anthropic.com>	2 years ago
Harrison Chase	ec727bf166	Align table info (#999 ) (#1034 ) Currently the chain is getting the column names and types on the one side and the example rows on the other. It is easier for the llm to read the table information if the column name and examples are shown together so that it can easily understand to which columns do the examples refer to. For an instantiation of this, please refer to the changes in the `sqlite.ipynb` notebook. Also changed `eval` for `ast.literal_eval` when interpreting the results from the sample row query since it is a better practice. --------- Co-authored-by: Francisco Ingham <> --------- Co-authored-by: Francisco Ingham <fpingham@gmail.com>	2 years ago
Harrison Chase	8c45f06d58	Harrison/standarize prompt loading (#1036 ) Co-authored-by: Ibis Prevedello <ibiscp@gmail.com>	2 years ago
Enrico Shippole	f30dcc6359	Add GooseAI, CerebriumAI, Petals, ForefrontAI (#981 ) Add GooseAI, CerebriumAI, Petals, ForefrontAI	2 years ago
Anton Troynikov	d43d430d86	Chroma persistence (#1028 ) This PR adds persistence to the Chroma vector store. Users can supply a `persist_directory` with any of the `Chroma` creation methods. If supplied, the store will be automatically persisted at that directory. If a user creates a new `Chroma` instance with the same persistence directory, it will get loaded up automatically. If they use `from_texts` or `from_documents` in this way, the documents will be loaded into the existing store. There is the chance of some funky behavior if the user passes a different embedding function from the one used to create the collection - we will make this easier in future updates. For now, we log a warning.	2 years ago
Harrison Chase	012a6dfb16	Harrison/makefile (#1033 ) Co-authored-by: blob42 <contact@blob42.xyz> Co-authored-by: blob42 <spike@w530>	2 years ago
Harrison Chase	6a31a59400	add links (#1027 )	2 years ago
Oliver Klingefjord	20889205e8	Added retry for openai.error.ServiceUnavailableError (#1022 ) Imho retries should be performed for ServiceUnavailableError (which tends to happen to me quite often).	2 years ago
Harrison Chase	fc2502cd81	bump version to 0085 (#1017 )	2 years ago
Harrison Chase	0f0e69adce	agent refactors (#997 )	2 years ago
Harrison Chase	7fb33fca47	chroma docs (#1012 )	2 years ago
Harrison Chase	0c553d2064	Harrion/kg (#1016 ) Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2 years ago
Anton Troynikov	78abd277ff	Chroma in LangChain (#1010 ) Chroma is a simple to use, open-source, zero-config, zero setup vectorstore. Simply `pip install chromadb`, and you're good to go. Out-of-the-box Chroma is suitable for most LangChain workloads, but is highly flexible. I tested to 1M embs on my M1 mac, with out issues and reasonably fast query times. Look out for future releases as we integrate more Chroma features with LangChain!	2 years ago
cragwolfe	05d8969c79	Unstructured example notebook: add a pdf, related deps (#1011 ) Updates the Unstructured example notebook with a PDF example. Includes additional dependencies for PDF processing (and images, etc).	2 years ago
Dhruv Anand	03e5794978	typo fix on chat vector db docs (#1007 ) simple typo fix: because --> between	2 years ago
Harrison Chase	6d44a2285c	bump version to 0084 (#1005 )	2 years ago
Harrison Chase	0998577dfe	Harrison/unstructured structured (#1004 )	2 years ago
Harrison Chase	bbb06ca4cf	pdfminer (#1003 )	2 years ago
Francisco Ingham	0b6aa6a024	Added initial capital letter to bullet points that had it missing (#1000 ) Co-authored-by: Francisco Ingham <>	2 years ago
Harrison Chase	10e7297306	Harrison/fake llm (#990 ) Co-authored-by: Stefan Keselj <skeselj@princeton.edu> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	2 years ago
Harrison Chase	e51fad1488	Harrison/0083 (#996 ) Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	2 years ago
Shahriar Tajbakhsh	b7747017d7	Import of `declarative_base` when SQLAlchemy <1.4 (#883 ) In [pyproject.toml](https://github.com/hwchase17/langchain/blob/master/pyproject.toml), the expectation is `SQLAlchemy = "^1"`. But, the way `declarative_base` is imported in [cache.py](https://github.com/hwchase17/langchain/blob/master/langchain/cache.py) will only work with SQLAlchemy >=1.4. This PR makes sure Langchain can be run in environments with SQLAlchemy <1.4	2 years ago
Harrison Chase	2e96704d59	Harrison/airbyte (#989 ) Co-authored-by: zanderchase <zanderchase@gmail.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MacBook-Pro.local>	2 years ago
Charles Frye	e9799d6821	improves huggingface_hub example (#988 ) The provided example uses the default `max_length` of `20` tokens, which leads to the example generation getting cut off. 20 tokens is way too short to show CoT reasoning, so I boosted it to `64`. Without knowing HF's API well, it can be hard to figure out just where those `model_kwargs` come from, and `max_length` is a super critical one.	2 years ago
zanderchase	c2d1d903fa	Zander/online pdf loader (#984 )	2 years ago
Harrison Chase	055a53c27f	add texts example (#985 ) Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MacBook-Pro.local>	2 years ago
Harrison Chase	231da14771	bump version to 0082 (#980 ) Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MacBook-Pro.local>	2 years ago
jeff	6ab432d62e	docs: update spelling typos (#982 ) Wonder why "with" is spelled "wiht" so many times by human	2 years ago
Matt Robinson	07a407d89a	feat: adds `UnstructuredURLLoader` for loading data from urls (#979 ) ### Summary Adds a `UnstructuredURLLoader` that supports loading data from a list of URLs. ### Testing ```python from langchain.document_loaders import UnstructuredURLLoader urls = [ "https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-8-2023", "https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-9-2023" ] loader = UnstructuredURLLoader(urls=urls) raw_documents = loader.load() ```	2 years ago
Harrison Chase	c64f98e2bb	Harrison/format agent instructions (#973 ) Co-authored-by: Andrew White <white.d.andrew@gmail.com> Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net> Co-authored-by: Peng Qu <82029664+pengqu123@users.noreply.github.com>	2 years ago
Harrison Chase	5469d898a9	Harrison/everynote (#974 ) Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	2 years ago
Harrison Chase	3d639d1539	update lint (#975 ) Co-authored-by: Harrison Chase <harrisonchase@Harrisons-MBP.attlocal.net>	2 years ago

... 66 67 68 69 70 ...

3935 Commits (2b663089b5f6f16890c134d14981db7a0eb446ba) All Branches Search

3935 Commits (2b663089b5f6f16890c134d14981db7a0eb446ba)

All Branches