langchain

Commit Graph

Author	SHA1	Message	Date
Alexander Hoyle	42b892c21b	Avoid IntegrityError for SQLiteCache updates (#1286 ) While using a `SQLiteCache`, if there are duplicate `(prompt, llm, idx)` tuples passed to [`update_cache()`](`c5dd491a21/langchain/llms/base.py (L39)`), then an `IntegrityError` is thrown. This can happen when there are duplicated prompts within the same batch. This PR changes the SQLAlchemy `session.add()` to a `session.merge()` in `cache.py`, [following the solution from this SO thread](https://stackoverflow.com/questions/10322514/dealing-with-duplicate-primary-keys-on-insert-in-sqlalchemy-declarative-style). I believe this fixes #983, but not entirely sure since that also involves async Here's a minimal example of the error: ```python from pathlib import Path import langchain from langchain.cache import SQLiteCache llm = langchain.OpenAI(model_name="text-ada-001", openai_api_key=Path("/.openai_api_key").read_text().strip()) langchain.llm_cache = SQLiteCache("test_cache.db") llm.generate(['a'] * 5) ``` ``` > IntegrityError: (sqlite3.IntegrityError) UNIQUE constraint failed: full_llm_cache.prompt, full_llm_cache.llm, full_llm_cache.idx [SQL: INSERT INTO full_llm_cache (prompt, llm, idx, response) VALUES (?, ?, ?, ?)] [parameters: ('a', "[('_type', 'openai'), ('best_of', 1), ('frequency_penalty', 0), ('logit_bias', {}), ('max_tokens', 256), ('model_name', 'text-ada-001'), ('n', 1), ('presence_penalty', 0), ('request_timeout', None), ('stop', None), ('temperature', 0.7), ('top_p', 1)]", 0, '\n\nA is for air.\n\nA is for atmosphere.')] (Background on this error at: https://sqlalche.me/e/14/gkpj) ``` After the change, we now have the following ```python class Output: def __init__(self, text): self.text = text # make dummy data cache = SQLiteCache("test_cache_2.db") cache.update(prompt="prompt_0", llm_string="llm_0", return_val=[Output("text_0")]) cache.engine.execute("SELECT * FROM full_llm_cache").fetchall() # output > [('prompt_0', 'llm_0', 0, 'text_0')] ``` ```python # update data, before change this would have thrown an `IntegrityError` cache.update(prompt="prompt_0", llm_string="llm_0", return_val=[Output("text_0_new")]) cache.engine.execute("SELECT * FROM full_llm_cache").fetchall() # output > [('prompt_0', 'llm_0', 0, 'text_0_new')] ```	1 year ago
Zach Schillaci	159c560c95	Refactor some loops into list comprehensions (#1185 )	1 year ago
Shahriar Tajbakhsh	b7747017d7	Import of `declarative_base` when SQLAlchemy <1.4 (#883 ) In [pyproject.toml](https://github.com/hwchase17/langchain/blob/master/pyproject.toml), the expectation is `SQLAlchemy = "^1"`. But, the way `declarative_base` is imported in [cache.py](https://github.com/hwchase17/langchain/blob/master/langchain/cache.py) will only work with SQLAlchemy >=1.4. This PR makes sure Langchain can be run in environments with SQLAlchemy <1.4	1 year ago
Amos Ng	fa6826e417	Fix sqlalchemy warnings when running tests (#733 ) This has been bugging me when running my own tests that call langchain methods :P	1 year ago
Harrison Chase	54d7f1c933	fix caching (#658 )	1 year ago
Harrison Chase	73f7ebd9d1	Harrison/sqlalchemy cache store (#536 ) Co-authored-by: Jason Gill <jasongill@gmail.com>	1 year ago
Samantha Whitmore	6bc8ae63ef	Add Redis cache implementation (#397 ) I'm using a hash function for the key just to make sure its length doesn't get out of hand, otherwise the implementation is quite similar.	1 year ago
Harrison Chase	3474f39e21	Harrison/improve cache (#368 ) make it so everything goes through generate, which removes the need for two types of caches	1 year ago
Benjamin	85c1bd2cd0	add sqlalchemy generic cache (#361 ) Created a generic SQLAlchemyCache class to plug any database supported by SQAlchemy. (I am using Postgres). I also based the class SQLiteCache class on this class SQLAlchemyCache. As a side note, I'm questioning the need for two distinct class LLMCache, FullLLMCache. Shouldn't we merge both ?	1 year ago
Harrison Chase	78b31e5966	Harrison/cache (#343 )	1 year ago

10 Commits (main)