langchain/tests/integration_tests
Caleb Ellington c5a7a85a4e
fix chroma update_document to embed entire documents, fixes a characer-wise embedding bug (#5584)
# Chroma update_document full document embeddings bugfix

Chroma update_document takes a single document, but treats the
page_content sting of that document as a list when getting the new
document embedding.

This is a two-fold problem, where the resulting embedding for the
updated document is incorrect (it's only an embedding of the first
character in the new page_content) and it calls the embedding function
for every character in the new page_content string, using many tokens in
the process.

Fixes #5582


Co-authored-by: Caleb Ellington <calebellington@Calebs-MBP.hsd1.ca.comcast.net>
2023-06-02 11:12:48 -07:00
..
agent Add Multi-CSV/DF support in CSV and DataFrame Toolkits (#5009) 2023-05-25 14:23:11 -07:00
cache feat: add Momento as a standard cache and chat message history provider (#5221) 2023-05-25 19:13:21 -07:00
callbacks feat(integrations): Add WandbTracer (#4521) 2023-06-01 00:01:19 -07:00
chains Harrison/neo4j (#5078) 2023-05-22 07:31:48 -07:00
chat_models Harrison/vertex (#5049) 2023-05-24 15:51:12 -07:00
client Add Feedback Methods + Evaluation examples (#5166) 2023-05-31 11:14:27 -07:00
document_loaders add maxcompute (#5533) 2023-06-01 00:54:42 -07:00
embeddings encoding_kwargs for InstructEmbeddings (#5450) 2023-05-30 11:57:04 -07:00
examples fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 2023-05-16 14:35:25 -07:00
llms Harrison/prediction guard update (#5404) 2023-05-29 07:14:59 -07:00
memory feat: add Momento as a standard cache and chat message history provider (#5221) 2023-05-25 19:13:21 -07:00
prompts Cleanup integration test dir (#3308) 2023-04-21 09:44:09 -07:00
retrievers tfidf retriever (#5114) 2023-05-24 10:02:09 -07:00
utilities Tedma4/twilio tool (#5136) 2023-05-25 19:19:22 -07:00
vectorstores fix chroma update_document to embed entire documents, fixes a characer-wise embedding bug (#5584) 2023-06-02 11:12:48 -07:00
__init__.py
.env.example adding MongoDBAtlasVectorSearch (#5338) 2023-05-30 07:59:01 -07:00
conftest.py feat: improve pinecone tests (#2806) 2023-04-13 21:49:31 -07:00
test_document_transformers.py Contextual compression retriever (#2915) 2023-04-20 17:01:14 -07:00
test_nlp_text_splitters.py
test_pdf_pagesplitter.py
test_schema.py Add 'get_token_ids' method (#4784) 2023-05-22 13:17:26 +00:00
test_text_splitter.py Fix TextSplitter.from_tiktoken(#4361) 2023-05-08 16:36:38 -07:00