langchain/tests/integration_tests
German Martin 736a1819aa
LOTR: Lord of the Retrievers. A retriever that merge several retrievers together applying document_formatters to them. (#5798)
"One Retriever to merge them all, One Retriever to expose them, One
Retriever to bring them all and in and process them with Document
formatters."

Hi @dev2049! Here bothering people again!

I'm using this simple idea to deal with merging the output of several
retrievers into one.
I'm aware of DocumentCompressorPipeline and
ContextualCompressionRetriever but I don't think they allow us to do
something like this. Also I was getting in trouble to get the pipeline
working too. Please correct me if i'm wrong.

This allow to do some sort of "retrieval" preprocessing and then using
the retrieval with the curated results anywhere you could use a
retriever.
My use case is to generate diff indexes with diff embeddings and sources
for a more colorful results then filtering them with one or many
document formatters.

I saw some people looking for something like this, here:
https://github.com/hwchase17/langchain/issues/3991
and something similar here:
https://github.com/hwchase17/langchain/issues/5555

This is just a proposal I know I'm missing tests , etc. If you think
this is a worth it idea I can work on tests and anything you want to
change.
Let me know!

---------

Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>
2023-06-10 08:41:02 -07:00
..
agent Add Multi-CSV/DF support in CSV and DataFrame Toolkits (#5009) 2023-05-25 14:23:11 -07:00
cache feat: add Momento as a standard cache and chat message history provider (#5221) 2023-05-25 19:13:21 -07:00
callbacks Tracing Group (#5326) 2023-06-05 19:18:43 -07:00
chains Implement saving and loading of RetrievalQA chain (#5818) 2023-06-07 21:07:13 -07:00
chat_models top_k and top_p transposed in vertexai (#5673) 2023-06-04 16:59:53 -07:00
document_loaders Harrison/fauna loader (#5864) 2023-06-07 21:32:23 -07:00
embeddings Add DeepInfra embeddings integration with tests and examples, better exception handling for Deep Infra LLM (#5854) 2023-06-07 19:14:30 -07:00
examples feat: Add UnstructuredCSVLoader for CSV files (#5844) 2023-06-07 19:18:01 -07:00
llms Baseten integration (#5862) 2023-06-08 23:05:57 -07:00
memory feat: add Momento as a standard cache and chat message history provider (#5221) 2023-05-25 19:13:21 -07:00
prompts Cleanup integration test dir (#3308) 2023-04-21 09:44:09 -07:00
retrievers LOTR: Lord of the Retrievers. A retriever that merge several retrievers together applying document_formatters to them. (#5798) 2023-06-10 08:41:02 -07:00
utilities Harrison/pubmed integration (#5664) 2023-06-03 16:25:28 -07:00
vectorstores Added SingleStoreDB Vector Store (#5619) 2023-06-07 20:45:33 -07:00
__init__.py
.env.example adding MongoDBAtlasVectorSearch (#5338) 2023-05-30 07:59:01 -07:00
conftest.py
test_document_transformers.py Contextual compression retriever (#2915) 2023-04-20 17:01:14 -07:00
test_nebulagraph.py Harrison/nebula graph (#5865) 2023-06-07 21:56:43 -07:00
test_nlp_text_splitters.py
test_pdf_pagesplitter.py
test_schema.py Add 'get_token_ids' method (#4784) 2023-05-22 13:17:26 +00:00
test_text_splitter.py refactor: extract token text splitter function (#5179) 2023-06-04 14:41:44 -07:00