You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/tests/integration_tests
Pau Ramon Revilla 87802c86d9
Added a MHTML document loader (#6311)
MHTML is a very interesting format since it's used both for emails but
also for archived webpages. Some scraping projects want to store pages
in disk to process them later, mhtml is perfect for that use case.

This is heavily inspired from the beautifulsoup html loader, but
extracting the html part from the mhtml file.

---------

Co-authored-by: rlm <pexpresss31@gmail.com>
1 year ago
..
agent Add Multi-CSV/DF support in CSV and DataFrame Toolkits (#5009) 1 year ago
cache feat: add Momento as a standard cache and chat message history provider (#5221) 1 year ago
callbacks split up batch llm calls into separate runs (#5804) 1 year ago
chains fix neo4j schema query (#6381) 1 year ago
chat_models fix anthropic chat model mutating input list (#6457) 1 year ago
document_loaders feat: Add `UnstructuredRSTLoader` (#6594) 1 year ago
embeddings feat: interfaces for async embeddings, implement async openai (#6563) 1 year ago
examples Added a MHTML document loader (#6311) 1 year ago
llms split up batch llm calls into separate runs (#5804) 1 year ago
memory feat: add Momento as a standard cache and chat message history provider (#5221) 1 year ago
prompts Cleanup integration test dir (#3308) 1 year ago
retrievers DocArray as a Retriever (#6031) 1 year ago
utilities ArxivAPIWrapper - doc_content_chars_max (#6063) 1 year ago
vectorstores Clarifai integration (#5954) 1 year ago
.env.example adding MongoDBAtlasVectorSearch (#5338) 1 year ago
__init__.py initial commit 2 years ago
conftest.py feat: improve pinecone tests (#2806) 1 year ago
test_document_transformers.py Contextual compression retriever (#2915) 1 year ago
test_kuzu.py Add KuzuQAChain (#6454) 1 year ago
test_nebulagraph.py Harrison/nebula graph (#5865) 1 year ago
test_nlp_text_splitters.py OptimizedPrompt -- k-shot example choice backed by semantic search (#91) 2 years ago
test_pdf_pagesplitter.py cleanup: unify 3 different pdf loaders, rename PagedPDFSplitter (#1615) 2 years ago
test_schema.py Add 'get_token_ids' method (#4784) 1 year ago
test_text_splitter.py chore: spedd up integration test by using smaller model (#6044) 1 year ago