You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/tests/integration_tests
Yifei Song 7d29bb2c02
Add Xorbits Dataframe as a Document Loader (#7319)
- [Xorbits](https://doc.xorbits.io/en/latest/) is an open-source
computing framework that makes it easy to scale data science and machine
learning workloads in parallel. Xorbits can leverage multi cores or GPUs
to accelerate computation on a single machine, or scale out up to
thousands of machines to support processing terabytes of data.

- This PR added support for the Xorbits document loader, which allows
langchain to leverage Xorbits to parallelize and distribute the loading
of data.
- Dependencies: This change requires the Xorbits library to be installed
in order to be used.
`pip install xorbits`
- Request for review: @rlancemartin, @eyurtsev
- Twitter handle: https://twitter.com/Xorbitsio

Co-authored-by: Bagatur <baskaryan@gmail.com>
1 year ago
..
agent Add Multi-CSV/DF support in CSV and DataFrame Toolkits (#5009) 1 year ago
cache feat: add Momento as a standard cache and chat message history provider (#5221) 1 year ago
callbacks support adding custom metadata to runs (#7120) 1 year ago
chains Support for SPARQL (#7165) 1 year ago
chat_models integrate JinaChat (#6927) 1 year ago
client Load Run Evaluator (#7101) 1 year ago
document_loaders Add Xorbits Dataframe as a Document Loader (#7319) 1 year ago
embeddings Harrison/octo ml (#6897) 1 year ago
evaluation/embedding_distance Add String Distance and Embedding Evaluators (#7123) 1 year ago
examples feat: Add `UnstructuredTSVLoader` (#7367) 1 year ago
graphs Add HugeGraphQAChain to support gremlin generating chain (#7132) 1 year ago
llms feat: ctransformers support async chain (#6859) 1 year ago
memory Harrison/split schema dir (#7025) 1 year ago
prompts Cleanup integration test dir (#3308) 1 year ago
retrievers Add serialized object to retriever start callback (#7074) 1 year ago
utilities Harrison/dataforseo (#7214) 1 year ago
vectorstores adding max_marginal_relevance_search method to MongoDBAtlasVectorSearch (#7310) 1 year ago
.env.example adding MongoDBAtlasVectorSearch (#5338) 1 year ago
__init__.py initial commit 2 years ago
conftest.py feat: improve pinecone tests (#2806) 1 year ago
test_document_transformers.py The Fellowship of the Vectors: New Embeddings Filter using clustering. (#7015) 1 year ago
test_kuzu.py Add KuzuQAChain (#6454) 1 year ago
test_nebulagraph.py Harrison/nebula graph (#5865) 1 year ago
test_nlp_text_splitters.py Add spacy sentencizer (#7442) 1 year ago
test_pdf_pagesplitter.py cleanup: unify 3 different pdf loaders, rename PagedPDFSplitter (#1615) 2 years ago
test_schema.py Base language model docstrings (#7104) 1 year ago
test_text_splitter.py chore: spedd up integration test by using smaller model (#6044) 1 year ago