You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/libs/community/tests/unit_tests/document_loaders
WilliamEspegren 804390ba4b
community: Spider integration (#20937)
Added the [Spider.cloud](https://spider.cloud) document loader.
[Spider](https://github.com/spider-rs/spider) is the
[fastest](https://github.com/spider-rs/spider/blob/main/benches/BENCHMARKS.md)
and cheapest crawler that returns LLM-ready data.

```
- **Description:** Adds Spider data loader
- **Dependencies:** spider-client
- **Twitter handle:** @WilliamEspegren 
```

---------

Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>
Co-authored-by: Bagatur <baskaryan@gmail.com>
Co-authored-by: = <=>
Co-authored-by: Chester Curme <chester.curme@gmail.com>
2 months ago
..
blob_loaders community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
loaders community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
parsers community: Add PHP language parser to document_loaders (#19850) 2 months ago
sample_documents community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_docs community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
__init__.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_airbyte.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_arcgis_loader.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_assemblyai.py Merge pull request #18421 4 months ago
test_bibtex.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_bshtml.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_confluence.py Merge pull request #18436 4 months ago
test_couchbase.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_csv_loader.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_cube_semantic.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_detect_encoding.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_directory.py community[minor]: Implement DirectoryLoader lazy_load function (#19537) 3 months ago
test_directory_loader.py infra: test directory loader multithreaded (#20281) 2 months ago
test_evernote_loader.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_generic_loader.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_git.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_github.py community[patch]: Change github endpoint in GithubLoader (#17622) 4 months ago
test_hugging_face.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_hugging_face_model.py community[minor]: add hugging_face_model document loader (#17323) 4 months ago
test_imports.py community: Spider integration (#20937) 2 months ago
test_json_loader.py community[minor]: use jq schema for content_key in json_loader (#18003) 4 months ago
test_lakefs.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_mediawikidump.py infra: add print rule to ruff (#16221) 4 months ago
test_mhtml.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_mongodb.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_notebook.py community[patch]: add NotebookLoader unit test (#17721) 3 months ago
test_obsidian.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_onenote.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_oracleadb.py community[minor]: add oracle autonomous database doc loader integration (#19536) 3 months ago
test_pebblo.py community[minor]: Add support for Pebblo cloud_api_key in PebbloSafeLoader (#19855) 2 months ago
test_psychic.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_readthedoc.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_rspace_loader.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_rss.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_trello.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_web_base.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago
test_youtube.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 6 months ago