langchain/libs/community/tests/unit_tests/document_loaders
Mazen Ramadan 3c1d77dd64
community[minor]: Add Scrapfly Loader community integration (#22036)
Added [Scrapfly](https://scrapfly.io/) Web Loader integration. Scrapfly
is a web scraping API that allows extracting web page data into
accessible markdown or text datasets.

- __Description__: Added Scrapfly web loader for retrieving web page
data as markdown or text.
- Dependencies: scrapfly-sdk
- Twitter: @thealchemi1st

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>
2024-05-22 21:29:13 +00:00
..
blob_loaders community[patch]: upgrade to recent version of mypy (#21616) 2024-05-13 14:55:07 -04:00
loaders community[patch]: upgrade to recent version of mypy (#21616) 2024-05-13 14:55:07 -04:00
parsers community: Add PHP language parser to document_loaders (#19850) 2024-04-08 11:30:28 -04:00
sample_documents
test_docs community: Fix CSVLoader columns is None (#20701) 2024-05-22 12:57:46 -07:00
__init__.py
test_airbyte.py
test_arcgis_loader.py
test_assemblyai.py Merge pull request #18421 2024-03-06 13:16:05 -05:00
test_bibtex.py
test_bshtml.py
test_confluence.py Merge pull request #18436 2024-03-06 13:15:24 -05:00
test_couchbase.py
test_csv_loader.py community: Fix CSVLoader columns is None (#20701) 2024-05-22 12:57:46 -07:00
test_cube_semantic.py
test_detect_encoding.py
test_directory_loader.py community: Fix CSVLoader columns is None (#20701) 2024-05-22 12:57:46 -07:00
test_directory.py community[minor]: Implement DirectoryLoader lazy_load function (#19537) 2024-03-29 14:46:52 +00:00
test_evernote_loader.py
test_generic_loader.py
test_git.py
test_github.py community[patch]: upgrade to recent version of mypy (#21616) 2024-05-13 14:55:07 -04:00
test_hugging_face_model.py community[minor]: add hugging_face_model document loader (#17323) 2024-02-28 20:05:35 +00:00
test_hugging_face.py
test_imports.py community[minor]: Add Scrapfly Loader community integration (#22036) 2024-05-22 21:29:13 +00:00
test_json_loader.py community[minor]: use jq schema for content_key in json_loader (#18003) 2024-03-05 15:51:24 -08:00
test_lakefs.py community[minor]: import fix (#20995) 2024-04-29 10:32:50 -04:00
test_mediawikidump.py infra: add print rule to ruff (#16221) 2024-02-09 16:13:30 -08:00
test_mhtml.py
test_mongodb.py
test_notebook.py community[patch]: add NotebookLoader unit test (#17721) 2024-03-29 00:27:46 +00:00
test_obsidian.py
test_onenote.py
test_oracleadb.py community[minor]: add oracle autonomous database doc loader integration (#19536) 2024-03-26 17:02:18 -07:00
test_pebblo.py community[minor]: Add support for Pebblo cloud_api_key in PebbloSafeLoader (#19855) 2024-04-08 11:10:04 -04:00
test_psychic.py
test_readthedoc.py
test_rspace_loader.py
test_rss.py
test_trello.py
test_web_base.py
test_youtube.py