You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/tests/integration_tests/document_loaders
Matt Robinson bf3f554357
feat: batch multiple files in a single Unstructured API request (#4525)
### Submit Multiple Files to the Unstructured API

Enables batching multiple files into a single Unstructured API requests.
Support for requests with multiple files was added to both
`UnstructuredAPIFileLoader` and `UnstructuredAPIFileIOLoader`. Note that
if you submit multiple files in "single" mode, the result will be
concatenated into a single document. We recommend using this feature in
"elements" mode.

### Testing

The following should load both documents, using two of the example docs
from the integration tests folder.

```python
    from langchain.document_loaders import UnstructuredAPIFileLoader

    file_paths = ["examples/layout-parser-paper.pdf",  "examples/whatsapp_chat.txt"]

    loader = UnstructuredAPIFileLoader(
        file_paths=file_paths,
        api_key="FAKE_API_KEY",
        strategy="fast",
        mode="elements",
    )
    docs = loader.load()
```
1 year ago
..
parsers Add html parsers (#4874) 1 year ago
__init__.py Add new iFixit document loader (#1333) 2 years ago
test_arxiv.py `Arxiv` document loader (#3627) 1 year ago
test_bigquery.py Harrison/big query (#2100) 2 years ago
test_bilibili.py Remove unnecessary spaces from document object’s page_content of BiliBiliLoader (#4619) 1 year ago
test_blockchain.py Enhancement: option to Get All Tokens with a single Blockchain Document Loader call (#3797) 1 year ago
test_confluence.py Several confluence loader improvements (#3300) 1 year ago
test_dataframe.py rm pandas dependency (#2102) 2 years ago
test_duckdb.py Harrison/duckdb (#2064) 2 years ago
test_email.py Harrison/msg files (#2375) 2 years ago
test_facebook_chat.py Refactor TelegramChatLoader and FacebookChatLoader classes and add tests (#3863) 1 year ago
test_figma.py Harrison/figma doc loader (#1908) 2 years ago
test_gitbook.py Harrison/gitbook (#2044) 2 years ago
test_ifixit.py Add new iFixit document loader (#1333) 2 years ago
test_json_loader.py JSON loader (#4067) 1 year ago
test_modern_treasury.py Dev2049/add modern treasury (#3924) 1 year ago
test_odt.py feat: add loader for open office odt files (#4405) 1 year ago
test_pdf.py Dev2049/pypdfium2 (#4209) 1 year ago
test_python.py Add PythonLoader which auto-detects encoding of Python files (#3311) 1 year ago
test_sitemap.py Harrison/sitemap local (#4704) 1 year ago
test_slack.py Add Slack Directory Loader (#2841) 1 year ago
test_spreedly.py Harrison/spreedly (#3937) 1 year ago
test_stripe.py Dev2049/add modern treasury (#3924) 1 year ago
test_unstructured.py feat: batch multiple files in a single Unstructured API request (#4525) 1 year ago
test_url.py add continue to fix 'continue_on_failure' parameter for URL doc loader (#2735) 1 year ago
test_url_playwright.py Harrison/playwright selector (#3185) 1 year ago
test_whatsapp_chat.py Update WhatsAppChatLoader to include the character ~ in the sender name (#4420) 1 year ago