langchain/tests/integration_tests
Matt Robinson bf3f554357
feat: batch multiple files in a single Unstructured API request (#4525)
### Submit Multiple Files to the Unstructured API

Enables batching multiple files into a single Unstructured API requests.
Support for requests with multiple files was added to both
`UnstructuredAPIFileLoader` and `UnstructuredAPIFileIOLoader`. Note that
if you submit multiple files in "single" mode, the result will be
concatenated into a single document. We recommend using this feature in
"elements" mode.

### Testing

The following should load both documents, using two of the example docs
from the integration tests folder.

```python
    from langchain.document_loaders import UnstructuredAPIFileLoader

    file_paths = ["examples/layout-parser-paper.pdf",  "examples/whatsapp_chat.txt"]

    loader = UnstructuredAPIFileLoader(
        file_paths=file_paths,
        api_key="FAKE_API_KEY",
        strategy="fast",
        mode="elements",
    )
    docs = loader.load()
```
2023-05-21 20:48:20 -07:00
..
agent power bi api wrapper integration tests & bug fix (#4983) 2023-05-19 11:25:52 -04:00
cache Optimize the initialization method of GPTCache (#4522) 2023-05-11 16:15:23 -07:00
callbacks add tracing v2 env var (#4465) 2023-05-10 11:08:29 -07:00
chains Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
chat_models add alias for model (#4553) 2023-05-18 09:12:23 -07:00
document_loaders feat: batch multiple files in a single Unstructured API request (#4525) 2023-05-21 20:48:20 -07:00
embeddings Dev2049/hf emb encode kwargs (#3925) 2023-05-01 20:27:41 -07:00
examples fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 2023-05-16 14:35:25 -07:00
llms add alias for model (#4553) 2023-05-18 09:12:23 -07:00
memory Cassandra support for chat history (#4378) (#4764) 2023-05-15 23:43:09 -07:00
prompts
retrievers Remove unnecessary comment (#4845) 2023-05-17 11:53:03 -04:00
utilities Fix graphql tool (#4984) 2023-05-19 15:27:50 -07:00
vectorstores Adds 'IN' metadata filter for pgvector for checking set presence (#4982) 2023-05-19 13:53:23 -07:00
__init__.py
.env.example power bi api wrapper integration tests & bug fix (#4983) 2023-05-19 11:25:52 -04:00
conftest.py
test_document_transformers.py
test_nlp_text_splitters.py
test_pdf_pagesplitter.py
test_schema.py Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
test_text_splitter.py Fix TextSplitter.from_tiktoken(#4361) 2023-05-08 16:36:38 -07:00