langchain/docs/ecosystem
Matt Robinson c51dec5101
feat: add Unstructured API loaders (#3906)
### Summary

Adds `UnstructuredAPIFileLoaders` and `UnstructuredAPIFIleIOLoaders`
that partition documents through the Unstructured API. Defaults to the
URL for hosted Unstructured API, but can switch to a self hosted or
locally running API using the `url` kwarg. Currently, the Unstructured
API is open and does not require an API, but it will soon. A note was
added about that to the Unstructured ecosystem page.

### Testing


```python
from langchain.document_loaders import UnstructuredAPIFileIOLoader

filename = "fake-email.eml"

with open(filename, "rb") as f:
    loader = UnstructuredAPIFileIOLoader(file=f, file_filename=filename)
    docs = loader.load()

docs[0]
```

```python
from langchain.document_loaders import UnstructuredAPIFileLoader

filename = "fake-email.eml"
loader = UnstructuredAPIFileLoader(file_path=filename, mode="elements")
docs = loader.load()

docs[0]
```
2023-05-01 20:37:35 -07:00
..
ai21.md Docs refactor (#480) 2023-01-02 08:24:09 -08:00
aim_tracking.ipynb Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
analyticdb.md Add a full PostgresSQL syntax database 'AnalyticDB' as vector store. (#3135) 2023-04-22 08:25:41 -07:00
apify.md Harrison/apify (#2215) 2023-03-30 20:58:14 -07:00
atlas.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
bananadev.md [DOCS] Assorted wording, punctuation, and consistency revisions (#1443) 2023-03-08 20:16:09 -08:00
cerebriumai.md Add GooseAI, CerebriumAI, Petals, ForefrontAI (#981) 2023-02-13 21:20:19 -08:00
chroma.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
clearml_tracking.ipynb Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
cohere.md fix: numerous broken documentation links (#2070) 2023-03-27 23:07:03 -07:00
comet_tracking.ipynb Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
databerry.md Harrison/databerry (#2688) 2023-04-10 18:49:47 -07:00
deepinfra.md Add DeepInfra LLM support (#1232) 2023-02-23 07:37:15 -08:00
deeplake.md Harrison/docs cleanup (#2633) 2023-04-09 12:55:22 -07:00
forefrontai.md Add GooseAI, CerebriumAI, Petals, ForefrontAI (#981) 2023-02-13 21:20:19 -08:00
google_search.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
google_serper.md bump version to 131 (#2391) 2023-04-04 07:21:50 -07:00
gooseai.md Add GooseAI, CerebriumAI, Petals, ForefrontAI (#981) 2023-02-13 21:20:19 -08:00
gpt4all.md Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
graphsignal.md Update Graphsignal ecosystem page (#2662) 2023-04-10 08:00:26 -07:00
hazy_research.md Docs refactor (#480) 2023-01-02 08:24:09 -08:00
helicone.md [DOCS] Assorted wording, punctuation, and consistency revisions (#1443) 2023-03-08 20:16:09 -08:00
huggingface.md fix: numerous broken documentation links (#2070) 2023-03-27 23:07:03 -07:00
jina.md fix: add jina jupyter notebook (#2477) 2023-04-06 12:42:01 -07:00
lancedb.md Harrison/lancedb (#3634) 2023-04-27 08:14:36 -07:00
llamacpp.md cr 2023-04-04 07:25:28 -07:00
metal.md add metal to ecosystem (#3613) 2023-04-26 15:57:48 -07:00
milvus.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
modal.md Add Writer, Banana, Modal, StochasticAI (#1270) 2023-02-24 06:58:58 -08:00
myscale.md Harrison/myscale (#3352) 2023-04-22 09:17:38 -07:00
nlpcloud.md Docs refactor (#480) 2023-01-02 08:24:09 -08:00
openai.md fix: numerous broken documentation links (#2070) 2023-03-27 23:07:03 -07:00
opensearch.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
petals.md Update petals.md (#1225) 2023-02-22 10:34:16 -08:00
pgvector.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
pinecone.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
pipelineai.md Add PipelineAI LLM integration (#3644) 2023-04-27 08:22:26 -07:00
predictionguard.md Harrison/prediction guard (#3490) 2023-04-24 22:27:22 -07:00
promptlayer.md fix: numerous broken documentation links (#2070) 2023-03-27 23:07:03 -07:00
qdrant.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
redis.md Harrison/redis cache (#3766) 2023-04-28 20:47:18 -07:00
replicate.md Fix Replicate llm response to handle iterator / multiple outputs (#3614) 2023-04-26 14:26:33 -07:00
runhouse.md fixed the Cohere example title (#3053) 2023-04-17 21:02:52 -07:00
rwkv.md Add support for rwkv (#2422) 2023-04-06 14:41:06 -07:00
searx.md searx: implement async + helper tool providing json results (#2129) 2023-03-28 22:49:02 -07:00
serpapi.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
stochasticai.md Add Writer, Banana, Modal, StochasticAI (#1270) 2023-02-24 06:58:58 -08:00
tair.md Harrison/tair (#3770) 2023-04-28 21:25:33 -07:00
unstructured.md feat: add Unstructured API loaders (#3906) 2023-05-01 20:37:35 -07:00
wandb_tracking.ipynb Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
weaviate.md Updated incorrect link to Weaviate notebook (#3362) 2023-04-22 20:47:41 -07:00
wolfram_alpha.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
writer.md Add Writer, Banana, Modal, StochasticAI (#1270) 2023-02-24 06:58:58 -08:00
yeagerai.md Ecosystem/Yeager.ai (#3239) 2023-04-20 15:20:21 -07:00
zilliz.md Adding milvus/zilliz into docs (#2686) 2023-04-10 18:08:41 -07:00