mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
cb84f612c9
- Updated `document_transformers` examples: titles, descriptions, links - Added `integrations/providers` for missed document_transformers
38 lines
1.3 KiB
Plaintext
38 lines
1.3 KiB
Plaintext
# Nuclia
|
|
|
|
>[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal
|
|
> and external source, providing optimized search results and generative answers.
|
|
> It can handle video and audio transcription, image content extraction, and document parsing.
|
|
|
|
>`Nuclia Understanding API` document transformer splits text into paragraphs and sentences,
|
|
> identifies entities, provides a summary of the text and generates embeddings for all the sentences.
|
|
|
|
|
|
## Installation and Setup
|
|
|
|
We need to install the `nucliadb-protos` package to use the `Nuclia Understanding API`.
|
|
```bash
|
|
pip install nucliadb-protos
|
|
```
|
|
|
|
To use the `Nuclia Understanding API`, we need to have a `Nuclia account`.
|
|
We can create one for free at [https://nuclia.cloud](https://nuclia.cloud),
|
|
and then [create a NUA key](https://docs.nuclia.dev/docs/docs/using/understanding/intro).
|
|
|
|
To use the Nuclia document transformer, we need to instantiate a `NucliaUnderstandingAPI`
|
|
tool with `enable_ml` set to `True`:
|
|
|
|
```python
|
|
from langchain.tools.nuclia import NucliaUnderstandingAPI
|
|
|
|
nua = NucliaUnderstandingAPI(enable_ml=True)
|
|
```
|
|
|
|
## Document Transformer
|
|
|
|
See a [usage example](/docs/integrations/document_transformers/nuclia_transformer).
|
|
|
|
```python
|
|
from langchain.document_transformers.nuclia_text_transform import NucliaTextTransformer
|
|
```
|