2023-05-15 14:53:00 +00:00
|
|
|
# Docugami
|
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
>[Docugami](https://docugami.com) converts business documents into a Document XML Knowledge Graph, generating forests
|
|
|
|
> of XML semantic trees representing entire documents. This is a rich representation that includes the semantic and
|
2023-05-30 20:58:16 +00:00
|
|
|
> structural characteristics of various chunks in the document as an XML tree.
|
2023-05-15 14:53:00 +00:00
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
## Installation and Setup
|
2023-05-15 14:53:00 +00:00
|
|
|
|
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
```bash
|
|
|
|
pip install lxml
|
|
|
|
```
|
2023-05-15 14:53:00 +00:00
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
## Document Loader
|
2023-05-15 14:53:00 +00:00
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
See a [usage example](../modules/indexes/document_loaders/examples/docugami.ipynb).
|
2023-05-15 14:53:00 +00:00
|
|
|
|
2023-06-01 00:54:05 +00:00
|
|
|
```python
|
|
|
|
from langchain.document_loaders import DocugamiLoader
|
|
|
|
```
|