You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/docs/integrations/docugami.md

559 B

Docugami

Docugami converts business documents into a Document XML Knowledge Graph, generating forests of XML semantic trees representing entire documents. This is a rich representation that includes the semantic and structural characteristics of various chunks in the document as an XML tree.

Installation and Setup

pip install lxml

Document Loader

See a usage example.

from langchain.document_loaders import DocugamiLoader