langchain/docs
Eugene Yurtsev 3c490b5ba3
Docugami DataLoader (#4727)
### Adds a document loader for Docugami

Specifically:

1. Adds a data loader that talks to the [Docugami](http://docugami.com)
API to download processed documents as semantic XML
2. Parses the semantic XML into chunks, with additional metadata
capturing chunk semantics
3. Adds a detailed notebook showing how you can use additional metadata
returned by Docugami for techniques like the [self-querying
retriever](https://python.langchain.com/en/latest/modules/indexes/retrievers/examples/self_query_retriever.html)
4. Adds an integration test, and related documentation

Here is an example of a result that is not possible without the
capabilities added by Docugami (from the notebook):

<img width="1585" alt="image"
src="https://github.com/hwchase17/langchain/assets/749277/bb6c1ce3-13dc-4349-a53b-de16681fdd5b">

---------

Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>
Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>
2023-05-15 10:53:00 -04:00
..
_static docs: Mendable Fixes and Improvements (#4184) 2023-05-05 13:04:24 -07:00
ecosystem Docugami DataLoader (#4727) 2023-05-15 10:53:00 -04:00
getting_started docs: tutorials are moved on the top-level of docs (#4464) 2023-05-14 21:22:25 -07:00
modules Docugami DataLoader (#4727) 2023-05-15 10:53:00 -04:00
reference Move Generative Agent definition to Experimental (#3245) 2023-04-23 18:32:37 -07:00
tracing Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
use_cases Add Steamship Image Generation Tool (#4580) 2023-05-12 10:35:01 -07:00
conf.py docs: Mendable Search integration (#2803) 2023-04-13 21:52:25 -07:00
deployments.md [Docs]: Add Kinsta to the list of deployment providers (#4445) 2023-05-11 00:29:48 -07:00
ecosystem.rst added integration links to the ecosystem.rst (#3453) 2023-04-24 12:17:44 -07:00
gallery.rst Update gallery.rst with chatpdf opensource (#4342) 2023-05-14 19:43:16 -07:00
glossary.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
index.rst docs: tutorials are moved on the top-level of docs (#4464) 2023-05-14 21:22:25 -07:00
make.bat
Makefile Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
model_laboratory.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
reference.rst Move Generative Agent definition to Experimental (#3245) 2023-04-23 18:32:37 -07:00
requirements.txt Harrison/docs reqs (#2199) 2023-03-30 08:20:30 -07:00
tracing.md Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
youtube.md docs: tutorials are moved on the top-level of docs (#4464) 2023-05-14 21:22:25 -07:00