You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/docs
Eugene Yurtsev 3c490b5ba3
Docugami DataLoader (#4727)
### Adds a document loader for Docugami

Specifically:

1. Adds a data loader that talks to the [Docugami](http://docugami.com)
API to download processed documents as semantic XML
2. Parses the semantic XML into chunks, with additional metadata
capturing chunk semantics
3. Adds a detailed notebook showing how you can use additional metadata
returned by Docugami for techniques like the [self-querying
retriever](https://python.langchain.com/en/latest/modules/indexes/retrievers/examples/self_query_retriever.html)
4. Adds an integration test, and related documentation

Here is an example of a result that is not possible without the
capabilities added by Docugami (from the notebook):

<img width="1585" alt="image"
src="https://github.com/hwchase17/langchain/assets/749277/bb6c1ce3-13dc-4349-a53b-de16681fdd5b">

---------

Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>
Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>
1 year ago
..
_static docs: Mendable Fixes and Improvements (#4184) 1 year ago
ecosystem Docugami DataLoader (#4727) 1 year ago
getting_started docs: tutorials are moved on the top-level of docs (#4464) 1 year ago
modules Docugami DataLoader (#4727) 1 year ago
reference Move Generative Agent definition to Experimental (#3245) 1 year ago
tracing Callbacks Refactor [base] (#3256) 1 year ago
use_cases Add Steamship Image Generation Tool (#4580) 1 year ago
Makefile Feature: linkcheck-action (#534) (#542) 2 years ago
conf.py docs: Mendable Search integration (#2803) 1 year ago
deployments.md [Docs]: Add Kinsta to the list of deployment providers (#4445) 1 year ago
ecosystem.rst added integration links to the ecosystem.rst (#3453) 1 year ago
gallery.rst Update gallery.rst with chatpdf opensource (#4342) 1 year ago
glossary.md big docs refactor (#1978) 2 years ago
index.rst docs: tutorials are moved on the top-level of docs (#4464) 1 year ago
make.bat initial commit 2 years ago
model_laboratory.ipynb big docs refactor (#1978) 2 years ago
reference.rst Move Generative Agent definition to Experimental (#3245) 1 year ago
requirements.txt Harrison/docs reqs (#2199) 2 years ago
tracing.md Callbacks Refactor [base] (#3256) 1 year ago
youtube.md docs: tutorials are moved on the top-level of docs (#4464) 1 year ago