langchain/tests/unit_tests/document_loaders/sample_documents
Eugene Yurtsev 5cfa72a130
Bibtex integration for document loader and retriever (#5137)
# Bibtex integration

Wrap bibtexparser to retrieve a list of docs from a bibtex file.
* Get the metadata from the bibtex entries
* `page_content` get from the local pdf referenced in the `file` field
of the bibtex entry using `pymupdf`
* If no valid pdf file, `page_content` set to the `abstract` field of
the bibtex entry
* Support Zotero flavour using regex to get the file path
* Added usage example in
`docs/modules/indexes/document_loaders/examples/bibtex.ipynb`
---------

Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr>
Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>
2023-05-25 00:21:31 -07:00
..
__init__.py
bibtex.bib
empty_export.enex
layout-parser-paper.pdf
sample_notebook_2.enex
sample_notebook_emptynote.enex
sample_notebook_missingcontenttag.enex
sample_notebook_missingmetadata.enex
sample_notebook_with_media.enex
sample_notebook.enex