langchain/tests/unit_tests/document_loaders/sample_documents
Eugene Yurtsev 5cfa72a130
Bibtex integration for document loader and retriever (#5137)
# Bibtex integration

Wrap bibtexparser to retrieve a list of docs from a bibtex file.
* Get the metadata from the bibtex entries
* `page_content` get from the local pdf referenced in the `file` field
of the bibtex entry using `pymupdf`
* If no valid pdf file, `page_content` set to the `abstract` field of
the bibtex entry
* Support Zotero flavour using regex to get the file path
* Added usage example in
`docs/modules/indexes/document_loaders/examples/bibtex.ipynb`
---------

Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr>
Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>
2023-05-25 00:21:31 -07:00
..
__init__.py feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
bibtex.bib Bibtex integration for document loader and retriever (#5137) 2023-05-25 00:21:31 -07:00
empty_export.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
layout-parser-paper.pdf Bibtex integration for document loader and retriever (#5137) 2023-05-25 00:21:31 -07:00
sample_notebook_2.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
sample_notebook_emptynote.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
sample_notebook_missingcontenttag.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
sample_notebook_missingmetadata.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
sample_notebook_with_media.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00
sample_notebook.enex feature/4493 Improve Evernote Document Loader (#4577) 2023-05-19 14:28:17 -07:00