mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
5cfa72a130
# Bibtex integration Wrap bibtexparser to retrieve a list of docs from a bibtex file. * Get the metadata from the bibtex entries * `page_content` get from the local pdf referenced in the `file` field of the bibtex entry using `pymupdf` * If no valid pdf file, `page_content` set to the `abstract` field of the bibtex entry * Support Zotero flavour using regex to get the file path * Added usage example in `docs/modules/indexes/document_loaders/examples/bibtex.ipynb` --------- Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> |
||
---|---|---|
.. | ||
__init__.py | ||
bibtex.bib | ||
empty_export.enex | ||
layout-parser-paper.pdf | ||
sample_notebook_2.enex | ||
sample_notebook_emptynote.enex | ||
sample_notebook_missingcontenttag.enex | ||
sample_notebook_missingmetadata.enex | ||
sample_notebook_with_media.enex | ||
sample_notebook.enex |