mirror of
https://github.com/hwchase17/langchain
synced 2024-11-18 09:25:54 +00:00
5cfa72a130
# Bibtex integration Wrap bibtexparser to retrieve a list of docs from a bibtex file. * Get the metadata from the bibtex entries * `page_content` get from the local pdf referenced in the `file` field of the bibtex entry using `pymupdf` * If no valid pdf file, `page_content` set to the `abstract` field of the bibtex entry * Support Zotero flavour using regex to get the file path * Added usage example in `docs/modules/indexes/document_loaders/examples/bibtex.ipynb` --------- Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr> Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> |
||
---|---|---|
.. | ||
blob_loaders | ||
loaders | ||
parsers | ||
sample_documents | ||
test_docs | ||
__init__.py | ||
test_base.py | ||
test_bibtex.py | ||
test_bshtml.py | ||
test_confluence.py | ||
test_csv_loader.py | ||
test_detect_encoding.py | ||
test_evernote_loader.py | ||
test_generic_loader.py | ||
test_json_loader.py | ||
test_psychic.py | ||
test_readthedoc.py | ||
test_telegram.py | ||
test_web_base.py | ||
test_youtube.py |