You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/tests/unit_tests/document_loaders
Eugene Yurtsev 5cfa72a130
Bibtex integration for document loader and retriever (#5137)
# Bibtex integration

Wrap bibtexparser to retrieve a list of docs from a bibtex file.
* Get the metadata from the bibtex entries
* `page_content` get from the local pdf referenced in the `file` field
of the bibtex entry using `pymupdf`
* If no valid pdf file, `page_content` set to the `abstract` field of
the bibtex entry
* Support Zotero flavour using regex to get the file path
* Added usage example in
`docs/modules/indexes/document_loaders/examples/bibtex.ipynb`
---------

Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr>
Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>
1 year ago
..
blob_loaders fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
loaders fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
parsers Add html parsers (#4874) 1 year ago
sample_documents Bibtex integration for document loader and retriever (#5137) 1 year ago
test_docs Allow readthedoc loader to pass custom html tag (#5175) 1 year ago
__init__.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_base.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_bibtex.py Bibtex integration for document loader and retriever (#5137) 1 year ago
test_bshtml.py Add html parsers (#4874) 1 year ago
test_confluence.py Add Confluence Loader unit tests (#3333) 1 year ago
test_csv_loader.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_detect_encoding.py feat #4479: TextLoader auto detect encoding and improved exceptions (#4927) 1 year ago
test_evernote_loader.py feature/4493 Improve Evernote Document Loader (#4577) 1 year ago
test_generic_loader.py Add a generic document loader (#4875) 1 year ago
test_json_loader.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_psychic.py Harrison/psychic (#5063) 1 year ago
test_readthedoc.py Allow readthedoc loader to pass custom html tag (#5175) 1 year ago
test_telegram.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_web_base.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago
test_youtube.py fix(document_loaders/telegram): fix pandas calls + add tests (#4806) 1 year ago