langchain/docs/modules/indexes
Eugene Yurtsev 5cfa72a130
Bibtex integration for document loader and retriever (#5137)
# Bibtex integration

Wrap bibtexparser to retrieve a list of docs from a bibtex file.
* Get the metadata from the bibtex entries
* `page_content` get from the local pdf referenced in the `file` field
of the bibtex entry using `pymupdf`
* If no valid pdf file, `page_content` set to the `abstract` field of
the bibtex entry
* Support Zotero flavour using regex to get the file path
* Added usage example in
`docs/modules/indexes/document_loaders/examples/bibtex.ipynb`
---------

Co-authored-by: Sébastien M. Popoff <sebastien.popoff@espci.fr>
Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>
2023-05-25 00:21:31 -07:00
..
document_loaders/examples Bibtex integration for document loader and retriever (#5137) 2023-05-25 00:21:31 -07:00
retrievers/examples tfidf retriever (#5114) 2023-05-24 10:02:09 -07:00
text_splitters docs: text splitters improvements (#4490) 2023-05-17 21:33:34 -07:00
vectorstores Improve weaviate vectorstore docs (#5201) 2023-05-24 09:31:48 -07:00
document_loaders.rst nit (#5208) 2023-05-24 12:52:20 -07:00
getting_started.ipynb Update getting_started.ipynb (#4850) 2023-05-17 13:19:14 -07:00
retrievers.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
text_splitters.rst docs: text splitters improvements (#4490) 2023-05-17 21:33:34 -07:00
vectorstores.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00