langchain/docs/modules/indexes/document_loaders/examples
leo-gan 36c59e0c25
Arxiv document loader (#3627)
It makes sense to use `arxiv` as another source of the documents for
downloading.
- Added the `arxiv` document_loader, based on the
`utilities/arxiv.py:ArxivAPIWrapper`
- added tests
- added an example notebook
- sorted `__all__` in `__init__.py` (otherwise it is hard to find a
class in the very long list)
2023-04-26 21:04:56 -07:00
..
example_data Add ChatGPT Data Loader (#3336) 2023-04-22 09:06:24 -07:00
airbyte_json.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
apify_dataset.ipynb Harrison/apify (#2215) 2023-03-30 20:58:14 -07:00
arxiv.ipynb Arxiv document loader (#3627) 2023-04-26 21:04:56 -07:00
azlyrics.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
azure_blob_storage_container.ipynb Harrison/site map (#2061) 2023-03-27 16:28:08 -07:00
azure_blob_storage_file.ipynb Harrison/site map (#2061) 2023-03-27 16:28:08 -07:00
bigquery.ipynb Harrison/big query (#2100) 2023-03-28 08:17:22 -07:00
bilibili.ipynb Added bilibili loader (#2673) (#2724) 2023-04-11 10:40:32 -07:00
blackboard.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
blockchain.ipynb Update Alchemy Key URL (#3559) 2023-04-25 16:08:42 -07:00
chatgpt_loader.ipynb Add ChatGPT Data Loader (#3336) 2023-04-22 09:06:24 -07:00
college_confidential.ipynb Harrison/site map (#2061) 2023-03-27 16:28:08 -07:00
confluence.ipynb Harrison/output error (#3094) 2023-04-18 08:59:56 -07:00
CoNLL-U.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
copypaste.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
csv.ipynb [Docs] minor fixes to loaders links and rst warnings (#2846) 2023-04-13 10:54:40 -07:00
dataframe.ipynb Harrison/document cleanup (#2062) 2023-03-27 16:32:55 -07:00
diffbot.ipynb Harrison/diffbot (#2984) 2023-04-16 09:11:24 -07:00
directory_loader.ipynb Adds progress bar using tqdm to directory_loader (#3349) 2023-04-24 21:42:42 -07:00
discord_loader.ipynb update nnotebook title 2023-04-20 11:53:23 -07:00
duckdb.ipynb Harrison/duckdb (#2064) 2023-03-27 19:51:34 -07:00
email.ipynb Harrison/msg files (#2375) 2023-04-04 06:48:34 -07:00
epub.ipynb bump version to 128 (#2236) 2023-03-31 11:16:21 -07:00
evernote.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
facebook_chat.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
figma.ipynb [Documents] Updated Figma docs and added example (#2172) 2023-03-29 22:11:45 -07:00
gcs_directory.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
gcs_file.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
git.ipynb Add file filter param to Git loader (#2904) 2023-04-14 10:45:54 -07:00
gitbook.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
googledrive.ipynb Fix docs error for google drive loader (#3574) 2023-04-25 22:52:59 -07:00
gutenberg.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
hn.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
html.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
hugging_face_dataset.ipynb Harrison/hf document loader (#3394) 2023-04-23 10:17:43 -07:00
ifixit.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
image_captions.ipynb Harrison/image caption loader (#3051) 2023-04-17 20:49:10 -07:00
image.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
imsdb.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
markdown.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
notebook.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
notion.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
notiondb.ipynb feat: Add Notion database document loader (#2056) 2023-03-28 08:07:09 -07:00
obsidian.ipynb Harrison/obsidian (#3060) 2023-04-17 21:57:32 -07:00
pdf.ipynb Add an example tutorial for using PDFMinerPDFasHTMLLoader (#2960) 2023-04-16 08:34:39 -07:00
powerpoint.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
readthedocs_documentation.ipynb GuessedAtParserWarning from RTD document loader documentation example (#3397) 2023-04-24 21:54:39 -07:00
roam.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
s3_directory.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
s3_file.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
sitemap.ipynb docs: tiny fix on docs verbiage (#2124) 2023-03-28 22:56:29 -07:00
slack_directory.ipynb Add Slack Directory Loader (#2841) 2023-04-13 21:31:59 -07:00
srt.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
telegram.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
twitter.ipynb Harrison/output error (#3094) 2023-04-18 08:59:56 -07:00
unstructured_file.ipynb Update unstructured_file.ipynb (#3377) 2023-04-23 21:22:38 -07:00
url.ipynb Harrison/playwright (#2871) 2023-04-13 22:15:03 -07:00
web_base.ipynb [Docs] minor fixes to loaders links and rst warnings (#2846) 2023-04-13 10:54:40 -07:00
whatsapp_chat.ipynb Harrison/whatsapp loader (#2085) 2023-03-27 23:43:45 -07:00
word_document.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
youtube.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00