You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/libs/community/langchain_community/document_loaders
Bagatur 5efb5c099f
text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346)
7 months ago
..
blob_loaders community[patch]: doc loaders mypy fixes (#17368) 7 months ago
parsers text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346) 7 months ago
__init__.py community[minor]: Add `SQLDatabaseLoader` document loader (#18281) 7 months ago
acreom.py
airbyte.py
airbyte_json.py
airtable.py
apify_dataset.py
arcgis_loader.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
arxiv.py Update arxiv.py with get_summaries_as_docs inside of Arxivloader (#14953) 9 months ago
assemblyai.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
astradb.py community[patch]: Add AstraDBLoader docstring (#17873) 7 months ago
async_html.py
athena.py community[patch]: document_loaders: modified athena key logic to handle s3 uris without a prefix (#17526) 7 months ago
azlyrics.py
azure_ai_data.py
azure_blob_storage_container.py
azure_blob_storage_file.py
baiducloud_bos_directory.py
baiducloud_bos_file.py
base.py text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346) 7 months ago
base_o365.py
bibtex.py
bigquery.py
bilibili.py
blackboard.py infra: add print rule to ruff (#16221) 8 months ago
blockchain.py
brave_search.py
browserless.py
cassandra.py community: Fix some mypy types in cassandra doc loader (#17570) 7 months ago
chatgpt.py
chm.py community[patch]: docstrings (#16810) 8 months ago
chromium.py
college_confidential.py
concurrent.py
confluence.py infra: add print rule to ruff (#16221) 8 months ago
conllu.py
couchbase.py
csv_loader.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
cube_semantic.py
datadog_logs.py
dataframe.py
diffbot.py
directory.py community[minor]: add exclude parameter to DirectoryLoader (#17316) 7 months ago
discord.py
doc_intelligence.py infra: add -p to mkdir in lint steps (#17013) 8 months ago
docugami.py
docusaurus.py
dropbox.py infra: add print rule to ruff (#16221) 8 months ago
duckdb_loader.py
email.py
epub.py
etherscan.py infra: add print rule to ruff (#16221) 8 months ago
evernote.py
excel.py Docs: fix excel document loader typo (#15470) 9 months ago
facebook_chat.py
fauna.py
figma.py
gcs_directory.py
gcs_file.py fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647) 9 months ago
generic.py text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346) 7 months ago
geodataframe.py
git.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
gitbook.py
github.py community[patch]: Add Pagination to GitHubIssuesLoader for Efficient GitHub Issues Retrieval (#16934) 7 months ago
google_speech_to_text.py
googledrive.py infra: add print rule to ruff (#16221) 8 months ago
gutenberg.py
helpers.py
hn.py
html.py
html_bs.py fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647) 9 months ago
hugging_face_dataset.py
hugging_face_model.py community[minor]: add hugging_face_model document loader (#17323) 7 months ago
ifixit.py
image.py
image_captions.py
imsdb.py
iugu.py
joplin.py
json_loader.py
lakefs.py
larksuite.py
markdown.py corrected outdated link (#15053) 9 months ago
mastodon.py
max_compute.py
mediawikidump.py text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346) 7 months ago
merge.py langchain[minor],community[minor]: Add async methods in BaseLoader (#16634) 8 months ago
mhtml.py fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647) 9 months ago
modern_treasury.py
mongodb.py
news.py
notebook.py
notion.py
notiondb.py community[patch]: support query filters for NotionDBLoader (#17217) 7 months ago
nuclia.py infra: add print rule to ruff (#16221) 8 months ago
obs_directory.py
obs_file.py
obsidian.py
odt.py
onedrive.py
onedrive_file.py
onenote.py infra: add print rule to ruff (#16221) 8 months ago
open_city_data.py
org_mode.py
pdf.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
pebblo.py community[patch]: Fix pwd import that is not available on windows (#17532) 7 months ago
polars_dataframe.py
powerpoint.py
psychic.py
pubmed.py
pyspark_dataframe.py
python.py
quip.py
readthedocs.py
recursive_url_loader.py community[patch]: doc loaders mypy fixes (#17368) 7 months ago
reddit.py
roam.py
rocksetdb.py
rspace.py fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647) 9 months ago
rss.py
rst.py
rtf.py
s3_directory.py
s3_file.py
sharepoint.py
sitemap.py
slack_directory.py
snowflake_loader.py infra: add print rule to ruff (#16221) 8 months ago
spreedly.py
sql_database.py community[minor]: Add `SQLDatabaseLoader` document loader (#18281) 7 months ago
srt.py
stripe.py
surrealdb.py community[patch]: SurrealDB fix for asyncio (#16092) 8 months ago
telegram.py text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346) 7 months ago
tencent_cos_directory.py
tencent_cos_file.py fix: correct spelling mistakes of "seperate, intialise, pre-defined" (#14647) 9 months ago
tensorflow_datasets.py
text.py
tidb.py community[minor]: Add tidb loader support (#17788) 7 months ago
tomarkdown.py
toml.py infra: add print rule to ruff (#16221) 8 months ago
trello.py
tsv.py
twitter.py
unstructured.py community[patch]: Load list of files using UnstructuredFileLoader (#16216) 8 months ago
url.py
url_playwright.py community[proxy]: Enhancement/add proxy support playwrighturlloader 16751 (#16822) 7 months ago
url_selenium.py
vsdx.py community[minor]: New documents loader for visio files (with extension .vsdx) (#16171) 8 months ago
weather.py
web_base.py community[patch]: Add Cookie Support to Fetch Method (#16673) 8 months ago
whatsapp_chat.py
wikipedia.py
word_document.py
xml.py
xorbits.py
youtube.py community[patch]: docstrings (#16810) 8 months ago