You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/libs/community/langchain_community/document_loaders
Dominik Fladung ffa0c838d8
Allow ConfluenceLoader authorization via Personal Access Tokens (#25096)
- community: Allow authorization to Confluence with bearer token

- **Description:** Allow authorization to Confluence with [Personal
Access
Token](https://confluence.atlassian.com/enterprise/using-personal-access-tokens-1026032365.html)
by checking for the keys `['client_id', token: ['access_token',
'token_type']]`

- **Issue:** 

Currently the following error occurs when using an personal access token
for authorization.

```python
loader = ConfluenceLoader(
    url=os.getenv('CONFLUENCE_URL'),
    oauth2={
        'token': {"access_token": os.getenv("CONFLUENCE_ACCESS_TOKEN"), "token_type": "bearer"},
        'client_id': 'client_id',
    },
    page_ids=['12345678'], 
)
```

```
ValueError: Error(s) while validating input: ["You have either omitted require keys or added extra keys to the oauth2 dictionary. key values should be `['access_token', 'access_token_secret', 'consumer_key', 'key_cert']`"]
```

With this PR the loader runs as expected.

---------

Co-authored-by: Chester Curme <chester.curme@gmail.com>
2 months ago
..
blob_loaders infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
parsers community[patch]: Add missing annotations (#24890) 2 months ago
__init__.py community[minor]: Add ScrapingAnt Loader Community Integration (#24514) 2 months ago
acreom.py community[patch]: Add missing annotations (#24890) 2 months ago
airbyte.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
airbyte_json.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
airtable.py docs: fix kwargs docstring (#25010) 2 months ago
apify_dataset.py community[patch]: Update root_validators to use explicit pre=True or pre=False (#23736) 3 months ago
arcgis_loader.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
arxiv.py docs: Arxiv docs update (#23871) 3 months ago
assemblyai.py community[patch]: docstrings update (#20301) 6 months ago
astradb.py (all): update removal in deprecation warnings from 0.2 to 0.3 (#21265) 5 months ago
async_html.py community[patch]: Release 0.2.11 (#24989) 2 months ago
athena.py community: make AthenaLoader profile_name optional and fix type hint (#24958) 2 months ago
azlyrics.py
azure_ai_data.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
azure_blob_storage_container.py community[patch]: type ignore fixes (#18395) 7 months ago
azure_blob_storage_file.py
baiducloud_bos_directory.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
baiducloud_bos_file.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
base.py core: Move document loader interfaces to core (#17723) 7 months ago
base_o365.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
bibtex.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
bigquery.py (all): update removal in deprecation warnings from 0.2 to 0.3 (#21265) 5 months ago
bilibili.py community[patch]: docstrings update (#20301) 6 months ago
blackboard.py community: add flag to toggle progress bar (#24463) 2 months ago
blockchain.py
brave_search.py
browserbase.py community: updated Browserbase loader (#21757) 4 months ago
browserless.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
cassandra.py community[minor]: Add Cassandra ByteStore (#22064) 4 months ago
chatgpt.py
chm.py
chromium.py community[minor]: add user agent for web scraping loaders (#22480) 4 months ago
college_confidential.py
concurrent.py community[patch]: import flattening fix (#20110) 6 months ago
confluence.py Allow ConfluenceLoader authorization via Personal Access Tokens (#25096) 2 months ago
conllu.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
couchbase.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
csv_loader.py docs: Standardize DocumentLoader docstrings (#22932) 3 months ago
cube_semantic.py community[patch]: Implement lazy_load() for CubeSemanticLoader (#18535) 7 months ago
datadog_logs.py
dataframe.py community[patch]: support modin document loader (#18866) 7 months ago
dedoc.py community[minor]: added new document loaders based on dedoc library (#24303) 2 months ago
diffbot.py
directory.py community: glob multiple patterns when using DirectoryLoader (#22852) 3 months ago
discord.py
doc_intelligence.py docs: community docstring updates (#21040) 5 months ago
docugami.py community[patch]: Add missing annotations (#24890) 2 months ago
docusaurus.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
dropbox.py community[patch]: Add linter to catch @root_validator (#24070) 3 months ago
duckdb_loader.py
email.py community[patch]: Small Fix in OutlookMessageLoader (Close the Message once Open) (#22744) 4 months ago
epub.py
etherscan.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
evernote.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
excel.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
facebook_chat.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
fauna.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
figma.py
firecrawl.py community:Add support for specifying document_loaders.firecrawl api url. (#24747) 2 months ago
gcs_directory.py (all): update removal in deprecation warnings from 0.2 to 0.3 (#21265) 5 months ago
gcs_file.py (all): update removal in deprecation warnings from 0.2 to 0.3 (#21265) 5 months ago
generic.py community[patch]: import flattening fix (#20110) 6 months ago
geodataframe.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
git.py Merge pull request #18539 7 months ago
gitbook.py community: add flag to toggle progress bar (#24463) 2 months ago
github.py community: Implement lazy_load() for GithubFileLoader (#18584) 7 months ago
glue_catalog.py community[minor]: Add glue catalog loader (#20220) 5 months ago
google_speech_to_text.py (all): update removal in deprecation warnings from 0.2 to 0.3 (#21265) 5 months ago
googledrive.py fix: load google credentials properly in GoogleDriveLoader (#12871) 2 months ago
gutenberg.py
helpers.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
hn.py
html.py
html_bs.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
hugging_face_dataset.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
hugging_face_model.py community[patch]: Add missing annotations (#24890) 2 months ago
ifixit.py
image.py
image_captions.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
imsdb.py
iugu.py
joplin.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
json_loader.py docs: Standardize DocumentLoader docstrings (#22932) 3 months ago
kinetica_loader.py community[patch]: Kinetica Integrations handled error in querying; quotes in table names; updated gpudb API (#22724) 3 months ago
lakefs.py
larksuite.py community[minor]: Add LarkSuite wiki document loader. (#21016) 5 months ago
llmsherpa.py community[minor]: add support for llmsherpa (#19741) 6 months ago
markdown.py
mastodon.py Merge pull request #18671 7 months ago
max_compute.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
mediawikidump.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
merge.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
mhtml.py community[patch]: upgrade to recent version of mypy (#21616) 4 months ago
mintbase.py community[minor]: add mintbase loader to langchain (#20089) 5 months ago
modern_treasury.py
mongodb.py community: fix issue with nested field extraction in MongodbLoader (#22801) 3 months ago
news.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
notebook.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
notion.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
notiondb.py community: Fix KeyError in NotionDB loader when 'name' is missing (#24224) 2 months ago
nuclia.py
obs_directory.py
obs_file.py
obsidian.py community[patch]: Add missing annotations (#24890) 2 months ago
odt.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
onedrive.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
onedrive_file.py
onenote.py community[patch]: Add missing annotations (#24890) 2 months ago
open_city_data.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
oracleadb_loader.py community[minor]: add oracle autonomous database doc loader integration (#19536) 6 months ago
oracleai.py community[minor]: Oraclevs integration (#21123) 5 months ago
org_mode.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
pdf.py community[minor]: added new document loaders based on dedoc library (#24303) 2 months ago
pebblo.py community[minor]: [PebbloSafeLoader] Implement content-size-based batching (#24871) 2 months ago
polars_dataframe.py
powerpoint.py
psychic.py multiple: Remove unnecessary Ruff suppression comments (#21050) 5 months ago
pubmed.py community[patch]: upgrade to recent version of mypy (#21616) 4 months ago
pyspark_dataframe.py
python.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
quip.py community[major]: lint for usage of xml library (#22132) 4 months ago
readthedocs.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
recursive_url_loader.py docs, cli[patch]: document loaders doc template (#22862) 3 months ago
reddit.py
roam.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
rocksetdb.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
rspace.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
rss.py multiple: Remove unnecessary Ruff suppression comments (#21050) 5 months ago
rst.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
rtf.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
s3_directory.py community[patch]: Skip nested directories when using S3DirectoryLoader (#17829) 7 months ago
s3_file.py community[patch]: support unstructured_kwargs for s3 loader (#15473) 6 months ago
scrapfly.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
scrapingant.py community[minor]: Add ScrapingAnt Loader Community Integration (#24514) 2 months ago
sharepoint.py Enhance metadata of sharepointLoader. (#22248) 3 months ago
sitemap.py community[patch]: SitemapLoader restrict depth of parsing sitemap (CVE-2024-2965) (#22903) 3 months ago
slack_directory.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
snowflake_loader.py community[patch]: upgrade to recent version of mypy (#21616) 4 months ago
spider.py doc list not empty (#21208) 4 months ago
spreedly.py
sql_database.py community[patch]: restore compatibility with SQLAlchemy 1.x (#22546) 3 months ago
srt.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
stripe.py
surrealdb.py
telegram.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
tencent_cos_directory.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
tencent_cos_file.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
tensorflow_datasets.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
text.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
tidb.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
tomarkdown.py community[patch]: Update URL to the 2markdown API (#24546) 2 months ago
toml.py community: Use default load() implementation in doc loaders (#18385) 7 months ago
trello.py community: Implement lazy_load() for TrelloLoader (#18658) 7 months ago
tsv.py community: better support of pathlib paths in document loaders (#18396) 6 months ago
twitter.py
unstructured.py unstructured, community, initialize langchain-unstructured package (#22779) 2 months ago
url.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
url_playwright.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
url_selenium.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
vsdx.py community[patch]: import flattening fix (#20110) 6 months ago
weather.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
web_base.py community[patch]: Release 0.2.11 (#24989) 2 months ago
whatsapp_chat.py community: Implement lazy_load() for WhatsAppChatLoader (#18677) 7 months ago
wikipedia.py community[patch]: upgrade to recent version of mypy (#21616) 4 months ago
word_document.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
xml.py infra: update mypy 1.10, ruff 0.5 (#23721) 3 months ago
xorbits.py
youtube.py community[minor]: [GoogleApiYoutubeLoader] Replace API used in _get_document_for_channel from search to playlistItem (#24034) 2 months ago
yuque.py community[minor]: add Yuque document loader (#17924) 7 months ago