mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
b4de839ed8
This PR addresses several improvements: - Previously it was not possible to load spaces of more than 100 pages. The `limit` was being used both as an overall page limit *and* as a per request pagination limit. This, in combination with the fact that atlassian seem to use a server-side hard limit of 100 when page content is expanded, meant it wasn't possible to download >100 pages. Now `limit` is used *only* as a per-request pagination limit and `max_pages` is introduced as the way to limit the total number of pages returned by the paginator. - Document metadata now includes `source` (the source url), making it compatible with `RetrievalQAWithSourcesChain`. - It is now possible to include inline and footer comments. - It is now possible to pass `verify_ssl=False` and other parameters to the confluence object for use cases that require it. |
||
---|---|---|
.. | ||
__init__.py | ||
test_bigquery.py | ||
test_bilibili.py | ||
test_bshtml.py | ||
test_confluence.py | ||
test_dataframe.py | ||
test_duckdb.py | ||
test_email.py | ||
test_figma.py | ||
test_gitbook.py | ||
test_ifixit.py | ||
test_pdf.py | ||
test_python.py | ||
test_sitemap.py | ||
test_slack.py | ||
test_url_playwright.py | ||
test_url.py |