forked from Archives/langchain
bugfix: return whole document when loading with ConfluenceLoader.load by label (#3980)
Method confluence.get_all_pages_by_label, returns only metadata about documents with a certain label (such as pageId, titles, ...). To return all documents with a certain label we need to extract all page ids given a certain label and get pages content by these ids. --------- Co-authored-by: Andrea Biondo <a.biondo@reply.it>
This commit is contained in:
parent
5f30cc8713
commit
d18e788ee3
@ -219,9 +219,11 @@ class ConfluenceLoader(BaseLoader):
|
|||||||
max_pages=max_pages,
|
max_pages=max_pages,
|
||||||
expand="body.storage.value",
|
expand="body.storage.value",
|
||||||
)
|
)
|
||||||
docs += self.process_pages(
|
ids_by_label = [page["id"] for page in pages]
|
||||||
pages, include_restricted_content, include_attachments, include_comments
|
if page_ids:
|
||||||
)
|
page_ids = list(set(page_ids + ids_by_label))
|
||||||
|
else:
|
||||||
|
page_ids = list(set(ids_by_label))
|
||||||
|
|
||||||
if cql:
|
if cql:
|
||||||
pages = self.paginate_request(
|
pages = self.paginate_request(
|
||||||
|
Loading…
Reference in New Issue
Block a user