langchain/libs/community/langchain_community/document_loaders/parsers
Edwin Wenink 9fb09c1c30
community: fix the "page" mode in the AzureAIDocumentIntelligenceParser (bug) (#15958)
**Description**: the "page" mode in the
AzureAIDocumentIntelligenceParser is not accessible due to a wrong
membership test. The mode argument can only be a string (also see the
assertion in the `__init__`: `assert self.mode in ["single", "page",
"object", "markdown"]`, so the check `elif self.mode == ["page"]:`
always fails.
As a result, effectively the "object" mode is used when selecting the
"page" mode, which may lead to errors.

The docstring of the `AzureAIDocumentIntelligenceLoader` also ommitted
the `mode` parameter alltogether, so I added it.

**Issue**: I could not find a related issue (this class is only 3 weeks
old anyways)

**Dependencies**: this PR does not introduce or affect dependencies.

The current demo notebook and examples are not affected because they all
use the default markdown mode.
2024-01-12 11:01:28 -08:00
..
html
language
__init__.py community[minor]: Azure DocumentIntelligenceLoader/Parser support update with latest SDK (#14389) 2023-12-21 16:40:27 -08:00
audio.py
doc_intelligence.py community: fix the "page" mode in the AzureAIDocumentIntelligenceParser (bug) (#15958) 2024-01-12 11:01:28 -08:00
docai.py
generic.py
grobid.py
msword.py
pdf.py docs, experimental[patch], langchain[patch], community[patch]: update storage imports (#15429) 2024-01-02 16:47:11 -05:00
registry.py
txt.py