langchain/docs/extras/integrations
Lars von Wedel 6d82503eb1
Add parser and loader for Azure document intelligence service. (#10136)
Hi,

this PR contains loader / parser for Azure Document intelligence which
is a ML-based service to ingest arbitrary PDFs / images, even if
scanned. The loader generates Documents by pages of the original
document. This is my first contribution to LangChain.

Unfortunately I could not find the correct place for test cases. Happy
to add one if you can point me to the location, but as this is a
cloud-based service, a test would require network access and credentials
- so might be of limited help.

Dependencies: The needed dependency was already part of pyproject.toml,
no change.
Twitter: feel free to mention @LarsAC on the announcement
2023-09-03 14:25:39 -07:00
..
callbacks rm base64 images from docs (#10110) 2023-09-01 15:15:12 -07:00
chat feat: add bedrock chat model (#8017) 2023-09-01 13:16:57 -07:00
chat_loaders iMessage loader (#9832) 2023-08-28 13:43:59 -07:00
document_loaders Add parser and loader for Azure document intelligence service. (#10136) 2023-09-03 14:25:39 -07:00
document_transformers Added a pdf parser based on DocAI (#9579) 2023-08-24 21:44:49 -07:00
llms added support for inference from Model Garden (#9367) 2023-09-01 15:58:21 -07:00
memory Pre-release Xata version no longer required (#9915) 2023-08-29 07:21:22 -07:00
providers rm base64 images from docs (#10110) 2023-09-01 15:15:12 -07:00
retrievers Update google drive notebooks (#9851) 2023-08-28 19:29:35 -07:00
text_embedding EdenAI LLM update. Add models name option (#8963) 2023-09-01 12:11:33 -07:00
toolkits docs: google-drive-tool example fix (#10000) 2023-09-01 13:31:26 -07:00
tools rm base64 images from docs (#10110) 2023-09-01 15:15:12 -07:00
vectorstores implement vectorstores by tencent vectordb (#9989) 2023-08-31 00:48:25 -07:00