forked from Archives/langchain
You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
3830d900bf
### Summary Adds a document loader for MS Word Documents. Works with both `.docx` and `.doc` files as longer as the user has installed `unstructured>=0.4.11`. ### Testing The follow workflow test the loader for both `.doc` and `.docx` files using example docs from the `unstructured` repo. #### `.docx` ```python from langchain.document_loaders import UnstructuredWordDocumentLoader filename = "../unstructured/example-docs/fake.docx" loader = UnstructuredWordDocumentLoader(filename) loader.load() ``` #### `.doc` ```python from langchain.document_loaders import UnstructuredWordDocumentLoader filename = "../unstructured/example-docs/fake.doc" loader = UnstructuredWordDocumentLoader(filename) loader.load() ``` |
1 year ago | |
---|---|---|
.. | ||
_static | 1 year ago | |
ecosystem | 1 year ago | |
getting_started | 1 year ago | |
modules | 1 year ago | |
reference | 1 year ago | |
tracing | 1 year ago | |
use_cases | 1 year ago | |
Makefile | 1 year ago | |
conf.py | 1 year ago | |
deployments.md | 1 year ago | |
ecosystem.rst | 1 year ago | |
gallery.rst | 1 year ago | |
glossary.md | 1 year ago | |
index.rst | 1 year ago | |
make.bat | 2 years ago | |
reference.rst | 1 year ago | |
requirements.txt | 1 year ago | |
tracing.md | 1 year ago |