forked from Archives/langchain
2f15c11b87
### Summary Adds a document loader for MS Word Documents. Works with both `.docx` and `.doc` files as longer as the user has installed `unstructured>=0.4.11`. ### Testing The follow workflow test the loader for both `.doc` and `.docx` files using example docs from the `unstructured` repo. #### `.docx` ```python from langchain.document_loaders import UnstructuredWordDocumentLoader filename = "../unstructured/example-docs/fake.docx" loader = UnstructuredWordDocumentLoader(filename) loader.load() ``` #### `.doc` ```python from langchain.document_loaders import UnstructuredWordDocumentLoader filename = "../unstructured/example-docs/fake.doc" loader = UnstructuredWordDocumentLoader(filename) loader.load() ``` |
||
---|---|---|
.. | ||
agents | ||
chains | ||
document_loaders | ||
indexes | ||
llms | ||
memory | ||
prompts | ||
utils | ||
agents.rst | ||
chains.rst | ||
document_loaders.rst | ||
indexes.rst | ||
llms.rst | ||
memory.rst | ||
prompts.rst | ||
state_of_the_union.txt | ||
utils.rst |