You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/langchain
Matt Robinson 2f15c11b87
feat: document loader for MS Word documents (#1282)
### Summary

Adds a document loader for MS Word Documents. Works with both `.docx`
and `.doc` files as longer as the user has installed
`unstructured>=0.4.11`.

### Testing

The follow workflow test the loader for both `.doc` and `.docx` files
using example docs from the `unstructured` repo.

#### `.docx`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.docx"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```

#### `.doc`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.doc"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```
1 year ago
..
agents searx: remove duplicate param (#1219) 1 year ago
callbacks rfc: callback changes (#1165) 1 year ago
chains Harrison/source docs (#1275) 1 year ago
docstore Harrison/wiki update (#622) 1 year ago
document_loaders feat: document loader for MS Word documents (#1282) 1 year ago
embeddings Harrison/cohere params (#1278) 1 year ago
evaluation Refactor some loops into list comprehensions (#1185) 1 year ago
graphs catch networkx error (#1201) 1 year ago
indexes Harrion/kg (#1016) 1 year ago
llms cleanup (#1274) 1 year ago
prompts Harrison/semantic subset (#1079) 1 year ago
tools add ifttt tool (#1244) 1 year ago
utilities Harrison/errors (#1276) 1 year ago
vectorstores Harrison/add documents (#1197) 1 year ago
__init__.py Add Writer, Banana, Modal, StochasticAI (#1270) 1 year ago
cache.py Refactor some loops into list comprehensions (#1185) 1 year ago
docker-compose.yaml add tracing support to langchain (#741) 1 year ago
example_generator.py Harrison/improve cache (#368) 1 year ago
formatting.py initial commit 2 years ago
input.py Add asyncio support for LLM (OpenAI), Chain (LLMChain, LLMMathChain), and Agent (#841) 1 year ago
model_laboratory.py Harrison/improve cache (#368) 1 year ago
py.typed Add py.typed marker to package (#121) 2 years ago
python.py Harrison/tools exp (#372) 1 year ago
requests.py LLMRequestsChain (#267) 2 years ago
schema.py add tracing support to langchain (#741) 1 year ago
serpapi.py move serpapi wrapper (#1199) 1 year ago
server.py add tracing support to langchain (#741) 1 year ago
sql_database.py fix sqlite internal tables breaking table_info (#1224) 1 year ago
text_splitter.py fix bug with length function (#1257) 1 year ago
utils.py Harrison/bing wrapper (#656) 1 year ago