langchain/docs/modules/indexes
qued e4224a396b
feat: Add UnstructuredXMLLoader for .xml files (#5955)
# Unstructured XML Loader
Adds an `UnstructuredXMLLoader` class for .xml files. Works with
unstructured>=0.6.7. A plain text representation of the text with the
XML tags will be available under the `page_content` attribute in the
doc.

### Testing
```python
from langchain.document_loaders import UnstructuredXMLLoader

loader = UnstructuredXMLLoader(
    "example_data/factbook.xml",
)
docs = loader.load()
```


## Who can review?

@hwchase17 
@eyurtsev
2023-06-10 16:24:42 -07:00
..
document_loaders/examples feat: Add UnstructuredXMLLoader for .xml files (#5955) 2023-06-10 16:24:42 -07:00
retrievers/examples Fixed typo missing "use" (#5991) 2023-06-10 14:31:58 -07:00
text_splitters Add start index to metadata in TextSplitter (#5912) 2023-06-08 23:09:32 -07:00
vectorstores Add a new vector store - AwaDB (#5971) (#5992) 2023-06-10 15:42:32 -07:00
document_loaders.rst Create Airtable loader (#5958) 2023-06-10 15:43:18 -07:00
getting_started.ipynb Update getting_started.ipynb (#4850) 2023-05-17 13:19:14 -07:00
retrievers.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
text_splitters.rst code splitter docs (#5480) 2023-05-31 07:11:53 -07:00
vectorstores.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00