mirror of
https://github.com/hwchase17/langchain
synced 2024-11-18 09:25:54 +00:00
a97e4252e3
# Unstructured Excel Loader Adds an `UnstructuredExcelLoader` class for `.xlsx` and `.xls` files. Works with `unstructured>=0.6.7`. A plain text representation of the Excel file will be available under the `page_content` attribute in the doc. If you use the loader in `"elements"` mode, an HTML representation of the Excel file will be available under the `text_as_html` metadata key. Each sheet in the Excel document is its own document. ### Testing ```python from langchain.document_loaders import UnstructuredExcelLoader loader = UnstructuredExcelLoader( "example_data/stanley-cups.xlsx", mode="elements" ) docs = loader.load() ``` ## Who can review? @hwchase17 @eyurtsev |
||
---|---|---|
.. | ||
parsers | ||
__init__.py | ||
test_arxiv.py | ||
test_bigquery.py | ||
test_bilibili.py | ||
test_blockchain.py | ||
test_confluence.py | ||
test_dataframe.py | ||
test_duckdb.py | ||
test_email.py | ||
test_excel.py | ||
test_facebook_chat.py | ||
test_figma.py | ||
test_gitbook.py | ||
test_github.py | ||
test_ifixit.py | ||
test_joplin.py | ||
test_json_loader.py | ||
test_mastodon.py | ||
test_max_compute.py | ||
test_modern_treasury.py | ||
test_odt.py | ||
test_pdf.py | ||
test_pyspark_dataframe_loader.py | ||
test_python.py | ||
test_sitemap.py | ||
test_slack.py | ||
test_spreedly.py | ||
test_stripe.py | ||
test_unstructured.py | ||
test_url_playwright.py | ||
test_url.py | ||
test_whatsapp_chat.py |