langchain/docs/modules/indexes/document_loaders.rst

Document Loaders
==========================

.. note::
   `Conceptual Guide <https://docs.langchain.com/docs/components/indexing/document-loaders>`_


Combining language models with your own text data is a powerful way to differentiate them.
The first step in doing this is to load the data into "documents" - a fancy way of say some pieces of text.
This module is aimed at making this easy.

A primary driver of a lot of this is the `Unstructured <https://github.com/Unstructured-IO/unstructured>`_ python package.
This package is a great way to transform all types of files - text, powerpoint, images, html, pdf, etc - into text data.

For detailed instructions on how to get set up with Unstructured, see installation guidelines `here <https://github.com/Unstructured-IO/unstructured#coffee-getting-started>`_.

The following document loaders are provided:


.. toctree::
   :maxdepth: 1
   :glob:

   ./document_loaders/examples/*
Harrison/unstructured support (#903) 2023-02-06 07:02:07 +00:00			`Document Loaders`
			`==========================`

big docs refactor (#1978) Co-authored-by: Ankush Gola <ankush.gola@gmail.com> 2023-03-27 02:49:46 +00:00			`.. note::`
			`Conceptual Guide <https://docs.langchain.com/docs/components/indexing/document-loaders>`_


Harrison/unstructured support (#903) 2023-02-06 07:02:07 +00:00			`Combining language models with your own text data is a powerful way to differentiate them.`
			`The first step in doing this is to load the data into "documents" - a fancy way of say some pieces of text.`
			`This module is aimed at making this easy.`

			A primary driver of a lot of this is the `Unstructured <https://github.com/Unstructured-IO/unstructured>`_ python package.
			`This package is a great way to transform all types of files - text, powerpoint, images, html, pdf, etc - into text data.`

			For detailed instructions on how to get set up with Unstructured, see installation guidelines `here <https://github.com/Unstructured-IO/unstructured#coffee-getting-started>`_.

big docs refactor (#1978) Co-authored-by: Ankush Gola <ankush.gola@gmail.com> 2023-03-27 02:49:46 +00:00			`The following document loaders are provided:`
Harrison/unstructured support (#903) 2023-02-06 07:02:07 +00:00

			`.. toctree::`
			`:maxdepth: 1`
big docs refactor (#1978) Co-authored-by: Ankush Gola <ankush.gola@gmail.com> 2023-03-27 02:49:46 +00:00			`:glob:`
Harrison/unstructured support (#903) 2023-02-06 07:02:07 +00:00
big docs refactor (#1978) Co-authored-by: Ankush Gola <ankush.gola@gmail.com> 2023-03-27 02:49:46 +00:00			`./document_loaders/examples/*`