langchain/docs
Chetanya Rastogi 50c511d75f
Add new loader to load pdf as html content (#2607)
Adds a new pdf loader using the existing dependency on PDFMiner. 

The new loader can be helpful for chunking texts semantically into
sections as the output html content can be parsed via `BeautifulSoup` to
get more structured and rich information about font size, page numbers,
pdf headers/footers, etc. which may not be available otherwise with
other pdf loaders
2023-04-09 17:57:25 -07:00
..
_static Harrison/apify (#2215) 2023-03-30 20:58:14 -07:00
ecosystem Add Streaming for GPT4All (#2642) 2023-04-09 17:54:26 -07:00
getting_started bump version to 131 (#2391) 2023-04-04 07:21:50 -07:00
modules Add new loader to load pdf as html content (#2607) 2023-04-09 17:57:25 -07:00
reference Harrison/llama (#2314) 2023-04-02 14:57:45 -07:00
tracing bump version to 131 (#2391) 2023-04-04 07:21:50 -07:00
use_cases Harrison/docs cleanup (#2633) 2023-04-09 12:55:22 -07:00
conf.py Corrects copyright year (#1762) 2023-03-18 19:55:05 -07:00
deployments.md docs(deployment): add langchain-serve (#2006) 2023-03-27 23:32:04 -07:00
ecosystem.rst
gallery.rst docs: update gpt index references to LlamaIndex (#1856) 2023-03-21 22:01:05 -07:00
glossary.md big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
index.rst cr 2023-04-09 13:10:46 -07:00
make.bat
Makefile Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
model_laboratory.ipynb big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
reference.rst Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
requirements.txt Harrison/docs reqs (#2199) 2023-03-30 08:20:30 -07:00
tracing.md Harrison/tracing docs (#806) 2023-01-29 20:49:35 -08:00