mirror of
https://github.com/hwchase17/langchain
synced 2024-10-31 15:20:26 +00:00
5a084e1b20
New HTML loader that asynchronously loader a list of urls. New transformer using [HTML2Text](https://github.com/Alir3z4/html2text/) for HTML to clean, easy-to-read plain ASCII text (valid Markdown). |
||
---|---|---|
.. | ||
_category_.yml | ||
doctran_extract_properties.ipynb | ||
doctran_interrogate_document.ipynb | ||
doctran_translate_document.ipynb | ||
html2text.ipynb | ||
openai_metadata_tagger.ipynb |