You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/libs/community/langchain_community/document_loaders/parsers
hulitaitai 7d0a008744
community[minor]: Add audio-parser "faster-whisper" in audio.py (#20012)
faster-whisper is a reimplementation of OpenAI's Whisper model using
CTranslate2, which is up to 4 times faster than enai/whisper for the
same accuracy while using less memory. The efficiency can be further
improved with 8-bit quantization on both CPU and GPU.

It can automatically detect the following 14 languages and transcribe
the text into their respective languages: en, zh, fr, de, ja, ko, ru,
es, th, it, pt, vi, ar, tr.

The gitbub repository for faster-whisper is :
    https://github.com/SYSTRAN/faster-whisper

---------

Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>
5 months ago
..
html community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 9 months ago
language community: Add PHP language parser to document_loaders (#19850) 5 months ago
__init__.py community[patch]: speed up import times in the community package (#18928) 6 months ago
audio.py community[minor]: Add audio-parser "faster-whisper" in audio.py (#20012) 5 months ago
doc_intelligence.py community[patch]: Microsoft Azure Document Intelligence updates (#16932) 6 months ago
docai.py deprecating integrations moved to langchain_google_community (#19841) 5 months ago
generic.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 9 months ago
grobid.py community[patch]: Update grobid.py (#16298) 8 months ago
msword.py community[patch]: import flattening fix (#20110) 5 months ago
pdf.py community[patch]: import flattening fix (#20110) 5 months ago
registry.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 9 months ago
txt.py community[patch]: import flattening fix (#20110) 5 months ago
vsdx.py infra: add print rule to ruff (#16221) 7 months ago