langchain/docs/extras/integrations
Patrick Loeber 5990651070
Add new document_loader: AssemblyAIAudioTranscriptLoader (#9667)
This PR adds a new document loader `AssemblyAIAudioTranscriptLoader`
that allows to transcribe audio files with the [AssemblyAI
API](https://www.assemblyai.com) and loads the transcribed text into
documents.

- Add new document_loader with class `AssemblyAIAudioTranscriptLoader`
- Add optional dependency `assemblyai`
- Add unit tests (using a Mock client)
- Add docs notebook

This is the equivalent to the JS integration already available in
LangChain.js. See the [LangChain JS docs AssemblyAI
page](https://js.langchain.com/docs/modules/data_connection/document_loaders/integrations/web_loaders/assemblyai_audio_transcription).

At its simplest, you can use the loader to get a transcript back from an
audio file like this:

```python
from langchain.document_loaders.assemblyai import AssemblyAIAudioTranscriptLoader

loader =  AssemblyAIAudioTranscriptLoader(file_path="./testfile.mp3")
docs = loader.load()
```

To use it, it needs the `assemblyai` python package installed, and the
environment variable `ASSEMBLYAI_API_KEY` set with your API key.
Alternatively, the API key can also be passed as an argument.

Twitter handles to shout out if so kindly 🙇
[@AssemblyAI](https://twitter.com/AssemblyAI) and
[@patloeber](https://twitter.com/patloeber)

---------

Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>
Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>
2023-08-23 22:51:19 -07:00
..
callbacks docs:misc fixes (#9671) 2023-08-23 22:36:54 -07:00
chat Adds ChatOllama (#9628) 2023-08-23 13:02:26 -07:00
document_loaders Add new document_loader: AssemblyAIAudioTranscriptLoader (#9667) 2023-08-23 22:51:19 -07:00
document_transformers Added new use case docs for Web Scraping, Chromium loader, BS4 transformer (#8732) 2023-08-11 11:46:59 -07:00
llms Updates to Nomic Atlas and GPT4All documentation (#9414) 2023-08-23 17:49:44 -07:00
memory Integrate Rockset as a chat history store (#8940) 2023-08-08 18:54:07 -07:00
providers docs:misc fixes (#9671) 2023-08-23 22:36:54 -07:00
retrievers Add support for structured data sources with google enterprise search (#9037) 2023-08-22 23:18:10 -04:00
text_embedding Updates to Nomic Atlas and GPT4All documentation (#9414) 2023-08-23 17:49:44 -07:00
toolkits Add AINetwork blockchain toolkit integration (#9527) 2023-08-22 08:03:33 -07:00
tools Add LangChain utility for real-time crypto exchange prices (#4501) 2023-08-11 14:45:06 -07:00
vectorstores Updates to Nomic Atlas and GPT4All documentation (#9414) 2023-08-23 17:49:44 -07:00