langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-29 17:07:25 +00:00

History

Patrick Loeber 5990651070 Add new document_loader: AssemblyAIAudioTranscriptLoader (#9667 ) This PR adds a new document loader `AssemblyAIAudioTranscriptLoader` that allows to transcribe audio files with the [AssemblyAI API](https://www.assemblyai.com) and loads the transcribed text into documents. - Add new document_loader with class `AssemblyAIAudioTranscriptLoader` - Add optional dependency `assemblyai` - Add unit tests (using a Mock client) - Add docs notebook This is the equivalent to the JS integration already available in LangChain.js. See the [LangChain JS docs AssemblyAI page](https://js.langchain.com/docs/modules/data_connection/document_loaders/integrations/web_loaders/assemblyai_audio_transcription). At its simplest, you can use the loader to get a transcript back from an audio file like this: ```python from langchain.document_loaders.assemblyai import AssemblyAIAudioTranscriptLoader loader = AssemblyAIAudioTranscriptLoader(file_path="./testfile.mp3") docs = loader.load() ``` To use it, it needs the `assemblyai` python package installed, and the environment variable `ASSEMBLYAI_API_KEY` set with your API key. Alternatively, the API key can also be passed as an argument. Twitter handles to shout out if so kindly 🙇 [@AssemblyAI](https://twitter.com/AssemblyAI) and [@patloeber](https://twitter.com/patloeber) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>		2023-08-23 22:51:19 -07:00
..
callbacks	docs:misc fixes (#9671 )	2023-08-23 22:36:54 -07:00
chat	Adds ChatOllama (#9628 )	2023-08-23 13:02:26 -07:00
document_loaders	Add new document_loader: AssemblyAIAudioTranscriptLoader (#9667 )	2023-08-23 22:51:19 -07:00
document_transformers	Added new use case docs for Web Scraping, Chromium loader, BS4 transformer (#8732 )	2023-08-11 11:46:59 -07:00
llms	Updates to Nomic Atlas and GPT4All documentation (#9414 )	2023-08-23 17:49:44 -07:00
memory	Integrate Rockset as a chat history store (#8940 )	2023-08-08 18:54:07 -07:00
providers	docs:misc fixes (#9671 )	2023-08-23 22:36:54 -07:00
retrievers	Add support for structured data sources with google enterprise search (#9037 )	2023-08-22 23:18:10 -04:00
text_embedding	Updates to Nomic Atlas and GPT4All documentation (#9414 )	2023-08-23 17:49:44 -07:00
toolkits	Add AINetwork blockchain toolkit integration (#9527 )	2023-08-22 08:03:33 -07:00
tools	Add LangChain utility for real-time crypto exchange prices (#4501 )	2023-08-11 14:45:06 -07:00
vectorstores	Updates to Nomic Atlas and GPT4All documentation (#9414 )	2023-08-23 17:49:44 -07:00