To transcribe audio recordings using ChatGPT, you can leverage the Speech-to-Text function powered by OpenAI’s Whisper API. This feature allows you to convert audio files into written text in over 50 languages, making it a versatile tool for various applications.

Here’s a step-by-step guide on how to transcribe audio recordings using ChatGPT:
- Upload Audio File: You can upload the audio file directly to the ChatGPT playground, which will transcribe the audio flawlessly.
- Utilize Whisper API: The Whisper API, part of ChatGPT, offers speech-to-text capabilities. After uploading an audio file, ChatGPT will process the speech and generate a corresponding text output.
- Supported File Types: The Whisper API supports various file types, including mp3, mp4, mpeg, mpga, m4a, wav, and webm. However, the default audio size limit is 25 MB
- Language Support: ChatGPT’s Whisper API is trained in 98 languages, allowing it to transcribe audio files in multiple languages to industry-standard benchmarks.
- Transcription Automation: You can set up an automation to transcribe audio recordings using ChatGPT and Notion. This involves using Whisper to transcribe the audio, summarize it using the ChatGPT API, and then send the transcripts and summaries to Notion. I’ve been using the Thomas Frank Pipedream automation routine for months, and the results are nothing short of amazing using MP3 or M4A audio file formats.
Following these steps will enable you to effectively transcribe audio recordings using ChatGPT’s Speech to Text function, providing a seamless way to convert spoken content into written text for various purposes.