03/11/2024 | Press release | Distributed by Public on 03/11/2024 10:32
The Oracle Cloud Infrastructure (OCI) Speech service now supports the Whisper model from OpenAI. Trained on a large corpus of multilingual data, Whisper is a speech-to-text model that supports file-based transcription for over 50 languages. It uses the same service end points and API and software developer kit (SDK) interfaces as the OCI Speech model to give you the most flexibility and compatibility. The Whisper model also gained speaker diarization, a feature that distinguishes and labels different voices within an audio stream, allowing for precise speaker separation in the transcription.
The Whisper model has five sizes: tiny, base, small, medium, and large-V2. For the best cost-performance trade off, the medium Whisper model is made available in all OC1 regions from both The Oracle Cloud Console and SDK.
The large-V2 model is supported when submitting a service request in the Ashburn and Phoenix regions. We plan to make more regions and models available in the future, based on customer feedback.
The Whisper model in OCI Speech offers the following features and benefits:
Feature |
OCI Speech model |
The Whisper model in OCI Speech |
Real time transcriptions |
Supported |
Not supported |
Large file size |
Up to 2GB |
Up to 2GB |
Word level timestamp |
Supported |
Supported |
File format |
AAC, AC3, AMR, AU, FLAC, M4A, MKV, MP3, MP4, OGA, OGG, WAV, WEBM |
AAC, AC3, AMR, AU, FLAC, M4A, MKV, MP3, MP4, OGA, OGG, WAV, WEBM |
Multilingual support |
EN, ES, FR, DE, PT, HI, IT |
Same as Oracle ASR model plus 50 other languages |
Diarization |
Supported |
Supported |
English translation |
Not supported |
Coming soon |
The OCI Speech service team is committed to empowering you with tools that redefine possibilities, and we look forward to you benefitting from the newly introduced Whisper model multilingual support with diarization capabilities. Contact your Oracle representative to discuss how OCI Speech with diarization can help you unlock the value of your multimedia data and gain the insight you need to bring your business to the next level.
If you're new to Oracle Cloud Infrastructure, try Oracle Cloud Free Trial, a free 30-day trial with US$300 in credits.
For more information, see the following resources: