OpenAI Launches Whisper API With New Speech-To-Text Capabilities – SlashGear
With regard to English transcription, aside from its ability to accurately hear words from a much wider breadth of accents, it’s also trained to filter out problematic background noise that can often throw these systems off. Whisper also aims to be better at transcribing unique technical jargon that competing systems might not yet recognize. Whisper API users can access both English-only and non-English transcriptions, as well as any-to-English translation (and vice versa).
The model was trained on 98 different languages, but only a subset of those are available in this API. Supported languages include:
Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.
While today’s news doesn’t come with a ChatGPT-like component for the everyday user to enjoy, it does pave the way for existing apps to more easily tap into this technology, and pass its benefits onto their users. Language learning app Speak is among the first to leverage its capabilities. For others, applying for an API license is easy, and the costs don’t sound too prohibitive — OpenAI offers a rate of just $0.006 per minute of on-demand usage.
For all the latest Games News Click Here
For the latest news and updates, follow us on Google News.