Speech Recognition
We often take out our phones and say, “Hey Siri, play Perfect by Ed Sheeran” or “Ok Google, set an alarm at 7.30 in the morning.” And the work is
In this article, we fine tune the Whisper ASR model on a custom dataset to recognize Air Traffic Control audio.
This article presents ASR with Diarization using OpenAI Whisper and Nvidia Nemo Toolkit.