Speech Recognition

Introduction to Speech to Speech: Most Efficient Form of NLP

We often take out our phones and say, “Hey Siri, play Perfect by Ed Sheeran” or “Ok Google, set an alarm at 7.30 in the morning.” And the work is

Computer Vision, Deep Learning, LLMs, Speech AI, Speech Recognition, Voice AI

Fine Tuning Whisper on Custom Dataset

In this article, we fine tune the Whisper ASR model on a custom dataset to recognize Air Traffic Control audio.

Hugging Face Transformers, Speech Recognition, Training Neural Networks

WhisperX Automatic Speech Recognition (ASR) with Nemo Speaker Diarization : Speech-to-Text

This article presents ASR with Diarization using OpenAI Whisper and Nvidia Nemo Toolkit.

Artificial Intelligence, Deep Learning, Speech Recognition, Transformer Neural Networks