The NVIDIA AI Summit 2024, held from October 23 to 25 at the Jio World Convention Centre in Mumbai, marked a significant milestone in India's journey toward becoming a global leader in artificial ...
Handwritten Text Recognition using OCR
Handwritten text documents are ubiquitous in the field of research and study. They are personalized to the user’s needs and often contain a style of writing difficult to comprehend by others. This ...
Fine Tuning Whisper on Custom Dataset
Whisper is a leading open-source model used for converting speech to text. Developed by OpenAI, Whisper has been trained on a diverse array of languages and speech conditions using extensive data. ...
SAM 2 – Promptable Segmentation for Images and Videos
Image segmentation is one of the most fundamental tasks in Computer Vision. With their Segment Anything Model (SAM), last year, Meta AI put forth the world's first foundation model for image ...
Dreambooth using Diffusers
Ever wanted to generate different images of your pet but in different settings and locations? Dreambooth training using the Diffuser library can help you achieve this. In the evolving landscape of ...
Introduction to Hugging Face Diffusers
Since the advent of diffusion models, Computer Vision has seen tremendous growth in image generation capabilities. This spans image generation models, techniques, datasets, pipelines, and libraries. ...