Sovit Rath

Model Context Protocol (MCP) is a new standard by Anthropic to connect LLMs with different applications via a server-client protocol.
In this article, we explore OmniParser a UI screen parsing pipeline combining fine-tuned YOLO model for icon detection and Florence2 for icon recognition and icon description generation.

The NVIDIA AI Summit 2024, held from October 23 to 25 at the Jio World Convention Centre in Mumbai, marked a significant milestone in India’s journey toward becoming a global

In this article, we carry out handwritten text recognition using OCR. We fine tune the TrOCR model on the GNHK dataset.
In this article, we fine tune the Whisper ASR model on a custom dataset to recognize Air Traffic Control audio.
In this article, we explore SAM 2 (Segment Anything Model 2), for Promptable Visual Segmentation of objects in images and videos.
In this article, we using the Dreambooth technique to train Stable Diffusion 1.5 and teach it to generate images of a very specific species of cat.
In this article, we cover the Hugging Face Diffusers library for text-to-image, image-to-image, and image inpainting.
In this article, we do text summarization using T5 and fine-tune the model to build a Text Summarization Gradio app.
In this article, we are fine tuning the T5 model for Stack Overflow tag generation using the Hugging Face Transformer library.
In this post, we fine-tune BERT on Arxiv abstract classification dataset using the Hugging Face Transformers library.

Black Friday Sale!

Unlock the biggest discounts of the year on AI & Computer Vision Courses and Programs

Days
Hours
Minutes
Seconds

Subscribe to receive the download link, receive updates, and be notified of bug fixes

Which email should I send you the download link?

 

Get Started with OpenCV

Subscribe To Receive

We hate SPAM and promise to keep your email address safe.​