Search Results for: c

YOLO11: Redefining Real-Time Object Detection

October 8, 2024 2 Comments

October 8, 2024 By 2 Comments

YOLO11 is finally here, revealed at the exciting Ultralytics YOLO Vision 2024 (YV24) event. 2024 is a year of YOLO models. After the release of YOLOv8 in 2023, we got YOLOv9 and YOLOv10 this year, and ...

soumyadip

October 1, 2024 8 Comments

Computer Vision Deep Learning Image Segmentation

October 1, 2024 By 8 Comments

DINO is a self-supervised learning (SSL) framework that uses the Vision Transformer (ViT) as it's core architecture. While SSL initially gained popularity through its use in natural language ...

Jaykumaran

September 24, 2024 Leave a Comment

3D Computer Vision Computer Vision Deep Learning Generative AI SpatialAI-Depth

September 24, 2024 By Leave a Comment

Sapiens, a family of foundational Human Vision Models by Rawal et al., from Meta, achieves state-of-the-art results for human centric tasks like 2D pose estimation, body-part segmentation, depth ...

Jaykumaran

September 17, 2024 Leave a Comment

Computer Vision LLMs RAGs Vision Language Models

September 17, 2024 By Leave a Comment

ColPali multimodal RAG offers a novel approach for efficient retrieval of elements such as images, tables, charts, and texts by treating each page as an image. This method takes advantage of Vision ...

soumyadip

September 10, 2024 2 Comments

Autonomous Vehicle Robotics

September 10, 2024 By 2 Comments

Robotics, once a specialized and niche field, has surged into the mainstream with the rapid development of autonomous vehicles, quadruped robots, and humanoids. What’s fueling this revolution? The ...

Sovit Rath

September 3, 2024 4 Comments

Deep Learning Hugging Face Transformers OCR

September 3, 2024 By 4 Comments

Handwritten text documents are ubiquitous in the field of research and study. They are personalized to the user’s needs and often contain a style of writing difficult to comprehend by others. This ...

YOLO11: Redefining Real-Time Object Detection

Exploring DINO: Self-Supervised Transformers for Road Segmentation with ResNet50 and U-Net

Sapiens: Foundation for Human Vision Models by Meta

ColPali: Enhancing Financial Report Analysis with Multimodal RAG and Gemini

Building Autonomous Vehicle in Carla: Path Following with PID Control & ROS 2

Handwritten Text Recognition using OCR

Get Started with OpenCV

Subscribe to receive the download link, receive updates, and be notified of bug fixes

Which email should I send you the download link?