deep learning

ColPali: Enhancing Financial Report Analysis with Multimodal RAG and Gemini

Performing RAG on Unstructured elements that too in complex pdfs like finance, law reports is challenging. ColPali a novel document retrieval approach achieves SOTA results with high quality retrieval. This

Computer Vision, LLMs, RAGs, Vision Language Models

Introduction to Feature Matching Using Neural Networks

Feature matching using deep learning is a game-changer for computer vision tasks like panorama stitching, video stabilization, and face recognition, providing greater accuracy and reliability. Dive into how this technology

Computer Vision, Deep Learning, Feature Detection, Neural Network

CVPR 2024 Key Research & Dataset Papers – Part 2

This article gives an overview about the key research papers and dataset from CVPR 2024 along with repository links.

AI Research Papers, Computer Vision, Deep Learning

YOLOv10: The Dual-Head OG of YOLO Series

YOLOv10 introduces a dual-head architecture for NMS-free training and efficiency-accuracy driven model design. It combines one-to-one and one-to-many label assignments to improve performance without extra computation. YOLOv10 uses lightweight classification

Computer Vision, Deep Learning, Object Detection, YOLO

Fine-tuning Faster R-CNN on Sea Rescue Dataset – Small Object Detection: PyTorch

This research article discusses about how data preparation matters for Fine-tuning Faster R-CNN on aerial small object detection.

Computer Vision, Deep Learning, Object Detection

Mastering Recommendation System: A Complete Guide

Recommendation systems (recommender systems) suggest content based on user preferences and behaviors. This guide explores their types, traditional ML techniques like matrix factorization, and advanced deep learning methods like neural

Beginners, Deep Learning, Tutorial

Building MobileViT Image Classification Model from Scratch In Keras 3

In the rapidly evolving field of deep learning, the challenge often lies not just in designing powerful models but also in making them accessible and efficient for practical use, especially

AI Research Papers, CNN, Computer Vision, Convolution, Deep Learning, Keras, Transformer Neural Networks, Vision Transformer

YOLOv9: Advancing the YOLO Legacy

This article introduces the YOLOv9 model, which addresses the core challenges in object detection through deep learning.

Computer Vision, Object Detection, YOLO

YOLO Loss Function Part 2: GFL and VFL Loss

In the preceding article, YOLO Loss Functions Part 1, we focused exclusively on SIoU and Focal Loss as the primary loss functions used in the YOLO series of models. In

Computer Vision, Deep Learning, Focal Loss, GFL, Loss Function, Object Detection, SIoU Loss Functions, VFL, YOLO

Introducing YOLO-NAS Pose: A Leap in Pose Estimation Technology

Unveiling a significant breakthrough in computer vision, Deci introduces YOLO-NAS Pose, the latest evolution in Pose Estimation technology. Building on the foundations of the acclaimed YOLO-NAS, this advanced model stands

Pose Estimation, YOLO

Facial Emotion Recognition: Decoding Expressions

In this article, we explore the real-time facial emotion recognition using the RFB-320 SSD face detection model and the VGG-13 emotion recognition model.

Face Application, Face Detection, Facial Expression Recognition

Real Time Deep SORT with Torchvision Detectors

In this article, we explore several Re-ID models for tracking along with object detection models from Torchvision to create a small modular codebase.

Deep Learning, DeepSORT, Object Detection, Object Tracking