Deep Learning

The Annotated NeRF – Training on Custom Dataset from Scratch in Pytorch

In recent years, the field of 3D from multi-view has become one of the most popular topics in computer vision conferences, with a high number of submitted papers each year.

3D Computer Graphics, 3D Computer Vision, 3D Reconstruction, Deep Learning, PyTorch, Robotics, SLAM

LightRAG: Simple and Fast Alternative to GraphRAG for Legal Doc Analysis

This article discusses the architecture of LightRAG from HKU, exploring its in-depth internal workings and comparing it with GraphRAG and NaiveRAG for local document analysis.

Deep Learning, Generative AI, LLMs, RAGs

NVIDIA AI Summit 2024 – India Overview

The NVIDIA AI Summit 2024, held from October 23 to 25 at the Jio World Convention Centre in Mumbai, marked a significant milestone in India’s journey toward becoming a global

Artificial Intelligence, Deep Learning, NVIDIA

Introduction to Speech to Speech: Most Efficient Form of NLP

We often take out our phones and say, “Hey Siri, play Perfect by Ed Sheeran” or “Ok Google, set an alarm at 7.30 in the morning.” And the work is

Computer Vision, Deep Learning, LLMs, Speech AI, Speech Recognition, Voice AI

Training 3D U-Net for Brain Tumor Segmentation Challenge – Medical Imaging

This articles discussed Training 3D U-Net for Brain Tumor Segmentation - BraTS2023. Glioma Detection It touches upon the importance of 3D U-Net over 2D U-Net for MRI Brain Scans.

3D Computer Vision, Computer Vision, Deep Learning, Medical Imaging

Exploring DINO: Self-Supervised Transformers for Road Segmentation with ResNet50 and U-Net

DINO is a self-supervised learning (SSL) framework that uses the Vision Transformer (ViT) as it’s core architecture. While SSL initially gained popularity through its use in natural language processing (NLP)

Computer Vision, Deep Learning, Image Segmentation

Sapiens: Foundation for Human Vision Models by Meta

The article primarily discusses capabilities Sapiens a foundational human vision model by meta, achieves state-of-the-art performance in tasks like 2D pose estimation, body-part segmentation, normal and depth estimation.

3D Computer Vision, Computer Vision, Deep Learning, Generative AI, SpatialAI-Depth

Handwritten Text Recognition using OCR

In this article, we carry out handwritten text recognition using OCR. We fine tune the TrOCR model on the GNHK dataset.

Deep Learning, Hugging Face Transformers, OCR

Training CLIP Model from Scratch for an Fashion Image Retrieval App

This article discusses how to train a CLIP like model from scratch. It presents gradio app for Fashion E-commerce Image Retrieval using Text search in PyTorch.

Computer Vision, Deep Learning, Similarity Measure

Recommendation System using Vector Search with Qdrant

In this article, we explore how to build a movie recommendation system using vector search with Qdrant. You'll learn about vector databases, sparse and dense vectors, and how the Retrieval-Augmented

Deep Learning, NLP

Introduction to Feature Matching Using Neural Networks

Feature matching using deep learning is a game-changer for computer vision tasks like panorama stitching, video stabilization, and face recognition, providing greater accuracy and reliability. Dive into how this technology

Computer Vision, Deep Learning, Feature Detection, Neural Network

CVPR 2024 Key Research & Dataset Papers – Part 2

This article gives an overview about the key research papers and dataset from CVPR 2024 along with repository links.

AI Research Papers, Computer Vision, Deep Learning