VGGT (Visual Geometry Grounded Transformer) leverages deep learning based representations to infer 3D structures from an image rather than traditional 2D based SfM pipelines. It provides a simplified, ...
MASt3R and MASt3R-SfM Explanation: Image Matching and 3D Reconstruction Results
MASt3R (Matching and Stereo 3D Reconstruction) aims to treat image matching as a 3D problem leveraging dense correspondences and understanding the 3D scene rather than a traditional 2D approach. This ...
DUSt3R: Geometric 3D Vision Made Easy : Explanation and Results
DUSt3R (Dense and Unconstrained Stereo 3D Reconstruction) introduces a novel paradigm in multi-view 3D reconstruction, eliminating the need for predefined camera poses and intrinsics. 3D ...
Understanding Visual SLAM for Robotics Perception: Building Monocular SLAM from Scratch in Python OpenCV
Introduction to Monocular SLAM: Have you ever wondered how Tesla's Autonomous Vehicle views its surroundings and understands its position, and makes smart decisions to reach its target ...
Camera Calibration using OpenCV
A camera, when used as a visual sensor, is an integral part of several domains like robotics, surveillance, space exploration, social media, industrial automation, and even the entertainment ...
Geometry of Image Formation
In this post, we will explain the image formation from a geometrical point of view. Specifically, we will cover the math behind how a point in 3D gets projected on the image plane. This post ...