3D Gaussian Splatting (3DGS) is redefining the landscape of 3D computer graphics and vision — but here’s a twist: it achieves groundbreaking results without relying on any neural networks, not even a ...
The Annotated NeRF – Training on Custom Dataset from Scratch in Pytorch
In recent years, the field of 3D from multi-view has become one of the most popular topics in computer vision conferences, with a high number of submitted papers each year. A groundbreaking paper in ...
Training 3D U-Net for Brain Tumor Segmentation (BraTS2023-GLI) Challenge
3D U-Net, an efficient paradigm in medical segmentation, excels at analyzing 3D volumetric data, allowing it to capture a holistic view of brain scans. In many parts of the world, ...
Introduction to LiDAR SLAM: LOAM and LeGO-LOAM Paper and Code Explanation with ROS 2 Implementation
LiDAR SLAM is a crucial component in robotics perception, widely used in both industry and academia for its efficiency and robustness in localization and mapping. In robotics perception research, ...
CVPR 2024: An Overview and Key Papers
AI research made great strides in 2023-2024, including VLLMs like GPT4-O and Gemini; Text-to-Video Diffusion Models like SORA and Veo; and Humanoids like Atlas V2, Figure -01, and Tesla Optimus. ...
Understanding Visual SLAM for Robotics Perception: Building Monocular SLAM from Scratch in Python OpenCV
Introduction to Monocular SLAM: Have you ever wondered how Tesla's Autonomous Vehicle views its surroundings and understands its position, and makes smart decisions to reach its target ...