Robotics

SimLingo: Vision-Language-Action Model for Autonomous Driving

SimLingo is a remarkable model that combines autonomous driving language understanding and instruction aware control all in one unified camera only framework It not only delivered top rankings on CARLA

Advanced Driver Assistance Systems, Autonomous Vehicle, Computer Vision, Robotics, VLMs

V-JEPA 2: Meta’s Breakthrough in AI for the Physical World

The ultimate goal for many in artificial intelligence is to build agents that can perceive reason and act in our complex physical world Meta AI has made a significant stride

Computer Vision, Generative AI, Generative Models, Hugging Face Transformers, Multimodal Models, Robotics, Vision Language Models

GR00T N1.5 Explained: NVIDIA’s VLA Model for Humanoids

Dive into NVIDIA s GR00T N1 5 a groundbreaking open foundation model poised to revolutionize humanoid robotics Discover how this advanced Vision Language Action VLA model with its smarter architecture

Robotics, Vision Language Models, Vision Transformer

SmolVLA: Affordable & Efficient VLA Robotics on Consumer GPUs

Imagine you 8217 re a robotics enthusiast a student or even a seasoned developer and you 8217 ve been captivated by the idea of robots that can see understand our

Robotics, Vision Language Models, Vision Transformer

MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

MASt3R SLAM is a truly plug and play monocular dense SLAM pipeline that operates in the wild It is first of its kind real time SLAM system that leverages MASt3R

3D Computer Vision, 3D Reconstruction, Robotics, SLAM

Vision Language Action Models (VLA) Overview: LeRobot Policies Demo

The advent of Generative AI has fundamentally transformed robotic intelligence enabling significant strides in how advanced humanoid robots 8220 perceive reason and act 8221 in the physical world This huge

Generative AI, Robotics, Vision Language Models