VLMs
Get a comprehensive overview of VLM Evaluation Metrics Benchmarks and various datasets for tasks like VQA OCR and Image Captioning
Learn how to setup a pipeline to run VLM on Jetson Nano using Huggingface Transformers Run models like LiquidAI Moondream2 FastVLM and SmolVLM
Testing Vision Language Models VLM on edge devices Check how small VLMs perform on our custom Raspberry Pi Cluster and Jetson Nanos
Video Anomaly Detection VAD is one of the most challenging problems in computer vision It involves identifying rare abnormal events in videos 8211 such as burglary fighting or accidents 8211
Learn how Video RAG boosts training free and low compute long video understanding by pairing OCR ASR and open vocabulary detection with any long video LVLMs
What if object detection wasn t just about drawing boxes but about having a conversation with an image Dive deep into the world of Vision Language Models VLMs and see