Computer Vision
A comprehensive step by step guide on fine tuning RetinaNet using PyTorch to achieve 79 accuracy on wildlife detection tasks In this tutorial we dive deep into RetinaNet s architecture
Real time object detection has become essential for many practical applications and the YOLO You Only Look Once series by Ultralytics has always been a state of the art model
Leaf diseases reduce crop yields and impact food security Finetuning SAM2 helps detect and segment diseased areas using deep learning With a small dataset we achieved 74 IoU making early
3D Gaussian splatting 3DGS has recently gained recognition as a groundbreaking approach in radiance fields and computer graphics It stands out as a jack of all trades addressing challenges that
Apple s DepthPro is quite impressive producing pixel perfect high resolution metric depth maps with sharp boundaries through monocular depth estimation It outperforms all of its contenders like Metric3D v2
Image Captioning using ResNet and LSTM bridges vision and language enabling machines to see images and describe them in text This model powers applications like accessibility for visually impaired users