Computer Vision
This articles discussed Training 3D U Net for Brain Tumor Segmentation BraTS2023 Glioma Detection It touches upon the importance of 3D U Net over 2D U Net for MRI Brain
This blog goes through the architecture of DETR
YOLO11 is here Continuing the legacy of the YOLO series YOLO11 sets new standards in speed and efficiency With enhanced architecture and multi task capabilities it outperforms previous models making
DINO is a self supervised learning SSL framework that uses the Vision Transformer ViT as it 8217 s core architecture While SSL initially gained popularity through its use in natural
The article primarily discusses capabilities Sapiens a foundational human vision model by meta achieves state of the art performance in tasks like 2D pose estimation body part segmentation normal and
Performing RAG on Unstructured elements that too in complex pdfs like finance law reports is challenging ColPali a novel document retrieval approach achieves SOTA results with high quality retrieval This