OCR
Traditional Optical Character Recognition (OCR) systems are primarily designed to extract plain text from scanned documents or images. While useful, such systems often ignore semantic structure, layout, and visual cues
In this article, we explore OmniParser a UI screen parsing pipeline combining fine-tuned YOLO model for icon detection and Florence2 for icon recognition and icon description generation.
In this article, we carry out handwritten text recognition using OCR. We fine tune the TrOCR model on the GNHK dataset.
In this article, we are fine tuning the TrOCR Small Printed model on the SCUT CTW1500 dataset to improve its performance on curved text.
In this article, we explore TrOCR architecture, models, training strategy and run inference using HuggingFace.
Optical Character Recognition is the process of recognizing text from an image by understanding and analyzing its underlying patterns. We will implement and compare various OCR algorithms provided by PaddleOCR