OCR

DeepSeek OCR Paper Explanation and Test using Transformers and vLLM Pipeline. Understanding Context Optical Compression and model architecture in depth.

Traditional Optical Character Recognition (OCR) systems are primarily designed to extract plain text from scanned documents or images. While useful, such systems often ignore semantic structure, layout, and visual cues

In this article, we explore OmniParser a UI screen parsing pipeline combining fine-tuned YOLO model for icon detection and Florence2 for icon recognition and icon description generation.
In this article, we carry out handwritten text recognition using OCR. We fine tune the TrOCR model on the GNHK dataset.
In this article, we are fine tuning the TrOCR Small Printed model on the SCUT CTW1500 dataset to improve its performance on curved text.
In this article, we explore TrOCR architecture, models, training strategy and run inference using HuggingFace.
Optical Character Recognition is the process of recognizing text from an image by understanding and analyzing its underlying patterns. We will implement and compare various OCR algorithms provided by PaddleOCR

Deep learning has been one of the fastest-growing technologies in the modern world. Deep learning has become part of our everyday life, from voice-assistant to self-driving cars, it is everywhere.

In this article, we will learn deep learning based OCR and how to recognize text in images using an open-source tool called Tesseract and OpenCV. The method of extracting text

Subscribe to receive the download link, receive updates, and be notified of bug fixes

Which email should I send you the download link?

 

Get Started with OpenCV

Subscribe To Receive

We hate SPAM and promise to keep your email address safe.​