OCR

What Makes DeepSeek OCR So Powerful?

DeepSeek OCR Paper Explanation and Test using Transformers and vLLM Pipeline. Understanding Context Optical Compression and model architecture in depth.

Generative AI, OCR, Text Recognition

Nanonets-OCR-s: Enabling Rich, Structured Markdown for Document Understanding

Traditional Optical Character Recognition (OCR) systems are primarily designed to extract plain text from scanned documents or images. While useful, such systems often ignore semantic structure, layout, and visual cues

OCR, VLMs

OmniParser: Vision Based GUI Agent

In this article, we explore OmniParser a UI screen parsing pipeline combining fine-tuned YOLO model for icon detection and Florence2 for icon recognition and icon description generation.

Agentic AI, Generative AI, OCR, Vision Language Models

Handwritten Text Recognition using OCR

In this article, we carry out handwritten text recognition using OCR. We fine tune the TrOCR model on the GNHK dataset.

Deep Learning, Hugging Face Transformers, OCR

Fine Tuning TrOCR – Training TrOCR to Recognize Curved Text

In this article, we are fine tuning the TrOCR Small Printed model on the SCUT CTW1500 dataset to improve its performance on curved text.

Hugging Face Transformers, OCR, Transformer Neural Networks, Vision Transformer

TrOCR – Getting Started with Transformer Based OCR

In this article, we explore TrOCR architecture, models, training strategy and run inference using HuggingFace.

Hugging Face Transformers, OCR, Transformer Neural Networks

PaddleOCR: Unveiling the Power of Optical Character Recognition

Optical Character Recognition is the process of recognizing text from an image by understanding and analyzing its underlying patterns. We will implement and compare various OCR algorithms provided by PaddleOCR

OCR, Text Detection, Text Recognition

Automatic License Plate Recognition using Deep Learning

Deep learning has been one of the fastest-growing technologies in the modern world. Deep learning has become part of our everyday life, from voice-assistant to self-driving cars, it is everywhere.

Object Detection, OCR, Text Recognition

Deep Learning Based OCR Text Recognition Using Tesseract and OpenCV

In this article, we will learn deep learning based OCR and how to recognize text in images using an open-source tool called Tesseract and OpenCV. The method of extracting text

Deep Learning, OCR, OpenCV 3, Text Recognition, Tutorial

OCR