Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

Vishnu Kant Soni; Vivek Shukla; S. R. Tandan; Amit Pimpalkar; Neetesh Kumar Nema; Muskan Naik

doi:10.14569/IJACSA.2025.0160144

DOI: 10.14569/IJACSA.2025.0160144

PDF

Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

Author 1: Vishnu Kant Soni

Author 2: Vivek Shukla

Author 3: S. R. Tandan

Author 4: Amit Pimpalkar

Author 5: Neetesh Kumar Nema

Author 6: Muskan Naik

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 16 Issue 1, 2025.

Abstract and Keywords
How to Cite this Article
{} BibTeX Source

Abstract: Scene texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques—TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.

Keywords: Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection

Vishnu Kant Soni, Vivek Shukla, S. R. Tandan, Amit Pimpalkar, Neetesh Kumar Nema and Muskan Naik, “Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion” International Journal of Advanced Computer Science and Applications(IJACSA), 16(1), 2025. http://dx.doi.org/10.14569/IJACSA.2025.0160144

@article{Soni2025,
title = {Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2025.0160144},
url = {http://dx.doi.org/10.14569/IJACSA.2025.0160144},
year = {2025},
publisher = {The Science and Information Organization},
volume = {16},
number = {1},
author = {Vishnu Kant Soni and Vivek Shukla and S. R. Tandan and Amit Pimpalkar and Neetesh Kumar Nema and Muskan Naik}
}

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

Upcoming Conferences