Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos

Lakshmi Haritha Medida; Kasarapu Ramani

doi:10.14569/IJACSA.2021.0120442

DOI: 10.14569/IJACSA.2021.0120442

PDF

Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos

Author 1: Lakshmi Haritha Medida

Author 2: Kasarapu Ramani

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 12 Issue 4, 2021.

Abstract and Keywords
How to Cite this Article
{} BibTeX Source

Abstract: Now-a-days, the video recording technologies have turned out to be more and more forceful and easier to utilize. Therefore, numerous universities are recording and publishing their lectures online in order to make them reachable for learners or students. These lecture videos encapsulate the handwritten text written either on a paper or blackboard or on a tablet using a stylus. On the other hand, this mechanism of recording the lecture videos consumes huge quantity of multimedia data in a faster manner. Thus, handwritten text recognition on the lecture video portals has turned out to be an incredibly significant and demanding task. Thus, this paper intends to develop a novel handwritten text detection and recognition approach on the video lecture dataset by following four major phases, viz. (a) Text Localization, (b) Segmentation (c) Pre-processing and (d) Recognition. The text localization in the lecture video frames is the initial phase and here the arbitrarily oriented text on video frames is localized using the Modified Region Growing (MRG) algorithm. Then, the localized words are subjected to segmentation via the K-means clustering, in which the words from the detected text regions are segmented out. Subsequently, the segmented words are pre-processed to avoid the blurriness artifacts as well. Finally, the pre-processed words are recognized using the Deep Convolutional Neural Network (DCNN). The performance of the proposed model is analyzed in terms of the performance measures like accuracy, precision, sensitivity and specificity to exhibit the supremacy of the text detection and recognition in lecture video. Experimental results reveal that at Learning Percentage of 70, the presented work has the highest accuracy of 89.3% for 500 count of frames.

Keywords: Lecture video; text localization; segmentation; word recognition; deep convolutional neural network (DCNN)

Lakshmi Haritha Medida and Kasarapu Ramani, “Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos” International Journal of Advanced Computer Science and Applications(IJACSA), 12(4), 2021. http://dx.doi.org/10.14569/IJACSA.2021.0120442

@article{Medida2021,
title = {Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2021.0120442},
url = {http://dx.doi.org/10.14569/IJACSA.2021.0120442},
year = {2021},
publisher = {The Science and Information Organization},
volume = {12},
number = {4},
author = {Lakshmi Haritha Medida and Kasarapu Ramani}
}

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos

Upcoming Conferences