Future of Information and Communication Conference (FICC) 2024
4-5 April 2024
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 11 Issue 4, 2020.
Abstract: Developing systems for interpreting visuals, such as images, videos is really challenging but important task to be developed and applied on benchmark datasets. This study solves the very challenge by using STN-OCR model consisting of deep neural networks (DNN) and Spatial Transformer Networks (STNs). The network architecture of this study consists of two stages: localization network and recognition network. In the localization network it finds and localizes text regions and generates sampling grid. Whereas, in the recognition network, text regions will be input and then this network learns to recognize text including low resolution, curved and multi-oriented text. Deep learning-based approaches require a lot of data for training effectively, therefore, this study has used two benchmark datasets, Street View House Numbers (SVHN) and International Conference on Document Analysis and Recognition (ICDAR) 2015 to evaluate the system. The STN-OCR model achieves better results than literature on these datasets.
Saif Hassan Katper, Abdul Rehman Gilal, Abdullah Alshanqiti, Ahmad Waqas, Aeshah Alsughayyir and Jafreezal Jaafar, “Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition” International Journal of Advanced Computer Science and Applications(IJACSA), 11(4), 2020. http://dx.doi.org/10.14569/IJACSA.2020.0110424
@article{Katper2020,
title = {Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2020.0110424},
url = {http://dx.doi.org/10.14569/IJACSA.2020.0110424},
year = {2020},
publisher = {The Science and Information Organization},
volume = {11},
number = {4},
author = {Saif Hassan Katper and Abdul Rehman Gilal and Abdullah Alshanqiti and Ahmad Waqas and Aeshah Alsughayyir and Jafreezal Jaafar}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.