Future of Information and Communication Conference (FICC) 2024
4-5 April 2024
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 7 Issue 5, 2016.
Abstract: Optical Character Recognition (OCR) has been an attractive research area for the last three decades and mature OCR systems reporting near to 100% recognition rates are available for many scripts/languages today. Despite these develop-ments, research on recognition of text in many languages is still in its early days, Urdu being one of them. The limited existing literature on Urdu OCR is either limited to isolated characters or considers limited vocabularies in fixed font sizes. This research presents a segmentation free and size invariant technique for recognition of Urdu words in Nastaliq font using ligatures as units of recognition. Ligatures, separated into primary ligatures and diacritics, are recognized using right-to-left HMMs. Diacritics are then associated with the main body using position information and the resulting ligatures are validated using a dictionary. The system evaluated on Urdu words realized promising recognition rates at ligature and word levels.
Safia Shabbir and Imran Siddiqi, “Optical Character Recognition System for Urdu Words in Nastaliq Font” International Journal of Advanced Computer Science and Applications(IJACSA), 7(5), 2016. http://dx.doi.org/10.14569/IJACSA.2016.070575
@article{Shabbir2016,
title = {Optical Character Recognition System for Urdu Words in Nastaliq Font},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2016.070575},
url = {http://dx.doi.org/10.14569/IJACSA.2016.070575},
year = {2016},
publisher = {The Science and Information Organization},
volume = {7},
number = {5},
author = {Safia Shabbir and Imran Siddiqi}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.