Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.
Digital Object Identifier (DOI) : 10.14569/SpecialIssue.2014.040102
Article Published in International Journal of Advanced Computer Science and Applications(IJACSA), Special Issue on Natural Language Processing 2014, 2014.
Abstract: Splitting of compound Telugu words into its components or root words is one of the important, tedious and yet inaccurate tasks of Natural Language Processing (NLP). Except in few special cases, at least one vowel is necessarily involved in Telugu conjunctions. In the result, vowels are often repeated as they are or are converted into other vowels or consonants. This paper describes issues involved in vowel based splitting of a Telugu bigram into proper root words using Telugu grammar conjunction (‘sandhi’) rules for MT.
T. Kameswara Rao and Dr. T. V.Prasad, “Key Issues in Vowel Based Splitting of Telugu Bigrams” International Journal of Advanced Computer Science and Applications(IJACSA), Special Issue on Natural Language Processing 2014, 2014. http://dx.doi.org/10.14569/SpecialIssue.2014.040102