Future of Information and Communication Conference (FICC) 2025
28-29 April 2025
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 15 Issue 10, 2024.
Abstract: This study addresses the challenge of Native Language Identification (NLI) in ultra-short English for Academic Purposes (EAP) texts by proposing an innovative two-stage recognition method. Conventional views suggest that ultra-short texts lack sufficient linguistic features for effective NLI. However, we have found that even in such brief texts, subtle linguistic cues—such as syntactic structures, lexical choices, and grammatical errors—can still reveal the author’s native language background. Our approach involves fine-tuning the granularity of first language (L1) labels and refining deep learning models to more accurately capture the subtle differences in second language (L2) English texts written by individuals from similar cultural backgrounds. To validate the effectiveness of this method, we designed and conducted a series of scientific experiments using advanced Natural Language Processing (NLP) techniques. The results demonstrate that models adjusted for granular L1 distinctions exhibit greater sensitivity and accuracy in identifying language variations caused by nuanced cultural differences. Furthermore, this method is not only applicable to ultra-short texts but can also be extended to texts of varying lengths, offering new perspectives and tools for handling diverse language inputs. By integrating in-depth linguistic analysis with advanced computational techniques, our research opens up new possibilities for enhancing the performance and adaptability of NLI models in complex linguistic environments. It also provides fresh insights for future efforts aimed at optimizing the capture of linguistic features.
Zhendong Du and Kenji Hashimoto, “Pioneering Granularity: Advancing Native Language Identification in Ultra-Short EAP Texts” International Journal of Advanced Computer Science and Applications(IJACSA), 15(10), 2024. http://dx.doi.org/10.14569/IJACSA.2024.01510100
@article{Du2024,
title = {Pioneering Granularity: Advancing Native Language Identification in Ultra-Short EAP Texts},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2024.01510100},
url = {http://dx.doi.org/10.14569/IJACSA.2024.01510100},
year = {2024},
publisher = {The Science and Information Organization},
volume = {15},
number = {10},
author = {Zhendong Du and Kenji Hashimoto}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.