Computer Vision Conference (CVC) 2026
21-22 May 2026
Publication Links
IJACSA
Special Issues
Computer Vision Conference (CVC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 17 Issue 3, 2026.
Abstract: Sentiment analysis for low-resource languages remains challenging due to limited annotated data, orthographic instability, informal writing practices, and the lack of dedicated linguistic resources, challenges that are particularly acute for Tarifit (Tamazight of the Rif), an under-resourced Amazigh language characterized by strong dialectal variation, pervasive multi-script usage, and highly noisy user-generated content on social media. This study introduces D-LexeCan, a dynamic lexicon-based sentiment analysis framework that infers polarity directly from annotated corpus evidence without relying on predefined sentiment dictionaries or computationally intensive pretrained deep learning and transformer-based models. The framework combines deterministic multi-script normalization, unifying Arabic script, Tifinagh, and Arabizi into a single Tarifit Latin representation with automatic induction of sentiment-bearing unigrams and bigrams, while explicitly modeling negation and amplification phenomena through linguistically motivated operators and preserving emojis as meaningful discourse-level sentiment cues. The approach is evaluated on a manually annotated social media corpus collected from multiple online platforms, where it achieves an accuracy of 0.8800 and a Macro-F1 score of 0.8798. The results outperform a static lexicon baseline with an accuracy of 0.5275, a classical machine-learning model based on TF–IDF and SVM with an accuracy of 0.8525, and neural architectures including BiLSTM with an accuracy of 0.7950. Experiments with frozen multilingual transformer encoders show accuracy ranging from 0.6725 to 0.7650. Fine-tuned multilingual transformers such as mBERT achieve competitive performance, reaching an accuracy of 0.8175. Overall, the results demonstrate that adaptive and linguistically grounded dynamic lexicon induction constitutes an effective, interpretable, and computationally efficient alternative for sentiment analysis in low-resource, noisy, and multi-script African language contexts.
Amar Amakssoum, Fadwa Bouhafer, Anass El Haddadi and Abdelkhalak Bahri. “D-LexeCan: A Dynamic Lexicon-Based Framework for Sentiment Analysis in Tarifit, a Low-Resource Multiscript Language”. International Journal of Advanced Computer Science and Applications (IJACSA) 17.3 (2026). http://dx.doi.org/10.14569/IJACSA.2026.0170372
@article{Amakssoum2026,
title = {D-LexeCan: A Dynamic Lexicon-Based Framework for Sentiment Analysis in Tarifit, a Low-Resource Multiscript Language},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2026.0170372},
url = {http://dx.doi.org/10.14569/IJACSA.2026.0170372},
year = {2026},
publisher = {The Science and Information Organization},
volume = {17},
number = {3},
author = {Amar Amakssoum and Fadwa Bouhafer and Anass El Haddadi and Abdelkhalak Bahri}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.