Future of Information and Communication Conference (FICC) 2025
28-29 April 2025
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 15 Issue 6, 2024.
Abstract: Sentiment analysis is vital for understanding public opinion, but improving its performance is challenging due to the complexities of high-dimensional text data and diverse user-generated content. We propose a novel framework based on Dimensionality Reduction for Machine Learning (DRML) that enhances the classification performance by 21.55% while reducing the dimension of the feature matrix by 99.63%. Our research addresses the fundamental question of whether it is possible to reduce the feature space significantly while improving sentiment analysis performance. Our approach employs Principal Component Analysis (PCA) to effectively capture essential textual features and includes the development of an algorithm for identifying principal components from positive and negative reviews. We then create a supervised dataset by combining these components. Furthermore, we integrate a range of state-of-the-art machine learning algorithms (Decision Tree, K-Nearest Neighbours, Bernoulli Naïve Bayes, and Majority Voting Ensemble) into our framework, along with a custom tokenizer, to harness the full potential of reduced-dimensional data for sentiment classification. We have conducted extensive experiments using gold standard multi-domain benchmark datasets from Amazon to show that DRML outperforms other state-of-the-art approaches. Our proposed methodology gives superior performance with an average performance of 98.38% which is a significant increase in performance by 21.55% compared to the baseline methodology using Bag of Words (BoW). In terms of individual evaluation parameters, DRML shows an increase of 21.84% in Accuracy, 20.4% in Precision, 21.84% in Recall, and 22.11% in F1-score. In comparison with the state-of-the-art (SOTA) methodologies applied to the same benchmark dataset in recent years, our framework demonstrates a significant average increase in Accuracy for Sentiment Analysis by 10.96%. This substantial improvement underscores the effectiveness of our approach. To conclude, our research contributes to the field of sentiment analysis by introducing an innovative framework that not only improves the efficiency of sentiment analysis but also paves the way for the analysis of extensive textual data in diverse real-world applications.
Dhamayanthi N and Lavanya B, “A Novel Framework for Sentiment Analysis: Dimensionality Reduction for Machine Learning (DRML)” International Journal of Advanced Computer Science and Applications(IJACSA), 15(6), 2024. http://dx.doi.org/10.14569/IJACSA.2024.0150678
@article{N2024,
title = {A Novel Framework for Sentiment Analysis: Dimensionality Reduction for Machine Learning (DRML)},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2024.0150678},
url = {http://dx.doi.org/10.14569/IJACSA.2024.0150678},
year = {2024},
publisher = {The Science and Information Organization},
volume = {15},
number = {6},
author = {Dhamayanthi N and Lavanya B}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.