Future of Information and Communication Conference (FICC) 2025
28-29 April 2025
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 10 Issue 10, 2019.
Abstract: Automatic image caption generation is a challenging AI problem since it requires utilization of several techniques from different computer science domains such as computer vision and natural language processing. Deep learning techniques have demonstrated outstanding results in many different applications. However, data augmentation in deep learning, which replicates the amount and the variety of training data available for learning models without the burden of collecting new data, is a promising field in machine learning. Generating textual description for a given image is a challenging task for computers. Nowadays, deep learning performs a significant role in the manipulation of visual data with the help of Convolutional Neural Networks (CNN). In this study, CNNs are employed to train prediction models which will help in automatic image caption generation. The proposed method utilizes the concept of data augmentation to overcome the fuzziness of well-known image caption generation models. Flickr8k dataset is used in the experimental work of this study and the BLEU score is applied to evaluate the reliability of the proposed method. The results clearly show the stability of the outcomes generated through the proposed method when compared to others.
Hamza Aldabbas, Muhammad Asad, Mohammad Hashem Ryalat, Kaleem Razzaq Malik and Muhammad Zubair Akbar Qureshi, “Data Augmentation to Stabilize Image Caption Generation Models in Deep Learning” International Journal of Advanced Computer Science and Applications(IJACSA), 10(10), 2019. http://dx.doi.org/10.14569/IJACSA.2019.0101074
@article{Aldabbas2019,
title = {Data Augmentation to Stabilize Image Caption Generation Models in Deep Learning},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2019.0101074},
url = {http://dx.doi.org/10.14569/IJACSA.2019.0101074},
year = {2019},
publisher = {The Science and Information Organization},
volume = {10},
number = {10},
author = {Hamza Aldabbas and Muhammad Asad and Mohammad Hashem Ryalat and Kaleem Razzaq Malik and Muhammad Zubair Akbar Qureshi}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.