The Science and Information (SAI) Organization
  • Home
  • About Us
  • Journals
  • Conferences
  • Contact Us

Publication Links

  • IJACSA
  • Author Guidelines
  • Publication Policies

IJACSA

  • About the Journal
  • Call for Papers
  • Editorial Board
  • Author Guidelines
  • Submit your Paper
  • Current Issue
  • Archives
  • Indexing
  • Fees/ APC
  • Reviewers
  • Apply as a Reviewer

IJARAI

  • About the Journal
  • Archives
  • Indexing & Archiving

Special Issues

  • Home
  • Archives
  • Proposals
  • GIDP 2026
  • ICONS_BA 2025

Computer Vision Conference (CVC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Computing Conference

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Intelligent Systems Conference (IntelliSys)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Future Technologies Conference (FTC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact
  • Home
  • Call for Papers
  • Editorial Board
  • Guidelines
  • Submit
  • Current Issue
  • Archives
  • Indexing
  • Fees
  • Reviewers
  • RSS Feed

DOI: 10.14569/IJACSA.2024.0150704
PDF

Enhancing Audio Classification Through MFCC Feature Extraction and Data Augmentation with CNN and RNN Models

Author 1: Karim Mohammed Rezaul
Author 2: Md. Jewel
Author 3: Md Shabiul Islam
Author 4: Kazy Noor e Alam Siddiquee
Author 5: Nick Barua
Author 6: Muhammad Azizur Rahman
Author 7: Mohammad Shan-A-Khuda
Author 8: Rejwan Bin Sulaiman
Author 9: Md Sadeque Imam Shaikh
Author 10: Md Abrar Hamim
Author 11: F.M Tanmoy
Author 12: Afraz Ul Haque
Author 13: Musarrat Saberin Nipun
Author 14: Navid Dorudian
Author 15: Amer Kareem
Author 16: Ahmmed Khondokar Farid
Author 17: Asma Mubarak
Author 18: Tajnuva Jannat
Author 19: Umme Fatema Tuj Asha

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 15 Issue 7, 2024.

  • Abstract and Keywords
  • How to Cite this Article
  • {} BibTeX Source

Abstract: Sound classification is a multifaceted task that necessitates the gathering and processing of vast quantities of data, as well as the construction of machine learning models that can accurately distinguish between various sounds. In our project, we implemented a novel methodology for classifying both musical instruments and environmental sounds, utilizing convolutional and recurrent neural networks. We used the Mel Frequency Cepstral Coefficient (MFCC) method to extract features from audio, which emulates the human auditory system and produces highly distinct features. Knowing how important data processing is, we implemented distinctive approaches, including a range of data augmentation and cleaning techniques, to achieve an optimized solution. The outcomes were noteworthy, as both the convolutional and recurrent neural network models achieved a commendable level of accuracy. As machine learning and deep learning continue to revolutionize image classification, it is high time to explore the development of adaptable models for audio classification. Despite the challenges associated with a small dataset, we successfully crafted our models using convolutional and recurrent neural networks. Overall, our strategy for sound classification bears significant implications for diverse domains, encompassing speech recognition, music production, and healthcare. We hold the belief that with further research and progress, our work can pave the way for breakthroughs in audio data classification and analysis.

Keywords: Deep learning (artificial intelligence); data augmentation; audio segmentation; signal processing; frame blocking; fast fourier transform; discrete cosine transform; feature extraction; MFCC; CNN; RNN

Karim Mohammed Rezaul, Md. Jewel, Md Shabiul Islam, Kazy Noor e Alam Siddiquee, Nick Barua, Muhammad Azizur Rahman, Mohammad Shan-A-Khuda, Rejwan Bin Sulaiman, Md Sadeque Imam Shaikh, Md Abrar Hamim, F.M Tanmoy, Afraz Ul Haque, Musarrat Saberin Nipun, Navid Dorudian, Amer Kareem, Ahmmed Khondokar Farid, Asma Mubarak, Tajnuva Jannat and Umme Fatema Tuj Asha. “Enhancing Audio Classification Through MFCC Feature Extraction and Data Augmentation with CNN and RNN Models”. International Journal of Advanced Computer Science and Applications (IJACSA) 15.7 (2024). http://dx.doi.org/10.14569/IJACSA.2024.0150704

@article{Rezaul2024,
title = {Enhancing Audio Classification Through MFCC Feature Extraction and Data Augmentation with CNN and RNN Models},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2024.0150704},
url = {http://dx.doi.org/10.14569/IJACSA.2024.0150704},
year = {2024},
publisher = {The Science and Information Organization},
volume = {15},
number = {7},
author = {Karim Mohammed Rezaul and Md. Jewel and Md Shabiul Islam and Kazy Noor e Alam Siddiquee and Nick Barua and Muhammad Azizur Rahman and Mohammad Shan-A-Khuda and Rejwan Bin Sulaiman and Md Sadeque Imam Shaikh and Md Abrar Hamim and F.M Tanmoy and Afraz Ul Haque and Musarrat Saberin Nipun and Navid Dorudian and Amer Kareem and Ahmmed Khondokar Farid and Asma Mubarak and Tajnuva Jannat and Umme Fatema Tuj Asha}
}



Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

IJACSA

Upcoming Conferences

Computer Vision Conference (CVC) 2026

21-22 May 2026

  • Amsterdam, The Netherlands

Computing Conference 2026

9-10 July 2026

  • London, United Kingdom

Artificial Intelligence Conference 2026

3-4 September 2026

  • Amsterdam, The Netherlands

Future Technologies Conference (FTC) 2026

15-16 October 2026

  • Berlin, Germany
The Science and Information (SAI) Organization
BACK TO TOP

Computer Science Journal

  • About the Journal
  • Call for Papers
  • Submit Paper
  • Indexing

Our Conferences

  • Computer Vision Conference
  • Computing Conference
  • Intelligent Systems Conference
  • Future Technologies Conference

Help & Support

  • Contact Us
  • About Us
  • Terms and Conditions
  • Privacy Policy

The Science and Information (SAI) Organization Limited is a company registered in England and Wales under Company Number 8933205.