Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages

Zafi Sherhan Syed; Sajjad Ali Memon; Muhammad Shehram Shah; Abbas Shah Syed

doi:10.14569/IJACSA.2020.01104104

DOI: 10.14569/IJACSA.2020.01104104

PDF

Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages

Author 1: Zafi Sherhan Syed

Author 2: Sajjad Ali Memon

Author 3: Muhammad Shehram Shah

Author 4: Abbas Shah Syed

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 11 Issue 4, 2020.

Abstract and Keywords
How to Cite this Article
{} BibTeX Source

Abstract: Speech emotion recognition is one of the most active areas of research in the field of affective computing and social signal processing. However, most research is directed towards a select group of languages such as English, German, and French. This is mainly due to a lack of available datasets in other languages. Such languages are called low-resource languages given that there is a scarcity of publicly available datasets. In the recent past, there has been a concerted effort within the research community to create and introduce datasets for emotion recognition for low-resource languages. To this end, we introduce in this paper the Urdu-Sindhi Speech Emotion Corpus, a novel dataset consisting of 1,435 speech recordings for two widely spoken languages of South Asia, that is Urdu and Sindhi. Furthermore, we also trained machine learning models to establish a baseline for classification performance, with accuracy being measured in terms of unweighted average recall (UAR). We report that the best performing model for Urdu language achieves a UAR = 65.00% on the validation partition and a UAR = 56.96% on the test partition. Meanwhile, the model for Sindhi language achieved UARs of 66.50% and 55.29% on the validation and test partitions, respectively. This classification performance is considerably better than the chance level UAR of 16.67%. The dataset can be accessed via https://zenodo.org/record/3685274.

Keywords: Speech emotion recognition; affective computing; social signal processing

Zafi Sherhan Syed, Sajjad Ali Memon, Muhammad Shehram Shah and Abbas Shah Syed, “Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages” International Journal of Advanced Computer Science and Applications(IJACSA), 11(4), 2020. http://dx.doi.org/10.14569/IJACSA.2020.01104104

@article{Syed2020,
title = {Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2020.01104104},
url = {http://dx.doi.org/10.14569/IJACSA.2020.01104104},
year = {2020},
publisher = {The Science and Information Organization},
volume = {11},
number = {4},
author = {Zafi Sherhan Syed and Sajjad Ali Memon and Muhammad Shehram Shah and Abbas Shah Syed}
}

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages

Upcoming Conferences