Design of Hierarchical Classifier to Improve Speech Emotion Recognition

P. Vasuki

doi:10.32604/csse.2023.024441

Open Access icon Open Access

ARTICLE

Design of Hierarchical Classifier to Improve Speech Emotion Recognition

P. Vasuki^*

Department of IT, Sri Sivasubramaniya Nadar College of Engineering, Chennai, 603110, India

* Corresponding Author: P. Vasuki. Email: email

Computer Systems Science and Engineering 2023, 44(1), 19-33. https://doi.org/10.32604/csse.2023.024441

Received 17 October 2021; Accepted 13 December 2021; Issue published 01 June 2022

Abstract

Automatic Speech Emotion Recognition (SER) is used to recognize emotion from speech automatically. Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influenced by the variations in gender, age, the cultural and acoustical background of the speaker. The acoustical resemblance between emotional expressions further increases the complexity of recognition. Many recent research works are concentrated to address these effects individually. Instead of addressing every influencing attribute individually, we would like to design a system, which reduces the effect that arises on any factor. We propose a two-level Hierarchical classifier named Interpreter of responses (IR). The first level of IR has been realized using Support Vector Machine (SVM) and Gaussian Mixer Model (GMM) classifiers. In the second level of IR, a discriminative SVM classifier has been trained and tested with meta information of first-level classifiers along with the input acoustical feature vector which is used in primary classifiers. To train the system with a corpus of versatile nature, an integrated emotion corpus has been composed using emotion samples of 5 speech corpora, namely; EMO-DB, IITKGP-SESC, SAVEE Corpus, Spanish emotion corpus, CMU's Woogle corpus. The hierarchical classifier has been trained and tested using MFCC and Low-Level Descriptors (LLD). The empirical analysis shows that the proposed classifier outperforms the traditional classifiers. The proposed ensemble design is very generic and can be adapted even when the number and nature of features change. The first-level classifiers GMM or SVM may be replaced with any other learning algorithm.

Keywords

Speech emotion recognition; hierarchical classifier design; ensemble; emotion speech corpora

Cite This Article

APA Style

Vasuki, P. (2023). Design of Hierarchical Classifier to Improve Speech Emotion Recognition. Computer Systems Science and Engineering, 44(1), 19–33. https://doi.org/10.32604/csse.2023.024441

Vancouver Style

Vasuki P. Design of Hierarchical Classifier to Improve Speech Emotion Recognition. Comput Syst Sci Eng. 2023;44(1):19–33. https://doi.org/10.32604/csse.2023.024441

IEEE Style

P. Vasuki, “Design of Hierarchical Classifier to Improve Speech Emotion Recognition,” Comput. Syst. Sci. Eng., vol. 44, no. 1, pp. 19–33, 2023. https://doi.org/10.32604/csse.2023.024441

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Design of Hierarchical Classifier to Improve Speech Emotion Recognition

Abstract

Keywords

Cite This Article

2481

955

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link