TY - JOUR
T1 - Acoustic feature selection for automatic emotion recognition from speech
AU - Rong, Jia
AU - Li, Gang
AU - Chen, Yi Ping Phoebe
PY - 2009/5/1
Y1 - 2009/5/1
N2 - Emotional expression and understanding are normal instincts of human beings, but automatical emotion recognition from speech without referring any language or linguistic information remains an unclosed problem. The limited size of existing emotional data samples, and the relative higher dimensionality have outstripped many dimensionality reduction and feature selection algorithms. This paper focuses on the data preprocessing techniques which aim to extract the most effective acoustic features to improve the performance of the emotion recognition. A novel algorithm is presented in this paper, which can be applied on a small sized data set with a high number of features. The presented algorithm integrates the advantages from a decision tree method and the random forest ensemble. Experiment results on a series of Chinese emotional speech data sets indicate that the presented algorithm can achieve improved results on emotional recognition, and outperform the commonly used Principle Component Analysis (PCA)/Multi-Dimensional Scaling (MDS) methods, and the more recently developed ISOMap dimensionality reduction method.
AB - Emotional expression and understanding are normal instincts of human beings, but automatical emotion recognition from speech without referring any language or linguistic information remains an unclosed problem. The limited size of existing emotional data samples, and the relative higher dimensionality have outstripped many dimensionality reduction and feature selection algorithms. This paper focuses on the data preprocessing techniques which aim to extract the most effective acoustic features to improve the performance of the emotion recognition. A novel algorithm is presented in this paper, which can be applied on a small sized data set with a high number of features. The presented algorithm integrates the advantages from a decision tree method and the random forest ensemble. Experiment results on a series of Chinese emotional speech data sets indicate that the presented algorithm can achieve improved results on emotional recognition, and outperform the commonly used Principle Component Analysis (PCA)/Multi-Dimensional Scaling (MDS) methods, and the more recently developed ISOMap dimensionality reduction method.
KW - Emotion recognition
KW - Feature selection
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=64549147125&partnerID=8YFLogxK
U2 - 10.1016/j.ipm.2008.09.003
DO - 10.1016/j.ipm.2008.09.003
M3 - Article
AN - SCOPUS:64549147125
SN - 0306-4573
VL - 45
SP - 315
EP - 328
JO - Information Processing and Management
JF - Information Processing and Management
IS - 3
ER -