Survey on speech emotion recognition: Features, classification schemes, and databases

作者:

Highlights:

摘要

Recently, increasing attention has been directed to the study of the emotional content of speech signals, and hence, many systems have been proposed to identify the emotional content of a spoken utterance. This paper is a survey of speech emotion classification addressing three important aspects of the design of a speech emotion recognition system. The first one is the choice of suitable features for speech representation. The second issue is the design of an appropriate classification scheme and the third issue is the proper preparation of an emotional speech database for evaluating system performance. Conclusions about the performance and limitations of current speech emotion recognition systems are discussed in the last section of this survey. This section also suggests possible ways of improving speech emotion recognition systems.

论文关键词:Archetypal emotions,Speech emotion recognition,Statistical classifiers,Dimensionality reduction techniques,Emotional speech databases

论文评审过程:Received 4 February 2009, Revised 25 July 2010, Accepted 1 September 2010, Available online 13 October 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.09.020