Toward effective automatic recognition systems of emotion in speech

Abstract

Humans are emotional beings, and emotions are one of the main drivers of human thoughts and actions. Therefore, for all environments designed for humans, it is essential that emotion processing capabilities such as analysis, recognition, and synthesis be incorporated. Naturally, any type of information, such as audio, visual, written, mental, or physiological, can be used for these tasks.
In this chapter, our concentration will be on emotion recognition from speech. Specifically, this chapter discusses the collection and organization of databases and emotional descriptors; the calculation, selection, and normalization of relevant speech features; and the models used to recognize emotions. We outline achievements, open questions, and future challenges in building Effective Automatic Speech Emotion Recognition (EASER) systems. It is known that emotions cause mental and physiological changes that also reflect in uttered speech.

Date: November 1, 2013
Authors: Carlos Busso, Murtaza Bulut, Shrikanth Narayanan, J Gratch, S Marsella
Journal: Social emotions in nature and artifact: emotions in human and human-computer interaction
Volume: 7
Issue: 17
Pages: 110-127
Publisher: New York, NY, USA: Oxford Univ. Press

View Paper

Information Sciences Institute

Publications

Toward effective automatic recognition systems of emotion in speech

Abstract