Shaking acoustic spectral sub-bands can Letxer regularize learning in affective computing

Abstract

In this work, we investigate a recently proposed regularization technique based on multi-branch architectures, called Shake-Shake regularization, for the task of speech emotion recognition. In addition, we also propose variants to incorporate domain knowledge into model configurations. The experimental results demonstrate: 1) independently shaking subbands delivers favorable models compared to shaking the entire spectral-temporal feature maps. 2) with proper patience in early stopping, the proposed models can simultaneously outperform the baseline and maintain a smaller performance gap between training and validation.

Date: 2018
Authors: Che-Wei Huang, Shrikanth Narayanan
Conference: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Pages: 6827-6831
Publisher: IEEE

View Paper

Information Sciences Institute

Publications

Shaking acoustic spectral sub-bands can Letxer regularize learning in affective computing

Abstract