Publications

An overview on perceptually motivated audio indexing and classification

Abstract

An audio indexing system aims at describing audio content by identifying, labeling, or categorizing different acoustic events. Since the resulting audio classification and indexing is meant for direct human consumption, it is highly desirable that it produces perceptually relevant results. This can be obtained by integrating specific knowledge of the human auditory system in the design process to various extent. In this paper, we highlight some of the important concepts used in audio classification and indexing that are perceptually motivated or that exploit some principles of perception. In particular, we discuss several different strategies to integrate human perception, including: 1) the use of generic audition models; 2) the use of perceptually relevant features for the analysis stage that are perceptually justified either as a component of a hearing model or as being correlated with a perceptual dimension of sound similarity …

Date
2013
Authors
Gaël Richard, Shiva Sundaram, Shrikanth Narayanan
Source
Proceedings of the IEEE
Volume
101
Issue
9
Pages
1939-1954
Publisher
IEEE