Publications

Overlapped speech detection using long-term spectro-temporal similarity in stereo recording

Abstract

The problem of detecting overlapped speech in stereo recordings using close-talk microphones is important for a variety of applications including the identification of back-channels, interruptions etc. in a dyadic or multi-party interactions. For detecting overlapped speech, we propose a feature derived using the spectral similarity of two channels over a range of acoustic frames. During overlapped speech frames the proposed spectro-temporal similarity-based feature values decrease and during non-overlapped speech frames the feature values increase due to the presence of cross-talk. Thus the proposed feature helps to discriminate the overlapped speech frames from the non-overlapped ones. Using overlapped speech detection experiments on a dyadic interaction corpus, it is shown that the proposed feature provides a significant improvement ~26% absolute, in the accuracy of detecting the overlapped speech …

Date
2011
Authors
Bo Xiao, Prasanta Kumar Ghosh, Panayiotis Georgiou, Shrikanth S Narayanan
Conference
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Pages
5216-5219
Publisher
IEEE