Long-term SNR estimation of speech signals in known and unknown channel conditions

Abstract

Many speech processing algorithms and applications rely on the explicit knowledge of signal-to-noise ratio (SNR) in their design and implementation. Estimating the SNR of a signal can enhance the performance of such technologies. We propose a novel method for estimating the long-term SNR of speech signals based on features, from which we can approximately detect regions of speech presence in a noisy signal. By measuring the energy in these regions, we create sets of energy ratios, from which we train regression models for different types of noise. If the type of noise that corrupts a signal is known, we use the corresponding regression model to estimate the SNR. When the noise is unknown, we use a deep neural network to find the “closest” regression model to estimate the SNR. Evaluations were done based on the TIMIT speech corpus, using noises from the NOISEX-92 noise database. Furthermore, we …

Date: 2016
Authors: Pavlos Papadopoulos, Andreas Tsiartas, Shrikanth Narayanan
Journal: IEEE/ACM Transactions on audio, speech, and language processing
Volume: 24
Issue: 12
Pages: 2495-2506
Publisher: IEEE

View Paper

Information Sciences Institute

Publications

Long-term SNR estimation of speech signals in known and unknown channel conditions

Abstract