Publications

Long-term SNR estimation of speech signals in known and unknown channel conditions

Abstract

Many speech processing algorithms and applications rely on the explicit knowledge of signal-to-noise ratio (SNR) in their design and implementation. Estimating the SNR of a signal can enhance the performance of such technologies. We propose a novel method for estimating the long-term SNR of speech signals based on features, from which we can approximately detect regions of speech presence in a noisy signal. By measuring the energy in these regions, we create sets of energy ratios, from which we train regression models for different types of noise. If the type of noise that corrupts a signal is known, we use the corresponding regression model to estimate the SNR. When the noise is unknown, we use a deep neural network to find the “closest” regression model to estimate the SNR. Evaluations were done based on the TIMIT speech corpus, using noises from the NOISEX-92 noise database. Furthermore, we …

Date
2016
Authors
Pavlos Papadopoulos, Andreas Tsiartas, Shrikanth Narayanan
Journal
IEEE/ACM Transactions on audio, speech, and language processing
Volume
24
Issue
12
Pages
2495-2506
Publisher
IEEE