Rapid language identification

Abstract

A critical challenge to automatic language identification (LID) is achieving accurate performance with the shortest possible speech segment in a rapid fashion. The accuracy to correctly identify the spoken language is highly sensitive to the duration of speech and is bounded by the amount of information available. The proposed approach for rapid language identification transforms the utterances to a low dimensional i-vector representation upon which language classification methods are applied. In order to meet the challenges involved in rapidly making reliable decisions about the spoken language, a highly accurate and computationally efficient framework of i-vector extraction is proposed. The LID framework integrates the approach of universal background model (UBM) fused total variability modeling. UBM-fused modeling yields the estimation of a more discriminant, single i-vector space. This way, it is also a …

Date: 2015
Authors: Maarten Van Segbroeck, Ruchir Travadi, Shrikanth S Narayanan
Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing
Volume: 23
Issue: 7
Pages: 1118-1129
Publisher: IEEE

Information Sciences Institute

Publications

Rapid language identification

Abstract