Publications
Rapid language identification
Abstract
A critical challenge to automatic language identification (LID) is achieving accurate performance with the shortest possible speech segment in a rapid fashion. The accuracy to correctly identify the spoken language is highly sensitive to the duration of speech and is bounded by the amount of information available. The proposed approach for rapid language identification transforms the utterances to a low dimensional i-vector representation upon which language classification methods are applied. In order to meet the challenges involved in rapidly making reliable decisions about the spoken language, a highly accurate and computationally efficient framework of i-vector extraction is proposed. The LID framework integrates the approach of universal background model (UBM) fused total variability modeling. UBM-fused modeling yields the estimation of a more discriminant, single i-vector space. This way, it is also a …
- Date
- April 6, 2015
- Authors
- Maarten Van Segbroeck, Ruchir Travadi, Shrikanth S Narayanan
- Journal
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Volume
- 23
- Issue
- 7
- Pages
- 1118-1129
- Publisher
- IEEE