Publications

Recognition of voice onset time for use in pronunciation modeling

Abstract

This study examines methods for recognizing native and accented voiceless stops based on voice onset time (VOT). These methods are tested on data from the Tball corpus of early elementary school children, which includes both native English speakers and Spanish speakers learning English, and which is transcribed to highlight pronunciation variation. We examine the English voiceless stop series, which have long VOT and aspiration, and the corresponding voiceless stops in Spanish accented English, which have short VOT and little aspiration. The methods tested are : (1) to train hidden Markov models (HMMs) based on native speech and then extract the VOT times by post‐processing phone‐level alignments, (2) to train HMMs with explicit aspiration models, and (3) to train, for each phoneme, different HMMs for native and accented variants. Error rates of 23%–53% for distinguishing phone VOT …

Date
September 1, 2005
Authors
Abe Kazemzadeh, Sungbok Lee, Shrikanth Narayanan
Journal
The Journal of the Acoustical Society of America
Volume
118
Issue
3_Supplement
Pages
2026-2026
Publisher
Acoustical Society of America