Publications

Dynamic 3-D visualization of vocal tract shaping during speech

Abstract

Noninvasive imaging is widely used in speech research as a means to investigate the shaping and dynamics of the vocal tract during speech production. 3-D dynamic MRI would be a major advance, as it would provide 3-D dynamic visualization of the entire vocal tract. We present a novel method for the creation of 3-D dynamic movies of vocal tract shaping based on the acquisition of 2-D dynamic data from parallel slices and temporal alignment of the image sequences using audio information. Multiple sagittal 2-D real-time movies with synchronized audio recordings are acquired for English vowel-consonant-vowel stimuli /ala/, /aa/, /asa/, and /a$\! \smallint \!$a/. Audio data are aligned using mel-frequency cepstral coefficients (MFCC) extracted from windowed intervals of the speech signal. Sagittal image sequences acquired from all slices are then aligned using dynamic time warping (DTW). The aligned image …

Date
2012
Authors
Yinghua Zhu, Yoon-Chul Kim, Michael I Proctor, Shrikanth S Narayanan, Krishna S Nayak
Journal
IEEE transactions on medical imaging
Volume
32
Issue
5
Pages
838-848
Publisher
IEEE