Publications

Leveraging linguistic context in dyadic interactions to improve automatic speech recognition for children

Abstract

Automatic speech recognition for child speech has been long considered a more challenging problem than for adult speech. Various contributing factors have been identified such as larger acoustic speech variability including mispronunciations due to continuing biological changes in growth, developing vocabulary and linguistic skills, and scarcity of training corpora. A further challenge arises when dealing with spontaneous speech of children involved in a conversational interaction, and especially when the child may have limited or impaired communication ability. This includes health applications, one of the motivating domains of this paper, that involve goal-oriented dyadic interactions between a child and clinician/adult social partner as a part of behavioral assessment. In this work, we use linguistic context information from the interaction to adapt speech recognition models for children speech. Specifically …

Date
2020
Authors
Manoj Kumar, So Hyun Kim, Catherine Lord, Thomas D Lyon, Shrikanth Narayanan
Journal
Computer speech & language
Volume
63
Pages
101101
Publisher
Academic Press