Publications
Leveraging linguistic context in dyadic interactions to improve automatic speech recognition for children
Abstract
Automatic speech recognition for child speech has been long considered a more challenging problem than for adult speech. Various contributing factors have been identified such as larger acoustic speech variability including mispronunciations due to continuing biological changes in growth, developing vocabulary and linguistic skills, and scarcity of training corpora. A further challenge arises when dealing with spontaneous speech of children involved in a conversational interaction, and especially when the child may have limited or impaired communication ability. This includes health applications, one of the motivating domains of this paper, that involve goal-oriented dyadic interactions between a child and clinician/adult social partner as a part of behavioral assessment. In this work, we use linguistic context information from the interaction to adapt speech recognition models for children speech. Specifically …
- Date
- 2020
- Authors
- Manoj Kumar, So Hyun Kim, Catherine Lord, Thomas D Lyon, Shrikanth Narayanan
- Journal
- Computer speech & language
- Volume
- 63
- Pages
- 101101
- Publisher
- Academic Press