Hierarchical classification for speech-to-speech translation.

Abstract

Concept classifiers have been used in speech to speech translation systems. Their effectiveness, however, depends on the size of the domain that they cover. The main bottleneck in expanding the classifier domain is the degradation in accuracy as the number of classes increase. Here we introduce a hierarchical classification process that aims to scale up the domain without compromising the accuracy. We propose to exploit the categorical associations that naturally appear in the training data to split the domain into sub-domains with fewer classes. We use two methods of language model based classification and topic modeling with latent Dirichlet allocation to use the discourse information for sub-domain detection. The classification task is performed in two steps. First the best category for the discourse is detected using one of the above methods. Then a sub-domain classifier—limited to that category—is …

Date: September 27, 2025
Authors: Emil Ettelaie, Panayiotis G Georgiou, Shrikanth S Narayanan
Conference: INTERSPEECH
Pages: 2530-2533

View Paper

Information Sciences Institute

Publications

Hierarchical classification for speech-to-speech translation.

Abstract