Name translation in statistical machine translation-learning when to transliterate

Abstract

We present a method to transliterate names in the framework of end-to-end statistical machine translation. The system is trained to learn when to transliterate. For Arabic to English MT, we developed and trained a transliterator on a bitext of 7 million sentences and Google’s English terabyte ngrams and achieved better name translation accuracy than 3 out of 4 professional translators. The paper also includes a discussion of challenges in name translation evaluation.

Date: 2008
Authors: Ulf Hermjakob, Kevin Knight, Hal Daumé III
Conference: Proceedings of ACL-08: HLT
Pages: 389-397

View Paper

Information Sciences Institute

Publications

Name translation in statistical machine translation-learning when to transliterate

Abstract