Publications
Name translation in statistical machine translation-learning when to transliterate
Abstract
We present a method to transliterate names in the framework of end-to-end statistical machine translation. The system is trained to learn when to transliterate. For Arabic to English MT, we developed and trained a transliterator on a bitext of 7 million sentences and Google’s English terabyte ngrams and achieved better name translation accuracy than 3 out of 4 professional translators. The paper also includes a discussion of challenges in name translation evaluation.
- Date
- 2008
- Authors
- Ulf Hermjakob, Kevin Knight, Hal Daumé III
- Conference
- Proceedings of ACL-08: HLT
- Pages
- 389-397