JOURNAL ARTICLE

HMM Word and Phrase Alignment for Statistical Machine Translation

Yonggang DengBill Byrne

Year: 2008 Journal:   IEEE Transactions on Audio Speech and Language Processing Vol: 16 (3)Pages: 494-507   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Estimation and alignment procedures for word and phrase alignment hidden Markov models (HMMs) are developed for the alignment of parallel text. The development of these models is motivated by an analysis of the desirable features of IBM Model 4, one of the original and most effective models for word alignment. These models are formulated to capture the desirable aspects of Model 4 in an HMM alignment formalism. Alignment behavior is analyzed and compared to human-generated reference alignments, and the ability of these models to capture different types of alignment phenomena is evaluated. In analyzing alignment performance, Chinese-English word alignments are shown to be comparable to those of IBM Model 4 even when models are trained over large parallel texts. In translation performance, phrase-based statistical machine translation systems based on these HMM alignments can equal and exceed systems based on Model 4 alignments, and this is shown in Arabic-English and Chinese-English translation. These alignment models can also be used to generate posterior statistics over collections of parallel text, and this is used to refine and extend phrase translation tables with a resulting improvement in translation quality.

Keywords:
Computer science Hidden Markov model Phrase Machine translation Natural language processing Artificial intelligence Word (group theory) Translation (biology) Speech recognition Rule-based machine translation Statistical model Linguistics

Metrics

44
Cited By
7.58
FWCI (Field Weighted Citation Impact)
36
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation

Zezhong LiHideto IkedaJunichi Fukumoto

Journal:   IEICE Transactions on Information and Systems Year: 2013 Vol: E96.D (7)Pages: 1536-1543
JOURNAL ARTICLE

Statistical machine translation using hierarchical phrase alignment

Taro WatanabeKenji ImamuraEiichiro SumitaHiroshi G. Okuno

Journal:   Systems and Computers in Japan Year: 2007 Vol: 38 (6)Pages: 70-79
BOOK-CHAPTER

Statistical Machine Translation and Word Alignment

Machine Translation Year: 2017 Pages: 121-146
© 2026 ScienceGate Book Chapters — All rights reserved.