JOURNAL ARTICLE

Bayesian Word Alignment and Phrase Table Training for Statistical Machine Translation

Zezhong LiHideto IkedaJunichi Fukumoto

Year: 2013 Journal:   IEICE Transactions on Information and Systems Vol: E96.D (7)Pages: 1536-1543   Publisher: Institute of Electronics, Information and Communication Engineers

Abstract

In most phrase-based statistical machine translation (SMT) systems, the translation model relies on word alignment, which serves as a constraint for the subsequent building of a phrase table. Word alignment is usually inferred by GIZA++, which implements all the IBM models and HMM model in the framework of Expectation Maximum (EM). In this paper, we present a fully Bayesian inference for word alignment. Different from the EM approach, the Bayesian inference makes use of all possible parameter values rather than estimating a single parameter value, from which we expect a more robust inference. After inferring the word alignment, current SMT systems usually train the phrase table from Viterbi word alignment, which is prone to learn incorrect phrases due to the word alignment mistakes. To overcome this drawback, a new phrase extraction method is proposed based on multiple Gibbs samples from Bayesian inference for word alignment. Empirical results show promising improvements over baselines in alignment quality as well as the translation performance.

Keywords:
Computer science Phrase Word (group theory) Artificial intelligence Machine translation Natural language processing Inference Bayesian inference Viterbi algorithm Translation (biology) Hidden Markov model Table (database) Bayesian probability Speech recognition Data mining Mathematics

Metrics

3
Cited By
0.94
FWCI (Field Weighted Citation Impact)
28
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

HMM Word and Phrase Alignment for Statistical Machine Translation

Yonggang DengBill Byrne

Journal:   IEEE Transactions on Audio Speech and Language Processing Year: 2008 Vol: 16 (3)Pages: 494-507
JOURNAL ARTICLE

Statistical machine translation using hierarchical phrase alignment

Taro WatanabeKenji ImamuraEiichiro SumitaHiroshi G. Okuno

Journal:   Systems and Computers in Japan Year: 2007 Vol: 38 (6)Pages: 70-79
BOOK-CHAPTER

Statistical Machine Translation and Word Alignment

Machine Translation Year: 2017 Pages: 121-146
© 2026 ScienceGate Book Chapters — All rights reserved.