SMT systems rely on sufficient amount of parallel corpora to train the translation model. This paper investigates possibilities to use word-to-word and phrase-to-phrase translations extracted not only from clean parallel corpora but also from noisy comparable corpora. Translation results for a Chinese to English translation task are given.
Rafael E. BanchsJosep CregoAdrià de GispertPatrik LambertJosé Bernardo Mariño Acebal
Chung‐Chi HuangWei-Teh ChenJason S. Chang
Zhixiang RenYajuan LüJie CaoQun LiuYun Huang