J FinkelC ManningV YadavS BethardP KoehnM JohnsonE MatusovP WilkenY GeorgakopoulouM GraaY KimJ SchamperS KhadiviH NeyN PhuocC.-Y OckM.-T LuongC ManningQ NgoW WiniwarterG LampleM BallesterosS SubramanianK KawakamiC DyerS HochreiterJ SchmidhuberH MayerF GomezD WierstraI NagyA KnollJ SchmidhuberF.-F LiP PeronaG QiuL GetoorB TaskarQ.-P NguyenA.-D VoJ.-C ShinP TranC.-Y OckT LuongH PhamC ManningA VaswaniH BahuleyanL MouO VechtomovaP PoupartS HochreiterH KamigaitoK HayashiT HiraoH TakamuraM OkumuraM NagataG KleinY KimY DengJ SenellartA RushD NguyenD NguyenT VuM DrasM JohnsonK PapineniS RoukosT WardW.-J ZhuM SnoverB DorrR SchwartzL Micciulla
Translators are becoming more and more popular and achieving reliable results since deep learning was born.English-Vietnamese machines translation (MT) still have limitations due to Vietnamese contain words with many different meanings, thus resulting in the lower accuracy of automatic MT systems.Our study applied Named Entity Recognition (NER) tool for Vietnamese sentences to determine the category of words in the English-Vietnamese parallel corpus with over 900K sentence pairs.Then, we performed experiments to assess the effect of NER on English-Vietnamese MT systems.The results showed that NER had a positive effect on MT with averagely 1.24 Bi-Lingual Evaluation Understudy (BLEU) scores and averagely 1.8 Translation Error Rate (TER) scores increased comparing to data without using NER.
Van-Hai VuQuang-Phuoc NguyenKiem-Hieu NguyenJoon-Choul ShinCheol-Young Ock
Dat Ba NguyenSon Huu HoangSon Bao PhamThai Phuong Nguyen
Azmat AnwarLi XiaoYating YangRui DongTurghun Osman
Roman GrundkiewiczKenneth Heafield