Unsupervised Multi-Modal Neural Machine Translation

Yuanhang Su; Kai Fan; Nguyễn Bách; C.‐C. Jay Kuo; Fei Huang

doi:10.1109/cvpr.2019.01073

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Multi-Modal Neural Machine Translation

Yuanhang Su Kai Fan Nguyễn Bách C.‐C. Jay Kuo Fei Huang

Year: 2019 Pages: 10474-10483

DOI: 10.1109/cvpr.2019.01073

Get Full-Text PDF Get Analytical Report

Abstract

Unsupervised neural machine translation (UNMT) has recently achieved remarkable results \cite{lample2018phrase} with only large monolingual corpora in each language. However, the uncertainty of associating target with source sentences makes UNMT theoretically an ill-posed problem. This work investigates the possibility of utilizing images for disambiguation to improve the performance of UNMT. Our assumption is intuitively based on the invariant property of image, i.e., the description of the same visual content by different languages should be approximately similar. We propose an unsupervised multi-modal machine translation (UMNMT) framework based on the language translation cycle consistency loss conditional on the image, targeting to learn the bidirectional multi-modal translation simultaneously. Through an alternate training between multi-modal and uni-modal, our inference model can translate with or without the image. On the widely used Multi30K dataset, the experimental results of our approach are significantly better than those of the text-only UNMT on the 2016 test dataset.

Keywords:

Machine translation Computer science Modal Artificial intelligence Inference Translation (biology) Consistency (knowledge bases) Natural language processing Artificial neural network Image translation Invariant (physics) Machine learning Image (mathematics) Pattern recognition (psychology) Mathematics

Metrics

Cited By

5.13

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Unsupervised Multi-Modal Neural Machine Translation

Abstract

Metrics

Citation History

Topics

Related Documents

Video Pivoting Unsupervised Multi-Modal Machine Translation

Multi-Domain Neural Machine Translation through Unsupervised Adaptation

Unsupervised dialectal neural machine translation

Multi-modal graph contrastive encoding for neural machine translation

Doubly-Attentive Decoder for Multi-modal Neural Machine Translation