Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Po-Yao Huang; Junjie Hu; Xiaojun Chang; Alexander G. Hauptmann

doi:10.18653/v1/2020.acl-main.731

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Po-Yao Huang Junjie Hu Xiaojun Chang Alexander G. Hauptmann

Year: 2020 Pages: 8226-8237

DOI: 10.18653/v1/2020.acl-main.731

Get Full-Text PDF Get Analytical Report

Abstract

Unsupervised machine translation (MT) has recently achieved impressive results with monolingual corpora only. However, it is still challenging to associate source-target sentences in the latent space. As people speak different languages biologically share similar visual systems, the potential of achieving better alignment through visual content is promising yet under-explored in unsupervised multimodal MT (MMT). In this paper, we investigate how to utilize visual content for disambiguation and promoting latent space alignment in unsupervised MMT. Our model employs multimodal back-translation and features pseudo visual pivoting in which we learn a shared multilingual visual-semantic embedding space and incorporate visually-pivoted captioning as additional weak supervision. The experimental results on the widely used Multi30K dataset show that the proposed model significantly improves over the state-of-the-art methods and generalizes well when images are not available at the testing time.

Keywords:

Computer science Machine translation Closed captioning Embedding Artificial intelligence Translation (biology) Natural language processing Space (punctuation) Unsupervised learning Visual space Machine learning Image (mathematics)

Metrics

Cited By

3.67

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Abstract

Metrics

Citation History

Topics

Related Documents

Visual Pivoting Unsupervised Multimodal Machine Translation in Low-Resource Distant Language Pairs

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

Video Pivoting Unsupervised Multi-Modal Machine Translation

Enabling Unsupervised Neural Machine Translation with Word-level Visual Representations

Bilingual–Visual Consistency for Multimodal Neural Machine Translation