Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning

Çaǧlar Gülçehre; Francis Dutil; Adam Trischler; Yoshua Bengio

doi:10.18653/v1/w17-2627

ScienceGate Book Chapters

JOURNAL ARTICLE

Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning

Çaǧlar Gülçehre Francis Dutil Adam Trischler Yoshua Bengio

Year: 2017 Pages: 228-234

DOI: 10.18653/v1/w17-2627

Get Full-Text PDF Get Analytical Report

Abstract

We investigate the integration of a planning mechanism into an encoder-decoder architecture with attention. We develop a model that can plan ahead when it computes alignments between the source and target sequences not only for a single time-step but for the next k time-steps as well by constructing a matrix of proposed future alignments and a commitment vector that governs whether to follow or recompute the plan. This mechanism is inspired by strategic attentive reader and writer (STRAW) model, a recent neural architecture for planning with hierarchical reinforcement learning that can also learn higher level temporal abstractions. Our proposed model is end-to-end trainable with differentiable operations. We show that our model outperforms strong baselines on character-level translation task from WMT’15 with fewer parameters and computes alignments that are qualitatively intuitive.

Keywords:

Computer science Machine translation Plan (archaeology) Artificial intelligence Character (mathematics) Task (project management) Translation (biology) Mechanism (biology) Architecture Differentiable function Encoder Machine learning

Metrics

Cited By

1.60

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning

Abstract

Metrics

Citation History

Topics

Related Documents

Character Decomposition for Japanese-Chinese Character-Level Neural Machine Translation

Fully Character-Level Neural Machine Translation without Explicit Segmentation

Hybrid Attention for Chinese Character-Level Neural Machine Translation

Standardizing Tweets with Character-Level Machine Translation

Extract and Attend: Improving Entity Translation in Neural Machine Translation