Surprisingly Easy Hard-Attention for Sequence to Sequence Learning

Shiv Shankar; Siddhant Garg; Sunita Sarawagi

doi:10.18653/v1/d18-1065

ScienceGate Book Chapters

JOURNAL ARTICLE

Surprisingly Easy Hard-Attention for Sequence to Sequence Learning

Shiv Shankar Siddhant Garg Sunita Sarawagi

Year: 2018

DOI: 10.18653/v1/d18-1065

Get Full-Text PDF Get Analytical Report

Abstract

In this paper we show that a simple beam approximation of the joint distribution between attention and output is an easy, accurate, and efficient attention mechanism for sequence to sequence learning. The method combines the advantage of sharp focus in hard attention and the implementation ease of soft attention. On five translation tasks we show effortless and consistent gains in BLEU compared to existing attention mechanisms.

Keywords:

Computer science Sequence (biology) Focus (optics) Sequence learning Artificial intelligence Machine translation Simple (philosophy) Translation (biology)

Metrics

Cited By

6.16

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Surprisingly Easy Hard-Attention for Sequence to Sequence Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Sequence learning is surprisingly fragile in visual search

Sequence learning is surprisingly fragile in visual search.

Attention and probabilistic sequence learning

Dialog state tracking with attention-based sequence-to-sequence learning

Attention and structure in sequence learning.