JOURNAL ARTICLE

Surprisingly Easy Hard-Attention for Sequence to Sequence Learning

Abstract

In this paper we show that a simple beam approximation of the joint distribution between attention and output is an easy, accurate, and efficient attention mechanism for sequence to sequence learning. The method combines the advantage of sharp focus in hard attention and the implementation ease of soft attention. On five translation tasks we show effortless and consistent gains in BLEU compared to existing attention mechanisms.

Keywords:
Computer science Sequence (biology) Focus (optics) Sequence learning Artificial intelligence Machine translation Simple (philosophy) Translation (biology)

Metrics

41
Cited By
6.16
FWCI (Field Weighted Citation Impact)
32
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Sequence learning is surprisingly fragile in visual search

Yi Ni TohRoger W. RemingtonVanessa G. Lee

Journal:   Journal of Vision Year: 2021 Vol: 21 (9)Pages: 2386-2386
JOURNAL ARTICLE

Sequence learning is surprisingly fragile in visual search.

Yi Ni TohRoger W. RemingtonVanessa G. Lee

Journal:   Journal of Experimental Psychology Human Perception & Performance Year: 2021 Vol: 47 (10)Pages: 1378-1394
JOURNAL ARTICLE

Attention and probabilistic sequence learning

Roger W. SchvaneveldtRebecca L. Gómez

Journal:   Psychological Research Year: 1998 Vol: 61 (3)Pages: 175-190
JOURNAL ARTICLE

Attention and structure in sequence learning.

Asher CohenRichard IvrySteven W. Keele

Journal:   Journal of Experimental Psychology Learning Memory and Cognition Year: 1990 Vol: 16 (1)Pages: 17-30
© 2026 ScienceGate Book Chapters — All rights reserved.