JOURNAL ARTICLE

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Abstract

Neural Machine Translation (NMT) can be improved by including document-level contextual information. For this purpose, we propose a hierarchical attention model to capture the context in a structured and dynamic manner. The model is integrated in the original NMT architecture as another level of abstraction, conditioning on the NMT model's own previous hidden states. Experiments show that hierarchical attention significantly improves the BLEU score over a strong NMT baseline with the state-of-the-art in context-aware methods, and that both the encoder and decoder benefit from context in complementary ways.

Keywords:
Machine translation Computer science Abstraction Context (archaeology) Artificial intelligence Encoder Translation (biology) Baseline (sea) Context model Hierarchical database model Natural language processing Artificial neural network Machine learning Data mining

Metrics

275
Cited By
38.12
FWCI (Field Weighted Citation Impact)
42
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.