Document-Level Neural Machine Translation with Hierarchical Attention Networks

Lesly Miculicich; Dhananjay Ram; Nikolaos Pappas; James Henderson

doi:10.18653/v1/d18-1325

ScienceGate Book Chapters

JOURNAL ARTICLE

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Lesly Miculicich Dhananjay Ram Nikolaos Pappas James Henderson

Year: 2018

DOI: 10.18653/v1/d18-1325

Get Full-Text PDF Get Analytical Report

Abstract

Neural Machine Translation (NMT) can be improved by including document-level contextual information. For this purpose, we propose a hierarchical attention model to capture the context in a structured and dynamic manner. The model is integrated in the original NMT architecture as another level of abstraction, conditioning on the NMT model's own previous hidden states. Experiments show that hierarchical attention significantly improves the BLEU score over a strong NMT baseline with the state-of-the-art in context-aware methods, and that both the encoder and decoder benefit from context in complementary ways.

Keywords:

Machine translation Computer science Abstraction Context (archaeology) Artificial intelligence Encoder Translation (biology) Baseline (sea) Context model Hierarchical database model Natural language processing Artificial neural network Machine learning Data mining

Metrics

275

Cited By

38.12

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Document-Level Machine Translation with Hierarchical Attention

Sparse Hierarchical Modeling of Deep Contextual Attention for Document-Level Neural Machine Translation

Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context

Document-Level Neural Machine Translation With Document Embeddings