Document-Level Machine Translation with Hierarchical Attention

Yu-Tang Shen

doi:10.31979/etd.k2wk-3tf8

ScienceGate Book Chapters

DISSERTATION

Document-Level Machine Translation with Hierarchical Attention

Yu-Tang Shen

Year: 2023

DOI: 10.31979/etd.k2wk-3tf8

Get Full-Text PDF Get Analytical Report

Abstract

Machine translation (MT) aims to translate texts with minimal human involvement, and the utilization of machine learning methods is pivotal to its success. Sentence-level and paragraph-level translations were well-explored in the past decade, such as the Transformer and its variations, but less research was done on the document level. From reading a piece of news in a different language to trying to understand foreign research, document-level translation can be helpful.\nThis project utilizes a hierarchical attention (HAN) mechanism to abstract context information making document-level translation possible. It further utilizes the Big Bird attention mask in the hope of reducing memory usage. The results from the experiments showed that the HAN models produced readable translations and had an average BLEU score of 0.75 (0.67 for full attention HAN, and 0.82 for Big Bird attention), whereas the Transformer model failed to comprehend the large input and had a score of 0.22 on the same dataset.

Keywords:

Paragraph Machine translation Computer science Natural language processing Sentence Transformer Artificial intelligence Reading (process) Example-based machine translation Linguistics Foreign language World Wide Web Engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Document-Level Machine Translation with Hierarchical Attention

Abstract

Metrics

Topics

Related Documents

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Document-Level Neural Machine Translation with Hierarchical Attention Networks

Sparse Hierarchical Modeling of Deep Contextual Attention for Document-Level Neural Machine Translation

Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context

Document-Level Machine Translation