JOURNAL ARTICLE

A data challenge for Vietnamese abstractive multi-document summarization

Minh TranHoang Quynh LeDuy Cat CanQuoc Phong Nguyen

Year: 2024 Journal:   Journal of Computer Science and Cybernetics Vol: 40 (4)Pages: 347-362

Abstract

This paper provides an overview of the Vietnamese abstractive multi-document summarization shared task (AbMuSu) for Vietnamese news, which is hosted at the 9th annual workshop on Vietnamese Language and Speech Processing (VLSP 2022). The main goal of this shared task is to develop automated summarization systems that can generate abstractive summaries for a given set of documents on a specific topic. The input consists of several news documents on the same topic, and the output is a related abstractive summary. The focus of the AbMuSu shared task is solely on Vietnamese news summarization. To this end, a human-annotated dataset comprising 1,839 documents in 600 clusters, collected from Vietnamese news in 8 categories, has been developed. Participating models are evaluated and ranked based on their ROUGE2-F1 score, which is the most common evaluation metric for document summarization problems.

Keywords:
Automatic summarization Vietnamese Computer science Natural language processing Artificial intelligence Multi-document summarization Information retrieval Linguistics Philosophy

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
18
Refs
0.29
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.