JOURNAL ARTICLE

Data Augmentation with Large Language Models for Vietnamese Abstractive Text Summarization

Abstract

Text summarization plays a crucial role in managing the overwhelming volume of information available today. This task aims to condense large amounts of information into summaries. However, the lack of large-scale annotated data in certain languages, such as Vietnamese, poses a substantial challenge for developing effective summarization models. With the recent advancements in large language models, such as GPT-3.5, there is an opportunity to leverage these models to augment data for improving the performance of deep learning models in Vietnamese text summarization. In this paper, we propose an automatic approach that utilizes a large language model to generate additional training examples and to enhance the summarization process for Vietnamese texts.

Keywords:
Automatic summarization Vietnamese Computer science Leverage (statistics) Natural language processing Artificial intelligence Language model Task (project management) Process (computing) Multi-document summarization Information retrieval Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
32
Refs
0.14
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.