JOURNAL ARTICLE

Exploiting User-Generated Content to Enrich Web Document Summarization

Minh-Tien NguyenVu TranChien-Xuan TranLe-Minh Nguyen

Year: 2017 Journal:   International Journal of Artificial Intelligence Tools Vol: 26 (05)Pages: 1760017-1760017   Publisher: World Scientific

Abstract

User-generated content such as comments or tweets (also called by social information) following a Web document provides additional information for enriching the content of an event mentioned in sentences. This paper presents a framework named SoSVMRank, which integrates the user-generated content of a Web document to generate a highquality summarization. In order to do that, the summarization was formulated as a learning to rank task, in which comments or tweets are exploited to support sentences in a mutual reinforcement fashion. To model sentence-comment (or tweet) relation, a set of local and social features are proposed. After ranking, top m ranked sentences and comments (or tweets) are selected as the summarization. To validate the efficiency of our framework, sentence and story highlight extraction tasks were taken as a case study on three datasets in two languages, English and Vietnamese. Experimental results indicate that: (i) our new features improve the summary performance of the framework in term of ROUGE-scores compared to state-of-the-art baselines and (ii) the integration of user-generated content benefits single-document summarization.

Keywords:
Automatic summarization Computer science Multi-document summarization Ranking (information retrieval) Information retrieval Sentence Task (project management) Set (abstract data type) Social media Rank (graph theory) User-generated content Natural language processing Web page Artificial intelligence World Wide Web

Metrics

6
Cited By
0.92
FWCI (Field Weighted Citation Impact)
9
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.