Multi-document summarization based on lexical chains

Yanmin Chen; Xiaolong Wang; Bingquan Liu

doi:10.1109/icmlc.2005.1527262

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-document summarization based on lexical chains

Yanmin Chen Xiaolong Wang Bingquan Liu

Year: 2005 Pages: 1937-1942 Vol. 3

DOI: 10.1109/icmlc.2005.1527262

Get Full-Text PDF Get Analytical Report

Abstract

This paper for the first time investigates using lexical chains as a model of multiple documents written in Chinese to generate an indicative, moderately fluent summary. The algorithm which computes lexical chains based on the HowNet knowledge database is modified to improve the performance and suit Chinese summarization. Based on an analysis of semanteme, the algorithm can remove redundant similarities and remain differences in information content among multiple documents. The method pre-processes the text first, then constructs lexical chains and identifies strong chains. Then significant sentences are extracted from each document and are ordered, and redundant information are recognized and removed. Finally, the summary is generated in chronological order, and the anaphora resolution technology is applied to improve the fluency of the summary. Evaluation results show that the performance of the presented system is obviously better than that of the baseline system, and lexical chains are effective for multidocument summarization.

Keywords:

Automatic summarization Computer science Natural language processing Information retrieval Artificial intelligence Fluency Multi-document summarization Resolution (logic) Linguistics

Metrics

Cited By

1.15

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multi-document summarization based on lexical chains

Abstract

Metrics

Citation History

Topics

Related Documents

Extractive Summarization of a Document Using Lexical Chains

Research on Multi-document Summarization Using Lexical Cohesion

Automatic Text Summarization Based on Lexical Chains

Novel Algorithm for Multi-document Summarization using Lexical Concept

Applying Lexical-Conceptual Knowledge for Multilingual Multi-document Summarization