This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.
Xin ZhengAixin SunJing LiKarthik Muthuswamy
Shu GongYouli QuShengfeng Tian
Rafael RibaldoPaula Christina Figueira CardosoThiago Alexandre Salgueiro Pardo