JOURNAL ARTICLE

Document-Specific Keyphrase Extraction Using Sequential Patterns with Wildcards

Abstract

Finding good keyphrases for a document is beneficial for many applications, such as text summarization, browsing, and indexing. In this paper, we propose a sequential pattern mining based document-specific keyphrase extraction method. Our key innovation is to use wildcards (or gap constraints) to help extract sequential patterns, where the flexible wildcard constraints within a pattern can capture semantic relationships between words. To achieve this goal, we regard each single document as a sequential dataset, and propose an efficient algorithm to mine sequential patterns with wildcard and one-off conditions that allows important keyphrases to be captured during the mining process. For each extracted keyphrase candidate, we use some statistical pattern features to characterize it. A supervised learning classifier is trained to identify keyphrases from a test document. Comparisons on keyphrase benchmark datasets confirm that our document-specific keyphrase extraction method is effective in improving the quality of extracted keyphrases.

Keywords:
Computer science Automatic summarization Artificial intelligence Search engine indexing Classifier (UML) Benchmark (surveying) Merge (version control) Information retrieval Natural language processing

Metrics

17
Cited By
4.35
FWCI (Field Weighted Citation Impact)
27
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Efficient sequential pattern mining with wildcards for keyphrase extraction

Fei XieXindong WuXingquan Zhu

Journal:   Knowledge-Based Systems Year: 2016 Vol: 115 Pages: 27-39
JOURNAL ARTICLE

Single-Document Keyphrase Extraction for Multi-Document Keyphrase Extraction

Gábor BerendRichárd Farkas

Journal:   Computación y Sistemas Year: 2013 Vol: 17 (2)Pages: 179-186
JOURNAL ARTICLE

Document Specific Supervised Keyphrase Extraction With Strong Semantic Relations

Huiting LiuLili WangPeng ZhaoXindong Wu

Journal:   IEEE Access Year: 2019 Vol: 7 Pages: 167507-167520
JOURNAL ARTICLE

MAIL: mining sequential patterns with wildcards

Fei XieXindong WuXuegang HuJun GaoDan GuoYulian FeiErtian Hua

Journal:   International Journal of Data Mining and Bioinformatics Year: 2013 Vol: 8 (1)Pages: 1-1
JOURNAL ARTICLE

Keyphrase Extraction with Sequential Pattern Mining

Qingren WangVictor S. ShengXindong Wu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2017 Vol: 31 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.