Abstract

Sequential pattern mining is a very significant data mining project. In this area, most of the previous studies require us to provide a support threshold to accomplish the mining. However, in reality providing an appropriate threshold is very difficult if we did not acquaint with some background information relevant to the data. In addition, there exist many useless sequential patterns when the least support is too low. An alternative task is proposed to solve the above problems: mining top-k frequent closed sequences with the least length constraint, that is, mining k most frequent closed sequences whose length are equal or more than min_len. However, most of the previous algorithms are based on the framework of candidate and generation, thus leading too much space usage and running time. To this end, in this paper, we propose a very efficient algorithm named BI-TSP(Mining top-k closed sequential patterns with BI-Directional checking scheme) without candidate and generation for mining top-k frequent closed sequences with the least length. Specifically, we adopt BI-Directional Extension for frequent closed sequential patterns enumeration. Based on BI-Directional Extension, we can directly use the closure checking scheme and effectively raise the minimum support threshold without candidate maintenance. In addition, we also propose two novel pruning strategies by exploiting the properties of minimum length constraint. Our extensive performance test with synthetic and real datasets demonstrates that BI-TSP outperforms the baselines in both memory and running time.

Keywords:
Pruning Computer science Extension (predicate logic) Constraint (computer-aided design) Data mining Sequential Pattern Mining Enumeration Sequence (biology) Closure (psychology) Algorithm Scheme (mathematics) Task (project management) Mathematics

Metrics

2
Cited By
0.81
FWCI (Field Weighted Citation Impact)
18
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

TSP: Mining top-k closed sequential patterns

TzvetkovPetreYanXifengHanJiawei

Journal:   Knowledge and Information Systems Year: 2005
JOURNAL ARTICLE

TSP: Mining top-k closed sequential patterns

P. TzvetkovXifeng YanJiawei Han

Journal:   Knowledge and Information Systems Year: 2004 Vol: 7 (4)Pages: 438-457
JOURNAL ARTICLE

Mining Top-k Closed Sequential Patterns in Sequential Databases

K. Sohini

Journal:   IOSR Journal of Computer Engineering Year: 2013 Vol: 15 (4)Pages: 20-23
© 2026 ScienceGate Book Chapters — All rights reserved.