Maximal Frequent Item Sequences Mining

Li Zhou; Zhang Zhang

doi:10.4028/www.scientific.net/amr.108-111.1211

ScienceGate Book Chapters

JOURNAL ARTICLE

Maximal Frequent Item Sequences Mining

Li Zhou Zhang Zhang

Year: 2010 Journal: Advanced materials research Vol: 108-111 Pages: 1211-1216 Publisher: Trans Tech Publications

DOI: 10.4028/www.scientific.net/amr.108-111.1211

Get Full-Text PDF Get Analytical Report

Abstract

This work proposes a new fast algorithm finding maximal frequent item sequences from transaction database. Itemset is defined as item sequence (IS) for mining. Two lists called ISL (Item Sequence List) and FISL (Frequent Item Sequence List) are created by scanning database once for dividing n-IS into two categories depending on whether the IS to achieve minimum support number (n is the number of attributes). Sub item sequences (SIS) whose n-superset is in ISL are generated by recursion to make sure that each k-SIS appeared before its (k+1)-superset. As current k-SIS being joined to FISL, its (k-1)-SIS are pruned (k range from 2 to n-1). At last, all SISs whose n-superset is in FISL are pruned from FISL. We compare our new algorithm and FP-Growth by experiment to prove its superiority.

Keywords:

Database transaction Sequence (biology) Computer science Recursion (computer science) Range (aeronautics) Combinatorics Data mining Mathematics Algorithm Database Engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.12

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Rough Sets and Fuzzy Logic

Physical Sciences → Computer Science → Computational Theory and Mathematics

Data Management and Algorithms

Physical Sciences → Computer Science → Signal Processing

Maximal Frequent Item Sequences Mining

Abstract

Metrics

Topics

Related Documents

Mining Maximal Frequent Item Sets

Maximal Frequent Item Sequences Mining of Datasets with few Attributes and Large Instances

Mining Maximal Frequent Contiguous Sequences in Biological Data Sequences

Mining Maximal Frequent Contiguous Sequences in Biological Data Sequences

An Efficient Algorithm for Mining Maximal Frequent Item Sets