For mining new pattern from evolving data streams, most algorithms are inherited from DenStream framework which is realized via a sliding window. So at the early stage of a pattern emerges, its knowledge points can be easily mistaken as outliers and dropped. In most cases, these points can be ignored, but in some special applications which need to quickly and precisely master the emergence rule of some patterns, these points must play their rules. Based on DenStream, this paper proposes a three-step clustering algorithm, rDenStream, which presents the concept of outlier retrospect. In rDenStream clustering, dropped micro-clusters are stored on outside memory temporarily, and will be given new chance to attend clustering to improve the clustering accuracy. Experiments modeled the arrival of data stream in Poisson process, and the results over standard data set showed its advantage over other methods in the early phase of new pattern discovery.
Lixiong LiuYunfei GuoKang JingHai Huang
Mohammed Oualid AttaouiMustapha LebbahNabil KeskesHanene AzzagMohammed Ghesmoune
Renxia WanJingchao ChenLixin WangXiaoke Su
Chunhua YangCong WangXiao HuNiankang YouXuguang Yang