JOURNAL ARTICLE

SumPA: Efficient Pattern-Centric Graph Mining with Pattern Abstraction

Abstract

Graph mining aims to explore interesting structural information of a graph. Pattern-centric systems typically transform a generic-purpose graph mining problem into a series of subgraph matching problems for high performance. Existing pattern-centric mining systems reduce the substantial search space towards a single pattern by exploring a highly-optimized matching order, but inherent computational redundancies of such a matching order itself still suffer severely, leading to significant performance degradation. The key innovation of this work lies in a general redundancy criterion that characterizes computational redundancies arising in not only handing a single pattern but also matching multiple patterns simultaneously. In this paper, we present SumPA, a high-performance pattern-centric graph mining system that can sufficiently remove redundant computations for any complex graph mining problems. SumPA features three key designs: (1) a pattern abstraction technique that can simplify numerous complex patterns into a few simple abstract patterns based on pattern similarity, (2) abstraction-guided pattern matching that completely eliminates (totally and partially) redundant computations during subgraph enumeration, and (3) a suite of system optimizations to maximize storage and computation efficiency. Our evaluation on a wide variety of real-world graphs shows that SumPA outperforms the two state-of-the-art systems Peregrine and GraphPi by up to 61.89× and 8.94×, respectively. For many mining problems on large graphs, Peregrine takes hours or even days while SumPA finishes in only a few minutes.

Keywords:
Computer science Pattern matching Computation Redundancy (engineering) Graph Theoretical computer science Abstraction Matching (statistics) Data mining Algorithm Artificial intelligence Mathematics

Metrics

13
Cited By
0.92
FWCI (Field Weighted Citation Impact)
53
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Graph Theory and Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems

Related Documents

BOOK-CHAPTER

Pattern-Based Graph Abstraction

Arend RensinkEduardo Zambon

Lecture notes in computer science Year: 2012 Pages: 66-80
BOOK-CHAPTER

Graph abstraction for closed pattern mining in attributed networks

Henry SoldanoGuillaume Santini

Frontiers in artificial intelligence and applications Year: 2014 Pages: 849-854
BOOK-CHAPTER

Graph Pattern Mining

Year: 2014 Pages: 660-660
BOOK-CHAPTER

Graph Pattern Mining

Mohammed J. ZakiWagner Meira

Cambridge University Press eBooks Year: 2020 Pages: 282-302
BOOK-CHAPTER

Graph Pattern Mining

Mohammed J. ZakiWagner Meira, Jr

Cambridge University Press eBooks Year: 2018 Pages: 280-300
© 2026 ScienceGate Book Chapters — All rights reserved.