JOURNAL ARTICLE

Exploiting and Evaluating MapReduce for Large-Scale Graph Mining

Abstract

Graph mining is a popular technique for discovering the hidden structures or important instances in a graph, but the computational efficiency is usually a cause for concern when dealing with large-scale graphs containing billions of entities. Cloud computing is widely regarded as a feasible solution to the problem. In this work, we present an open source graph mining library called the MapReduce Graph Mining Framework (MGMF) to be a robust and efficient MapReduce-based graph mining tool. We start from dividing graph mining algorithms into four categories and designing a MapReduce framework for algorithms in each category. The experimental results show that MGMF is 3 to 20 times more efficient than PEGASUS, a state-of-the-art library for graph mining on MapReduce. Moreover, it provides better coverage of different graph mining algorithms. We also validate our framework on billion-scaled networks to demonstrate that it is scalable to the number of machines. Fur-thermore, we test and compare the feasibility between single ma-chine and the cloud computing technique. The effects of different file input formats for MapReduce are investigated as well. Our implemented open-source library can be downloaded from http://mslab.csie.ntu.edu.tw/~noahsark/MGMF/.

Keywords:
Computer science Scalability Cloud computing Graph database Graph Theoretical computer science Data mining Database Operating system

Metrics

5
Cited By
0.55
FWCI (Field Weighted Citation Impact)
23
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Graph Theory and Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Large Scale Graph Mining with MapReduce

Charalampos E. Tsourakakis

Advances in data mining and database management book series Year: 2011 Pages: 299-314
BOOK-CHAPTER

Large Scale Graph Mining with MapReduce

Charalampos E. Tsourakakis

IGI Global eBooks Year: 2011 Pages: 66-78
JOURNAL ARTICLE

MapReduce in MPI for Large-scale graph algorithms

Steven J. PlimptonKaren Devine

Journal:   Parallel Computing Year: 2011 Vol: 37 (9)Pages: 610-632
DISSERTATION

Large-scale data mining analytics based on MapReduce

Sunny Ranjan

University:   OPUS Publication Server of the University of Stuttgart (University of Stuttgart) Year: 2014
© 2026 ScienceGate Book Chapters — All rights reserved.