JOURNAL ARTICLE

Multi-label Feature Selection for Graph Classification

Abstract

Nowadays, the classification of graph data has become an important and active research topic in the last decade, which has a wide variety of real world applications, e.g. drug activity predictions and kinase inhibitor discovery. Current research on graph classification focuses on single-label settings. However, in many applications, each graph data can be assigned with a set of multiple labels simultaneously. Extracting good features using multiple labels of the graphs becomes an important step before graph classification. In this paper, we study the problem of multi-label feature selection for graph classification and propose a novel solution, called gMLC, to efficiently search for optimal sub graph features for graph objects with multiple labels. Different from existing feature selection methods in vector spaces which assume the feature set is given, we perform multi-label feature selection for graph data in a progressive way together with the sub graph feature mining process. We derive an evaluation criterion, named gHSIC, to estimate the dependence between sub graph features and multiple labels of graphs. Then a branch-and-bound algorithm is proposed to efficiently search for optimal sub graph features by judiciously pruning the sub graph search space using multiple labels. Empirical studies on real-world tasks demonstrate that our feature selection approach can effectively boost multi-label graph classification performances and is more efficient by pruning the sub graph search space using multiple labels.

Keywords:
Feature selection Computer science Graph Feature vector Pattern recognition (psychology) Artificial intelligence Minimum redundancy feature selection Data mining Theoretical computer science

Metrics

34
Cited By
2.81
FWCI (Field Weighted Citation Impact)
32
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning in Bioinformatics
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

gMLC: a multi-label feature selection framework for graph classification

Xiangnan KongPhilip S. Yu

Journal:   Knowledge and Information Systems Year: 2011 Vol: 31 (2)Pages: 281-305
BOOK-CHAPTER

Feature Selection Algorithm for Multi-label Classification Based on Graph Operations

Qianyao TangFuyi WeiZhihong LiuHang ZhangYing GuoPeiwei SuDongxin Li

Lecture notes on data engineering and communications technologies Year: 2024 Pages: 335-342
BOOK-CHAPTER

Graph-Margin Based Multi-label Feature Selection

Peng YanYun Li

Lecture notes in computer science Year: 2016 Pages: 540-555
BOOK-CHAPTER

Feature Selection for Hierarchical Multi-label Classification

Luan V. M. da SilvaRicardo Cerri

Lecture notes in computer science Year: 2021 Pages: 196-208
BOOK-CHAPTER

Feature Selection for Multi-label Classification Problems

Gauthier DoquireMichel Verleysen

Lecture notes in computer science Year: 2011 Pages: 9-16
© 2026 ScienceGate Book Chapters — All rights reserved.