JOURNAL ARTICLE

GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification

Mengting ZhouZhiguo Gong

Year: 2023 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 37 (4)Pages: 4954-4962   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Graph neural networks (GNNs) have achieved great success in node classification tasks. However, existing GNNs naturally bias towards the majority classes with more labelled data and ignore those minority classes with relatively few labelled ones. The traditional techniques often resort over-sampling methods, but they may cause overfitting problem. More recently, some works propose to synthesize additional nodes for minority classes from the labelled nodes, however, there is no any guarantee if those generated nodes really stand for the the corresponding minority classes. In fact, improperly synthesized nodes may result in insufficient generalization of the algorithm. To resolve the problem, in this paper we seek to automatically augment the minority classes from the massive unlabelled nodes of the graph. Specifically, we propose \textit{GraphSR}, a novel self-training strategy to augment the minority classes with significant diversity of unlabelled nodes, which is based on a Similarity-based selection module and a Reinforcement Learning(RL) selection module. The first module finds a subset of unlabelled nodes which are most similar to those labelled minority nodes, and the second one further determines the representative and reliable nodes from the subset via RL technique. Furthermore, the RL-based module can adaptively determine the sampling scale according to current training data. This strategy is general and can be easily combined with different GNNs models. Our experiments demonstrate the proposed approach outperforms the state-of-the-art baselines on various class-imbalanced datasets.

Keywords:
Overfitting Computer science Generalization Artificial intelligence Machine learning Selection (genetic algorithm) Graph Node (physics) Class (philosophy) Similarity (geometry) Algorithm Artificial neural network Theoretical computer science Mathematics

Metrics

27
Cited By
3.89
FWCI (Field Weighted Citation Impact)
52
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence
Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
Online Learning and Analytics
Physical Sciences →  Computer Science →  Computer Science Applications

Related Documents

JOURNAL ARTICLE

Data Augmentation Classifier for Imbalanced Fault Classification

Xiaoyu JiangZhiqiang Ge

Journal:   IEEE Transactions on Automation Science and Engineering Year: 2020 Vol: 18 (3)Pages: 1206-1217
JOURNAL ARTICLE

Graph Data Augmentation for Node Classification

Ziyu WeiXi XiaoBin ZhangGuangwu HuQing LiShu‐Tao Xia

Journal:   2022 26th International Conference on Pattern Recognition (ICPR) Year: 2022 Vol: 80 Pages: 4899-4905
© 2026 ScienceGate Book Chapters — All rights reserved.