JOURNAL ARTICLE

Pre-training of Heterogeneous Graph Neural Networks for Multi-label Document Classification

WU Jiawei, FANG Quan, HU Jun, QIAN Shengsheng

Year: 2024 Journal:   DOAJ (DOAJ: Directory of Open Access Journals)

Abstract

Multi-label document classification aims to associate document instances with relevant labels,which has received increasing research attention in recent years.Existing multi-label document classification methods attempt to explore the fusion of information beyond the text,such as document metadata or label structure.However,these methods either simply use the semantic information of metadata or do not consider the long-tail distribution of labels,thereby ignoring higher-order relationships between documents and their metadata and the distribution pattern of labels,which affects the accuracy of multi-label document classification.Therefore,this paper proposes a new multi-label document classification method based on the pre-training of hete-rogeneous graph neural networks.The method constructs a heterogeneous graph based on documents and their metadata,adopts two contrastive pre-training methods to capture the relationship between documents and their metadata,and improves the accuracy of multi-label document classification by balancing the problem of long-tail distribution of labels through a loss function.Experimental results on the benchmark dataset show that the proposed method outperforms Transformer BertXML and MATCH by 8%,4.75%,1.3%,respectively.

Keywords:
Metadata Document classification Graph Document clustering Artificial neural network Profiling (computer programming) Benchmark (surveying) Document retrieval

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Cervical Cancer and HPV Research
Health Sciences →  Medicine →  Epidemiology
Endometrial and Cervical Cancer Treatments
Health Sciences →  Medicine →  Obstetrics and Gynecology
Global Cancer Incidence and Screening
Health Sciences →  Medicine →  Oncology

Related Documents

JOURNAL ARTICLE

PHGNN: Pre-Training Heterogeneous Graph Neural Networks

Xin LiHao WeiYu Ding

Journal:   IEEE Access Year: 2024 Vol: 12 Pages: 135411-135418
BOOK-CHAPTER

Label-Wise Document Pre-training for Multi-label Text Classification

Han LiuCaixia YuanXiaojie Wang

Lecture notes in computer science Year: 2020 Pages: 641-653
BOOK-CHAPTER

Deep Neural Networks for Czech Multi-label Document Classification

Ladislav LencPavel Král

Lecture notes in computer science Year: 2018 Pages: 460-471
BOOK-CHAPTER

Combination of Neural Networks for Multi-label Document Classification

Ladislav LencPavel Král

Lecture notes in computer science Year: 2017 Pages: 278-282
© 2026 ScienceGate Book Chapters — All rights reserved.