JOURNAL ARTICLE

Hierarchical Multi-Granularity Interaction Graph Convolutional Network for Long Document Classification

Tengfei LiuYongli HuJunbin GaoYanfeng SunBaocai Yin

Year: 2024 Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Vol: 32 Pages: 1762-1775   Publisher: Institute of Electrical and Electronics Engineers

Abstract

With the growing demand for text analytics, long document classification (LDC) has received extensive attention, and great progress has been made. To reveal the complex structure and extract the intrinsic feature, the current approaches focus on modeling a long sequence with sparse attention or representing word-sentence or word-section relations partially. However, the thorough hierarchical structure from words, sentences to sections of long documents remains relatively unexplored. For this purpose, we propose a novel Hierarchical Multi-granularity Interaction Graph Convolutional Network (HMIGCN) for long document classification, in which three different granularity graphs, i.e., section graph, sentence graph and word graph, are constructed hierarchically. The section graph encapsulates the macrostructure of a long document, while the sentence and word graphs delve into the document's microstructure. Notably, within the sentence graph, we introduce a Global-Local Graph Convolutional (GLGC) block to adaptively capture both global and local dependency structures among sentence nodes. Additionally, to integrate the three graph networks as a whole, two well-designed techniques, namely section-guided pooling block and transfer fusion block, are proposed to train the model jointly by promoting each other. Extensive experiments on five long document datasets show that our model outperforms the existing state-of-the-art LDC models.

Keywords:
Computer science Sentence Granularity Pooling Graph Artificial intelligence Natural language processing Theoretical computer science

Metrics

1
Cited By
0.64
FWCI (Field Weighted Citation Impact)
52
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Hierarchical Multiple Granularity Attention Network for Long Document Classification

Yongli HuWen Xiang DingTengfei LiuJunbin GaoYanfeng SunBaocai Yin

Journal:   2022 International Joint Conference on Neural Networks (IJCNN) Year: 2022 Pages: 1-7
JOURNAL ARTICLE

Hierarchical Graph Convolutional Networks for Structured Long Document Classification

Tengfei LiuYongli HuBoyue WangYanfeng SunJunbin GaoBaocai Yin

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2022 Vol: 34 (10)Pages: 8071-8085
JOURNAL ARTICLE

MG-GCN: Multi-Granularity Graph Convolutional Neural Network for Multi-Label Classification in Multi-Label Information System

Bin YuHengjie XieMingjie CaiWeiping Ding

Journal:   IEEE Transactions on Emerging Topics in Computational Intelligence Year: 2023 Vol: 8 (1)Pages: 288-299
© 2026 ScienceGate Book Chapters — All rights reserved.