JOURNAL ARTICLE

Multi-level Convolutional Transformer with Adaptive Ranking for Semi-supervised Crowd Counting

Xin DengSongjian ChenYifan ChenJie-Fang Xu

Year: 2021 Journal:   2021 4th International Conference on Algorithms, Computing and Artificial Intelligence Pages: 1-7

Abstract

Most existing crowd counting methods have focused on pure convolutional neural network based supervised algorithms. Although these methods have attained good results on some datasets, they still encounter several common problems. The cost of labeling annotations for supervised methods is huge and the shortage of labeled datasets limits the further development of supervised algorithms for crowd counting. Meanwhile, pure CNN-based algorithms have certain limitations in building the connections among these features. To overcome those problems, we proposed a semi-supervised crowd counting algorithm that is a mixture model of CNN and transformer. Specifically, our method consists of two parts Multi-Level Convolutional Transformer (MLCT) and Adaptive Scale Module (ASM). MLCT is the counting branch, with its front end and back end being the CNN and the transformer, respectively. ASM outputs an adaptive scale factor for the unlabeled crowd images. We generate a ranking list based on this factor, which is fed into the MLCT and computes loss by the order of the list. Different from most crowd counting methods, we use a region-level regression target for labeled images, which is a weaker regression approach than the location regression. Furthermore, We train the entire model using a novel loss function that combines L1 loss and ranking loss. Experimental results on the three challenging datasets ShanghaiTech Part A, ShanghaiTech Part B, and UCF-QNRF have all demonstrated the effectiveness of the proposed approach.

Keywords:
Computer science Convolutional neural network Transformer Artificial intelligence Economic shortage Regression Ranking (information retrieval) Machine learning Pattern recognition (psychology) Data mining Mathematics Statistics

Metrics

6
Cited By
0.32
FWCI (Field Weighted Citation Impact)
22
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Level Dynamic Graph Convolutional Networks for Weakly Supervised Crowd Counting

Zhuangzhuang MiaoYong ZhangHao RenYongli HuBaocai Yin

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2023 Vol: 25 (5)Pages: 3483-3495
JOURNAL ARTICLE

Crowd Counting Model with Convolutional Neural Networks and Transformer

Yonghui WangYang LiKe Tu

Journal:   2022 4th International Conference on Communications, Information System and Computer Engineering (CISCE) Year: 2022 Pages: 490-493
JOURNAL ARTICLE

Multi-scale features fused network with multi-level supervised path for crowd counting

Yongjie WangWei ZhangDongxiao HuangYanyan LiuJianghua Zhu

Journal:   Expert Systems with Applications Year: 2022 Vol: 200 Pages: 116949-116949
© 2026 ScienceGate Book Chapters — All rights reserved.