Multi-level Convolutional Transformer with Adaptive Ranking for Semi-supervised Crowd Counting

Xin Deng; Songjian Chen; Yifan Chen; Jie-Fang Xu

doi:10.1145/3508546.3508548

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-level Convolutional Transformer with Adaptive Ranking for Semi-supervised Crowd Counting

Xin Deng Songjian Chen Yifan Chen Jie-Fang Xu

Year: 2021 Journal: 2021 4th International Conference on Algorithms, Computing and Artificial Intelligence Pages: 1-7

DOI: 10.1145/3508546.3508548

Get Full-Text PDF Get Analytical Report

Abstract

Most existing crowd counting methods have focused on pure convolutional neural network based supervised algorithms. Although these methods have attained good results on some datasets, they still encounter several common problems. The cost of labeling annotations for supervised methods is huge and the shortage of labeled datasets limits the further development of supervised algorithms for crowd counting. Meanwhile, pure CNN-based algorithms have certain limitations in building the connections among these features. To overcome those problems, we proposed a semi-supervised crowd counting algorithm that is a mixture model of CNN and transformer. Specifically, our method consists of two parts Multi-Level Convolutional Transformer (MLCT) and Adaptive Scale Module (ASM). MLCT is the counting branch, with its front end and back end being the CNN and the transformer, respectively. ASM outputs an adaptive scale factor for the unlabeled crowd images. We generate a ranking list based on this factor, which is fed into the MLCT and computes loss by the order of the list. Different from most crowd counting methods, we use a region-level regression target for labeled images, which is a weaker regression approach than the location regression. Furthermore, We train the entire model using a novel loss function that combines L1 loss and ranking loss. Experimental results on the three challenging datasets ShanghaiTech Part A, ShanghaiTech Part B, and UCF-QNRF have all demonstrated the effectiveness of the proposed approach.

Keywords:

Computer science Convolutional neural network Transformer Artificial intelligence Economic shortage Regression Ranking (information retrieval) Machine learning Pattern recognition (psychology) Data mining Mathematics Statistics

Metrics

Cited By

0.32

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multi-level Convolutional Transformer with Adaptive Ranking for Semi-supervised Crowd Counting

Abstract

Metrics

Citation History

Topics

Related Documents

DTCC: Multi-level dilated convolution with transformer for weakly-supervised crowd counting

Semi-Supervised Crowd Counting via Swin Transformer with Adaptive Soft Threshold and Contrastive Learning

Multi-Level Dynamic Graph Convolutional Networks for Weakly Supervised Crowd Counting

Crowd Counting Model with Convolutional Neural Networks and Transformer

Multi-scale features fused network with multi-level supervised path for crowd counting