JOURNAL ARTICLE

Multi-scale Multi-clue Crowd Counting Network

Lianyi BaoSongjian ChenFei Han

Year: 2021 Journal:   2021 4th International Conference on Algorithms, Computing and Artificial Intelligence Pages: 1-7

Abstract

At present, crowd counting under complex background is still a big challenge, but a meaningful task for public safety. We focus on this problem and propose a multi-scale multi-clue crowd counting network (MMNet), which is composed of a feature encoder backbone and four stacked multi-clue crowd estimation modules (MCEM) under multiple scales as decoders. Each module consists of three predictors, including a shared attention predictor (SAP), a density map predictor (DMP) and a local counting map predictor (LCMP). DMP utilizes the information of each pixel on the image, while LCMP divides the image into patches and counts the number of people on these patches, focusing on the number in each patch. These two predictors solve the problem of inaccurate crowd counting under complex background from the perspective of training target. They use the microscopic information and macro information of the image for model training, respectively. SAP helps them concentrate more on the human head region in the image by generating multi-scale shared attention maps from the perspective of feature extraction. Furthermore, we design a multi-task joint training strategy that automatically adjusts the loss weights of different tasks to promote training and the robustness of the model. Extensive experiments on three challenging datasets (ShanghaiTech, UCF_CC_50, UCF-QNRF) show the superior performance of MMNet.

Keywords:
Computer science Robustness (evolution) Artificial intelligence Perspective (graphical) Feature extraction Task (project management) Encoder Pixel Macro Focus (optics) Pattern recognition (psychology) Image (mathematics) Public security Scale (ratio) Feature (linguistics) Machine learning Computer vision

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
22
Refs
0.22
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Fire Detection and Safety Systems
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Multi‐scale supervised network for crowd counting

Yongjie WangWei ZhangDongxiao HuangYanyan LiuJianghua Zhu

Journal:   IET Image Processing Year: 2020 Vol: 14 (17)Pages: 4701-4707
JOURNAL ARTICLE

The Multi-channel and Multi-scale Network for Crowd Counting

Pengze WangWei WuYang SuXin LiDuan Yong-sheng

Journal:   Journal of Physics Conference Series Year: 2020 Vol: 1650 (3)Pages: 032070-032070
© 2026 ScienceGate Book Chapters — All rights reserved.