Multi-scale Multi-clue Crowd Counting Network

Lianyi Bao; Songjian Chen; Fei Han

doi:10.1145/3508546.3508547

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-scale Multi-clue Crowd Counting Network

Lianyi Bao Songjian Chen Fei Han

Year: 2021 Journal: 2021 4th International Conference on Algorithms, Computing and Artificial Intelligence Pages: 1-7

DOI: 10.1145/3508546.3508547

Get Full-Text PDF Get Analytical Report

Abstract

At present, crowd counting under complex background is still a big challenge, but a meaningful task for public safety. We focus on this problem and propose a multi-scale multi-clue crowd counting network (MMNet), which is composed of a feature encoder backbone and four stacked multi-clue crowd estimation modules (MCEM) under multiple scales as decoders. Each module consists of three predictors, including a shared attention predictor (SAP), a density map predictor (DMP) and a local counting map predictor (LCMP). DMP utilizes the information of each pixel on the image, while LCMP divides the image into patches and counts the number of people on these patches, focusing on the number in each patch. These two predictors solve the problem of inaccurate crowd counting under complex background from the perspective of training target. They use the microscopic information and macro information of the image for model training, respectively. SAP helps them concentrate more on the human head region in the image by generating multi-scale shared attention maps from the perspective of feature extraction. Furthermore, we design a multi-task joint training strategy that automatically adjusts the loss weights of different tasks to promote training and the robustness of the model. Extensive experiments on three challenging datasets (ShanghaiTech, UCF_CC_50, UCF-QNRF) show the superior performance of MMNet.

Keywords:

Computer science Robustness (evolution) Artificial intelligence Perspective (graphical) Feature extraction Task (project management) Encoder Pixel Macro Focus (optics) Pattern recognition (psychology) Image (mathematics) Public security Scale (ratio) Feature (linguistics) Machine learning Computer vision

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.22

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Fire Detection and Safety Systems

Physical Sciences → Engineering → Safety, Risk, Reliability and Quality

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Multi-scale Multi-clue Crowd Counting Network

Abstract

Metrics

Topics

Related Documents

Multi Scale Attention Network for Crowd Counting

Multi‐scale supervised network for crowd counting

Multi-scale Transformer-Based Crowd Counting Network

MSNet: multi-scale network for crowd counting

The Multi-channel and Multi-scale Network for Crowd Counting