JOURNAL ARTICLE

Crowd Counting by Multi-Scale Dilated Convolution Networks

Jingwei DongZiqi ZhaoTongxin Wang

Year: 2023 Journal:   Electronics Vol: 12 (12)Pages: 2624-2624   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The number of people in a crowd is crucial information in public safety, intelligent monitoring, traffic management, architectural design, and other fields. At present, the counting accuracy in public spaces remains compromised by some unavoidable situations, such as the uneven distribution of a crowd and the difference in head scale caused by people’s differing distances from the camera. To solve these problems, we propose a deep learning crowd counting model, multi-scale dilated convolution networks (MSDCNet), based on crowd density map estimation. MSDCNet consists of three parts. The front-end network uses the truncated VGG16 to obtain preliminary features of the input image, with a proposed spatial pyramid pooling (SPP) module replacing the max-pooling layer to extract features with scale invariance. The core network is our proposed multi-scale feature extraction network (MFENet) for extracting features in three different scales. The back-end network consists of consecutive dilation convolution layers instead of traditional alternate convolution and pooling to expand the receptive field, extract high-level semantic information and avoid the spatial feature loss of small-scale heads. The experimental results on three public datasets show that the proposed model solved the above problems satisfactorily and obtained better counting accuracy than representative models in terms of mean absolute error (MAE) and mean square error (MSE).

Keywords:
Pooling Convolution (computer science) Computer science Scale (ratio) Artificial intelligence Pyramid (geometry) Mean squared error Feature extraction Feature (linguistics) Convolutional neural network Pattern recognition (psychology) Dilation (metric space) Data mining Algorithm Computer vision Mathematics Artificial neural network Statistics Geometry Geography Cartography

Metrics

3
Cited By
0.55
FWCI (Field Weighted Citation Impact)
27
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Fire Detection and Safety Systems
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality
Human Mobility and Location-Based Analysis
Social Sciences →  Social Sciences →  Transportation

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.