JOURNAL ARTICLE

Crowd Counting with Dilated Inception Convolution

Abstract

Convolutional neural network (CNN) has been successfully applied to image-based crowd density estimation. However, large computational resources are required in previous CNN-based methods. Therefore, to overcome these drawbacks, this paper proposes a lightweight crowd density map estimation architecture with Dilated Inception Convolution Neural Network (DICNN). The proposed method not only extracts scale-aware informative features, but also effectively reduces the number of parameters of the CNN architecture. In addition, the proposed method is trained along with DICNN in an end-to-end fashion via both pixel-wise Euclidean distance and density-level relevant (DLR) loss for global optimization. Extensive experiments on several publicly available datasets have shown that the proposed method outperforms state-of-the-art approaches in almost all datasets with far fewer parameters.

Keywords:
Computer science Convolutional neural network Convolution (computer science) Artificial intelligence Euclidean distance Pixel Pattern recognition (psychology) Scale (ratio) Architecture Image (mathematics) Artificial neural network Computer vision

Metrics

2
Cited By
0.20
FWCI (Field Weighted Citation Impact)
13
Refs
0.50
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.