JOURNAL ARTICLE

Crowd Counting via Multi-view Scale Aggregation Networks

Abstract

Crowd counting, aiming at estimating the total number of people in unconstrained crowded scenes, has increasingly received attention. But it is greatly challenged by the huge variation in people scale. In this paper, we propose a novel Multi-View Scale Aggregation Network (MVSAN), which handle the scale variation from feature, input and criterion view comprehensively. Firstly, we design a simple but effective Multi-Scale Feature Encoder, which exploits dilated convolution layers with various dilation rates to improve the representation ability and scale diversity of features. Secondly, we feed multiple scales of input images into networks to generate high-quality density maps in a coarse-to-fine manner. Finally, we propose a Multi-Scale Structural Similarity loss to force our networks to learn the local correlation of density maps. Extensive experiments on two standard benchmarks show that the proposed method can generate high-quality crowd density map and accurate count estimation, outperforming the state-of-the-art methods with a large margin.

Keywords:
Computer science Scale (ratio) Margin (machine learning) Encoder Convolution (computer science) Feature (linguistics) Artificial intelligence Pattern recognition (psychology) Representation (politics) Feature extraction Dilation (metric space) Similarity (geometry) Density estimation Inference Exploit Convolutional neural network Data mining Artificial neural network Machine learning Image (mathematics) Mathematics Estimator Statistics

Metrics

33
Cited By
2.46
FWCI (Field Weighted Citation Impact)
43
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.