Crowd counting is a challenging task in computer vison field and haven't been well addressed until now. In this paper, we intend to develop an end to end multi-scale deep convolutional neural network(CNN) model that can accurately estimate the crowd count from an individual image with arbitrary crowd density and perspective. The proposed model extract multi-scale deep CNN features from the input image and regress the crwod count directly, without any post-processing . Hence our model could handle muti-scale targets well in various crowd scene. We evaluate our model on several benchmark datasets and the performance outperforms some state-of-the-art methods. What's more, due to the end-to-end characteristics, our model demonstrates good practical application performance.
Siqi TangYijia WuWei BaiZhisong Pan
Yanjie WangShiyu HuGuodong WangChenglizhao ChenZhenkuan Pan
Lingke ZengXiangmin XuBolun CaiSuo QiuTong Zhang
Guoquan JiangRui WuZhanqiang HuoCuijun ZhaoJunwei Luo