Object detection in very-high-resolution (VHR) remote sensing images is one of the important technical means in many fields. In recent years, conventional object detection methods have been completely replaced by Convolutional Neural Network (CNN)-based methods, which are more accurate and efficient. However, most current CNN-based methods applied in VHR image sets have certain defects: (1) Scale preference is common in the framework designs, and the representation ability of feature maps for large and small objects is quite different, so accuracy promotion can hardly be made comprehensively in the detection of different objects. (2) The scale difference of the objects leads to training difficulties. (3) Some high-precision methods require high hardware costs, and the overall frameworks lack practicality. To address such problems, we propose a new object detection method in this paper, namely Balanced Multi-Scale Fusion-based CNN (BMF-CNN). It is a redesigned two-stage detection framework according to the region-based object detection methods, which enabled the detection accuracy of both large and small objects to reach a high level. Through the evaluation in the open VHR remote sensing image sets, we found that BMF-CNN showed a better integrative performance than the current mainstream detection methods.
Chun LiuSixuan ZhangMengjie HuQing Song
Sumin LiJinhua LinYijin GangXiuqin Pan