Ruolei LiYilong ZengJianfeng WuYongli WangXiaoli Zhang
Few-shot object detection (FSOD), which aims at detecting rare objects based on few training samples has attracted significant research interest. Previous approaches find that the performance degradation of FSOD is mainly caused by category confusion (high false positives). To solve this issue, we propose a two-stage fine-tuning approach via classification score calibration for remote sensing images, named TFACSC, which follows the flowchart of base training and few-shot fine-tuning to train the detector. First, the backbone with strong representation ability is employed to extract the multi-scale features of the query image. Then, these features are aggregated by a novel multi-head scaled cosine non-local module (MSCN). Next, the aggregated features are used to generate the objectiveness proposals by a region proposal network. Eventually, the generated proposals are refined by a bounding box prediction head for the final category and position prediction, and the category score is calibrated by a novel classification score calibration module (CSCM). Extensive experiments conducted on NWPU VHR 10 and DIOR benchmarks demonstrate the effectiveness of our model. Particularly, for any shot cases, our method greatly outperforms the baseline and achieves state-of-the-art performance.
Xingyu ZhangHaopeng ZhangZhiguo Jiang
Yu CaoJingyi ChenHaoyu WangLei ZhangChen DingWei WeiShiqi CaoMeilin Xie
Sixu LiuYanan YouHaozheng SuGang MengWei YangFang Liu
Lian ZhouC. HeDaosheng WANGZiqi Guo