Keping WangBingqian SuoQian WeiGaopeng ZhangTian WangYi Yang
Abstract The essence of small object detection is to establish a mapping from the image pixels to the location and classification of targets. It is well known that a few valid pixels and complex backgrounds are the greatest challenges. This is because the intricate mapping cannot be formulated in a small object pixel space while facing a huge disturbance produced by background pixel spaces. To address these issues, this paper proposes a novel small object detection network named synchronous attention granularity Swin Transformer (SAG-ST). A synchronous attention ST block is proposed to elegantly integrate information from deep and shallow features. And the granularity adaptive ST block employs a channel granularity adaptive mechanism to mitigate background interference by adaptively applying self-attention with varying granularities for different channels. Finally, this paper creates a small object detection dataset based on unmanned aerial vehicles with different flight altitudes. The experiments are carried out on the created dataset and VisDrone dataset, and the experimental results show that our SAG-ST algorithm achieves the best detection accuracy.
Hang GongTingkui MuQiuxia LiHaishan DaiChunlai LiZhiping HeWenjing WangFeng HanAbudusalamu TuniyaziHaoyang LiXuechan LangZhiyuan LiBin Wang
Yuqi SunXuan WangYi ZhengLin YaoShuhan QiLinlin TangHong YiKunlei Dong
Xu FengchangRayner AlfredJackel Vui Lung ChewShifeng DuGe LvRayner Henry Pailus
Dung NguyenVan-Dung HoangVan-Tuong-Lan LeNhat-Duy Nguyen
Meiling ShiDongling ZhengTianhao WuWenjing ZhangRuijie FuKailiang Huang