Deep learning has been applied to segment buildings from high-resolution images with promising results. However, there still exist the problems stemming from training on split patches and class imbalances. To overcome these problems, we propose a dual-resolution U-Net that uses pairs of images as inputs to capture both high and low resolution features. We also employ a soft Jaccard loss to place more emphasis on the sparse and low accuracy samples. The images from different regions are further balanced according to their building densities. With our architecture, we achieved state-of-the-art results on the Inria aerial image labeling dataset without any post-processing.
Chaohui LiYingjian LiuHaoyu YinYue LiPengting DuLimin ZhangQingxiang Guo
Renhe ZhangZhechun WanQian ZhangGuixu Zhang
Yaohui LiuJie ZhouWenhua QiXiaoli LiLutz GrossQi ShaoZhengguang ZhaoLi NiXiwei FanZhiqiang Li
M. SobhanaGudapati Satya Dinesh KumarPavithra PakkiruYarramreddy Tejaswi