Ziyang GongZhixiang WeiDi WangXiaoxing HuXianzheng MaHongruixuan ChenYuru JiaYun DengZhenming JiXiangwei ZhuX. Jessie YangNaoto YokoyaJing ZhangBo DuJunchi YanLiangpei Zhang
Due to the substantial domain gaps in Remote Sensing (RS) images that are characterized by variabilities such as location, wavelength, and sensor type, Remote Sensing Domain Generalization (RSDG) has emerged as a critical and valuable research frontier, focusing on developing models that generalize effectively across diverse scenarios. However, research in this area remains underexplored: (1) Current cross-domain methods primarily focus on Domain Adaptation (DA), which adapts models to predefined domains rather than to unseen ones; (2) Few studies target the RSDG issue, especially for semantic segmentation tasks. Existing related models are developed for specific unknown domains, struggling with issues of underfitting on other unseen scenarios; (3) Existing RS foundation models tend to prioritize in-domain performance over cross-domain generalization. To this end, we introduce the first vision foundation model for RSDG semantic segmentation, CrossEarth. CrossEarth demonstrates strong cross-domain generalization through a specially designed data-level Earth-Style Injection pipeline and a model-level Multi-Task Training pipeline. In addition, for the semantic segmentation task, we have curated an RSDG benchmark comprising 32 semantic segmentation scenarios across various regions, spectral bands, platforms, and climates, providing comprehensive evaluations of the generalizability of future RSDG models. Extensive experiments on this collection demonstrate the superiority of CrossEarth over existing state-of-the-art methods.
Muying LuoYujie ZanKourosh KhoshelhamShunping Ji
Pan ChenXijian FanTardi TjahjadiHaiyan GuanLiyong FuQiaolin YeRuili Wang
Jiayuan LiZhen WangNan XuZhu‐Hong YouDe-Shuang Huang
Christian GeiβYue ZhuChunping QiuLichao MouXiao Xiang ZhuHannes Taubenböck
Qiuyue ZhangZhiwang ZhangShiting WenChaoyi PangFangyu Wu