With the development of globalization, automatic emotion recognition has faced a new challenge in the multi-culture scenario - to generalize across different cultures. Previous works mainly rely on multi-cultural datasets to address the cross-culture discrepancy, which are expensive to collect. In this paper, we propose an adversarial learning framework to alleviate the culture influence on multimodal emotion recognition. We treat the emotion recognition and culture recognition as two adversarial tasks. The emotion feature embedding is trained to improve the emotion recognition but to confuse the culture recognition, so that it is more emotion-salient and culture-invariant for cross-culture emotion recognition. Our approach is applicable to both mono-culture and multi-culture emotion datasets. Extensive experiments demonstrate that the proposed method significantly outperforms previous baselines in both cross-culture and multi-culture evaluations.
Jun HePenghao RaoSiming CaoBo SunLejun YuHuanqing YanYongkang Xiao
Haotian MiaoYifei ZhangDaling WangShi Feng
Lu GanWei LiuYun LuoXun WuBao‐Liang Lu
Minjie RenXiangdong HuangJing LiuMing LiuXuanya LiAn-An Liu
Soyeon HongHyeoungguk KangHyunsouk Cho