RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval

Yanglin Feng; Hongyuan Zhu; Dezhong Peng; Xi Peng; Peng Hu

doi:10.1109/cvpr52729.2023.01117

ScienceGate Book Chapters

JOURNAL ARTICLE

RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval

Yanglin Feng Hongyuan Zhu Dezhong Peng Xi Peng Peng Hu

Year: 2023 Pages: 11610-11619

DOI: 10.1109/cvpr52729.2023.01117

Get Full-Text PDF Get Analytical Report

Abstract

Recently, with the advent of Metaverse and AI Generated Content, cross-modal retrieval becomes popular with a burst of 2D and 3D data. However, this problem is challenging given the heterogeneous structure and semantic discrepancies. Moreover, imperfect annotations are ubiquitous given the ambiguous 2D and 3D content, thus inevitably producing noisy labels to degrade the learning performance. To tackle the problem, this paper proposes a robust 2D-3D retrieval framework (RONO) to robustly learn from noisy multimodal data. Specifically, one novel Robust Discriminative Center Learning mechanism (RDCL) is proposed in RONO to adaptively distinguish clean and noisy samples for respectively providing them with positive and negative optimization directions, thus mitigating the negative impact of noisy labels. Besides, we present a Shared Space Consistency Learning mechanism (SSCL) to capture the intrinsic information inside the noisy data by minimizing the cross-modal and semantic discrepancy between common space and label space simultaneously. Comprehensive mathematical analyses are given to theoretically prove the noise tolerance of the proposed method. Furthermore, we conduct extensive experiments on four 3D-model multimodal datasets to verify the effectiveness of our method by comparing it with 15 state-of-the-art methods. Code is available at https://github.com/penghu-cs/RONO.

Keywords:

Discriminative model Computer science Consistency (knowledge bases) Noise (video) Artificial intelligence Modal Robustness (evolution) Code (set theory) Imperfect Source code Noisy data Machine learning Pattern recognition (psychology) Image (mathematics) Set (abstract data type)

Metrics

Cited By

4.91

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval

Abstract

Metrics

Citation History

Topics

Related Documents

Learning Cross-Modal Retrieval with Noisy Labels

Cross-Modal Retrieval With Noisy Labels

Robust Self-Paced Hashing for Cross-Modal Retrieval with Noisy Labels

Neighborhood Learning from Noisy Labels for Cross-Modal Retrieval

Noise-Robust Cross-modal Learning for Reliable 2D-3D Retrieval