JOURNAL ARTICLE

Deep multi-modal data analysis and fusion for robust scene understanding in CAVs

Abstract

<p>Deep learning (DL) tends to be the integral part of Autonomous Vehicles (AVs). Therefore the development of  scene analysis modules that are robust to various vulnerabilities such as adversarial inputs or cyber-attacks is becoming an imperative need for the future AV perception systems.  In this paper, we deal with this issue by exploring the recent progress in Artificial Intelligence (AI) and Machine Learning (ML) to provide holistic situational awareness and eliminate the effect of the previous attacks on the scene analysis modules. We propose novel multi-modal approaches against which achieve robustness to adversarial attacks, by appropriately modifying the analysis Neural networks and by utilizing late fusion methods. More specifically, we propose a holistic approach by adding new layers to a 2D segmentation DL model enhancing its robustness to adversarial noise. Then, a novel late fusion technique has been applied, by extracting direct features from the 3D space and project them into the 2D segmented space for identifying inconsistencies. Extensive evaluation studies using the KITTI odometry dataset provide promising performance results under various types of noise.</p>

Keywords:
Robustness (evolution) Computer science Artificial intelligence Adversarial system Odometry Sensor fusion Deep learning Machine learning Noise (video) Situation awareness Artificial neural network Modal Robot Engineering Mobile robot Image (mathematics)

Metrics

2
Cited By
0.28
FWCI (Field Weighted Citation Impact)
53
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Forensic Toxicology and Drug Analysis
Life Sciences →  Pharmacology, Toxicology and Pharmaceutics →  Toxicology

Related Documents

JOURNAL ARTICLE

Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding

Yi LiuChengxin LiShoukun XuJungong Han

Journal:   International Journal of Computer Vision Year: 2025 Vol: 133 (7)Pages: 4483-4503
JOURNAL ARTICLE

Multi-modal Medical Data Fusion using Deep Learning

D. HarithaB. Sandhya

Journal:   2022 9th International Conference on Computing for Sustainable Global Development (INDIACom) Year: 2022 Pages: 500-505
JOURNAL ARTICLE

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

Ling‐Dong KongXiang XuJiawei RenWenwei ZhangLiang PanKai ChenWei Tsang OoiZiwei Liu

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2025 Vol: 47 (5)Pages: 3748-3765
© 2026 ScienceGate Book Chapters — All rights reserved.