IFE-CMT: Instance-Aware Fine-Grained Feature Enhancement Cross Modal Transformer for 3D Object Detection

Xiaona Song; Haonan Zhang; Haichao Liu; Xinxin Wang; Lijun Wang

doi:10.3390/s25185685

ScienceGate Book Chapters

JOURNAL ARTICLE

IFE-CMT: Instance-Aware Fine-Grained Feature Enhancement Cross Modal Transformer for 3D Object Detection

Xiaona Song Haonan Zhang Haichao Liu Xinxin Wang Lijun Wang

Year: 2025 Journal: Sensors Vol: 25 (18)Pages: 5685-5685 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s25185685

Get Full-Text PDF Get Analytical Report

Abstract

In recent years, multi-modal 3D object detection algorithms have experienced significant development. However, current algorithms primarily focus on designing overall fusion strategies for multi-modal features, neglecting finer-grained representations, which leads to a decline in the detection accuracy of small objects. To address this issue, this paper proposes the Instance-aware Fine-grained feature Enhancement Cross Modal Transformer (IFE-CMT) model. We designed an Instance feature Enhancement Module (IE-Module), which can accurately extract object features from multi-modal data and use them to enhance overall features while avoiding view transformations and maintaining low computational overhead. Additionally, we design a new point cloud branch network that effectively expands the network’s receptive field, enhancing the model’s semantic expression capabilities while preserving texture details of the objects. Experimental results on the nuScenes dataset demonstrate that compared to the CMT model, our proposed IFE-CMT model improves mAP and NDS by 2.1% and 0.8% on the validation set, respectively. On the test set, it improves mAP and NDS by 1.9% and a 0.7%. Notably, for small object categories such as bicycles and motorcycles, the mAP improved by 6.6% and 3.7%, respectively, significantly enhancing the detection accuracy of small objects.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.37

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

IFE-CMT: Instance-Aware Fine-Grained Feature Enhancement Cross Modal Transformer for 3D Object Detection

Abstract

Metrics

Topics

Related Documents

FFEDet: Fine-Grained Feature Enhancement for Small Object Detection

Semantic-aware Fine-grained Point Augmentation for 3D Multi-modal Object Detection

FINE-GRAINED FEATURE ENHANCEMENT FOR OBJECT DETECTION IN REMOTE SENSING IMAGES

Fine-Grained Feature Enhancement for Object Detection in Remote Sensing Images

Spatially-Aware Human-Object Interaction Detection with Cross-Modal Enhancement