A Scale-Aware and Discriminative Feature Learning Network for Fine-Grained Rigid Object Recognition

Yu Gao; Chenwei Deng; Liang Chen; Zicong Zhu

doi:10.1109/jstars.2024.3484411

ScienceGate Book Chapters

JOURNAL ARTICLE

A Scale-Aware and Discriminative Feature Learning Network for Fine-Grained Rigid Object Recognition

Yu Gao Chenwei Deng Liang Chen Zicong Zhu

Year: 2024 Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 18 Pages: 1695-1705 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jstars.2024.3484411

Get Full-Text PDF Get Analytical Report

Abstract

With the rapid development of remote sensing imaging and deep learning technology, the fine-grained recognition of rigid targets has gradually emerged. Rigid targets in remote sensing scenes usually retain relatively stable scale information and apparent structure, providing an adequate basis for their discrimination. However, existing methods need to more reasonably utilize the scale information and apparent structure, which results in scale neglect and insufficient discriminative feature extraction (DFE). In response to the aforementioned challenges, we propose our SD-Net, a training framework for fine-grained recognition tasks of rigid objects in remote sensing scenes. It consists of a fused label learning process based on probability distribution function (PDF) and a DFE branch. The PDF counts the objects' scale information by category, builds a probability model based on the sample distribution, and finally converts it into a soft label form to guide model learning. DFE extracts discriminative features along the channel and spatial dimensions of the feature map based on feature deep mining and wide-ranging. Finally, we propose the FAIR1M-OR dataset, containing 37 fine-grained categories and about 600 000 instances, to verify the method's effectiveness. The experimental results show that introducing only a small number of parameters during training, SD-Net, improves the performance of the models based on the ResNet and ViT by about 4.6 points. The code and dataset will be open source in the future.

Keywords:

Discriminative model Computer science Artificial intelligence Pattern recognition (psychology) Feature (linguistics) Feature learning Scale (ratio) Cognitive neuroscience of visual object recognition Feature extraction Object (grammar) Machine learning Computer vision

Metrics

Cited By

0.53

FWCI (Field Weighted Citation Impact)

Refs

0.58

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

A Scale-Aware and Discriminative Feature Learning Network for Fine-Grained Rigid Object Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Discriminative Feature Learning with Application to Fine-grained Recognition

Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification

Discriminative Feature Mining and Enhancement Network for Low-Resolution Fine-Grained Image Recognition

Learning deep and sparse feature representation for fine-grained object recognition

Disentangled Feature Network for Fine-Grained Recognition