JOURNAL ARTICLE

Learning Semantics-Guided Visual Attention for Few-Shot Image Classification

Abstract

We propose a deep learning framework for few-shot image classification, which exploits information across label semantics and image domains, so that regions of interest can be properly attended for improved classification. The proposed semantics-guided attention module is able to focus on most relevant regions in an image, while the attended image samples allow data augmentation and alleviate possible overfitting during FSL training. Promising performances are presented in our experiments, in which we consider both closed and open-world settings. The former considers the test input belong to the categories of few shots only, while the latter requires recognition of all categories of interest.

Keywords:
Overfitting Semantics (computer science) Computer science Focus (optics) Artificial intelligence Image (mathematics) Contextual image classification Shot (pellet) Exploit Pattern recognition (psychology) Machine learning Natural language processing Artificial neural network Programming language

Metrics

14
Cited By
2.18
FWCI (Field Weighted Citation Impact)
24
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.