JOURNAL ARTICLE

Multilabel Deep Visual-Semantic Embedding

Mei-Chen YehYinan Li

Year: 2019 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 42 (6)Pages: 1530-1536   Publisher: IEEE Computer Society

Abstract

Inspired by the great success from deep convolutional neural networks (CNNs) for single-label visual-semantic embedding, we exploit extending these models for multilabel images. We propose a new learning paradigm for multilabel image classification, in which labels are ranked according to its relevance to the input image. In contrast to conventional CNN models that learn a latent vector representation (i.e., the image embedding vector), the developed visual model learns a mapping (i.e., a transformation matrix) from an image in an attempt to differentiate between its relevant and irrelevant labels. Despite the conceptual simplicity of our approach, the proposed model achieves state-of-the-art results on three public benchmark datasets.

Keywords:
Artificial intelligence Computer science Embedding Convolutional neural network Pattern recognition (psychology) Benchmark (surveying) Deep learning Image (mathematics) Exploit Simplicity Relevance (law) Semantics (computer science) Contextual image classification Image retrieval Representation (politics) Transformation (genetics) Machine learning

Metrics

22
Cited By
1.84
FWCI (Field Weighted Citation Impact)
53
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.