Multilabel Deep Visual-Semantic Embedding

Mei-Chen Yeh; Yinan Li

doi:10.1109/tpami.2019.2911065

ScienceGate Book Chapters

JOURNAL ARTICLE

Multilabel Deep Visual-Semantic Embedding

Mei-Chen Yeh Yinan Li

Year: 2019 Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 42 (6)Pages: 1530-1536 Publisher: IEEE Computer Society

DOI: 10.1109/tpami.2019.2911065

Get Full-Text PDF Get Analytical Report

Abstract

Inspired by the great success from deep convolutional neural networks (CNNs) for single-label visual-semantic embedding, we exploit extending these models for multilabel images. We propose a new learning paradigm for multilabel image classification, in which labels are ranked according to its relevance to the input image. In contrast to conventional CNN models that learn a latent vector representation (i.e., the image embedding vector), the developed visual model learns a mapping (i.e., a transformation matrix) from an image in an attempt to differentiate between its relevant and irrelevant labels. Despite the conceptual simplicity of our approach, the proposed model achieves state-of-the-art results on three public benchmark datasets.

Keywords:

Artificial intelligence Computer science Embedding Convolutional neural network Pattern recognition (psychology) Benchmark (surveying) Deep learning Image (mathematics) Exploit Simplicity Relevance (law) Semantics (computer science) Contextual image classification Image retrieval Representation (politics) Transformation (genetics) Machine learning

Metrics

Cited By

1.84

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multilabel Deep Visual-Semantic Embedding

Abstract

Metrics

Citation History

Topics

Related Documents

Video thumbnail recommendation based on deep visual-semantic embedding

Deep Visual Semantic Embedding with Text Data Augmentation and Word Embedding Initialization

SLED: Semantic Label Embedding Dictionary Representation for Multilabel Image Annotation

Click-through-based Deep Visual-Semantic Embedding for Image Search

Multi-task deep visual-semantic embedding for video thumbnail selection