Abstract

We present a novel latent embedding model for learning a compatibility function between image and class embeddings, in the context of zero-shot classification. The proposed method augments the state-of-the-art bilinear compatibility model by incorporating latent variables. Instead of learning a single bilinear map, it learns a collection of maps with the selection, of which map to use, being a latent variable for the current image-class pair. We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image. We empirically demonstrate that our model improves the state-of-the-art for various class embeddings consistently on three challenging publicly available datasets for the zero-shot setting. Moreover, our method leads to visually highly interpretable results with clear clusters of different fine-grained object properties that correspond to different latent variable maps.

Keywords:
Computer science Bilinear interpolation Artificial intelligence Latent variable Bilinear map Embedding Pattern recognition (psychology) Class (philosophy) Machine learning Computer vision

Metrics

690
Cited By
62.01
FWCI (Field Weighted Citation Impact)
60
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
COVID-19 diagnosis using AI
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging

Related Documents

JOURNAL ARTICLE

Zero-Shot Audio Classification using Image Embeddings

Duygu DoğanHuang XieToni HeittolaTuomas Virtanen

Journal:   2022 30th European Signal Processing Conference (EUSIPCO) Year: 2022 Pages: 1-5
JOURNAL ARTICLE

Zero-Shot Audio Classification Via Semantic Embeddings

Huang XieTuomas Virtanen

Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Year: 2021 Vol: 29 Pages: 1233-1242
JOURNAL ARTICLE

Binary Attribute Embeddings for Zero-Shot Sound Event Classification

Yihan LinXunquan ChenRyoichi TakashimaTetsuya Takiguchi

Journal:   2022 IEEE 11th Global Conference on Consumer Electronics (GCCE) Year: 2022 Pages: 13-14
© 2026 ScienceGate Book Chapters — All rights reserved.