JOURNAL ARTICLE

Multimodal kernel learning for image retrieval

Abstract

We propose a semi-supervised learning technique to address the problem of fusing multimodal information sources for CBIR. In our approach, user's preferences in the form of reference feedback are treated as labeled data, and the key idea is to devise an on-line scheme to effectively transform the abstract semantics into useful training data for improving the query performance. Specifically, our method can be characterized with the following three advantages: 1) Kernel matrices are used to encode each modality of information so that the fusion can be conveniently carried out via boosting; 2) The base kernel matrices are derived from eigendecomposing the graph Laplacian, and further refined to satisfy a pivotal monotone property that ensures intrinsic structure will be reasonably maintained for each modality; 3) The adopted optimization criterion in boosting is to align with a target kernel matrix accounting for relevance feedback, and the learned multimodal kernel matrix can be used for training, and then for testing with those unlabeled ones in the database. To demonstrate the efficiency of the proposed framework, experimental results on CBIR are provided to illustrate several practical considerations.

Keywords:
Computer science Kernel (algebra) Boosting (machine learning) Artificial intelligence Relevance feedback ENCODE Machine learning Graph Image retrieval Laplacian matrix Pattern recognition (psychology) Theoretical computer science Image (mathematics) Mathematics

Metrics

1
Cited By
0.32
FWCI (Field Weighted Citation Impact)
46
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology

Related Documents

BOOK-CHAPTER

Online Kernel-Based Multimodal Similarity Learning with Application to Image Retrieval

Wenping ZhangHong Zhang

Lecture notes in computer science Year: 2015 Pages: 221-232
JOURNAL ARTICLE

Application of Diffusion Kernel in Multimodal Image Retrieval

Rajeev AgrawalWilliam I. GroskyFarshad FotouhiChanghua Wu

Journal:   Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007) Year: 2007 Pages: 271-276
JOURNAL ARTICLE

Application of Diffusion Kernel in Multimodal Image Retrieval

Rajeev AgrawalWilliam I. GroskyFarshad FotouhiChanghua Wu

Journal:   Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007) Year: 2007
JOURNAL ARTICLE

Application of Diffusion Kernel in Multimodal Image Retrieval

Rajeev AgrawalWilliam I. GroskyFarshad FotouhiChanghua Wu

Journal:   International Symposium on Multimedia Year: 2007 Pages: 271-276
JOURNAL ARTICLE

Learning Discriminative Fisher Kernel for Image Retrieval

Bin WangXiong LiYuncai Liu

Journal:   KSII Transactions on Internet and Information Systems Year: 2013 Vol: 7 (3)Pages: 522-538
© 2026 ScienceGate Book Chapters — All rights reserved.