JOURNAL ARTICLE

Multimodal Similarity Gaussian Process Latent Variable Model

Guoli SongShuhui WangQingming HuangQi Tian

Year: 2017 Journal:   IEEE Transactions on Image Processing Vol: 26 (9)Pages: 4168-4181   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Data from real applications involve multiple modalities representing content with the same semantics from complementary aspects. However, relations among heterogeneous modalities are simply treated as observation-to-fit by existing work, and the parameterized modality specific mapping functions lack flexibility in directly adapting to the content divergence and semantic complicacy in multimodal data. In this paper, we build our work based on the Gaussian process latent variable model (GPLVM) to learn the non-parametric mapping functions and transform heterogeneous modalities into a shared latent space. We propose multimodal Similarity Gaussian Process latent variable model (m-SimGP), which learns the mapping functions between the intra-modal similarities and latent representation. We further propose multimodal distance-preserved similarity GPLVM (m-DSimGP) to preserve the intra-modal global similarity structure, and multimodal regularized similarity GPLVM (m-RSimGP) by encouraging similar/dissimilar points to be similar/dissimilar in the latent space. We propose m-DRSimGP, which combines the distance preservation in m-DSimGP and semantic preservation in m-RSimGP to learn the latent representation. The overall objective functions of the four models are solved by simple and scalable gradient decent techniques. They can be applied to various tasks to discover the nonlinear correlations and to obtain the comparable low-dimensional representation for heterogeneous modalities. On five widely used real-world data sets, our approaches outperform existing models on cross-modal content retrieval and multimodal classification.

Keywords:
Probabilistic latent semantic analysis Computer science Latent variable Artificial intelligence Representation (politics) Similarity (geometry) Gaussian process Multimodal learning Pattern recognition (psychology) Machine learning Gaussian Image (mathematics)

Metrics

49
Cited By
3.69
FWCI (Field Weighted Citation Impact)
79
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Harmonized Multimodal Learning with Gaussian Process Latent Variable Models

Guoli SongShuhui WangQingming HuangQi Tian

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2019 Vol: 43 (3)Pages: 858-872
JOURNAL ARTICLE

Supervised Gaussian process latent variable model based on Gaussian mixture model

Jiayuan ZhangZiqi ZhuJixin Zou

Journal:   2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC) Year: 2017 Pages: 124-129
© 2026 ScienceGate Book Chapters — All rights reserved.