JOURNAL ARTICLE

Multi-modal deep distance metric learning

Seyed Mahdi RoostaiyanEhsan ImaniMahdieh Soleymani Baghshah

Year: 2017 Journal:   Intelligent Data Analysis Vol: 21 (6)Pages: 1351-1369   Publisher: IOS Press

Abstract

In many real-world applications, data contain heterogeneous input modalities (e.g., web pages include images, text, etc.). Moreover, data such as images are usually described using different views (i.e. different sets of features). Learning a distance metric or similarity measure that originates fr om all input modalities or views is essential for many tasks such as content-based retrieval ones. In these cases, similar and dissimilar pairs of data can be used to find a better representation of data in which similarity and dissimilarity constraints are better satisfied. In this paper, we incorporate supervision in the form of pairwise similarity and/or dissimilarity constraints into multi-modal deep networks to combine different modalities into a shared latent space. Using properties of multi-modal data, we design multi-modal deep networks and propose a pre-training algorithm for these networks. In fact, the proposed network has the ability of learning intra- and inter-modal high-order statistics from raw features and we control its high flexibility via an efficient multi-stage pre-training phase corresponding to properties of multi-modal data. Experimental results show that the proposed method outperforms recent methods on image retrieval tasks.

Keywords:
Computer science Modal Similarity (geometry) Metric (unit) Artificial intelligence Pairwise comparison Representation (politics) Modalities Deep learning Raw data Pattern recognition (psychology) Flexibility (engineering) Machine learning Data mining Image (mathematics) Mathematics

Metrics

6
Cited By
0.25
FWCI (Field Weighted Citation Impact)
50
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Modal Distance Metric Learning

Pengtao XieEric P. Xing

Journal:   Neurorehabilitation and neural repair Year: 2013 Vol: 18 (3)Pages: 1806-1812
JOURNAL ARTICLE

Operational Multi-Modal Distance Metric Learning to Image Reclamation

L LavanyaChebrolu Ujwala PavaniGadchanda VineethBorada Lavanya

Journal:   International Journal of Engineering & Technology Year: 2018 Vol: 7 (2.32)Pages: 405-405
JOURNAL ARTICLE

Multi-level Distance Regularization for Deep Metric Learning

Yonghyun KimWonpyo Park

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (3)Pages: 1827-1835
© 2026 ScienceGate Book Chapters — All rights reserved.