BOOK-CHAPTER

Cross-modal Deep Learning Applications: Audio-Visual Retrieval

Keywords:
Computer science Modal Similarity (geometry) Subspace topology Artificial intelligence Deep learning Modalities Feature learning Artificial neural network Feature (linguistics) Representation (politics) Pattern recognition (psychology) Speech recognition Image (mathematics)

Metrics

6
Cited By
2.79
FWCI (Field Weighted Citation Impact)
27
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

DCLMA: Deep correlation learning with multi-modal attention for visual-audio retrieval

Jiwei ZhangHirotaka Hachiya

Journal:   Machine Learning with Applications Year: 2025 Vol: 21 Pages: 100695-100695
BOOK-CHAPTER

Cross-Modal Retrieval Using Deep Learning

Shaily MalikNikhil BhardwajRahul BhardwajSaurabh Kumar

Lecture notes in networks and systems Year: 2022 Pages: 725-734
© 2026 ScienceGate Book Chapters — All rights reserved.