JOURNAL ARTICLE

Hierarchical cross-modal contextual attention network for visual grounding

Xin XuGang LvYining SunHU Yu-xiaFudong Nian

Year: 2023 Journal:   Multimedia Systems Vol: 29 (4)Pages: 2073-2083   Publisher: Springer Science+Business Media
Keywords:
Computer science Modal Transformer Sentence Modality (human–computer interaction) Artificial intelligence Encoder Natural language processing Task (project management) Visualization

Metrics

4
Cited By
0.73
FWCI (Field Weighted Citation Impact)
43
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.