JOURNAL ARTICLE

Multimodal learning with feature fusion transformer for image captioning

Wenqing ZhuFeiniu Yuan

Year: 2025 Journal:   Displays Vol: 90 Pages: 103126-103126   Publisher: Elsevier BV
Keywords:
Closed captioning Computer science Transformer Artificial intelligence Feature (linguistics) Computer vision Image (mathematics) Pattern recognition (psychology) Engineering Linguistics Electrical engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
82
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.