BOOK-CHAPTER

Multimodal Interaction Fusion Network Based on Transformer for Video Captioning

Hui XuPengpeng ZengAbdullah Aman Khan

Year: 2022 Communications in computer and information science Pages: 21-36   Publisher: Springer Science+Business Media
Keywords:
Closed captioning Computer science Transformer Encoder Artificial intelligence Benchmark (surveying) Machine learning Natural language processing Image (mathematics)

Metrics

2
Cited By
0.33
FWCI (Field Weighted Citation Impact)
34
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.