JOURNAL ARTICLE

Event-centric multi-modal fusion method for dense video captioning

Keywords:
Computer science Closed captioning Event (particle physics) Benchmark (surveying) Exploit Fuse (electrical) ENCODE Modal Artificial intelligence Process (computing) Machine learning Natural language processing Image (mathematics)

Metrics

23
Cited By
1.84
FWCI (Field Weighted Citation Impact)
81
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.