JOURNAL ARTICLE

Position embedding fusion on transformer for dense video captioning

Sixuan YangPengjie TangHanli WangQinyu Li

Year: 2020 Journal:   Developments of Artificial Intelligence Technologies in Computation and Robotics Pages: 792-799
Keywords:
Closed captioning Transformer Embedding Computer science Position (finance) Fusion Computer vision Artificial intelligence Electrical engineering Engineering Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Improving Dense Video Captioning with a Transformer-based Multimodal Fusion Model

Yixuan LiuZiwei ZhouShen HuiHaoyuan MaHong‐Ju LiZhibo Zhang

Journal:   Journal of industry and engineering management. Year: 2024 Vol: 2 (4)Pages: 33-40
JOURNAL ARTICLE

Accelerated masked transformer for dense video captioning

Yu ZhouNanjia Han

Journal:   Neurocomputing Year: 2021 Vol: 445 Pages: 72-80
BOOK-CHAPTER

Transformer and LLM-Based Captioning Module for Dense Video Captioning

Dvijesh BhattPriyank Thakkar

Lecture notes in networks and systems Year: 2025 Pages: 449-459
JOURNAL ARTICLE

Parallel Pathway Dense Video Captioning With Deformable Transformer

Wangyu ChoiJiasi ChenJongwon Yoon

Journal:   IEEE Access Year: 2022 Vol: 10 Pages: 129899-129910
© 2026 ScienceGate Book Chapters — All rights reserved.