BOOK-CHAPTER

LongVLM: Efficient Long Video Understanding via Large Language Models

Yuetian WengMingfei HanHaoyu HeXiaojun ChangBohan Zhuang

Year: 2024 Lecture notes in computer science Pages: 453-470   Publisher: Springer Science+Business Media
Keywords:
Computer science Programming language Computer graphics (images) Natural language processing

Metrics

19
Cited By
17.14
FWCI (Field Weighted Citation Impact)
47
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Multimodal Large Language Models for Video Understanding

Yi WangJiashuo YuYinan HeLimin WangYu Qiao

Advances in computer vision and pattern recognition Year: 2025 Pages: 59-91
JOURNAL ARTICLE

Large Language Models (LLMs) for Video Understanding

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2024 Vol: 34 (10)Pages: 9758-9758
JOURNAL ARTICLE

Large Language Models (LLMs) for Video Understanding

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2024 Vol: 34 (10)Pages: 10522-10522
JOURNAL ARTICLE

Large Language Models (LLMs) for Video Understanding

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2024 Vol: 34 (11)Pages: 12098-12098
© 2026 ScienceGate Book Chapters — All rights reserved.