Tiny-VPS: Tiny Video Panoptic Segmentation Standing on the Shoulder of Giant-VPS

Qingfeng Liu; Mostafa El‐Khamy; Kee-Bong Song

doi:10.1109/ojsp.2025.3581840

ScienceGate Book Chapters

JOURNAL ARTICLE

Tiny-VPS: Tiny Video Panoptic Segmentation Standing on the Shoulder of Giant-VPS

Qingfeng Liu Mostafa El‐Khamy Kee-Bong Song

Year: 2025 Journal: IEEE Open Journal of Signal Processing Vol: 6 Pages: 803-814 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/ojsp.2025.3581840

Get Full-Text PDF Get Analytical Report

Abstract

Video Panoptic Segmentation (VPS) is the most challenging video segmentation task, as it requires accurate labeling of every pixel in each frame, as well as identifying the multiple instances and tracking them across frames. In this paper, we explore state-of-the-art solutions for VPS at both the giant model regime for offline or server processing and the tiny model regime for online or edge computing. We designed Giant-VPS which achieved the first place solution in the 2024 Pixel Level Video Understanding in the Wild (PVUW) challenge. Our Giant-VPS builds on top of MinVIS and deploys the DINOv2-giant vision foundation model with a carefully designed ViT (Vision Transformer) adapter. For mobile and edge devices, we designed the Tiny-VPS model and show that our novel ViT-adapter distillation from the Giant-VPS model can further improve the accuracy of Tiny-VPS. Our Tiny-VPS is the first, in the sub-20 GFLOPS regime, to achieve competitive accuracy on VPS and VSS (Video Semantic Segmentation) benchmarks.

Keywords:

Panopticon Computer vision Computer science Artificial intelligence Sociology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.29

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Retinal Imaging and Analysis

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

Tiny-VPS: Tiny Video Panoptic Segmentation Standing on the Shoulder of Giant-VPS

Abstract

Metrics

Topics

Related Documents

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

VPS

Tiny giant

Tiny giant

Videotext — Prüfzeilen — VPS (Video-Programm-System)