JOURNAL ARTICLE

Visual and audio segmentation for video streams

Abstract

In order to achieve object detection/location and tracking in video streams, this paper describes the video scene segmentation using visual and audio cues. For visual segmentation, frame distance and two techniques for speed-up are introduced. For audio segmentation, Cepstrum Flux and Block Cepstrum Flux parameters are introduced. Furthermore, experimental results of segmentation in both cases are described.

Keywords:
Computer science Segmentation Computer vision Artificial intelligence Cepstrum Audio visual Frame (networking) Speech recognition Block (permutation group theory) Image segmentation Object (grammar) Scale-space segmentation Video tracking Pattern recognition (psychology) Multimedia Mathematics Telecommunications

Metrics

7
Cited By
1.14
FWCI (Field Weighted Citation Impact)
2
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.