JOURNAL ARTICLE

Audio-Based Semantic Concept Classification for Consumer Video

Keansub LeeDaniel P. W. Ellis

Year: 2009 Journal:   IEEE Transactions on Audio Speech and Language Processing Vol: 18 (6)Pages: 1406-1416   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen for their usefulness to users, viability of automatic detection and of annotator labeling, and sufficiency of representation in available video collections. A set of 1873 videos from real users has been annotated with these concepts. Starting with a basic representation of each video clip as a sequence of mel-frequency cepstral coefficient (MFCC) frames, we experiment with three clip-level representations: single Gaussian modeling, Gaussian mixture modeling, and probabilistic latent semantic analysis of a Gaussian component histogram. Using such summary features, we produce support vector machine (SVM) classifiers based on the Kullback-Leibler, Bhattacharyya, or Mahalanobis distance measures. Quantitative evaluation shows that our approaches are effective for detecting interesting concepts in a large collection of real-world consumer video clips.

Keywords:
Computer science Mel-frequency cepstrum Artificial intelligence Bhattacharyya distance Mixture model Mahalanobis distance Support vector machine Pattern recognition (psychology) Set (abstract data type) Representation (politics) Histogram Gaussian Speech recognition Natural language processing Feature extraction Image (mathematics)

Metrics

90
Cited By
5.91
FWCI (Field Weighted Citation Impact)
34
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.