JOURNAL ARTICLE

Exploring audio semantic concepts for event-based video retrieval

Abstract

The audio semantic concepts (sound events) play important roles in audio-based content analysis. How to capture the semantic information effectively from the complex occurrence pattern of sound events in YouTube quality videos is a challenging problem. This paper presents a novel framework to handle the complex situation for semantic information extraction in real-world videos and evaluate through the NIST multimedia event detection task (MED). We calculate the occurrence confidence matrix of sound events and explore multiple strategies to generate clip-level semantic features from the matrix. We evaluate the performance using TRECVID2011 MED dataset. The proposed method outperforms previous HMM-based system. The late fusion experiment with the low-level features and text feature (ASR) shows that audio semantic concepts capture complementary information in the soundtrack.

Keywords:
Event (particle physics) Task (project management) Semantics (computer science) Feature (linguistics) Semantic computing Semantic similarity Feature extraction Quality (philosophy) NIST

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Plant pathogens and resistance mechanisms
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
Genetic diversity and population structure
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Genetics
Agricultural pest management studies
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
© 2026 ScienceGate Book Chapters — All rights reserved.