This paper presents an efficient and robust automatic process for large-scale sports video analysis. The proposed system firstly identifies the genre of the query video, and then accomplishes the interesting event detection task. The significance of this framework is its automatic characteristic in testing with minimum human involvement in training, as well as the scalability and expansibility in dealing with a large-scale dataset. Domain-knowledge independent local features are extracted from an input video sequence and a histogram based distribution representation is created using the bag-of-visual-words (BoW) model. In genre categorization, k-nearest neighbor (k-NN) classifiers with various dissimilarity measures are assessed and evaluated analytically. For the event detection, a hidden conditional random field (HCRF) structured prediction model is utilized. Overall, this framework demonstrates the efficiency and accuracy in processing voluminous data from sports collection and achieves various tasks in video analysis. It also demonstrates a potential technology transformation from the "laboratory bench" to commercial applications.
Yuan DongJiwei ZhangXiaofu ChangJian Zhao
Lingfang LiNing ZhangLing‐Yu DuanQingming HuangJun DuLing Guan
Xun YuanWei LaiTao MeiXian‐Sheng HuaXiuqing WuShipeng Li