Tuan Hue ThiLi ChengJian ZhangLi WangShin’ichi Satoh
In this paper, we present a robust framework for action recognition in video, that is able to perform competitively against the state-of-the-art methods, yet does not rely on sophisticated background subtraction preprocess to remove background features. In particular, we extend the Implicit Shape Modeling (ISM) of [10] for object recognition to 3D to integrate local spatiotemporal features, which are produced by a weakly supervised Bayesian kernel filter. Experiments on benchmark datasets (including KTH and Weizmann) verifies the effectiveness of our approach.
Haomin YanRuize HanWei FengJiewen ZhaoSong Wang
Naiyuan FanMing KongJing HuangBingdi ChenQiang Zhu
Hakan BoyrazSyed Zain MasoodBaoyuan LiuMarshall F. Tappen
Limin WangYuanjun XiongDahua LinLuc Van Gool