Bai Yang XIANGBoKai LIHuaijuan ZangZeLiang ZHAOShu ZHAN
Video facial micro expression recognition is difficult to extract features due to its short duration and small action amplitude. In order to better combine temporal and spatial information of video, the whole model is divided into local attention module, global attention module and temporal module. First, the local attention module intercepts the key areas and sends them to the network with channel attention after processing; Then the global attention module sends the data into the network with spatial attention after random erasure avoiding key areas; Finally, the temporal module sends the micro expression occurrence frame to the network with temporal shift module and spatial attention after processing; Finally, the classification results are obtained through three full connection layers after feature fusion. The experiment is tested based on CASMEⅡ dataset,After five-fold Cross Validation, the average accuracy rate is 76.15, the unweighted F1 value is 0.691.Compared with the mainstream algorithm, this method has improvement.
Wei SongPei YangNingning LiuGuosheng YangFuhong Lin
Xianghui LiuZhongdong WuChunyang Tang
Zhiqiang BaoLuping YanMei Wang
Guocheng HaoL. BuMengyuan LuHui LiuGang LiuJuan Guo