In this paper, we focus on the problem of clustering faces in videos. Different from traditional clustering on a collection of facial images, a video provides some inherent benefits: faces from a face track must belong to the same person and faces from a video frame can not be the same person. These benefits can be used to enhance the clustering performance. More precisely, we convert the above benefits into must-link and cannot-link constraints. These constraints are further effectively incorporated into our novel algorithm, Video Face Clustering via Constrained Sparse Representation (CS-VFC). The CS-VFC utilizes the constraints in two stages, including sparse representation and spectral clustering. Experiments on real-world videos show the improvements of our algorithm over the state-of-the-art methods.
Xiaochun CaoChangqing ZhangChengju ZhouHuazhu FuHassan Foroosh
Vicky KalogeitonAndrew Zisserman
Xiaobing PeiYue GuanPingping CaiTao Dong
Imran NaseemRoberto TogneriMohammed Bennamoun