Chen WeiLuming LiangYuelong ZhaoShu Chen
Reconstructing three-dimensional (3-D) poses from matched feature correspondences is widely used in 3-D object tracking. The precision of correspondence matching plays a major role in the pose reconstruction. Without prior knowledge of the perspective camera model, state-of-the-art methods only deal with two-dimensional (2-D) planar affine transforms. An interest point’s detector and descriptor [perspective scale invariant feature transform (SIFT)] is proposed to overcome the side effects of viewpoint changing, i.e., our detector is invariant to viewpoint changing. Perspective SIFT is detected by the SIFT approach, where the sample region is determined by projecting the original sample region to the image plane based on the established camera model. An iterative algorithm then modifies the pose of the tracked object and it generally converges to a 3-D perspective invariant point. The pose of the tracked object is finally estimated by the combination of template warping and perspective SIFT correspondences. Thorough evaluations are performed on two public databases, the Biwi Head Pose dataset and the Boston University dataset. Comparisons illustrate that the proposed keypoint’s detector largely improves the tracking performance.
Xinli YangYang HuixianJinfang ZengHong Yu
Jeonggeun JoGeonsu YoonDaehwan KoE. C. LeeS. KimMinjun Choi