JOURNAL ARTICLE

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

Abstract

Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose estimation in the presence of occlusion. More recently researchers have shown improvements by learning object fragments as segmentation. In this work, we present a discrete descriptor, which can represent the object surface densely. By incorporating a hierarchical binary grouping, we can encode the object surface very efficiently. Moreover, we propose a coarse to fine training strategy, which enables fine-grained correspondence prediction. Finally, by matching predicted codes with object surface and using a PnP solver, we estimate the 6DoF pose. Results on the public LM-O and YCB-V datasets show major improvement over the state of the art w.r.t. ADD(-S) metric, even surpassing RGB-D based methods in some cases.

Keywords:
Artificial intelligence Computer science Pose Object (grammar) Computer vision Metric (unit) Segmentation Surface (topology) Solver Pattern recognition (psychology) Matching (statistics) Similarity (geometry) Encoding (memory) ENCODE Binary number 3D pose estimation RGB color model Image (mathematics) Mathematics

Metrics

149
Cited By
61.16
FWCI (Field Weighted Citation Impact)
88
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.