JOURNAL ARTICLE

PoseDiffusion: A Coarse-to-Fine Framework for Unseen Object 6-DoF Pose Estimation

Jiaming ZhouQing ZhuYaonan WangMingtao FengChengzhong WuXuebing LiuJianan HuangAjmal Mian

Year: 2024 Journal:   IEEE Transactions on Industrial Informatics Vol: 20 (9)Pages: 11127-11138   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Accurately estimating the six-degrees of freedom (DoF) pose of unseen objects is crucial for successful robotic manipulation in industrial automation. Some existing methods for this task rely on prior knowledge of individual objects, i.e., the model must be trained on the exact object instance or object category. Others perform unseen object pose estimation but are limited in their feature learning and pose refinement ability. To address these problems, we propose an unseen object pose estimation method that follows a coarse-to-fine framework and leverages the powerful learning ability of diffusion models. We introduce a diffusion model for generating object poses, and conduct a comparison between the generated poses and the original pose to determine the optimal one. We design a novel pose estimation module to provide coarse poses for the PoseDiffusion. This module comprises two feature extraction modules that extract global and masked features. In addition, we propose a strategy to estimate the pose by comparing the similarity between rendered and query poses. The renderings of an unseen object from various viewpoints are generated from its computer-aided design (CAD) model. Our method requires a CAD model of the unseen object only during inference, a scenario well suited to industrial applications. Experimental evaluation on benchmark datasets demonstrates that the proposed framework outperforms existing approaches, achieving state-of-the-art performance in six-DoF object pose estimation.

Keywords:
Pose Computer science Artificial intelligence 3D pose estimation Object (grammar) Benchmark (surveying) Articulated body pose estimation Inference Computer vision Feature (linguistics) Feature extraction Machine learning Object detection Similarity (geometry) Object model Pattern recognition (psychology) Image (mathematics)

Metrics

5
Cited By
3.18
FWCI (Field Weighted Citation Impact)
32
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Image and Object Detection Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

6 DOF Object Pose Estimation using CNN

Kota SuzuiYusuke YoshiyasuEiichi Yoshida

Journal:   The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) Year: 2018 Vol: 2018 (0)Pages: 2A2-K16
JOURNAL ARTICLE

A probabilistic framework for object search with 6-DOF pose estimation

Jeremy MaTimothy H. ChungJoel W. Burdick

Journal:   The International Journal of Robotics Research Year: 2011 Vol: 30 (10)Pages: 1209-1228
JOURNAL ARTICLE

Object 6-DoF pose estimation using auxiliary learning

Minjia CHENShaoyan GaiFeipeng DaJian Yu

Journal:   Optics and Precision Engineering Year: 2024 Vol: 32 (6)Pages: 901-914
JOURNAL ARTICLE

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

Yongzhi SuMahdi SalehTorben FetzerJason RambachNassir NavabBenjamin BusamDidier StrickerFederico Tombari

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 6728-6738
© 2026 ScienceGate Book Chapters — All rights reserved.