JOURNAL ARTICLE

Toward 6 DOF Object Pose Estimation with Minimum Dataset

Abstract

In this research, we propose a method for estimating 6 DOF object pose (3D orientation and position), based on convolutional neural networks (CNN). We propose RotationCNN that predicts 3D orientation of the object. The position of the object is estimated using an object detection CNN that predicts the class of the object and bounding box around it. Unlike the method that trains CNNs using a largescale database, the proposed system is trained with minimum dataset obtained in a local environment that is similar to where the robot is used. With the proposed semi-automated dataset collection techniques based on a web camera and AR markers, users in different environment will be able to train the network suited for their own environment relatively easily and quickly. We believe that this approach is suitable for a practical robotic application. The results on 3D orientation prediction using RotationCNN show the average error of 18.9 degrees, which we empirically found that it is low enough as an initial solution to successfully run the iterative closest point (ICP) algorithm that uses depth data to refine the pose obtained with CNNs. The effectiveness of the proposed method is validated by applying the method to object grasping by a robot manipulator.

Keywords:
Artificial intelligence Computer science Orientation (vector space) Pose Minimum bounding box Convolutional neural network Computer vision Object (grammar) Position (finance) 3D pose estimation Iterative closest point Robot Bounding overwatch Object detection Point (geometry) Pattern recognition (psychology) Point cloud Image (mathematics) Mathematics

Metrics

4
Cited By
0.66
FWCI (Field Weighted Citation Impact)
20
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.