JOURNAL ARTICLE

Real-time grasp detection using convolutional neural networks

Abstract

We present an accurate, real-time approach to robotic grasp detection based on convolutional neural networks. Our network performs single-stage regression to graspable bounding boxes without using standard sliding window or region proposal techniques. The model outperforms state-of-the-art approaches by 14 percentage points and runs at 13 frames per second on a GPU. Our network can simultaneously perform classification so that in a single step it recognizes the object and finds a good grasp rectangle. A modification to this model predicts multiple grasps per object by using a locally constrained prediction mechanism. The locally constrained model performs significantly better, especially on objects that can be grasped in a variety of ways.

Keywords:
GRASP Computer science Bounding overwatch Convolutional neural network Artificial intelligence Rectangle Object (grammar) Object detection Sliding window protocol Pattern recognition (psychology) Artificial neural network Computer vision Window (computing) Mathematics

Metrics

888
Cited By
42.42
FWCI (Field Weighted Citation Impact)
31
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Soft Robotics and Applications
Physical Sciences →  Engineering →  Biomedical Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.