JOURNAL ARTICLE

Precision-enhanced object recognition for semantic SLAM fusion

Abstract

Recent developments in deep learning techniques have accelerated the growth of robotic vision systems for many applications. This thesis examines several object detection algorithms that can be combined with simultaneous localization and mapping (SLAM) to produce an annotated 3D map of the local environment. After the initial review, Deep Object Pose Estimation (DOPE) is examined further and modified to enable object recognition using a depth map as input. This modification allows for shape-based object recognition in low visibility conditions such as darkness or camouflage. The method starts with a dual threshold to isolate a detection window followed by median and sharpness filtering to reduce the noise in the depth map. Moreover, a novel method to quantify the uncertainty of predictions is used to evaluate a threshold applied to key points used to enhance precision of the algorithm. Precision of the method surpasses the baseline across all frame rates tested. The method maintains high precision when the depth of the convolution neural network is reduced from 86 layers to 21 layers promoting an increase in frame rate from 8.6 Hz to 22.6 Hz. Additionally, five datasets are tested to determine the optimal size that should be used for training. Results show that 100k images is ideal, achieving the highest position accuracy of 96.8%, and that overfitting becomes apparent above 300k images as accuracy starts to drop. Finally, DOPE is combined with Real-Time Appearance-Based Mapping (RTAB-Map) and the enhanced precision method is compared against the original version to evaluate object placement on the map generated by SLAM. The original version yields an average error of 79.8 cm distance from the ground truth compared to 30.5 cm using the proposed method.

Keywords:
Object (grammar) Overfitting Noise (video) Object detection Position (finance) Pattern recognition (psychology) Frame (networking) Convolutional neural network Convolution (computer science) Frame rate

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.43
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Mycorrhizal Fungi and Plant Interactions
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
Genomics and Phylogenetic Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Plant Pathogens and Fungal Diseases
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Cell Biology

Related Documents

BOOK-CHAPTER

DRL-SLAM: Enhanced Object Detection Fusion with Improved YOLOv8

Farooq UsmanChaoxia ShiYanqing Wang

IFIP advances in information and communication technology Year: 2024 Pages: 257-272
DISSERTATION

Semantic Slam: A New Paradigm for Object Recognition and Scene Reconstruction

Tommaso Cavallari

University:   AMS Dottorato Institutional Doctoral Theses Repository (University of Bologna) Year: 2017
© 2026 ScienceGate Book Chapters — All rights reserved.