JOURNAL ARTICLE

Multi-Objective Neural Architecture Search for Efficient and Fast Semantic Segmentation on Edge

Dou ZiWenDong Ye

Year: 2023 Journal:   IEEE Transactions on Intelligent Vehicles Vol: 9 (1)Pages: 1346-1357   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Deploying efficient and fast semantic segmentation networks on edge computing platforms in real-world environments is desired and challenging. To address this challenge, we propose RealtimeSeg, one of the first semantic segmentation models to be searched by neural architecture search(NAS), capable of running at real-time speed on edge devices. In our neural architecture search, we incorporate the inference time and FLOPs (floating-point operations) of the target edge devices and the semantic segmentation accuracy as objectives. In this way, we construct a multi-objective neural architecture search. Specifically, the multi-objective NAS's loss function is decomposed into three sub-objective loss functions, which are weighted and summed. We employed knowledge distillation to further enhance the accuracy, latency, and FLOPs of the discovered network architecture during the search process. As a result, we successfully obtained our RealtimeSeg model. Lastly, we utilized NVIDIA TensorRT to accelerate RealtimeSeg and deployed the accelerated RealtimeSeg on the target platform for real-time semantic segmentation. Using a single NVIDIA Titan XP GPU, RealtimeSeg can be obtained within 1.5 days. The experimental results demonstrate that RealtimeSeg achieved an accuracy of 71.7 mIoU(%) while maintaining a frame rate of 25.25 FPS on the NVIDIA Jetson NX, using the input resolution of 1024 × 2048. And the RealtimeSeg has a lower FLOPs value of 1.52 G, which is 17-18× less than SOTA methods. In realistic scenarios, RealtimeSeg has been successfully deployed on edge computing platforms, achieving efficient and fast semantic segmentation results.

Keywords:
Computer science FLOPS Segmentation Inference Edge device Frame rate Enhanced Data Rates for GSM Evolution Architecture Latency (audio) Artificial intelligence Artificial neural network Edge computing Parallel computing Computer architecture Computer engineering Cloud computing

Metrics

7
Cited By
1.27
FWCI (Field Weighted Citation Impact)
56
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.