JOURNAL ARTICLE

A Multi-level Feature Fusion Network for Real-time Semantic Segmentation

Abstract

Recently, convolutional neural networks (CNNs) have made a big splash in the field of semantic segmentation, achieving very high segmentation accuracy. In order to meet the requirement of real-time inference, existing methods increase inference speed by reducing the image resolution, leading to lower segmentation performance. We propose in this work a multi-level feature fusion network referred to as MLFFNet that utilizes a novel deep neural network architecture for efficient and real-time semantic segmentation. To strike a balance between speed and performance, MLFFNet substantially reduces the computational complexity by using a lightweight feature extraction network to implement feature reuse through multi-level feature fusion. In addition, MLFFNet targets at excellent segmentation performance through a channel attention mechanism and dilated convolutions with different rates. Specifically, MLFFNet achieves 72.6% mIoU on Cityscapes with the speed of 68.3 FPS on one NVIDIA Titan X card, which is significantly faster than the existing methods with comparable performance.

Keywords:
Computer science Segmentation Convolutional neural network Artificial intelligence Feature extraction Inference Feature (linguistics) Pattern recognition (psychology) Image segmentation Artificial neural network Computer vision

Metrics

3
Cited By
0.21
FWCI (Field Weighted Citation Impact)
51
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.