JOURNAL ARTICLE

Video Semantic Segmentation Network with Low Latency Based on Deep Learning

Channappa Gowda D VR. Kanagavalli

Year: 2023 Journal:   International Journal of Communication Networks and Information Security (IJCNIS) Vol: 15 (3)Pages: 209-225   Publisher: Iran University of Science and Technology

Abstract

Recently, new advances in deep learning algorithms have yielded some fascinating results in the field of computer vision technology. As a result, it can now perform activities that formerly required the use of human vision and the brain. Classification, object identification, and semantic segmentation have all seen substantial advancements in deep learning architecture in the last few years. For still images and movies, there has been a major advancement in the field of semantic segmentation. In practical uses like autonomous vehicles, segmenting semantic video continues to be difficult due to high-performance standards, the high cost of convolutional neural networks (CNNs), and the significant need for low latency. An effective machine-learning environment will be developed to meet the performance and latency challenges outlined above. The use of deep learning architectures like SegNet and FlowNet2.0 on the CamVid dataset enables this environment to conduct pixel-wise semantic segmentation of video properties while maintaining low latency. As a result, it is ideally suited for real-world applications since it takes advantage of both SegNet and FlowNet topologies. The decision network determines whether an image frame should be processed by a segmentation network or an optical flow network based on the expected confidence score. In conjunction with adaptive scheduling of the key frame approach, this technique for decision-making can help to speed up the procedure. Using the ResNet50 SegNet model, a mean Intersection on Union (IoU) of "54.27 percent" and an average frame per second of "19.57" were observed. Aside from decision network and adaptive key frame sequencing, it was discovered that FlowNet2.0 increased the frames processed per second9(fps) to "30.19" on GPU with a mean IoU of "47.65%". Because the GPU was utilized "47.65%" of the time, this resulted. There has been an increase in the speed of the Video semantic segmentation network without sacrificing quality, as demonstrated by this improvement in performance.

Keywords:
Computer science Segmentation Deep learning Artificial intelligence Convolutional neural network Machine learning Latency (audio) Pattern recognition (psychology) Computer vision

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
52
Refs
0.47
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Video semantic segmentation with low latency

Chinnappa Gowda D. V.R. Kanagavalli

Journal:   TELKOMNIKA (Telecommunication Computing Electronics and Control) Year: 2024 Vol: 22 (5)Pages: 1147-1147
JOURNAL ARTICLE

Low-Latency Video Semantic Segmentation

Yule LiJianping ShiDahua Lin

Year: 2018 Pages: 5997-6005
JOURNAL ARTICLE

Deep Video Dehazing With Semantic Segmentation

Wenqi RenJingang ZhangXiangyu XuLin MaXiaochun CaoGaofeng MengWei Liu

Journal:   IEEE Transactions on Image Processing Year: 2018 Vol: 28 (4)Pages: 1895-1908
© 2026 ScienceGate Book Chapters — All rights reserved.