Deep reinforcement learning enhanced optimization algorithm for adaptive bitrate video streaming

Jianwei Zhang; Yang Han; Zengyu Cai; Yuan Feng; Liang Zhu

doi:10.1063/5.0277381

ScienceGate Book Chapters

JOURNAL ARTICLE

Deep reinforcement learning enhanced optimization algorithm for adaptive bitrate video streaming

Jianwei Zhang Yang Han Zengyu Cai Yuan Feng Liang Zhu

Year: 2025 Journal: AIP Advances Vol: 15 (7) Publisher: American Institute of Physics

DOI: 10.1063/5.0277381

Get Full-Text PDF Get Analytical Report

Abstract

Driven by the digital era, video traffic is growing rapidly, and users’ demand for high-quality video experiences is increasing. Adaptive bitrate (ABR) algorithms, as a key technology to optimize the transmission performance of video streams, play an important role in improving the efficiency of communication networks and the quality of experience (QoE). However, existing ABR algorithms rely too much on fixed control rules and simplified environment models, which make it difficult to provide optimal performance under complex and changing physical network environments (e.g., bandwidth fluctuations, delays, and network congestion). To address these challenges, this paper proposes a new ABR algorithm, the deep reinforcement learning enhanced ABR video stream optimization algorithm (PLL-ABR), which adopts proximal policy optimization as a reinforcement learning training framework and combines the dual clipping mechanism and deep neural networks (long short-term memory and local attention mechanism) to improve the training efficiency and policy parameter optimization capability. In addition, this paper also introduces a nonlinear entropy weight dynamic adjustment mechanism to balance exploration and utilization and enhance the stability of strategy optimization. By training the neural network model through reinforcement learning, PLL-ABR can dynamically select the future video block bitrate based on the physical state of the client video player and network environment information. Through comparison experiments with seven representative ABR algorithms, the method shows significant superiority under different physical network conditions and QoE factors (bitrate utilization, rebuffering penalty, and video smoothness penalty), with an average QoE improvement of 28.50%.

Keywords:

Computer science Reinforcement learning Video streaming Optimization algorithm Streaming current Algorithm Real-time computing Artificial intelligence Mathematical optimization Mathematics Materials science

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.27

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Image and Video Quality Assessment

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Coding and Compression Technologies

Physical Sciences → Computer Science → Signal Processing

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Deep reinforcement learning enhanced optimization algorithm for adaptive bitrate video streaming

Abstract

Metrics

Topics

Related Documents

Optimized bitrate ladders for adaptive video streaming with deep reinforcement learning

PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming

Fastconv: Fast Learning Based Adaptive BitRate Algorithm for Video Streaming

Deep reinforced bitrate ladders for adaptive video streaming

Deep Reinforcement Learning-based Bitrate Adaptations in Dynamic Adaptive Streaming over HTTP