SAC-ABR: Soft Actor-Critic based deep reinforcement learning for Adaptive BitRate streaming

Mandan Naresh; Nandiraju Gireesh; Paresh Saxena; Manik Gupta

doi:10.1109/comsnets53615.2022.9668424

ScienceGate Book Chapters

JOURNAL ARTICLE

SAC-ABR: Soft Actor-Critic based deep reinforcement learning for Adaptive BitRate streaming

Mandan Naresh Nandiraju Gireesh Paresh Saxena Manik Gupta

Year: 2022 Pages: 353-361

DOI: 10.1109/comsnets53615.2022.9668424

Get Full-Text PDF Get Analytical Report

Abstract

Adaptive Bit Rate (ABR) assignment plays a crucial role for ensuring satisfactory quality of experience (QoE) in video streaming applications. Recently the authors of [1] proposed to use reinforcement learning (RL) based asynchronous advantage actor-critic ( $A3C$ ), an on-policy method, Pensieve, to improve ABR algorithms. It has shown to achieve a higher QoE as compared to other traditional ABR methods. However, Pensieve is sample inefficient and frail to different random seeds and hyperparameters. In this paper, we present soft actor-critic based deep reinforcement learning for adaptive bitrate streaming (SAC-ABR), an off-policy method, which improves the QoE as compared to other existing state-of-the-art ABR algorithms under a wide variety of network conditions. Based on the maximum entropy RL framework, SAC-ABR aims to maximize entropy while maximizing the expected rewards, hence achieving a better exploration-exploitation tradeoff as compared to on-policy ABR methods. We present the overall design together with the training and testing results of SAC-ABR, and evaluate its performance as compared to other state-of-the-art ABR algorithms. Our results show that SAC-ABR provides up to 27.42% higher average QoE as compared to Pensieve and much higher QoE when compared to other traditional fixed-rule based ABR algorithms.

Keywords:

Reinforcement learning Computer science Quality of experience Asynchronous communication Hyperparameter Artificial intelligence Multimedia Machine learning Computer network Quality of service

Metrics

Cited By

1.11

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Image and Video Quality Assessment

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Coding and Compression Technologies

Physical Sciences → Computer Science → Signal Processing

Multimedia Communication and Technology

Social Sciences → Social Sciences → Sociology and Political Science

SAC-ABR: Soft Actor-Critic based deep reinforcement learning for Adaptive BitRate streaming

Abstract

Metrics

Citation History

Topics

Related Documents

PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming

SAC-AP: Soft Actor Critic based Deep Reinforcement Learning for Alert Prioritization

SAC-FACT: Soft Actor-Critic Reinforcement Learning for Counterfactual Explanations

Averaged Soft Actor‐Critic for Deep Reinforcement Learning

Soft Actor-Critic Deep Reinforcement Learning Based Interference Resource Allocation