In this paper, we propose a deep reinforcement learning(DRL) algorithm which combines Deep Deterministic Policy Gradient (DDPG) with expert demonstrations and supervised loss for decision making for autonomous driving. Training DRL agent with supervised learning is adopted to accelerate the exploration process and increase the stability. A supervised loss function is introduced in the algorithm to update the actor networks. In addition, reward construction is combined to make the training process more stable and efficient. The proposed algorithm is applied to a popular autonomous driving simulator called TORCS. The experimental results show that the training efficiency and stability are improved by utilizing our algorithm in autonomous driving.
Chenghao WangMinglei LiMuxiang ZhangMin Zhang
Haochen LiuZhiyu HuangJingda WuChen Lv
Yao WuLucai WangXiao LuYue WuHaojun Zhang
Feng WangDaiyin ZhuJiaan Zhang