In recent years, reinforcement learning has attracted researchers' attention with the AlphaGo event. Especially in autonomous mobile robots, the reinforcement learning approach can be applied to the mapless navigation problem. The Robot can complete the set tasks well and works well in different environments without maps and ready-made path plans. However, for reinforcement learning in general and mapless navigation based on reinforcement learning in particular, exploitation and exploration balance are issues that need to be carefully considered. Specifically, the fact that the agent (Robot) can discover and execute actions in a particular working environment plays a significant role in improving the performance of the reinforcement learning problem. By creating some noise during the convolutional neural network training, the above problem can be solved by some popular approaches today. With outstanding advantages compared to other approaches, the Boltzmann policy approach has been used in our problem. It helps the Robot explore more thoroughly in complex environments, and the policy is also more optimized.
Anastasiya SlavovaVladimir Hristov
Enrico MarchesiniAlessandro Farinelli
Jiajun WuWeihao ChenJiaming JiXing ChenLumei SuHoude Dai