JOURNAL ARTICLE

Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning

Abstract

Collision avoidance is a crucial task in vision-guided autonomous navigation. Solutions based on deep reinforcement learning (DRL) has become increasingly popular. In this work, we proposed several novel agent state and reward function designs to tackle two critical issues in DRL-based navigation solutions: 1) smoothness of the trained flight trajectories; and 2) model generalization to handle unseen environments.Formulated under a DRL framework, our model relies on margin reward and smoothness constraints to ensure UAVs fly smoothly while greatly reducing the chance of collision. The proposed smoothness reward minimizes a combination of first-order and second-order derivatives of flight trajectories, which can also drive the points to be evenly distributed, leading to stable flight speed. To enhance the agent's capability of handling new unseen environments, two practical setups are proposed to improve the invariance of both the state and reward function when deploying in different scenes. Experiments demonstrate the effectiveness of our overall design and individual components.

Keywords:
Collision avoidance Reinforcement learning Smoothness Computer science Trajectory Generalization Margin (machine learning) Artificial intelligence Function (biology) Obstacle avoidance Collision avoidance system Autonomous agent State (computer science) Collision Machine learning Mobile robot Robot Algorithm Mathematics Computer security

Metrics

7
Cited By
0.87
FWCI (Field Weighted Citation Impact)
26
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotic Path Planning Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.