• Electronics Optics & Control
  • Vol. 30, Issue 4, 23 (2023)
TANG Jianing, YANG Xin, ZHOU Sida, LI Luoyu, and AN Chengan
Author Affiliations
  • [in Chinese]
  • show less
    DOI: 10.3969/j.issn.1671-637x.2023.04.005 Cite this Article
    TANG Jianing, YANG Xin, ZHOU Sida, LI Luoyu, AN Chengan. UAV Exploration Trajectory Planning with Improved DDQN in Unknown Environment[J]. Electronics Optics & Control, 2023, 30(4): 23 Copy Citation Text show less

    Abstract

    For the exploration of unknown environments, such as search and rescue, chase and escape scenarios, the UAV needs to explore (perceive) the environment while completing current trajectory planning (action selection).Aiming at the above scenarios, in order to achieve efficient environment exploration, an improved Deep Double Q Network (DDQN) exploration trajectory planning method based on Long Short-Term Memory (LSTM) network is proposed.A simulation map is built, the environmental information in the UAVs field of view is taken as input, the LSTM network is introduced, and the choice of action direction is outputted.The priority of exploration experience samples is set to improve training efficiency.Flight dynamics constraints are added, and reasonable state, action space and one-step reward function are designed.Using the proposed algorithm, the UAV can autonomously plan a collision-free track with a wide range of environmental exploration.The simulation results show that the proposed algorithm is better than the traditional DDQN algorithm in the exploration area ratio and the average reward of one-step exploration in unknown environments.
    TANG Jianing, YANG Xin, ZHOU Sida, LI Luoyu, AN Chengan. UAV Exploration Trajectory Planning with Improved DDQN in Unknown Environment[J]. Electronics Optics & Control, 2023, 30(4): 23
    Download Citation