[7] YANG Q, ZHANG J, SHI G, et al.Maneuver decision of UAV in short-range air combat based on deep reinforcement learning[J].IEEE Access, 2020, 8:363-378.
[10] SUTTON R S, BARTO A G.Reinforcement learning:an introduction[M].2nd ed.Massachusetts:MIT Press, 2018.
[11] HOCHREITER S, SCHMIDHUBER J.Long short-term memory[J].Neural Computation, 1997, 9(8):1735-1780.