[5] XIAO Z LYAN Z Y.Radar emitter identification based on feedforward neural networks[C]//IEEE 4th Information TechnologyNetworkingElectronic and Automation Control Conference (ITNEC).ChongqingChina:IEEE 2020:555-558.
[10] TANG ZGAO X G.Research on the self-defence electronic jamming decision-making based on the discrete dynamic Bayesian network[J].Journal of Systems Engineering and Electronics200819(4):702-708.
[14] VISNEVSKI NKRISHNAMURTHY VHAYKIN Set al.Multi-function radar emitter modelling:a stochastic discrete event system approach[C]//The 42nd IEEE International Conference on Decision and Control.MauiHI:IEEE2003:6295-6300.
[15] VISNEVSKI NKRISHNAMURTHY VWANG Aet al.Syntactic modeling and signal processing of multifunction radars:a stochastic context-free grammar approach[J].Proceedings of the IEEE200795(5):1000-1025.
[26] XING QZHU W GJIA X.Intelligent countermeasure design of radar working-modes unknown[C]//IEEE International Conference on Signal ProcessingCommunications and Computing (ICSPCC).XiamenChina:IEEE2017:1-5.
[27] YOO JJANG DKIM H Jet al.Hybrid reinforcement learning control for a micro quadrotor flight[J].IEEE Control Systems Letters20205(2):505-510.
[28] ZHAO Z YWANG QLI X L.Deep reinforcement learning based lane detection and localization[J].Neurocomputing2020413:328-338.
[29] PARK HSIM M KCHOI D G.An intelligent financial portfolio trading strategy using deep Q-learning[J].Expert Systems with Applications2020158:113573.
[30] WATKINS C J C HDAYAN P.Technical note:Q-learning[J].Machine Learning19928:279-292.
[31] SILVER DHUANG AMADDISON C Jet al.Mastering the game of Go with deep neural networks and tree search[J].Nature2016529:484-489.
[32] VAN HASSELT HGUEZ ASILVER D.Deep reinforcement learning with double Q-learning[C]//Proceedings of the 30th AAAI Conference on Artificial Intelligence.PhoenixArizona:AAAI2016:2094-2100.