[7] MNIH V, KAVUKCUOGLU K, SILVER D, et al.Human-level control through deep reinforcement learning[J].Nature, 2015, 518(7540):529-533.
[10] LIU X, XU Y H, JIA L L, et al.Anti-jamming communications using spectrum waterfall: a deep reinforcement learning approach[J].IEEE Communications Letters, 2018, 22, (5): 998-1001.
[12] HADO V H, ARTHUR G, DAVID S.Deep reinforcement learning with double Q-learning[J].arXiv E-PRINTS, 2015:1509.06461.