[1] RICHARD A P.Electronic warfare target location methods[M].Boston: Artech House 2005.
[2] KRISHNAMURTHY V.Emission management for low pro-bability intercept sensors in network centric warfare[J].IEEE Transactions on Aerospace and Electronic Systems 2005 41(1): 133-151.
[3] BAUM M L PASSINO K M.A search-theoretic approach to cooperative control for uninhabited air vehicles[C]//AIAA Guidance Navigation and Control Conference 2002: 1-8.
[4] CASBEER D W.Decentralized estimation using information consensus filters with a multi-static UAV radar tracking system[D].Hawaii: Brigham Young University 2009.
[5] PENG H SU F SHEN L C.Extended search map approach for multiple UAVs wide area target searching[J].Systems Engineering and Electronics 2010 32(4): 795-798.
[6] POLYCARPOU M M YANG Y L PASSINO K M.A cooperative search framework for distributed agents[C]//IEEE International Symposium on Intelligent Control 2001:1-6.
[7] SUJIT P BGHOSE D L.Multiple UAV search using agent based negotiation scheme[C]//American Control Conference2005:2995-3000.
[8] WIERING MSCHMIDHUBER J R.Fast online Q(λ)[J].Machine Learning199833(1):105-115.
[9] MILLAN J D RPOSENATO DDEDIEU E.Continuous-action Q-learning[J].Machine Learning200249(2/3):247-265.
[10] TORRIERI D J.Statistical theory of passive location systems[J].IEEE Transactions on Aerospace and Electro-nic Systems1984AES-20(2):183-198.
[11] TSITSIKLIS J NROY B V.Feature-based methods for large scale dynamic programming[J].Machine Lear-ning199622(1-3):59-94.
[12] GAO X, FANG Y W, HU S G, et al.Angle precision study on dual-aircraft cooperatively detecting remote target by passive locating method[C]//IEEE International Conference on Signal Processing, Communication and Computing, 2011:1174-1178.
[13] BUSONIU L, BABUSKA R, SCHUTTER B D, et al.Reinforcement learning and dynamic programming using function approximators[M].Florida:Automatic Control and Engineering Series, CRC Press, 2010:49-51.