[1] WANG RXIONG ZLIU Jet al.Chi-square and SPRT com-bined fault detection for multisensor navigation[J].IEEE Transactions on Aerospace & Electronic Systems201652(3):1352-1365.
[2] WANG TSHEN YMAZUELAS Set al.Distributed scheduling for cooperative localization based on information evolution[C]//IEEE International Conference on Communications2012:576-580.
[3] LI Y X.Deep reinforcement learning:an overview[DB/OL].[2020-09-08].https://arxiv.org/pdf/1701.07274v1.pdf.
[4] ATALLAH R F ASSI C M KHABBAZ M J.Scheduling the operation of a connected vehicular network using deep reinforcement learning[J].IEEE Transactions on Intelligent Transportation Systems2018:1-14.
[5] HUANG W WANG Y YI X.A deep reinforcement learning approach to preserve connectivity for multi-robot systems[C]//The 10th International Congress on Image and Signal Processing BioMedical Engineering and Informatics (CISP-BMEI).IEEE 2017.doi:10.1109/CISP-BMEI.2017.8302157.
[6] SARTORETTI G WU Y PAIVINE W et al.Distributed reinforcement learning for multi-robot decentralized collective construction[C]//International Symposium on Distri-buted Autonomous Robotic Systems (DARS)2018:35-49.
[9] MAA C.Information theory[M].Heidelberg:Springer International Publishing2017.
[10] NAKAYA Y OSANA Y.Deep Q-network using reward distribution[C]//ICAISC,2018:160-169.
[11] HEESS N DHRUVA T B SRIRAM S et al.Emergence of locomotion behaviours in rich environments[DB/DL].[2020-09-10].https://arxiv.org/pdf/1707.02286.pdf.