A Collaborative Information Screening Method for UAV Swarm Based on Deep Reinforcement Learning

LI Xintong; XIONG Zhi; CHEN Mingxing; XIONG Jun; LI Wenlong

doi:10.3969/j.issn.1671-637x.2021.10.002

[1] WANG RXIONG ZLIU Jet al.Chi-square and SPRT com-bined fault detection for multisensor navigation［J］.IEEE Transactions on Aerospace & Electronic Systems201652(3):1352-1365.

[2] WANG TSHEN YMAZUELAS Set al.Distributed scheduling for cooperative localization based on information evolution［C］//IEEE International Conference on Communications2012:576-580.

[3] LI Y X.Deep reinforcement learning:an overview［DB/OL］.［2020-09-08］.https://arxiv.org/pdf/1701.07274v1.pdf.

[4] ATALLAH R F ASSI C M KHABBAZ M J.Scheduling the operation of a connected vehicular network using deep reinforcement learning［J］.IEEE Transactions on Intelligent Transportation Systems2018:1-14.

[5] HUANG W WANG Y YI X.A deep reinforcement learning approach to preserve connectivity for multi-robot systems［C］//The 10th International Congress on Image and Signal Processing BioMedical Engineering and Informatics (CISP-BMEI).IEEE 2017.doi:10.1109/CISP-BMEI.2017.8302157.

[6] SARTORETTI G WU Y PAIVINE W et al.Distributed reinforcement learning for multi-robot decentralized collective construction［C］//International Symposium on Distri-buted Autonomous Robotic Systems (DARS)2018:35-49.

[9] MAA C.Information theory［M］.Heidelberg:Springer International Publishing2017.

[10] NAKAYA Y OSANA Y.Deep Q-network using reward distribution［C］//ICAISC,2018:160-169.

[11] HEESS N DHRUVA T B SRIRAM S et al.Emergence of locomotion behaviours in rich environments［DB/DL］.［2020-09-10］.https://arxiv.org/pdf/1707.02286.pdf.