[4] LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:common objects in context[C]//European Conference on Computer Vision.Cham:Springer,2014:740-755.
[5] VIOLA P,JONES M J.Robust real-time face detection[J].International Journal of Computer Vision,2004,57:137-154.
[6] GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE,2014:580-587.
[7] GIRSHICK R.Fast R-CNN[C]//IEEE International Conference on Computer Vision.Santiago:IEEE,2015:1440-1448.
[8] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[9] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unified,real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:779-788.
[10] LIU W,ANGUELOV D,ERHAN D,et al.SSD:single shot multibox detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[11] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:6517-6525.
[12] REDMON J,FARHADI A.YOLOv3:an incremental improvement[EB/OL].(2018-04-08)[2022-07-05].https://arxiv.org/abs/1804.02767.
[13] BOCHKOVSKIY A,WANG C Y,LIAO H Y M.YOLOv4:optimal speed and accuracy of object detection[EB/OL].(2020-04-23)[2022-07-05].https://arxiv.org/abs/2004.10934.
[16] LIU Z,LIN Y T,CAO Y,et al.Swin Transformer:hierarchical vision transformer using shifted windows[C]//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:9992-10002.
[17] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[EB/OL].(2017-12-06)[2022-07-05].https://arxiv.org/abs/1706.03762.
[18] TAN M X,PANG R M,LE Q V.EfficientDet:scalable and efficient object detection[C]//IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:10778-10787.
[19] LIU Y C,SHAO Z R,HOFFMANN N.Global attention mechanism:retain information to enhance channel-spatial interactions[EB/OL].(2021-12-10)[2022-07-05].https://arxiv.org/abs/2112.05561.