[2] Krizhevsky A, Sutskever I, Hinton G E et al. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 60, 84-90(2017).
[3] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017).
[4] He K M, Gkioxari G, Dollár P et al. Mask R-CNN[C]. //2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy., 2980-2988(2017).
[5] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9905, 21-37(2016).
[6] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA, 779-788(2016).
[7] Duan Z J, Li S B, Hu J J et al. Review of deep learning based object detection methods and their mainstream frameworks[J]. Laser & Optoelectronics Progress, 57, 120005(2020).
[8] Chen L L, Zhang Z D, Peng L et al. Real-time detection based on improved single shot MultiBox detector[J]. Laser & Optoelectronics Progress, 56, 011002(2019).
[9] Li C Y, Yao J M, Lin Z X et al. Object detection method based on improved YOLO lightweight network[J]. Laser & Optoelectronics Progress, 57, 141003(2020).
[10] Cui J H, Zhang Y Z, Wang Z et al. Light-weight object detection networks for embedded platform[J]. Acta Optica Sinica, 39, 0415006(2019).
[11] Sandler M, Howard A, Zhu M L et al. MobileNetV2: inverted residuals and linear bottlenecks[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA, 4510-4520(2018).
[12] Gao S H, Cheng M M, Zhao K et al. Res2Net: a new multi-scale backbone architecture[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 652-662(2021).
[15] Zhang X Y, Zhou X Y, Lin M X et al. ShuffleNet: an extremely efficient convolutional neural network for mobile devices[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, 6848-6856(2018).
[16] Lin T Y, Dollár P, Girshick R et al. Feature pyramid networks for object detection[C]. //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 936-944(2017).
[17] Liu S, Qi L, Qin H F et al. Path aggregation network for instance segmentation[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA., 8759-8768(2018).
[18] Rezatofighi H, Tsoi N, Gwak J et al. Generalized intersection over union: a metric and a loss for bounding box regression[C]. //2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 15-20, 2019, Long Beach, CA, USA, 658-666(2019).
[19] Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 318-327(2017).
[23] Liu S T, Huang D, Wang Y H et al. Receptive field block net for accurate and fast object detection[M]. //Ferrari V, Hebert M, Sminchisescu C, et al. Computer vision-ECCV 2018. Lecture notes in computer science, 11215, 404-419(2018).