Lightweight Object Detection Network Based on Convolutional Neural Network

Yequn Cheng; Yan Wang; Yuying Fan; Baoqing Li

doi:10.3788/LOP202158.1610023

[1] Zou Z X, Shi Z W, Guo Y H et al. Object detection in 20 years: a survey[EB/OL]. (2019-05-16)[2020-07-28]. https://arxiv.org/abs/1905.05055

[2] Krizhevsky A, Sutskever I, Hinton G E et al. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 60, 84-90(2017).

[3] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017).

[4] He K M, Gkioxari G, Dollár P et al. Mask R-CNN[C]. //2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy., 2980-2988(2017).

[5] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9905, 21-37(2016).

[6] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA, 779-788(2016).

[7] Duan Z J, Li S B, Hu J J et al. Review of deep learning based object detection methods and their mainstream frameworks[J]. Laser & Optoelectronics Progress, 57, 120005(2020).

[8] Chen L L, Zhang Z D, Peng L et al. Real-time detection based on improved single shot MultiBox detector[J]. Laser & Optoelectronics Progress, 56, 011002(2019).

[9] Li C Y, Yao J M, Lin Z X et al. Object detection method based on improved YOLO lightweight network[J]. Laser & Optoelectronics Progress, 57, 141003(2020).

[10] Cui J H, Zhang Y Z, Wang Z et al. Light-weight object detection networks for embedded platform[J]. Acta Optica Sinica, 39, 0415006(2019).

[11] Sandler M, Howard A, Zhu M L et al. MobileNetV2: inverted residuals and linear bottlenecks[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA, 4510-4520(2018).

[12] Gao S H, Cheng M M, Zhao K et al. Res2Net: a new multi-scale backbone architecture[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 652-662(2021).

[13] He K M, Zhang X Y, Ren S Q et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916(2015). http://www.sciencedirect.com/science/article/pii/S0031320315004252

[14] Iandola F N, Han S, Moskewicz M W et al. SqueezeNet: AlexNet-level accuracy with 50× fewer parameters and <0.5 MB model size[EB/OL]. (2016-11-04)[2020-07-28]. https://arxiv.org/abs/1602.07360

[15] Zhang X Y, Zhou X Y, Lin M X et al. ShuffleNet: an extremely efficient convolutional neural network for mobile devices[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, 6848-6856(2018).

[16] Lin T Y, Dollár P, Girshick R et al. Feature pyramid networks for object detection[C]. //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 936-944(2017).

[17] Liu S, Qi L, Qin H F et al. Path aggregation network for instance segmentation[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA., 8759-8768(2018).

[18] Rezatofighi H, Tsoi N, Gwak J et al. Generalized intersection over union: a metric and a loss for bounding box regression[C]. //2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 15-20, 2019, Long Beach, CA, USA, 658-666(2019).

[19] Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 318-327(2017).

[20] Zhang Z, He T, Zhang H et al. Bag of freebies for training object detection neural networks[EB/OL]. (2019-04-12)[2020-07-28]. https://arxiv.org/abs/1902.04103

[21] Fu C Y, Liu W, Ranga A et al. DSSD: deconvolutional single shot detector[EB/OL]. (2017-01-23)[2020-07-28]. https://arxiv.org/abs/1701.06659

[22] Dai J F, Li Y, He K M et al. R-FCN: object detection via region-based fully convolutional networks[EB/OL]. (2016-05-20)[2020-07-28]. https://arxiv.org/abs/1605.06409v2

[23] Liu S T, Huang D, Wang Y H et al. Receptive field block net for accurate and fast object detection[M]. //Ferrari V, Hebert M, Sminchisescu C, et al. Computer vision-ECCV 2018. Lecture notes in computer science, 11215, 404-419(2018).

[24] Redmon J, Farhadi A. YOLOv3: an incremental improvement[EB/OL]. (2018-04-08)[2020-07-28]. https://arxiv.org/abs/1804.02767