[3] LIENHART R, MAYDT J.An extended set of haar-like features for rapid object detection[C]//International Conference on Image Processing(ICIP), IEEE, 2002:900-903.
[4] VIOLA P, JONES M.Rapid object detection using a boosted cascade of simple features[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2001:511-518.
[5] DALAL N, TRIGGS B.Histograms of oriented gradients for human detection[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2005:886-893.
[6] FELZENSZWALB P F, MCALLESTER D, RAMANAN D.A discriminatively trained, multiscale, deformable part model[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2008:1-8.
[7] FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al.Object detection with discriminatively trained part-based models[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 32(9):1627-1645.
[8] HINTON G E, SALAKHUTDINOV R R.Reducing the dimensionality of data with neural networks[J].Science, 2006, 313(5786):504-507.
[9] ZEILER M D, FERGUS R.Visualizing and understanding convolutional networks[C]//European Conference on Computer Vision(ECCV), 2014:818-833.
[10] SIMONYAN K, ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv Preprint, arXiv:1409.1556, 2014.
[11] REDMON J, DIVVALA S, GIRSHICK R, et al.You only look once:unified, real-time object detection[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2016:779-788.
[12] REDMON J, FARHADI A.YOLO9000:better, faster, stronger[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2017:7263-7271.
[13] REDMON J, FARHADI A.Yolov3:an incremental improvement[J].arXiv Preprint, arXiv:1804.02767, 2018.
[14] LIU W, ANGUELOV D, ERHAN D, et al.SSD:single shot multibox detector[C]//European Conference on Computer Vision(ECCV), 2016:21-37.
[15] FU C Y, LIU W, RANGA A, et al.DSSD:deconvolutional single shot detector[J].arXiv Preprint, arXiv:1701.06659, 2017.
[16] HE K, ZHANG X, REN S, et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2016:770-778.
[17] JEONG J, PARK H, KAWK N.Enhancement of SSD by concatenating feature maps for object detection[J].arXiv Preprint, arXiv:1705.09587, 2017.
[18] LI Z, ZHOU F.FSSD:feature fusion single shot multibox detector[J].arXiv Preprint, arXiv:1712.00960, 2017.
[19] SERMANET P, EIGEN D, ZHANG X, et al.Overfeat:integrated recognition, localization and detection using convolutional networks[J].arXiv Preprint, arXiv:1312.6229, 2013.
[20] KRIZHEVSKY A, SUTSKEVER I, HINTON G E.Imagenet classification with deep convolutional neural networks[C]//Advances in Neural Information Processing Syste-ms, 2012:1097-1105.
[21] GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2014:580-587.
[22] FELZENSZWALB P F, HUTTENLOCHER D P.Efficient graph-based image segmentation[J].International Journal of Computer Vision, 2004, 59(2):167-181.
[23] HE K M, ZHANG X Y, REN S Q, et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9):1904-1916.
[24] GIRSHICK R.Fast R-CNN[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2015:1440-1448.
[25] REN S Q, HE K M, GIRSHICK R, et al.Faster R-CNN:towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems, 2015:91-99.
[26] KONG T, YAO A B, CHEN Y D, et al.Hypernet:towards accurate region proposal generation and joint object detection[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2016:845-853.
[27] CAI Z W, FAN Q F, FERIS R S, et al.A unified multi-scale deep convolutional neural network for fast object detection[C]//European Conference on Computer Vision(ECCV), 2016:354-370.
[28] KIM K H, HONG S, ROH B, et al.PVANET:deep but lightweight neural networks for real-time object detection[J].arXiv Preprint, arXiv:1608.08021, 2016.
[29] SHANG W, SOHN K, ALMEIDAl D, et al.Understanding and improving convolutional neural networks via concatenated rectified linear units[C]//International Conference on Machine Learning, 2016:2217-2225.
[30] LIN T Y, DOLLAR P, GIRSHICKR, et al.Feature pyramid networks for object detection[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2017:2117-2125.
[31] YANG B, YAN J, LEI Z, et al.Craft objects from images[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition(CVPR), 2016:6043-6051.
[32] HE K, GKIOXARI G, DOLLAR P, et al.Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017:2961-2969.