Backbone Network for Object Detection Task

Yalin Song; Yanwei Pang

doi:10.3788/LOP57.041021

[1] LeCun Y, Bengio Y, Hinton G. Deep learning[J]. Nature, 521, 436-444(2015).

[2] Li Z D, Zhong Y, Chen M et al. Fast face image retrieval based on depth feature[J]. Acta Optica Sinica, 38, 1010004(2018).

[3] Hua X, Wang X Q, Wang D et al. Multi-objective detection of traffic scenes based on improved SSD[J]. Acta Optica Sinica, 38, 1215003(2018).

[4] Cao Y J, Xu G M, Shi G C. Low altitude armored target detection based on rotation invariant Faster R-CNN[J]. Laser & Optoelectronics Progress, 55, 101501(2018).

[5] Girshick R, Donahue J, Darrell T et al. Rich feature hierarchies for accurate object detection and semantic segmentation. [C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 23-28, 2014, Columbus, Ohio. New York: IEEE, 580-587(2014).

[6] He K M, Zhang X Y, Ren S Q et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916(2015). http://www.sciencedirect.com/science/article/pii/S0031320315004252

[7] Girshick R. Fast R-CNN. [C]∥Proceedings of the IEEE International Conference on Computer Vision(ICCV), December 11-18, 2015, Santiago, Chile. New York: IEEE, 1440-1448(2015).

[8] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks. [C]∥Advances in Neural Information Processing Systems, December 7-12, 2015, Montreal, Quebec, Canada. Canada: NIPS, 91-99(2015).

[9] Chen J M, Jin J, Wang W F. Refine-FPN: an improvement based on FPN algorithm[J]. Laser & Optoelectronics Progress, 56, 211505(2019).

[10] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection. [C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 26-July 1, 2016, Las Vegas, Nevada. New York: IEEE, 779-788(2016).

[11] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. ∥Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science. Cham: Springer, 9905, 21-37(2016).

[12] Liu W J, Gao M Y, Qu H C et al. Light-weight multi-object detection network[J]. Laser & Optoelectronics Progress, 56, 221003(2019).

[13] Chen L L, Zhang Z D, Peng L. Real-time detection based on improved single shot MultiBox detector[J]. Laser & Optoelectronics Progress, 56, 011002(2019).

[14] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. [C]∥Advances in Neural Information Processing Systems, December 3-6, 2012, Lake Tahoe, Nevada, United States. Canada: NIPS, 1097-1105(2012).

[15] Simonyan K. San Diego, CA, USA[C]. Zisserman A. Very deep convolutional networks for large-scale image recognition∥Proceedings of the International Conference on Learning Representations, May 7-9(2015).

[16] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition. [C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), June 26-July 1, 2016, Las Vegas, Nevada. New York: IEEE, 770-778(2016).

[17] Huang G. Liu Z, van der Maaten L, et al. Densely connected convolutional networks. [C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR), July 21-26, 2017, Honolulu, Hawaii. New York: IEEE, 4700-4708(2017).

[18] He K M, Girshick R, Dollár P. Rethinking ImageNet pre-training. [C]∥Proceedings of the IEEE International Conference on Computer Vision (ICCV), October 27-November 2, 2019, Seoul, Korea. New York: IEEE, 4918-4927(2019).

[19] Shen Z Q, Liu Z, Li J G et al. DSOD: learning deeply supervised object detectors from scratch. [C]∥Proceedings of the IEEE International Conference on Computer Vision(ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 1919-1927(2017).

[20] Zhu R, Zhang S F, Wang X B et al. ScratchDet: training single-shot object detectors from scratch. [C]∥Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recongition (CVPR), June 16-20, 2019, Long Beach, CA, USA. New York: IEEE, 2268-2277(2019).

[21] Everingham M, van Gool L, Williams C K I et al. The pascal visual object classes (VOC) challenge[J]. International Journal of Computer Vision, 88, 303-338(2010).

[22] Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks[C]∥Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, May 13-15, 2010, Sardinia, 249-256(2010).

[23] Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. [C]∥Proceedings of the International Conference on Machine Learning (ICML), July 6-11, 2015, Lille, France. New York: ACM, 448-456(2015).

[24] Glorot X, Bordes A, Bengio Y. Deep sparse rectifier neural networks[C]∥Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, April 11-13, 2011, Fort Lauderdale, USA., 315-323(2011).

[25] Liu S T, Huang D, Wang Y H. Receptive field block net for accurate and fast object detection. [C]∥Proceedings of the European Conference on Computer Vision(ECCV), September 8-14, 2018, Munich, Germany. New York: IEEE, 385-400(2018).

[26] Redmon J, Farhadi A. YOLO9000: better, faster, stronger. [C]∥Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 6517-6525(2017).

[27] Fu C Y, Liu W, Ranga A et al. -01-23)[2019-06-09]. http:∥arxiv., org/abs/1701, 06659(2017).

[28] Dai J F, Li Y, He K M et al. R-FCN: object detection via region-based fully convolutional networks. [C]∥Advances in Neural Information Processing Systems, December 5-10, 2016, Barcelona, Spain. Canada: NIPS, 379-387(2016).

微信扫一扫：分享

微信扫一扫：分享