Head Detection Based on RDM-YOLOv3

Junwen Liu; Yongjun Zhang; Zhi Li; Yong Zhao; Xinyu Ran; Zhongwei Cui; Mengjia Niu

doi:10.3788/LOP202259.0815011

[1] Li N, Wu Y Y, Liu Y et al. Pedestrian attribute recognition algorithm based on multi-scale attention network[J]. Laser & Optoelectronics Progress, 58, 0410025(2021).

[2] Tian Y C, Dehghan A, Shah M. On detection, data association and segmentation for multi-target tracking[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41, 2146-2160(2019).

[3] Yao H T, Zhang S L, Hong R C et al. Deep representation learning with part loss for person re-identification[J]. IEEE Transactions on Image Processing, 28, 2860-2871(2019).

[4] Yu C Y, Xu Y, Gou L S et al. Crowd counting based on single-column deep spatiotemporal convolutional neural network[J]. Laser & Optoelectronics Progress, 58, 0810011(2021).

[5] Zhang T, Zhang L. Multiscale feature fusion-based object detection algorithm[J]. Laser & Optoelectronics Progress, 58, 0215003(2021).

[6] Ballotta D, Borghi G, Vezzani R et al. Fully convolutional network for head detection with depth images[C], 752-757(2018).

[7] Shami M B, Maqbool S, Sajid H et al. People counting in dense crowd images using sparse head detections[J]. IEEE Transactions on Circuits and Systems for Video Technology, 29, 2627-2636(2019).

[8] Zhang J J, Liu Y T, Li R C et al. End-to-end spatial attention network with feature mimicking for head detection[C], 199-206(2020).

[9] Vu T H, Osokin A, Laptev I. Context-aware CNNs for person head detection[C], 2893-2901(2015).

[10] Girshick R, Donahue J, Darrell T et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C], 580-587(2014).

[11] Gao S H, Cheng M M, Zhao K et al. Res2Net: a new multi-scale backbone architecture[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 652-662(2021).

[12] Hariharan B, Arbeláez P, Girshick R et al. Hypercolumns for object segmentation and fine-grained localization[C], 447-456(2015).

[13] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9905, 21-37(2016).

[14] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection[C], 779-788(2016).

[15] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C], 770-778(2016).

[16] Lin T Y, Dollár P, Girshick R et al. Feature pyramid networks for object detection[C], 936-944(2017).