Human’s Dangerous Action Recognition in Petrochemical Scene Using Machine Vision

Bin Yang; Xiao Yun; Kaiwen Dong; Xixiang Liu; Han Huang

doi:10.3788/LOP202158.2215001

[1] Huang K Q, Chen X T, Kang Y F et al. Intelligent visual surveillance: a review[J]. Chinese Journal of Computers, 38, 1093-1118(2015).

[2] Yang J, Simon S, Gayathri C et al. Detecting driver phone use leveraging car speakers[C]. //Proceedings of the 17th Annual International Conference on Mobile Computing and Networking, September 19-23, 2011, Las Vegas, Nevada, USA, 97-108(2011).

[3] Rodríguez-Ascariz J M, Boquete L, Cantos J et al. Automatic system for detecting driver use of mobile phones[J]. Transportation Research Part C: Emerging Technologies, 19, 673-681(2011).

[4] Berri R A, Silva A G, Parpinelli R S et al. A pattern recognition system for detecting use of mobile phones while driving[C]. //2014 International Conference on Computer Vision Theory and Applications (VISAPP), January 5-8, 2014, Lisbon, Portugal., 411-418(2014).

[5] le T H N, Zheng Y T, Zhu C C et al. Multiple scale faster-RCNN approach to driver’s cell-phone usage and hands on steering wheel detection[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 26-July 1, 2016, Las Vegas, NV, USA., 46-53(2016).

[6] Seshadri K, Juefei-Xu F, Pal D K et al. Driver cell phone usage detection on strategic highway research program (SHRP2) face view videos[C]. //2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 7-12, 2015, Boston, MA, USA., 35-43(2015).

[7] Kohl D, Eberheim A, Schieberle P. Detection mechanisms of smoke compounds on homogenous semiconductor sensor films[J]. Thin Solid Films, 490, 1-6(2005).

[8] Liu B J, Alvarez-Ossa D, Kherani N P et al. Gamma-free smoke and particle detector using tritiated foils[J]. IEEE Sensors Journal, 7, 917-918(2007).

[9] Millan-Garcia L, Sanchez-Perez G, Nakano M et al. An early fire detection algorithm using IP cameras[J]. Sensors (Basel, Switzerland), 12, 5670-5686(2012).

[10] Yuan F N. A fast accumulative motion orientation model based on integral image for video smoke detection[J]. Pattern Recognition Letters, 29, 925-932(2008).

[11] Li P, Zhang Y. Video smoke detection based on Gaussian mixture model and convolutional neural network[J]. Laser & Optoelectronics Progress, 56, 211502(2019).

[12] Davis J. Visual gesture recognition[J]. IEEE Proceedings-Vision, Image, and Signal Processing, 141, 101-106(1994).

[13] Hu J F, Wang X H, Zheng W S et al. RGB-D action recognition: recent advances and future perspectives[J]. Acta Automatica Sinica, 45, 829-840(2019).

[14] Guo F Z, Kong J, Jiang M. Action recognition based on adaptive fusion of RGB and skeleton features[J]. Laser & Optoelectronics Progress, 57, 201506(2020).

[15] Li C, Zhong Q Y, Xie D et al. Skeleton-based action recognition with convolutional neural networks[C]. //2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), July 10-14, 2017, Hong Kong, China., 597-600(2017).

[16] Yan S J, Xiong Y J, Lin D H. Spatial temporal graph convolutional networks for skeleton-based action recognition[C]. //2018 the Association for the Advancement of Artificial Intelligence, February 2-7, 2018, New Orleans, Louisiana, USA., 7444-7452(2018).

[17] Rosenfeld A, Ullman S. Hand-object interaction and precise localization in transitive action recognition[C]. //2016 13th Conference on Computer and Robot Vision (CRV), June 1-3, 2016, Victoria, BC, Canada., 148-155(2016).

[18] Kim S, Yun K, Park J et al. Skeleton-based action recognition of people handling objects[C]. //2019 IEEE Winter Conference on Applications of Computer Vision (WACV), January 7-11, 2019, Waikoloa, HI, USA., 61-70(2019).

[19] Cao Z, Simon T, Wei S H et al. Realtime multi-person 2D pose estimation using part affinity fields[C]. //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 1302-1310(2017).

[20] Tian W H, Zeng K M, Mo Z Q et al. Recognition of unsafe driving behaviors based on convolutional neural network[J]. Journal of University of Electronic Science and Technology of China, 48, 381-387(2019).

[21] Redmon J, Farhadi A. YOLOv3: an incremental improvement[EB/OL]. (2018-04-08)[2020-12-14]. https://arxiv.org/abs/1804.02767

[22] Lin T Y, Dollár P, Girshick R et al. Feature pyramid networks for object detection[C]. //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 936-944(2017).

[23] Sun Y C, Pan S G, Zhao T et al. Traffic light detection based on optimized YOLOv3 algorithm[J]. Acta Optica Sinica, 40, 1215001(2020).

[24] Zhao Q, Li B Q, Li T W. Target detection algorithm based on improved YOLO v3[J]. Laser & Optoelectronics Progress, 57, 121502(2020).

[25] Lyu S, Cai X, Feng R. YOLOv3 network based on improved loss function[J]. Computer Systems & Applications, 28, 1-7(2019).

[26] Kay W, Carreira J, Simonyan K et al. The kinetics human action video dataset[EB/OL]. (2017-05-19)[2020-12-14]. https://arxiv.org/abs/1705.06950

[27] Shahroudy A, Liu J, Ng T T et al. NTU RGB+D: a large scale dataset for 3D human activity analysis[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA, 1010-1019(2016).