Human Action Recognition by Decision-Making Level Fusion Based on Spatial-Temporal Features

Yandi Li; Xiping Xu

doi:10.3788/AOS201838.0810001

[1] Li R F, Wang L L, Wang K. A survey of human body action recognition[J]. Pattern Recognition and Artificial Intelligence, 27, 35-48(2014).

[2] Lafferty J. McCallum A, Pereira F. Conditional random fields: probabilistic model for segmenting and labeling Sequence data. [C]∥Proceedings of the 18th International Conference on Machine Learning, 282-289(2001).

[3] Wang J, Liu P, She M et al. Human action categorization using conditional random field[C]. IEEE Workshop on Robotic Intelligence in Informationally Structured Space (RiiSS), 131-135(2011).

[4] Yamato J, Ohya J, Ishii K. Recognizing human action in time-sequential images using hidden markov model[C]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 379-385(1992).

[5] Peursum P, Venkatesh S, West G. Tracking-as-recognition for articulated full-body human motion analysis[C]. IEEE Conference on Computer Vision and Pattern Recognition, 1-8(2007).

[6] Natarajan P, Nevatia R. Online, real-time tracking and recognition of human actions[C]. IEEE Workshop on Motion and video Computing, 1-8(2008).

[7] Huang K, Zhang Y, Tan T. A discriminative model of motion and cross ratio for view-invariant action recognition[J]. IEEE Transactions on Image Processing, 21, 2187-2197(2012). http://europepmc.org/abstract/MED/22438508

[8] Niebles J C, Wang H, Li F. Unsupervised learning of human action categories using spatial-temporal words[J]. International Journal of Computer Vision, 79, 299-318(2008). http://link.springer.com/article/10.1007/s11263-007-0122-4

[9] Wang H, Schmid C. Action recognition with improved trajectories[C]. IEEE International Conference on Computer Vision, 3551-3558(2013).

[10] Schindler K, Van Gool L. Action snippets: how many frames does human action recognition require?[C]. IEEE Conference on Computer Vision and Pattern Recognition, 1-8(2008).

[11] Liu J, Ali S, Shah M. Recognizing human actions using multiple features[C]. IEEE Conference on Computer Vision and Pattern Recognition, 1-8(2008).

[12] Wang X Y, Zhang Y Z, Chen D Y. Face detection based on MB-LBP and eye tracking[J]. Chinese Journal of Scientific Instrument, 2739-2745(2014).

[13] Ando H, Fujiyoshi H. Human-area segmentation by selecting similar silhouette images based on weak-classifier response[C]. 20th International Conference on Pattern Recognition, 3444-3447(2010).

[14] Fu Y, Guo J Y. Dynamic time warping-based human action recognition[J]. Eletronic Measurement Technology, 69-72(2014).

[15] Zhang J, Gao W, Liu A A et al[J]. Modeling approach of the video semantic events based on motion trajectories Eletronic Measurement Technology, 2013, 31-36.

[16] An D, Rong C Q, Yang D et al[J]. Speaker recognition method basd on PSOA clustering and KMP algorithm Chinese Journal of Scientific Instrument, 2013, 107-112.

[17] Li Q H, Li A H, Wang T et al. Two-stream networks with sequence opticsl flow image for action recognition[J]. Acta Optica Sinica, 38, 0615002(2018).

[18] Ijjina E P, Mohan C K. Human action recognition based on motion capture information using fuzzy convolution neural networks[C]. Eighth International Conference on Advances in Pattern Recognition, 1-6(2015).

[19] Karpathy A, Toderici G, Shetty S et al. Large-scale video classification with convolutional neural networks. [C]∥Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1725-1732(2014).

[20] Baccouche M, Mamalet F, Wolf C et al. Sequential deep learning for human action recognition[C]. International Workshop on Human Behavior Understanding, 29-39(2011).

[21] Shou Z, Chan J, Zareian A et al. CDC: convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos[C]. IEEE Conference on Computer Vision and Pattern Recognition, 1417-1426(2017).

[22] Zhao S, Wang B, Tang C Y. Arm vein feature extraction and matching based on chain code[J]. Acta Optica Sinica, 36, 0515003(2016).

[23] Li Y D, Xu X P, Chen J et al. Background updating based on dynamic characteristic block matching in the application of the motion detection[J]. Chinese Journal of Scientific Instrument, 445-453(2017).

[24] Li Y D, Xu X P, Wang J Q. Feature extraction based on pyramid match kernel algorithm with adaptive partitioning[J]. Acta Photonica Sinica, 46, 1210001(2017).

[25] Rabiner L R, Juang B H. Fundamentals of speech recognition[M]. Englewood Cliffs: PTR Prentice Hall(1993).