[2] Ma M, Li Y B. Multi-level image sequences and convolutional neural networks based human action recognition method[J]. Journal of Jilin University Engineering and Technology Edition, 47, 1244-1252(2017).
[3] Subetha T, Chitrakala S. A survey on human activity recognition from videos. [C]∥Proceedings of IEEE International Conference on Information Communication and Embedded Systems, 1-7(2016).
[4] Karpathy A, Toderici G, Shetty S et al. Large-scale video classification with convolutional neural networks. [C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 1725-1732(2014).
[6] Tran D, Bourdev L, Fergus R et al. Learning spatiotemporal features with 3D convolutional networks. [C]∥Proceedings of IEEE International Conference on Computer Vision, 4489-4497(2015).
[8] Wang L M, Xiong Y J, Wang Z et al. Temporal segment networks: Towards good practices for deep action recognition[J]. ACM Transactions on Information Systems, 22, 20-36(2016).
[10] Bilen H, Fernando B, Gavves E et al. Dynamic image networks for action recognition. [C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 3034-3042(2016).
[11] Lin S Z, Zheng Y, Lu X F et al. Adaptive tracking algorithm for aerial small targets based on multi-domain convolutional neural networks and autoregression model[J]. Acta Optica Sinica, 37, 1215006(2017).
[12] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. [C]∥Proceedings of International Conference on Learning Representations, 1-14(2015).
[13] Qu L, Wang K R, Chen L L et al. Fast road detection based on RGBD images and convolutional neural network[J]. Acta Optica Sinica, 37, 1010003(2017).
[14] Feichtenhofer C, Pinz A, Wildes R P. Spatiotemporal residual networks for video action recognition. [C]∥Proceedings of Neural Information Processing Systems, 3468-3476(2016).
[15] Wang H, Schmid C. Action recognition with improved trajectories. [C]∥Proceedings of IEEE International Conference on Computer Vision, 3551-3558(2013).
[18] Zha SX, LuisierF, AndrewsW, et al. Exploiting image-trained CNN architectures for unconstrained video classification[J]. arXiv preprint arXiv:1503. 04144, 2015.
[19] Wang L M, Qiao Y, Tang X O. Action recognition with trajectory-pooled deep-convolutional descriptors. [C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 4305-4314(2015).