[1] Simonyan K. -11-12)[2019-12-21]. https:∥arxiv., org/abs/1406, 2199(2014).
[2] Ng JoeY H, Hausknecht M, Vijayanarasimhan S et al. Beyond short snippets: deep networks for video classification. [C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA. New York: IEEE, 4694-4702(2015).
[3] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 9, 1735-1780(1997).
[4] Wang L M, Xiong Y J, Wang Z et al. Temporal segment networks: towards good practices for deep action recognition[M]. ∥ Leibe B, Matas J, Sebe N, et al. Computer Vision—— ECCV 2016. Lecture Notes in Computer Science. Cham: Springer, 9912, 20-36(2016).
[5] Tran D, Bourdev L, Fergus R et al. Learning spatiotemporal features with 3D convolutional networks[J]. 2015 IEEE International Conference on Computer Vision (ICCV), 4489-4497(2015).
[7] Hu T, Zhu X, Guo W et al. Human action recognition based on scene semantics[J]. Multimedia Tools and Applications, 78, 28515-28536(2019).
[9] Zhang K P, Zhang Z P, Li Z F et al. Joint face detection and alignment using multitask cascaded convolutional networks[J]. IEEE Signal Processing Letters, 23, 1499-1503(2016).
[10] Szegedy C, Vanhoucke V, Ioffe S et al. Rethinking the inception architecture for computer vision. [C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30,2016, Las Vegas, NV, USA. New York: IEEE, 2818-2826(2016).
[11] Ramachandran P, Zoph B. -10-16)[2019-12-21]. https:∥arxiv.org/abs/1710.05941v1.(2017).
[12] Howard A, Sandler M, Chen B et al. Searching for MobileNetV3. [C]∥2019 IEEE/CVF International Conference on Computer Vision (ICCV), October 27-November 2, 2019, Seoul, Korea (South). New York: IEEE, 1314-1324(2019).
[13] Glorot X, Bordes A. -01-17)[2019-12-21]. https:∥wenku.baidu.com/view/7822feb5770bf78a65295450.html.(2010).
[14] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. ∥ Leibe B, Matas J, Sebe N, et al. Computer Vision—— ECCV 2016. Lecture Notes in Computer Science. Cham: Springer, 9905, 21-37(2016).
[15] Redmon J. -04-08)[2019-12-21]. https:∥arxiv., org/abs/1804, 02767(2018).
[16] Sandler M, Howard A, Zhu M L et al. MobileNetV2: inverted residuals and linear bottlenecks. [C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 4510-4520(2018).
[18] Cui H, Dahnoun N. Real-time stereo vision implementation on Nvidia Jetson TX2. [C]∥2019 8th Mediterranean Conference on Embedded Computing (MECO), June 10-14, 2019, Budva, Montenegro. New York: IEEE, 1-5(2019).
[19] Jose E. M G, Haridas M T P, et al. Face recognition based surveillance system using FaceNet and MTCNN on jetson TX2. [C]∥2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), March 15-16 , 2019, Coimbatore, India. New York: IEEE, 608-613(2019).
[20] Giubilato R, Chiodini S, Pertile M et al. An evaluation of ROS-compatible stereo visual SLAM methods on a nVidia Jetson TX2[J]. Measurement, 140, 161-170(2019).
[22] He K, Zhang X, Ren S et al. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. [C]∥ 2015 IEEE International Conference on Computer Vision, December 7-13 , 2015, Santiago, Chile. New York: IEEE, 15802053(2015).