[1] Ma C, Huang J B, Yang X, et al. Hierarchical convolutional features f visual tracking[C]Proceedings of the IEEE International Conference on Computer Vision, 2015: 3074−3082.
[2] Danelljan M, Robinson A, Khan F S, et al. Beyond crelation filters: Learning continuous convolution operats f visual tracking[C]ECCV, 2016.
[3] Danelljan M, Bhat G, Khan F S, et al. Eco: Efficient convolution operats f tracking[C]CVPR, 2017.
[4] Tao Ran, Gavves E, Smeulders A W M. Siamese instance search f tracking[C]2016 IEEE Conference on Computer Vision Pattern Recognition, 2016: 1420−1429.
[5] Bertito L, Valmadre J, Henriques J F, et al. FullyConvolutional Siamese wks f Object Tracking[M]Hua G, Jegou H. Computer Version ECCV 2016 Wkshops. Cham: Springer, 2016, 9914: 850−865.
[6] Valmadre J, Bertito L, Henriques J F, et al. Endtoend representation learning f crelation filter based tracking[C]2017 IEEE Conference on Computer Vision Pattern Recognition, 2017: 5000−5008.
[7] Li Bo, Yan Junjie, Wu Wei, et al. High perfmance visual tracking with Siamese region proposal wk[C]2018 IEEECVF Conference on Computer Vision Pattern Recognition, 2018: 8971−8980.
[8] Zhu Zheng, Wang Qiang, Li Bo, et al. Distractaware Siamese wks f visual object tracking[C]The 15th European Conference on Computer Vision, 2018: 103−119.
[9] Simonyan K, Zisserman A. Very deep convolutional wks f largescale image recognition[EBOL]. (20150410)[20181215]. https:arxiv.gabs1409.1556.
[10] Bromley J, Guyon I, LeCun Y, et al. Signature verification using a “Siamese” time delay neural wk[C]Advances in Neural Infmation Processing Systems, 1994: 737−744.
[11] Zaguyko S, Komodakis N. Learning to compare image patches via convolutional neural wks[C]CVPR, 2015.
[12] Wang N, Shi J, Yeung D, et al. Understing Diagnosing Visual Tracking Systems[C]2015 IEEE International Conference on Computer Vision (ICCV), 2015: 3101−3109.
[13] Wanyi Li, Peng Wang, Hong Qiao. A survey of visual attention based methods for object tracking. Acta automatica sinica, 40, 561-576(2014).
[14] Woo S, Park J, Lee J Y, etal. CBAM: Convolutional Block Attention Module[C] European Conference on Computer Vision, 2018: 3−19.
[15] Russakovsky O, Deng J, Su H, et al. Image large scale visual recognition challenges[C]IJCV, 2015.
[16] Lin TY, Maire M, Belongie S, et al. Microsoft coco: Common objects in context[C]ECCV, 2014: 740−755.
[17] Real E, Shlens J, Mazzocchi S, et al. Youtube boundingboxes: A large highprecision humanannotated data set f object detection in video[C]2017 IEEE Conference on Computer Vision Pattern Recognition (CVPR), IEEE, 2017: 7464−7473.
[18] Selvaraju R R, Cogswell M, Das A, et al. Gradcam: Visual explanations from deep wks via gradientbased localization[C]Proceedings of the IEEE Conference on Computer Vision Pattern Recognition, 2017: 618−626.
[19] Y Wu, J Lim, M H Yang. Object tracking benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1834-1848(2015).