Light-Weight Siamese Attention Network Object Tracking for Unmanned Aerial Vehicle

Zhoujuan Cui; Junshe An; Yufeng Zhang; Tianshu Cui

doi:10.3788/AOS202040.1915001

Journals >Acta Optica Sinica >Volume 40 >Issue 19 >Page 1915001 > Article

Acta Optica Sinica
Vol. 40, Issue 19, 1915001 (2020)

Light-Weight Siamese Attention Network Object Tracking for Unmanned Aerial Vehicle

Zhoujuan Cui^1、2、*, Junshe An¹, Yufeng Zhang^1、2, and Tianshu Cui^1、2

Author Affiliations

¹Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China

²University of Chinese Academy of Sciences, Beijing 100049, China

show less

DOI: 10.3788/AOS202040.1915001 Cite this Article Set citation alerts

Zhoujuan Cui, Junshe An, Yufeng Zhang, Tianshu Cui. Light-Weight Siamese Attention Network Object Tracking for Unmanned Aerial Vehicle[J]. Acta Optica Sinica, 2020, 40(19): 1915001 Copy Citation Text

show less

Fig. 1. Framework of Siamese network with channel spatial coordination attention module

Download full size

Fig. 2. Convolutional blocks of MobileNetV2

Download full size

Fig. 3. Convolution process abstract graph of MobileNetV2

Download full size

Fig. 4. Grad-CAM network visualization results. (a) No attention module; (b) with attention module

Download full size

Fig. 5. Channel spatial coordination attention module

Download full size

Fig. 6. Success rate comparison of different attention module combinations on OTB-2015

Download full size

Fig. 7. Precision comparison of different attention module combinations on OTB-2015

Download full size

Fig. 8. Qualitative results of the nine tracking algorithms on different video sequences. (a) car6_5; (b) car17; (c) person9; (d) person1_s; (e) uav4; (f) wakeboard6

Download full size

Fig. 9. Results of problem sequence on uav1_1

Download full size

Fig. 10. Results of the tracking algorithms on OTB-2015. (a) Success plot; (b) precision plot

Download full size

Fig. 11. Tracking success plots of different attributes videos. (a) Scale variation; (b) aspect ratio change; (c) low resolution; (d) fast motion; (e) full occlusion; (f) partial occlusion; (g) out-of-view; (h) background clutter; (i) illumination variation; (j) viewpoint change; (k) camera motion; (l) similar object

Download full size

Fig. 12. Tracking precision plots of different attributes videos. (a) Scale variation; (b) aspect ratio change; (c) low resolution; (d) fast motion; (e) full occlusion; (f) partial occlusion; (g) out-of-view; (h) background clutter; (i) illumination variation; (j) viewpoint change; (k) camera motion; (l) similar object

Download full size

Fig. 13. Quantitative analysis of some video sequences. (a) Scale variation and aspect ratio change; (b) CLE

Download full size

Layer name	Input	Operator	Expansion factor	Channel	Repeat time	Stride	CSCAM
Input	255×255×3	Conv2d	-	32	1	2	No
Layer1	127×127×32	Bottleneck	1	16	1	1	No
Layer 2	127×127×16	Bottleneck	6	24	2	2	No
Layer 3	63×63×24	Bottleneck	6	32	3	2	Yes
Layer 4	31×31×32	Bottleneck	6	64	4	1	No
Layer 5	31×31×64	Bottleneck	6	96	3	1	Yes
Layer 6	31×31×96	Bottleneck	6	160	3	1	Yes
Layer 7	31×31×160	Bottleneck	6	320	1	1	Yes
Output	31×31×320	-	-	-	-	-	-

Table 1. Architecture of Siamese network based on MobieleNetV2

Zhoujuan Cui, Junshe An, Yufeng Zhang, Tianshu Cui. Light-Weight Siamese Attention Network Object Tracking for Unmanned Aerial Vehicle[J]. Acta Optica Sinica, 2020, 40(19): 1915001

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information