• Acta Optica Sinica
  • Vol. 40, Issue 4, 0415001 (2020)
Xueli Xie, Chuanxiang Li, Xiaogang Yang, Jianxiang Xi*, and Tong Chen
Author Affiliations
  • College of Missile Engineering, Rocket Force University of Engineering, Xi'an, Shaanxi 710025, China
  • show less
    DOI: 10.3788/AOS202040.0415001 Cite this Article Set citation alerts
    Xueli Xie, Chuanxiang Li, Xiaogang Yang, Jianxiang Xi, Tong Chen. Dynamic Receptive Field-Based Object Detection in Aerial Imaging[J]. Acta Optica Sinica, 2020, 40(4): 0415001 Copy Citation Text show less
    Structure of RetinaNet
    Fig. 1. Structure of RetinaNet
    Internal structure of dual attention SE-ResNeXt module
    Fig. 2. Internal structure of dual attention SE-ResNeXt module
    Bottom-up short connection
    Fig. 3. Bottom-up short connection
    Structure of pixel-wise addition module
    Fig. 4. Structure of pixel-wise addition module
    Structure of GCU module
    Fig. 5. Structure of GCU module
    Structure of object detection subnet
    Fig. 6. Structure of object detection subnet
    Structure of DRF module
    Fig. 7. Structure of DRF module
    Partial sample of VISDrone-g dataset
    Fig. 8. Partial sample of VISDrone-g dataset
    Statistical of the VISDrone-g. (a) Object scale distribution characteristics; (b) object frame length and width proportional distribution characteristics
    Fig. 9. Statistical of the VISDrone-g. (a) Object scale distribution characteristics; (b) object frame length and width proportional distribution characteristics
    Detailed explanation of COCO object detection and evaluation indexes [26]
    Fig. 10. Detailed explanation of COCO object detection and evaluation indexes [26]
    Visual contrast between DRF-RetinaNet and RetinaNet*. (a)(c)(e) DRF-RetinaNet's detection result; (b)(d)(f) RetinaNet's detection result
    Fig. 11. Visual contrast between DRF-RetinaNet and RetinaNet*. (a)(c)(e) DRF-RetinaNet's detection result; (b)(d)(f) RetinaNet's detection result
    Detection results of dim light
    Fig. 12. Detection results of dim light
    Detection results of dense objects
    Fig. 13. Detection results of dense objects
    Detection results of oblique view
    Fig. 14. Detection results of oblique view
    Detection results of down view
    Fig. 15. Detection results of down view
    αtγAP /%AP50 /%F1-score
    0.202.023.9338.1847.24
    0.252.024.3739.9548.43
    0.253.025.1442.6252.47
    0.303.024.8241.2350.72
    Table 1. Focal Loss parameter tuning
    ModuleWhether or not it contains
    RetinaNet*
    SE-ResNeXt
    Bottom-up
    GCU
    DRF detection subnet
    AP /%18.9720.2221.3322.0523.1725.14
    AP50 /%28.6530.6432.7834.3337.8342.62
    Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    Table 2. Performance comparison of model components
    MethodInput sizeBasebone NetworkAP /%AP50 /%AP75 /%AR1 /%AR10 /%AR100 /%Time /ms
    Faster R-CNN600Resnet-5016.7224.3214.154.2112.4716.65137
    R-FCN600Resnet-10119.3530.1819.525.6518.7322.56178
    SSD512Vgg-1612.2317.2911.543.7111.2215.4154
    RFB-Net512Resnet-5014.8722.1712.064.3413.1517.3875
    YOLO v3416Darknet-5314.7521.8612.174.1212.9317.4167
    RetinaNet608Resnet-5016.3523.1813.924.8514.7518.3685
    RetinaNet*608Resnet-5018.9728.6517.424.9217.2520.5288
    DRF-RetinaNet608SE-ResNeXt-5025.1442.6224.717.8224.2231.24103
    Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    Table 3. Performance comparison of each algorithm
    MethodAPsmall /%APmedium /%APlarge /%ARsmall /%ARmedium /%ARlarge /%F1-score
    Faster R-CNN7.1424.4236.7310.6226.7541.4133.15
    R-FCN9.8526.1340.2514.5732.7147.7940.67
    SSD5.8520.0334.077.6324.9738.6826.41
    RFB-Net6.6222.1834.289.5525.7740.8233.13
    YOLO v36.2522.2636.179.7225.7240.2732.73
    RetinaNet7.2723.9536.7210.3126.6342.2332.69
    RetinaNet*9.8225.3538.3114.9331.9144.8237.92
    DRF-RetinaNet13.6240.3455.9517.4249.9761.5352.47
    Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    Table 4. Performance comparison of algorithms for different scales object detection
    Xueli Xie, Chuanxiang Li, Xiaogang Yang, Jianxiang Xi, Tong Chen. Dynamic Receptive Field-Based Object Detection in Aerial Imaging[J]. Acta Optica Sinica, 2020, 40(4): 0415001
    Download Citation