• Acta Optica Sinica
  • Vol. 39, Issue 11, 1128002 (2019)
Qunli Yao1、2、*, Xian Hu1、2, and Hong Lei1
Author Affiliations
  • 1Department of Space Microwave Remote Sensing Systems, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
  • 2School of Electronics, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    DOI: 10.3788/AOS201939.1128002 Cite this Article Set citation alerts
    Qunli Yao, Xian Hu, Hong Lei. Object Detection in Remote Sensing Images Using Multiscale Convolutional Neural Networks[J]. Acta Optica Sinica, 2019, 39(11): 1128002 Copy Citation Text show less
    Target detection framework of MSCNN
    Fig. 1. Target detection framework of MSCNN
    Structures of EFPN and dilated bottleneck. (a) Structure of EFPN module; (b) dilated bottleneck structure; (c) dilated bottleneck structure with 1×1 Conv
    Fig. 2. Structures of EFPN and dilated bottleneck. (a) Structure of EFPN module; (b) dilated bottleneck structure; (c) dilated bottleneck structure with 1×1 Conv
    Network structure of EFPN-NoProj
    Fig. 3. Network structure of EFPN-NoProj
    Visual detection results of MSCNN
    Fig. 4. Visual detection results of MSCNN
    SmallMediumLarge
    (0,60](60,120](120,+¥)
    Table 1. Definition of bounding box areas based on distribution of instance scales
    MethodRICNN[25]FRCN-VGG-16[16]YOLO[15]SSD[13]R-FCN[26]FRCN-Deform[27]FPN[18]MSDN[28]MSCNN
    Airplane0.8840.8300.8740.9560.9610.9830.9640.9980.994
    Ship0.7730.7760.8470.9370.9830.8920.9310.9720.953
    Storage tank0.8530.5250.4270.6170.7250.8170.9140.8380.918
    Baseball diamond0.8810.9630.9310.9950.9940.9840.9470.9910.963
    Tennis court0.4080.6290.6580.8600.9070.8590.9440.9730.954
    Basketball court0.5850.6880.8700.9440.9780.9270.9590.9990.967
    Ground track field0.8670.9840.9750.9870.9810.9880.9900.9860.993
    Harbor0.6860.8190.8000.9500.9240.9460.9210.9720.955
    Bridge0.6150.7930.9030.9660.9340.9470.8380.9270.972
    Vehicle0.7110.6390.7040.7450.8840.8160.9000.9010.933
    mAP0.7260.7640.7990.8940.9280.9170.9310.9560.960
    Table 2. Comparison of the detection precision of different algorithms on NWPU VHR-10 dataset
    BackboneAverage precision
    IOU=0.5:0.95IOU=0.5IOU=0.75SmallMediumLarge
    RetinaNet0.6900.9450.8090.5320.5590.682
    EFPN0.7060.9600.8240.5470.5780.701
    EFPN-NoProj0.7000.9500.8190.5440.5620.698
    Table 3. Ablation experimental parameters of MSCNN
    BackboneAverage precisionAverage precision(IOU =0.5:0.95)Average recall(IOU=0.5:0.95)
    IOU=0.5:0.95IOU=0.5IOU=0.75SmallMediumLargeSmallMediumLarge
    RetinaNet0.6900.9450.8090.5320.5590.6820.5730.5860.754
    MSCNN0.7060.9600.8240.5470.5780.7010.6000.6050.755
    EFPN-NoProj0.7000.9500.8190.5440.5620.6980.5970.6000.753
    Table 4. Average precision and average recall under different IOU thresholds and different bounding box areas
    Qunli Yao, Xian Hu, Hong Lei. Object Detection in Remote Sensing Images Using Multiscale Convolutional Neural Networks[J]. Acta Optica Sinica, 2019, 39(11): 1128002
    Download Citation