• Opto-Electronic Engineering
  • Vol. 49, Issue 4, 210363 (2022)
Liang Ma1、2、3, Yutao Gou1、2、3, Tao Lei1、2、*, Lei Jin1、2, and Yixuan Song1、2、3
Author Affiliations
  • 1Photoelectric Detection Technology Laboratory, Chinese Academy of Sciences, Chengdu, Sichuan 610209, China
  • 2Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu, Sichuan 610209, China
  • 3University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    DOI: 10.12086/oee.2022.210363 Cite this Article
    Liang Ma, Yutao Gou, Tao Lei, Lei Jin, Yixuan Song. Small object detection based on multi-scale feature fusion using remote sensing images[J]. Opto-Electronic Engineering, 2022, 49(4): 210363 Copy Citation Text show less
    Complex background in remote sensing images
    Fig. 1. Complex background in remote sensing images
    Network framework
    Fig. 2. Network framework
    Network structure
    Fig. 3. Network structure
    Feature weighting method based on grouped convolution
    Fig. 4. Feature weighting method based on grouped convolution
    (a) Schematic diagram of convolutional network receptive field; (b) Object classification strategy based on receptive field
    Fig. 5. (a) Schematic diagram of convolutional network receptive field; (b) Object classification strategy based on receptive field
    Object scale distribution of the dataset
    Fig. 6. Object scale distribution of the dataset
    Sample of plane and small-vehicle image of DOTA dataset used in the experiment. (a) Training set; (b) Testing set
    Fig. 7. Sample of plane and small-vehicle image of DOTA dataset used in the experiment. (a) Training set; (b) Testing set
    Objects cut and copy flow diagram
    Fig. 8. Objects cut and copy flow diagram
    The loss curve of the network trained on the DOTA plane training set
    Fig. 9. The loss curve of the network trained on the DOTA plane training set
    The loss curve of the network trained on the DOTA small-vehicle training set
    Fig. 10. The loss curve of the network trained on the DOTA small-vehicle training set
    Partial plane test results. Yellow circles represent false alarms and green circles represent missed detection.
    Fig. 11. Partial plane test results. Yellow circles represent false alarms and green circles represent missed detection.
    Partial small-vehicle test results. Yellow circles represent false alarms and green circles represent missed detection.
    Fig. 12. Partial small-vehicle test results. Yellow circles represent false alarms and green circles represent missed detection.
    Model convergence under different initial values of fusion factors
    Fig. 13. Model convergence under different initial values of fusion factors
    模型参数量M
    VGG16138
    ResNet5025.6
    ResNet10144.6
    Ours0.49
    Table 1. Parameters of different networks
    金字塔层数检测目标尺寸下采样倍数感受野感受野步长感受野/目标尺寸
    两层16~2545543.5
    225~5089582.5
    三层16~1022322.9
    210~2044743.1
    320~5087982.3
    Table 2. Receptive field of feature map and corresponding object size parameters
    Basic unitFPNmAPPrecisionRecall
    两层B_11-86.876.888.9
    B_1187.282.688.8
    三层B_10-87.447.491.8
    B_1088.583.890.4
    Table 3. Detection results of different feature fusion schemes
    B_13FPN分组数量(3)特征图通道(channel)常数融合因子[0.71,0.87]mAPPrecisionRecall
    ----80.563.482.8
    ---82.081.485.1
    --82.385.184.5
    --83.685.587.0
    --82.582.385.6
    Table 4. DOTA plane dataset test results under different network configurations
    B_12FPN分组数量(3)特征图通道(channel)常数融合因子[0.63,1.28]mAPPrecisionRecall
    ----63.756.873.9
    ---65.986.168.5
    --66.383.368.9
    --68.786.471.7
    --64.484.067.3
    Table 5. DOTA small-vehicle dataset test results under different network configurations
    B_10FPN分组数量(3)特征图通道(channel)常数融合因子[1.08,1.05]mAPPrecisionRecall
    ----89.944.293.7
    ---90.283.691.4
    --90.684.892.0
    --91.087.792.4
    --未收敛未收敛未收敛
    Table 6. Test results of our dataset under different network configurations
    数据集尺度目标数量常数融合因子 (Si+1Si)
    DOTA飞机训练集S1[6-12] 55030.87
    S2[12-30] 4807
    S3[30-70] 34280.71
    DOTA小汽车训练集S1[6-15] 598751.28
    S2[15-25] 76615
    S3[25-60] 482030.63
    自建数据训练集S1[6-10] 49631.05
    S2[10-20] 5196
    S3[20-50] 56251.08
    Table 7. Statistics of the distribution of each scale objects number
    融合因子初始值mAP
    183.6
    随机初始化80.7
    Table 8. Influence of initial value of fusion factor on detection performance
    模型+数据集mAPPrecisionRecall推理速度/(s/张)
    B_10+FPN+CBAM(自建数据集)90.583.890.60.036
    B_10+FPN+自适应融合模块(自建数据集)91.087.792.40.027
    B_13+FPN+CBAM(DOTA飞机数据集)83.082.685.80.048
    B_13+FPN+自适应融合模块(DOTA飞机数据集)83.685.587.00.037
    B_12+FPN+CBAM(DOTA小汽车数据集)67.683.071.10.043
    B_12+FPN+自适应融合模块(DOTA小汽车数据集)68.783.371.70.034
    Table 9. Influence of CBAM and adaptive fusion module on detection performance
    方法DOTA飞机数据集(mAP)DOTA小汽车数据集(mAP)自建数据集(mAP)
    SSD63.443.364.4
    RetinaNet55.245.162.7
    Yolov3-tiny70.858.374.3
    Faster R-CNN73.059.088.6
    Ours83.668.791.0
    Table 10. Comparison of detection performance of different methods
    Backbone+数据集mAP
    ResNet50+FPN(自建数据集)88.6
    ResNet50+自适应融合模块(自建数据集)89.7
    ResNet50+FPN(DOTA飞机数据集)73.0
    ResNet50+自适应融合模块(DOTA飞机数据集)73.8
    ResNet50+FPN(DOTA小汽车数据集)59.0
    ResNet50+自适应融合模块(DOTA小汽车数据集)63.2
    Table 11. Performance of FPN module based on adaptive feature weighted fusion on Faster R-CNN
    Liang Ma, Yutao Gou, Tao Lei, Lei Jin, Yixuan Song. Small object detection based on multi-scale feature fusion using remote sensing images[J]. Opto-Electronic Engineering, 2022, 49(4): 210363
    Download Citation