• Laser & Optoelectronics Progress
  • Vol. 59, Issue 12, 1215016 (2022)
Qisheng Wang1、2、3, Fengsui Wang1、2、3、*, Jingang Chen1、2、3, and Furong Liu1、2、3
Author Affiliations
  • 1School of Electrical Engineering, Anhui Polytechnic University, Wuhu 241000, Anhui , China
  • 2Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu 241000, Anhui , China
  • 3Key Laboratory of Advanced Perception and Intelligent Control of High-End Equipment, Ministry of Education, Wuhu 241000, Anhui , China
  • show less
    DOI: 10.3788/LOP202259.1215016 Cite this Article Set citation alerts
    Qisheng Wang, Fengsui Wang, Jingang Chen, Furong Liu. Faster R-CNN Target-Detection Algorithm Fused with Adaptive Attention Mechanism[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215016 Copy Citation Text show less
    Structure diagram of MA
    Fig. 1. Structure diagram of MA
    Comparison diagram of test results. (a) Original algorithm detection results; (b) Detection results of improved algorithm based on MA
    Fig. 2. Comparison diagram of test results. (a) Original algorithm detection results; (b) Detection results of improved algorithm based on MA
    Layer nameOutput sizeResNet-50ResNet-50_MA*ResNet-50_MA
    Conv1112×1127×7,64,stride 2
    Attention 0112×112MA
    Conv2_x56×563×3 max pool,stride 2
    1×1,643×3,641×1,256×3
    Conv3_x28×281×1,1283×3,1281×1,512×4
    Conv4_x14×141×1,2563×3,2561×1,1024×6
    Conv5_x7×71×1,5123×3,5121×1,2048×3
    Attention 17×7MA*
    1×1Average pool,fc,softmax
    Table 1. Comparison of modified ResNet-50 structure based on MA
    NameModel number
    SystemWindows 10
    CPUCore i9-10900 @ 3.7 GHz
    GPUNvidia GeForce RTX 2080Ti(11 GB)
    FrameworkPytorch 1.2.0
    LanguagePython
    Table 2. Experimental environment configuration
    DataTrainvalTest
    ImagesObjectsImagesObjects
    VOC2007501112608495212032
    VOC2012115402745000
    Total1655140058495212032
    Table 3. PASCAL VOC dataset training and test data statistics
    CategoryFRFR+MA*FR+MA
    mAP74.2575.01+0.7675.16+0.91
    Cat87.2387.31+0.0889.28+2.05
    Car86.6086.33-0.2785.85-0.75
    Horse84.8786.14+1.2786.50+1.63
    Dog83.6282.87-0.7584.29+0.67
    Bus80.2982.55+2.2683.45+3.16
    Train82.4983.12+0.6382.96+0.47
    Motorbike83.4883.62+0.1483.07-0.41
    Bicycle79.5683.14+3.5882.72+3.16
    Person79.7580.09+0.3480.38+0.63
    Aeroplane79.7576.09-3.6674.28-5.47
    Sheep75.2278.26+3.0473.20-2.02
    Bird74.1476.45+2.3177.53+3.39
    Cow74.7378.78+4.0574.70-0.03
    Tvmonitor74.3473.33-1.0173.09-1.25
    Diningtable72.2671.63-0.6373.96+1.70
    Sofa70.4475.12+4.6873.31+2.87
    Boat65.6962.92-2.7766.60+0.91
    Chair54.1951.53-2.6656.31+2.12
    Bottle52.0657.03+4.9756.48+4.42
    Pottedplant45.9643.85-2.1145.25-0.71
    Table 4. Comparison of detection accuracy of 20 target classes
    CategoryFRFR+MA1FR+MA2
    mAP74.2574.61+0.3674.70+0.45
    Cat87.2387.10-0.1390.07+2.84
    Car86.6085.94-0.6684.66-1.94
    Horse84.8785.20+0.3385.90+1.03
    Dog83.6286.41+2.7985.11+1.49
    Bus80.2983.17+2.8885.36+5.07
    Train82.4981.98-0.5182.09-0.40
    Motorbike83.4881.38-2.1081.37-2.11
    Bicycle79.5681.80+2.2483.38+3.82
    Person79.7578.64-1.1179.61-0.14
    Aeroplane79.7577.68-2.0776.20-3.55
    Sheep75.2275.62+0.4073.97-1.25
    Bird74.1475.58+1.4475.87+1.73
    Cow74.7378.09+3.3673.06-1.67
    Tvmonitor74.3472.20-2.1473.79-0.55
    Diningtable72.2670.78-1.4870.58-1.68
    Sofa70.4474.22+3.7873.24+2.80
    Boat65.6966.58+0.8966.29+0.66
    Chair54.1953.49-0.7051.81-2.38
    Bottle52.0651.17-0.8955.37+3.31
    Pottedplant45.9645.20-0.7646.21+0.25
    Table 5. Comparison of experimental results of ablation of MA
    Qisheng Wang, Fengsui Wang, Jingang Chen, Furong Liu. Faster R-CNN Target-Detection Algorithm Fused with Adaptive Attention Mechanism[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215016
    Download Citation