• Laser & Optoelectronics Progress
  • Vol. 58, Issue 24, 2420001 (2021)
Fengsui Wang1、2、3、*, Qisheng Wang1、2、3, Jingang Chen1、2、3, and Furong Liu1、2、3
Author Affiliations
  • 1School of Electrical Engineering, Anhui Polytechnic University, Wuhu, Anhui 241000, China
  • 2Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu, Anhui 241000, China
  • 3Key Laboratory of Advanced Perception and Intelligent Control of High-end Equipment, Ministry of Education, Wuhu, Anhui 241000, China
  • show less
    DOI: 10.3788/LOP202158.2420001 Cite this Article Set citation alerts
    Fengsui Wang, Qisheng Wang, Jingang Chen, Furong Liu. Improved Faster R-CNN Target Detection Algorithm Based on Attention Mechanism and Soft-NMS[J]. Laser & Optoelectronics Progress, 2021, 58(24): 2420001 Copy Citation Text show less
    Faster R-CNN target detection model
    Fig. 1. Faster R-CNN target detection model
    Structure of improved convolutional attention mechanism
    Fig. 2. Structure of improved convolutional attention mechanism
    Improved Faster R-CNN model
    Fig. 3. Improved Faster R-CNN model
    Comparison of false detection rate. (a) Error detection rate statistics of original algorithm; (b) error detection rate statistics of improved algorithm
    Fig. 4. Comparison of false detection rate. (a) Error detection rate statistics of original algorithm; (b) error detection rate statistics of improved algorithm
    Image detection comparison. (a) Detection results of original algorithm; (b) detection results of improved algorithm
    Fig. 5. Image detection comparison. (a) Detection results of original algorithm; (b) detection results of improved algorithm
    DatasetTrainingTest
    ImageObjectImageObject
    VOC2007501112608495212032
    VOC2012115402745000
    Total1655140058495212032
    Table 1. Statistics of training and test data in VOC2007 and VOC2012 datasets
    Serial numberModelBackboneChannel attentionSpatial attentionDimension reductionNMSSoft-NMS
    0FR[11]VGG-16--------
    1FRResNet-50--------
    2+SoftResNet-50--------
    3+SEResNet-50----
    4+ECAResNet-50------
    5+CBAMResNet-50--
    6Ours1ResNet-50----
    7Ours2ResNet-50----
    Table 2. Comparison of experimental model structure
    CategoryFROurs1VariationCategorySSDSSD+VariationCategoryYOLOv4YOLOv4+Variation
    Cat87.289.2+2.0Cat89.789.6-0.1Cat90.390.2-0.1
    Car86.685.4-1.2Car84.985.1+0.2Car94.895.1+0.3
    Horse84.986.6+1.7Horse89.189.4+0.3Horse91.791.0-0.7
    Dog83.685.6+2.0Dog85.786.1+0.4Dog87.188.9+1.8
    Bus80.385.3+5.0Bus84.283.9-0.3Bus91.392.6+1.3
    Train82.582.50Train87.187.2+0.1Train93.192.6-0.5
    Motorbike83.581.0-2.5Motorbike84.284.5+0.3Motorbike92.192.10
    Bicycle79.680.9+1.3Bicycle86.787.1+0.4Bicycle90.791.1+0.5
    Person79.879.7-0.1Person81.481.2-0.2Person91.291.1-0.1
    Aeroplane78.177.6-0.5Aeroplane76.277.5+1.3Aeroplane87.990.4+2.5
    CategoryFROurs1VariationCategorySSDSSD+VariationCategoryYOLOv4YOLOv4+Variation
    Sheep75.276.9+1.7Sheep75.377.1+1.8Sheep87.487.7+0.3
    Bird74.175.3+1.2Bird75.475.0-0.4Bird87.486.0-1.4
    Cow74.775.1+0.4Cow78.579.6+1.1Cow91.592.1+0.6
    Tvmonitor73.372.5-0.8Tvmonitor76.676.7+0.1Tvmonitor89.090.1+1.1
    Diningtable72.373.0+0.7Diningtable76.380.1+3.8Diningtable80.981.9+1.0
    Sofa70.475.2+4.8Sofa78.080.4+1.6Sofa77.179.2+2.1
    Boat65.766.4+1.3Boat67.366.4-0.9Boat75.479.0+3.6
    Chair54.253.4-0.8Chair59.761.2+0.5Chair73.271.9-1.3
    Bottle52.153.6+1.5Bottle49.248.8-0.4Bottle82.180.8-1.3
    Pottedplant46.044.4-1.6Pottedplant47.247.6+0.4Pottedplant60.060.6+0.6
    mAP74.275.0+0.8mAP76.677.2+0.6mAP85.786.2+0.5
    Table 3. Results of validation experiment of improved attention mechanism unit: %
    ModelBackboneInput image size /(pixel×pixel)mAP /%
    FR[11]VGG-16~600×100073.2
    FRResNet-50~600×100074.2
    +SoftResNet-50~600×100074.5
    +SEResNet-50~600×100075.2
    +ECAResNet-50~600×100074.8
    +CBAMResNet-50~600×100074.8
    Ours1ResNet-50~600×100075.0
    Ours2ResNet-50~600×100075.9
    Table 4. MAP comparison of experimental models on VOC2007 test set
    CategoryFR[11]FR+Soft+SE+ECA+CBAMOurs1Ours2
    Cat86.487.287.489.988.687.489.290.1
    Car84.786.687.185.685.285.185.486.1
    Horse84.684.985.286.686.186.686.686.8
    Dog84.883.684.087.884.182.685.686.8
    Bus83.180.380.383.184.984.785.385.3
    Train83.082.584.284.181.983.082.584.4
    Motorbike77.583.582.982.583.480.881.082.5
    Bicycle79.079.680.481.583.381.580.982.6
    Person76.779.880.581.079.680.879.781.3
    Aeroplane76.578.178.975.978.476.977.678.7
    Sheep73.675.274.473.073.570.576.977.3
    Bird70.974.174.276.075.975.875.376.8
    Cow81.974.775.077.574.077.275.175.9
    Tvmonitor72.673.374.472.173.075.472.573.1
    Diningtable65.772.372.071.772.572.473.073.6
    Sofa73.970.470.973.571.171.675.276.2
    Boat65.565.765.966.166.466.166.466.7
    Chair52.054.254.153.653.754.153.454.1
    Bottle52.152.152.659.154.856.053.654.0
    Pottedplant38.846.046.044.245.945.644.446.1
    mAP73.274.274.575.274.874.875.075.9
    Table 5. AP comparison of 20 target classes unit: %
    AlgotithmBackbonemAP /%
    Ref. [11]VGG-1673.2
    Ref. [22]ResNet-10176.4
    Ref. [23]ResNet-5074.4
    Ref. [24]ResNet-10174.4
    YOLO[6]Darknet63.4
    SSD300[7]VGG-1674.3
    Ref. [25]DiCENet68.4
    Ref. [26]ResNet-5074.4
    Ours2ResNet-5075.9
    Table 6. mAP comparison of different algorithms
    Fengsui Wang, Qisheng Wang, Jingang Chen, Furong Liu. Improved Faster R-CNN Target Detection Algorithm Based on Attention Mechanism and Soft-NMS[J]. Laser & Optoelectronics Progress, 2021, 58(24): 2420001
    Download Citation