• Laser & Optoelectronics Progress
  • Vol. 59, Issue 18, 1815015 (2022)
Yu Li1、2, Shaoyan Gai1、2、3, Feipeng Da1、2、3、*, and Ru Hong1、2
Author Affiliations
  • 1School of Automation, Southeast University, Nanjing 210096, Jiangsu, China
  • 2Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education,Southeast University, Nanjing 210096, Jiangsu, China
  • 3Shenzhen Research Institute, Southeast University, Shenzhen 518063, Guangdong, China
  • show less
    DOI: 10.3788/LOP202259.1815015 Cite this Article Set citation alerts
    Yu Li, Shaoyan Gai, Feipeng Da, Ru Hong. Object Detection Based on Semantic Sampling and Localization Refinement[J]. Laser & Optoelectronics Progress, 2022, 59(18): 1815015 Copy Citation Text show less
    Comparison of object detection architectures. (a) Architecture of RepPoints[14]; (b) architecture of proposed detector
    Fig. 1. Comparison of object detection architectures. (a) Architecture of RepPoints[14]; (b) architecture of proposed detector
    Overall architecture of proposed object detector. Dashed box: semantic based positioning module; dashed box: feature enhancement module
    Fig. 2. Overall architecture of proposed object detector. Dashed box: semantic based positioning module; dashed box: feature enhancement module
    Architecture of localization subnet and classification subnet
    Fig. 3. Architecture of localization subnet and classification subnet
    Training samplers for object detection. (a) Center-based training sampler; (b) semantic-based training sampler
    Fig. 4. Training samplers for object detection. (a) Center-based training sampler; (b) semantic-based training sampler
    Qualitative results of proposed method on different datasets. (a), (b) VOC 2007; (c), (d) MS COCO
    Fig. 5. Qualitative results of proposed method on different datasets. (a), (b) VOC 2007; (c), (d) MS COCO
    MethodBackboneAPAP50AP75APsAPmAPlTime /ms
    RetinaNet22ResNet-10139.159.142.321.842.750.290
    FCOS6ResNet-10141.560.745.024.444.851.691
    FSAF5ResNet-10140.961.544.024.044.251.3138
    ATSS10ResNet-10143.662.147.426.147.053.693
    RepPoints14ResNet-10141.062.944.323.644.151.787
    CornerNet4Hourglass-10440.556.543.119.442.753.9227
    ExtremeNet7Hourglass-10440.255.543.220.443.253.1348
    CenterNet8Hourglass-10442.161.145.924.145.552.8298
    LSNet21ResNeXt-10143.963.147.826.647.155.4138
    ProposedResNet-10142.865.146.326.147.355.878
    Table 1. Experimental result comparison of proposed method to other methods on MS COCO test-dev
    MethodAPAP50AP75APsAPmAPlTime /ms
    Baseline RepPoints39.160.542.121.241.250.085
    RepPoints + LB40.461.943.623.143.852.485
    RepPoints + LB + FE41.062.644.324.344.653.386
    RepPoints + EO39.261.742.221.442.850.676
    Proposed(LB + FE+EO)41.263.044.224.645.053.977
    Table 2. Experimental result comparison of integrating three modules into RepPoints
    Conversion function NLAPAP50AP75Time /ms
    1×1 conv40.460.243.685
    Max-pooling40.360.143.485
    Max-pooling + avg-pooling40.460.343.886
    Table 3. Comparison of different transformation conversions
    ϵLAPAP50AP75
    0.00039.159.942.1
    0.00539.860.142.1
    0.01040.460.243.6
    0.05040.260.143.3
    0.10039.859.942.9
    Table 4. Detection results with different location thresholds
    MethodEOFLOPs /GParam /MFPS /(frame·s-1Time /msAP
    RepPoints190.1636.6211.88539.1
    190.0635.9713.17639.2
    Proposed190.1836.6211.68641.0
    190.0735.9712.97741.2
    Table 5. Impact of applying efficient refinement optimization to different networks
    Yu Li, Shaoyan Gai, Feipeng Da, Ru Hong. Object Detection Based on Semantic Sampling and Localization Refinement[J]. Laser & Optoelectronics Progress, 2022, 59(18): 1815015
    Download Citation