• Acta Optica Sinica
  • Vol. 40, Issue 5, 0504001 (2020)
Bin Zhao, Chunping Wang*, Qiang Fu, and Yichao Chen
Author Affiliations
  • Department of Electronic and Optical Engineering, Shijiazhuang Campus of Army Engineering University, Shijiazhuang, Hebei 050003, China
  • show less
    DOI: 10.3788/AOS202040.0504001 Cite this Article Set citation alerts
    Bin Zhao, Chunping Wang, Qiang Fu, Yichao Chen. Multi-Scale Infrared Pedestrian Detection Based on Deep Attention Mechanism[J]. Acta Optica Sinica, 2020, 40(5): 0504001 Copy Citation Text show less
    Characteristic of pedestrian in U-FOV infrared images. (a) Large and medium scale pedestrians; (b) small scale pedestrians
    Fig. 1. Characteristic of pedestrian in U-FOV infrared images. (a) Large and medium scale pedestrians; (b) small scale pedestrians
    Architecture of multi-scale infrared pedestrian detection network based on Darknet53
    Fig. 2. Architecture of multi-scale infrared pedestrian detection network based on Darknet53
    Architecture of attention module
    Fig. 3. Architecture of attention module
    Residual module
    Fig. 4. Residual module
    Principle of pedestrian detection
    Fig. 5. Principle of pedestrian detection
    Learning rate and loss curves. (a) Learning rate on Caltech dataset; (b) loss on Caltech dataset; (c) learning rate on U-FOV dataset; (d) loss on U-FOV dataset
    Fig. 6. Learning rate and loss curves. (a) Learning rate on Caltech dataset; (b) loss on Caltech dataset; (c) learning rate on U-FOV dataset; (d) loss on U-FOV dataset
    Salient coefficient and feature maps
    Fig. 7. Salient coefficient and feature maps
    Distribution of pedestrian size in U-FOV test set
    Fig. 8. Distribution of pedestrian size in U-FOV test set
    Visualization results of infrared pedestrian detection
    Fig. 9. Visualization results of infrared pedestrian detection
    P-R curves under different IoU thresholds. (a) IoU threshold is 0.3; (b) IoU threshold is 0.45; (c) IoU threshold is 0.5; (d) IoU threshold is 0.7
    Fig. 10. P-R curves under different IoU thresholds. (a) IoU threshold is 0.3; (b) IoU threshold is 0.45; (c) IoU threshold is 0.5; (d) IoU threshold is 0.7
    Visualization results of infrared pedestrian detection on LTIR dataset at different scenes
    Fig. 11. Visualization results of infrared pedestrian detection on LTIR dataset at different scenes
    MethodBackboneDatasetAP
    IoU is 0.3IoU is 0.45IoU is 0.5IoU is 0.7
    Faster R-CNNResNet101U-FOV IR0.5932
    SSDMobilenet_v1U-FOV IR0.5584
    R-FCNResNet101U-FOV IR0.6312
    YOLOv3Darknet53U-FOV IR0.65950.66710.66280.6461
    MS-IRPDDarknet53U-FOV IR0.88800.88700.88280.8511
    MS-IRPDDarknet53Caltech+IR0.90570.90780.90840.8961
    MS-IRPD-attentionDarknet53Caltech+IR0.92950.93450.93400.9120
    Table 1. Average precision of pedestrian detection under different IoU thresholds
    MethodYOLOv3YOLOv3+AttentionMS-IRPD-Attention
    Total time /s90.7595.32125.21
    FPS7.286.935.28
    Table 2. Total times of U-FOV test set
    Bin Zhao, Chunping Wang, Qiang Fu, Yichao Chen. Multi-Scale Infrared Pedestrian Detection Based on Deep Attention Mechanism[J]. Acta Optica Sinica, 2020, 40(5): 0504001
    Download Citation