• Laser & Optoelectronics Progress
  • Vol. 61, Issue 10, 1037004 (2024)
Guoli Zhang1、2, Shuai Chang1、2、*, Yansong Song1、2, and Tianci Liu1、2
Author Affiliations
  • 1College of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • 2Institute of Space Photoelectric Technology, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • show less
    DOI: 10.3788/LOP232131 Cite this Article Set citation alerts
    Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004 Copy Citation Text show less

    Abstract

    At present, most of the multi-spectral pedestrian detection algorithms focus on the fusion methods of visible light and infrared images, but the number of parameters to fully fuse multi-spectral images is huge, resulting in lower detection speed. To solve this problem, we propose a multi-spectral pedestrian detection algorithm based on YOLOv5s with high timeliness. To ensure the detection speed of the algorithm, we select the merging method of visible light and infrared light channel direction as the input of the network, and improve the detection accuracy by improving the traditional algorithm. First, some standard convolution is replaced by deformable convolution to enhance the ability of the network to extract irregular shape feature objects. Second, the spatial pyramid pooling module in the network is replaced by multi-scale residual attention module, which weakens the interference of the background to the pedestrian target and improves the detection accuracy. Finally, by changing the connection mode and adding the large-scale feature splicing layer, the minimum detection scale of the network is increased, and the detection effect of the network for small targets is improved. Experimental results show that the improved algorithm has obvious advantages in detection speed, and improves the mAP@0.5 and mAP@0.5∶0.95 by 5.1 and 1.9 percentage points over the original algorithm, respectively.
    Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004
    Download Citation