• Opto-Electronic Engineering
  • Vol. 48, Issue 12, 210358 (2021)
Gou Yutao1、2、3, Ma Liang1、2、3, Song Yixuan1、2、3, Jin Lei1、2, and Lei Tao1、2、*
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • show less
    DOI: 10.12086/oee.2021.210358 Cite this Article
    Gou Yutao, Ma Liang, Song Yixuan, Jin Lei, Lei Tao. Multi-task learning for thermal pedestrian detection[J]. Opto-Electronic Engineering, 2021, 48(12): 210358 Copy Citation Text show less


    Compared with high-quality RGB images, thermal images tend to have a higher false alarm rate in pede-strian detection tasks. The main reason is that thermal images are limited by imaging resolution and spectral cha-racteristics, lacking clear texture features, while some samples have poor feature quality, which interferes with the network training. We propose a thermal pedestrian algorithm based on a multi-task learning framework, which makes the following improvements based on the multiscale detection framework. First, saliency detection tasks are introduced as an auxiliary branch with the target detection network to form a multitask learning framework, which side-step the detector's attention to illuminate salient regions and their edge information in a co-learning manner. Second, the learning weight of noisy samples is suppressed by introducing the saliency strength into the classifica-tion loss function. The detection results on the publicly available KAIST dataset confirm that our learning method can effectively reduce the log-average miss rate by 4.43% compared to the baseline, RetinaNet.