• Acta Photonica Sinica
  • Vol. 49, Issue 8, 0810002 (2020)
Hao-nan AN1, Ming ZHAO1、2, Sheng-da PAN1, and Chang-qing LIN2
Author Affiliations
  • 1College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
  • 2Key Laboratory of Intelligent Infrared Perception, Chinese Academy of Sciences, Shanghai 200083, China
  • show less
    DOI: 10.3788/gzxb20204908.0810002 Cite this Article
    Hao-nan AN, Ming ZHAO, Sheng-da PAN, Chang-qing LIN. Infrared Target Detection Algorithm Based on Pseudo Multimodal Images[J]. Acta Photonica Sinica, 2020, 49(8): 0810002 Copy Citation Text show less
    Structure diagram of the proposed algorithm
    Fig. 1. Structure diagram of the proposed algorithm
    Process of generating pseudo visible light image using CycleGAN dual cycle countermeasure
    Fig. 2. Process of generating pseudo visible light image using CycleGAN dual cycle countermeasure
    Residual module in bimodal feature extraction network
    Fig. 3. Residual module in bimodal feature extraction network
    Improved residual network of bimodal feature extraction
    Fig. 4. Improved residual network of bimodal feature extraction
    Feature vector of image obtained by residual network
    Fig. 5. Feature vector of image obtained by residual network
    Dataset sample images
    Fig. 6. Dataset sample images
    Infrared image and its corresponding pseudo visible image
    Fig. 7. Infrared image and its corresponding pseudo visible image
    Average accuracy of each iteration of training vehicle and person on training set
    Fig. 8. Average accuracy of each iteration of training vehicle and person on training set
    Detection result on FLIR-ADAS data set
    Fig. 9. Detection result on FLIR-ADAS data set
    Detection result on SODA data set
    Fig. 10. Detection result on SODA data set
    Algorithm:PMFD: Pse-model fused detection
    Input:(1)Infrared image training set:{(xiyi)}i=1n
    (2)Generator of I2I framework :WI2R
    (3)stage1:IR Pre-trained:WIR
    (4)stage2:FRGB Pre-trained:WFRGB
    (5)stage3:Fusion Pre-train:WADD
    Output: Trained PMFD model, F (g)
    for num_epoches do
        for xii = 1,…,n do
            Through I2I framework generate a pseudo RGB $\hat{x}_{i}$ using WI2R
            Then the infrared image xi and its corresponding pseudo RGB image $\hat{x}_{i}$ are input into the respective training channels
            using WIR and WFRGB, generate fusion vector (Tensor in Fig. 1)
            Pass the fusion vector to Fusion Pre-train network using WADD
            Update WI2R, WIR, WFRGB, WADD by minimizing Loss function of the PMFD model
        end
    end
    Table 1. Algorithm detailed process
    Data setsNumber of samples
    Training set8 140
    Validation set2 326
    Testing set1 163
    Total11 629
    Table 2. Dataset distribution
    NetworksPrecisionRecallF1-scoremAP
    SSDAll0.5640.8360.6740.571
    Car0.6030.8810.7160.628
    Person0.5250.7900.6310.514
    Faster-RCNNAll0.5810.8410.6870.612
    Car0.6250.8800.7310.676
    Person0.5360.8010.6420.547
    BaselineAll0.6080.8920.7230.786
    Car0.6300.9090.7440.82
    Person0.5860.8740.7010.752
    MMTODAll0.6290.8930.7380.800
    Car0.6400.9020.7490.835
    Person0.6180.8840.7270.765
    PMFDAll0.6250.9090.7410.813
    Car0.6380.9230.7540.839
    Person0.6110.8940.7260.786
    Table 3. Experimental results on test set
    Hao-nan AN, Ming ZHAO, Sheng-da PAN, Chang-qing LIN. Infrared Target Detection Algorithm Based on Pseudo Multimodal Images[J]. Acta Photonica Sinica, 2020, 49(8): 0810002
    Download Citation