• Journal of Infrared and Millimeter Waves
  • Vol. 42, Issue 2, 276 (2023)
Liang-Qin CHEN1, Ming-Xuan ZENG1, Zhi-Meng XU1、*, and Zhi-Zhang CHEN1、2
Author Affiliations
  • 1College of Physics and Information Engineering, Fuzhou University, Fuzhou 350108, China
  • 2Department of Electrical and Computer Engineering, Dalhousie University, Halifax, NS B3J 1Z1, Canada
  • show less
    DOI: 10.11972/j.issn.1001-9014.2023.02.019 Cite this Article
    Liang-Qin CHEN, Ming-Xuan ZENG, Zhi-Meng XU, Zhi-Zhang CHEN. Head motion detection based on low resolution infrared array sensor[J]. Journal of Infrared and Millimeter Waves, 2023, 42(2): 276 Copy Citation Text show less
    Infrared array sensor,(a)physical view of the sensor,(b)infrared temperature image(palm)
    Fig. 1. Infrared array sensor,(a)physical view of the sensor,(b)infrared temperature image(palm)
    System composition design
    Fig. 2. System composition design
    Flow chart of the head motion detection algorithm
    Fig. 3. Flow chart of the head motion detection algorithm
    Original image and pseudo-color image(the region of human head and shoulder)(a)origin image(32×32),(b)pseudo-color image(32×32),(c)pseudo-color image(64×64)
    Fig. 4. Original image and pseudo-color image(the region of human head and shoulder)(a)origin image(32×32),(b)pseudo-color image(32×32),(c)pseudo-color image(64×64)
    Flowchart of the head salient region extraction algorithm
    Fig. 5. Flowchart of the head salient region extraction algorithm
    Comparison of preprocessed results
    Fig. 6. Comparison of preprocessed results
    3D image fusion of a sequence of frames
    Fig. 7. 3D image fusion of a sequence of frames
    Residual learning structure of ResNeXt network,(a)BottleNeck structure of ResNet network,(b)split-transform-merge structure of Inception network,(c)block structure of ResNeXt network
    Fig. 8. Residual learning structure of ResNeXt network,(a)BottleNeck structure of ResNet network,(b)split-transform-merge structure of Inception network,(c)block structure of ResNeXt network
    Pro-ResNeXt50 network
    Fig. 9. Pro-ResNeXt50 network
    The training accuracy and loss of three network
    Fig. 10. The training accuracy and loss of three network
    Experience scenarios(a)Experiments in an indoor hall,(b)Experiments in a car:Experiments were conducted in an indoor hall to simulate a driving and online learning environment,as shown in Fig. 11(a). The test user is sitting on a chair,and the sensor is fixed at the height of 1.2 m above the ground by a tripod so that it is aligned with the user’s front face. The collection distance ranges of 0.5 m to 1 m,and the collected lighting environment includes both day and night conditions.
    Fig. 11. Experience scenarios(a)Experiments in an indoor hall,(b)Experiments in a car:Experiments were conducted in an indoor hall to simulate a driving and online learning environment,as shown in Fig. 11(a). The test user is sitting on a chair,and the sensor is fixed at the height of 1.2 m above the ground by a tripod so that it is aligned with the user’s front face. The collection distance ranges of 0.5 m to 1 m,and the collected lighting environment includes both day and night conditions.
    Accuracy using different methods
    Fig. 12. Accuracy using different methods
    Random continuous head movement steering
    Fig. 13. Random continuous head movement steering
    Recognition accuracy in different detection distances and light conditions
    Fig. 14. Recognition accuracy in different detection distances and light conditions

    Algorithm 1Adaptive Threshold

    Input the image of Canny edge detection:IM(x,y)

                the local window size: ws

    1. Obtain the threshold(C)value by the OTSU method

    C  thresh_OTSU(IM(x,y))

    2. Obtain the image(mIMx,y)after mean filtering:

    mIMx,y  Mean_filter(IM(x,y),ws)

    3. Obtain the continuous boundary image(sIMx,y

    xIMx,y IMx,y-mIMx,y-C

    sIMx,yOTSU(xIMx,y)

    Output the continuous boundary image sIMx,y

    Table 0. [in Chinese]
    ItemSpecification
    Infrared sensor modelHTPA 32×32
    Camera1
    Temperature range of object-40~85℃
    Viewing angle66°
    Number of pixels1024(32×32)
    Temperature output modeI2C
    Frame rate5 frames/s
    Table 1. HTPA infrared sensor specification parameters
    NetworkResNet50ResNeXt50Pro-ResNeXt50
    #params.25.5×10625.0×10622.6×106
    FLOPs4.1×1094.2×1094.8×109
    Table 2. Comparison of the three networks
    ActivityFrontBowL45°L90°LCLTR45°R90°RCRT
    Total300280159154160170173167171182
    Precision0.9470.9890.9620.9610.9810.9700.9480.9640.9820.962
    Table 3. The Precision for each activity
    Method

    Original+

    Pro-ResNeXt50

    Original

    +CBAM

    +ResNeXt50

    Channel(1)+

    Pro-ResNeXt50

    Channel(1,2)+

    Pro-ResNeXt50

    Channel(1,3)+

    Pro-ResNeXt50

    Channel(1,2,3)+

    Pro-ResNeXt50

    Accuracy87.73%94.47%89.35%92.06%87.31%96.76%
    Table 4. Accuracy using different channels
    MethodAccuracyTimes
    ResNet5094.10%7f/s
    ResNeXt5094.83%8f/s
    Pro-ResNeXt5096.76%9f/s
    Table 5. Accuracy and time using different network
    ActivityFrontBowL90°R45°RCRTAverage Acc
    Total267184205195189192
    Precision0.9480.9020.9420.9100.9630.9580.937
    Table 6. The Precision for each activity in the car
    Liang-Qin CHEN, Ming-Xuan ZENG, Zhi-Meng XU, Zhi-Zhang CHEN. Head motion detection based on low resolution infrared array sensor[J]. Journal of Infrared and Millimeter Waves, 2023, 42(2): 276
    Download Citation