Head motion detection based on low resolution infrared array sensor

Liang-Qin CHEN; Ming-Xuan ZENG; Zhi-Meng XU; Zhi-Zhang CHEN

doi:10.11972/j.issn.1001-9014.2023.02.019

Journals >Journal of Infrared and Millimeter Waves >Volume 42 >Issue 2 >Page 276 > Article

Journal of Infrared and Millimeter Waves
Vol. 42, Issue 2, 276 (2023)

Head motion detection based on low resolution infrared array sensor

Liang-Qin CHEN¹, Ming-Xuan ZENG¹, Zhi-Meng XU^1、*, and Zhi-Zhang CHEN^1、2

Author Affiliations

¹College of Physics and Information Engineering, Fuzhou University, Fuzhou 350108, China

²Department of Electrical and Computer Engineering, Dalhousie University, Halifax, NS B3J 1Z1, Canada

show less

DOI: 10.11972/j.issn.1001-9014.2023.02.019 Cite this Article

Liang-Qin CHEN, Ming-Xuan ZENG, Zhi-Meng XU, Zhi-Zhang CHEN. Head motion detection based on low resolution infrared array sensor[J]. Journal of Infrared and Millimeter Waves, 2023, 42(2): 276 Copy Citation Text

EndNote(RIS)

BibTex

Plain Text

show less

Fig. 1. Infrared array sensor，（a）physical view of the sensor，（b）infrared temperature image（palm）

Download full size | View in the Article

Fig. 2. System composition design

Download full size | View in the Article

Fig. 3. Flow chart of the head motion detection algorithm

Download full size | View in the Article

Fig. 4. Original image and pseudo-color image（the region of human head and shoulder）（a）origin image（32×32），（b）pseudo-color image（32×32），（c）pseudo-color image（64×64）

Download full size | View in the Article

Fig. 5. Flowchart of the head salient region extraction algorithm

Download full size | View in the Article

Fig. 6. Comparison of preprocessed results

Download full size | View in the Article

Fig. 7. 3D image fusion of a sequence of frames

Download full size | View in the Article

Fig. 8. Residual learning structure of ResNeXt network，（a）BottleNeck structure of ResNet network，（b）split-transform-merge structure of Inception network，（c）block structure of ResNeXt network

Download full size | View in the Article

Fig. 9. Pro-ResNeXt50 network

Download full size | View in the Article

Fig. 10. The training accuracy and loss of three network

Download full size | View in the Article

Fig. 11. Experience scenarios（a）Experiments in an indoor hall，（b）Experiments in a car：Experiments were conducted in an indoor hall to simulate a driving and online learning environment，as shown in Fig. 11（a）. The test user is sitting on a chair，and the sensor is fixed at the height of 1.2 m above the ground by a tripod so that it is aligned with the user’s front face. The collection distance ranges of 0.5 m to 1 m，and the collected lighting environment includes both day and night conditions.

Download full size | View in the Article

Fig. 12. Accuracy using different methods

Download full size | View in the Article

Fig. 13. Random continuous head movement steering

Download full size | View in the Article

Fig. 14. Recognition accuracy in different detection distances and light conditions

Download full size | View in the Article

Algorithm 1Adaptive Threshold
Input： the image of Canny edge detection： $I M (x, y)$
the local window size $: w s$
1. Obtain the threshold（C）value by the OTSU method $C \leftarrow t h r e s h_O T S U (I M (x, y))$
2. Obtain the image $(m I M (x, y))$ after mean filtering： $m I M (x, y) \leftarrow M e a n_f i l t e r (I M (x, y), w s)$
3. Obtain the continuous boundary image（ $s I M (x, y)$ ） $x I M (x, y) \leftarrow I M (x, y) - m I M (x, y) - C$ $s I M (x, y) \leftarrow O T S U (x I M (x, y))$
Output： the continuous boundary image $s I M (x, y)$

Table 0. [in Chinese]

View in the Article

Item	Specification
Infrared sensor model	HTPA 32×32
Camera	1
Temperature range of object	-40~85℃
Viewing angle	66°
Number of pixels	1024（32×32）
Temperature output mode	$I^{2} C$
Frame rate	5 frames/s

Table 1. HTPA infrared sensor specification parameters

View in the Article

Network	ResNet50	ResNeXt50	Pro-ResNeXt50
#params.	25.5 $\times 10^{6}$	25.0 $\times 10^{6}$	22 $. 6 \times 10^{6}$
FLOPs	4 $. 1 \times 10^{9}$	4 $. 2 \times 10^{9}$	4 $. 8 \times 10^{9}$

Table 2. Comparison of the three networks

View in the Article

Activity	Front	Bow	L45°	L90°	LC	LT	R45°	R90°	RC	RT
Total	300	280	159	154	160	170	173	167	171	182
Precision	0.947	0.989	0.962	0.961	0.981	0.970	0.948	0.964	0.982	0.962

Table 3. The Precision for each activity

View in the Article

Method

Original+

Pro-ResNeXt50

Original

+CBAM

+ResNeXt50

Channel（1）+

Pro-ResNeXt50

Channel（1，2）+

Pro-ResNeXt50

Channel（1，3）+

Pro-ResNeXt50

Channel（1，2，3）+

Pro-ResNeXt50

Accuracy

87.73%

94.47%

89.35%

92.06%

87.31%

96.76%

Table 4. Accuracy using different channels

View in the Article

Method	Accuracy	Times
ResNet50	94.10%	7f/s
ResNeXt50	94.83%	8f/s
Pro-ResNeXt50	96.76%	9f/s

Table 5. Accuracy and time using different network

View in the Article

Activity	Front	Bow	L90°	R45°	RC	RT	Average Acc
Total	267	184	205	195	189	192	—
Precision	0.948	0.902	0.942	0.910	0.963	0.958	0.937

Table 6. The Precision for each activity in the car

Liang-Qin CHEN, Ming-Xuan ZENG, Zhi-Meng XU, Zhi-Zhang CHEN. Head motion detection based on low resolution infrared array sensor[J]. Journal of Infrared and Millimeter Waves, 2023, 42(2): 276

Download Citation

Save the article for my favorites

Paper Information