Video Classification Based on Three-Dimensional Squeeze Excitation Module

Ningxiao Li; Guodong Wang; Yanjie Wang; Shiyu Hu; Liangliang Wang

doi:10.3788/LOP56.121004

Journals >Laser & Optoelectronics Progress >Volume 56 >Issue 12 >Page 121004 > Article

Laser & Optoelectronics Progress
Vol. 56, Issue 12, 121004 (2019)

Video Classification Based on Three-Dimensional Squeeze Excitation Module

Ningxiao Li, Guodong Wang^*, Yanjie Wang, Shiyu Hu, and Liangliang Wang

Author Affiliations

College of Computer Science & Technology, Qingdao University, Qingdao, Shandong 266071, China

show less

DOI: 10.3788/LOP56.121004 Cite this Article Set citation alerts

Ningxiao Li, Guodong Wang, Yanjie Wang, Shiyu Hu, Liangliang Wang. Video Classification Based on Three-Dimensional Squeeze Excitation Module[J]. Laser & Optoelectronics Progress, 2019, 56(12): 121004 Copy Citation Text

show less

Abstract

To address the fusion problem of time sequence features in video classification, this paper proposes a new three-dimensional (3D) squeezing excitation (SE) network structure module that is constructed by combining the SE network in a two-dimensional convolutional neural network (CNN) with a 3D convolutional residual network. The new module adds an extra time-dimension coefficient to the coefficient set of a directly transformed 3D SE module, allowing it to record the changes in the motion trajectories of the research objects on time trajectories. The proposed module can not only record the characteristics of a specific time point, but also strengthen the relevance of multiple time points. To assess the effectiveness of the module, an SE network with a spatial and temporal latitude was used to perform character-action-behavior recognition. The experimental results indicate that the module can accelerate the loss convergence and effectively improve the accuracy of video classification.

Keywords

deep learning image processing residual network signal processing squeeze excitation three-dimensional convolution video classification

Ningxiao Li, Guodong Wang, Yanjie Wang, Shiyu Hu, Liangliang Wang. Video Classification Based on Three-Dimensional Squeeze Excitation Module[J]. Laser & Optoelectronics Progress, 2019, 56(12): 121004

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information