• Laser & Optoelectronics Progress
  • Vol. 58, Issue 2, 0210007 (2021)
Wenqiang Zhang, Zengqiang Wang, and Liang Zhang*
Author Affiliations
  • Tianjin Key Laboratory of Advanced Signal and Image Processing, Civil Aviation University of China, Tianjin 300300, China
  • show less
    DOI: 10.3788/LOP202158.0210007 Cite this Article Set citation alerts
    Wenqiang Zhang, Zengqiang Wang, Liang Zhang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 0210007 Copy Citation Text show less
    Overall flow diagram of action representation
    Fig. 1. Overall flow diagram of action representation
    Static video frames and corresponding timing dynamic diagrams. (a) Static images; (b) timing dynamic diagrams; (c) optical flow diagrams
    Fig. 2. Static video frames and corresponding timing dynamic diagrams. (a) Static images; (b) timing dynamic diagrams; (c) optical flow diagrams
    TS-CNN network framework
    Fig. 3. TS-CNN network framework
    Recognition results of different subsequence lengths
    Fig. 4. Recognition results of different subsequence lengths
    MethodSplit1Split2Split3Accuracy
    SI84.684.985.084.8
    SOF87.389.991.089.4
    FSDI83.983.883.183.6
    BSDI84.183.384.383.9
    SDI85.786.285.585.8
    ESDI87.286.887.687.2
    SI+SOF93.294.094.293.8
    ESDI+SOF94.894.695.394.9
    Table 1. Recognition accuracy of UCF101 dataset with different input modes unit: %
    MethodSplit1Split2Split3Accuracy
    SI54.850.449.651.6
    SOF64.263.662.763.5
    FSDI50.751.453.651.9
    BSDI51.651.554.152.4
    SDI54.552.953.753.7
    ESDI53.655.555.654.9
    SI+SOF68.767.568.468.2
    ESDI+SOF69.671.271.670.8
    Table 2. Recognition accuracy of HMDB51 dataset with different input modes unit: %
    Consensus functionUCF101HMDB51
    Max93.069.1
    Average94.970.8
    Weighted average93.869.7
    Table 3. Recognition accuracy of different fusion methods on dataset unit: %
    Network structureUCF101HMDB51
    Resnet10193.668.4
    Bn-inception94.268.2
    InceptionV394.970.8
    Table 4. Recognition accuracy of different network models on dataset unit: %
    NetworkUCF101HMDB51
    Spatial stream84.851.4
    Temproral stream89.463.5
    Original two-stream88.059.4
    Ref. [19]94.069.4
    Appearance and long-sequential stream87.254.9
    Short sequential stream89.964
    TS-CNN94.970.8
    Table 5. Recognition accuracy of different human behavior recognition models unit: %
    Feature extractionMethodUCF101HMDB51
    TraditionRef. [7]84.857.2
    Ref. [8]87.961.1
    Deep learningRef. [17]88.059.4
    Ref. [21]88.6--
    Ref. [22]91.565.9
    Ref. [23]93.163.3
    Ref. [24]93.466.4
    Ref. [19]94.069.4
    Proposed94.970.8
    Table 6. Recognition accuracy of different algorithms unit: %
    Wenqiang Zhang, Zengqiang Wang, Liang Zhang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 0210007
    Download Citation