• Acta Optica Sinica
  • Vol. 38, Issue 8, 0810001 (2018)
Yandi Li* and Xiping Xu*
Author Affiliations
  • College of Photoelectrical Engineering, Changchun University of Science and Technology, Changchun 130022, China
  • show less
    DOI: 10.3788/AOS201838.0810001 Cite this Article Set citation alerts
    Yandi Li, Xiping Xu. Human Action Recognition by Decision-Making Level Fusion Based on Spatial-Temporal Features[J]. Acta Optica Sinica, 2018, 38(8): 0810001 Copy Citation Text show less
    Target contour extraction process. (a) Original video; (b) motion detection; (c) target contour extraction
    Fig. 1. Target contour extraction process. (a) Original video; (b) motion detection; (c) target contour extraction
    (a) Template image and (b) test image of POSER 3D simulation samples
    Fig. 2. (a) Template image and (b) test image of POSER 3D simulation samples
    Sampling results
    Fig. 3. Sampling results
    Distribution of contour points in polar coordinates
    Fig. 4. Distribution of contour points in polar coordinates
    Shape histogram of different contour sampling points. (a) Sampling point 1; (b) sampling point 2; (c) sampling point 3
    Fig. 5. Shape histogram of different contour sampling points. (a) Sampling point 1; (b) sampling point 2; (c) sampling point 3
    Matching results
    Fig. 6. Matching results
    Matching results of different sampling points
    Fig. 7. Matching results of different sampling points
    Video sequences
    Fig. 8. Video sequences
    Matching process of feature sequence in spatial domain
    Fig. 9. Matching process of feature sequence in spatial domain
    Match process of feature sequence in time domain
    Fig. 10. Match process of feature sequence in time domain
    Schematic diagram of DTW algorithm
    Fig. 11. Schematic diagram of DTW algorithm
    Schematic of typical local path constraint
    Fig. 12. Schematic of typical local path constraint
    Diagram of "Morbid" twisting path
    Fig. 13. Diagram of "Morbid" twisting path
    Schematic of parameter of elliptic band
    Fig. 14. Schematic of parameter of elliptic band
    Recognition rate of different weight distributions
    Fig. 15. Recognition rate of different weight distributions
    Influence of decision threshold values on recognition rate
    Fig. 16. Influence of decision threshold values on recognition rate
    Schematic of three global constraint boundaries with the same warping window size
    Fig. 17. Schematic of three global constraint boundaries with the same warping window size
    Comparison of searching efficiency of different bands
    Fig. 18. Comparison of searching efficiency of different bands
    Searching efficiency of different global boundaries on large time series
    Fig. 19. Searching efficiency of different global boundaries on large time series
    Classification accuracies of different boundaries sharps and sizes
    Fig. 20. Classification accuracies of different boundaries sharps and sizes
    Classification accuracies of all warping window sizes with different frames
    Fig. 21. Classification accuracies of all warping window sizes with different frames
    Confusion matrix of classification results on KTH dataset. (a) Shape feature; (b) motion feature; (c) fusion feature
    Fig. 22. Confusion matrix of classification results on KTH dataset. (a) Shape feature; (b) motion feature; (c) fusion feature
    Confusion matrix of classification results on Weizmann dataset. (a) Shape feature; (b) motion feature; (c) fusion feature
    Fig. 23. Confusion matrix of classification results on Weizmann dataset. (a) Shape feature; (b) motion feature; (c) fusion feature
    AlgorithmAverageaccuracy /%Computationtime /ms
    Method in Ref.[7]89.7023.9
    Method in Ref.[10]85.6730.3
    Proposed method (motion)84.8918.9
    Proposed method (shape)81.1119.3
    Proposed method (fusion)92.7021.7
    Table 1. Comparison ofaccuracy and computation time of different algorithms on KTH dataset
    AlgorithmAverageaccuracy /%Computationtime /ms
    Method in Ref.[11]89.2635.7
    Method in Ref.[8]90.0026.8
    Proposed method (motion)83.6917.9
    Proposed method (shape)82.8019.4
    Proposed method (fusion)93.2020.9
    Table 2. Comparison of average accuracy and computation time of different algorithms on Weizmann dataset
    AlgorithmAverage accuracy /%
    Method in Ref.[9]78.67
    Method in Ref.[19]81.60
    Proposed method (motion)77.90
    Proposed method (shape)76.50
    Proposed method (fusion)81.20
    Table 3. Comparison of average accuracy of different algorithms on UCF101 dataset
    Yandi Li, Xiping Xu. Human Action Recognition by Decision-Making Level Fusion Based on Spatial-Temporal Features[J]. Acta Optica Sinica, 2018, 38(8): 0810001
    Download Citation