• Laser & Optoelectronics Progress
  • Vol. 57, Issue 2, 21006 (2020)
Yan Fenting, Wang Peng*, Lü Zhigang, Ding Zhe, and Qiao Mengyu
Author Affiliations
  • School of Electronics and Information Engineering, Xi''an Technological University, Xi''an, Shaanxi 710021, China
  • show less
    DOI: 10.3788/LOP57.021006 Cite this Article Set citation alerts
    Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 21006 Copy Citation Text show less
    Structural diagram of symmetric space transformation network
    Fig. 1. Structural diagram of symmetric space transformation network
    Real-time multi-person pose estimation model
    Fig. 2. Real-time multi-person pose estimation model
    Pose estimation network model
    Fig. 3. Pose estimation network model
    [in Chinese]
    Fig. 4. [in Chinese]
    Comparison of results in different scenarios for each model. (a)-(d) Scale change; (e)-(h) dense population; (i)-(l) occlusion; (m)-(p) complex pose
    Fig. 4. Comparison of results in different scenarios for each model. (a)-(d) Scale change; (e)-(h) dense population; (i)-(l) occlusion; (m)-(p) complex pose
    ModelFrameworkProgramminglanguage
    CMU-Pose[10]CaffePython3.6.2
    MaskR-CNN[11]TensorFlow1.3.0+Keras2.2.6Python3.6.2
    RMPE[14]Pytorch0.4.0Python3.6.2
    Proposed modelPytorch0.4.0Python3.6.2
    Table 1. Algorithm model and environment configuration
    ModelAPAP@0.5AP@0.75APmAPl
    CMU-Pose61.884.967.557.168.2
    Mask R-CNN63.187.368.757.871.4
    RMPE72.389.279.168.078.6
    Proposed model74.192.580.570.679.5
    Table 2. Comparison of performance of each pose estimation model
    ModelData setRunning speed /(frame·s-1)Parametersize /MBCalculatedamount /109
    YOLOv3[15]MS COCO5123765.86
    Proposed modelMS COCO6419544.32
    Table 3. Comparison of parameters of each human detection algorithm
    InputAPAP@0.5AP@0.75APmAPlARAR@0.5AR@0.75ARmARl
    256 pixel×192 pixel71.291.478.368.575.274.392.280.971.378.9
    384 pixel×288 pixel74.192.580.570.679.576.893.282.573.082.6
    Table 4. AP-AR values of model under different inputs
    Yan Fenting, Wang Peng, Lü Zhigang, Ding Zhe, Qiao Mengyu. Real-Time Multi-Person Video-Based Pose Estimation[J]. Laser & Optoelectronics Progress, 2020, 57(2): 21006
    Download Citation