• Optoelectronics Letters
  • Vol. 18, Issue 5, 313 (2022)
Chaonan LI, Sheng LIU*, Lu YAO, and Siyu ZOU
Author Affiliations
  • College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
  • show less
    DOI: 10.1007/s11801-022-2015-8 Cite this Article
    LI Chaonan, LIU Sheng, YAO Lu, ZOU Siyu. Video-based body geometric aware network for 3D human pose estimation[J]. Optoelectronics Letters, 2022, 18(5): 313 Copy Citation Text show less

    Abstract

    Three-dimensional human pose estimation (3D HPE) has broad application prospects in the fields of trajectory prediction, posture tracking and action analysis. However, the frequent self-occlusions and the substantial depth ambiguity in two-dimensional (2D) representations hinder the further improvement of accuracy. In this paper, we propose a novel video-based human body geometric aware network to mitigate the above problems. Our network can implicitly be aware of the geometric constraints of the human body by capturing spatial and temporal context information from 2D skeleton data. Specifically, a novel skeleton attention (SA) mechanism is proposed to model geometric context dependencies among different body joints, thereby improving the spatial feature representation ability of the network. To enhance the temporal consistency, a novel multilayer perceptron (MLP)-Mixer based structure is exploited to comprehensively learn temporal context information from input sequences. We conduct experiments on publicly available challenging datasets to evaluate the proposed approach. The results outperform the previous best approach by 0.5 mm in the Human3.6m dataset. It also demonstrates significant improvements in HumanEva-I dataset.
    LI Chaonan, LIU Sheng, YAO Lu, ZOU Siyu. Video-based body geometric aware network for 3D human pose estimation[J]. Optoelectronics Letters, 2022, 18(5): 313
    Download Citation