Abstract
Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe self-
occlusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method
to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both
tasks.
occlusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method
to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both
tasks.
Original language | English |
---|---|
Title of host publication | Proceedings - IEEE International Conference on Computer Vision, ICCV 2019 |
Editors | In So Kweon, Nikos Paragios, Ming-Hsuan Yang, Svetlana Lazebnik |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 2272-2281 |
Number of pages | 10 |
ISBN (Electronic) | 9781728148038 |
ISBN (Print) | 9781728148045 |
DOIs | |
Publication status | Published - 2019 |
Event | IEEE International Conference on Computer Vision 2019 - Seoul, Korea, South Duration: 27 Oct 2019 → 2 Nov 2019 Conference number: 17th http://iccv2019.thecvf.com/ https://ieeexplore.ieee.org/xpl/conhome/8972782/proceeding (Proceedings) |
Publication series
Name | Proceedings of the IEEE International Conference on Computer Vision |
---|---|
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Volume | 2019-October |
ISSN (Print) | 1550-5499 |
ISSN (Electronic) | 2380-7504 |
Conference
Conference | IEEE International Conference on Computer Vision 2019 |
---|---|
Abbreviated title | ICCV 2019 |
Country/Territory | Korea, South |
City | Seoul |
Period | 27/10/19 → 2/11/19 |
Internet address |