{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T23:46:52Z","timestamp":1780444012416,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":64,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62106177"],"award-info":[{"award-number":["62106177"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ministry of Education of China","award":["8091B032156"],"award-info":[{"award-number":["8091B032156"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547773","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:01Z","timestamp":1665416581000},"page":"5102-5113","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":34,"title":["Uncertainty-Aware 3D Human Pose Estimation from Monocular Video"],"prefix":"10.1145","author":[{"given":"Jinlu","family":"Zhang","sequence":"first","affiliation":[{"name":"Wuhan University, Wuhan, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yujin","family":"Chen","sequence":"additional","affiliation":[{"name":"Technical University of Munich, Munich, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhigang","family":"Tu","sequence":"additional","affiliation":[{"name":"Wuhan University, Wuhan, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3386569.3392462"},{"key":"e_1_3_2_2_2_1","first-page":"14927","article-title":"Deep evidential regression","volume":"33","author":"Amini Alexander","year":"2020","unstructured":"Alexander Amini , Wilko Schwarting , Ava Soleimany , and Daniela Rus . 2020 . Deep evidential regression . Advances in Neural Information Processing Systems , Vol. 33 (2020), 14927 -- 14937 . Alexander Amini, Wilko Schwarting, Ava Soleimany, and Daniela Rus. 2020. Deep evidential regression. Advances in Neural Information Processing Systems , Vol. 33 (2020), 14927--14937.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01310"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00236"},{"key":"e_1_3_2_2_5_1","volume-title":"Anatomy-aware 3D Human Pose Estimation with Bone-based Pose Decomposition","author":"Chen Tianlang","year":"2021","unstructured":"Tianlang Chen , Chen Fang , Xiaohui Shen , Yiheng Zhu , Zhili Chen , and Jiebo Luo . 2021a. Anatomy-aware 3D Human Pose Estimation with Bone-based Pose Decomposition . IEEE Transactions on Circuits and Systems for Video Technology ( 2021 ). Tianlang Chen, Chen Fang, Xiaohui Shen, Yiheng Zhu, Zhili Chen, and Jiebo Luo. 2021a. Anatomy-aware 3D Human Pose Estimation with Bone-based Pose Decomposition. IEEE Transactions on Circuits and Systems for Video Technology (2021)."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00706"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01031"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2021.3068645"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00742"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i07.6689"},{"key":"e_1_3_2_2_11_1","volume-title":"Generating long sequences with sparse transformers. arXiv preprint arXiv:1904.10509","author":"Child Rewon","year":"2019","unstructured":"Rewon Child , Scott Gray , Alec Radford , and Ilya Sutskever . 2019. Generating long sequences with sparse transformers. arXiv preprint arXiv:1904.10509 ( 2019 ). Rewon Child, Scott Gray, Alec Radford, and Ilya Sutskever. 2019. Generating long sequences with sparse transformers. arXiv preprint arXiv:1904.10509 (2019)."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00235"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.256"},{"key":"e_1_3_2_2_14_1","volume-title":"Concrete dropout. Advances in neural information processing systems","author":"Gal Yarin","year":"2017","unstructured":"Yarin Gal , Jiri Hron , and Alex Kendall . 2017. Concrete dropout. Advances in neural information processing systems , Vol. 30 ( 2017 ). Yarin Gal, Jiri Hron, and Alex Kendall. 2017. Concrete dropout. Advances in neural information processing systems , Vol. 30 (2017)."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00847"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.322"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475219"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.248"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.248"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00781"},{"key":"e_1_3_2_2_21_1","volume-title":"Variational dropout and the local reparameterization trick. Advances in neural information processing systems","author":"Kingma Durk P","year":"2015","unstructured":"Durk P Kingma , Tim Salimans , and Max Welling . 2015. Variational dropout and the local reparameterization trick. Advances in neural information processing systems , Vol. 28 ( 2015 ). Durk P Kingma, Tim Salimans, and Max Welling. 2015. Variational dropout and the local reparameterization trick. Advances in neural information processing systems , Vol. 28 (2015)."},{"key":"e_1_3_2_2_22_1","volume-title":"Kipf and Max Welling","author":"Thomas","year":"2017","unstructured":"Thomas N. Kipf and Max Welling . 2017 . Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Conference Track Proceedings. OpenReview .net. https:\/\/openreview.net\/forum?id=SJU4ayYgl Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Conference Track Proceedings. OpenReview.net. https:\/\/openreview.net\/forum?id=SJU4ayYgl"},{"key":"e_1_3_2_2_23_1","volume-title":"Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems","author":"Lakshminarayanan Balaji","year":"2017","unstructured":"Balaji Lakshminarayanan , Alexander Pritzel , and Charles Blundell . 2017. Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems , Vol. 30 ( 2017 ). Balaji Lakshminarayanan, Alexander Pritzel, and Charles Blundell. 2017. Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems , Vol. 30 (2017)."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01084"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00621"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2022.3141231"},{"key":"e_1_3_2_2_27_1","volume-title":"Mhformer: Multi-hypothesis transformer for 3d human pose estimation. arXiv preprint arXiv:2111.12707","author":"Li Wenhao","year":"2021","unstructured":"Wenhao Li , Hong Liu , Hao Tang , Pichao Wang , and Luc Van Gool . 2021 b. Mhformer: Multi-hypothesis transformer for 3d human pose estimation. arXiv preprint arXiv:2111.12707 (2021). Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, and Luc Van Gool. 2021b. Mhformer: Multi-hypothesis transformer for 3d human pose estimation. arXiv preprint arXiv:2111.12707 (2021)."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01112"},{"key":"e_1_3_2_2_29_1","volume-title":"Trajectory space factorization for deep video-based 3d human pose estimation. arXiv preprint arXiv:1908.08289","author":"Lin Jiahao","year":"2019","unstructured":"Jiahao Lin and Gim Hee Lee . 2019. Trajectory space factorization for deep video-based 3d human pose estimation. arXiv preprint arXiv:1908.08289 ( 2019 ). Jiahao Lin and Gim Hee Lee. 2019. Trajectory space factorization for deep video-based 3d human pose estimation. arXiv preprint arXiv:1908.08289 (2019)."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58607-2_19"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00511"},{"key":"e_1_3_2_2_32_1","unstructured":"Ilya Loshchilov and Frank Hutter. 2018. Fixing weight decay regularization in adam. (2018).  Ilya Loshchilov and Frank Hutter. 2018. Fixing weight decay regularization in adam. (2018)."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00617"},{"key":"e_1_3_2_2_34_1","volume-title":"Predictive uncertainty estimation via prior networks. Advances in neural information processing systems","author":"Malinin Andrey","year":"2018","unstructured":"Andrey Malinin and Mark Gales . 2018. Predictive uncertainty estimation via prior networks. Advances in neural information processing systems , Vol. 31 ( 2018 ). Andrey Malinin and Mark Gales. 2018. Predictive uncertainty estimation via prior networks. Advances in neural information processing systems , Vol. 31 (2018)."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.288"},{"key":"e_1_3_2_2_36_1","volume-title":"2017 Fifth International Conference on. IEEE. https:\/\/doi.org\/10","author":"Mehta Dushyant","year":"2017","unstructured":"Dushyant Mehta , Helge Rhodin , Dan Casas , Pascal Fua , Oleksandr Sotnychenko , Weipeng Xu , and Christian Theobalt . 2017 a. Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 3D Vision (3DV) , 2017 Fifth International Conference on. IEEE. https:\/\/doi.org\/10 .1109\/3dv.2017.00064 Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, and Christian Theobalt. 2017a. Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 3D Vision (3DV), 2017 Fifth International Conference on. IEEE. https:\/\/doi.org\/10.1109\/3dv.2017.00064"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073596"},{"key":"e_1_3_2_2_38_1","volume-title":"International Conference on Machine Learning. PMLR, 2498--2507","author":"Molchanov Dmitry","year":"2017","unstructured":"Dmitry Molchanov , Arsenii Ashukha , and Dmitry Vetrov . 2017 . Variational dropout sparsifies deep neural networks . In International Conference on Machine Learning. PMLR, 2498--2507 . Dmitry Molchanov, Arsenii Ashukha, and Dmitry Vetrov. 2017. Variational dropout sparsifies deep neural networks. In International Conference on Machine Learning. PMLR, 2498--2507."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"e_1_3_2_2_40_1","volume-title":"Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , 2019 . Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems , Vol. 32 (2019). Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems , Vol. 32 (2019)."},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00763"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.139"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00794"},{"key":"e_1_3_2_2_44_1","volume-title":"Advances in Neural Information Processing Systems","volume":"31","author":"Sensoy Murat","year":"2018","unstructured":"Murat Sensoy , Lance Kaplan , and Melih Kandemir . 2018 . Evidential deep learning to quantify classification uncertainty . Advances in Neural Information Processing Systems , Vol. 31 (2018). Murat Sensoy, Lance Kaplan, and Melih Kandemir. 2018. Evidential deep learning to quantify classification uncertainty. Advances in Neural Information Processing Systems , Vol. 31 (2018)."},{"key":"e_1_3_2_2_45_1","volume-title":"Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International journal of computer vision","author":"Sigal Leonid","year":"2010","unstructured":"Leonid Sigal , Alexandru O Balan , and Michael J Black . 2010 . Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International journal of computer vision , Vol. 87 , 1--2 (2010), 4. Leonid Sigal, Alexandru O Balan, and Michael J Black. 2010. Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International journal of computer vision , Vol. 87, 1--2 (2010), 4."},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00584"},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_33"},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.113"},{"key":"e_1_3_2_2_49_1","volume-title":"Consistent 3D Hand Reconstruction in Video via self-supervised Learning. arXiv preprint arXiv:2201.09548","author":"Tu Zhigang","year":"2022","unstructured":"Zhigang Tu , Zhisheng Huang , Yujin Chen , Di Kang , Linchao Bao , Bisheng Yang , and Junsong Yuan . 2022. Consistent 3D Hand Reconstruction in Video via self-supervised Learning. arXiv preprint arXiv:2201.09548 ( 2022 ). Zhigang Tu, Zhisheng Huang, Yujin Chen, Di Kang, Linchao Bao, Bisheng Yang, and Junsong Yuan. 2022. Consistent 3D Hand Reconstruction in Video via self-supervised Learning. arXiv preprint arXiv:2201.09548 (2022)."},{"key":"e_1_3_2_2_50_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008."},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58601-0_45"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01101"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_29"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01584"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12328"},{"key":"e_1_3_2_2_56_1","first-page":"8163","article-title":"Chirality nets for human pose regression","volume":"32","author":"Yeh Raymond","year":"2019","unstructured":"Raymond Yeh , Yuan-Ting Hu , and Alexander Schwing . 2019 . Chirality nets for human pose regression . Advances in Neural Information Processing Systems , Vol. 32 (2019), 8163 -- 8173 . Raymond Yeh, Yuan-Ting Hu, and Alexander Schwing. 2019. Chirality nets for human pose regression. Advances in Neural Information Processing Systems , Vol. 32 (2019), 8163--8173.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01479"},{"key":"e_1_3_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58568-6_30"},{"key":"e_1_3_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01124"},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01288"},{"key":"e_1_3_2_2_61_1","volume-title":"A spatial attentive and temporal dilated (SATD) GCN for skeleton-based action recognition. CAAI Transactions on Intelligence Technology","author":"Zhang Jiaxu","year":"2020","unstructured":"Jiaxu Zhang , Gaoxiang Ye , Zhigang Tu , Yongtao Qin , Jinlu Zhang , Xiangjian Liu , and Shixu Luo . 2020. A spatial attentive and temporal dilated (SATD) GCN for skeleton-based action recognition. CAAI Transactions on Intelligence Technology ( 2020 ). Jiaxu Zhang, Gaoxiang Ye, Zhigang Tu, Yongtao Qin, Jinlu Zhang, Xiangjian Liu, and Shixu Luo. 2020. A spatial attentive and temporal dilated (SATD) GCN for skeleton-based action recognition. CAAI Transactions on Intelligence Technology (2020)."},{"key":"e_1_3_2_2_62_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Zhao Long","unstructured":"Long Zhao , Xi Peng , Yu Tian , Mubbasir Kapadia , and Dimitris N. Metaxas . 2019. Semantic Graph Convolutional Networks for 3D Human Pose Regression . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, and Dimitris N. Metaxas. 2019. Semantic Graph Convolutional Networks for 3D Human Pose Regression. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01145"},{"key":"e_1_3_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.51"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547773","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547773","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:41Z","timestamp":1750188641000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547773"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":64,"alternative-id":["10.1145\/3503161.3547773","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547773","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}