{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:49:15Z","timestamp":1777657755071,"version":"3.51.4"},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2019,7,12]],"date-time":"2019-07-12T00:00:00Z","timestamp":1562889600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100011199","name":"European Research Council","doi-asserted-by":"publisher","award":["StG-2013-335373"],"award-info":[{"award-number":["StG-2013-335373"]}],"id":[{"id":"10.13039\/100011199","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100006785","name":"Google","doi-asserted-by":"crossref","award":["Faculty Award"],"award-info":[{"award-number":["Faculty Award"]}],"id":[{"id":"10.13039\/100006785","id-type":"DOI","asserted-by":"crossref"}]},{"name":"ERC","award":["SemanticCity 825706"],"award-info":[{"award-number":["SemanticCity 825706"]}]},{"DOI":"10.13039\/501100000288","name":"Royal Society","doi-asserted-by":"crossref","award":["Advanced Newton Fellowship"],"award-info":[{"award-number":["Advanced Newton Fellowship"]}],"id":[{"id":"10.13039\/501100000288","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100004344","name":"Adobe Systems","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100004344","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2019,8,31]]},"abstract":"<jats:p>\n            Next generation smart and augmented reality systems demand a computational understanding of monocular footage that captures humans in physical spaces to reveal plausible object arrangements and human-object interactions. Despite recent advances, both in scene layout and human motion analysis, the above setting remains challenging to analyze due to regular occlusions that occur between objects and human motions. We observe that the\n            <jats:italic>interaction<\/jats:italic>\n            between object arrangements and human actions is often strongly correlated, and hence can be used to help recover from these occlusions. We present iMapper, a data-driven method to identify such human-object interactions and utilize them to infer layouts of occluded objects. Starting from a monocular video with detected 2D human joint positions that are potentially noisy and occluded, we first introduce the notion of\n            <jats:italic>interaction-saliency<\/jats:italic>\n            as space-time snapshots where informative human-object interactions happen. Then, we propose a global optimization to retrieve and fit interactions from a database to the detected salient interactions in order to best explain the input video. We extensively evaluate the approach, both quantitatively against manually annotated ground truth and through a user study, and demonstrate that iMapper produces plausible scene layouts for scenes with medium to heavy occlusion. Code and data are available on the project page.\n          <\/jats:p>","DOI":"10.1145\/3306346.3322961","type":"journal-article","created":{"date-parts":[[2019,7,12]],"date-time":"2019-07-12T19:04:08Z","timestamp":1562958248000},"page":"1-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":47,"title":["iMapper"],"prefix":"10.1145","volume":"38","author":[{"given":"Aron","family":"Monszpart","sequence":"first","affiliation":[{"name":"University College London"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paul","family":"Guerrero","sequence":"additional","affiliation":[{"name":"University College London"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Duygu","family":"Ceylan","sequence":"additional","affiliation":[{"name":"Adobe Research"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ersin","family":"Yumer","sequence":"additional","affiliation":[{"name":"Uber ATG"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Niloy J.","family":"Mitra","sequence":"additional","affiliation":[{"name":"University College London and Adobe Research"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,7,12]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Richard H Byrd Peihuang Lu Jorge Nocedal and Ciyou Zhu. 1995. A limited memory algorithm for bound constrained optimization. In SISC.  Richard H Byrd Peihuang Lu Jorge Nocedal and Ciyou Zhu. 1995. A limited memory algorithm for bound constrained optimization. In SISC."},{"key":"e_1_2_1_2_1","volume-title":"Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields","author":"Cao Zhe","unstructured":"Zhe Cao , Tomas Simon , Shih-En Wei , and Yaser Sheikh . 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields . In IEEE CVPR. Zhe Cao, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. In IEEE CVPR."},{"key":"e_1_2_1_3_1","unstructured":"Ayan Chakrabarti Jingyu Shao and Greg Shakhnarovich. 2016. Depth from a Single Image by Harmonizing Overcomplete Local Network Predictions. In NIPS.   Ayan Chakrabarti Jingyu Shao and Greg Shakhnarovich. 2016. Depth from a Single Image by Harmonizing Overcomplete Local Network Predictions. In NIPS."},{"key":"e_1_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Angel Chang Angela Dai Thomas Funkhouser Maciej Halber Matthias Niessner Manolis Savva Shuran Song Andy Zeng and Yinda Zhang. 2017. Matterport3D: Learning from RGB-D Data in Indoor Environments. In 3DV.  Angel Chang Angela Dai Thomas Funkhouser Maciej Halber Matthias Niessner Manolis Savva Shuran Song Andy Zeng and Yinda Zhang. 2017. Matterport3D: Learning from RGB-D Data in Indoor Environments. In 3DV.","DOI":"10.1109\/3DV.2017.00081"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661239"},{"key":"e_1_2_1_6_1","volume-title":"ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes","author":"Dai Angela","unstructured":"Angela Dai , Angel X. Chang , Manolis Savva , Maciej Halber , Thomas Funkhouser , and Matthias Nie\u00dfner . 2017a. ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes . In IEEE CVPR. Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, and Matthias Nie\u00dfner. 2017a. ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes. In IEEE CVPR."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3054739"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.27"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33783-3_21"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366154"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964921.1964929"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818057"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33715-4_53"},{"key":"e_1_2_1_14_1","volume-title":"3D-reconstruction of indoor environments from human activity","author":"Frank Barbara","unstructured":"Barbara Frank , Michael Ruhnke , Maxim Tatarchenko , and Wolfram Burgard . 2015. 3D-reconstruction of indoor environments from human activity . In IEEE ICRA. Barbara Frank, Michael Ruhnke, Maxim Tatarchenko, and Wolfram Burgard. 2015. 3D-reconstruction of indoor environments from human activity. In IEEE ICRA."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.229"},{"key":"e_1_2_1_16_1","volume-title":"Pose-Inspired Shape Synthesis and Functional Hybrid","author":"Fu Qiang","unstructured":"Qiang Fu , Xiaowu Chen , Xiaoyu Su , and Hongbo Fu. 2017a. Pose-Inspired Shape Synthesis and Functional Hybrid . In IEEE TVCG. Qiang Fu, Xiaowu Chen, Xiaoyu Su, and Hongbo Fu. 2017a. Pose-Inspired Shape Synthesis and Functional Hybrid. In IEEE TVCG."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130800.3130805"},{"key":"e_1_2_1_18_1","volume-title":"Detecting and Recognizing Human-Object Interactions","author":"Gkioxari Georgia","unstructured":"Georgia Gkioxari , Ross Girshick , Piotr Doll\u00e1r , and Kaiming He. 2018. Detecting and Recognizing Human-Object Interactions . In IEEE CVPR. Georgia Gkioxari, Ross Girshick, Piotr Doll\u00e1r, and Kaiming He. 2018. Detecting and Recognizing Human-Object Interactions. In IEEE CVPR."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.83"},{"key":"e_1_2_1_20_1","volume-title":"Mask R-CNN","author":"He Kaiming","unstructured":"Kaiming He , Georgia Gkioxari , Piotr Doll\u00e1r , and Ross Girshick . 2017. Mask R-CNN . In IEEE ICCV. Kaiming He, Georgia Gkioxari, Piotr Doll\u00e1r, and Ross Girshick. 2017. Mask R-CNN. In IEEE ICCV."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925870"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766914"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.440"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12307-8_5"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.gmod.2016.03.004"},{"key":"e_1_2_1_26_1","volume":"201","author":"Hueting Moos","unstructured":"Moos Hueting , Pradyumna Reddy , Ersin Yumer , Vladimir G. Kim , Nathan Carr , and Niloy J. Mitra. 201 8. SeeThrough: Finding Objects in Heavily Occluded Indoor Scene Images. In 3DV. Moos Hueting, Pradyumna Reddy, Ersin Yumer, Vladimir G. Kim, Nathan Carr, and Niloy J. Mitra. 2018. SeeThrough: Finding Objects in Heavily Occluded Indoor Scene Images. In 3DV.","journal-title":"Niloy J. Mitra."},{"key":"e_1_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Eldar Insafutdinov Leonid Pishchulin Bjoern Andres Mykhaylo Andriluka and Bernt Schiele. 2016. DeeperCut: A Deeper Stronger and Faster Multi-Person Pose Estimation Model. In ECCV.  Eldar Insafutdinov Leonid Pishchulin Bjoern Andres Mykhaylo Andriluka and Bernt Schiele. 2016. DeeperCut: A Deeper Stronger and Faster Multi-Person Pose Estimation Model. In ECCV.","DOI":"10.1007\/978-3-319-46466-4_3"},{"key":"e_1_2_1_28_1","doi-asserted-by":"crossref","unstructured":"Hamid Izadinia Qi Shan and Steven M Seitz. 2017. IM2CAD. In CVPR.  Hamid Izadinia Qi Shan and Steven M Seitz. 2017. IM2CAD. In CVPR.","DOI":"10.1109\/CVPR.2017.260"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2501811"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.gmod.2017.10.002"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2601097.2601117"},{"key":"e_1_2_1_32_1","volume-title":"Environmental Design and Human Behavior","author":"Krasner Leonard","unstructured":"Leonard Krasner . 2013. Environmental Design and Human Behavior . Elsevier . Leonard Krasner. 2013. Environmental Design and Human Behavior. Elsevier."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661243"},{"key":"e_1_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Diogo C. Luvizon David Picard and Hedi Tabia. 2018. 2D\/3D Pose Estimation and Action Recognition Using Multitask Deep Learning. In IEEE CVPR.  Diogo C. Luvizon David Picard and Hedi Tabia. 2018. 2D\/3D Pose Estimation and Action Recognition Using Multitask Deep Learning. In IEEE CVPR.","DOI":"10.1109\/CVPR.2018.00539"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2980223"},{"key":"e_1_2_1_36_1","volume-title":"Seitz","author":"Newcombe Richard A.","year":"2015","unstructured":"Richard A. Newcombe , Dieter Fox , and Steven M . Seitz . 2015 . DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time. In IEEE CVPR. Richard A. Newcombe, Dieter Fox, and Steven M. Seitz. 2015. DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time. In IEEE CVPR."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISMAR.2011.6092378"},{"key":"e_1_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Dushyant Mehta Helge Rhodin Dan Casas Pascal Fua Oleksandr Sotnychenko Weipeng Xu and Christian Theobalt. 2017a. Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 3DV.  Dushyant Mehta Helge Rhodin Dan Casas Pascal Fua Oleksandr Sotnychenko Weipeng Xu and Christian Theobalt. 2017a. Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision. In 3DV.","DOI":"10.1109\/3DV.2017.00064"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073596"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366156"},{"key":"e_1_2_1_41_1","volume-title":"Environmental Design and Human Behavior","author":"Neisser Ulric","unstructured":"Ulric Neisser . 1976. Environmental Design and Human Behavior . W. H. Freeman . Ulric Neisser. 1976. Environmental Design and Human Behavior. W. H. Freeman."},{"key":"e_1_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. In ECCV.  Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. In ECCV.","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"e_1_2_1_43_1","volume":"2017","author":"Pirk S\u00f6ren","unstructured":"S\u00f6ren Pirk , Olga Diamanti , Boris Thibert , Danfei Xu , and Leonidas J. Guibas. 2017 a. Shape-Aware Spatio-Temporal Descriptors for Interaction Classification. In IEEE ICIP. S\u00f6ren Pirk, Olga Diamanti, Boris Thibert, Danfei Xu, and Leonidas J. Guibas. 2017a. Shape-Aware Spatio-Temporal Descriptors for Interaction Classification. In IEEE ICIP.","journal-title":"Leonidas J. Guibas."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3083725"},{"key":"e_1_2_1_45_1","volume-title":"Berg","author":"Poirson Patrick","year":"2016","unstructured":"Patrick Poirson , Phil Ammirato , Cheng-Yang Fu , Wei Liu , Jana Koseck\u00e1 , and Alexander C . Berg . 2016 . Fast Single Shot Detection and Pose Estimation. In 3DV. Patrick Poirson, Phil Ammirato, Cheng-Yang Fu, Wei Liu, Jana Koseck\u00e1, and Alexander C. Berg. 2016. Fast Single Shot Detection and Pose Estimation. In 3DV."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"e_1_2_1_47_1","volume-title":"Multi-person 2D and 3D Pose Detection in Natural Images","author":"Rogez Gr\u00e9gory","unstructured":"Gr\u00e9gory Rogez , Philippe Weinzaepfel , and Cordelia Schmid . 2019. LCR-Net++ : Multi-person 2D and 3D Pose Detection in Natural Images . In IEEE PAMI. Gr\u00e9gory Rogez, Philippe Weinzaepfel, and Cordelia Schmid. 2019. LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images. In IEEE PAMI."},{"key":"e_1_2_1_48_1","volume-title":"3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding","author":"Satkin Scott","unstructured":"Scott Satkin and Martial Hebert . 2013. 3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding . In IEEE CVPR. Scott Satkin and Martial Hebert. 2013. 3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding. In IEEE CVPR."},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661230"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2992138.2992147"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.51"},{"key":"e_1_2_1_52_1","volume-title":"A Large Scale Dataset for 3D Human Activity Analysis","author":"Shahroudy Amir","unstructured":"Amir Shahroudy , Jun Liu , Tian-Tsong Ng , and Gang Wang . 2016. NTU RGB+D : A Large Scale Dataset for 3D Human Activity Analysis . In IEEE CVPR. Amir Shahroudy, Jun Liu, Tian-Tsong Ng, and Gang Wang. 2016. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. In IEEE CVPR."},{"key":"e_1_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Tianjia Shao Aron Monszpart Youyi Zheng Bongjin Koo Weiwei Xu Kun Zhou and Niloy Mitra. 2014. Imagining the Unseen: Stability-based Cuboid Arrangements for Scene Understanding. In ACM SIGGRAPH Asia. Joint first authors.  Tianjia Shao Aron Monszpart Youyi Zheng Bongjin Koo Weiwei Xu Kun Zhou and Niloy Mitra. 2014. Imagining the Unseen: Stability-based Cuboid Arrangements for Scene Understanding. In ACM SIGGRAPH Asia. Joint first authors.","DOI":"10.1145\/2661229.2661288"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366155"},{"key":"e_1_2_1_55_1","volume-title":"Direct Prediction of 3D Body Poses from Motion Compensated Sequences","author":"Tekin Bugra","unstructured":"Bugra Tekin , Artem Rozantsev , Vincent Lepetit , and Pascal Fua . 2016. Direct Prediction of 3D Body Poses from Motion Compensated Sequences . In IEEE CVPR. Bugra Tekin, Artem Rozantsev, Vincent Lepetit, and Pascal Fua. 2016. Direct Prediction of 3D Body Poses from Motion Compensated Sequences. In IEEE CVPR."},{"key":"e_1_2_1_56_1","volume-title":"Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image","author":"Tom\u00e8 Denis","unstructured":"Denis Tom\u00e8 , Chris Russell , and Lourdes Agapito . 2017. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image . In IEEE CVPR. Denis Tom\u00e8, Chris Russell, and Lourdes Agapito. 2017. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image. In IEEE CVPR."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.214"},{"key":"e_1_2_1_58_1","doi-asserted-by":"crossref","unstructured":"Shubham Tulsiani Saurabh Gupta David Fouhey Alexei A. Efros and Jitendra Malik. 2018. Factoring Shape Pose and Layout from the 2D Image of a 3D Scene. In IEEE CVPR.  Shubham Tulsiani Saurabh Gupta David Fouhey Alexei A. Efros and Jitendra Malik. 2018. Factoring Shape Pose and Layout from the 2D Image of a 3D Scene. In IEEE CVPR.","DOI":"10.1109\/CVPR.2018.00039"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13131"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201362"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.406"},{"key":"e_1_2_1_62_1","volume-title":"Convolutional pose machines","author":"Wei Shih-En","unstructured":"Shih-En Wei , Varun Ramakrishna , Takeo Kanade , and Yaser Sheikh . 2016. Convolutional pose machines . In IEEE CVPR. Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional pose machines. In IEEE CVPR."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778779"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366207"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2601097.2601109"},{"key":"e_1_2_1_66_1","doi-asserted-by":"crossref","unstructured":"Bangpeng Yao Aditya Khosla and Li Fei-Fei. 2011. Classifying Actions and Measuring Action Similarity by Modeling the Mutual Context of Objects and Human Poses. In ICML.  Bangpeng Yao Aditya Khosla and Li Fei-Fei. 2011. Classifying Actions and Measuring Action Similarity by Modeling the Mutual Context of Objects and Human Poses. In ICML.","DOI":"10.1109\/TPAMI.2012.67"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185552"},{"key":"e_1_2_1_68_1","doi-asserted-by":"crossref","unstructured":"Hong-Bo Zhang Qing Lei Bi-Neng Zhong Ji-Xiang Du and JiaLin Peng. 2016. A Survey on Human Pose Estimation. In Intelligent Automation and Soft Computing.  Hong-Bo Zhang Qing Lei Bi-Neng Zhong Ji-Xiang Du and JiaLin Peng. 2016. A Survey on Human Pose Estimation. In Intelligent Automation and Soft Computing.","DOI":"10.1080\/10798587.2015.1095419"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2982410"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/2574860"},{"key":"e_1_2_1_71_1","volume-title":"Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video","author":"Zhou Xiaowei","unstructured":"Xiaowei Zhou , Menglong Zhu , Spyridon Leonardos , Kosta Derpanis , and Kostas Daniilidis . 2016. Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video . In IEEE CVPR. Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Kosta Derpanis, and Kostas Daniilidis. 2016. Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video. In IEEE CVPR."}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3306346.3322961","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3306346.3322961","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:25:44Z","timestamp":1750206344000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3306346.3322961"}},"subtitle":["interaction-guided scene mapping from monocular videos"],"short-title":[],"issued":{"date-parts":[[2019,7,12]]},"references-count":71,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2019,8,31]]}},"alternative-id":["10.1145\/3306346.3322961"],"URL":"https:\/\/doi.org\/10.1145\/3306346.3322961","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,7,12]]},"assertion":[{"value":"2019-07-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}