{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T04:40:51Z","timestamp":1769834451206,"version":"3.49.0"},"reference-count":32,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2018,7,30]],"date-time":"2018-07-30T00:00:00Z","timestamp":1532908800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,8,31]]},"abstract":"<jats:p>\n            To carry out autonomous 3D scanning and online reconstruction of unknown indoor scenes, one has to find a balance between global exploration of the entire scene and local scanning of the objects within it. In this work, we propose a novel approach, which provides object-aware guidance for autoscanning, for exploring, reconstructing, and understanding an unknown scene within\n            <jats:italic>one navigation pass.<\/jats:italic>\n            Our approach interleaves between object analysis to identify the\n            <jats:italic>next best object<\/jats:italic>\n            (NBO) for global exploration, and object-aware information gain analysis to plan the\n            <jats:italic>next best view<\/jats:italic>\n            (NBV) for local scanning. First, an objectness-based segmentation method is introduced to extract semantic objects from the current scene surface via a multi-class graph cuts minimization. Then, an object of interest (OOI) is identified as the NBO which the robot aims to visit and scan. The robot then conducts fine scanning on the OOI with views determined by the NBV strategy. When the OOI is recognized as a full object, it can be replaced by its most similar 3D model in a shape database. The algorithm iterates until all of the objects are recognized and reconstructed in the scene. Various experiments and comparisons have shown the feasibility of our proposed approach.\n          <\/jats:p>","DOI":"10.1145\/3197517.3201295","type":"journal-article","created":{"date-parts":[[2018,7,31]],"date-time":"2018-07-31T15:56:23Z","timestamp":1533052583000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["Object-aware guidance for autonomous scene reconstruction"],"prefix":"10.1145","volume":"37","author":[{"given":"Ligang","family":"Liu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China"}]},{"given":"Xi","family":"Xia","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China"}]},{"given":"Han","family":"Sun","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China"}]},{"given":"Qi","family":"Shen","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China"}]},{"given":"Juzhan","family":"Xu","sequence":"additional","affiliation":[{"name":"Shenzhen University"}]},{"given":"Bin","family":"Chen","sequence":"additional","affiliation":[{"name":"Shenzhen University"}]},{"given":"Hui","family":"Huang","sequence":"additional","affiliation":[{"name":"Shenzhen University"}]},{"given":"Kai","family":"Xu","sequence":"additional","affiliation":[{"name":"Shenzhen University and National University of Defense Technology"}]}],"member":"320","published-online":{"date-parts":[[2018,7,30]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Bundle Adjustment in the Large. In European Conference on Computer Vision. 29--42","author":"Agarwal Sameer","year":"2010","unstructured":"Sameer Agarwal , Noah Snavely , Steven M. Seitz , and Richard Szeliski . 2010 . Bundle Adjustment in the Large. In European Conference on Computer Vision. 29--42 . Sameer Agarwal, Noah Snavely, Steven M. Seitz, and Richard Szeliski. 2010. Bundle Adjustment in the Large. In European Conference on Computer Vision. 29--42."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.28"},{"key":"e_1_2_2_3_1","volume-title":"Joint 2D-3D-Semantic Data for Indoor Scene Understanding. arXiv preprint arXiv:1702.01105","author":"Armeni Iro","year":"2017","unstructured":"Iro Armeni , Sasha Sax , Amir R Zamir , and Silvio Savarese . 2017. Joint 2D-3D-Semantic Data for Indoor Scene Understanding. arXiv preprint arXiv:1702.01105 ( 2017 ). Iro Armeni, Sasha Sax, Amir R Zamir, and Silvio Savarese. 2017. Joint 2D-3D-Semantic Data for Indoor Scene Understanding. arXiv preprint arXiv:1702.01105 (2017)."},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2015.XI.003"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1531326.1531379"},{"key":"e_1_2_2_6_1","volume-title":"Proc. CVPR. 5556--5565","author":"Choi Sungjoon","year":"2015","unstructured":"Sungjoon Choi , Qian-Yi Zhou , and Vladlen Koltun . 2015 . Robust reconstruction of indoor scenes . In Proc. CVPR. 5556--5565 . Sungjoon Choi, Qian-Yi Zhou, and Vladlen Koltun. 2015. Robust reconstruction of indoor scenes. In Proc. CVPR. 5556--5565."},{"key":"e_1_2_2_7_1","volume-title":"Scannet: Richly-annotated 3d reconstructions of indoor scenes. arXiv preprint arXiv:1702.04405","author":"Dai Angela","year":"2017","unstructured":"Angela Dai , Angel X Chang , Manolis Savva , Maciej Halber , Thomas Funkhouser , and Matthias Nie\u00dfner . 2017 . Scannet: Richly-annotated 3d reconstructions of indoor scenes. arXiv preprint arXiv:1702.04405 (2017). Angela Dai, Angel X Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, and Matthias Nie\u00dfner. 2017. Scannet: Richly-annotated 3d reconstructions of indoor scenes. arXiv preprint arXiv:1702.04405 (2017)."},{"key":"e_1_2_2_8_1","volume-title":"Proc. of the RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum","author":"Engelhard Nikolas","year":"2011","unstructured":"Nikolas Engelhard , Felix Endres , J\u00fcrgen Hess J\u00fcrgen Sturm , and Wolfram Burgard . 2011 . Real-time 3D visual SLAM with a hand-held RGB-D camera . In Proc. of the RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum , Vasteras, Sweden, Vol 180. 1--15. Nikolas Engelhard, Felix Endres, J\u00fcrgen Hess J\u00fcrgen Sturm, and Wolfram Burgard. 2011. Real-time 3D visual SLAM with a hand-held RGB-D camera. In Proc. of the RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum, Vasteras, Sweden, Vol 180. 1--15."},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2980225"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366154"},{"key":"e_1_2_2_11_1","unstructured":"Gazebo. 2013. The Gazebo Project http:\/\/wiki.ros.org\/gazebo. (2013).  Gazebo. 2013. The Gazebo Project http:\/\/wiki.ros.org\/gazebo. (2013)."},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-012-9321-0"},{"key":"e_1_2_2_13_1","volume-title":"IEEE International Conference on Robotics and Automation. 5031--5037","author":"Krainin M","year":"2012","unstructured":"M Krainin , B Curless , and D Fox . 2012 . Autonomous generation of complete 3D object models using next best view manipulation planning . In IEEE International Conference on Robotics and Automation. 5031--5037 . M Krainin, B Curless, and D Fox. 2012. Autonomous generation of complete 3D object models using next best view manipulation planning. In IEEE International Conference on Robotics and Automation. 5031--5037."},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6385624"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_2_2_16_1","volume-title":"Proceedings of 14th Pacific Conference on Computer Graphics and Applications (Pacific Graphics","author":"Low Kok-Lim","year":"2006","unstructured":"Kok-Lim Low and Anselmo Lastra . 2006 . An adaptive hierarchical next-best-view algorithm for 3d reconstruction of indoor scenes . In Proceedings of 14th Pacific Conference on Computer Graphics and Applications (Pacific Graphics 2006). Kok-Lim Low and Anselmo Lastra. 2006. An adaptive hierarchical next-best-view algorithm for 3d reconstruction of indoor scenes. In Proceedings of 14th Pacific Conference on Computer Graphics and Applications (Pacific Graphics 2006)."},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366156"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISMAR.2011.6092378"},{"key":"e_1_2_2_19_1","volume-title":"Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv preprint arXiv:1706.02413","author":"Qi Charles R","year":"2017","unstructured":"Charles R Qi , Li Yi , Hao Su , and Leonidas J Guibas . 2017. PointNet++ : Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv preprint arXiv:1706.02413 ( 2017 ). Charles R Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. arXiv preprint arXiv:1706.02413 (2017)."},{"key":"e_1_2_2_20_1","volume-title":"Motion Planning Strategies for Autonomously Mapping 3D Structures. arXiv preprint arXiv:1602.06667","author":"Ramanagopal Manikandasriram Srinivasan","year":"2016","unstructured":"Manikandasriram Srinivasan Ramanagopal and Jerome Le Ny. 2016. Motion Planning Strategies for Autonomously Mapping 3D Structures. arXiv preprint arXiv:1602.06667 ( 2016 ). Manikandasriram Srinivasan Ramanagopal and Jerome Le Ny. 2016. Motion Planning Strategies for Autonomously Mapping 3D Structures. arXiv preprint arXiv:1602.06667 (2016)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.178"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.28"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7354011"},{"key":"e_1_2_2_24_1","volume-title":"Robotic mapping: a survey","author":"Thrun Sebastian","year":"2002","unstructured":"Sebastian Thrun . 2002. Robotic mapping: a survey . Morgan Kaufmann Publishers Inc . 2002 pages. Sebastian Thrun. 2002. Robotic mapping: a survey. Morgan Kaufmann Publishers Inc. 2002 pages."},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751556"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2015.XI.001"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661242"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818075"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2980224"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3130800.3130812"},{"key":"e_1_2_2_31_1","volume-title":"3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions. arXiv preprint arXiv:1603.08182","author":"Zeng Andy","year":"2016","unstructured":"Andy Zeng , Shuran Song , Matthias Nie\u00dfner , Matthew Fisher , Jianxiong Xiao , and Thomas Funkhouser . 2016. 3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions. arXiv preprint arXiv:1603.08182 ( 2016 ). Andy Zeng, Shuran Song, Matthias Nie\u00dfner, Matthew Fisher, Jianxiong Xiao, and Thomas Funkhouser. 2016. 3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions. arXiv preprint arXiv:1603.08182 (2016)."},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2768821"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197517.3201295","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3197517.3201295","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:44Z","timestamp":1750210784000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197517.3201295"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,30]]},"references-count":32,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2018,8,31]]}},"alternative-id":["10.1145\/3197517.3201295"],"URL":"https:\/\/doi.org\/10.1145\/3197517.3201295","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,7,30]]},"assertion":[{"value":"2018-07-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}