{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T12:45:56Z","timestamp":1769690756590,"version":"3.49.0"},"reference-count":50,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2014,11,19]],"date-time":"2014-11-19T00:00:00Z","timestamp":1416355200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Beijing Higher Institution Engineering Research Center"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2014,11,19]]},"abstract":"<jats:p>We present a novel solution to automatic semantic modeling of indoor scenes from a sparse set of low-quality RGB-D images. Such data presents challenges due to noise, low resolution, occlusion and missing depth information. We exploit the knowledge in a scene database containing 100s of indoor scenes with over 10,000 manually segmented and labeled mesh models of objects. In seconds, we output a visually plausible 3D scene, adapting these models and their parts to fit the input scans. Contextual relationships learned from the database are used to constrain reconstruction, ensuring semantic compatibility between both object models and parts. Small objects and objects with incomplete depth information which are difficult to recover reliably are processed with a two-stage approach. Major objects are recognized first, providing a known scene structure. 2D contour-based model retrieval is then used to recover smaller objects. Evaluations using our own data and two public datasets show that our approach can model typical real-world indoor scenes efficiently and robustly.<\/jats:p>","DOI":"10.1145\/2661229.2661239","type":"journal-article","created":{"date-parts":[[2014,11,18]],"date-time":"2014-11-18T14:21:03Z","timestamp":1416320463000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":97,"title":["Automatic semantic modeling of indoor scenes from low-quality RGB-D data using contextual information"],"prefix":"10.1145","volume":"33","author":[{"given":"Kang","family":"Chen","sequence":"first","affiliation":[{"name":"Tsinghua University, Beijing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu-Kun","family":"Lai","sequence":"additional","affiliation":[{"name":"Cardiff University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu-Xin","family":"Wu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ralph","family":"Martin","sequence":"additional","affiliation":[{"name":"Cardiff University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shi-Min","family":"Hu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2014,11,19]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proc. CVPR, 2703--2710","author":"Bao S. Y.","unstructured":"Bao , S. Y. , Bagra , M. , Chao , Y.-W. , and Savarese , S . 2012. Semantic structure from motion with points, regions, and objects . In Proc. CVPR, 2703--2710 . Bao, S. Y., Bagra, M., Chao, Y.-W., and Savarese, S. 2012. Semantic structure from motion with points, regions, and objects. In Proc. CVPR, 2703--2710."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2007.09.014"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.993558"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.121791"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Bo L. Ren X. and Fox D. 2011. Depth kernel descriptors for object recognition. In IROS 821--826.  Bo L. Ren X. and Fox D. 2011. Depth kernel descriptors for object recognition. In IROS 821--826.","DOI":"10.1109\/IROS.2011.6095119"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.969114"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/0262-8856(92)90066-C"},{"key":"e_1_2_1_9_1","volume-title":"Proc. CVPR, 1271--1278","author":"Divvala S. K.","unstructured":"Divvala , S. K. , Hoiem , D. , Hays , J. H. , Efros , A. A. , and Hebert , M . 2009. An empirical study of context in object detection . In Proc. CVPR, 1271--1278 . Divvala, S. K., Hoiem, D., Hays, J. H., Efros, A. A., and Hebert, M. 2009. An empirical study of context in object detection. In Proc. CVPR, 1271--1278."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/MRA.2006.1638022"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185527"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1882261.1866204"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964929"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366154"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007465528199"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/588272.588279"},{"key":"e_1_2_1_18_1","volume-title":"Proc. ICCV, 80--87","author":"Furukawa Y.","unstructured":"Furukawa , Y. , Curless , B. , Seitz , S. M. , and Szeliski , R . 2009. Reconstructing building interiors from images . In Proc. ICCV, 80--87 . Furukawa, Y., Curless, B., Seitz, S. M., and Szeliski, R. 2009. Reconstructing building interiors from images. In Proc. ICCV, 80--87."},{"key":"e_1_2_1_19_1","volume-title":"Proc. CVPR, 1--8.","author":"Galleguillos C.","unstructured":"Galleguillos , C. , Rabinovich , A. , and Belongie , S . 2008. Object categorization using co-occurrence, location and appearance . In Proc. CVPR, 1--8. Galleguillos, C., Rabinovich, A., and Belongie, S. 2008. Object categorization using co-occurrence, location and appearance. In Proc. CVPR, 1--8."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2006.233"},{"key":"e_1_2_1_21_1","volume-title":"Proc. Intl. Symp. Experimental Robotics, 22--25","author":"Henry P.","unstructured":"Henry , P. , Krainin , M. , Herbst , E. , Ren , X. , and Fox , D . 2010. RGB-D mapping: Using depth cameras for dense 3D modeling of indoor environments . In Proc. Intl. Symp. Experimental Robotics, 22--25 . Henry, P., Krainin, M., Herbst, E., Ren, X., and Fox, D. 2010. RGB-D mapping: Using depth cameras for dense 3D modeling of indoor environments. In Proc. Intl. Symp. Experimental Robotics, 22--25."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-37331-2_42"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2047196.2047270"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.765655"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366157"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.409"},{"key":"e_1_2_1_27_1","volume-title":"Proc. ICRA.","author":"Lai K.","unstructured":"Lai , K. , Bo , L. , Ren , X. , and Fox , D . 2012. Detection-based Object Labeling in 3D Scenes . In Proc. ICRA. Lai, K., Bo, L., Ren, X., and Fox, D. 2012. Detection-based Object Labeling in 3D Scenes. In Proc. ICRA."},{"key":"e_1_2_1_28_1","volume-title":"Proc. ICRA.","author":"Lai K.","unstructured":"Lai , K. , Bo , L. , and Fox , D . 2014. Unsupervised Feature Learning for 3D Scene Labeling . In Proc. ICRA. Lai, K., Bo, L., and Fox, D. 2014. Unsupervised Feature Learning for 3D Scene Labeling. In Proc. ICRA."},{"key":"e_1_2_1_29_1","first-page":"1150","article-title":"Object recognition from local scale-invariant features","volume":"2","author":"Lowe D. G.","year":"1999","unstructured":"Lowe , D. G. 1999 . Object recognition from local scale-invariant features . In Proc. ICCV , vol. 2 , 1150 -- 1157 . Lowe, D. G. 1999. Object recognition from local scale-invariant features. In Proc. ICCV, vol. 2, 1150--1157.","journal-title":"Proc. ICCV"},{"key":"e_1_2_1_30_1","unstructured":"Malisiewicz T. and Efros A. A. 2009. Beyond categories: The visual memex model for reasoning about object relationships. In NIPS.  Malisiewicz T. and Efros A. A. 2009. Beyond categories: The visual memex model for reasoning about object relationships. In NIPS ."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964982"},{"key":"e_1_2_1_32_1","volume-title":"Multiple Images: Part 1: Principles","author":"Moons T.","year":"2009","unstructured":"Moons , T. , van Gool , L. , and Vergauwen , M . 2009 . 3D Reconstruction from Multiple Images: Part 1: Principles . Now Publishers . Moons, T., van Gool, L., and Vergauwen, M. 2009. 3D Reconstruction from Multiple Images: Part 1: Principles. Now Publishers."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778830"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366156"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1983.4767405"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015720"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.178"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.235"},{"key":"e_1_2_1_39_1","volume-title":"Proc. BMVC.","author":"Satkin S.","unstructured":"Satkin , S. , Lin , J. , and Hebert , M . 2012. Data-driven scene understanding from 3D models . In Proc. BMVC. Satkin, S., Lin, J., and Hebert, M. 2012. Data-driven scene understanding from 3D models. In Proc. BMVC."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.51"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366155"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2366145.2366199"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"e_1_2_1_44_1","volume-title":"Proc. NIPS, 17","author":"Torralba A.","unstructured":"Torralba , A. , Murphy , K. P. , and Freeman , W. T . 2004. Contextual models for object detection using boosted random fields . In Proc. NIPS, 17 . Torralba, A., Murphy, K. P., and Freeman, W. T. 2004. Contextual models for object detection using boosted random fields. In Proc. NIPS, 17."},{"key":"e_1_2_1_45_1","volume-title":"Proc. ICRA.","author":"Trevor A. J. B.","year":"2012","unstructured":"Trevor , A. J. B. 2012 . Fast segmentation of organized point cloud data . In Proc. ICRA. Trevor, A. J. B. 2012. Fast segmentation of organized point cloud data. In Proc. ICRA."},{"key":"e_1_2_1_46_1","volume-title":"Proc. Intl. Conf. 3-D Digital Imaging and Modeling, 348--357","author":"Whitaker R. T.","unstructured":"Whitaker , R. T. , Gregor , J. , and Chen , P. F . 1999. Indoor scene reconstruction from sets of noisy range images . In Proc. Intl. Conf. 3-D Digital Imaging and Modeling, 348--357 . Whitaker, R. T., Gregor, J., and Chen, P. F. 1999. Indoor scene reconstruction from sets of noisy range images. In Proc. Intl. Conf. 3-D Digital Imaging and Modeling, 348--357."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185553"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461912.2461968"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2010324.1964981"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.402"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2661229.2661239","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2661229.2661239","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:19:47Z","timestamp":1750231187000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2661229.2661239"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,11,19]]},"references-count":50,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2014,11,19]]}},"alternative-id":["10.1145\/2661229.2661239"],"URL":"https:\/\/doi.org\/10.1145\/2661229.2661239","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,11,19]]},"assertion":[{"value":"2014-11-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}