{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T09:33:05Z","timestamp":1769765585294,"version":"3.49.0"},"reference-count":30,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2008,12,1]],"date-time":"2008-12-01T00:00:00Z","timestamp":1228089600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Singapore FRC","award":["R-263-000-477-112"],"award-info":[{"award-number":["R-263-000-477-112"]}]},{"DOI":"10.13039\/501100002920","name":"Research Grants Council, University Grants Committee, Hong Kong","doi-asserted-by":"publisher","award":["618908619107619006RGC\/NSFC N-HKUST602\/05"],"award-info":[{"award-number":["618908619107619006RGC\/NSFC N-HKUST602\/05"]}],"id":[{"id":"10.13039\/501100002920","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:p>We propose in this paper a semi-automatic image-based approach to fa\u00e7ade modeling that uses images captured along streets and relies on structure from motion to recover camera positions and point clouds automatically as the initial stage for modeling. We start by considering a building fa\u00e7ade as a flat rectangular plane or a developable surface with an associated texture image composited from the multiple visible images. A fa\u00e7ade is then decomposed and structured into a Directed Acyclic Graph of rectilinear elementary patches. The decomposition is carried out top-down by a recursive subdivision, and followed by a bottom-up merging with the detection of the architectural bilateral symmetry and repetitive patterns. Each subdivided patch of the flat fa\u00e7ade is augmented with a depth optimized using the 3D points cloud. Our system also allows for an easy user feedback in the 2D image space for the proposed decomposition and augmentation. Finally, our approach is demonstrated on a large number of fa\u00e7ades from a variety of street-side images.<\/jats:p>","DOI":"10.1145\/1409060.1409114","type":"journal-article","created":{"date-parts":[[2008,12,3]],"date-time":"2008-12-03T21:56:04Z","timestamp":1228341364000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":98,"title":["Image-based fa\u00e7ade modeling"],"prefix":"10.1145","volume":"27","author":[{"given":"Jianxiong","family":"Xiao","sequence":"first","affiliation":[{"name":"The Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tian","family":"Fang","sequence":"additional","affiliation":[{"name":"The Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ping","family":"Tan","sequence":"additional","affiliation":[{"name":"National University of Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peng","family":"Zhao","sequence":"additional","affiliation":[{"name":"The Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eyal","family":"Ofek","sequence":"additional","affiliation":[{"name":"Microsoft"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Long","family":"Quan","sequence":"additional","affiliation":[{"name":"The Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2008,12]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141966"},{"key":"e_1_2_2_2_1","volume-title":"Proceedings of IEEE International Conference on Computer Vision, 1--8.","author":"Berg A. C.","unstructured":"Berg , A. C. , Grabler , F. , and Malik , J . 2007. Parsing images of architectural scenes . In Proceedings of IEEE International Conference on Computer Vision, 1--8. Berg, A. C., Grabler, F., and Malik, J. 2007. Parsing images of architectural scenes. In Proceedings of IEEE International Conference on Computer Vision, 1--8."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1986.4767851"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0081-9"},{"key":"e_1_2_2_5_1","first-page":"721","article-title":"Object removal by exemplar-based inpainting","volume":"2","author":"Criminisi A.","year":"2003","unstructured":"Criminisi , A. , Perez , P. , and Toyama , K. 2003 . Object removal by exemplar-based inpainting . In Proceedings of IEEE Computer Vision and Pattern Recognition , vol. 2 , 721 -- 728 . Criminisi, A., Perez, P., and Toyama, K. 2003. Object removal by exemplar-based inpainting. In Proceedings of IEEE Computer Vision and Pattern Recognition, vol. 2, 721--728.","journal-title":"Proceedings of IEEE Computer Vision and Pattern Recognition"},{"key":"e_1_2_2_6_1","volume-title":"Proceedings of ACM SIGGRAPH, 11--20","author":"Debevec P.","unstructured":"Debevec , P. , Taylor , C. , and Malik , J . 1996. Modeling and rendering architecture from photographs: a hybrid geometry-and image-based approach . In Proceedings of ACM SIGGRAPH, 11--20 . 10.1145\/237170.237191 Debevec, P., Taylor, C., and Malik, J. 1996. Modeling and rendering architecture from photographs: a hybrid geometry-and image-based approach. In Proceedings of ACM SIGGRAPH, 11--20. 10.1145\/237170.237191"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029665.07652.61"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/361237.361242"},{"key":"e_1_2_2_9_1","first-page":"562","article-title":"Constructing 3d city models by merging ground-based and airborne views","volume":"2","author":"Fr\u00fch C.","year":"2003","unstructured":"Fr\u00fch , C. , and Zakhor , A. 2003 . Constructing 3d city models by merging ground-based and airborne views . In Proceedings of IEEE Computer Vision and Pattern Recognition , vol. 2 , 562 -- 569 . Fr\u00fch, C., and Zakhor, A. 2003. Constructing 3d city models by merging ground-based and airborne views. In Proceedings of IEEE Computer Vision and Pattern Recognition, vol. 2, 562--569.","journal-title":"Proceedings of IEEE Computer Vision and Pattern Recognition"},{"key":"e_1_2_2_10_1","volume-title":"Proceedings of IEEE Conference Computer Vision and Pattern Recognition, 1--8.","author":"Furukawa Y.","unstructured":"Furukawa , Y. , and Ponce , J . 2007. Accurate, dense, and robust multi-view stereopsis . In Proceedings of IEEE Conference Computer Vision and Pattern Recognition, 1--8. Furukawa, Y., and Ponce, J. 2007. Accurate, dense, and robust multi-view stereopsis. In Proceedings of IEEE Conference Computer Vision and Pattern Recognition, 1--8."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1984.4767596"},{"key":"e_1_2_2_12_1","volume-title":"Proceeding of IEEE International Conference in Computer Vision, 1--8.","author":"Goesele M.","unstructured":"Goesele , M. , Snavely , N. , Curless , B. , Seitz , S. M. , and Hoppe , H . 2007. Multi-view stereo for community photo collections . In Proceeding of IEEE International Conference in Computer Vision, 1--8. Goesele, M., Snavely, N., Curless, B., Seitz, S. M., and Hoppe, H. 2007. Multi-view stereo for community photo collections. In Proceeding of IEEE International Conference in Computer Vision, 1--8."},{"key":"e_1_2_2_13_1","doi-asserted-by":"crossref","unstructured":"Hartley R. I. and Zisserman A. 2004. Multiple View Geometry in Computer Vision second ed. Cambridge University Press ISBN: 0521540518.   Hartley R. I. and Zisserman A. 2004. Multiple View Geometry in Computer Vision second ed. Cambridge University Press ISBN: 0521540518.","DOI":"10.1017\/CBO9780511811685"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177729694"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2005.44"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015719"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.isprsjprs.2006.06.005"},{"key":"e_1_2_2_18_1","volume-title":"Proceedings of the European Conference on Computer Vision. 10","author":"Lukas Z.","unstructured":"Lukas , Z. , Joachim , B. , Konrad , K. , and Horst , B . 2008. Fusion of feature- and area-based information for urban buildings modeling from aerial imagery . In Proceedings of the European Conference on Computer Vision. 10 .1007\/978-3-540-88693-8_64 Lukas, Z., Joachim, B., Konrad, K., and Horst, B. 2008. Fusion of feature- and area-based information for urban buildings modeling from aerial imagery. In Proceedings of the European Conference on Computer Vision. 10.1007\/978-3-540-88693-8_64"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141931"},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276484"},{"key":"e_1_2_2_21_1","first-page":"433","article-title":"Image-based modeling and photo editing","volume":"1","author":"Oh B. M.","year":"2001","unstructured":"Oh , B. M. , Chen , M. , Dorsey , J. , and Durand , F. 2001 . Image-based modeling and photo editing . ACM Transactions on Graphics 1 , 433 -- 442 . Oh, B. M., Chen, M., Dorsey, J., and Durand, F. 2001. Image-based modeling and photo editing. ACM Transactions on Graphics 1, 433--442.","journal-title":"ACM Transactions on Graphics"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-007-0086-4"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1014573219977"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141964"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1073204.1073274"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276485"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2007.01.006"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/18.910585"},{"key":"e_1_2_2_29_1","volume-title":"Proceedings of the European Conference on Computer Vision","volume":"2","author":"Werner T.","unstructured":"Werner , T. , and Zisserman , A . 2002. New techniques for automated architectural reconstruction from photographs . In Proceedings of the European Conference on Computer Vision , vol. 2 , 541--555. Werner, T., and Zisserman, A. 2002. New techniques for automated architectural reconstruction from photographs. In Proceedings of the European Conference on Computer Vision, vol. 2, 541--555."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/882262.882324"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1409060.1409114","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1409060.1409114","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:47:21Z","timestamp":1750258041000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1409060.1409114"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12]]},"references-count":30,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["10.1145\/1409060.1409114"],"URL":"https:\/\/doi.org\/10.1145\/1409060.1409114","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,12]]},"assertion":[{"value":"2008-12-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}