{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:27:40Z","timestamp":1760149660054,"version":"build-2065373602"},"reference-count":37,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2023,9,6]],"date-time":"2023-09-06T00:00:00Z","timestamp":1693958400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The acquisition of building structures has broad applications across various fields. However, existing methods for inferring building structures predominantly depend on manual expertise, lacking sufficient automation. To tackle this challenge, we propose a building structure inference network that utilizes UAV remote sensing images, with the PIX2PIX network serving as the foundational framework. We enhance the generator by incorporating an additive attention module that performs multi-scale feature fusion, enabling the combination of features from diverse spatial resolutions of the feature map. This modification enhances the model\u2019s capability to emphasize global relationships during the mapping process. To ensure the completeness of line elements in the generator\u2019s output, we design a novel loss function based on the Hough transform. A line penalty term is introduced that transforms the output of the generator and ground truth to the Hough domain due to the original loss function\u2019s inability to effectively constrain the completeness of straight-line elements in the generated results in the spatial domain. A dataset of the appearance features obtained from UAV remote sensing images and the internal floor plan structure is made. Using UAV remote sensing images of multi-story residential buildings, high-rise residential buildings, and office buildings as test collections, the experimental results show that our method has better performance in inferring a room\u2019s layout and the locations of load-bearing columns, achieving an average improvement of 11.2% and 21.1% over PIX2PIX in terms of the IoU and RMSE, respectively.<\/jats:p>","DOI":"10.3390\/rs15184390","type":"journal-article","created":{"date-parts":[[2023,9,6]],"date-time":"2023-09-06T10:23:42Z","timestamp":1693995822000},"page":"4390","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["A Generative Adversarial Network with Spatial Attention Mechanism for Building Structure Inference Based on Unmanned Aerial Vehicle Remote Sensing Images"],"prefix":"10.3390","volume":"15","author":[{"given":"Hao","family":"Chen","sequence":"first","affiliation":[{"name":"School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150006, China"}]},{"given":"Zhixiang","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150006, China"}]},{"given":"Xing","family":"Meng","sequence":"additional","affiliation":[{"name":"Institute of Defense Engineering, Academy of Military Sciences, Beijing 100036, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8220-2877","authenticated-orcid":false,"given":"Fachuan","family":"He","sequence":"additional","affiliation":[{"name":"School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150006, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,9,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Jia, H., Song, Y., Chen, X., Liu, S., and Zhang, B. (2022). Seismic Performance Evaluation of a High-Rise Building with Structural Irregularities. Buildings, 12.","DOI":"10.3390\/buildings12091484"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1002\/tal.172","article-title":"Energy appproach in peformance-based seismic design of steel moment resisting frames for basic safety objective","volume":"10","author":"Akbas","year":"2001","journal-title":"Struct. Des. Tall Build."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/j.compositesb.2010.10.008","article-title":"Seismic performance of composite reinforced concrete and steel moment frame structures\u2013state-of-the-art","volume":"42","author":"Li","year":"2011","journal-title":"Compos. B Eng."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1002\/suco.202200378","article-title":"Seismic vulnerability assessment of reinforced concrete bridge piers with corroded bars","volume":"24","author":"Messina","year":"2023","journal-title":"Struct. Concr."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1144\/qjegh2013-004","article-title":"Sourcing stone for the conservation and repair of historical buildings in Britain","volume":"46","author":"Lott","year":"2013","journal-title":"Q. J. Eng. Geol. Hydrogeol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/j.isprsjprs.2014.02.013","article-title":"Unmanned aerial systems for photogrammetry and remote sensing: A review","volume":"92","author":"Colomina","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"7125","DOI":"10.1080\/01431161.2018.1523832","article-title":"Drones\u2013the third generation source of remote sensing data","volume":"39","author":"Cracknell","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1016\/j.rse.2015.12.029","article-title":"UAVs as remote sensing platform in glaciology: Present applications and future prospects","volume":"175","author":"Bhardwaj","year":"2016","journal-title":"Remote Sens. Environ."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Wang, J., Li, Y., and Chen, W. (2023). UAV Aerial Image Generation of Crucial Components of High-Voltage Transmission Lines Based on Multi-Level Generative Adversarial Network. Remote Sens., 15.","DOI":"10.3390\/rs15051412"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Mac\u00e9, S., Locteau, H., Valveny, E., and Tabbone, S. (2010, January 9\u201311). A system to detect rooms in architectural floor plan images. Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, Boston, MA, USA.","DOI":"10.1145\/1815330.1815352"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ahmed, S., Liwicki, M., Weber, M., and Dengel, A. (2011, January 18\u201321). Improved automatic analysis of architectural floor plans. Proceedings of the 2011 International Conference on Document Analysis and Recognition, Beijing, China.","DOI":"10.1109\/ICDAR.2011.177"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"De Las Heras, L.-P., Fern\u00e1ndez, D., Valveny, E., Llad\u00f3s, J., and S\u00e1nchez, G. (2013, January 25\u201328). Unsupervised wall detector in architectural floor plans. Proceedings of the 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA.","DOI":"10.1109\/ICDAR.2013.252"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.autcon.2015.12.008","article-title":"Automatic reconstruction of 3D building models from scanned 2D floor plans","volume":"63","author":"Gimenez","year":"2016","journal-title":"Autom. Constr."},{"key":"ref_14","unstructured":"Jang, H., Yang, J.H., and Kiyun, Y. (2018, January 28\u201331). Automatic wall detection and building topology and property of 2D floor plan (short paper). Proceedings of the 10th International Conference on Geographic Information Science (GIScience 2018), Melbourne, Australia."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"109096","DOI":"10.1016\/j.patcog.2022.109096","article-title":"IDA: Improving distribution analysis for reducing data complexity and dimensionality in hyperspectral images","volume":"134","author":"Dalal","year":"2023","journal-title":"Pattern Recognit."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Dodge, S., Xu, J., and Stenger, B. (2017, January 8\u201312). Parsing floor plan images. Proceedings of the 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA), Nagoya, Japan.","DOI":"10.23919\/MVA.2017.7986875"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, C., Wu, J., Kohli, P., and Furukawa, Y. (2017, January 22\u201329). Raster-to-vector: Revisiting floorplan transformation. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.241"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Lee, C.-Y., Badrinarayanan, V., Malisiewicz, T., and Rabinovich, A. (2017, January 22\u201329). Roomnet: End-to-end room layout estimation. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.521"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Huang, W., and Zheng, H. (2018, January 18\u201320). Architectural drawings recognition and generation through machine learning. Proceedings of the 38th Annual Conference of the Association for Computer Aided Design in Architecture, Mexico City, Mexico.","DOI":"10.52842\/conf.acadia.2018.156"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Yamasaki, T., Zhang, J., and Takada, Y. (2018, January 11). Apartment structure estimation using fully convolutional networks and graph model. Proceedings of the 2018 ACM Workshop on Multimedia for Real Estate Tech, Yokohama, Japan.","DOI":"10.1145\/3210499.3210528"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1111\/j.1365-3091.1971.tb00220.x","article-title":"Micritic envelopes of carbonate grains are not exclusively of photosynthetic algal origin","volume":"16","author":"Friedman","year":"1971","journal-title":"Sedimentology"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1068\/b070343","article-title":"Introduction to shape and shape grammars","volume":"7","author":"Stiny","year":"1980","journal-title":"Environ. Plann. B Plann. Des."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1145\/2766956","article-title":"Advanced procedural modeling of architecture","volume":"34","author":"Schwarz","year":"2015","journal-title":"ACM Trans. Graph."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1145\/3422622","article-title":"Generative adversarial networks","volume":"63","author":"Goodfellow","year":"2020","journal-title":"Commun. ACM"},{"key":"ref_25","unstructured":"Nauata, N., Chang, K.-H., Cheng, C.-Y., Mori, G., and Furukawa, Y. (2020). Computer Vision\u2014ECCV 2020 Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK, 23\u201328 August 2020, Proceedings, Part I 16, Springer."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Chang, K.-H., Cheng, C.-Y., Luo, J., Murata, S., Nourbakhsh, M., and Tsuji, Y. (2021, January 11\u201317). Building-GAN: Graph-conditioned architectural volumetric design generation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.01174"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Nauata, N., Hosseini, S., Chang, K.-H., Chu, H., Cheng, C.-Y., and Furukawa, Y. (2021, January 20\u201325). House-gan++: Generative adversarial layout refinement network towards intelligent computational agent for professional architects. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01342"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"104888","DOI":"10.1016\/j.autcon.2023.104888","article-title":"Building layout generation using site-embedded GAN model","volume":"151","author":"Jiang","year":"2023","journal-title":"Autom. Constr."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Isola, P., Zhu, J.-Y., Zhou, T., and Efros, A.A. (2017, January 21\u201326). Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.632"},{"key":"ref_30","unstructured":"Mirza, M., and Osindero, S. (2014). Conditional generative adversarial nets. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., and Efros, A.A. (2016, January 27\u201330). Context encoders: Feature learning by inpainting. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.278"},{"key":"ref_32","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2015, Proceedings of the 8th International Conference, Munich, Germany, 5\u20139 October 2015, Proceedings, Part III 18, Springer."},{"key":"ref_33","unstructured":"Li, C., and Wand, M. (2016). Computer Vision\u2014ECCV 2016, Proceedings of the 14th European Conference, Amsterdam, The Netherlands, 11\u201314 October 2016, Proceedings, Part III 14, Springer."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Milletari, F., Navab, N., and Ahmadi, S.-A. (2016, January 25\u201328). V-net: Fully convolutional neural networks for volumetric medical image segmentation. Proceedings of the 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA.","DOI":"10.1109\/3DV.2016.79"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1145\/361237.361242","article-title":"Use of the Hough transformation to detect lines and curves in pictures","volume":"15","author":"Duda","year":"1972","journal-title":"Commun. ACM"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1145\/3355089.3356556","article-title":"Data-driven interior plan generation for residential buildings","volume":"38","author":"Wu","year":"2019","journal-title":"ACM Trans. Graph."},{"key":"ref_37","unstructured":"Peters, N. (2018). Harvard University Graduate School of Design."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/18\/4390\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:46:20Z","timestamp":1760129180000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/18\/4390"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,6]]},"references-count":37,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2023,9]]}},"alternative-id":["rs15184390"],"URL":"https:\/\/doi.org\/10.3390\/rs15184390","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2023,9,6]]}}}