{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T05:04:06Z","timestamp":1780981446894,"version":"3.54.1"},"reference-count":31,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T00:00:00Z","timestamp":1605744000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012550","name":"National Research, Development and Innovation Fund","doi-asserted-by":"publisher","award":["K-120233 and 2018-2.1.3-EUREKA-2018-00032"],"award-info":[{"award-number":["K-120233 and 2018-2.1.3-EUREKA-2018-00032"]}],"id":[{"id":"10.13039\/501100012550","id-type":"DOI","asserted-by":"publisher"}]},{"name":"European Union and the Hungarian Government from the project \u2018Intensification of the activities of HU-MATHS-IN\u2014Hungarian Service Network of Mathematics for Industry and Innovation","award":["EFOP-3.6.2-16-2017-00015"],"award-info":[{"award-number":["EFOP-3.6.2-16-2017-00015"]}]},{"name":"National Excellence Program","award":["2018-1.2.1-NKP-00008"],"award-info":[{"award-number":["2018-1.2.1-NKP-00008"]}]},{"name":"New National Excellence Program of the Ministry of Human Capacities","award":["\u00daNKP-20-4"],"award-info":[{"award-number":["\u00daNKP-20-4"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Automatic building categorization and analysis are particularly relevant for smart city applications and cultural heritage programs. Taking a picture of the facade of a building and instantly obtaining information about it can enable the automation of processes in urban planning, virtual city tours, and digital archiving of cultural artifacts. In this paper, we go beyond traditional convolutional neural networks (CNNs) for image classification and propose the HierarchyNet: a new hierarchical network for the classification of urban buildings from all across the globe into different main and subcategories from images of their facades. We introduce a coarse-to-fine hierarchy on the dataset and the model learns to simultaneously extract features and classify across both levels of hierarchy. We propose a new multiplicative layer, which is able to improve the accuracy of the finer prediction by considering the feedback signal of the coarse layers. We have quantitatively evaluated the proposed approach both on our proposed building datasets, as well as on various benchmark databases to demonstrate that the model is able to efficiently learn hierarchical information. The HierarchyNet model is able to outperform the state-of-the-art convolutional neural networks in urban building classification as well as in other multi-label classification tasks while using significantly fewer parameters.<\/jats:p>","DOI":"10.3390\/rs12223794","type":"journal-article","created":{"date-parts":[[2020,11,19]],"date-time":"2020-11-19T06:23:52Z","timestamp":1605767032000},"page":"3794","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":43,"title":["HierarchyNet: Hierarchical CNN-Based Urban Building Classification"],"prefix":"10.3390","volume":"12","author":[{"given":"Salma","family":"Taoufiq","sequence":"first","affiliation":[{"name":"Department of Mathematics and Its Applications, Central European University, 1051 Budapest, Hungary"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0338-863X","authenticated-orcid":false,"given":"Bal\u00e1zs","family":"Nagy","sequence":"additional","affiliation":[{"name":"Institute for Computer Science and Control, Machine Perception Research Laboratory, 1111 Budapest, Hungary"},{"name":"Faculty of Information Technology and Bionics, P\u00e9ter P\u00e1zm\u00e1ny Catholic University, 1083 Budapest, Hungary"},{"name":"Faculty of Informatics, University of Debrecen, 4028 Debrecen, Hungary"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3203-0741","authenticated-orcid":false,"given":"Csaba","family":"Benedek","sequence":"additional","affiliation":[{"name":"Institute for Computer Science and Control, Machine Perception Research Laboratory, 1111 Budapest, Hungary"},{"name":"Faculty of Information Technology and Bionics, P\u00e9ter P\u00e1zm\u00e1ny Catholic University, 1083 Budapest, Hungary"},{"name":"Faculty of Informatics, University of Debrecen, 4028 Debrecen, Hungary"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2020,11,19]]},"reference":[{"key":"ref_1","unstructured":"(2020, December 08). 2018 Revision of World Urbanization Prospects. Available online: https:\/\/www.un.org\/development\/desa\/publications\/2018-revision-of-world-urbanization-prospects.html."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"You, Y., Wang, S., Ma, Y., Chen, G., Wang, B., Shen, M., and Liu, W. (2018). Building detection from VHR remote sensing imagery based on the morphological building index. Remote Sens., 10.","DOI":"10.3390\/rs10081287"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Zhang, P., Ke, Y., Zhang, Z., Wang, M., Li, P., and Zhang, S. (2018). Urban land use and land cover classification using novel deep learning models based on high spatial resolution satellite imagery. Sensors, 18.","DOI":"10.3390\/s18113717"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Xu, Y., Wu, L., Xie, Z., and Chen, Z. (2018). Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens., 10.","DOI":"10.3390\/rs10010144"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhang, Q., Wang, Y., Liu, Q., Liu, X., and Wang, W. (2016, January 10\u201315). CNN based suburban building detection using monocular high resolution Google Earth images. Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China.","DOI":"10.1109\/IGARSS.2016.7729166"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Vakalopoulou, M., Karantzalos, K., Komodakis, N., and Paragios, N. (2015, January 25\u201329). Building detection in very high resolution multispectral data with deep learning features. Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Seoul, Korea.","DOI":"10.1109\/IGARSS.2015.7326158"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Huang, Y., Zhuo, L., Tao, H., Shi, Q., and Liu, K. (2017). A novel building type classification scheme based on integrated LiDAR and high-resolution images. Remote Sens., 9.","DOI":"10.3390\/rs9070679"},{"key":"ref_8","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote. Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1007\/s10489-016-0762-6","article-title":"Rapid building detection using machine learning","volume":"45","author":"Cohen","year":"2016","journal-title":"Appl. Intell."},{"key":"ref_11","unstructured":"Muhr, V., Despotovic, M., Koch, D., D\u00f6ller, M., and Zeppelzauer, M. (2017, January 29\u201330). Towards Automated Real Estate Assessment from Satellite Images with CNNs. Proceedings of the Forum Media Technology, P\u00f6lten, Austria."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Hoffmann, E.J., Wang, Y., Werner, M., Kang, J., and Zhu, X.X. (2019). Model fusion for building type classification from aerial and street view images. Remote Sens., 11.","DOI":"10.3390\/rs11111259"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Cao, R., Zhu, J., Tu, W., Li, Q., Cao, J., Liu, B., Zhang, Q., and Qiu, G. (2018). Integrating aerial and street view images for urban land use classification. Remote Sens., 10.","DOI":"10.3390\/rs10101553"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.isprsjprs.2017.05.002","article-title":"Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks","volume":"130","author":"Alshehhi","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Xu, Y., Xie, Z., Feng, Y., and Chen, Z. (2018). Road extraction from high-resolution remote sensing imagery using deep learning. Remote Sens., 10.","DOI":"10.3390\/rs10091461"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Law, S., Shen, Y., and Seresinhe, C. (2017, January 7\u201310). An application of convolutional neural network in street image classification: The case study of London. Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery, Redondo Beach, CA, USA.","DOI":"10.1145\/3149808.3149810"},{"key":"ref_17","first-page":"1","article-title":"Street-Frontage-Net: Urban image classification using deep convolutional neural networks","volume":"34","author":"Law","year":"2018","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Bebis, G., Boyle, R., Parvin, B., Koracin, D., Wang, S., Kyungnam, K., Benes, B., Moreland, K., Borst, C., and DiVerdi, S. (2011). Architectural Style Classification of Building Facade Windows. Advances in Visual Computing, Springer.","DOI":"10.1007\/978-3-642-24031-7"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Bebis, G., Boyle, R., Parvin, B., Koracin, D., Fowlkes, C., Wang, S., Choi, M.H., Mantler, S., Schulze, J., and Acevedo, D. (2012). Architectural Style Classification of Domes. Advances in Visual Computing, Springer.","DOI":"10.1007\/978-3-642-33191-6"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"011016","DOI":"10.1117\/1.JEI.26.1.011016","article-title":"Architectural style classification of Mexican historical buildings using deep convolutional neural networks and sparse features","volume":"26","author":"Ramirez","year":"2016","journal-title":"J. Electron. Imaging"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1080\/15481603.2017.1338389","article-title":"Building block level urban land-use information retrieval based on Google Street View images","volume":"54","author":"Li","year":"2017","journal-title":"GISci. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Yan, Z., Zhang, H., Piramuthu, R., Jagadeesh, V., DeCoste, D., Di, W., and Yu, Y. (2015, January 7\u201310). HD-CNN: Hierarchical deep convolutional neural networks for large scale visual recognition. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.314"},{"key":"ref_23","unstructured":"Zhu, X., and Bain, M. (2017). B-CNN: Branch Convolutional Neural Network for Hierarchical Classification. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/0010-0277(93)90058-4","article-title":"Learning and development in neural networks: The importance of starting small","volume":"48","author":"Elman","year":"1993","journal-title":"Cognition"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Bengio, Y., Louradour, J., Collobert, R., and Weston, J. (2009, January 14\u201318). Curriculum learning. Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, QC, Canada.","DOI":"10.1145\/1553374.1553380"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (2014). Architectural Style Classification Using Multinomial Latent Logistic Regression. Computer Vision\u2014ECCV 2014, Springer.","DOI":"10.1007\/978-3-319-10590-1"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2014, January 6\u201312). Visualizing and understanding convolutional networks. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_30","unstructured":"Krizhevsky, A., and Hinton, G. (2009). Learning Multiple Layers of Features From Tiny Images, Department of Computer Science, Univsersity of Toronto. Technical report."},{"key":"ref_31","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2020, October 21). Deep Learning. Available online: http:\/\/www.deeplearningbook.org."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/22\/3794\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:34:16Z","timestamp":1760178856000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/22\/3794"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,19]]},"references-count":31,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2020,11]]}},"alternative-id":["rs12223794"],"URL":"https:\/\/doi.org\/10.3390\/rs12223794","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,11,19]]}}}