{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T13:34:41Z","timestamp":1773149681743,"version":"3.50.1"},"reference-count":41,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2021,10,29]],"date-time":"2021-10-29T00:00:00Z","timestamp":1635465600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Automated extraction of buildings from Earth observation (EO) data is important for various applications, including updating of maps, risk assessment, urban planning, and policy-making. Combining data from different sensors, such as high-resolution multispectral images (HRI) and light detection and ranging (LiDAR) data, has shown great potential in building extraction. Deep learning (DL) is increasingly used in multi-modal data fusion and urban object extraction. However, DL-based multi-modal fusion networks may under-perform due to insufficient learning of \u201cjoint features\u201d from multiple sources and oversimplified approaches to fusing multi-modal features. Recently, a hybrid attention-aware fusion network (HAFNet) has been proposed for building extraction from a dataset, including co-located Very-High-Resolution (VHR) optical images and light detection and ranging (LiDAR) joint data. The system reported good performances thanks to the adaptivity of the attention mechanism to the features of the information content of the three streams but suffered from model over-parametrization, which inevitably leads to long training times and heavy computational load. In this paper, the authors propose a restructuring of the scheme, which involved replacing VGG-16-like encoders with the recently proposed EfficientNet, whose advantages counteract exactly the issues found with the HAFNet scheme. The novel configuration was tested on multiple benchmark datasets, reporting great improvements in terms of processing times, and also in terms of accuracy. The new scheme, called HAFNetE (HAFNet with EfficientNet integration), appears indeed capable of achieving good results with less parameters, translating into better computational efficiency. Based on these findings, we can conclude that, given the current advancements in single-thread schemes, the classical multi-thread HAFNet scheme could be effectively transformed by the HAFNetE scheme by replacing VGG-16 with EfficientNet blocks on each single thread. The remarkable reduction achieved in computational requirements moves the system one step closer to on-board implementation in a possible, future \u201curban mapping\u201d satellite constellation.<\/jats:p>","DOI":"10.3390\/rs13214361","type":"journal-article","created":{"date-parts":[[2021,11,1]],"date-time":"2021-11-01T22:24:22Z","timestamp":1635805462000},"page":"4361","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["Integrating EfficientNet into an HAFNet Structure for Building Mapping in High-Resolution Optical Earth Observation Data"],"prefix":"10.3390","volume":"13","author":[{"given":"Luca","family":"Ferrari","sequence":"first","affiliation":[{"name":"CNIT, Pavia Unit, Department of Electrical, Computer and Biomedical Engineering, University of Pavia, 27100 Pavia, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0044-2998","authenticated-orcid":false,"given":"Fabio","family":"Dell\u2019Acqua","sequence":"additional","affiliation":[{"name":"CNIT, Pavia Unit, Department of Electrical, Computer and Biomedical Engineering, University of Pavia, 27100 Pavia, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peng","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Geographic Information Science, University of Nanjing, Nanjing 210093, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2488-2656","authenticated-orcid":false,"given":"Peijun","family":"Du","sequence":"additional","affiliation":[{"name":"Department of Geographic Information Science, University of Nanjing, Nanjing 210093, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,10,29]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.isprsjprs.2018.06.005","article-title":"Developing a multi-filter convolutional neural network for semantic segmentation using high-resolution aerial imagery and LiDAR data","volume":"143","author":"Sun","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Xu, Y., Du, B., and Zhang, L. (2018, January 22\u201327). Multi-source remote sensing data classification via fully convolutional networks and post-classification processing. Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518295"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Hazirbas, C., Ma, L., Domokos, C., and Cremers, D. (2016, January 20\u201324). Fusenet: Incorporating depth into semantic segmentation via fusion-based cnn architecture. Proceedings of the Asian Conference on Computer Vision, Taipei, Taiwan.","DOI":"10.1007\/978-3-319-54181-5_14"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhang, W., Huang, H., Schmitz, M., Sun, X., Wang, H., and Mayer, H. (2018). Effective fusion of multi-modal remote sensing data in a fully convolutional network for semantic labeling. Remote Sens., 10.","DOI":"10.3390\/rs10010052"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1016\/j.isprsjprs.2017.11.009","article-title":"Classification with an edge: Improving semantic image segmentation with boundary detection","volume":"135","author":"Marmanis","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Marcos, D., Hamid, R., and Tuia, D. (2016, January 27\u201330). Geospatial correspondences for multimodal registration. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.550"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zhang, P., Du, P., Lin, C., Wang, X., Li, E., Xue, Z., and Bai, X. (2020). A Hybrid Attention-Aware Fusion Network (HAFNet) for Building Extraction from High-Resolution Imagery and LiDAR Data. Remote Sens., 12.","DOI":"10.3390\/rs12223764"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1109\/MAES.2020.3008468","article-title":"Towards the Use of Artificial Intelligence on the Edge in Space Systems: Challenges and Opportunities","volume":"35","author":"Furano","year":"2020","journal-title":"IEEE Aerosp. Electron. Syst. Mag."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Kothari, V., Liberis, E., and Lane, N.D. (2020, January 3\u20134). The final frontier: Deep learning in space. Proceedings of the 21st International Workshop on Mobile Computing Systems and Applications, Austin, TX, USA.","DOI":"10.1145\/3376897.3377864"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"7249","DOI":"10.1038\/s41598-021-86650-z","article-title":"Towards global flood mapping onboard low cost satellites with machine learning","volume":"11","author":"Smith","year":"2021","journal-title":"Sci. Rep."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Giuffrida, G., Diana, L., de Gioia, F., Benelli, G., Meoni, G., Donati, M., and Fanucci, L. (2020). CloudScout: A Deep Neural Network for On-Board Cloud Detection on Hyperspectral Images. Remote Sens., 12.","DOI":"10.3390\/rs12142205"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"103952","DOI":"10.1016\/j.engappai.2020.103952","article-title":"CubeSatNet: Ultralight Convolutional Neural Network designed for on-orbit binary image classification on a 1U CubeSat","volume":"96","author":"Maskey","year":"2020","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_14","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C. (2018, January 18\u201323). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_16","unstructured":"Tan, M., and Le, Q. (2019, January 9\u201315). Efficientnet: Rethinking model scaling for convolutional neural networks. Proceedings of the International Conference on Machine Learning, PMLR, Long Beach, CA, USA."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Bazi, Y., Al Rahhal, M.M., Alhichri, H., and Alajlan, N. (2019). Simple Yet Effective Fine-Tuning of Deep CNNs Using an Auxiliary Classification Loss for Remote Sensing Scene Classification. Remote Sens., 11.","DOI":"10.3390\/rs11242908"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"14078","DOI":"10.1109\/ACCESS.2021.3051085","article-title":"Classification of Remote Sensing Images Using EfficientNet-B3 CNN Model With Attention","volume":"9","author":"Alhichri","year":"2021","journal-title":"IEEE Access"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Lasloum, T., Alhichri, H., Bazi, Y., and Alajlan, N. (2021). SSDAN: Multi-Source Semi-Supervised Domain Adaptation Network for Remote Sensing Scene Classification. Remote Sens., 13.","DOI":"10.3390\/rs13193861"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Salas, J., Vera, P., Zea-Ortiz, M., Villase\u00f1or, E.A., Pulido, D., and Figueroa, A. (2021). Fine-Grained Large-Scale Vulnerable Communities Mapping via Satellite Imagery and Population Census Using Deep Learning. Remote Sens., 13.","DOI":"10.3390\/rs13183603"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1016\/j.isprsjprs.2020.07.002","article-title":"Cross-regional oil palm tree counting and detection via a multi-level attention domain adaptation network","volume":"167","author":"Zheng","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Cai, W., and Wei, Z. (2020). Remote Sensing Image Classification Based on a Cross-Attention Mechanism and Graph Convolution. IEEE Geosci. Remote Sens. Lett., 1\u20135.","DOI":"10.1109\/LGRS.2020.3026587"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Huang, X., He, B., Tong, M., Wang, D., and He, C. (2021). Few-Shot Object Detection on Remote Sensing Images via Shared Attention Module and Balanced Fine-Tuning Strategy. Remote Sens., 13.","DOI":"10.3390\/rs13193816"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Shi, H., Fan, J., Wang, Y., and Chen, L. (2021). Dual Attention Feature Fusion and Adaptive Context for Accurate Segmentation of Very High-Resolution Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13183715"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"2825","DOI":"10.1109\/TIP.2019.2891104","article-title":"Three-stream attention-aware network for RGB-D salient object detection","volume":"28","author":"Chen","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_29","unstructured":"(2021, May 10). ImageNet. Available online: https:\/\/image-net.org\/index.php."},{"key":"ref_30","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_32","unstructured":"(2021, May 10). ISPRS 2D Semantic Labeling Contest. Available online: https:\/\/www2.isprs.org\/commissions\/comm2\/wg4\/benchmark\/semantic-labeling\/."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"022003","DOI":"10.1088\/1742-6596\/1213\/2\/022003","article-title":"Exploring An Easy Way for Imbalanced Data Sets in Semantic Image Segmentation","volume":"1213","author":"Xia","year":"2019","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_34","unstructured":"Yakubovskiy, P. (2021, May 10). Segmentation Models Pytorch. Available online: https:\/\/github.com\/qubvel\/segmentation_models.pytorch."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Shang, R., Zhang, J., Jiao, L., Li, Y., Marturi, N., and Stolkin, R. (2020). Multi-scale adaptive feature fusion network for semantic segmentation in remote sensing images. Remote Sens., 12.","DOI":"10.3390\/rs12050872"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1766","DOI":"10.1109\/LGRS.2019.2907009","article-title":"End-to-end DSM fusion networks for semantic segmentation in high-resolution aerial images","volume":"16","author":"Cao","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"2612","DOI":"10.1109\/JSTARS.2019.2906387","article-title":"Densely based multi-scale and multi-modal fully convolutional networks for high-resolution remote-sensing image semantic segmentation","volume":"12","author":"Peng","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Liu, C., Zeng, D., Wu, H., Wang, Y., Jia, S., and Xin, L. (2020). Urban land cover classification of high-resolution aerial imagery using a relation-enhanced multiscale convolutional network. Remote Sens., 12.","DOI":"10.3390\/rs12020311"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Lei, T., Li, L., Lv, Z., Zhu, M., Du, X., and Nandi, A.K. (2021). Multi-Modality and Multi-Scale Attention Fusion Network for Land Cover Classification from VHR Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13183771"},{"key":"ref_41","unstructured":"Tan, M., and Le, Q.V. (2021). Efficientnetv2: Smaller models and faster training. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/21\/4361\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:22:59Z","timestamp":1760167379000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/21\/4361"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,29]]},"references-count":41,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["rs13214361"],"URL":"https:\/\/doi.org\/10.3390\/rs13214361","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,29]]}}}