{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,8]],"date-time":"2026-06-08T04:57:45Z","timestamp":1780894665541,"version":"3.54.1"},"reference-count":56,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2020,11,16]],"date-time":"2020-11-16T00:00:00Z","timestamp":1605484800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["No. 41631176"],"award-info":[{"award-number":["No. 41631176"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Automated extraction of buildings from earth observation (EO) data has long been a fundamental but challenging research topic. Combining data from different modalities (e.g., high-resolution imagery (HRI) and light detection and ranging (LiDAR) data) has shown great potential in building extraction. Recent studies have examined the role that deep learning (DL) could play in both multimodal data fusion and urban object extraction. However, DL-based multimodal fusion networks may encounter the following limitations: (1) the individual modal and cross-modal features, which we consider both useful and important for final prediction, cannot be sufficiently learned and utilized and (2) the multimodal features are fused by a simple summation or concatenation, which appears ambiguous in selecting cross-modal complementary information. In this paper, we address these two limitations by proposing a hybrid attention-aware fusion network (HAFNet) for building extraction. It consists of RGB-specific, digital surface model (DSM)-specific, and cross-modal streams to sufficiently learn and utilize both individual modal and cross-modal features. Furthermore, an attention-aware multimodal fusion block (Att-MFBlock) was introduced to overcome the fusion problem by adaptively selecting and combining complementary features from each modality. Extensive experiments conducted on two publicly available datasets demonstrated the effectiveness of the proposed HAFNet for building extraction.<\/jats:p>","DOI":"10.3390\/rs12223764","type":"journal-article","created":{"date-parts":[[2020,11,16]],"date-time":"2020-11-16T21:48:52Z","timestamp":1605563332000},"page":"3764","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":62,"title":["A Hybrid Attention-Aware Fusion Network (HAFNet) for Building Extraction from High-Resolution Imagery and LiDAR Data"],"prefix":"10.3390","volume":"12","author":[{"given":"Peng","family":"Zhang","sequence":"first","affiliation":[{"name":"Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China"},{"name":"Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2488-2656","authenticated-orcid":false,"given":"Peijun","family":"Du","sequence":"additional","affiliation":[{"name":"Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China"},{"name":"Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Cong","family":"Lin","sequence":"additional","affiliation":[{"name":"Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China"},{"name":"Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6653-2585","authenticated-orcid":false,"given":"Xin","family":"Wang","sequence":"additional","affiliation":[{"name":"Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China"},{"name":"Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Erzhu","family":"Li","sequence":"additional","affiliation":[{"name":"School of Geography, Geomatics and Planning, Jiangsu Normal University, Xuzhou 221116, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6253-2967","authenticated-orcid":false,"given":"Zhaohui","family":"Xue","sequence":"additional","affiliation":[{"name":"School of Earth Sciences and Engineering, Hohai University, Nanjing 211100, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xuyu","family":"Bai","sequence":"additional","affiliation":[{"name":"Jiangsu Provincial Key Laboratory of Geographic Information Science and Technology, Key Laboratory for Land Satellite Remote Sensing Applications of Ministry of Natural Resources, School of Geography and Ocean Science, Nanjing University, Nanjing 210023, China"},{"name":"Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2020,11,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.isprsjprs.2019.02.019","article-title":"Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network","volume":"151","author":"Huang","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2523","DOI":"10.1109\/TGRS.2006.874137","article-title":"Automatic Construction of Building Footprints From Airborne LIDAR Data","volume":"44","author":"Zhang","year":"2006","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"7393","DOI":"10.1109\/TGRS.2014.2311991","article-title":"Seamless fusion of LiDAR and aerial imagery for building extraction","volume":"52","author":"Zhou","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1416","DOI":"10.1109\/TGRS.2008.916480","article-title":"Fusion of Hyperspectral and LIDAR Remote Sensing Data for Classification of Complex Forest Areas","volume":"46","author":"Dalponte","year":"2008","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1080\/01490410306707","article-title":"Combining Lidar Elevation Data and IKONOS Multispectral Imagery for Coastal Classification Mapping","volume":"26","author":"Lee","year":"2003","journal-title":"Mar. Geod."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1109\/LGRS.2017.2704625","article-title":"Deep Fusion of Remote Sensing Data for Accurate Classification","volume":"14","author":"Chen","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Karsli, F., Dihkan, M., Acar, H., and Ozturk, A. (2016). Automatic building extraction from very high-resolution image and LiDAR data with SVM algorithm. Arabian J. Geosci., 9.","DOI":"10.1007\/s12517-016-2664-7"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1864","DOI":"10.1109\/JSTARS.2015.2470547","article-title":"A Novel Building and Tree Detection Method From LiDAR Data and Aerial Images","volume":"9","author":"Zarea","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Du, P., Bai, X., Tan, K., Xue, Z., Samat, A., Xia, J., Li, E., Su, H., and Liu, W. (2020). Advances of Four Machine Learning Methods for Spatial Data Handling: A Review. J. Geovis. Spat. Anal., 4.","DOI":"10.1007\/s41651-020-00048-5"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5653","DOI":"10.1109\/TGRS.2017.2711275","article-title":"Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification","volume":"55","author":"Li","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"6207","DOI":"10.1109\/TGRS.2015.2435801","article-title":"Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery","volume":"53","author":"Zhong","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1109\/LGRS.2015.2475299","article-title":"Deep Learning Based Feature Selection for Remote Sensing Scene Classification","volume":"12","author":"Zou","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2019.09.016","article-title":"Combining Sentinel-1 and Sentinel-2 Satellite Image Time Series for land cover mapping via a multi-source deep learning architecture","volume":"158","author":"Ienco","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Storie, C.D., and Henry, C.J. (2018, January 22\u201327). Deep Learning Neural Networks for Land Use Land Cover Mapping. Proceedings of the Igarss 2018\u20142018 Ieee International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518619"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Abdollahi, A., Pradhan, B., Shukla, N., Chakraborty, S., and Alamri, A. (2020). Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review. Remote Sens., 12.","DOI":"10.3390\/rs12091444"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pan, X., Yang, F., Gao, L., Chen, Z., Zhang, B., Fan, H., and Ren, J. (2019). Building Extraction from High-Resolution Aerial Imagery Using a Generative Adversarial Network with Spatial and Channel Attention Mechanisms. Remote Sens., 11.","DOI":"10.3390\/rs11080917"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Sun, G., Huang, H., Zhang, A., Li, F., Zhao, H., and Fu, H. (2019). Fusion of Multiscale Convolutional Neural Networks for Building Extraction in Very High-Resolution Images. Remote Sens., 11.","DOI":"10.3390\/rs11030227"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.isprsjprs.2019.09.018","article-title":"Multi-modal deep learning for landform recognition","volume":"158","author":"Du","year":"2019","journal-title":"J. Photogramm. Remote Sens."},{"key":"ref_19","first-page":"640","article-title":"Fully Convolutional Networks for Semantic Segmentation","volume":"39","author":"Long","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xin, J., Zhang, X., Zhang, Z., and Fang, W. (2019). Road Extraction of High-Resolution Remote Sensing Images Derived from DenseUNet. Remote Sens., 11.","DOI":"10.3390\/rs11212499"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Yang, H., Wu, P., Yao, X., Wu, Y., Wang, B., and Xu, Y. (2018). Building Extraction in Very High Resolution Imagery by Dense-Attention Networks. Remote Sens., 10.","DOI":"10.3390\/rs10111768"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Liu, W., Yang, M., Xie, M., Guo, Z., Li, E., Zhang, L., Pei, T., and Wang, D. (2019). Accurate Building Extraction from Fused DSM and UAV Images Using a Chain Fully Convolutional Neural Network. Remote Sens., 11.","DOI":"10.3390\/rs11242912"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.isprsjprs.2018.06.005","article-title":"Developing a multi-filter convolutional neural network for semantic segmentation using high-resolution aerial imagery and LiDAR data","volume":"143","author":"Sun","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Xu, Y., Du, B., and Zhang, L. (2018, January 22\u201327). Multi-source remote sensing data classification via fully convolutional networks and post-classification processing. Proceedings of the IGARSS 2018\u20132018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518295"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/978-3-319-54181-5_14","article-title":"FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-Based CNN Architecture","volume":"Volume 10111","author":"Lai","year":"2017","journal-title":"Computer Vision\u2014Accv 2016, Pt I"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, W., Huang, H., Schmitz, M., Sun, X., Wang, H., and Mayer, H. (2018). Effective Fusion of Multi-Modal Remote Sensing Data in a Fully Convolutional Network for Semantic Labeling. Remote Sens., 10.","DOI":"10.3390\/rs10010052"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1016\/j.isprsjprs.2017.11.009","article-title":"Classification with an edge: Improving semantic image segmentation with boundary detection","volume":"135","author":"Marmanis","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Marcos, D., Hamid, R., and Tuia, D. (2016, January 27\u201330). Geospatial Correspondences for Multimodal Registration. Proceedings of the 2016 Ieee Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.550"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-Excitation Networks. Proceedings of the 2018 Ieee\/Cvf Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Chen, H., and Li, Y. (2019). Three-stream Attention-aware Network for RGB-D Salient Object Detection. IEEE Trans. Image Process.","DOI":"10.1109\/TIP.2019.2891104"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Mohla, S., Pande, S., Banerjee, B., and Chaudhuri, S. (2020, January 14\u201319). FusAtNet: Dual Attention based SpectroSpatial Multimodal Fusion Network for Hyperspectral and LiDAR Classification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.21203\/rs.3.rs-32802\/v1"},{"key":"ref_33","unstructured":"Badrinarayanan, V., Handa, A., and Cipolla, R. (2015). SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2016, January 20\u201324). Semantic segmentation of earth observation data using multimodal and multi-scale deep networks. Proceedings of the Asian Conference on Computer Vision, Taipei, Taiwan.","DOI":"10.1007\/978-3-319-54181-5_12"},{"key":"ref_35","first-page":"234","article-title":"U-Net: Convolutional Networks for Biomedical Image Segmentation","volume":"Volume 9351","author":"Navab","year":"2015","journal-title":"Medical Image Computing and Computer-Assisted Intervention, Pt III"},{"key":"ref_36","unstructured":"Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. (2014). Semantic image segmentation with deep convolutional nets and fully connected crfs. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_38","unstructured":"Chen, L.-C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Guo, M., Liu, H., Xu, Y., and Huang, Y. (2020). Building Extraction Based on U-Net with an Attention Block and Multiple Losses. Remote Sens., 12.","DOI":"10.3390\/rs12091400"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Wagner, F.H., Dalagnol, R., Tarabalka, Y., Segantine, T.Y., Thom\u00e9, R., and Hirye, M. (2020). U-Net-Id, an Instance Segmentation Model for Building Extraction from Satellite Images\u2014Case Study in the Joan\u00f3polis City, Brazil. Remote Sens., 12.","DOI":"10.3390\/rs12101544"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lin, Y., Xu, D., Wang, N., Shi, Z., and Chen, Q. (2020). Road Extraction from Very-High-Resolution Remote Sensing Images via a Nested SE-Deeplab Model. Remote Sens., 12.","DOI":"10.3390\/rs12182985"},{"key":"ref_43","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.media.2019.01.012","article-title":"Attention gated networks: Learning to leverage salient regions in medical images","volume":"53","author":"Schlemper","year":"2019","journal-title":"Med. Image Anal."},{"key":"ref_45","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (1996). Attention is all you need. Advances in Neural Information Processing Systems, Mit Press."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Chen, L., Zhang, H., Xiao, J., Nie, L., Shao, J., Liu, W., and Chua, T.-S. (2017, January 21\u201326). Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.667"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., and Lu, H. (2019, January 16\u201320). Dual Attention Network for Scene Segmentation. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Lin, G., Shen, C., van den Hengel, A., and Reid, I. (2016, January 27\u201330). Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.348"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201323). Non-local neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_50","unstructured":"Yuan, Y., and Wang, J. (2018). Ocnet: Object context network for scene parsing. arXiv."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhang, Y., Liu, S., Shi, J., Loy, C.C., Lin, D., and Jia, J. (2018, January 8\u201314). Psanet: Point-wise spatial attention network for scene parsing. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01240-3_17"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Jin, Y., Xu, W., Hu, Z., Jia, H., Luo, X., and Shao, D. (2020). GSCA-UNet: Towards Automatic Shadow Detection in Urban Aerial Imagery with Global-Spatial-Context Attention Module. Remote Sens., 12.","DOI":"10.3390\/rs12172864"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Tian, Z., Zhan, R., Hu, J., Wang, W., He, Z., and Zhuang, Z. (2020). Generating Anchor Boxes Based on Attention Mechanism for Object Detection in Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12152416"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Li, L., Liang, P., Ma, J., Jiao, L., Guo, X., Liu, F., and Sun, C. (2020). A Multiscale Self-Adaptive Attention Network for Remote Sensing Scene Classification. Remote Sens., 12.","DOI":"10.3390\/rs12142209"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Hazirbas, C., Ma, L., Domokos, C., and Cremers, D. (2016, January 20\u201324). Fusenet: Incorporating depth into semantic segmentation via fusion-based cnn architecture. Proceedings of the Asian Conference on Computer Vision, Taipei, Taiwan.","DOI":"10.1007\/978-3-319-54181-5_14"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/22\/3764\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:34:16Z","timestamp":1760178856000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/22\/3764"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,16]]},"references-count":56,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2020,11]]}},"alternative-id":["rs12223764"],"URL":"https:\/\/doi.org\/10.3390\/rs12223764","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,11,16]]}}}