{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T20:31:49Z","timestamp":1769113909087,"version":"3.49.0"},"reference-count":32,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2021,12,16]],"date-time":"2021-12-16T00:00:00Z","timestamp":1639612800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["92038301"],"award-info":[{"award-number":["92038301"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41771363"],"award-info":[{"award-number":["41771363"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Accurate building extraction from remotely sensed images is essential for topographic mapping, cadastral surveying and many other applications. Fully automatic segmentation methods still remain a great challenge due to the poor generalization ability and the inaccurate segmentation results. In this work, we are committed to robust click-based interactive building extraction in remote sensing imagery. We argue that stability is vital to an interactive segmentation system, and we observe that the distance of the newly added click to the boundaries of the previous segmentation mask contains progress guidance information of the interactive segmentation process. To promote the robustness of the interactive segmentation, we exploit this information with the previous segmentation mask, positive and negative clicks to form a progress guidance map, and feed it to a convolutional neural network (CNN) with the original RGB image, we name the network as PGR-Net. In addition, an adaptive zoom-in strategy and an iterative training scheme are proposed to further promote the stability of PGR-Net. Compared with the latest methods FCA and f-BRS, the proposed PGR-Net basically requires 1\u20132 fewer clicks to achieve the same segmentation results. Comprehensive experiments have demonstrated that the PGR-Net outperforms related state-of-the-art methods on five natural image datasets and three building datasets of remote sensing images.<\/jats:p>","DOI":"10.3390\/rs13245111","type":"journal-article","created":{"date-parts":[[2021,12,16]],"date-time":"2021-12-16T21:32:40Z","timestamp":1639690360000},"page":"5111","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Progress Guidance Representation for Robust Interactive Extraction of Buildings from Remotely Sensed Images"],"prefix":"10.3390","volume":"13","author":[{"given":"Zhen","family":"Shu","sequence":"first","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiangyun","family":"Hu","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"},{"name":"Institute of Artificial Intelligence in Geomatics, Wuhan University, Wuhan 430079, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hengming","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,12,16]]},"reference":[{"key":"ref_1","first-page":"234","article-title":"U-Net: Convolutional Networks for Biomedical Image Segmentation","volume":"Volume 9351","author":"Ronneberger","year":"2015","journal-title":"Lecture Notes in Computer Science, Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2015\u201418th International Conference, Munich, Germany, 5\u20139 October 2015"},{"key":"ref_2","unstructured":"Badrinarayanan, V., Handa, A., and Cipolla, R. (2015). Segnet: A deep convolutional encoder-decoder architecture for robust semantic pixel-wise labelling. arXiv."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_4","unstructured":"Mair, S.G., and Cook, R. (1995, January 6\u201311). Intelligent scissors for image composition. Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 1995, Los Angeles, CA, USA."},{"key":"ref_5","unstructured":"Boykov, Y., and Jolly, M. (2001, January 7\u201314). Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images. Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, BC, Canada."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1109\/TPAMI.2004.60","article-title":"An experimental comparison of min-cut\/max-flow algorithms for energy minimization in vision","volume":"26","author":"Boykov","year":"2004","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","first-page":"454","article-title":"Star Shape Prior for Graph-Cut Image Segmentation","volume":"Volume 5304","author":"Forsyth","year":"2008","journal-title":"Lecture Notes in Computer Science, Proceedings of the Computer Vision\u2014ECCV 2008, 10th European Conference on Computer Vision, Marseille, France, 12\u201318 October 2008"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Gulshan, V., Rother, C., Criminisi, A., Blake, A., and Zisserman, A. (2010, January 13\u201318). Geodesic star convexity for interactive image segmentation. Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2010, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5540073"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1145\/1015706.1015720","article-title":"\u201cGrabCut\u201d: Interactive foreground extraction using iterated graph cuts","volume":"23","author":"Rother","year":"2004","journal-title":"ACM Trans. Graph."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Yu, H., Zhou, Y., Qian, H., Xian, M., and Wang, S. (2017, January 17\u201320). Loosecut: Interactive image segmentation with loosely bounded boxes. Proceedings of the 2017 IEEE International Conference on Image Processing, ICIP 2017, Beijing, China.","DOI":"10.1109\/ICIP.2017.8296900"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1768","DOI":"10.1109\/TPAMI.2006.233","article-title":"Random walks for image segmentation","volume":"28","author":"Grady","year":"2006","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Xu, N., Price, B., Cohen, S., Yang, J., and Huang, T.S. (2016, January 27\u201330). Deep interactive object selection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.47"},{"key":"ref_13","unstructured":"Mahadevan, S., Voigtlaender, P., and Leibe, B. (2018). Iteratively Trained Interactive Segmentation. Proceedings of the British Machine Vision Conference 2018, BMVC 2018, Newcastle, UK, 3\u20136 September 2018, BMVA Press."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Li, Z., Chen, Q., and Koltun, V. (2018, January 18\u201323). Interactive Image Segmentation with Latent Diversity. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00067"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Majumder, S., and Yao, A. (2019, January 15\u201320). Content-Aware Multi-Level Guidance for Interactive Instance Segmentation. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01187"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Jang, W.D., and Kim, C.S. (2019, January 15\u201320). Interactive Image Segmentation via Backpropagating Refinement Scheme. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00544"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Sofiiuk, K., Petrov, I., Barinova, O., and Konushin, A. (2020, January 14\u201319). F-brs: Rethinking backpropagating refinement for interactive segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00865"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Lin, Z., Zhang, Z., Chen, L.Z., Cheng, M.M., and Lu, S.P. (2020, January 14\u201319). Interactive Image Segmentation With First Click Attention. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01335"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Liew, J.H., Wei, Y., Xiong, W., Ong, S.H., and Feng, J. (2017, January 22\u201329). Regional Interactive Image Segmentation Networks. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.297"},{"key":"ref_20","unstructured":"Mohanty, S.P. (2018, June 12). CrowdAI Dataset. Available online: https:\/\/www.crowdai.org\/challenges\/mapping-challenge\/dataset_files."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Hariharan, B., Arbelaez, P., Bourdev, L.D., Maji, S., and Malik, J. (2011, January 6\u201313). Semantic Contours from Inverse Detectors. Proceedings of the International Conference on Computer Vision, Washington, DC, USA.","DOI":"10.1109\/ICCV.2011.6126343"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1016\/j.patcog.2009.03.008","article-title":"A comparative evaluation of interactive segmentation algorithms","volume":"43","author":"Mcguinness","year":"2010","journal-title":"Pattern Recognit."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Perazzi, F., Pont-Tuset, J., McWilliams, B., Gool, L.V., Gross, M.H., and Sorkine-Hornung, A. (2016, January 27\u201330). A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.85"},{"key":"ref_24","first-page":"740","article-title":"Microsoft COCO: Common Objects in Context","volume":"Volume 8693","author":"Fleet","year":"2014","journal-title":"Lecture Notes in Computer Science, Proceedings of the Computer Vision\u2014ECCV 2014\u201413th European Conference, Zurich, Switzerland, 6\u201312 September 2014"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.isprsjprs.2018.11.011","article-title":"Aerial Imagery for Roof Segmentation: A Large-Scale Dataset towards Automatic Mapping of Buildings","volume":"147","author":"Chen","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs","volume":"40","author":"Chen","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid Scene Parsing Network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Cheng, H.K., Chung, J., Tai, Y., and Tang, C. (2020, January 13\u201319). CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00891"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_31","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1007\/s11263-008-0191-z","article-title":"Geodesic Matting: A Framework for Fast Interactive Image and Video Segmentation and Matting","volume":"82","author":"Bai","year":"2009","journal-title":"Int. J. Comput. Vis."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5111\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:49:30Z","timestamp":1760168970000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5111"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,16]]},"references-count":32,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["rs13245111"],"URL":"https:\/\/doi.org\/10.3390\/rs13245111","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,16]]}}}