{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,10]],"date-time":"2026-05-10T08:01:46Z","timestamp":1778400106028,"version":"3.51.4"},"reference-count":48,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2021,12,14]],"date-time":"2021-12-14T00:00:00Z","timestamp":1639440000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61675036"],"award-info":[{"award-number":["61675036"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"13th Five-year Plan Equipment Pre-research","award":["6140415020312"],"award-info":[{"award-number":["6140415020312"]}]},{"name":"Chinese Academy of Sciences Key Laboratory of Beam Control","award":["2017LBC006"],"award-info":[{"award-number":["2017LBC006"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Semi-autonomous learning for object detection has attracted more and more attention in recent years, which usually tends to find only one object instance with the highest score in each image. However, this strategy usually highlights the most representative part of the object instead of the whole object, which may lead to the loss of a lot of important information. To solve this problem, a novel end-to-end aggregate-guided semi-autonomous learning residual network is proposed to perform object detection. Firstly, a progressive modified residual network (MRN) is applied to the backbone network to make the detector more sensitive to the boundary features of the object. Then, an aggregate-based region-merging strategy (ARMS) is designed to select high-quality instances by selecting aggregation areas and merging these regions. The ARMS selects the aggregation areas that are highly related to the object through association coefficient, and then evaluates the aggregation areas through a similarity coefficient and fuses them to obtain high-quality object instance areas. Finally, a regression-locating branch is further developed to refine the location of the object, which can be optimized jointly with regional classification. Extensive experiments demonstrate that the proposed method is superior to state-of-the-art methods.<\/jats:p>","DOI":"10.3390\/rs13245065","type":"journal-article","created":{"date-parts":[[2021,12,14]],"date-time":"2021-12-14T22:06:10Z","timestamp":1639519570000},"page":"5065","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Semi-Autonomous Learning Algorithm for Remote Image Object Detection Based on Aggregation Area Instance Refinement"],"prefix":"10.3390","volume":"13","author":[{"given":"Bei","family":"Cheng","sequence":"first","affiliation":[{"name":"College of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhengzhou","family":"Li","sequence":"additional","affiliation":[{"name":"College of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China"},{"name":"Key Laboratory of Beam Control, Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hui","family":"Li","sequence":"additional","affiliation":[{"name":"College of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiquan","family":"Ding","sequence":"additional","affiliation":[{"name":"Sichuan Institute of Aerospace Electronic Equipment, Chengdu 610100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tianqi","family":"Qin","sequence":"additional","affiliation":[{"name":"Sichuan Institute of Aerospace Electronic Equipment, Chengdu 610100, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,12,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Cheng, B., Li, Z., Xu, B., Yao, X., Ding, Z., and Qin, T. (2021). Structured Object-Level Relational Reasoning CNN-Based Target Detection Algorithm in a Remote Sensing Image. Remote Sens., 13.","DOI":"10.3390\/rs13020281"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1109\/TGRS.2017.2778300","article-title":"Rotation-Insensitive and Context-Augmented Object Detection in Remote Sensing Images","volume":"56","author":"Li","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.isprsjprs.2020.12.015","article-title":"PBNet: Part-based convolutional neural network for complex composite object detection in remote sensing imagery","volume":"173","author":"Sun","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"9362","DOI":"10.1109\/TGRS.2019.2926397","article-title":"Multi-Scale and Multi-Task Deep Learning Framework for Automatic Road Extraction","volume":"57","author":"Lu","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"8919","DOI":"10.1109\/TGRS.2020.2991733","article-title":"Simultaneous Road Surface and Centerline Extraction from Large-Scale Remote Sensing Images Using CNN-Based Segmentation and Tracing","volume":"58","author":"Wei","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"8819","DOI":"10.1109\/TGRS.2020.2991006","article-title":"Topology-Enhanced Urban Road Extraction via a Geographic Feature-Enhanced Network","volume":"58","author":"Li","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"5384","DOI":"10.1109\/JSTARS.2020.3022210","article-title":"Improved Mapping Results of 10 m Resolution Land Cover Classification in Guangdong, China Using Multisource Remote Sensing Data with Google Earth Engine","volume":"13","author":"Tu","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"5233","DOI":"10.1109\/JSTARS.2019.2956318","article-title":"Urban Land Use and Land Cover Change Prediction via Self-Adaptive Cellular Based Deep Learning with Multisourced Data","volume":"12","author":"Mu","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1109\/LGRS.2018.2803259","article-title":"Very High Resolution Object-Based Land Use\u2013Land Cover Urban Classification Using Extreme Gradient Boosting","volume":"15","author":"Georganos","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"4823","DOI":"10.1109\/TGRS.2019.2893180","article-title":"Learning Compact and Discriminative Stacked Autoencoder for Hyperspectral Image Classification","volume":"57","author":"Zhou","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"3660","DOI":"10.1109\/TGRS.2016.2523563","article-title":"Semantic Annotation of High-Resolution Satellite Images via Weakly Supervised Learning","volume":"54","author":"Yao","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2473","DOI":"10.1109\/TCSVT.2017.2706264","article-title":"A Unified Metric Learning-Based Framework for Co-Saliency Detection","volume":"28","author":"Han","year":"2018","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2327","DOI":"10.1109\/JSTARS.2013.2242846","article-title":"Airborne Vehicle Detection in Dense Urban Areas Using HoG Features and Disparity Maps","volume":"6","author":"Tuermer","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3978","DOI":"10.1109\/TGRS.2007.907109","article-title":"A Multiple Conditional Random Fields Ensemble Model for Urban Area Detection in Remote Sensing Optical Images","volume":"45","author":"Zhong","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"97228","DOI":"10.1109\/ACCESS.2021.3094201","article-title":"Yolo V4 for Advanced Traffic Sign Recognition with Synthetic Training Data Generated by Various GAN","volume":"9","author":"Dewi","year":"2021","journal-title":"IEEE Access"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhang, X., Feng, J., Xiong, H., and Tian, Q. (2018, January 18\u201323). Zigzag Learning for Weakly Supervised Object Detection. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00448"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"112045","DOI":"10.1016\/j.rse.2020.112045","article-title":"Accurate cloud detection in high-resolution remote sensing imagery by weakly supervised deep learning","volume":"250","author":"Li","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1016\/j.isprsjprs.2018.09.014","article-title":"Deep networks under scene-level supervision for multi-class geospatial object detection from remote sensing images","volume":"146","author":"Li","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Jie, Z., Wei, Y., Jin, X., Feng, J., and Liu, W. (2017, January 21\u201326). Deep Self-Taught Learning for Weakly Supervised Object Localization. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.457"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2021.02.009","article-title":"Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation","volume":"175","author":"Li","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Bilen, H., and Vedaldi, A. (2016, January 27\u201330). Weakly Supervised Deep Detection Networks. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.311"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1109\/TPAMI.2015.2456908","article-title":"Weakly Supervised Large Scale Object Localization with Multiple Instance Learning and Bag Splitting","volume":"38","author":"Ren","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1109\/TPAMI.2016.2535231","article-title":"Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning","volume":"39","author":"Cinbis","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Li, D., Huang, J.-B., Li, Y., Wang, S., and Yang, M.-H. (2016, January 27\u201330). Weakly Supervised Object Localization with Progressive Domain Adaptation. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.382"},{"key":"ref_26","unstructured":"Song, H.O., Lee, Y.J., Jegelka, S., and Darrell, T. (2014, January 23\u201328). Weakly-supervised Discovery of Visual Pattern Configurations. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Wang, C., Ren, W., Huang, K., and Tan, T. (2014, January 6\u201312). Weakly Supervised Object Localization with Latent Category Learning. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10599-4_28"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Tang, P., Wang, X., Bai, X., and Liu, W. (2017, January 21\u201326). Multiple Instance Detection Network with Online Instance Classifier Refinement. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.326"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1109\/TPAMI.2018.2876304","article-title":"PCL: Proposal Cluster Learning for Weakly Supervised Object Detection","volume":"42","author":"Tang","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"8002","DOI":"10.1109\/TGRS.2020.2985989","article-title":"Progressive Contextual Instance Refinement for Weakly Supervised Object Detection in Remote Sensing Images","volume":"58","author":"Feng","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zitnick, C.L., and Doll\u00e1r, P. (2014, January 6\u201312). Edge boxes- Locating object proposals from edges. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_26"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely Connected Convolutional Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_35","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u201312). Very Deep Convolutional Networks for Large-Scale Image Recognition. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Shen, Y., Ji, R., Wang, Y., Chen, Z., Zheng, F., Huang, F., and Wu, Y. (2020, January 23\u201328). Enabling Deep Residual Networks for Weakly Supervised Object Detection. Proceedings of the Computer Vision\u2014ECCV 2020, Glasgow, UK.","DOI":"10.1007\/978-3-030-58598-3_8"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"9709","DOI":"10.1109\/TIM.2020.3002277","article-title":"EDRNet: Encoder\u2013Decoder Residual Network for Salient Object Detection of Strip Steel Surface Defects","volume":"69","author":"Song","year":"2020","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"3763","DOI":"10.1109\/TIP.2020.2965989","article-title":"Reverse Attention Based Residual Network for Salient Object Detection","volume":"29","author":"Chen","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"7831","DOI":"10.1109\/TGRS.2020.3043267","article-title":"Attention-Based Adaptive Spectral\u2013Spatial Kernel ResNet for Hyperspectral Image Classification","volume":"59","author":"Roy","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1109\/TCBB.2020.2986544","article-title":"ResNet-SCDA-50 for Breast Abnormality Classification","volume":"18","author":"Yu","year":"2021","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"7076","DOI":"10.1109\/TGRS.2020.3030324","article-title":"Training a Seismogram Discriminator Based on ResNet","volume":"59","author":"Zhu","year":"2020","journal-title":"IEEE Trans. Geosci. Remote. Sens."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Yang, K., Li, D.-S., and Dou, Y. (November, January 27). Towards Precise End-to-end Weakly Supervised Object Detection Network. Proceedings of the International Conference on Computer Vision (ICCV2019), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00846"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1100","DOI":"10.1109\/TIP.2017.2773199","article-title":"Random Access Memories: A New Paradigm for Target Detection in High Resolution Aerial Remote Sensing Images","volume":"27","author":"Zou","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_46","unstructured":"Andermatt, P., and Timofte, R. (December, January 30). A Weakly Supervised Convolutional Network for Change Segmentation and Classification. Proceedings of the Asian Conference on Computer Vision, Kyoto, Japan."},{"key":"ref_47","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020, January 16\u201318). YOLOv4: Optimal Speed and Accuracy of Object Detection. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR2020), Seattle, WA, USA."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2020, January 16\u201318). Scaled-YOLOv4: Scaling Cross Stage Partial Network. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR2020), Seattle, WA, USA.","DOI":"10.1109\/CVPR46437.2021.01283"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5065\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:47:14Z","timestamp":1760168834000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5065"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,14]]},"references-count":48,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["rs13245065"],"URL":"https:\/\/doi.org\/10.3390\/rs13245065","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,14]]}}}