{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:29:39Z","timestamp":1760236179985,"version":"build-2065373602"},"reference-count":53,"publisher":"MDPI AG","issue":"21","license":[{"start":{"date-parts":[[2021,10,25]],"date-time":"2021-10-25T00:00:00Z","timestamp":1635120000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61773377"],"award-info":[{"award-number":["61773377"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["NO.NJ2020014"],"award-info":[{"award-number":["NO.NJ2020014"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Rotated object detection is an extension of object detection that uses an oriented bounding box instead of a general horizontal bounding box to define the object position. It is widely used in remote sensing images, scene text, and license plate recognition. The existing rotated object detection methods usually add an angle prediction channel in the bounding box prediction branch, and smooth L1 loss is used as the regression loss function. However, we argue that smooth L1 loss causes a sudden change in loss and slow convergence due to the angle solving mechanism of open CV (the angle between the horizontal line and the first side of the bounding box in the counter-clockwise direction is defined as the rotation angle), and this problem exists in most existing regression loss functions. To solve the above problems, we propose a decoupling modulation mechanism to overcome the problem of sudden changes in loss. On this basis, we also proposed a constraint mechanism, the purpose of which is to accelerate the convergence of the network and ensure optimization toward the ideal direction. In addition, the proposed decoupling modulation mechanism and constraint mechanism can be integrated into the popular regression loss function individually or together, which further improves the performance of the model and makes the model converge faster. The experimental results show that our method achieves 75.2% performance on the aerial image dataset DOTA (OBB task), and saves more than 30% of computing resources. The method also achieves a state-of-the-art performance in HRSC2016, and saved more than 40% of computing resources, which confirms the applicability of the approach.<\/jats:p>","DOI":"10.3390\/rs13214291","type":"journal-article","created":{"date-parts":[[2021,10,25]],"date-time":"2021-10-25T21:42:05Z","timestamp":1635198125000},"page":"4291","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Constraint Loss for Rotated Object Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4310-7786","authenticated-orcid":false,"given":"Luyang","family":"Zhang","sequence":"first","affiliation":[{"name":"College of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China"}]},{"given":"Haitao","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China"}]},{"given":"Lingfeng","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China"},{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Chunhong","family":"Pan","sequence":"additional","affiliation":[{"name":"National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China"}]},{"given":"Qiang","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China"}]},{"given":"Xinyao","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Automation Engineering, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,10,25]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Rabbi, J., Ray, N., Schubert, M., Chowdhury, S., and Chao, D. (2020). Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network. Remote Sens., 12.","DOI":"10.20944\/preprints202003.0313.v2"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2014, January 6\u201312). Spatial pyramid pooling in deep convolutional networks for visual recognition. Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10578-9_23"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 20\u201323). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 11\u201318). Fast rcnn. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster rcnn: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016, January 5\u20138). R-FCN: Object Detection via Region-based Fully Convolutional Networks. Proceedings of the Advances in Neural Information Processing Systems (NIPS), Barcelona, Spain."},{"key":"ref_7","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (July, January 26). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A.C. (2016, January 8\u201316). SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll\u2019ar, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). Cornernet: Detecting objects as paired keypoints. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_11","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). FCOS: Fully convolutional one-stage object detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_12","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). Centernet: Keypoint triplets for object detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhou, X., Zhuo, J., and Krahenbuhl, P. (2019, January 16\u201320). Bottom-up object detection by grouping extreme and center points. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00094"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"7389","DOI":"10.1109\/TIP.2020.3002345","article-title":"FoveaBox: Beyound Anchor-Based Object Detection","volume":"29","author":"Kong","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_15","first-page":"1","article-title":"Self-Supervised Feature Augmentation for Large Image Object Detection","volume":"99","author":"Pan","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2104","DOI":"10.1109\/TGRS.2019.2953119","article-title":"Object Detection in High Resolution Remote Sensing Imagery Based on Convolutional Neural Networks with Suitable Object Scale Features","volume":"58","author":"Dong","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3388","DOI":"10.1109\/TPAMI.2020.2981890","article-title":"Imbalance Problems in Object Detection: A Review","volume":"43","author":"Oksuz","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"5693","DOI":"10.1109\/TGRS.2020.2968802","article-title":"Region-Enhanced Convolutional Neural Network for Object Detection in Remote Sensing Images","volume":"58","author":"Lei","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u2019ar, P., and Zitnic, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"4312","DOI":"10.1080\/01431161.2020.1717666","article-title":"Moving vehicle detection in aerial infrared image sequences via fast image registration and improved YOLOv3 network","volume":"41","author":"Zhang","year":"2020","journal-title":"Int. J. Remote Sens."},{"key":"ref_22","first-page":"381","article-title":"LR-CNN: Local aware region CNN for vehicle detection in aerial imagery","volume":"2","author":"Liao","year":"2020","journal-title":"ISPRS Annals. Photogram. Remote Sens. Spat. Inf."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"4110","DOI":"10.1080\/01431161.2021.1887542","article-title":"Tiny moving vehicle detection in satellite video with constraints of multiple prior information","volume":"42","author":"Lei","year":"2020","journal-title":"Int. J. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1921","DOI":"10.3390\/rs13101921","article-title":"Multi-Sector Oriented Object Detector for Accurate Localization in Optical Remote Sensing Images","volume":"13","author":"Everingham","year":"2021","journal-title":"Remote Sens."},{"key":"ref_25","first-page":"1960","article-title":"Priority Branches for Ship Detection in Optical Remote Sensing Images","volume":"12","author":"Zhang","year":"2020","journal-title":"Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"7247","DOI":"10.1109\/TGRS.2020.2981203","article-title":"Adaptive Period Embedding for Representing Oriented Objects in Aerial Images","volume":"58","author":"Zhu","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/LGRS.2020.2965629","article-title":"Rotated Feature Network for Multiorientation Object Detection of Remote-Sensing Images","volume":"18","author":"Zhou","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"106964","DOI":"10.1016\/j.patcog.2019.106964","article-title":"Rotated cascade R-CNN: A shape robust detector with coordinate regression","volume":"96","author":"Zhu","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Ming, Q., Miao, L., Zhou, Z., Song, J., and Yang, X. (2021). Sparse Label Assignment for Oriented Object Detection in Aerial Images. Remote Sens., 13.","DOI":"10.3390\/rs13142664"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 16\u201320). Learning roi transformer for oriented object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Han, J., Ding, J., Li, J., and Xia, G.S. (2021). Align Deep Features for Oriented Object Detection. IEEE Trans. Geosci. Remote Sens., 1\u201311.","DOI":"10.1109\/TGRS.2021.3062048"},{"key":"ref_33","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Fu, K. (November, January 27). SCRDet: Towards more robust detection for small, cluttered and rotated objects. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Chen, Z., Chen, K., Lin, W., See, J., Yu, H., Ke, Y., and Yang, C. (2020, January 23). PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments. Proceedings of the European Conference on Computer Vision (ECCV), Virtual.","DOI":"10.1007\/978-3-030-58558-7_12"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yang, X., Yan, J., Feng, Z., and He, T. (2021, January 2\u20139). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Virtual.","DOI":"10.1609\/aaai.v35i4.16426"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1109\/TPAMI.2020.2974745","article-title":"Gliding vertex on the horizontal bounding box for multi-oriented object detection","volume":"43","author":"Xu","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Yang, X., and Yan, J. (2020, January 23). Arbitrary-oriented object detection with circular smooth label. Proceedings of the European Conference on Computer Vision (ECCV), Virtual.","DOI":"10.1007\/978-3-030-58598-3_40"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Qian, W., Yang, X., Peng, S., Yan, J., and Guo, Y. (2021, January 2\u20139). Learning modulated loss for rotated object detection. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Virtual.","DOI":"10.1609\/aaai.v35i3.16347"},{"key":"ref_39","unstructured":"Jiang, L., Meng, D., Yu, S.I., Lan, Z., Shan, S., and Hauptmann, A. (2014, January 13). Self-paced learning with diversity. Proceedings of the Advances in Neural Information Processing Systems (NIPS), Montreal, QC, Canada."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1483","DOI":"10.1109\/TPAMI.2019.2956516","article-title":"Cascade R-CNN: High Quality Object Detection and Instance Segmentation","volume":"43","author":"Cai","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1214\/aoms\/1177703732","article-title":"Robust Estimation of a Location Parameter","volume":"35","author":"Huber","year":"1964","journal-title":"Ann. Math. Stat."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/0022-247X(87)90347-7","article-title":"Means that minimize relative error, and an associated integral equation","volume":"122","author":"Porta","year":"1987","journal-title":"J. Math. Anal. Appl."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1257\/jep.15.4.143","article-title":"Quantile regression","volume":"15","author":"Roger","year":"2001","journal-title":"J. Econ. Perspect."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Yu, J., Jiang, Y., Wang, Z., Cao, Z., and Huang, T. (2016, January 15\u201319). UnitBox: An Advanced Object Detection Network. Proceedings of the 24th ACM international conference on Multimedia, Suzhou, China.","DOI":"10.1145\/2964284.2967274"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Tsoi, N., Gwak, J.Y., Sadeghian, A., and Savarese, S. (2019, January 16\u201320). Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., and Ren, D. (2020, January 7\u201312). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., and Ding, J. (2018, January 18\u201323). Dota: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/LGRS.2016.2565705","article-title":"Ship rotated bounding box space for ship extraction from high resolution optical satellite images with complex backgrounds","volume":"13","author":"Liu","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_49","unstructured":"Yang, X. (2020, October 10). Rotation Detection Benchmark. Available online: https:\/\/github.com\/yangxue0827\/RotationDetection."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., and Luo, Z. (2017). R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection. arXiv.","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Azimi, S.M., Vig, E., Bahmanyar, R., K\u00f6rner, M., and Reinartz, P. (2018, January 2\u20136). Towards multi-class object detection in unconstrained remote sensing imagery. Proceedings of the Asian Conference on Computer Vision (ACCV), Perth, Australia.","DOI":"10.1007\/978-3-030-20893-6_10"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1109\/LGRS.2018.2856921","article-title":"Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks","volume":"15","author":"Zhang","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Liao, M., Zhu, Z., Shi, B., Xia, G.S., and Bai, X. (2018, January 18\u201323). Rotation-Sensitive Regression for Oriented Scene Text Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake, UT, USA.","DOI":"10.1109\/CVPR.2018.00619"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/21\/4291\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:23:16Z","timestamp":1760167396000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/21\/4291"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,25]]},"references-count":53,"journal-issue":{"issue":"21","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["rs13214291"],"URL":"https:\/\/doi.org\/10.3390\/rs13214291","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,10,25]]}}}