{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,11]],"date-time":"2026-06-11T16:10:18Z","timestamp":1781194218017,"version":"3.54.1"},"reference-count":46,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2025,1,4]],"date-time":"2025-01-04T00:00:00Z","timestamp":1735948800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Current remote sensing (RS) detectors often rely on predefined anchor boxes with fixed angles to handle the multi-directional variations of targets. This approach makes it challenging to accurately select regions of interest and extract features that align with the direction of the targets. Most existing regression methods also adopt angle regression to match the attributes of remote sensing detectors. Due to the inconsistent regression direction and massive anchor boxes with a high aspect ratio, the extracted target features change greatly, the loss function changes drastically, and the training is unstable. However, existing RS detectors and regression techniques have not been able to effectively balance the precision of directional feature extraction with the complexity of the models. To address these challenges, this paper introduces a novel approach known as Dynamic Direction Learning R-CNN (DDL R-CNN), which comprises a dynamic direction learning (DDL) module and a boundary center region offset generation network (BC-ROPN). The DDL module pre-extracts the directional features of targets to provide a coarse estimation of their angles and the corresponding weights. This information is used to generate rotationally aligned anchor boxes that better model the directional features of the targets. BC-ROPN represents an innovative method for anchor box regression. It utilizes the central features of the maximum bounding rectangle\u2019s width and height, along with the coarse angle estimation and weights derived from DDL module, to refine the orientation of the anchor box. Our method has been proven to surpass existing rotating detection networks in extensive testing across two widely used remote sensing detection datasets, namely UCAS-AOD and HRSC2016.<\/jats:p>","DOI":"10.3390\/a18010021","type":"journal-article","created":{"date-parts":[[2025,1,6]],"date-time":"2025-01-06T06:43:04Z","timestamp":1736145784000},"page":"21","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["DDL R-CNN: Dynamic Direction Learning R-CNN for Rotated Object Detection"],"prefix":"10.3390","volume":"18","author":[{"given":"Weixian","family":"Su","sequence":"first","affiliation":[{"name":"Faculty of Engineering, University of Hong Kong, Hong Kong 999077, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3021-5371","authenticated-orcid":false,"given":"Donglin","family":"Jing","sequence":"additional","affiliation":[{"name":"The School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,1,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2019). DOTA: A Large-scale Dataset for Object Detection in Aerial Images. arXiv.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Xian, S., and Fu, K. (2019). SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. arXiv.","DOI":"10.1109\/ICCV.2019.00832"},{"key":"ref_3","unstructured":"Qian, W., Yang, X., Peng, S., Guo, Y., and Yan, J. (2019). Learning Modulated Loss for Rotated Object Detection. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016). SSD: Single Shot MultiBox Detector. Computer Vision\u2013ECCV 2016, Springer International Publishing.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_7","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-Oriented Scene Text Detection via Rotation Proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1109\/TPAMI.2020.2974745","article-title":"Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection","volume":"43","author":"Xu","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., and Luo, Z. (2017). R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection. arXiv.","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"ref_12","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as Points. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, L., Ouyang, W., Wang, X., Fieguth, P., Chen, J., Liu, X., and Pietik\u00e4inen, M. (2019). Deep Learning for Generic Object Detection: A Survey. arXiv.","DOI":"10.1007\/s11263-019-01247-4"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Tian, Z., Shen, C., Chen, H., and He, T. (November, January 27). FCOS: Fully Convolutional One-Stage Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00972"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2018). Learning RoI Transformer for Detecting Oriented Objects in Aerial Images. arXiv.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Han, J., Ding, J., Xue, N., and Xia, G.S. (2021). ReDet: A Rotation-equivariant Detector for Aerial Object Detection. arXiv.","DOI":"10.1109\/CVPR46437.2021.00281"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning RoI transformer for oriented object detection in aerial images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Pan, X., Ren, Y., Sheng, K., Dong, W., Yuan, H., Guo, X., Ma, C., and Xu, C. (2020, January 13\u201319). Dynamic Refinement Network for Oriented and Densely Packed Object Detection. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01122"},{"key":"ref_19","unstructured":"Yang, X., Liu, Q., Yan, J., and Li, A. (2019). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. arXiv."},{"key":"ref_20","first-page":"2458","article-title":"Learning Modulated Loss for Rotated Object Detection","volume":"35","author":"Qian","year":"2021","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_21","unstructured":"Yang, X., Yan, J., Ming, Q., Wang, W., Zhang, X., and Tian, Q. (2021, January 18\u201324). Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss. Proceedings of the 38th International Conference on Machine Learning, Virtual Event."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., and Jiao, J. (2015, January 27\u201330). Orientation robust object detection in aerial images using deep convolutional neural network. Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada.","DOI":"10.1109\/ICIP.2015.7351502"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1109\/LGRS.2016.2565705","article-title":"Ship Rotated Bounding Box Space for Ship Extraction From High-Resolution Optical Satellite Images with Complex Backgrounds","volume":"13","author":"Liu","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","unstructured":"Chen, K., Wang, J., Pang, J., Cao, Y., Xiong, Y., Li, X., Sun, S., Feng, W., Liu, Z., and Xu, J. (2019). MMDetection: Open mmlab detection toolbox and benchmark. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Verelst, T., and Tuytelaars, T. (2020, January 13\u201319). Dynamic convolutions: Exploiting spatial sparsity for faster inference. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00239"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"8021505","DOI":"10.1109\/LGRS.2021.3115110","article-title":"Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss","volume":"19","author":"Ming","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Ming, Q., Miao, L., Zhou, Z., Song, J., and Yang, X. (2021). Sparse label assignment for oriented object detection in aerial images. Remote Sens., 13.","DOI":"10.3390\/rs13142664"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/j.isprsjprs.2023.01.001","article-title":"Task interleaving and orientation estimation for high-precision oriented object detection in aerial images","volume":"196","author":"Ming","year":"2023","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2021.3095186","article-title":"CFC-Net: A critical feature capturing network for arbitrary-oriented object detection in remote-sensing images","volume":"60","author":"Ming","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Ming, Q., Zhou, Z., Miao, L., Zhang, H., and Li, L. (2021, January 2\u20139). Dynamic anchor learning for arbitrary-oriented object detection. Proceedings of the AAAI Conference on Artificial Intelligence, Virtually.","DOI":"10.1609\/aaai.v35i3.16336"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Chen, Z., Chen, K., Lin, W., See, J., Yu, H., Ke, Y., and Yang, C. (2020, January 23\u201328). Piou loss: Towards accurate oriented object detection in complex environments. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part V 16.","DOI":"10.1007\/978-3-030-58558-7_12"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yang, X., Hou, L., Zhou, Y., Wang, W., and Yan, J. (2021, January 20\u201325). Dense label encoding for boundary discontinuity free rotation detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01556"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1340","DOI":"10.1007\/s11263-022-01593-w","article-title":"On the arbitrary-oriented object detection: Classification based approaches revisited","volume":"130","author":"Yang","year":"2022","journal-title":"Int. J. Comput. Vis."},{"key":"ref_37","first-page":"1","article-title":"Align deep features for oriented object detection","volume":"60","author":"Han","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"4307","DOI":"10.1109\/TGRS.2020.3010051","article-title":"Learning center probability map for detecting objects in aerial images","volume":"59","author":"Wang","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Guo, Z., Liu, C., Zhang, X., Jiao, J., Ji, X., and Ye, Q. (2021, January 20\u201325). Beyond bounding-box: Convex-hull feature adaptation for oriented and densely packed object detection. Proceedings of the IEEE\/CVF conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00868"},{"key":"ref_40","unstructured":"Hou, L., Lu, K., Xue, J., and Li, Y. (March, January 22). Shape-adaptive selection and measurement for oriented object detection. Proceedings of the AAAI Conference on Artificial Intelligence, Virtual."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"2342","DOI":"10.1109\/TCSVT.2022.3222906","article-title":"Ao2-detr: Arbitrary-oriented object detection transformer","volume":"33","author":"Dai","year":"2022","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_42","unstructured":"Lyu, C., Zhang, W., Huang, H., Zhou, Y., Wang, Y., Liu, Y., Zhang, S., and Chen, K. (2022). Rtmdet: An empirical study of designing real-time object detectors. arXiv."},{"key":"ref_43","first-page":"18381","article-title":"Learning high-precision bounding box for rotated object detection via kullback-leibler divergence","volume":"34","author":"Yang","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Xie, X., Cheng, G., Wang, J., Yao, X., and Han, J. (2021, January 10\u201317). Oriented R-CNN for object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00350"},{"key":"ref_45","unstructured":"Yang, X., Zhou, Y., Zhang, G., Yang, J., Wang, W., Yan, J., Zhang, X., and Tian, Q. (2022). The KFIoU loss for rotated object detection. arXiv."},{"key":"ref_46","first-page":"5607315","article-title":"Advancing plain vision transformer toward remote sensing foundation model","volume":"61","author":"Wang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/1\/21\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:22:58Z","timestamp":1759918978000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/1\/21"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,4]]},"references-count":46,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,1]]}},"alternative-id":["a18010021"],"URL":"https:\/\/doi.org\/10.3390\/a18010021","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,4]]}}}