{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T12:08:52Z","timestamp":1761739732669,"version":"build-2065373602"},"reference-count":46,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T00:00:00Z","timestamp":1761609600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>In the field of remote sensing (RS) object detection, efficient and accurate target recognition is crucial for applications such as national defense and maritime monitoring. However, existing detection methods either have high computational complexity, making them unsuitable for real-time applications, or suffer from feature redundancy issues that affect detection accuracy. To address these challenges, this paper proposes a Feature-Differentiated Perception (FDP) lightweight remote sensing object detection method, which optimizes computational efficiency while maintaining high detection accuracy. The proposed method introduces two critical innovations: (1) Dynamic mixed convolution (DM-Conv), which uses linear mapping to efficiently generate redundant feature maps, reducing convolutional computation. It combines features from different intermediate layers through weighted fusion, effectively reducing the number of channels and improving feature utilization. Channel refers to a single feature map in the multi-dimensional feature representation, where each channel corresponds to a specific feature pattern (e.g., edges, textures, or semantic information) learned by the network. (2) The Spatial Orthogonal Attention (SOA) mechanism, which enhances the ability to model long-range dependencies between distant pixels, thereby improving feature representation capability. Experiments on public remote sensing object detection datasets, including DOTA, HRSC2016, and UCMerced-LandUse, demonstrate that the proposed model achieves a significant reduction in computational complexity while maintaining nearly lossless detection accuracy. On the DOTA dataset, the proposed method achieves an mAP (mean Average Precision) of 79.37%, outperforming existing lightweight models in terms of both speed and accuracy. This study provides new insights and practical solutions for efficient remote sensing object detection in embedded and edge computing environments.<\/jats:p>","DOI":"10.3390\/a18110684","type":"journal-article","created":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T05:48:46Z","timestamp":1761716926000},"page":"684","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Feature-Differentiated Perception with Dynamic Mixed Convolution and Spatial Orthogonal Attention for Faster Aerial Object Detection"],"prefix":"10.3390","volume":"18","author":[{"given":"Yiming","family":"Ma","sequence":"first","affiliation":[{"name":"Computer Science and Information Technology, Universiti Putra Malaysia, Kuala Lumpur 43400, Malaysia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5188-3793","authenticated-orcid":false,"given":"Noridayu","family":"Manshor","sequence":"additional","affiliation":[{"name":"Computer Science and Information Technology, Universiti Putra Malaysia, Kuala Lumpur 43400, Malaysia"}]},{"given":"Fatimah binti","family":"Khalid","sequence":"additional","affiliation":[{"name":"Computer Science and Information Technology, Universiti Putra Malaysia, Kuala Lumpur 43400, Malaysia"}]}],"member":"1968","published-online":{"date-parts":[[2025,10,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Qiu, H., Li, H., Wu, Q., Meng, F., Ngan, K.N., and Shi, H. (2019). A2RMNet: Adaptively Aspect Ratio Multi-Scale Network for Object Detection in Remote Sensing Images. Remote Sens., 11.","DOI":"10.3390\/rs11131594"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). CenterNet: Keypoint Triplets for Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00667"},{"key":"ref_3","unstructured":"Wang, X., Zhang, R., Sun, Y., and Qi, J. (2018, January 3\u20138). Kdgan: Knowledge distillation with generative adversarial networks. Proceedings of the Advances in Neural Information Processing Systems 31 (NeurIPS 2018), Montreal, QC, Canada."},{"key":"ref_4","unstructured":"Chen, Y., Han, C., Wang, N., and Zhang, Z. (2019). Revisiting Feature Alignment for One-stage Object Detection. arXiv."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"5615515","DOI":"10.1109\/TGRS.2023.3294520","article-title":"Toward hierarchical adaptive alignment for aerial object detection in remote sensing images","volume":"61","author":"Deng","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","first-page":"5602511","article-title":"Align Deep Features for Oriented Object Detection","volume":"60","author":"Han","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wang, J., Chen, K., Yang, S., Loy, C.C., and Lin, D. (2019, January 16\u201320). Region proposal by guided anchoring. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00308"},{"key":"ref_8","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., and Sun, J. (2017). Light-Head R-CNN: In Defense of Two-Stage Object Detector. arXiv."},{"key":"ref_9","first-page":"6782","article-title":"Spatial transform decoupling for oriented object detection","volume":"38","author":"Yu","year":"2024","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_10","unstructured":"Yuan, X., Zheng, Z., Li, Y., Liu, X., Liu, L., Li, X., Hou, Q., and Cheng, M.M. (2025). Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"6505805","DOI":"10.1109\/LGRS.2022.3144513","article-title":"FAR-Net: Fast Anchor Refining for Arbitrary-Oriented Object Detection","volume":"19","author":"Deng","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 16\u201320). Learning roi transformer for oriented object detection in aerial images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_13","unstructured":"Yang, X., Yan, J., Feng, Z., and He, T. (2019). R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016, January 5\u201310). R-FCN: Object Detection via Region-based Fully Convolutional Networks. Proceedings of the Advances in Neural Information Processing Systems 29 (NIPS 2016), Barcelona, Spain."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 13\u201319). GhostNet: More Features From Cheap Operations. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhang, X., Zhou, X., Lin, M., and Sun, J. (2018, January 18\u201322). Shufflenet: An extremely efficient convolutional neural network for mobile devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"ref_18","unstructured":"Ma, N., Zhang, X., Zheng, H.T., and Sun, J. (2018, January 8\u201314). Shufflenet v2: Practical guidelines for efficient cnn architecture design. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany."},{"key":"ref_19","unstructured":"Zhuang, Z., Tan, M., Zhuang, B., Liu, J., Guo, Y., Wu, Q., Huang, J., and Zhu, J. (2018, January 3\u20138). Discrimination-aware channel pruning for deep neural networks. Proceedings of the Advances in Neural Information Processing Systems 31 (NeurIPS 2018), Montreal, QC, Canada."},{"key":"ref_20","first-page":"5612414","article-title":"Arbitrary-oriented ship detection through center-head point extraction","volume":"60","author":"Zhang","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"5623616","DOI":"10.1109\/TGRS.2022.3173610","article-title":"Ship Detection in High-Resolution Optical Remote Sensing Images Aided by Saliency Information","volume":"60","author":"Ren","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","first-page":"223373","article-title":"LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images","volume":"60","author":"Huang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Li, Y., Hou, Q., Zheng, Z., Cheng, M.M., Yang, J., and Li, X. (2023, January 2\u20136). Large selective kernel network for remote sensing object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.01540"},{"key":"ref_24","first-page":"5222316","article-title":"Fast Task-Specific Region Merging for SAR Image Segmentation","volume":"60","author":"Ma","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","first-page":"5214116","article-title":"Fast SAR Image Segmentation with Deep Task-Specific Superpixel Sampling and Soft Graph Convolution","volume":"60","author":"Ma","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Yi, X., Gu, S., Wu, X., and Jing, D. (2025). AFEDet: A Symmetry-Aware Deep Learning Model for Multi-Scale Object Detection in Aerial Images. Symmetry, 17.","DOI":"10.3390\/sym17040488"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Cheng, G., Zhou, P., and Han, J. (2016, January 27\u201330). RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.315"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/j.isprsjprs.2020.01.025","article-title":"Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images","volume":"161","author":"Fu","year":"2020","journal-title":"Isprs J. Photogramm. Remote Sens."},{"key":"ref_29","unstructured":"Lin, Y., Feng, P., and Guan, J. (2019). IENet: Interacting Embranchment One Stage Anchor Free Detector for Orientation Aerial Object Detection. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Lee, J.R., and Moon, Y.H. (2021, January 20\u201322). An Empirical Study on Channel Pruning through Residual Connections. Proceedings of the 2021 International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, Republic of Korea.","DOI":"10.1109\/ICTC52510.2021.9620978"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Xie, S., Girshick, R.B., Doll\u00e1r, P., Tu, Z., and He, K. (2017, January 21\u201326). Aggregated Residual Transformations for Deep Neural Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.634"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Dang, J., and Yang, J. (2021, January 6\u201311). HIGCNN: Hierarchical Interleaved Group Convolutional Neural Networks for Point Clouds Analysis. Proceedings of the ICASSP 2021\u20132021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada.","DOI":"10.1109\/ICASSP39728.2021.9413879"},{"key":"ref_33","unstructured":"Tan, M., and Le, Q. (2019, January 9\u201315). Efficientnet: Rethinking model scaling for convolutional neural networks. Proceedings of the International Conference on Machine Learning (PMLR), Long Beach, CA, USA."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ma, N., Zhang, X., Huang, J., and Sun, J. (2020, January 23\u201328). WeightNet: Revisiting the Design Space of Weight Networks. Proceedings of the Computer Vision\u2014ECCV 2020\u201416th European Conference, Glasgow, UK.","DOI":"10.1007\/978-3-030-58555-6_46"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Li, Y., Chen, Y., Dai, X., Chen, D., Liu, M., Yuan, L., Liu, Z., Zhang, L., and Vasconcelos, N. (2021, January 11\u201317). MicroNet: Improving Image Recognition with Extremely Low FLOPs. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision, ICCV, Online.","DOI":"10.1109\/ICCV48922.2021.00052"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Ma, S., Khader, A., and Xiao, L. (2022, January 21\u201323). Complementary features-aware attentive multi-adapter network for hyperspectral object tracking. Proceedings of the Fourteenth International Conference on Graphics and Image Processing (ICGIP 2022), Nanjing, China.","DOI":"10.1117\/12.2680541"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Han, J., Ding, J., Xue, N., and Xia, G.S. (2022). ReDet: A Rotation-equivariant Detector for Aerial Object Detection. arXiv.","DOI":"10.1109\/CVPR46437.2021.00281"},{"key":"ref_38","first-page":"4335","article-title":"Detecting Rotated Objects as Gaussian Distributions and its 3-D Generalization","volume":"45","author":"Yang","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"6414","DOI":"10.1109\/TCSVT.2022.3166803","article-title":"RB-Net: Training Highly Accurate and Efficient Binary Neural Networks With Reshaped Point-Wise Convolution and Balanced Activation","volume":"32","author":"Liu","year":"2022","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Liu, C., Ding, W., Xia, X., Zhang, B., Gu, J., Liu, J., Ji, R., and Doermann, D. (2019, January 16\u201320). Circulant binary convolutional networks: Enhancing the performance of 1-bit dcnns with circulant back propagation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00280"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., and Luo, Z. (2017). R2CNN: Rotational region CNN for orientation robust scene text detection. arXiv.","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Sun, X., and Fu, K. (2019, January 27\u201328). SCRDet: Towards more robust detection for small, cluttered and rotated objects. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00832"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"He, Y., Lin, J., Liu, Z., Wang, H., Li, L.J., and Han, S. (2018, January 8\u201314). Amc: Automl for model compression and acceleration on mobile devices. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_48"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"1895","DOI":"10.1109\/TIP.2022.3148874","article-title":"A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection","volume":"31","author":"Huang","year":"2022","journal-title":"IEEE Trans. Image Process."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"2384","DOI":"10.1109\/TPAMI.2022.3166956","article-title":"SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing","volume":"45","author":"Yang","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1109\/TPAMI.2020.2974745","article-title":"Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection","volume":"43","author":"Xu","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/11\/684\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T06:10:35Z","timestamp":1761718235000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/11\/684"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,28]]},"references-count":46,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2025,11]]}},"alternative-id":["a18110684"],"URL":"https:\/\/doi.org\/10.3390\/a18110684","relation":{},"ISSN":["1999-4893"],"issn-type":[{"type":"electronic","value":"1999-4893"}],"subject":[],"published":{"date-parts":[[2025,10,28]]}}}