{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T06:48:04Z","timestamp":1768459684807,"version":"3.49.0"},"reference-count":22,"publisher":"World Scientific Pub Co Pte Ltd","issue":"03","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2026,3,15]]},"abstract":"<jats:p>Remote sensing object detection faces persistent challenges from scale diversity, cluttered scenes, and resolution constraints, often leading to suboptimal performance on small or overlapping targets. To mitigate these limitations, we introduce MSAF-Net, a novel architecture designed to improve multi-resolution feature learning through refined attention mechanisms. The model incorporates coordinated spatial and channel-wise attention across network stages to adaptively highlight informative regions and suppress background interference. Furthermore, we develop a scale-aware feature fusion approach that aligns representations from different layers, enabling more accurate detection of objects across a wide range of scales. Experimental evaluations on standard benchmarks, including DOTA and NWPU VHR-10, confirm that MSAF-Net achieves notable gains over state-of-the-art methods in both precision and recall. These findings demonstrate the potential of attention-guided strategies in enhancing object detection for high-resolution remote sensing imagery.<\/jats:p>","DOI":"10.1142\/s0218001425550183","type":"journal-article","created":{"date-parts":[[2025,10,24]],"date-time":"2025-10-24T02:23:18Z","timestamp":1761272598000},"source":"Crossref","is-referenced-by-count":0,"title":["MSAF-Net: Enhancing Multi-Resolution Object Detection in Remote Sensing via Attention-Guided Feature Fusion"],"prefix":"10.1142","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-7737-2113","authenticated-orcid":false,"given":"Yiran","family":"Zhao","sequence":"first","affiliation":[{"name":"Faculty of Social Science and Humanities, The National University of Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4166-6338","authenticated-orcid":false,"given":"Rosniza Aznie Che","family":"Rose","sequence":"additional","affiliation":[{"name":"Faculty of Social Science and Humanities, The National University of Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0589-3237","authenticated-orcid":false,"given":"Kuok Choy","family":"Lam","sequence":"additional","affiliation":[{"name":"Faculty of Social Science and Humanities, The National University of Malaysia, Bangi, Selangor, Malaysia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-2764-7659","authenticated-orcid":false,"given":"Yujing","family":"Zhang","sequence":"additional","affiliation":[{"name":"Faculty of Mechanical, Delft University of Technology, Delft, Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2025,12,15]]},"reference":[{"key":"S0218001425550183BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2017.2675998"},{"key":"S0218001425550183BIB002","doi-asserted-by":"publisher","DOI":"10.3390\/rs14153735"},{"key":"S0218001425550183BIB003","unstructured":"A. Dosovitskiy\n                      et al.\n                      , An image is worth 16x16 words: Transformers for image recognition at scale, preprint (2020), arXiv:2010.11929."},{"key":"S0218001425550183BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00326"},{"key":"S0218001425550183BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.optlastec.2025.112652"},{"key":"S0218001425550183BIB006","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00720"},{"key":"S0218001425550183BIB007","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S0218001425550183BIB008","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"S0218001425550183BIB009","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01228-1_15"},{"key":"S0218001425550183BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/JSTARS.2023.3337132"},{"key":"S0218001425550183BIB011","doi-asserted-by":"publisher","DOI":"10.1016\/j.isprsjprs.2019.11.023"},{"key":"S0218001425550183BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.106"},{"key":"S0218001425550183BIB013","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00913"},{"key":"S0218001425550183BIB014","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"S0218001425550183BIB015","unstructured":"K. Sun, Y. Zhao, B. Jiang, T. Cheng, B. Xiao, D. Liu, Y. Mu, X. Wang, W. Liu and J. Wang, High-resolution representations for labeling pixels and regions, preprint (2019), arXiv:1904.04514."},{"key":"S0218001425550183BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"S0218001425550183BIB017","first-page":"10347","volume-title":"Proc. 38th Int. Conf. Machine Learning","author":"Touvron H.","year":"2021"},{"key":"S0218001425550183BIB018","doi-asserted-by":"publisher","DOI":"10.1007\/s00371-022-02503-4"},{"key":"S0218001425550183BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/JSTARS.2025.3576433"},{"key":"S0218001425550183BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"S0218001425550183BIB021","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"S0218001425550183BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/MGRS.2017.2762307"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001425550183","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T01:11:28Z","timestamp":1768266688000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001425550183"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,15]]},"references-count":22,"journal-issue":{"issue":"03","published-print":{"date-parts":[[2026,3,15]]}},"alternative-id":["10.1142\/S0218001425550183"],"URL":"https:\/\/doi.org\/10.1142\/s0218001425550183","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,15]]},"article-number":"2555018"}}