{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T22:27:07Z","timestamp":1776378427949,"version":"3.51.2"},"reference-count":40,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T00:00:00Z","timestamp":1646697600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61871307"],"award-info":[{"award-number":["61871307"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"the Fundamental Research Funds for the Central Universities","award":["JB210207"],"award-info":[{"award-number":["JB210207"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Large Unmanned Aerial Vehicle (UAV) clusters, containing hundreds of UAVs, have widely been used in the modern world. Therein, mission planning is the core of large UAV cluster collaborative systems. In this paper, we propose a mission planning method by introducing the Simple Attention Model (SAM) into Dynamic Information Reinforcement Learning (DIRL), named DIRL-SAM. To reduce the computational complexity of the original attention model, we derive the SAM with a lightweight interactive model to rapidly extract high-dimensional features of the cluster information. In DIRL, dynamic training conditions are considered to simulate different mission environments. Meanwhile, the data expansion in DIRL guarantees the convergence of the model in these dynamic environments, which improves the robustness of the algorithm. Finally, the simulation experiment results show that the proposed method can adaptively provide feasible mission planning schemes with second-level solution speed and that it exhibits excellent generalization performance in large-scale cluster planning problems.<\/jats:p>","DOI":"10.3390\/rs14061304","type":"journal-article","created":{"date-parts":[[2022,3,9]],"date-time":"2022-03-09T01:50:53Z","timestamp":1646790653000},"page":"1304","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["A Fast and Robust Algorithm with Reinforcement Learning for Large UAV Cluster Mission Planning"],"prefix":"10.3390","volume":"14","author":[{"given":"Lei","family":"Zuo","sequence":"first","affiliation":[{"name":"National Lab of Radar Signal Processing, Xidian University, Xi\u2019an 710000, China"}]},{"given":"Shan","family":"Gao","sequence":"additional","affiliation":[{"name":"National Lab of Radar Signal Processing, Xidian University, Xi\u2019an 710000, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6672-367X","authenticated-orcid":false,"given":"Yachao","family":"Li","sequence":"additional","affiliation":[{"name":"National Lab of Radar Signal Processing, Xidian University, Xi\u2019an 710000, China"}]},{"given":"Lianghai","family":"Li","sequence":"additional","affiliation":[{"name":"Beijing Research Institute of Telemetry, Beijing 100076, China"}]},{"given":"Ming","family":"Li","sequence":"additional","affiliation":[{"name":"National Lab of Radar Signal Processing, Xidian University, Xi\u2019an 710000, China"}]},{"given":"Xiaofei","family":"Lu","sequence":"additional","affiliation":[{"name":"Jiuquan Satelite Launch Centre, Jiuquan 735400, China"}]}],"member":"1968","published-online":{"date-parts":[[2022,3,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Fraser, B.T., and Congalton, R.G. (2021). Monitoring Fine-Scale Forest Health Using Unmanned Aerial Systems (UAS) Multispectral Models. Remote Sens., 13.","DOI":"10.3390\/rs13234873"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"105643","DOI":"10.1016\/j.asoc.2019.105643","article-title":"Adaptive Task Allocation for Multi-UAV Systems Based on Bacteria Foraging Behavior","volume":"83","author":"Kurdi","year":"2019","journal-title":"Appl. Soft Comput."},{"key":"ref_3","first-page":"3984","article-title":"Joint Deployment and Task Scheduling Optimization for Large-Scale Mobile Users in Multi-UAV-Enabled Mobile Edge Computing","volume":"9","author":"Wang","year":"2019","journal-title":"IEEE Trans. Cybern."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1016\/j.ast.2018.10.027","article-title":"Cooperative Load Transportation Using Multiple UAVs","volume":"84","author":"Shirani","year":"2019","journal-title":"Aerosp Sci Technol."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Sun, L., Chen, J., Feng, D., and Xing, M. (2021). Parallel Ensemble Deep Learning for Real-Time Remote Sensing Video Multi-Target Detection. Remote Sens., 13.","DOI":"10.3390\/rs13214377"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Milani, I., Bongioanni, C., Colone, F., and Lombardo, P. (2021). Fusing Measurements from Wi-Fi Emission-Based and Passive Radar Sensors for Short-Range Surveillance. Remote Sens., 13.","DOI":"10.3390\/rs13183556"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/j.physa.2017.08.094","article-title":"Modeling and Simulation of Dynamic Ant Colony\u2019s Labor Division for Task Allocation of UAV Swarm","volume":"491","author":"Wu","year":"2018","journal-title":"Phys. A Stat. Mech. Appl."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2825","DOI":"10.1016\/j.cja.2020.02.009","article-title":"Cooperative Task Assignment of Multi-UAV System","volume":"3","author":"Zhang","year":"2020","journal-title":"Chin. J. Aeronaut."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3363","DOI":"10.1109\/JSAC.2021.3088623","article-title":"Joint Resource, Trajectory, and Artificial Noise Optimization in Secure Driven 3-D UAVs with NOMA and Imperfect CSI","volume":"39","author":"Li","year":"2021","journal-title":"IEEE J. Sel. Area Commun."},{"key":"ref_10","first-page":"249","article-title":"A Convex Optimization Based Method for Multiple UAV Autonomous Formation Reconfiguration","volume":"47","author":"Wang","year":"2017","journal-title":"Sci China Technol. Sci."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mohr, H., Schroeder, K., and Black, J. (2019, January 2\u20139). Distributed Source Seeking and Robust Obstacle Avoidance through Hybrid Gradient Descent. Proceedings of the 2019 IEEE Aerospace Conference, Big Sky, MT, USA.","DOI":"10.1109\/AERO.2019.8741882"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"103447","DOI":"10.1016\/j.autcon.2020.103447","article-title":"An A-Star Algorithm for Semi-optimization of Crane Location and Configuration in Modular Construction","volume":"121","author":"Bagheri","year":"2021","journal-title":"Automat. Constr."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Martin, R.A., Rojas, I., Franke, K., and Hedengren, J.D. (2016). Evolutionary View Planning for Optimized UAV Terrain Modeling in a Simulated Environment. Remote Sens., 8.","DOI":"10.3390\/rs8010026"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Huang, X., Dong, X., Ma, J., Liu, K., Ahmed, S., Lin, J., and Qiu, B. (2021). The Improved A* Obstacle Avoidance Algorithm for the Plant Protection UAV with Millimeter Wave Radar and Monocular Camera Data Fusion. Remote Sens., 13.","DOI":"10.3390\/rs13173364"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Banerjee, B.P., and Raval, S. (2021). A Particle Swarm Optimization Based Approach to Pre-Tune Programmable Hyperspectral Sensors. Remote Sens., 13.","DOI":"10.36227\/techrxiv.14058233"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Alhaqbani, A., Kurdi, H., and Youcef-Toumi, K. (2021). Fish-Inspired Task Allocation Algorithm for Multiple Unmanned Aerial Vehicles in Search and Rescue Missions. Remote Sens., 13.","DOI":"10.3390\/rs13010027"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1016\/j.ast.2018.01.035","article-title":"Cooperative Search-Attack Mission Planning for Multi-UAV based on Intelligent Self-Organized Algorithm","volume":"76","author":"Zhen","year":"2018","journal-title":"Aerosp. Sci. Technol."},{"key":"ref_18","unstructured":"Vinyals, O., Fortunato, M., and Jaitly, N. (2015, January 7\u201312). Pointer Networks. Proceedings of the 2015 Neural Information Processing Systems (NIPS), Montreal, QC, Canada."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Meng, X., Wu, L., and Yu, S. (2019). Research on Resource Allocation Method of Space Information Networks Based on Deep Reinforcement Learning. Remote Sens., 11.","DOI":"10.3390\/rs11040448"},{"key":"ref_20","unstructured":"Nazari, M., Oroojlooy, A., Snyder, L.V., and Takac, M. (2018, January 2\u20138). Reinforcement Learning for Solving the Vehicle Routing Problem. Proceedings of the 2018 Neural Information Processing Systems (NIPS), Montreal, QC, Canada."},{"key":"ref_21","unstructured":"Bello, I., Pham, H.V., Le, Q.V., Norouzi, M., and Bengio, S. (2017, January 24\u201326). Neural Combinatorial Optimization with Reinforcement Learning. Proceedings of the 2017 International Conference on Learning Representations (ICLR), Toulon, France."},{"key":"ref_22","unstructured":"Kool, W., Herke, V.F., and Welling, M. (2017, January 6\u20139). Attention, Learn to Solve Routing Problems. Proceedings of the 2019 International Conference on Learning Representations (ICLR), New Orleans, LA, USA."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Huang, Y., Mu, Z., Wu, S., Cui, B., and Duan, Y. (2021). Revising the Observation Satellite Scheduling Problem Based on Deep Reinforcement Learning. Remote Sens., 13.","DOI":"10.3390\/rs13122377"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1016\/j.trb.2020.08.005","article-title":"An Actor-Critic Deep Reinforcement Learning Approach for Metro Train Scheduling with Rolling Stock Circulation under Stochastic Demand","volume":"140","author":"Ying","year":"2020","journal-title":"Transp. Res. B Meth."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1016\/j.future.2018.11.014","article-title":"Application of Reinforcement Learning in UAV Cluster Task Scheduling","volume":"95","author":"Jun","year":"2019","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"146264","DOI":"10.1109\/ACCESS.2019.2943253","article-title":"Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning","volume":"7","author":"Qie","year":"2019","journal-title":"IEEE Access"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"2930","DOI":"10.1016\/j.cja.2020.05.001","article-title":"Coactive Design of Explainable Agent-Based Task Planning and Deep Reinforcement Learning for Human-UAVs Teamwork","volume":"33","author":"Wang","year":"2020","journal-title":"Chin. J. Aeronaut."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"3103","DOI":"10.1109\/TCYB.2020.2977661","article-title":"Deep Reinforcement Learning for Multi-objective Optimization","volume":"51","author":"Li","year":"2021","journal-title":"IEEE Trans. Cybern."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1016\/j.swevo.2018.06.005","article-title":"Weighted Strategies to Guide a Multi-Objective Evolutionary Algorithm for Multi-UAV Mission Planning","volume":"44","author":"Atencia","year":"2019","journal-title":"Swarm Evol. Comput."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"105826","DOI":"10.1016\/j.ast.2020.105826","article-title":"An Intelligent Cooperative Mission Planning Scheme of UAV Swarm in Uncertain Dynamic Environment","volume":"100","author":"Zhen","year":"2020","journal-title":"Aerosp. Sci. Technol."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"588","DOI":"10.1016\/j.ast.2019.06.024","article-title":"Fast Task Allocation for Heterogeneous Unmanned Aerial Vehicles Through Reinforcement Learning","volume":"92","author":"Zhao","year":"2019","journal-title":"Aerosp. Sci. Technol."},{"key":"ref_32","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 2015 International Conference on Learning Representations (ICLR), Lille, France."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X.Y., Ren, S.Q., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_34","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A., Kaiser, L., and Polosukhin, I. (2017, January 3\u20139). Attention Is All You Need. Proceedings of the 2017 Neural Information Processing Systems (NIPS), Long Beach, CA, USA."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Hussain, R., Karbhari, Y., Ijaz, M.F., Wozniak, M., Singh, P.K., and Sarkar, R. (2021). Revise-Net: Exploiting Reverse Attention Mechanism for Salient Object Detection. Remote Sens., 13.","DOI":"10.3390\/rs13234941"},{"key":"ref_36","unstructured":"Guo, Q.P., Qiu, X.P., and Liu, P.F. (2019, January 5\u20137). Star-Transformer. Proceedings of the 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), Minneapolis, MN, USA."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Rennie, J.S., Marcheret, E., Mroueh, Y., Ross, J., and Goel, V. (2017, January 22\u201325). Self-Critical Sequence Training for Image Captioning. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.131"},{"key":"ref_38","unstructured":"Sutton, R.S., McAllester, D., Singh, S., and Mansour, Y. (December, January 29). Policy Gradient Methods for Reinforcement Learning with Function Approximation. Proceedings of the 1999 Neural Information Processing Systems (NIPS), Denver, CO, USA."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhou, N., Lau, L., Bai, R., and Moore, T. (2021). A Genetic Optimization Resampling Based Particle Filtering Algorithm for Indoor Target Tracking. Remote Sens., 13.","DOI":"10.3390\/rs13010132"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Huang, J., Xing, Y., You, H., Qin, L., Tian, J., and Ma, J. (2019). Particle Swarm Optimization-Based Noise Filtering Algorithm for Photon Cloud Data in Forest Area. Remote Sens., 11.","DOI":"10.3390\/rs11080980"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/6\/1304\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:33:56Z","timestamp":1760135636000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/6\/1304"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,8]]},"references-count":40,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2022,3]]}},"alternative-id":["rs14061304"],"URL":"https:\/\/doi.org\/10.3390\/rs14061304","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,8]]}}}