{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,5]],"date-time":"2026-07-05T10:44:41Z","timestamp":1783248281429,"version":"3.54.6"},"reference-count":31,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2023,6,1]],"date-time":"2023-06-01T00:00:00Z","timestamp":1685577600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"111 Project of China","award":["B14010"],"award-info":[{"award-number":["B14010"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Attacking a naval vessel with multiple missiles is an important way to improve the hit rate of missiles. Missile-borne radars need to complete detection and antijamming tasks to guide missiles, but communication between these radars is often difficult. In this paper, an optimization method based on multi-agent reinforcement learning is proposed for the collaborative detection and antijamming tasks of multiple radars against one naval vessel. We consider the collaborative radars as one player to make their confrontation with the naval vessel a two-person zero-sum game. With temporal constraints of the radar\u2019s and jammer\u2019s recognition and preparation interval, the game focuses on taking a favorable position at the end of the confrontation. It is assumed the total jamming capability of a shipborne jammer is constant and limited, and the shipborne jammer allocates the jamming capability in the radar\u2019s direction according to the radar threat assessment result and its probability of successful detection. The radars work collaboratively through prior centralized training and obtain a good performance by decentralized execution. The proposed method can make radars collaborate to detect the naval vessel, rather than only considering the detection result of each radar itself. Experimental results show that the proposed method in this paper is effective, improving the winning probability to 10% and 25% in the two-radar and four-radar scenarios, respectively.<\/jats:p>","DOI":"10.3390\/rs15112893","type":"journal-article","created":{"date-parts":[[2023,6,2]],"date-time":"2023-06-02T01:33:54Z","timestamp":1685669634000},"page":"2893","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9219-0865","authenticated-orcid":false,"given":"Cheng","family":"Feng","sequence":"first","affiliation":[{"name":"Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiongjun","family":"Fu","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing 100081, China"},{"name":"Tangshan Research Institute of BIT, Tangshan 063007, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ziyi","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jian","family":"Dong","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhichun","family":"Zhao","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, Beijing 100081, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Teng","family":"Pan","sequence":"additional","affiliation":[{"name":"Beijing Institute of Space Systems Engineering, Beijing 100094, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1109\/22.989948","article-title":"Electronic warfare systems","volume":"50","author":"Spezio","year":"2002","journal-title":"IEEE Trans. Microw. Theory Tech."},{"key":"ref_2","unstructured":"Skolnik, M. (2008). Radar Handbook, McGraw-Hill. [3rd ed.]."},{"key":"ref_3","first-page":"1","article-title":"Cooperative Combat of Missile Formation: Concepts and Key Technologies","volume":"29","author":"Xiao","year":"2013","journal-title":"Aerosp. Electron. Warf."},{"key":"ref_4","first-page":"22","article-title":"Review of Multi-Missile Cooperative Guidance","volume":"38","author":"Zhao","year":"2017","journal-title":"Acta Aeronaut. Et Astronaut. Sin."},{"key":"ref_5","first-page":"6","article-title":"Summary of Guidance Law based on Cooperative Attack of Multi-Missile method","volume":"29","author":"Wang","year":"2011","journal-title":"Flight Dyn."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/MSP.2011.940281","article-title":"Distributed and Decentralized Multicamera Tracking","volume":"28","author":"Taj","year":"2011","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1109\/JPROC.1997.554211","article-title":"Distributed fusion architectures and algorithms for target tracking","volume":"85","author":"Liggins","year":"1997","journal-title":"Proc. IEEE"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.inffus.2019.06.026","article-title":"Distributed estimation over a low-cost sensor network: A Review of state-of-the-art","volume":"54","author":"He","year":"2020","journal-title":"Inf. Fusion"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2200","DOI":"10.1109\/TAES.2011.5937292","article-title":"Adaptive MIMO Radar Design and Detection in Compound-Gaussian Clutter","volume":"47","author":"Akcakaya","year":"2011","journal-title":"IEEE Trans. Aerosp. Electron. Syst."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Chong, C.Y., Pascal, F., Ovarlez, J.-P., and Lesturgie, M. (September, January 31). Adaptive MIMO radar detection in non-Gaussian and heterogeneous clutter considering fluctuating targets. Proceedings of the 2009 IEEE\/SP 15th Workshop on Statistical Signal Processing, Cardiff, UK.","DOI":"10.1109\/SSP.2009.5278651"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2282","DOI":"10.1109\/TSP.2013.2245323","article-title":"A Parametric Moving Target Detector for Distributed MIMO Radar in Non-Homogeneous Environment","volume":"61","author":"Wang","year":"2013","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"5538","DOI":"10.1109\/TSP.2011.2162509","article-title":"Phase Synchronization for Coherent MIMO Radar: Algorithms and Their Analysis","volume":"59","author":"Yang","year":"2011","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2370","DOI":"10.1109\/TAES.2018.2816467","article-title":"Centralized Adaptive CFAR Detection with Registration Errors in Multistatic Radar","volume":"54","author":"Yang","year":"2018","journal-title":"IEEE Trans. Aerosp. Electron. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1109\/TSP.2011.2175386","article-title":"Diffusion Kalman Filtering Based on Covariance Intersection","volume":"60","author":"Hu","year":"2012","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1109\/7.640257","article-title":"Local SNR considerations in decentralized CFAR detection","volume":"34","author":"Mathur","year":"1998","journal-title":"IEEE Trans. Aerosp. Electron. Syst."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.sigpro.2018.06.003","article-title":"Spatial Resolution Cell Based Centralized Target Detection in Multistatic Radar","volume":"152","author":"Yang","year":"2018","journal-title":"Signal Process."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Panoui, A., Lambotharan, S., and Chambers, A. (2014, January 8\u20139). Game theoretic power allocation for a multistatic radar network in the presence of estimation error. Proceedings of the Sensor Signal Processing for Defence, Edinburgh, UK.","DOI":"10.1109\/SSPD.2014.6943316"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Panoui, A., Lambotharan, S., and Chambers, A. (2015, January 10\u201315). Waveform allocation for a MIMO radar network using potential games. Proceedings of the Radar Conference, Arlington, VA, USA.","DOI":"10.1109\/RADAR.2015.7131096"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1109\/TSP.2011.2174989","article-title":"Scheduling and Power Allocation in a Cognitive Radar Network for Multiple-Target Tracking","volume":"60","author":"Chavali","year":"2012","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1109\/TSMCC.2007.913919","article-title":"A Comprehensive Survey of Multiagent Reinforcement Learning","volume":"38","author":"Busoniu","year":"2008","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"163334","DOI":"10.1109\/ACCESS.2020.3022638","article-title":"Recurrent MADDPG for Object Detection and Assignment in Combat Tasks","volume":"8","author":"Wei","year":"2020","journal-title":"IEEE Access"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1109\/TVT.2021.3129504","article-title":"Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking","volume":"71","author":"Xia","year":"2022","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sutton, R.S., and Barto, A.G. (1998). Reinforcement Learning: An Introduction. IEEE Trans. Neural Netw., 9.","DOI":"10.1109\/TNN.1998.712192"},{"key":"ref_24","first-page":"86","article-title":"Research on Reinforcement Learning Technology: A Review","volume":"30","author":"Gao","year":"2004","journal-title":"Acta Autom. Sin."},{"key":"ref_25","first-page":"1","article-title":"Overview on Multi-Agent Reinforcement Learning","volume":"46","author":"Du","year":"2019","journal-title":"Comput. Sci."},{"key":"ref_26","unstructured":"Rashid, T., Samvelyan, M., De Witt, C.S., Farquhar, G., Foerster, J., and Whiteson, S. (2018). QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning. arXiv."},{"key":"ref_27","unstructured":"Lowe, R., Wu, Y.I., Tamar, A., Harb, J., Pieter Abbeel, O., and Mordatch, I. (2017, January 4\u20139). Multi-agent actor-critic for mixed cooperative-competitive environments. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_28","unstructured":"Foerster, J., Nardelli, N., Farquhar, G., Afouras, T., Torr, P.H.S., Kohli, P., and Whiteson, S. (2017, January 6\u201311). Stabilising experience replay for deep multi-agent reinforcement learning. Proceedings of the 34th International Conference on Machine Learning-Volume 70, Sydney, NSW, Australia."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"97429","DOI":"10.1109\/ACCESS.2022.3200761","article-title":"A Radar Anti-Jamming Strategy Based on Game Theory with Temporal Constraints","volume":"10","author":"Feng","year":"2022","journal-title":"IEEE Access"},{"key":"ref_30","unstructured":"Neumann, J.V., and Morgenstern, O. (1947). Theory of Games and Economic Behavior, Princeton University Press."},{"key":"ref_31","unstructured":"Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., and Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/11\/2893\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:47:21Z","timestamp":1760125641000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/11\/2893"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,1]]},"references-count":31,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2023,6]]}},"alternative-id":["rs15112893"],"URL":"https:\/\/doi.org\/10.3390\/rs15112893","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,1]]}}}