{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T14:08:49Z","timestamp":1753884529556,"version":"3.41.2"},"reference-count":38,"publisher":"World Scientific Pub Co Pte Ltd","issue":"18","funder":[{"DOI":"10.13039\/501100010023","name":"Natural Science Research of Jiangsu Higher Education Institutions of China","doi-asserted-by":"publisher","award":["1020220767"],"award-info":[{"award-number":["1020220767"]}],"id":[{"id":"10.13039\/501100010023","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100010023","name":"Natural Science Research of Jiangsu Higher Education Institutions of China","doi-asserted-by":"publisher","award":["22KJD590001"],"award-info":[{"award-number":["22KJD590001"]}],"id":[{"id":"10.13039\/501100010023","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J CIRCUIT SYST COMP"],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:p> In response to the poor decision-making caused by aerodynamic coupling and model linearization in the trajectory correction model, it is proposed to use a reinforcement learning method to establish a Constrained Markov Decision Process (CMDP) model between action prediction and action constraint. Taking a classical two-dimensional trajectory correction projectile for study, the cost function within constraints of thruster violations is built, so that the sequential prediction is transformed to solve the Constrained Reinforcement Learning (CRL) problem. Accordingly, a method combined with the policy gradient algorithm based on Lagrange operator with the model prediction algorithm is proposed. The test results show that the trained optimal policy model achieves a probability of over 65% and the correction projectile falls within a radius of 5 meters from the preset target point (CEP [Formula: see text]5[Formula: see text]m). Simulation examples have verified the decision-making of correction action sequences autonomously using the trained model, resulting in correction errors of 1.79 1.1 and 0.42[Formula: see text]m in the [Formula: see text] and z directions, respectively, demonstrating high correction accuracy and autonomous decision-making capability. <\/jats:p>","DOI":"10.1142\/s0218126625500306","type":"journal-article","created":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T03:45:57Z","timestamp":1723347957000},"source":"Crossref","is-referenced-by-count":0,"title":["Two-Dimensional Ballistic Correction Decision-Making Based on Constrained Reinforcement Learning"],"prefix":"10.1142","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0875-4336","authenticated-orcid":false,"given":"Xiaoyun","family":"Lei","sequence":"first","affiliation":[{"name":"School of Information Technology, Jiangsu Open University, Nanjing, Jiangsu 210036, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-5556-799X","authenticated-orcid":false,"given":"Lin","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information Technology, Jiangsu Open University, Nanjing, Jiangsu 210036, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-2080-1222","authenticated-orcid":false,"given":"Lihua","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering, Nanjing University of Science and Technology, Nanjing, Jiangsu 210096, P.\u00a0R.\u00a0China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2024,9,16]]},"reference":[{"key":"S0218126625500306BIB001","first-page":"634","volume":"42","author":"Shen Q.","year":"2022","journal-title":"Trans. Beijing Inst. Technol."},{"key":"S0218126625500306BIB002","first-page":"14","volume":"32","author":"Ke Z. F.","year":"2020","journal-title":"J. Ballist."},{"key":"S0218126625500306BIB003","first-page":"16","volume":"42","author":"Huo P. F.","year":"2020","journal-title":"J. Detect. Control"},{"key":"S0218126625500306BIB004","first-page":"323421","volume":"41","author":"Yang S. Z.","year":"2020","journal-title":"Acta Aeronaut. Astronaut. Sin."},{"key":"S0218126625500306BIB006","doi-asserted-by":"publisher","DOI":"10.2514\/2.3765"},{"key":"S0218126625500306BIB007","first-page":"22","volume":"32","author":"Wu H. Z.","year":"2020","journal-title":"J. Ballist."},{"key":"S0218126625500306BIB008","doi-asserted-by":"publisher","DOI":"10.1109\/IAEAC50856.2021.9391034"},{"volume-title":"Reinforcement Learning: An Introduction","year":"1998","author":"Sutton R. S.","key":"S0218126625500306BIB009"},{"key":"S0218126625500306BIB010","first-page":"415","volume":"42","author":"Zhang Q. H.","year":"2020","journal-title":"Syst. Eng. Electron."},{"key":"S0218126625500306BIB011","first-page":"3040","volume":"43","author":"Li Q.","year":"2022","journal-title":"Acta Armamentarii"},{"key":"S0218126625500306BIB012","doi-asserted-by":"publisher","DOI":"10.3390\/app10186567"},{"first-page":"1990","volume-title":"AIAA Guidance, Navigation, and Control Conf.","author":"Junell J. L.","key":"S0218126625500306BIB013"},{"key":"S0218126625500306BIB014","first-page":"611","volume":"42","author":"Liang C.","year":"2021","journal-title":"J. Astronaut."},{"first-page":"4470","volume-title":"AIAA Guidance, Navigation, and Control Conf.","author":"Gaudet B.","key":"S0218126625500306BIB015"},{"key":"S0218126625500306BIB016","doi-asserted-by":"publisher","DOI":"10.2514\/1.G005794"},{"key":"S0218126625500306BIB017","doi-asserted-by":"publisher","DOI":"10.1016\/j.actaastro.2020.01.007"},{"key":"S0218126625500306BIB018","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2021.3070252"},{"key":"S0218126625500306BIB019","first-page":"8378","volume":"33","author":"Ding D.","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"first-page":"1","volume-title":"7th Int. Conf. Learning Representations","author":"Tessler C.","key":"S0218126625500306BIB020"},{"first-page":"9797","volume-title":"Int. Conf. Machine Learning","author":"Wachi A.","key":"S0218126625500306BIB021"},{"key":"S0218126625500306BIB022","first-page":"6070","volume":"18","author":"Chow Y.","year":"2017","journal-title":"J. Mach. Learn. Res."},{"first-page":"9133","volume-title":"Int. Conf. Machine Learning","author":"Stooke A.","key":"S0218126625500306BIB023"},{"key":"S0218126625500306BIB024","first-page":"3304","volume-title":"Int. Conf. Artificial Intelligence and Statistics","author":"Ding D.","year":"2020"},{"first-page":"22","volume-title":"Int. Conf. Machine Learning","author":"Achiam J.","key":"S0218126625500306BIB025"},{"key":"S0218126625500306BIB026","doi-asserted-by":"publisher","DOI":"10.1109\/TSC.2024.3404347"},{"key":"S0218126625500306BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2023.3274631"},{"key":"S0218126625500306BIB028","first-page":"1","author":"Yu K.","year":"2024","journal-title":"IEEE Int. Things J."},{"key":"S0218126625500306BIB029","first-page":"1","author":"He Q.","year":"2024","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"S0218126625500306BIB030","doi-asserted-by":"publisher","DOI":"10.1109\/JSAC.2023.3310097"},{"key":"S0218126625500306BIB033","doi-asserted-by":"publisher","DOI":"10.21629\/JSEE.2019.01.17"},{"key":"S0218126625500306BIB034","first-page":"1057","volume-title":"Proc. 12th Int. Conf. Neural Information Processing Systems","volume":"12","author":"Sutton R. S."},{"key":"S0218126625500306BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8463189"},{"journal-title":"Artif. Intell.","year":"2017","author":"Henaff M.","key":"S0218126625500306BIB036"},{"key":"S0218126625500306BIB037","doi-asserted-by":"publisher","DOI":"10.1002\/aic.17601"},{"key":"S0218126625500306BIB038","first-page":"2829","volume-title":"Proc. Int. Conf. Machine Learning","volume":"48","author":"Gu S."},{"key":"S0218126625500306BIB039","doi-asserted-by":"publisher","DOI":"10.1109\/JAS.2023.123213"},{"first-page":"1","volume-title":"IEEE Intelligent Vehicles Sympos.","author":"Shen X.","key":"S0218126625500306BIB040"},{"key":"S0218126625500306BIB041","doi-asserted-by":"publisher","DOI":"10.1049\/cth2.12429"}],"container-title":["Journal of Circuits, Systems and Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218126625500306","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,24]],"date-time":"2024-12-24T03:19:25Z","timestamp":1735010365000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218126625500306"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,16]]},"references-count":38,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["10.1142\/S0218126625500306"],"URL":"https:\/\/doi.org\/10.1142\/s0218126625500306","relation":{},"ISSN":["0218-1266","1793-6454"],"issn-type":[{"type":"print","value":"0218-1266"},{"type":"electronic","value":"1793-6454"}],"subject":[],"published":{"date-parts":[[2024,9,16]]},"article-number":"2550030"}}