{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,17]],"date-time":"2026-07-17T12:06:48Z","timestamp":1784290008060,"version":"3.55.0"},"reference-count":24,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2021,1,18]],"date-time":"2021-01-18T00:00:00Z","timestamp":1610928000000},"content-version":"vor","delay-in-days":17,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61973101"],"award-info":[{"award-number":["61973101"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004750","name":"Aeronautical Science Foundation of China","doi-asserted-by":"publisher","award":["20180577005"],"award-info":[{"award-number":["20180577005"]}],"id":[{"id":"10.13039\/501100004750","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Complexity"],"published-print":{"date-parts":[[2021,1]]},"abstract":"<jats:p>A deep reinforcement learning\u2010based computational guidance method is presented, which is used to identify and resolve the problem of collision avoidance for a variable number of fixed\u2010wing UAVs in limited airspace. The cooperative guidance process is first analyzed for multiple aircraft by formulating flight scenarios using multiagent Markov game theory and solving it by machine learning algorithm. Furthermore, a self\u2010learning framework is established by using the actor\u2010critic model, which is proposed to train collision avoidance decision\u2010making neural networks. To achieve higher scalability, the neural network is customized to incorporate long short\u2010term memory networks, and a coordination strategy is given. Additionally, a simulator suitable for multiagent high\u2010density route scene is designed for validation, in which all UAVs run the proposed algorithm onboard. Simulated experiment results from several case studies show that the real\u2010time guidance algorithm can reduce the collision probability of multiple UAVs in flight effectively even with a large number of aircraft.<\/jats:p>","DOI":"10.1155\/2021\/8818013","type":"journal-article","created":{"date-parts":[[2021,1,19]],"date-time":"2021-01-19T03:35:32Z","timestamp":1611027332000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Reinforcement Learning\u2010Based Collision Avoidance Guidance Algorithm for Fixed\u2010Wing UAVs"],"prefix":"10.1155","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7229-0834","authenticated-orcid":false,"given":"Yu","family":"Zhao","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2840-9820","authenticated-orcid":false,"given":"Jifeng","family":"Guo","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0349-9869","authenticated-orcid":false,"given":"Chengchao","family":"Bai","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8685-9080","authenticated-orcid":false,"given":"Hongxing","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"311","published-online":{"date-parts":[[2021,1,18]]},"reference":[{"key":"e_1_2_10_1_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cja.2019.03.026"},{"key":"e_1_2_10_2_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/4757381"},{"key":"e_1_2_10_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2018.2885003"},{"key":"e_1_2_10_4_2","doi-asserted-by":"publisher","DOI":"10.1155\/2018\/8420294"},{"key":"e_1_2_10_5_2","doi-asserted-by":"crossref","unstructured":"AlzugarayI.andSanfeliuA. Learning the hidden human knowledge of UAV pilots when navigating in a cluttered environment for improving path planning Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) October 2016 Deajeon South Korea 1589\u20131594.","DOI":"10.1109\/IROS.2016.7759257"},{"key":"e_1_2_10_6_2","doi-asserted-by":"crossref","unstructured":"CampbellM. SukkariehS. andGoktoganA. Operator decision modeling in cooperative UAV systems Proceedings of the AIAA Guidance Navigation and Control Conference and Exhibit August 2006 Keystone CO USA 1\u201313.","DOI":"10.2514\/6.2006-6213"},{"key":"e_1_2_10_7_2","unstructured":"KopardekarP. RiosJ. PrevotT.et al. Unmanned aircraft system traffic management (UTM) concept of operations Proceedings of the16th AIAA Aviation Technology Integration and Operations Conference June 2016 Washington DC USA 1\u201316."},{"key":"e_1_2_10_8_2","doi-asserted-by":"publisher","DOI":"10.2514\/1.i010243"},{"key":"e_1_2_10_9_2","doi-asserted-by":"crossref","unstructured":"EverettM. ChenY. andHowJ. P. Motion planning among dynamic decision-making agents with deep reinforcement learning Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) October 2018 Madrid Spain 3052\u20133059.","DOI":"10.1109\/IROS.2018.8593871"},{"key":"e_1_2_10_10_2","doi-asserted-by":"publisher","DOI":"10.1243\/09544100jaero546"},{"key":"e_1_2_10_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/6979.898217"},{"key":"e_1_2_10_12_2","first-page":"91","article-title":"Multi-UAV cooperative collision avoidance against uncertain environment","volume":"21","author":"Zhou H.","year":"2014","journal-title":"Electronics Optics and Control"},{"key":"e_1_2_10_13_2","doi-asserted-by":"publisher","DOI":"10.2514\/1.g005000"},{"key":"e_1_2_10_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/9.664154"},{"key":"e_1_2_10_15_2","unstructured":"KravarisT. SpatharisC. BastasA.et al. Resolving congestions in the air traffic management domain via multiagent reinforcement learning methods 2019 https:\/\/arxiv.org\/abs\/1912.06860."},{"key":"e_1_2_10_16_2","doi-asserted-by":"publisher","DOI":"10.1504\/IJAM.2012.045736"},{"key":"e_1_2_10_17_2","doi-asserted-by":"crossref","unstructured":"KeongC. W. ShinH. andTsourdosA. Reinforcement learning for autonomous aircraft avoidance The 2019 International Workshop on Research Education and Development on Unmanned Aerial Systems 2019 Cranfield UK 126\u2013131.","DOI":"10.1109\/REDUAS47371.2019.8999689"},{"key":"e_1_2_10_18_2","first-page":"1","article-title":"Multiagent actor-critic for mixed cooperative-competitive environments","volume":"30","author":"Lowe R.","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_10_19_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_2_10_20_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ast.2020.105783"},{"key":"e_1_2_10_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ifacol.2019.11.062"},{"key":"e_1_2_10_22_2","doi-asserted-by":"crossref","unstructured":"LittmanM. L. Markov games as a framework for multiagent reinforcement learning Proceedings of the Eleventh International Conference on Machine Learning July 1994 Boca Raton FL USA 157\u2013163.","DOI":"10.1016\/B978-1-55860-335-6.50027-1"},{"key":"e_1_2_10_23_2","unstructured":"JangE. GuS. andPooleB. Categorical reparameterization with gumbel-softmax Proceedings of the 5th International Conference on Learning Representations (ICLR) April 2017 Toulon France 1\u201313."},{"key":"e_1_2_10_24_2","doi-asserted-by":"crossref","unstructured":"PradeepP.andWeiP. Energy optimal speed profile for arrival of tandem tilt-wing eVTOL aircraft with RTA constraint Proceedings of the 2018 IEEE CSAA Guidance Navigation and Control Conference August 2018 Beijing China.","DOI":"10.1109\/GNCC42960.2018.9018748"}],"container-title":["Complexity"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/8818013.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/8818013.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2021\/8818013","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,9]],"date-time":"2024-08-09T21:59:14Z","timestamp":1723240754000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2021\/8818013"}},"subtitle":[],"editor":[{"given":"Zhile","family":"Yang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2021,1]]},"references-count":24,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,1]]}},"alternative-id":["10.1155\/2021\/8818013"],"URL":"https:\/\/doi.org\/10.1155\/2021\/8818013","archive":["Portico"],"relation":{},"ISSN":["1076-2787","1099-0526"],"issn-type":[{"value":"1076-2787","type":"print"},{"value":"1099-0526","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1]]},"assertion":[{"value":"2020-08-06","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-01-04","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-01-18","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"8818013"}}