{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,15]],"date-time":"2026-07-15T19:29:11Z","timestamp":1784143751174,"version":"3.55.0"},"reference-count":42,"publisher":"Association for Computing Machinery (ACM)","issue":"1","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["U22B2057, 62471277"],"award-info":[{"award-number":["U22B2057, 62471277"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"ZTE Industry-University-Institute Cooperation","award":["HC-CN-20200923014-B004"],"award-info":[{"award-number":["HC-CN-20200923014-B004"]}]},{"name":"Young Elite Scientists Sponsorship Program by CAST","award":["ZB2025-293"],"award-info":[{"award-number":["ZB2025-293"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2026,1,31]]},"abstract":"<jats:p>The coordinated deployment of multiple Base Stations (BS) and tuning of antenna configuration plays a crucial role in ensuring high-quality communication services, especially in the context of dense 5G BS deployment in megacities. However, traditional optimization methods, such as heuristics and Reinforcement Learning (RL), face challenges in addressing such problems involving the coordination of hundreds of BSs due to their limitations in handling the complexity and scale of large-scale scenarios. To address these challenges, this article proposes the Hierarchical Multi-Agent Proximal Policy Optimization with Representation Learning (HMAPPO-RL). By employing a hierarchical structure, we effectively decouple the optimization problem into two sub-problems: BS deployment and antenna parameter tuning. Different from the step-by-step method of optimizing the BS location and antenna, HMAPPO-RL achieves joint optimization of the two problems through an ingenious interactive mechanism, fully considering the mutual influence of the BS location and antenna. To address the large-scale challenge posed by hundreds of BSs, we utilize the upsampling and downsampling mechanisms of the UNet network to integrate global and local information from large-scale state information for performance enhancement. Since complex environmental information will cause great difficulties for the agent to evaluate the state value in large-scale scenarios, we add a representation learning module to enhance the accuracy of the agent\u2019s state value estimation. The experiments using a precise mobile network simulator demonstrate the superiority of the proposed HMAPPO-RL, offering a comparative analysis with existing state-of-the-art methods. HMAPPO-RL achieves a coverage rate of 91.66% and an average throughput of 4,983,537\u2009bit\/s. These results represent improvements of 3.62% and 6.75% in coverage rate and throughput, respectively, when compared with the MAPPO algorithm.<\/jats:p>","DOI":"10.1145\/3763795","type":"journal-article","created":{"date-parts":[[2025,8,25]],"date-time":"2025-08-25T13:51:32Z","timestamp":1756129892000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Jointly Optimizing Deployment and Antenna of Base Stations Using Hierarchical Reinforcement Learning"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-7270-8522","authenticated-orcid":false,"given":"Weikang","family":"Su","sequence":"first","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9725-4730","authenticated-orcid":false,"given":"Haoqiang","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4343-703X","authenticated-orcid":false,"given":"Tong","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7039-4736","authenticated-orcid":false,"given":"Xingzai","family":"Lv","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Mobile Network and Mobile Multimedia Technology, ZTE Corporation, Shenzhen, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8134-8433","authenticated-orcid":false,"given":"Hua","family":"Rui","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Mobile Network and Mobile Multimedia Technology, ZTE Corporation, Shenzhen, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0454-7516","authenticated-orcid":false,"given":"Wenzhen","family":"Huang","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6150-3846","authenticated-orcid":false,"given":"Zhaocheng","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5617-1659","authenticated-orcid":false,"given":"Yong","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Electronic Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2025,11,21]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"crossref","first-page":"105804","DOI":"10.1016\/j.cor.2022.105804","article-title":"A greedy randomized adaptive search procedure (GRASP) for the multi-vehicle prize collecting arc routing for connectivity problem","volume":"143","author":"Souza Almeida Luana","year":"2022","unstructured":"Luana Souza Almeida, Floris Goerlandt, Ronald Pelot, and Kenneth S\u00f6rensen. 2022. A greedy randomized adaptive search procedure (GRASP) for the multi-vehicle prize collecting arc routing for connectivity problem. Computers and Operations Research 143 (2022), 105804.","journal-title":"Computers and Operations Research"},{"issue":"2","key":"e_1_3_1_3_2","doi-asserted-by":"crossref","first-page":"429","DOI":"10.3390\/rs15020429","article-title":"Energy-efficient Multi-UAVs cooperative trajectory optimization for communication coverage: An MADRL approach","volume":"15","author":"Ao Tianyong","year":"2023","unstructured":"Tianyong Ao, Kaixin Zhang, Huaguang Shi, Zhanqi Jin, Yi Zhou, and Fuqiang Liu. 2023. Energy-efficient Multi-UAVs cooperative trajectory optimization for communication coverage: An MADRL approach. Remote Sensing 15, 2 (2023), 429.","journal-title":"Remote Sensing"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/TWC.2021.3123500"},{"issue":"8","key":"e_1_3_1_5_2","doi-asserted-by":"crossref","first-page":"5466","DOI":"10.1109\/TII.2021.3132041","article-title":"5G private network deployment optimization based on RWSSA in open-pit mine","volume":"18","author":"Chang Zhaozhao","year":"2021","unstructured":"Zhaozhao Chang, Qinghua Gu, Caiwu Lu, Yanhong Zhang, Shunling Ruan, and Song Jiang. 2021. 5G private network deployment optimization based on RWSSA in open-pit mine. IEEE Transactions on Industrial Informatics 18, 8 (2021), 5466\u20135476.","journal-title":"IEEE Transactions on Industrial Informatics"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVT.2021.3094273"},{"key":"e_1_3_1_7_2","unstructured":"Christian Schroeder de Witt Tarun Gupta Denys Makoviichuk Viktor Makoviychuk Philip H. S. Torr Mingfei Sun and Shimon Whiteson. 2020. Is independent learning all you need in the starcraft multi-agent challenge? arXiv:2011.09533. Retrieved from https:\/\/arxiv.org\/abs\/2011.09533"},{"issue":"5","key":"e_1_3_1_8_2","doi-asserted-by":"crossref","first-page":"3055","DOI":"10.1109\/TWC.2022.3215941","article-title":"Cell-free UAV networks: Asymptotic analysis and deployment optimization","volume":"22","author":"Diaz-Vilor Carles","year":"2022","unstructured":"Carles Diaz-Vilor, Angel Lozano, and Hamid Jafarkhani. 2022. Cell-free UAV networks: Asymptotic analysis and deployment optimization. IEEE Transactions on Wireless Communications 22, 5 (2022), 3055\u20133070.","journal-title":"IEEE Transactions on Wireless Communications"},{"issue":"12","key":"e_1_3_1_9_2","doi-asserted-by":"crossref","first-page":"10544","DOI":"10.1109\/TWC.2022.3185094","article-title":"Cost-optimal deployment of millimeter-wave base stations under outage requirement","volume":"21","author":"Dong Miaomiao","year":"2022","unstructured":"Miaomiao Dong, Minsung Cho, Kangeun Lee, Sungrok Yoon, and Taejoon Kim. 2022. Cost-optimal deployment of millimeter-wave base stations under outage requirement. IEEE Transactions on Wireless Communications 21, 12 (2022), 10544\u201310559.","journal-title":"IEEE Transactions on Wireless Communications"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/3581791.3597297"},{"issue":"10","key":"e_1_3_1_11_2","doi-asserted-by":"crossref","first-page":"10487","DOI":"10.1016\/j.jksuci.2022.11.004","article-title":"QoS-aware energy-efficient microBase station deployment for 5G-enabled HetNets","volume":"34","author":"Guo Wanying","year":"2022","unstructured":"Wanying Guo, Jahwan Koo, Isma Farah Siddiqui, Nawab Muhammad Faseeh Qureshi, and Dong Ryeol Shin. 2022. QoS-aware energy-efficient microBase station deployment for 5G-enabled HetNets. Journal of King Saud University-Computer and Information Sciences 34, 10 (2022), 10487\u201310495.","journal-title":"Journal of King Saud University-Computer and Information Sciences"},{"key":"e_1_3_1_12_2","first-page":"01044","volume-title":"SHS Web of Conferences","author":"Han Shifen","year":"2022","unstructured":"Shifen Han and Li Xiao. 2022. An improved adaptive genetic algorithm. In SHS Web of Conferences. EDP Sciences, 01044."},{"issue":"3","key":"e_1_3_1_13_2","first-page":"2727","article-title":"Efficient resource allocation for multi-beam satellite-terrestrial vehicular networks: A multi-agent actor-critic method with attention mechanism","volume":"23","author":"He Ying","year":"2021","unstructured":"Ying He, Yuhang Wang, F. Richard Yu, Qiuzhen Lin, Jianqiang Li, and Victor C. M. Leung. 2021. Efficient resource allocation for multi-beam satellite-terrestrial vehicular networks: A multi-agent actor-critic method with attention mechanism. IEEE Transactions on Intelligent Transportation Systems 23, 3 (2021), 2727\u20132738.","journal-title":"IEEE Transactions on Intelligent Transportation Systems"},{"key":"e_1_3_1_14_2","first-page":"1","volume-title":"2022 Antenna Measurement Techniques Association Symposium (AMTA)","author":"Kim Jaehoon","year":"2022","unstructured":"Jaehoon Kim. 2022. 5G base-station network optimization in urban wireless scenario using machine learning. In 2022 Antenna Measurement Techniques Association Symposium (AMTA). IEEE, 1\u20134."},{"issue":"7","key":"e_1_3_1_15_2","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1109\/LWC.2023.3266011","article-title":"Interference-aware deployment for maximizing user satisfaction in multi-UAV wireless networks","volume":"12","author":"Lai Chuan-Chi","year":"2023","unstructured":"Chuan-Chi Lai, Ang-Hsun Tsai, Chia-Wei Ting, Ko-Han Lin, Jing-Chi Ling, and Chia-En Tsai. 2023. Interference-aware deployment for maximizing user satisfaction in multi-UAV wireless networks. IEEE Wireless Communications Letters 12, 7 (2023), 1189\u20131193.","journal-title":"IEEE Wireless Communications Letters"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/WCNC51071.2022.9771979"},{"issue":"10","key":"e_1_3_1_17_2","doi-asserted-by":"crossref","first-page":"10894","DOI":"10.1109\/TVT.2022.3182908","article-title":"Autonomous non-terrestrial base station deployment for non-terrestrial networks: A reinforcement learning approach","volume":"71","author":"Lien Shao-Yu","year":"2022","unstructured":"Shao-Yu Lien and Der-Jiunn Deng. 2022. Autonomous non-terrestrial base station deployment for non-terrestrial networks: A reinforcement learning approach. IEEE Transactions on Vehicular Technology 71, 10 (2022), 10894\u201310909.","journal-title":"IEEE Transactions on Vehicular Technology"},{"issue":"19","key":"e_1_3_1_18_2","doi-asserted-by":"crossref","first-page":"18539","DOI":"10.1109\/JIOT.2022.3161260","article-title":"Optimal tethered-UAV deployment in A2G communication networks: Multi-agent Q-learning approach","volume":"9","author":"Lim Suhyeon","year":"2022","unstructured":"Suhyeon Lim, Heejung Yu, and Howon Lee. 2022. Optimal tethered-UAV deployment in A2G communication networks: Multi-agent Q-learning approach. IEEE Internet of Things Journal 9, 19 (2022), 18539\u201318549.","journal-title":"IEEE Internet of Things Journal"},{"issue":"5","key":"e_1_3_1_19_2","doi-asserted-by":"crossref","first-page":"1882","DOI":"10.1007\/s40815-023-01480-7","article-title":"An efficient intuitionistic fuzzy sets base stations deployment strategy in internet of things systems","volume":"25","author":"Lin Zhen-Yin","year":"2023","unstructured":"Zhen-Yin Lin, Jau-Yang Chang, and Jin-Tsong Jeng. 2023. An efficient intuitionistic fuzzy sets base stations deployment strategy in internet of things systems. International Journal of Fuzzy Systems 25, 5 (2023), 1882\u20131894.","journal-title":"International Journal of Fuzzy Systems"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCCN.2022.3167549"},{"key":"e_1_3_1_21_2","first-page":"6380","article-title":"Multi-agent actor-critic for mixed cooperative-competitive environments","author":"Lowe Ryan","year":"2017","unstructured":"Ryan Lowe, Yi. I. Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in Neural Information Processing Systems 30 (2017), 6380\u20136391.","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"4","key":"e_1_3_1_22_2","doi-asserted-by":"crossref","first-page":"4729","DOI":"10.1109\/TAES.2023.3237994","article-title":"Base station antenna uptilt optimization for cellular-connected drone corridors","volume":"59","author":"Maeng Sung Joon","year":"2023","unstructured":"Sung Joon Maeng, Md Moin Uddin Chowdhury, \u0130smail G\u00fcven\u00e7, Arupjyoti Bhuyan, and Huaiyu Dai. 2023. Base station antenna uptilt optimization for cellular-connected drone corridors. IEEE Transactions on Aerospace and Electronic Systems 59, 4 (2023), 4729\u20134737.","journal-title":"IEEE Transactions on Aerospace and Electronic Systems"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11276-023-03273-0"},{"issue":"16","key":"e_1_3_1_24_2","doi-asserted-by":"crossref","first-page":"15372","DOI":"10.1109\/JIOT.2022.3150184","article-title":"Single-and multiagent actor\u2013critic for initial UAV\u2019s deployment and 3-D trajectory design","volume":"9","author":"Nasr-Azadani Maedeh","year":"2022","unstructured":"Maedeh Nasr-Azadani, Jamshid Abouei, and Konstantinos N. Plataniotis. 2022. Single-and multiagent actor\u2013critic for initial UAV\u2019s deployment and 3-D trajectory design. IEEE Internet of Things Journal 9, 16 (2022), 15372\u201315389.","journal-title":"IEEE Internet of Things Journal"},{"issue":"8","key":"e_1_3_1_25_2","doi-asserted-by":"crossref","first-page":"1590","DOI":"10.1109\/LWC.2022.3167568","article-title":"Optimizing energy efficiency in UAV-assisted networks using deep reinforcement learning","volume":"11","author":"Omoniwa Babatunji","year":"2022","unstructured":"Babatunji Omoniwa, Boris Galkin, and Ivana Dusparic. 2022. Optimizing energy efficiency in UAV-assisted networks using deep reinforcement learning. IEEE Wireless Communications Letters 11, 8 (2022), 1590\u20131594.","journal-title":"IEEE Wireless Communications Letters"},{"key":"e_1_3_1_26_2","first-page":"1","volume-title":"2022 International Symposium on Wireless Communication Systems (ISWCS)","author":"Ouyang Chongjun","year":"2022","unstructured":"Chongjun Ouyang, Hao Xu, Xujie Zang, and Hongwen Yang. 2022. Exploiting lens antenna arrays in uplink mmWave MU-MIMO networks: Joint beamforming optimization. In 2022 International Symposium on Wireless Communication Systems (ISWCS). IEEE, 1\u20136."},{"issue":"4","key":"e_1_3_1_27_2","doi-asserted-by":"crossref","first-page":"5049","DOI":"10.1109\/TVT.2022.3224304","article-title":"3-D deployment and trajectory planning for relay based UAV assisted cooperative communication for emergency scenarios using Dijkstra\u2019s algorithm","volume":"72","author":"Nelapati Lava Prasad","year":"2022","unstructured":"Nelapati Lava, Prasad and Barathram Ramkumar. 2022. 3-D deployment and trajectory planning for relay based UAV assisted cooperative communication for emergency scenarios using Dijkstra\u2019s algorithm. IEEE Transactions on Vehicular Technology 72, 4 (2022), 5049\u20135063.","journal-title":"IEEE Transactions on Vehicular Technology"},{"issue":"12","key":"e_1_3_1_28_2","doi-asserted-by":"crossref","first-page":"12290","DOI":"10.1109\/TVT.2021.3117792","article-title":"Distributed UAV-BSs trajectory optimization for user-level fair communication service with multi-agent deep reinforcement learning","volume":"70","author":"Qin Zhenquan","year":"2021","unstructured":"Zhenquan Qin, Zhonghao Liu, Guangjie Han, Chuan Lin, Linlin Guo, and Ling Xie. 2021. Distributed UAV-BSs trajectory optimization for user-level fair communication service with multi-agent deep reinforcement learning. IEEE Transactions on Vehicular Technology 70, 12 (2021), 12290\u201312301.","journal-title":"IEEE Transactions on Vehicular Technology"},{"issue":"3","key":"e_1_3_1_29_2","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1007\/s41870-023-01210-0","article-title":"Key performance indicators analysis for 4 G-LTE cellular networks based on real measurements","volume":"15","author":"Shakir Zaenab","year":"2023","unstructured":"Zaenab Shakir, Ahmed Yaseen Mjhool, Abbas Al-Thaedan, Ali Al-Sabbagh, and Ruaa Alsabah. 2023. Key performance indicators analysis for 4 G-LTE cellular networks based on real measurements. International Journal of Information Technology 15, 3 (2023), 1347\u20131355.","journal-title":"International Journal of Information Technology"},{"issue":"2","key":"e_1_3_1_30_2","doi-asserted-by":"crossref","first-page":"1884","DOI":"10.1109\/TNSM.2022.3218819","article-title":"Energy efficient clustering and resource allocation strategy for ultra-dense networks: A machine learning framework","volume":"20","author":"Sharma Nidhi","year":"2022","unstructured":"Nidhi Sharma and Krishan Kumar. 2022. Energy efficient clustering and resource allocation strategy for ultra-dense networks: A machine learning framework. IEEE Transactions on Network and Service Management 20, 2 (2022), 1884\u20131897.","journal-title":"IEEE Transactions on Network and Service Management"},{"key":"e_1_3_1_31_2","doi-asserted-by":"crossref","first-page":"2050","DOI":"10.1109\/GLOBECOM48099.2022.10001704","volume-title":"GLOBECOM 2022-2022 IEEE Global Communications Conference","author":"Shen Linzhi","year":"2022","unstructured":"Linzhi Shen and Shaowei Wang. 2022. An efficient codebook based radio parameter optimization method for mobile networks. In GLOBECOM 2022-2022 IEEE Global Communications Conference. IEEE, 2050\u20132055."},{"issue":"5","key":"e_1_3_1_32_2","first-page":"3057","article-title":"Multi-agent deep reinforcement learning for massive access in 5G and beyond ultra-dense NOMA system","volume":"21","author":"Shi Zhenjiang","year":"2021","unstructured":"Zhenjiang Shi, Jiajia Liu, Shangwei Zhang, and Nei Kato. 2021. Multi-agent deep reinforcement learning for massive access in 5G and beyond ultra-dense NOMA system. IEEE Transactions on Wireless Communications 21, 5 (2021), 3057\u20133070.","journal-title":"IEEE Transactions on Wireless Communications"},{"issue":"6","key":"e_1_3_1_33_2","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1109\/MWC.2006.275194","article-title":"Automated optimization of service coverage and base station antenna configuration in UMTS networks","volume":"13","author":"Siomina Iana","year":"2007","unstructured":"Iana Siomina, Peter Varbrand, and Di Yuan. 2007. Automated optimization of service coverage and base station antenna configuration in UMTS networks. IEEE Wireless Communications 13, 6 (2007), 16\u201325.","journal-title":"IEEE Wireless Communications"},{"issue":"2","key":"e_1_3_1_34_2","doi-asserted-by":"crossref","first-page":"1021","DOI":"10.1109\/TVT.2007.905347","article-title":"Comparison of MIMO antenna configurations: Methods and experimental results","volume":"57","author":"Suvikunnas Pasi","year":"2008","unstructured":"Pasi Suvikunnas, Jari Salo, Lasse Vuokko, Jarmo Kivinen, Kati Sulonen, and Pertti Vainikainen. 2008. Comparison of MIMO antenna configurations: Methods and experimental results. IEEE Transactions on Vehicular Technology 57, 2 (2008), 1021\u20131031.","journal-title":"IEEE Transactions on Vehicular Technology"},{"issue":"21","key":"e_1_3_1_35_2","doi-asserted-by":"crossref","first-page":"21899","DOI":"10.1109\/JIOT.2022.3182633","article-title":"Deep-reinforcement-learning-based drone base station deployment for wireless communication services","volume":"9","author":"Tarekegn Getaneh Berie","year":"2022","unstructured":"Getaneh Berie Tarekegn, Rong-Terng Juang, Hsin-Piao Lin, Yirga Yayeh Munaye, Li-Chun Wang, and Mekuanint Agegnehu Bitew. 2022. Deep-reinforcement-learning-based drone base station deployment for wireless communication services. IEEE Internet of Things Journal 9, 21 (2022), 21899\u201321915.","journal-title":"IEEE Internet of Things Journal"},{"issue":"7","key":"e_1_3_1_36_2","doi-asserted-by":"crossref","first-page":"5953","DOI":"10.1109\/TAP.2022.3161285","article-title":"Design of wideband base station antenna by involving fragment-type structures on dipole arms","volume":"70","author":"Wang Dong","year":"2022","unstructured":"Dong Wang, Gang Wang, Diqun Lu, Nan Yang, and Qingfu Zhang. 2022. Design of wideband base station antenna by involving fragment-type structures on dipole arms. IEEE Transactions on Antennas and Propagation 70, 7 (2022), 5953\u20135958.","journal-title":"IEEE Transactions on Antennas and Propagation"},{"key":"e_1_3_1_37_2","article-title":"Multiple aerial base station deployment and user association based on binary radio map","author":"Xia Xiaochen","year":"2023","unstructured":"Xiaochen Xia, Kui Xu, Wei Xie, Youyun Xu, Nan Sha, and Yurong Wang. 2023. Multiple aerial base station deployment and user association based on binary radio map. IEEE Internet of Things Journal (2023).","journal-title":"IEEE Internet of Things Journal"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/VTC2022-Fall57202.2022.10012752"},{"key":"e_1_3_1_39_2","article-title":"Multi-UAV navigation for partially observable communication coverage by graph reinforcement learning","author":"Ye Zhenhui","year":"2022","unstructured":"Zhenhui Ye, Ke Wang, Yining Chen, Xiaohong Jiang, and Guanghua Song. 2022. Multi-UAV navigation for partially observable communication coverage by graph reinforcement learning. IEEE Transactions on Mobile Computing (2022).","journal-title":"IEEE Transactions on Mobile Computing"},{"key":"e_1_3_1_40_2","first-page":"24611","article-title":"The surprising effectiveness of ppo in cooperative multi-agent games","volume":"35","author":"Yu Chao","year":"2022","unstructured":"Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, and Yi Wu. 2022. The surprising effectiveness of ppo in cooperative multi-agent games. Advances in Neural Information Processing Systems 35 (2022), 24611\u201324624.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_1_41_2","first-page":"330","volume-title":"2023 IEEE Symposium on Electromagnetic Compatibility and Signal\/Power Integrity (EMC+ SIPI)","author":"Zhang Chenxi","year":"2023","unstructured":"Chenxi Zhang, Feng Gao, Wentao Zhu, Fei Liu, Kankan Jin, and Jinpeng Xu. 2023. Research and application of coverage compensation rapid optimization based on antenna weights. In 2023 IEEE Symposium on Electromagnetic Compatibility and Signal\/Power Integrity (EMC+ SIPI). IEEE, 330\u2013334."},{"issue":"10","key":"e_1_3_1_42_2","doi-asserted-by":"crossref","first-page":"2309","DOI":"10.1109\/LWC.2021.3100388","article-title":"Joint 3D deployment and power allocation for UAV-BS: A deep reinforcement learning approach","volume":"10","author":"Zhang Meng","year":"2021","unstructured":"Meng Zhang, Shu Fu, and Qilin Fan. 2021. Joint 3D deployment and power allocation for UAV-BS: A deep reinforcement learning approach. IEEE Wireless Communications Letters 10, 10 (2021), 2309\u20132312.","journal-title":"IEEE Wireless Communications Letters"},{"issue":"8","key":"e_1_3_1_43_2","doi-asserted-by":"crossref","first-page":"5868","DOI":"10.1109\/JIOT.2021.3066368","article-title":"QoE-driven adaptive deployment strategy of multi-UAV networks based on hybrid deep reinforcement learning","volume":"9","author":"Zhou Yi","year":"2021","unstructured":"Yi Zhou, Xiaoyong Ma, Shuting Hu, Danyang Zhou, Nan Cheng, and Ning Lu. 2021. QoE-driven adaptive deployment strategy of multi-UAV networks based on hybrid deep reinforcement learning. IEEE Internet of Things Journal 9, 8 (2021), 5868\u20135881.","journal-title":"IEEE Internet of Things Journal"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3763795","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T20:09:08Z","timestamp":1763755748000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3763795"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,21]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,1,31]]}},"alternative-id":["10.1145\/3763795"],"URL":"https:\/\/doi.org\/10.1145\/3763795","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,21]]},"assertion":[{"value":"2024-01-20","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-15","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-11-21","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}