{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T21:03:21Z","timestamp":1767647001704,"version":"3.48.0"},"reference-count":67,"publisher":"Springer Science and Business Media LLC","issue":"10","license":[{"start":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T00:00:00Z","timestamp":1763337600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T00:00:00Z","timestamp":1763337600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100015401","name":"Key Research and Development Projects of Shaanxi Province","doi-asserted-by":"publisher","award":["2024GX-YBXM-001"],"award-info":[{"award-number":["2024GX-YBXM-001"]}],"id":[{"id":"10.13039\/501100015401","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J. King Saud Univ. Comput. Inf. Sci."],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Robotaxi services\u2013autonomous, on-demand ride-hailing using self-driving vehicles\u2013are rapidly expanding as technology and shared mobility models advance. However, robotaxi dispatching is challenged by the unpredictable and dynamic nature of urban traffic and fluctuating passenger demand. Existing ride-hailing platforms advocate for \u201cone city, one policy\u201d approaches, which tailor strategies to the unique supply, demand, and influencing factors of each city. They face significant barriers to rapid and large-scale scalability across diverse urban environments. This paper introduces the Cross-city Multi-Agent Policy Transfer (CcMAPT) method to generalize and adapt multi-vehicle dispatching policies for new cities while considering variations in local regulations and demand patterns. By employing matching and distillation techniques, CcMAPT identifies shared environmental dynamics across different cities, thereby complicating rapid scaling and consistent service quality. Experimental results based on real travel data indicate that CcMAPT significantly enhances both the total revenue for robotaxis and the order response rate in Chengdu, Beijing, and Haikou in China. Furthermore, the City Ride-hailing Transportation Simulator (CRTS) has been developed to simulate large-scale, dynamic urban travel environments, thus facilitating the testing and analysis of various vehicle dispatching policies. Validation via the CRTS also demonstrates that CcMAPT possesses robustness across varying city scales, vehicle quantities, and dispatch complexities.<\/jats:p>","DOI":"10.1007\/s44443-025-00342-6","type":"journal-article","created":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T10:37:48Z","timestamp":1763375868000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Cross-city robotaxi dispatch in ride-hailing platforms via policy distillation"],"prefix":"10.1007","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5465-610X","authenticated-orcid":false,"given":"Lei","family":"Tang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4920-1199","authenticated-orcid":false,"given":"Yaling","family":"Zhao","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0002-2383-3681","authenticated-orcid":false,"given":"Zeyu","family":"He","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2368-8947","authenticated-orcid":false,"given":"Zhu","family":"Wang","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0000-2860-2159","authenticated-orcid":false,"given":"Ying","family":"Liu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6944-7511","authenticated-orcid":false,"given":"Junchi","family":"Ma","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8802-753X","authenticated-orcid":false,"given":"Jiahao","family":"Yu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"key":"342_CR1","doi-asserted-by":"crossref","unstructured":"Ammar HB, Eaton E, Ruvolo P, Taylor M (2015) Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In: Proceedings of the AAAI conference on artificial intelligence, vol. 29","DOI":"10.1609\/aaai.v29i1.9631"},{"issue":"1","key":"342_CR2","doi-asserted-by":"publisher","first-page":"73","DOI":"10.3233\/AIC-201575","volume":"34","author":"A Castagna","year":"2021","unstructured":"Castagna A, Gu\u00e9riau M, Vizzari G, Dusparic I, Lujak M, Dusparic I, Kl\u00fcgl F, Vizzari G (2021) Demand-responsive rebalancing zone generation for reinforcement learning-based on-demand mobility. AI Commun 34(1):73\u201388. https:\/\/doi.org\/10.3233\/AIC-201575","journal-title":"AI Commun"},{"key":"342_CR3","doi-asserted-by":"publisher","unstructured":"Castagna A, Dusparic I (2022) Multi-agent transfer learning in reinforcement learning-based ride-sharing systems. In: Rocha A, Steels L, VandenHerik J (eds) ICAART: proceedings of the 14th international conference on agents and artificial intelligence - vol 2. ICAART. pp 120\u2013130. https:\/\/doi.org\/10.5220\/0010785200003116","DOI":"10.5220\/0010785200003116"},{"key":"342_CR4","doi-asserted-by":"publisher","unstructured":"Chaudhari HA, Byers JW, Terzi E (2020) Learn to earn: Enabling coordination within a ride-hailing fleet. In: 2020 IEEE international conference on big data (big data), pp 1127\u20131136. https:\/\/doi.org\/10.1109\/BigData50022.2020.9378416","DOI":"10.1109\/BigData50022.2020.9378416"},{"issue":"1","key":"342_CR5","doi-asserted-by":"publisher","first-page":"799","DOI":"10.1002\/int.22648","volume":"37","author":"Z Cheng","year":"2021","unstructured":"Cheng Z, Ye D, Zhu T, Zhou W, Yu PS, Zhu C (2021) Multi-agent reinforcement learning via knowledge transfer with differentially private noise. Int J Intell Syst 37(1):799\u2013828. https:\/\/doi.org\/10.1002\/int.22648","journal-title":"Int J Intell Syst"},{"key":"342_CR6","doi-asserted-by":"crossref","unstructured":"Clemente ML (2022) Building a real-world traffic micro-simulation scenario from scratch with sumo. In: SUMO conference proceedings, vol. 3, pp 215\u2013230","DOI":"10.52825\/scp.v3i.109"},{"key":"342_CR7","doi-asserted-by":"publisher","unstructured":"Dai J, Li R, Liu Z, Lin S (2021) Impacts of the introduction of autonomous taxi on travel behaviors of the experienced user: Evidence from a one-year paid taxi service in guangzhou, china. Transp Res Part C-Emerg Technol 130. https:\/\/doi.org\/10.1016\/j.trc.2021.103311","DOI":"10.1016\/j.trc.2021.103311"},{"key":"342_CR8","doi-asserted-by":"publisher","unstructured":"Dai G, Yi W, Cao J, Gong Z, Fu X, Zhang B (2025) Crrl: contrastive region relevance learning framework for cross-city traffic prediction. Inf Fusion 122(C). https:\/\/doi.org\/10.1016\/j.inffus.2025.103215","DOI":"10.1016\/j.inffus.2025.103215"},{"key":"342_CR9","unstructured":"De\u00a0Witt CS, Gupta T, Makoviichuk D, Makoviychuk V, Torr PH, Sun M, Whiteson S (2020) Is independent learning all you need in the starcraft multi-agent challenge? arXiv:2011.09533"},{"key":"342_CR10","unstructured":"Fujimoto S, Hoof H, Meger D (2018) Addressing function approximation error in actor-critic methods. In: Dy J, Krause A (eds) Proceedings of the 35th international conference on machine learning. proceedings of machine learning research, vol. 80. PMLR, Stockholmsm\u00e4ssan, Stockholm, Sweden, pp 1587\u20131596"},{"key":"342_CR11","doi-asserted-by":"publisher","unstructured":"Gao Z, Xu K, Ding B, Wang H (2021) Knowru: knowledge reuse via knowledge distillation in multi-agent reinforcement learning. Entropy 23(8). https:\/\/doi.org\/10.3390\/e23081043","DOI":"10.3390\/e23081043"},{"issue":"C","key":"342_CR12","doi-asserted-by":"publisher","first-page":"104","DOI":"10.1016\/j.neunet.2023.08.025","volume":"167","author":"H Gui","year":"2023","unstructured":"Gui H, Pang S, Yu S, Qiao S, Qi Y, He X, Wang M, Zhai X (2023) Cross-domain policy adaptation with dynamics alignment. Neural Netw 167(C):104\u2013117. https:\/\/doi.org\/10.1016\/j.neunet.2023.08.025","journal-title":"Neural Netw"},{"issue":"7","key":"342_CR13","doi-asserted-by":"publisher","first-page":"10780","DOI":"10.1109\/TITS.2025.3552111","volume":"26","author":"C Guo","year":"2025","unstructured":"Guo C, Hwang F-J, Chen C-H, Chang C-C, Chang C-C (2025) Cluster-granularity spatiotemporal transfer for cross-region graph-based traffic forecasting. IEEE Trans Intell Transp Syst 26(7):10780\u201310794. https:\/\/doi.org\/10.1109\/TITS.2025.3552111","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"342_CR14","unstructured":"Guo Y, Wang Y, Shi Y, Xu P, Liu A (2025) Off-dynamics reinforcement learning via domain adaptation and reward augmented imitation. In: Proceedings of the 38th international conference on neural information processing systems. NIPS \u201924. Curran Associates Inc., Red Hook, NY, USA"},{"key":"342_CR15","doi-asserted-by":"publisher","unstructured":"Herfort B, Lautenbach S, Albuquerque JP, Anderson J, Zipf A (2023) A spatio-temporal analysis investigating completeness and inequalities of global urban building data in openstreetmap. Nat Commun 14(1). https:\/\/doi.org\/10.1038\/s41467-023-39698-6","DOI":"10.1038\/s41467-023-39698-6"},{"issue":"12","key":"342_CR16","doi-asserted-by":"publisher","first-page":"14294","DOI":"10.1109\/TITS.2023.3302014","volume":"24","author":"X Huang","year":"2023","unstructured":"Huang X, Ling J, Yang X, Zhang X, Yang K (2023) Multi-agent mix hierarchical deep reinforcement learning for large-scale fleet management. IEEE Trans Intell Transp Syst 24(12):14294\u201314305. https:\/\/doi.org\/10.1109\/TITS.2023.3302014","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"342_CR17","doi-asserted-by":"publisher","unstructured":"Hu X, Wang Q, Zhang W, Xu C (2023) Rgmarl: vehicle dispatching based on road information and supply-demand distribution. In: Proceedings of the 2023 IEEE 26th international conference on intelligent transportation systems (ITSC). IEEE, Bilbao, Spain, pp 1487\u20131493. https:\/\/doi.org\/10.1109\/ITSC57777.2023.10422182","DOI":"10.1109\/ITSC57777.2023.10422182"},{"key":"342_CR18","doi-asserted-by":"publisher","unstructured":"Jin Y, Chen K, Yang Q (2022) Selective cross-city transfer learning for traffic prediction via source city region re-weighting. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining. KDD \u201922. Association for Computing Machinery, New York, NY, USA, pp 731\u2013741. https:\/\/doi.org\/10.1145\/3534678.3539250","DOI":"10.1145\/3534678.3539250"},{"key":"342_CR19","doi-asserted-by":"publisher","first-page":"111211","DOI":"10.1016\/j.engappai.2025.111211","volume":"158","author":"H Khan","year":"2025","unstructured":"Khan H, Ullah Khan S, Ullah W, Wook Baik S (2025) Optimal features driven hybrid attention network for effective video summarization. Eng Appl Artif Intell 158:111211. https:\/\/doi.org\/10.1016\/j.engappai.2025.111211","journal-title":"Eng Appl Artif Intell"},{"key":"342_CR20","unstructured":"Kim D-K, Liu M, Omidshafiei S, Lopez-Cot S, Riemer M, Habibi G, Tesauro G, Mourad S, Campbell M, How JP (2020) Learning hierarchical teaching policies for cooperative agents. In: Proceedings of the 19th international conference on Autonomous Agents and MultiAgent Systems. AAMAS \u201920. International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC, pp 620\u2013628"},{"key":"342_CR21","doi-asserted-by":"publisher","unstructured":"Kim J, Kim K (2021) Optimizing large-scale fleet management on a road network using multi-agent deep reinforcement learning with graph neural network. In: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC). pp 990\u2013995. https:\/\/doi.org\/10.1109\/ITSC48978.2021.9565029","DOI":"10.1109\/ITSC48978.2021.9565029"},{"key":"342_CR22","unstructured":"Lillicrap TP, Hunt JJ, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2015) Continuous control with deep reinforcement learning. arXiv:1509.02971"},{"key":"342_CR23","doi-asserted-by":"publisher","unstructured":"Lin K, Zhao R, Xu Z, Zhou J (2018) Efficient large-scale fleet management via multi-agent deep reinforcement learning. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. KDD \u201918. Association for Computing Machinery, New York, NY, USA, pp 1774\u20131783. https:\/\/doi.org\/10.1145\/3219819.3219993","DOI":"10.1145\/3219819.3219993"},{"key":"342_CR24","doi-asserted-by":"publisher","unstructured":"Li M, Qin Z, Jiao Y, Yang Y, Wang J, Wang C, Wu G, Ye J (2019) Efficient ridesharing order dispatching with mean field multi-agent reinforcement learning. In: The World Wide Web Conference. WWW \u201919. Association for Computing Machinery, New York, NY, USA, pp 983\u2013994. https:\/\/doi.org\/10.1145\/3308558.3313433","DOI":"10.1145\/3308558.3313433"},{"issue":"C","key":"342_CR25","doi-asserted-by":"publisher","first-page":"454","DOI":"10.1016\/j.ins.2021.11.054","volume":"585","author":"K Liu","year":"2022","unstructured":"Liu K, Zhao Y, Wang G, Peng B (2022) Self-attention-based multi-agent continuous control method in cooperative environments. Inf Sci 585(C):454\u2013470. https:\/\/doi.org\/10.1016\/j.ins.2021.11.054","journal-title":"Inf Sci"},{"key":"342_CR26","doi-asserted-by":"publisher","unstructured":"Liu C, Sun J, Jin H, Ai M, Li Q, Zhang C, Sheng K, Wu G, Qie X, Wang X (2020) Spatio-temporal hierarchical adaptive dispatching for ridesharing systems. In: Proceedings of the 28th international conference on advances in geographic information systems. SIGSPATIAL \u201920. Association for Computing Machinery, New York, NY, USA, pp 227\u2013238. https:\/\/doi.org\/10.1145\/3397536.3422212","DOI":"10.1145\/3397536.3422212"},{"issue":"1\u20132","key":"342_CR27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s44443-025-00016-3","volume":"37","author":"J Luo","year":"2025","unstructured":"Luo J, Li F, Jiao J (2025) A dynamic multiobjective recommendation method based on soft actor-critic with discrete actions. J King Saud University Comput Inf Sci 37(1\u20132):1. https:\/\/doi.org\/10.1007\/s44443-025-00016-3","journal-title":"J King Saud University Comput Inf Sci"},{"key":"342_CR28","unstructured":"Lyu J, Bai C, Yang J, Lu Z, Li X (2024) Cross-domain policy adaptation by capturing representation mismatch. In: International conference on machine learning"},{"key":"342_CR29","doi-asserted-by":"publisher","unstructured":"Mo H, Wei L (2025) Lightweight detection of cotton leaf diseases using stylegan2-ada and decoupled focused self-attention. J King Saud University \u2014 Comput Inf Sci 37(4):1\u201322. https:\/\/doi.org\/10.1007\/s44443-025-00054-x","DOI":"10.1007\/s44443-025-00054-x"},{"key":"342_CR30","unstructured":"Ng AY, Jordan MI, Weiss Y (2001) On spectral clustering: analysis and an algorithm. In: Proceedings of the 15th international conference on neural information processing systems: natural and synthetic. NIPS\u201901. MIT Press, Cambridge, MA, USA, pp 849\u2013856"},{"key":"342_CR31","doi-asserted-by":"publisher","unstructured":"Niu H, Chen Q, Liu T, Li J, Zhou G, Zhang Y, Hu J, Zhan X (2024) xted: Cross-domain adaptation via diffusion-based trajectory editing. In: NeurIPS 2024 workshop on open-world agents: synnergizing reasoning and decision-making in open-world environments (OWA). https:\/\/doi.org\/10.48550\/arXiv.2409.08687","DOI":"10.48550\/arXiv.2409.08687"},{"key":"342_CR32","doi-asserted-by":"publisher","unstructured":"Niu H, Hu J, Zhou G, Zhan X (2024) A comprehensive survey of cross-domain policy transfer for embodied agents. In: Larson K (ed) Proceedings of the thirty-third International Joint Conference on Artificial Intelligence (IJCAI-24). International Joint Conferences on Artificial Intelligence Organization, Jeju, Republic of Korea, pp 8197\u20138206. https:\/\/doi.org\/10.24963\/ijcai.2024\/906","DOI":"10.24963\/ijcai.2024\/906"},{"key":"342_CR33","doi-asserted-by":"publisher","unstructured":"Qin R, Chen F, Wang T, Yuan L, Wu X, Kang Y, Zhang Z, Zhang C, Yu Y (2024) Multi-agent policy transfer via task relationship modeling. Sci China-Inf Sci 67(8). https:\/\/doi.org\/10.1007\/s11432-023-3862-1","DOI":"10.1007\/s11432-023-3862-1"},{"issue":"3","key":"342_CR34","doi-asserted-by":"publisher","first-page":"1699","DOI":"10.1109\/TCYB.2021.3108237","volume":"53","author":"H Shi","year":"2023","unstructured":"Shi H, Li J, Mao J, Hwang K-S (2023) Lateral transfer learning for multiagent reinforcement learning. IEEE Trans Cybernet 53(3):1699\u20131711. https:\/\/doi.org\/10.1109\/TCYB.2021.3108237","journal-title":"IEEE Trans Cybernet"},{"key":"342_CR35","doi-asserted-by":"publisher","unstructured":"Shi J, Malik J (1997) Normalized cuts and image segmentation. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition. pp 731\u2013737. https:\/\/doi.org\/10.1109\/CVPR.1997.609407","DOI":"10.1109\/CVPR.1997.609407"},{"key":"342_CR36","doi-asserted-by":"publisher","first-page":"103551","DOI":"10.1016\/j.tre.2024.103551","volume":"186","author":"J Si","year":"2024","unstructured":"Si J, He F, Lin X, Tang X (2024) Vehicle dispatching and routing of on-demand intercity ride-pooling services: a multi-agent hierarchical reinforcement learning approach. Transp Res Part E Logistics Transp Rev 186:103551. https:\/\/doi.org\/10.1016\/j.tre.2024.103551","journal-title":"Transp Res Part E Logistics Transp Rev"},{"key":"342_CR37","doi-asserted-by":"crossref","unstructured":"Sohrabi S, Dadashova B, Khreis H, Sener IN, Zmud J (2021) Quantifying the health and health equity impacts of autonomous vehicles: A conceptual framework and literature review. J Transp Health 22","DOI":"10.1016\/j.jth.2021.101120"},{"issue":"7","key":"342_CR38","doi-asserted-by":"publisher","first-page":"8873","DOI":"10.1109\/JIOT.2024.3504544","volume":"12","author":"G Sun","year":"2025","unstructured":"Sun G, Maale GT, Zhang H, Ayepah-Mensah D, Jiang J (2025) Fedcruise: collaborative cruise guidance with federated policy distillation in multiple ride-hailing platforms. IEEE Internet Things J 12(7):8873\u20138884. https:\/\/doi.org\/10.1109\/JIOT.2024.3504544","journal-title":"IEEE Internet Things J"},{"issue":"1","key":"342_CR39","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1109\/TCE.2024.3476477","volume":"71","author":"M Talha Usman","year":"2025","unstructured":"Talha Usman M, Khan H, Kumar Singh S, Young Lee M, Koo J (2025) Efficient deepfake detection via layer-frozen assisted dual attention network for consumer imaging devices. IEEE Trans Consum Electron 71(1):281\u2013291. https:\/\/doi.org\/10.1109\/TCE.2024.3476477","journal-title":"IEEE Trans Consum Electron"},{"key":"342_CR40","doi-asserted-by":"crossref","unstructured":"Tang, X., Zhang, F., Qin, Z., Wang, Y., Shi, D., Song, B., Tong, Y., Zhu, H., Ye, J.: Value function is all you need: A unified learning framework for ride hailing platforms. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. KDD \u201921, pp. 3605\u20133615. Association for Computing Machinery, New York, NY, USA (2021). 10.1145\/3447548.3467096","DOI":"10.1145\/3447548.3467096"},{"issue":"8","key":"342_CR41","doi-asserted-by":"publisher","first-page":"12311","DOI":"10.1109\/TITS.2021.3113661","volume":"23","author":"L Tang","year":"2022","unstructured":"Tang L, Liu Z, Zhang R, Duan Z, Liang Y (2022) Who will travel with me? personalized ranking using attributed network embedding for pooling. IEEE Trans Intell Transp Syst 23(8):12311\u201312327. https:\/\/doi.org\/10.1109\/TITS.2021.3113661","journal-title":"IEEE Trans Intell Transp Syst"},{"issue":"2","key":"342_CR42","doi-asserted-by":"publisher","first-page":"1453","DOI":"10.1109\/JIOT.2024.3490598","volume":"12","author":"L Tang","year":"2025","unstructured":"Tang L, Wang R, Liu Z, Liang Y, Niu Y, Zhu W, Duan Z (2025) Scenario-based accelerated testing for sotif in autonomous driving: a review. IEEE Internet Things J 12(2):1453\u20131470. https:\/\/doi.org\/10.1109\/JIOT.2024.3490598","journal-title":"IEEE Internet Things J"},{"key":"342_CR43","doi-asserted-by":"publisher","unstructured":"Tang L, Liu Z, Zhao Y, Duan Z, Jia J (2020) Efficient ridesharing framework for ride-matching via heterogeneous network embedding. ACM Trans Knowl Discov Data 14(3). https:\/\/doi.org\/10.1145\/3373839","DOI":"10.1145\/3373839"},{"key":"342_CR44","unstructured":"Tseng W-C, Wang T-H, Yen-Chen L, Isola P (2022) offline multi-agent reinforcement learning with knowledge distillation. In: Proceedings of the 36th international conference on neural information processing systems. NIPS \u201922. Curran Associates Inc., Red Hook, NY, USA"},{"key":"342_CR45","doi-asserted-by":"publisher","unstructured":"Usman MT, Khan H, Rida I, Koo J (2025) Lightweight transformer-driven multi-scale trapezoidal attention network for saliency detection. Eng Appl Artif Intell 155(C). https:\/\/doi.org\/10.1016\/j.engappai.2025.110917","DOI":"10.1016\/j.engappai.2025.110917"},{"key":"342_CR46","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proceedings of the 31st international conference on Neural Information Processing Systems. NIPS\u201917. Curran Associates Inc., Red Hook, NY, USA, pp 6000\u20136010"},{"key":"342_CR47","doi-asserted-by":"publisher","unstructured":"Wadhwania S, Kim D-K, Omidshafiei S, How JP (2019) Policy distillation and value matching in multiagent reinforcement learning. In: 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). pp 8193\u20138200. https:\/\/doi.org\/10.1109\/IROS40897.2019.8967849","DOI":"10.1109\/IROS40897.2019.8967849"},{"issue":"9","key":"342_CR48","doi-asserted-by":"publisher","first-page":"2880","DOI":"10.1177\/09544070231167986","volume":"238","author":"B Wang","year":"2024","unstructured":"Wang B, He Z, Sheng J, Liu Y (2024) Multi-agent deep reinforcement learning with actor-attention-critic for traffic light control. Proc Inst Mech Eng Part D-J Automobile Eng 238(9):2880\u20132888. https:\/\/doi.org\/10.1177\/09544070231167986","journal-title":"Proc Inst Mech Eng Part D-J Automobile Eng"},{"key":"342_CR49","doi-asserted-by":"publisher","unstructured":"Wang J, Hao Q, Huang W, Fan X, Zhang Q, Tang Z, Wang B, Hao J, Li Y (2025) Coopride: Cooperate all grids in city-scale ride-hailing dispatching with multi-agent reinforcement learning. In: Proceedings of the 31st ACM SIGKDD conference on knowledge discovery and data mining V.1. KDD \u201925. Association for Computing Machinery, New York, NY, USA, pp 1457\u20131468. https:\/\/doi.org\/10.1145\/3690624.3709205","DOI":"10.1145\/3690624.3709205"},{"key":"342_CR50","doi-asserted-by":"publisher","unstructured":"Wang Z, Qin Z, Tang X, Ye J, Zhu H (2018) Deep reinforcement learning with knowledge transfer for online rides order dispatching. In: 2018 IEEE International Conference on Data Mining (ICDM). pp 617\u2013626 . https:\/\/doi.org\/10.1109\/ICDM.2018.00077","DOI":"10.1109\/ICDM.2018.00077"},{"key":"342_CR51","doi-asserted-by":"publisher","unstructured":"Wang Y, Tong Y, Zhou Z, Ren Z, Xu Y, Wu G, Lv W (2022) Fed-ltd: towards cross-platform ride hailing via federated learning to dispatch. In: Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining. KDD \u201922. Association for Computing Machinery, New York, NY, USA, pp 4079\u20134089. https:\/\/doi.org\/10.1145\/3534678.3539047","DOI":"10.1145\/3534678.3539047"},{"key":"342_CR52","doi-asserted-by":"publisher","unstructured":"Wan R, Zhang S, Shi C, Luo S, Song R (2021) Pattern transfer learning for reinforcement learning in order dispatching. In: Proceedings of the IJCAI 2021 workshop on reinforcement learning for intelligent transportation systems (RL4ITS), Montreal, Canada (virtual). https:\/\/doi.org\/10.48550\/arXiv.2105.13218","DOI":"10.48550\/arXiv.2105.13218"},{"key":"342_CR53","doi-asserted-by":"publisher","unstructured":"Watahiki H, Iwase R, Unno R, Tsuruoka Y (2024) Cross-domain policy transfer by representation alignment via multi-domain behavioral cloning. In: Proceedings of the 3rd Conference on Lifelong Learning Agents (CoLLAs 2024). proceedings of machine learning research, vol. 274. pp 301\u2013323. https:\/\/doi.org\/10.48550\/arXiv.2407.16912","DOI":"10.48550\/arXiv.2407.16912"},{"key":"342_CR54","doi-asserted-by":"publisher","unstructured":"Xia K, Lin L, Zhang X, Wang H, Wang S, He T (2025) A transferable spatio-temporal learning framework for cross-city logistics demand prediction. In: Proceedings of the 31st ACM SIGKDD conference on knowledge discovery and data mining V.2. KDD \u201925. Association for Computing Machinery, New York, NY, USA, pp 5071\u20135082. https:\/\/doi.org\/10.1145\/3711896.3737186","DOI":"10.1145\/3711896.3737186"},{"issue":"2","key":"342_CR55","doi-asserted-by":"publisher","first-page":"380","DOI":"10.1080\/13658816.2022.2119477","volume":"37","author":"M Xu","year":"2023","unstructured":"Xu M, Yue P, Yu F, Yang C, Zhang M, Li S, Li H (2023) Multi-agent reinforcement learning to unify order-matching and vehicle-repositioning in ride-hailing services. Int J Geograph Inf Sci 37(2):380\u2013402. https:\/\/doi.org\/10.1080\/13658816.2022.2119477","journal-title":"Int J Geograph Inf Sci"},{"key":"342_CR56","unstructured":"Xu K, Bai C, Ma X, Wang D, Zhao B, Wang Z, Li X, Li W (2023) Cross-domain policy adaptation via value-guided data filtering. In: Proceedings of the 37th international conference on Neural Information Processing Systems. NIPS \u201923. Curran Associates Inc., Red Hook, NY, USA"},{"key":"342_CR57","doi-asserted-by":"publisher","unstructured":"Xu D, Yen IE-H, Zhao J, Xiao Z (2021) Rethinking network pruning \u2014 under the pre-train and fine-tune paradigm. In: Proceedings of the 2021 conference of the north american chapter of the association for computational linguistics: human language technologies. Association for Computational Linguistics, Online, pp. 2376\u20132382. https:\/\/doi.org\/10.18653\/v1\/2021.naacl-main.188","DOI":"10.18653\/v1\/2021.naacl-main.188"},{"key":"342_CR58","doi-asserted-by":"publisher","unstructured":"Yang T, Hao J, Meng Z, Zhang Z, Hu Y, Chen Y, Fan C, Wang W, Liu W, Wang Z, Peng J (2020) Efficient deep reinforcement learning via adaptive policy transfer. In: Proceedings of the twenty-ninth International Joint Conference on Artificial Intelligence (IJCAI \u201920). pp 3094\u20133100. https:\/\/doi.org\/10.24963\/ijcai.2020\/428","DOI":"10.24963\/ijcai.2020\/428"},{"issue":"10","key":"342_CR59","doi-asserted-by":"publisher","first-page":"7934","DOI":"10.1109\/TNNLS.2022.3147221","volume":"34","author":"D Ye","year":"2023","unstructured":"Ye D, Zhu T, Zhu C, Zhou W, Yu PS (2023) Model-based self-advising for multi-agent learning. IEEE Trans Neural Netw Learn Syst 34(10):7934\u20137945. https:\/\/doi.org\/10.1109\/TNNLS.2022.3147221","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"342_CR60","unstructured":"You H, Yang T, Zheng Y, Hao J, Taylor E, Liu Z, Wang J (2022) Cross-domain adaptive transfer reinforcement learning based on state-action correspondence. In: Proceedings of the 38th conference on Uncertainty in Artificial Intelligence (UAI). Proceedings of machine learning research, vol. 180. pp 2299\u20132309"},{"key":"342_CR61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s44443-025-00009-2","volume":"37","author":"JX Zhang","year":"2025","unstructured":"Zhang JX, Wu CX (2025) Active lane-changing model based on cellular automata to promote cav platoon. J King Saud University Comput Inf Sci 37:1\u20132. https:\/\/doi.org\/10.1007\/s44443-025-00009-2","journal-title":"J King Saud University Comput Inf Sci"},{"key":"342_CR62","doi-asserted-by":"publisher","unstructured":"Zhang J, Tang L, Zhou X (2025) Robust detection for fisheye camera based on contrastive learning. CMC-Comput Mater Continua 83(2):2643\u20132658. https:\/\/doi.org\/10.32604\/cmc.2025.061690","DOI":"10.32604\/cmc.2025.061690"},{"key":"342_CR63","doi-asserted-by":"publisher","unstructured":"Zhao Y, Tang L, Liang Y, He Z, Ma J (2025) Optimizing matching for on-demand ride-pooling with stochastic day-to-day dynamics. ACM Trans Knowl Discov Data 19(4). https:\/\/doi.org\/10.1145\/3721434","DOI":"10.1145\/3721434"},{"key":"342_CR64","doi-asserted-by":"publisher","unstructured":"Zhou X, Wu L, Zhang Y, Chen Z-S, Jiang S (2023) A robust deep reinforcement learning approach to driverless taxi dispatching under uncertain demand. Inf. Sci. 646(C). https:\/\/doi.org\/10.1016\/j.ins.2023.119401","DOI":"10.1016\/j.ins.2023.119401"},{"issue":"11","key":"342_CR65","doi-asserted-by":"publisher","first-page":"13344","DOI":"10.1109\/TPAMI.2023.3292075","volume":"45","author":"Z Zhu","year":"2023","unstructured":"Zhu Z, Lin K, Jain AK, Zhou J (2023) Transfer learning in deep reinforcement learning: A survey. IEEE Trans Pattern Anal Mach Intell 45(11):13344\u201313362. https:\/\/doi.org\/10.1109\/TPAMI.2023.3292075","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"342_CR66","doi-asserted-by":"publisher","unstructured":"Zhu H, Feng J, Sun F, Tang K, Zang D, Kang Q (2025) Sharing control knowledge among heterogeneous intersections: a distributed arterial traffic signal coordination method using multi-agent reinforcement learning. IEEE Trans Intell Transp Syst 26(2):2760\u20132776. https:\/\/doi.org\/10.1109\/TITS.2024.3521514","DOI":"10.1109\/TITS.2024.3521514"},{"key":"342_CR67","doi-asserted-by":"crossref","unstructured":"Zhu R, Dai T, Celiktutan O (2024) Cross domain policy transfer with effect cycle-consistency. In: 2024 IEEE International Conference on Robotics and Automation (ICRA). pp 9471\u20139477","DOI":"10.1109\/ICRA57147.2024.10611110"}],"container-title":["Journal of King Saud University Computer and Information Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44443-025-00342-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44443-025-00342-6","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44443-025-00342-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T18:47:57Z","timestamp":1767638877000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44443-025-00342-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":67,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["342"],"URL":"https:\/\/doi.org\/10.1007\/s44443-025-00342-6","relation":{},"ISSN":["1319-1578","2213-1248"],"issn-type":[{"type":"print","value":"1319-1578"},{"type":"electronic","value":"2213-1248"}],"subject":[],"published":{"date-parts":[[2025,11,17]]},"assertion":[{"value":"4 July 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 October 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 November 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"319"}}