{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T02:49:59Z","timestamp":1772938199186,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T00:00:00Z","timestamp":1691107200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,6]]},"DOI":"10.1145\/3580305.3599811","type":"proceedings-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T18:13:58Z","timestamp":1691172838000},"page":"4628-4637","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-1748-7376","authenticated-orcid":false,"given":"Xiaowei","family":"Mao","sequence":"first","affiliation":[{"name":"Beijing Jiaotong University &amp; Cainiao Network, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6130-126X","authenticated-orcid":false,"given":"Haomin","family":"Wen","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University &amp; Cainiao Network, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0989-5803","authenticated-orcid":false,"given":"Hengrui","family":"Zhang","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University &amp; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0501-9363","authenticated-orcid":false,"given":"Huaiyu","family":"Wan","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University &amp; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1863-2316","authenticated-orcid":false,"given":"Lixia","family":"Wu","sequence":"additional","affiliation":[{"name":"Cainiao Network, Hangzhou, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0636-3905","authenticated-orcid":false,"given":"Jianbin","family":"Zheng","sequence":"additional","affiliation":[{"name":"Cainiao Network, Hangzhou, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0464-7736","authenticated-orcid":false,"given":"Haoyuan","family":"Hu","sequence":"additional","affiliation":[{"name":"Cainiao Network, Hangzhou, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1611-4323","authenticated-orcid":false,"given":"Youfang","family":"Lin","sequence":"additional","affiliation":[{"name":"Beijing Jiaotong University &amp; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086","author":"Bahdanau Dzmitry","year":"2016","unstructured":"Dzmitry Bahdanau , Philemon Brakel , Kelvin Xu , Anirudh Goyal , Ryan Lowe , Joelle Pineau , Aaron Courville , and Yoshua Bengio . 2016. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086 ( 2016 ). Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2016. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086 (2016)."},{"key":"e_1_3_2_2_2_1","unstructured":"Irwan Bello Hieu Pham Quoc V Le Mohammad Norouzi and Samy Bengio. 2017. Neural combinatorial optimization with reinforcement learning. In ICLR.  Irwan Bello Hieu Pham Quoc V Le Mohammad Norouzi and Samy Bengio. 2017. Neural combinatorial optimization with reinforcement learning. In ICLR."},{"key":"e_1_3_2_2_3_1","volume-title":"CrowdExpress: a probabilistic framework for on-time crowdsourced package deliveries","author":"Chen Chao","year":"2020","unstructured":"Chao Chen , Sen Yang , Yasha Wang , Bin Guo , and Daqing Zhang . 2020. CrowdExpress: a probabilistic framework for on-time crowdsourced package deliveries . IEEE transactions on big data 8, 3 ( 2020 ), 827--842. Chao Chen, Sen Yang, Yasha Wang, Bin Guo, and Daqing Zhang. 2020. CrowdExpress: a probabilistic framework for on-time crowdsourced package deliveries. IEEE transactions on big data 8, 3 (2020), 827--842."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"e_1_3_2_2_5_1","volume-title":"Fast abstractive summarization with reinforce-selected sentence rewriting. arXiv preprint arXiv:1805.11080","author":"Chen Yen-Chun","year":"2018","unstructured":"Yen-Chun Chen and Mohit Bansal . 2018. Fast abstractive summarization with reinforce-selected sentence rewriting. arXiv preprint arXiv:1805.11080 ( 2018 ). Yen-Chun Chen and Mohit Bansal. 2018. Fast abstractive summarization with reinforce-selected sentence rewriting. arXiv preprint arXiv:1805.11080 (2018)."},{"key":"e_1_3_2_2_6_1","volume-title":"Differentiable perturb-and-parse: Semisupervised parsing with a structured variational autoencoder. arXiv preprint arXiv:1807.09875","author":"Corro Caio","year":"2018","unstructured":"Caio Corro and Ivan Titov . 2018. Differentiable perturb-and-parse: Semisupervised parsing with a structured variational autoencoder. arXiv preprint arXiv:1807.09875 ( 2018 ). Caio Corro and Ivan Titov. 2018. Differentiable perturb-and-parse: Semisupervised parsing with a structured variational autoencoder. arXiv preprint arXiv:1807.09875 (2018)."},{"key":"e_1_3_2_2_7_1","volume-title":"Cold-start reinforcement learning with softmax policy gradient. Advances in Neural Information Processing Systems 30","author":"Ding Nan","year":"2017","unstructured":"Nan Ding and Radu Soricut . 2017. Cold-start reinforcement learning with softmax policy gradient. Advances in Neural Information Processing Systems 30 ( 2017 ). Nan Ding and Radu Soricut. 2017. Cold-start reinforcement learning with softmax policy gradient. Advances in Neural Information Processing Systems 30 (2017)."},{"key":"e_1_3_2_2_8_1","unstructured":"Chengliang Gao Fan Zhang Guanqun Wu Qiwan Hu Qiang Ru Jinghua Hao Renqing He and Zhizhao Sun. 2021. A Deep Learning Method for Route and Time Prediction in Food Delivery Service. In KDD. 2879--2889.  Chengliang Gao Fan Zhang Guanqun Wu Qiwan Hu Qiang Ru Jinghua Hao Renqing He and Zhizhao Sun. 2021. A Deep Learning Method for Route and Time Prediction in Food Delivery Service. In KDD. 2879--2889."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2018.00100"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330754"},{"key":"e_1_3_2_2_12_1","unstructured":"Guolin Ke Qi Meng Thomas Finley Taifeng Wang Wei Chen Weidong Ma Qiwei Ye and Tie-Yan Liu. 2017. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In NeurIPS. 3146--3154.  Guolin Ke Qi Meng Thomas Finley Taifeng Wang Wei Chen Weidong Ma Qiwei Ye and Tie-Yan Liu. 2017. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In NeurIPS. 3146--3154."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/30.1-2.81"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2019.2929141"},{"key":"e_1_3_2_2_15_1","first-page":"2469","article-title":"Deep Reinforcement Learning for Sequence-to-Sequence Models","volume":"31","author":"Keneshloo Yaser","year":"2020","unstructured":"Yaser Keneshloo , Tian Shi , Naren Ramakrishnan , and Chandan K. Reddy . 2020 . Deep Reinforcement Learning for Sequence-to-Sequence Models . IEEE Transactions on Neural Networks and Learning Systems 31 , 7 (2020), 2469 -- 2489 . https: \/\/doi.org\/10.1109\/TNNLS.2019.2929141 10.1109\/TNNLS.2019.2929141 Yaser Keneshloo, Tian Shi, Naren Ramakrishnan, and Chandan K. Reddy. 2020. Deep Reinforcement Learning for Sequence-to-Sequence Models. IEEE Transactions on Neural Networks and Learning Systems 31, 7 (2020), 2469--2489. https: \/\/doi.org\/10.1109\/TNNLS.2019.2929141","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"e_1_3_2_2_16_1","volume-title":"Reliability and learnability of human bandit feedback for sequence-to-sequence reinforcement learning. arXiv preprint arXiv:1805.10627","author":"Kreutzer Julia","year":"2018","unstructured":"Julia Kreutzer , Joshua Uyheng , and Stefan Riezler . 2018. Reliability and learnability of human bandit feedback for sequence-to-sequence reinforcement learning. arXiv preprint arXiv:1805.10627 ( 2018 ). Julia Kreutzer, Joshua Uyheng, and Stefan Riezler. 2018. Reliability and learnability of human bandit feedback for sequence-to-sequence reinforcement learning. arXiv preprint arXiv:1805.10627 (2018)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.100"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2018.2861864"},{"key":"e_1_3_2_2_19_1","volume-title":"Ranking sentences for extractive summarization with reinforcement learning. arXiv preprint arXiv:1802.08636","author":"Narayan Shashi","year":"2018","unstructured":"Shashi Narayan , Shay B Cohen , and Mirella Lapata . 2018. Ranking sentences for extractive summarization with reinforcement learning. arXiv preprint arXiv:1802.08636 ( 2018 ). Shashi Narayan, Shay B Cohen, and Mirella Lapata. 2018. Ranking sentences for extractive summarization with reinforcement learning. arXiv preprint arXiv:1802.08636 (2018)."},{"key":"e_1_3_2_2_20_1","volume-title":"Edit distance and dialect proximity. Time Warps, String Edits and Macromolecules: The theory and practice of sequence comparison 15","author":"Nerbonne John","year":"1999","unstructured":"John Nerbonne , Wilbert Heeringa , and Peter Kleiweg . 1999. Edit distance and dialect proximity. Time Warps, String Edits and Macromolecules: The theory and practice of sequence comparison 15 ( 1999 ). John Nerbonne, Wilbert Heeringa, and Peter Kleiweg. 1999. Edit distance and dialect proximity. Time Warps, String Edits and Macromolecules: The theory and practice of sequence comparison 15 (1999)."},{"key":"e_1_3_2_2_21_1","volume-title":"A deep reinforced model for abstractive summarization. arXiv preprint arXiv:1705.04304","author":"Paulus Romain","year":"2017","unstructured":"Romain Paulus , Caiming Xiong , and Richard Socher . 2017. A deep reinforced model for abstractive summarization. arXiv preprint arXiv:1705.04304 ( 2017 ). Romain Paulus, Caiming Xiong, and Richard Socher. 2017. A deep reinforced model for abstractive summarization. arXiv preprint arXiv:1705.04304 (2017)."},{"key":"e_1_3_2_2_22_1","volume-title":"Sequence level training with recurrent neural networks. arXiv preprint arXiv:1511.06732","author":"Ranzato Marc'Aurelio","year":"2015","unstructured":"Marc'Aurelio Ranzato , Sumit Chopra , Michael Auli , and Wojciech Zaremba . 2015. Sequence level training with recurrent neural networks. arXiv preprint arXiv:1511.06732 ( 2015 ). Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba. 2015. Sequence level training with recurrent neural networks. arXiv preprint arXiv:1511.06732 (2015)."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.131"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539027"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00307"},{"key":"e_1_3_2_2_26_1","volume-title":"High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438","author":"Schulman John","year":"2015","unstructured":"John Schulman , Philipp Moritz , Sergey Levine , Michael Jordan , and Pieter Abbeel . 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 ( 2015 ). John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015)."},{"key":"e_1_3_2_2_27_1","volume-title":"Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, 725--733","author":"Stoyanov Veselin","year":"2011","unstructured":"Veselin Stoyanov , Alexander Ropson , and Jason Eisner . 2011 . Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure . In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, 725--733 . Veselin Stoyanov, Alexander Ropson, and Jason Eisner. 2011. Empirical risk minimization of graphical model parameters given approximate inference, decoding, and model structure. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings, 725--733."},{"key":"e_1_3_2_2_28_1","unstructured":"Richard S Sutton Andrew G Barto etal 1998. Introduction to reinforcement learning. Vol. 135. MIT press Cambridge.  Richard S Sutton Andrew G Barto et al. 1998. Introduction to reinforcement learning. Vol. 135. MIT press Cambridge."},{"key":"e_1_3_2_2_29_1","volume-title":"Connecting the dots between mle and rl for sequence prediction. arXiv preprint arXiv:1811.09740","author":"Tan Bowen","year":"2018","unstructured":"Bowen Tan , Zhiting Hu , Zichao Yang , Ruslan Salakhutdinov , and Eric Xing . 2018. Connecting the dots between mle and rl for sequence prediction. arXiv preprint arXiv:1811.09740 ( 2018 ). Bowen Tan, Zhiting Hu, Zichao Yang, Ruslan Salakhutdinov, and Eric Xing. 2018. Connecting the dots between mle and rl for sequence prediction. arXiv preprint arXiv:1811.09740 (2018)."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Paolo Toth and Daniele Vigo. 2002. The vehicle routing problem. SIAM.  Paolo Toth and Daniele Vigo. 2002. The vehicle routing problem. SIAM.","DOI":"10.1137\/1.9780898718515"},{"key":"e_1_3_2_2_31_1","volume-title":"Pointer networks. Advances in neural information processing systems 28","author":"Vinyals Oriol","year":"2015","unstructured":"Oriol Vinyals , Meire Fortunato , and Navdeep Jaitly . 2015. Pointer networks. Advances in neural information processing systems 28 ( 2015 ). Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. 2015. Pointer networks. Advances in neural information processing systems 28 (2015)."},{"key":"e_1_3_2_2_32_1","volume-title":"A reinforced topic-aware convolutional sequence-to-sequence model for abstractive text summarization. arXiv preprint arXiv:1805.03616","author":"Wang Li","year":"2018","unstructured":"Li Wang , Junlin Yao , Yunzhe Tao , Li Zhong , Wei Liu , and Qiang Du. 2018. A reinforced topic-aware convolutional sequence-to-sequence model for abstractive text summarization. arXiv preprint arXiv:1805.03616 ( 2018 ). Li Wang, Junlin Yao, Yunzhe Tao, Li Zhong, Wei Liu, and Qiang Du. 2018. A reinforced topic-aware convolutional sequence-to-sequence model for abstractive text summarization. arXiv preprint arXiv:1805.03616 (2018)."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304389"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539084"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3481006"},{"key":"e_1_3_2_2_36_1","volume-title":"Package Pick-up Route Prediction via Modeling Couriers' Spatial-Temporal Behaviors","author":"Wen Haomin","unstructured":"Haomin Wen , Youfang Lin , Fan Wu , Huaiyu Wan , Shengnan Guo , Lixia Wu , Chao Song , and Yinghui Xu. 2021. Package Pick-up Route Prediction via Modeling Couriers' Spatial-Temporal Behaviors . In ICDE. IEEE , 2141--2146. Haomin Wen, Youfang Lin, Fan Wu, Huaiyu Wan, Shengnan Guo, Lixia Wu, Chao Song, and Yinghui Xu. 2021. Package Pick-up Route Prediction via Modeling Couriers' Spatial-Temporal Behaviors. In ICDE. IEEE, 2141--2146."},{"key":"e_1_3_2_2_37_1","volume-title":"Simple statistical gradient-following algorithms for connectionist reinforcement learning. Reinforcement learning","author":"Williams Ronald J","year":"1992","unstructured":"Ronald J Williams . 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Reinforcement learning ( 1992 ), 5--32. Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Reinforcement learning (1992), 5--32."},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11987"},{"key":"e_1_3_2_2_39_1","volume-title":"International conference on machine learning. PMLR","author":"Xu Kelvin","year":"2015","unstructured":"Kelvin Xu , Jimmy Ba , Ryan Kiros , Kyunghyun Cho , Aaron Courville , Ruslan Salakhudinov , Rich Zemel , and Yoshua Bengio . 2015 . Show, attend and tell: Neural image caption generation with visual attention . In International conference on machine learning. PMLR , 2048--2057. Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning. PMLR, 2048--2057."},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368297"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3351282"}],"event":{"name":"KDD '23: The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Long Beach CA USA","acronym":"KDD '23","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599811","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3580305.3599811","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:23Z","timestamp":1750182563000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599811"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":41,"alternative-id":["10.1145\/3580305.3599811","10.1145\/3580305"],"URL":"https:\/\/doi.org\/10.1145\/3580305.3599811","relation":{},"subject":[],"published":{"date-parts":[[2023,8,4]]},"assertion":[{"value":"2023-08-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}