{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T18:14:55Z","timestamp":1770228895708,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,21]],"date-time":"2023-10-21T00:00:00Z","timestamp":1697846400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62177033"],"award-info":[{"award-number":["62177033"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,21]]},"DOI":"10.1145\/3583780.3614897","type":"proceedings-article","created":{"date-parts":[[2023,10,21]],"date-time":"2023-10-21T07:45:42Z","timestamp":1697874342000},"page":"1318-1327","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Graph Enhanced Hierarchical Reinforcement Learning for Goal-oriented Learning Path Recommendation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-3731-1548","authenticated-orcid":false,"given":"Qingyao","family":"Li","sequence":"first","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2544-775X","authenticated-orcid":false,"given":"Wei","family":"Xia","sequence":"additional","affiliation":[{"name":"Huawei Noah's Art Lab, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5048-4619","authenticated-orcid":false,"given":"Li'ang","family":"Yin","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5432-7303","authenticated-orcid":false,"given":"Jian","family":"Shen","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-7031-5786","authenticated-orcid":false,"given":"Renting","family":"Rui","sequence":"additional","affiliation":[{"name":"Shanghai Jiaotong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0127-2425","authenticated-orcid":false,"given":"Weinan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9368-9526","authenticated-orcid":false,"given":"Xianyu","family":"Chen","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9224-2431","authenticated-orcid":false,"given":"Ruiming","family":"Tang","sequence":"additional","affiliation":[{"name":"Huawei Noah's Ark Lab, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0281-8271","authenticated-orcid":false,"given":"Yong","family":"Yu","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2023,10,21]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2023. MindSpore. https:\/\/www.mindspore.cn\/  2023. MindSpore. https:\/\/www.mindspore.cn\/"},{"key":"e_1_3_2_1_2_1","first-page":"2277","article-title":"Adaptive learning path recommendation based on graph theory and an improved immune algorithm","volume":"13","author":"Bian Cun-Ling","year":"2019","unstructured":"Cun-Ling Bian , De-Liang Wang , Shi-Yu Liu , Wei-Gang Lu , and Jun-Yu Dong . 2019 . Adaptive learning path recommendation based on graph theory and an improved immune algorithm . KSII Transactions on Internet and Information Systems (TIIS) 13 , 5 (2019), 2277 -- 2298 . Cun-Ling Bian, De-Liang Wang, Shi-Yu Liu, Wei-Gang Lu, and Jun-Yu Dong. 2019. Adaptive learning path recommendation based on graph theory and an improved immune algorithm. KSII Transactions on Internet and Information Systems (TIIS) 13, 5 (2019), 2277--2298.","journal-title":"KSII Transactions on Internet and Information Systems (TIIS)"},{"key":"e_1_3_2_1_3_1","volume-title":"Translating embeddings for modeling multi-relational data. Advances in neural information processing systems 26","author":"Bordes Antoine","year":"2013","unstructured":"Antoine Bordes , Nicolas Usunier , Alberto Garcia-Duran , Jason Weston , and Oksana Yakhnenko . 2013. Translating embeddings for modeling multi-relational data. Advances in neural information processing systems 26 ( 2013 ). Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. Advances in neural information processing systems 26 (2013)."},{"key":"e_1_3_2_1_4_1","volume-title":"Openai gym. arXiv preprint arXiv:1606.01540","author":"Brockman Greg","year":"2016","unstructured":"Greg Brockman , Vicki Cheung , Ludwig Pettersson , Jonas Schneider , John Schulman , Jie Tang , and Wojciech Zaremba . 2016. Openai gym. arXiv preprint arXiv:1606.01540 ( 2016 ). Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba. 2016. Openai gym. arXiv preprint arXiv:1606.01540 (2016)."},{"key":"e_1_3_2_1_5_1","unstructured":"Haw-Shiuan Chang Hwai-Jung Hsu and Kuan-Ta Chen. 2015. Modeling Exercise Relationships in E-Learning: A Unified Approach.. In EDM. 532--535.  Haw-Shiuan Chang Hwai-Jung Hsu and Kuan-Ta Chen. 2015. Modeling Exercise Relationships in E-Learning: A Unified Approach.. In EDM. 532--535."},{"key":"e_1_3_2_1_6_1","unstructured":"Haw-Shiuan Chang Hwai-Jung Hsu and Kuan-Ta Chen. 2015. Modeling Exercise Relationships in E-Learning: A Unified Approach. In EDM.  Haw-Shiuan Chang Hwai-Jung Hsu and Kuan-Ta Chen. 2015. Modeling Exercise Relationships in E-Learning: A Unified Approach. In EDM."},{"key":"e_1_3_2_1_7_1","volume-title":"Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio.","author":"Cho Kyunghyun","year":"2014","unstructured":"Kyunghyun Cho , Bart Van Merri\u00ebnboer , Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014 . Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014). Kyunghyun Cho, Bart Van Merri\u00ebnboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)."},{"key":"e_1_3_2_1_8_1","volume-title":"DAS3H: modeling student learning and forgetting for optimally scheduling distributed practice of skills. arXiv preprint arXiv:1905.06873","author":"Choffin Benoit","year":"2019","unstructured":"Benoit Choffin , Fabrice Popineau , Yolaine Bourda , and Jill-Jenn Vie . 2019. DAS3H: modeling student learning and forgetting for optimally scheduling distributed practice of skills. arXiv preprint arXiv:1905.06873 ( 2019 ). Benoit Choffin, Fabrice Popineau, Yolaine Bourda, and Jill-Jenn Vie. 2019. DAS3H: modeling student learning and forgetting for optimally scheduling distributed practice of skills. arXiv preprint arXiv:1905.06873 (2019)."},{"key":"e_1_3_2_1_9_1","volume-title":"Learning path recommendation based on modified variable length genetic algorithm. Education and information technologies 23, 2","author":"Dwivedi Pragya","year":"2018","unstructured":"Pragya Dwivedi , Vibhor Kant , and Kamal K Bharadwaj . 2018. Learning path recommendation based on modified variable length genetic algorithm. Education and information technologies 23, 2 ( 2018 ), 819--836. Pragya Dwivedi, Vibhor Kant, and Kamal K Bharadwaj. 2018. Learning path recommendation based on modified variable length genetic algorithm. Education and information technologies 23, 2 (2018), 819--836."},{"key":"e_1_3_2_1_10_1","volume-title":"Constructing a personalized learning path using genetic algorithms approach. arXiv preprint arXiv:2104.11276","author":"Elshani Lumbardh","year":"2021","unstructured":"Lumbardh Elshani and Krenare Pireva Nu\u00e7i . 2021. Constructing a personalized learning path using genetic algorithms approach. arXiv preprint arXiv:2104.11276 ( 2021 ). Lumbardh Elshani and Krenare Pireva Nu\u00e7i. 2021. Constructing a personalized learning path using genetic algorithms approach. arXiv preprint arXiv:2104.11276 (2021)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939754"},{"key":"e_1_3_2_1_12_1","volume-title":"International conference on machine learning. PMLR","author":"Haarnoja Tuomas","year":"2018","unstructured":"Tuomas Haarnoja , Aurick Zhou , Pieter Abbeel , and Sergey Levine . 2018 . Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor . In International conference on machine learning. PMLR , 1861--1870. Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning. PMLR, 1861--1870."},{"key":"e_1_3_2_1_13_1","volume-title":"Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939","author":"Hidasi Bal\u00e1zs","year":"2015","unstructured":"Bal\u00e1zs Hidasi , Alexandros Karatzoglou , Linas Baltrunas , and Domonkos Tikk . 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 ( 2015 ). Bal\u00e1zs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 (2015)."},{"key":"e_1_3_2_1_14_1","volume-title":"Long short-term memory. Neural computation 9, 8","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long short-term memory. Neural computation 9, 8 ( 1997 ), 1735--1780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11277-020-07199-0"},{"key":"e_1_3_2_1_16_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_17_1","volume-title":"Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907","author":"Kipf Thomas N","year":"2016","unstructured":"Thomas N Kipf and Max Welling . 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 ( 2016 ). Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)."},{"key":"e_1_3_2_1_18_1","volume-title":"Actor-critic algorithms. Advances in neural information processing systems 12","author":"Konda Vijay","year":"1999","unstructured":"Vijay Konda and John Tsitsiklis . 1999. Actor-critic algorithms. Advances in neural information processing systems 12 ( 1999 ). Vijay Konda and John Tsitsiklis. 1999. Actor-critic algorithms. Advances in neural information processing systems 12 (1999)."},{"key":"e_1_3_2_1_19_1","volume-title":"RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions. arXiv preprint arXiv:2108.00268","author":"Kubotani Yoshiki","year":"2021","unstructured":"Yoshiki Kubotani , Yoshihiro Fukuhara , and Shigeo Morishima . 2021. RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions. arXiv preprint arXiv:2108.00268 ( 2021 ). Yoshiki Kubotani, Yoshihiro Fukuhara, and Shigeo Morishima. 2021. RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions. arXiv preprint arXiv:2108.00268 (2021)."},{"key":"e_1_3_2_1_20_1","volume-title":"Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. Advances in neural information processing systems 29","author":"Kulkarni Tejas D","year":"2016","unstructured":"Tejas D Kulkarni , Karthik Narasimhan , Ardavan Saeedi , and Josh Tenenbaum . 2016. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. Advances in neural information processing systems 29 ( 2016 ). Tejas D Kulkarni, Karthik Narasimhan, Ardavan Saeedi, and Josh Tenenbaum. 2016. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. Advances in neural information processing systems 29 (2016)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107085"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330922"},{"key":"e_1_3_2_1_23_1","volume-title":"Jackie Chi Kit Cheung, and Doina Precup","author":"Long Teng","year":"2016","unstructured":"Teng Long , Ryan Lowe , Jackie Chi Kit Cheung, and Doina Precup . 2016 . Leveraging lexical resources for learning entity embeddings in multi-relational data. arXiv preprint arXiv:1605.05416 (2016). Teng Long, Ryan Lowe, Jackie Chi Kit Cheung, and Doina Precup. 2016. Leveraging lexical resources for learning entity embeddings in multi-relational data. arXiv preprint arXiv:1605.05416 (2016)."},{"key":"e_1_3_2_1_24_1","volume-title":"A theory of test scores. Psychometric monographs","author":"Lord Frederic","year":"1952","unstructured":"Frederic Lord . 1952. A theory of test scores. Psychometric monographs ( 1952 ). Frederic Lord. 1952. A theory of test scores. Psychometric monographs (1952)."},{"key":"e_1_3_2_1_25_1","volume-title":"Applications of item response theory to practical testing problems","author":"Lord Frederic M","unstructured":"Frederic M Lord . 2012. Applications of item response theory to practical testing problems . Routledge . Frederic M Lord. 2012. Applications of item response theory to practical testing problems. Routledge."},{"key":"e_1_3_2_1_26_1","volume-title":"Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602","author":"Mnih Volodymyr","year":"2013","unstructured":"Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Alex Graves , Ioannis Antonoglou , Daan Wierstra , and Martin Riedmiller . 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 ( 2013 ). Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3350546.3352513"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10639-020-10133-3"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2623330.2623732"},{"key":"e_1_3_2_1_30_1","volume-title":"Deep knowledge tracing. Advances in neural information processing systems 28","author":"Piech Chris","year":"2015","unstructured":"Chris Piech , Jonathan Bassen , Jonathan Huang , Surya Ganguli , Mehran Sahami , Leonidas J Guibas , and Jascha Sohl-Dickstein . 2015. Deep knowledge tracing. Advances in neural information processing systems 28 ( 2015 ). Chris Piech, Jonathan Bassen, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas J Guibas, and Jascha Sohl-Dickstein. 2015. Deep knowledge tracing. Advances in neural information processing systems 28 (2015)."},{"key":"e_1_3_2_1_31_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_32_1","volume-title":"Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence 112, 1--2","author":"Sutton Richard S","year":"1999","unstructured":"Richard S Sutton , Doina Precup , and Satinder Singh . 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence 112, 1--2 ( 1999 ), 181--211. Richard S Sutton, Doina Precup, and Satinder Singh. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial intelligence 112, 1--2 (1999), 181--211."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3580305.3599367"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v31i1.10952"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i5.16580"},{"key":"e_1_3_2_1_36_1","volume-title":"Automatic Learning Path Recommendation for Open Source Projects Using Deep Learning on Knowledge Graphs. In 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC)","author":"Yin Hang","unstructured":"Hang Yin , Zhiyu Sun , Yanchun Sun , and Gang Huang . 2021. Automatic Learning Path Recommendation for Open Source Projects Using Deep Learning on Knowledge Graphs. In 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC) . IEEE , 824--833. Hang Yin, Zhiyu Sun, Yanchun Sun, and Gang Huang. 2021. Automatic Learning Path Recommendation for Open Source Projects Using Deep Learning on Knowledge Graphs. In 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC). IEEE, 824--833."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.3301435"},{"key":"e_1_3_2_1_38_1","first-page":"76","article-title":"Recommender Systems in E-learning","volume":"1","author":"Zhang Qian","year":"2021","unstructured":"Qian Zhang , Jie Lu , and Guangquan Zhang . 2021 . Recommender Systems in E-learning . Journal of Smart Environments and Green Computing 1 , 2 (2021), 76 -- 89 . Qian Zhang, Jie Lu, and Guangquan Zhang. 2021. Recommender Systems in E-learning. Journal of Smart Environments and Green Computing 1, 2 (2021), 76--89.","journal-title":"Journal of Smart Environments and Green Computing"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401170"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377571.3377587"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2018.02.053"}],"event":{"name":"CIKM '23: The 32nd ACM International Conference on Information and Knowledge Management","location":"Birmingham United Kingdom","acronym":"CIKM '23","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGIR ACM Special Interest Group on Information Retrieval"]},"container-title":["Proceedings of the 32nd ACM International Conference on Information and Knowledge Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583780.3614897","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3583780.3614897","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:43Z","timestamp":1750178203000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583780.3614897"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,21]]},"references-count":41,"alternative-id":["10.1145\/3583780.3614897","10.1145\/3583780"],"URL":"https:\/\/doi.org\/10.1145\/3583780.3614897","relation":{},"subject":[],"published":{"date-parts":[[2023,10,21]]},"assertion":[{"value":"2023-10-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}