{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,24]],"date-time":"2026-07-24T19:51:06Z","timestamp":1784922666433,"version":"3.55.0"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T00:00:00Z","timestamp":1714089600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100011347","name":"State Key Laboratory of Software Development Environment","doi-asserted-by":"crossref","award":["SKLSDE-2020ZX-01"],"award-info":[{"award-number":["SKLSDE-2020ZX-01"]}],"id":[{"id":"10.13039\/501100011347","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Zhongguancun Laboratory"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2024,7,31]]},"abstract":"<jats:p>Knowledge tracing models based on deep learning can achieve impressive predictive performance by leveraging attention mechanisms. However, there still exist two challenges in attentive knowledge tracing (AKT): First, the mechanism of classical models of AKT demonstrates relatively low attention when processing exercise sequences with shifting knowledge concepts (KC), making it difficult to capture the comprehensive state of knowledge across sequences. Second, classical models do not consider stochastic behaviors, which negatively affects models of AKT in terms of capturing anomalous knowledge states. This article proposes a model of AKT, called Enhancing Locality for Attentive Knowledge Tracing (ELAKT), that is a variant of the deep KT model. The proposed model leverages the encoder module of the transformer to aggregate knowledge embedding generated by both exercises and responses over all timesteps. In addition, it uses causal convolutions to aggregate and smooth the states of local knowledge. The ELAKT model uses the states of comprehensive KCs to introduce a prediction correction module to forecast the future responses of students to deal with noise caused by stochastic behaviors. The results of experiments demonstrated that the ELAKT model consistently outperforms state-of-the-art baseline KT models.<\/jats:p>","DOI":"10.1145\/3652601","type":"journal-article","created":{"date-parts":[[2024,3,14]],"date-time":"2024-03-14T12:23:47Z","timestamp":1710419027000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["ELAKT: Enhancing Locality for Attentive Knowledge Tracing"],"prefix":"10.1145","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6154-1247","authenticated-orcid":false,"given":"Yanjun","family":"Pu","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Beijing, China and Zhongguancun Laboratory,  Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-6517-3959","authenticated-orcid":false,"given":"Fang","family":"Liu","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Beihang University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4298-9358","authenticated-orcid":false,"given":"Rongye","family":"Shi","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Beihang University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6721-065X","authenticated-orcid":false,"given":"Haitao","family":"Yuan","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-9587-3798","authenticated-orcid":false,"given":"Ruibo","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Beihang University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9910-2298","authenticated-orcid":false,"given":"Tianhao","family":"Peng","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Beihang University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2998-8828","authenticated-orcid":false,"given":"Wenjun","family":"Wu","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Beihang University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2024,4,26]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3331184.3331195"},{"key":"e_1_3_3_3_2","doi-asserted-by":"crossref","DOI":"10.1109\/TKDE.2022.3206447","article-title":"Deep graph memory networks for forgetting-robust knowledge tracing","author":"Abdelrahman Ghodai","year":"2022","unstructured":"Ghodai Abdelrahman and Qing Wang. 2022. Deep graph memory networks for forgetting-robust knowledge tracing. IEEE Transactions on Knowledge and Data Engineering 35, 8 (2022), 7844\u20137855.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_3_4_2","doi-asserted-by":"crossref","unstructured":"Ghodai Abdelrahman Qing Wang and Bernardo Pereira Nunes. 2022. Knowledge tracing: A survey. ACM Computing Surveys 55 11 (2022) 1\u201337.","DOI":"10.1145\/3569576"},{"key":"e_1_3_3_5_2","article-title":"Layer normalization","author":"Ba Jimmy Lei","year":"2016","unstructured":"Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E. Hinton. 2016. Layer normalization. arXiv:1607.06450 . Retrieved from https:\/\/arxiv.org\/abs\/1607.06450","journal-title":"arXiv:1607.06450"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.3758\/BF03202726"},{"key":"e_1_3_3_7_2","article-title":"C3SASR: Cheap causal convolutions for self-attentive sequential recommendation","author":"Chen Jiayi","year":"2022","unstructured":"Jiayi Chen, Wen Wu, and Liang He. 2022. C3SASR: Cheap causal convolutions for self-attentive sequential recommendation. arXiv:2211.01297 . Retrieved from https:\/\/arxiv.org\/abs\/2211.01297","journal-title":"arXiv:2211.01297"},{"key":"e_1_3_3_8_2","first-page":"39","volume-title":"Proceedings of the 2018 IEEE International Conference on Data Mining","author":"Chen Penghe","year":"2018","unstructured":"Penghe Chen, Yu Lu, Vincent W. Zheng, and Yang Pian. 2018. Prerequisite-driven deep knowledge tracing. In Proceedings of the 2018 IEEE International Conference on Data Mining. IEEE, 39\u201348."},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3386527.3405945"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF01099821"},{"key":"e_1_3_3_11_2","article-title":"Fine-grained interaction modeling with multi-relational transformer for knowledge tracing","author":"Cui Jiajun","year":"2023","unstructured":"Jiajun Cui, Zeyuan Chen, Aimin Zhou, Jianyong Wang, and Wei Zhang. 2023. Fine-grained interaction modeling with multi-relational transformer for knowledge tracing. ACM Transactions on Information Systems 41, 4 (2023), 1\u201326.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403282"},{"key":"e_1_3_3_13_2","first-page":"84","volume-title":"Proceedings of the 7th International Conference on Educational Data Mining","author":"Gonz\u00e1lez-Brenes Jos\u00e9","year":"2014","unstructured":"Jos\u00e9 Gonz\u00e1lez-Brenes, Yun Huang, and Peter Brusilovsky. 2014. General features in knowledge tracing to model multiple subskills, temporal item response theory, and expert knowledge. In Proceedings of the 7th International Conference on Educational Data Mining. University of Pittsburgh, 84\u201391."},{"key":"e_1_3_3_14_2","article-title":"Dynamic cognitive tracing: Towards unified discovery of student and cognitive models.","author":"Gonz\u00e1lez-Brenes Jos\u00e9 P.","year":"2012","unstructured":"Jos\u00e9 P. Gonz\u00e1lez-Brenes and Jack Mostow. 2012. Dynamic cognitive tracing: Towards unified discovery of student and cognitive models. International Educational Data Mining Society (2012).","journal-title":"International Educational Data Mining Society"},{"key":"e_1_3_3_15_2","article-title":"Neural turing machines","author":"Graves Alex","year":"2014","unstructured":"Alex Graves, Greg Wayne, and Ivo Danihelka. 2014. Neural turing machines. arXiv:1410.5401 . Retrieved from https:\/\/arxiv.org\/abs\/1410.5401","journal-title":"arXiv:1410.5401"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-021-06007-5"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"issue":"1","key":"e_1_3_3_18_2","first-page":"1","article-title":"MAN: Memory-augmented attentive networks for deep learning-based knowledge tracing","volume":"42","author":"He Liangliang","year":"2023","unstructured":"Liangliang He, Xiao Li, Pancheng Wang, Jintao Tang, and Ting Wang. 2023. MAN: Memory-augmented attentive networks for deep learning-based knowledge tracing. ACM Transactions on Information Systems 42, 1 (2023), 1\u201322.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-022-01041-2"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482136"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_3_22_2","article-title":"How deep is knowledge tracing?","author":"Khajah Mohammad","year":"2016","unstructured":"Mohammad Khajah, Robert V. Lindsey, and Michael C. Mozer. 2016. How deep is knowledge tracing? arXiv:1604.02416 . Retrieved from https:\/\/arxiv.org\/abs\/1604.02416","journal-title":"arXiv:1604.02416"},{"key":"e_1_3_3_23_2","unstructured":"M. Khajah R. V. Lindsey and M. C. Mozer. 2016. How deep is knowledge tracing?Proceedings of EDM (2016) 94\u2013101."},{"key":"e_1_3_3_24_2","first-page":"421","volume-title":"Proceedings of the International Conference on Artificial Intelligence in Education","author":"Koedinger Kenneth R.","year":"2013","unstructured":"Kenneth R. Koedinger, John C. Stamper, Elizabeth A. McLaughlin, and Tristan Nixon. 2013. Using data-driven discovery of better student models to improve student learning. In Proceedings of the International Conference on Artificial Intelligence in Education. Springer, 421\u2013430."},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01576"},{"key":"e_1_3_3_26_2","unstructured":"Shiyang Li Xiaoyong Jin Yao Xuan Xiyou Zhou Wenhu Chen Yu-Xiang Wang and Xifeng Yan. 2019. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. Proceedings of Advances in Neural Information Processing Systems (2019) 1598\u20131607."},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/3543507.3583436"},{"key":"e_1_3_3_28_2","article-title":"A survey of transformers","author":"Lin Tianyang","year":"2022","unstructured":"Tianyang Lin, Yuxin Wang, Xiangyang Liu, and Xipeng Qiu. 2022. A survey of transformers. AI Open (2022).","journal-title":"AI Open"},{"key":"e_1_3_3_29_2","first-page":"856","volume-title":"Proceedings of the Asian Conference on Machine Learning","author":"Liu Congjie","year":"2021","unstructured":"Congjie Liu and Xiaoguang Li. 2021. Multi-factor memory attentive model for knowledge tracing. In Proceedings of the Asian Conference on Machine Learning. PMLR, 856\u2013869."},{"key":"e_1_3_3_30_2","first-page":"64","article-title":"Recurrent neural networks","volume":"5","author":"Medsker Larry R.","year":"2001","unstructured":"Larry R. Medsker and LC Jain. 2001. Recurrent neural networks. Design and Applications 5 (2001), 64\u201367.","journal-title":"Design and Applications"},{"key":"e_1_3_3_31_2","article-title":"Wavenet: A generative model for raw audio","author":"Oord Aaron van den","year":"2016","unstructured":"Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, and Koray Kavukcuoglu. 2016. Wavenet: A generative model for raw audio. arXiv:1609.03499 . Retrieved from https:\/\/arxiv.org\/abs\/1609.03499","journal-title":"arXiv:1609.03499"},{"key":"e_1_3_3_32_2","article-title":"A self-attentive model for knowledge tracing","author":"Pandey Shalini","year":"2019","unstructured":"Shalini Pandey and George Karypis. 2019. A self-attentive model for knowledge tracing. arXiv:1907.06837 . Retrieved from https:\/\/arxiv.org\/abs\/1907.06837","journal-title":"arXiv:1907.06837"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3411994"},{"key":"e_1_3_3_34_2","article-title":"A decomposable attention model for natural language inference","author":"Parikh Ankur P.","year":"2016","unstructured":"Ankur P. Parikh, Oscar T\u00e4ckstr\u00f6m, Dipanjan Das, and Jakob Uszkoreit. 2016. A decomposable attention model for natural language inference. arXiv:1606.01933 . Retrieved from https:\/\/arxiv.org\/abs\/1606.01993","journal-title":"arXiv:1606.01933"},{"key":"e_1_3_3_35_2","first-page":"505","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Piech Chris","year":"2015","unstructured":"Chris Piech, Jonathan Bassen, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas J Guibas, and Jascha Sohl-Dickstein. 2015. Deep knowledge tracing. In Proceedings of the Advances in Neural Information Processing Systems. 505\u2013513."},{"key":"e_1_3_3_36_2","doi-asserted-by":"crossref","first-page":"490","DOI":"10.1145\/3448139.3448188","volume-title":"Proceedings of the LAK21: 11th International Learning Analytics and Knowledge Conference","author":"Shin Dongmin","year":"2021","unstructured":"Dongmin Shin, Yugeun Shim, Hangyeol Yu, Seewoo Lee, Byungsoo Kim, and Youngduck Choi. 2021. Saint+: Integrating temporal features for ednet correctness prediction. In Proceedings of the LAK21: 11th International Learning Analytics and Knowledge Conference. 490\u2013496."},{"key":"e_1_3_3_37_2","volume-title":"Proceedings of the Artificial Intelligence in Education - 15th International Conference, AIED 2011, Auckland, New Zealand, June 28 - July 2011","author":"Stamper John C.","year":"2011","unstructured":"John C. Stamper and Kenneth R. Koedinger. 2011. Human-machine student model discovery and improvement using datashop. In Proceedings of the Artificial Intelligence in Education - 15th International Conference, AIED 2011, Auckland, New Zealand, June 28 - July 2011."},{"key":"e_1_3_3_38_2","article-title":"Ensemble knowledge tracing: Modeling interactions in learning process","author":"Sun Jianwen","year":"2022","unstructured":"Jianwen Sun, Rui Zou, Ruxia Liang, Lu Gao, Sannyuya Liu, Qing Li, Kai Zhang, and Lulu Jiang. 2022. Ensemble knowledge tracing: Modeling interactions in learning process. Expert Systems with Applications 32 (2022), 1\u201312.","journal-title":"Expert Systems with Applications"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1745-3984.1983.tb00212.x"},{"key":"e_1_3_3_40_2","first-page":"5998","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems. 5998\u20136008."},{"issue":"1","key":"e_1_3_3_41_2","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.amjsurg.2002.12.005","article-title":"Cognitive task analysis for teaching technical skills in an inanimate surgical skills laboratory","volume":"187","author":"Velmahos George C.","year":"2004","unstructured":"George C. Velmahos, Konstantinos G. Toutouzas, Lelan F. Sillin, Linda Chan, Richard E. Clark, Demetrios Theodorou, and Fredric Maupin. 2004. Cognitive task analysis for teaching technical skills in an inanimate surgical skills laboratory. American Journal of Surgery 187, 1 (2004), 114\u2013119.","journal-title":"American Journal of Surgery"},{"key":"e_1_3_3_42_2","article-title":"Real-time target sound extraction","author":"Veluri Bandhav","year":"2022","unstructured":"Bandhav Veluri, Justin Chan, Malek Itani, Tuochao Chen, Takuya Yoshioka, and Shyamnath Gollakota. 2022. Real-time target sound extraction. arXiv:2211.02250 . Retrieved from https:\/\/arxiv.org\/abs\/2211.02250","journal-title":"arXiv:2211.02250"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441802"},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3231644.3231647"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41019-020-00151-z"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.14778\/3570690.3570691"},{"key":"e_1_3_3_47_2","first-page":"2135","volume-title":"Proceedings of the 2020 International Conference on Management of Data","author":"Yuan Haitao","year":"2020","unstructured":"Haitao Yuan, Guoliang Li, Zhifeng Bao, and Ling Feng. 2020. Effective travel time estimation: When historical trajectories over road networks matter. In Proceedings of the 2020 International Conference on Management of Data. ACM, 2135\u20132149."},{"key":"e_1_3_3_48_2","first-page":"348","volume-title":"Proceedings of the 37th IEEE International Conference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021","author":"Yuan Haitao","year":"2021","unstructured":"Haitao Yuan, Guoliang Li, Zhifeng Bao, and Ling Feng. 2021. An effective joint prediction model for travel demands and traffic flows. In Proceedings of the 37th IEEE International Conference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021. IEEE, 348\u2013359."},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052580"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2020.2995273"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3652601","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3652601","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:03:30Z","timestamp":1750291410000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3652601"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,26]]},"references-count":49,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,7,31]]}},"alternative-id":["10.1145\/3652601"],"URL":"https:\/\/doi.org\/10.1145\/3652601","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,26]]},"assertion":[{"value":"2023-06-07","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-03-06","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-04-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}