{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,4]],"date-time":"2026-03-04T19:30:09Z","timestamp":1772652609059,"version":"3.50.1"},"reference-count":100,"publisher":"Elsevier BV","issue":"4","license":[{"start":{"date-parts":[[2024,1,25]],"date-time":"2024-01-25T00:00:00Z","timestamp":1706140800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,1,25]],"date-time":"2024-01-25T00:00:00Z","timestamp":1706140800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Hessisches Ministerium f\u00fcr Digitale Strategie und Entwicklung"},{"DOI":"10.13039\/501100001664","name":"Leibniz-Gemeinschaft","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001664","id-type":"DOI","asserted-by":"publisher"}]},{"name":"DIPF | Leibniz-Institut f\u00fcr Bildungsforschung und Bildungsinformation"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Artif Intell Educ"],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Various studies empirically proved the value of highly informative feedback for enhancing learner success. However, digital educational technology has yet to catch up as automated feedback is often provided shallowly. This paper presents a case study on implementing a pipeline that provides German-speaking university students enrolled in an introductory-level educational psychology lecture with content-specific feedback for a lecture assignment. In the assignment, students have to discuss the usefulness and educational grounding (i.e., connection to working memory, metacognition or motivation) of ten learning tips presented in a video within essays. Through our system, students received feedback on the correctness of their solutions and content areas they needed to improve. For this purpose, we implemented a natural language processing pipeline with two steps: (1) segmenting the essays and (2) predicting codes from the resulting segments used to generate feedback texts. As training data for the model in each processing step, we used 689 manually labelled essays submitted by the previous student cohort. We then evaluated approaches based on\n                    <jats:italic>GBERT<\/jats:italic>\n                    ,\n                    <jats:italic>T5,<\/jats:italic>\n                    and bag-of-words baselines for scoring them. Both pipeline steps, especially the transformer-based models, demonstrated high performance. In the final step, we evaluated the feedback using a randomised controlled trial. The control group received\n                    <jats:italic>feedback as usual (essential feedback),<\/jats:italic>\n                    while the treatment group received\n                    <jats:italic>highly informative feedback<\/jats:italic>\n                    based on the natural language processing pipeline\n                    <jats:italic>.<\/jats:italic>\n                    We then used a six items long survey to test the perception of feedback. We conducted an ordinary least squares analysis to model these items as dependent variables, which showed that highly informative feedback had positive effects on helpfulness and reflection.\n                  <\/jats:p>","DOI":"10.1007\/s40593-023-00387-6","type":"journal-article","created":{"date-parts":[[2024,1,25]],"date-time":"2024-01-25T12:34:32Z","timestamp":1706186072000},"page":"1378-1416","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["From the Automated Assessment of Student Essay Content to Highly Informative Feedback: a Case Study"],"prefix":"10.1016","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5598-9547","authenticated-orcid":false,"given":"Sebastian","family":"Gombert","sequence":"first","affiliation":[]},{"given":"Aron","family":"Fink","sequence":"additional","affiliation":[]},{"given":"Tornike","family":"Giorgashvili","sequence":"additional","affiliation":[]},{"given":"Ioana","family":"Jivet","sequence":"additional","affiliation":[]},{"given":"Daniele","family":"Di Mitri","sequence":"additional","affiliation":[]},{"given":"Jane","family":"Yau","sequence":"additional","affiliation":[]},{"given":"Andreas","family":"Frey","sequence":"additional","affiliation":[]},{"given":"Hendrik","family":"Drachsler","sequence":"additional","affiliation":[]}],"member":"78","published-online":{"date-parts":[[2024,1,25]]},"reference":[{"key":"387_CR1","doi-asserted-by":"publisher","DOI":"10.1111\/jcal.12716","author":"A Ahmad","year":"2022","unstructured":"Ahmad, A., Schneider, J., Griffiths, D., Biedermann, D., Schiffner, D., Greller, W., & Drachsler, H. (2022). Connecting the dots \u2013 A literature review on learning analytics indicators from a learning design perspective. Journal of Computer Assisted Learning. https:\/\/doi.org\/10.1111\/jcal.12716","journal-title":"Journal of Computer Assisted Learning"},{"key":"387_CR2","unstructured":"Alobaidi, O. G., Crockett, K. A., O'Shea, J. D., & Jarad, T. M. (2013). Abdullah: An intelligent Arabic conversational tutoring system for modern Islamic education. In Proceedings of the World Congress on Engineering (Vol. 2)."},{"issue":"3","key":"387_CR3","doi-asserted-by":"publisher","first-page":"422","DOI":"10.3390\/psych3030030","volume":"3","author":"N Andersen","year":"2021","unstructured":"Andersen, N., & Zehner, F. (2021). shinyReCoR: A shiny application for automatically coding text responses using R. Psych, 3(3), 422\u2013446. https:\/\/doi.org\/10.3390\/psych3030030","journal-title":"Psych"},{"issue":"3","key":"387_CR4","doi-asserted-by":"publisher","first-page":"841","DOI":"10.1111\/jcal.12717","volume":"39","author":"N Andersen","year":"2022","unstructured":"Andersen, N., Zehner, F., & Goldhammer, F. (2022). Semi-automatic coding of open-ended text responses in large-scale assessments. Journal of Computer Assisted Learning, 39(3), 841\u2013854. https:\/\/doi.org\/10.1111\/jcal.12717","journal-title":"Journal of Computer Assisted Learning"},{"key":"387_CR5","doi-asserted-by":"publisher","DOI":"10.1007\/s40593-022-00323-0","author":"X Bai","year":"2022","unstructured":"Bai, X., & Stede, M. (2022). A survey of current machine learning approaches to student free-text evaluation for intelligent tutoring. International Journal of Artificial Intelligence in Education. https:\/\/doi.org\/10.1007\/s40593-022-00323-0","journal-title":"International Journal of Artificial Intelligence in Education"},{"key":"387_CR6","unstructured":"Barbaresi, A. (n.d.). German-NLP: Curated list of open-access\/open-source\/off-the-shelf resources and tools developed with a particular focus on German. GitHub. Retrieved September, 2023, from https:\/\/github.com\/adbar\/German-NLP"},{"key":"387_CR7","doi-asserted-by":"crossref","unstructured":"Beseiso, M., & Alzahrani, S. (2020). An empirical analysis of BERT embedding for automated essay scoring. International Journal of Advanced Computer Science and Applications, 11(10), 204-210.","DOI":"10.14569\/IJACSA.2020.0111027"},{"key":"387_CR8","doi-asserted-by":"publisher","unstructured":"Bexte, M., Horbach, A., & Zesch, T. (2022). Similarity-based content scoring - How to make S-BERT keep up with BERT. In Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022) (pp. 118\u2013123). https:\/\/doi.org\/10.18653\/v1\/2022.bea-1.16","DOI":"10.18653\/v1\/2022.bea-1.16"},{"issue":"3","key":"387_CR9","doi-asserted-by":"publisher","first-page":"347","DOI":"10.1007\/BF00138871","volume":"32","author":"J Biggs","year":"1996","unstructured":"Biggs, J. (1996). Enhancing teaching through constructive alignment. Higher Education, 32(3), 347\u2013364.","journal-title":"Higher Education"},{"issue":"Jan","key":"387_CR10","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993\u20131022.","journal-title":"Journal of Machine Learning Research"},{"key":"387_CR11","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1007\/s40593-014-0026-8","volume":"25","author":"S Burrows","year":"2015","unstructured":"Burrows, S., Gurevych, I., & Stein, B. (2015). The eras and trends of automatic short answer grading. International Journal of Artificial Intelligence in Education, 25, 60\u2013117. https:\/\/doi.org\/10.1007\/s40593-014-0026-8","journal-title":"International Journal of Artificial Intelligence in Education"},{"issue":"1","key":"387_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1017\/S0022226798007294","volume":"35","author":"L Cahill","year":"1999","unstructured":"Cahill, L., & Gazdar, G. (1999). German noun inflection. Journal of Linguistics, 35(1), 1\u201342.","journal-title":"Journal of Linguistics"},{"key":"387_CR13","doi-asserted-by":"publisher","unstructured":"Camus, L., & Filighera, A. (2020). Investigating transformers for automatic short answer grading. In I. I. Bittencourt, M. Cukurova, K. Muldner, R. Luckin, & E. Mill\u00e1n (Eds.), Artificial intelligence in education (pp. 43\u201348). Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-030-52240-7_8","DOI":"10.1007\/978-3-030-52240-7_8"},{"key":"387_CR14","doi-asserted-by":"publisher","first-page":"100027","DOI":"10.1016\/j.caeai.2021.100027","volume":"2","author":"AP Cavalcanti","year":"2021","unstructured":"Cavalcanti, A. P., Barbosa, A., Carvalho, R., Freitas, F., Tsai, Y.-S., Ga\u0161evi\u0107, D., & Mello, R. F. (2021). Automatic feedback in online learning environments: A systematic literature review. Computers and Education: Artificial Intelligence, 2, 100027. https:\/\/doi.org\/10.1016\/j.caeai.2021.100027","journal-title":"Computers and Education: Artificial Intelligence"},{"key":"387_CR15","doi-asserted-by":"publisher","unstructured":"Chan, B., Schweter, S., & M\u00f6ller, T. (2020). German\u2019s next language model. In Proceedings of the 28th International Conference on Computational Linguistics (pp. 6811\u20136822). International Committee on Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/2020.coling-main.598","DOI":"10.18653\/v1\/2020.coling-main.598"},{"key":"387_CR16","doi-asserted-by":"crossref","unstructured":"Chen, H., & He, B. (2013). Automated essay scoring by maximizing human-machine agreement. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (pp. 1741\u20131752). Association for Computational Linguistics. https:\/\/aclanthology.org\/D13-1180\/","DOI":"10.18653\/v1\/D13-1180"},{"key":"387_CR17","unstructured":"Choi, F. (2000). Advances in domain independent linear text segmentation. In 1st Meeting of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics. https:\/\/aclanthology.org\/A00-2004\/"},{"key":"387_CR18","doi-asserted-by":"crossref","unstructured":"Clahsen, H., Marcus, G., Bartke, S., & Wiese, R. (1996). Compounding and inflection in German child language. Yearbook of morphology 1995, 115-142.","DOI":"10.1007\/978-94-017-3716-6_7"},{"key":"387_CR19","doi-asserted-by":"publisher","unstructured":"Cozma, M., Butnaru, A., & Ionescu, R. T. (2018). Automated essay scoring with string kernels\u00a0and word embeddings. In I. Gurevych & Y. Miyao (Eds.), Proceedings of the 56th Annual Meeting of the\u00a0Association for Computational Linguistics (Volume 2: Short Papers) (pp. 503\u2013509).\u00a0https:\/\/doi.org\/10.18653\/v1\/P18-2080","DOI":"10.18653\/v1\/P18-2080"},{"key":"387_CR20","doi-asserted-by":"publisher","unstructured":"Crossley, S. A., Roscoe, R. D., & McNamara, D. S. (2011). Predicting human scores of essay quality using computational indices of linguistic and textual features. In Proceedings of the 15th International Conference on Artificial Intelligence in Education, AIED 2011, Auckland, New Zealand, June 28 - July 2011 (pp. 438\u2013440). https:\/\/doi.org\/10.1007\/978-3-642-21869-9_62","DOI":"10.1007\/978-3-642-21869-9_62"},{"key":"387_CR21","doi-asserted-by":"publisher","unstructured":"Crossley, S. A., Allen, L. K., Snow, E. L., & McNamara, D. S. (2015). Pssst... textual features... there is more to automatic essay scoring than just you! In Proceedings of the Fifth International Conference on Learning Analytics and Knowledge (pp. 203\u2013207). https:\/\/doi.org\/10.1145\/2723576.2723595","DOI":"10.1145\/2723576.2723595"},{"key":"387_CR22","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1016\/j.compedu.2020.104094","volume":"162","author":"G Deeva","year":"2021","unstructured":"Deeva, G., Bogdanova, D., Serral, E., Snoeck, M., & De Weerdt, J. (2021). A review of automated feedback systems for learners: Classification framework, challenges and opportunities. Computers & Education, 162, 10. https:\/\/doi.org\/10.1016\/j.compedu.2020.104094","journal-title":"Computers & Education"},{"key":"387_CR23","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 4171\u20134186). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"387_CR24","unstructured":"Di Mitri, D., Gombert, S., & Karademir, O. (2022). Reflecting on the actionable components of a model for augmented feedback. In Proceedings of the Second International Workshop on Multimodal Immersive Learning Systems (MILeS 2022) at the Seventeenth European Conference on Technology Enhanced Learning (EC-TEL 2022) (pp. 45\u201350). CEUR Workshop Proceedings. http:\/\/ceur-ws.org\/Vol-3247\/paper8.pdf"},{"key":"387_CR25","doi-asserted-by":"crossref","unstructured":"Durrell, M. (2006). Germanic Languages. In Encyclopedia of Language & Linguistics (pp. 53\u201355). Elsevier.","DOI":"10.1016\/B0-08-044854-2\/02189-1"},{"key":"387_CR26","unstructured":"Dzikovska, M., Nielsen, R., Brew, C., Leacock, C., Giampiccolo, D., Bentivogli, L., Clark, P., Dagan, I., & Dang, H. (2013). SemEval-2013 Task 7: The joint student response analysis and 8th recognizing textual entailment challenge. In Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013) (pp. 263\u2013274). Association for Computational Linguistics."},{"key":"387_CR27","doi-asserted-by":"publisher","unstructured":"Filighera, A., Parihar, S., Steuer, T., Meuser, T., & Ochs, S. (2022). Your answer is incorrect... Would you like to know why? Introducing a bilingual short answer feedback dataset. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 8577\u20138591). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/2022.acl-long.587","DOI":"10.18653\/v1\/2022.acl-long.587"},{"issue":"1","key":"387_CR28","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1111\/emip.12537","volume":"42","author":"T Firoozi","year":"2023","unstructured":"Firoozi, T., Mohammadi, H., & Gierl, M. J. (2023). Using active learning methods to strategically select essays for automated scoring. Educational Measurement: Issues and Practice, 42(1), 34\u201343.","journal-title":"Educational Measurement: Issues and Practice"},{"issue":"1","key":"387_CR29","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1007\/s10648-018-9446-6","volume":"31","author":"CJ Fong","year":"2018","unstructured":"Fong, C. J., Patall, E. A., Vasquez, A. C., & Stautberg, S. (2018). A Meta-analysis of negative feedback on intrinsic motivation. Educational Psychology Review, 31(1), 121\u2013162. https:\/\/doi.org\/10.1007\/s10648-018-9446-6","journal-title":"Educational Psychology Review"},{"issue":"0","key":"387_CR30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/00461520.2022.2134135","volume":"0","author":"CJ Fong","year":"2023","unstructured":"Fong, C. J., & Schallert, D. L. (2023). \u201cFeedback to the future\u201d: Advancing motivational and emotional perspectives in feedback research. Educational Psychologist, 0(0), 1\u201316. https:\/\/doi.org\/10.1080\/00461520.2022.2134135","journal-title":"Educational Psychologist"},{"key":"387_CR31","doi-asserted-by":"crossref","unstructured":"Gold, A., & Hasselhorn, M. (2017). P\u00e4dagogische Psychologie (4th ed.) [PDF].\u00a0Kohlhammer.","DOI":"10.17433\/978-3-17-031977-6"},{"key":"387_CR32","doi-asserted-by":"publisher","unstructured":"Gombert, S., Di Mitri, D., Karademir, O., Kubsch, M., Kolbe, H., Tautz, S., \u2026, & Drachsler, H. (2022). Coding energy knowledge in constructed responses with explainable NLP models. Journal of Computer Assisted Learning, 39(3), 767\u2013786. https:\/\/doi.org\/10.1111\/jcal.12767","DOI":"10.1111\/jcal.12767"},{"key":"387_CR33","first-page":"131","volume":"2021","author":"S Gr\u00fcnewald","year":"2021","unstructured":"Gr\u00fcnewald, S., Friedrich, A., & Kuhn, J. (2021). Applying Occam\u2019s razor to transformer-based dependency parsing: What works, what doesn\u2019t, and what is really necessary. IWPT, 2021, 131\u2013140.","journal-title":"IWPT"},{"issue":"3","key":"387_CR34","doi-asserted-by":"publisher","first-page":"464","DOI":"10.1080\/01690960701702035","volume":"23","author":"P Gygax","year":"2008","unstructured":"Gygax, P., Gabriel, U., Sarrasin, O., Oakhill, J., & Garnham, A. (2008). Generically intended, but specifically interpreted: When beauticians, musicians, and mechanics are all men. Language and Cognitive Processes, 23(3), 464\u2013485.","journal-title":"Language and Cognitive Processes"},{"key":"387_CR35","unstructured":"Haiden, M. (1997). Verbal Inflection and the Structure of IP in German. GAGL: Groninger Arbeiten zur germanistischen Linguistik, (41), 77\u2013106."},{"key":"387_CR36","volume-title":"Visible learning: A synthesis of 800+ meta-analyses on achievement","author":"JA Hattie","year":"2009","unstructured":"Hattie, J. A. (2009). Visible learning: A synthesis of 800+ meta-analyses on achievement. Routledge."},{"key":"387_CR37","doi-asserted-by":"publisher","unstructured":"Hattie, J., & Timperley, H. (2007). The Power of Feedback. Review of Educational Research, 77(1), 81\u2013112. https:\/\/doi.org\/10.3102\/003465430298487","DOI":"10.3102\/003465430298487"},{"key":"387_CR38","unstructured":"Hearst, M. A. (1997). Text tiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics, 23(1), 33\u201364. https:\/\/aclanthology.org\/J97-1003.pdf"},{"key":"387_CR39","doi-asserted-by":"publisher","unstructured":"Ho, T. K. (1995). Random decision forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition (Vol. 1, pp. 278\u2013282). IEEE. https:\/\/doi.org\/10.1109\/ICDAR.1995.598994","DOI":"10.1109\/ICDAR.1995.598994"},{"key":"387_CR40","doi-asserted-by":"publisher","unstructured":"Horbach, A., Scholten-Akoun, D., Ding, Y., & Zesch, T. (2017). Fine-grained essay scoring of a complex writing task for native speakers. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/w17-5040","DOI":"10.18653\/v1\/w17-5040"},{"key":"387_CR41","doi-asserted-by":"crossref","unstructured":"Horbach, A., Laarmann-Quante, R., Liebenow, L., Jansen, T., Keller, S., Meyer, J., \u2026, & Fleckenstein, J. (2022). Bringing automatic scoring into the classroom - measuring the impact of automated analytic feedback on student writing performance. In Proceedings of the 11th Workshop on NLP for Computer Assisted Language Learning (pp. 72\u201383). https:\/\/aclanthology.org\/2022.nlp4call-1.8","DOI":"10.3384\/ecp190008"},{"key":"387_CR42","unstructured":"It\u2019s Leena. (2017). Zehn Lerntipps f\u00fcr die Schule [Video]. YouTube. Retrieved January, 2023, from https:\/\/www.youtube.com\/watch?v=wqCe5KQqhxs"},{"key":"387_CR43","doi-asserted-by":"publisher","unstructured":"Ke, Z., & Ng, V. (2019). Automated essay scoring: A survey of the state of the art. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (pp. 6300\u20136308). https:\/\/doi.org\/10.24963\/ijcai.2019\/879","DOI":"10.24963\/ijcai.2019\/879"},{"key":"387_CR44","unstructured":"Klie, J.-C., Bugert, M., Boullosa, B., Eckart de Castilho, R., & Gurevych, I. (2018). The INCEpTION platform: Machine-assisted and knowledge-oriented interactive annotation. In Proceedings of System Demonstrations of the 27th International Conference on Computational Linguistics (pp. 9\u201313). International Committee on Computational Linguistics. https:\/\/aclanthology.org\/C18-2002"},{"key":"387_CR45","doi-asserted-by":"publisher","unstructured":"Klie, J.-C., & Eckart de Castilho, R. (n.d.).DKPro Cassis - Reading and Writing UIMA CAS Files in Python. Retrieved August 2023, from https:\/\/doi.org\/10.5281\/zenodo.3994108","DOI":"10.5281\/zenodo.3994108"},{"key":"387_CR46","doi-asserted-by":"publisher","DOI":"10.36198\/9783838538952","volume-title":"Psychologie des Unterrichts","author":"M Kunter","year":"2013","unstructured":"Kunter, M., & Trautwein, U. (2013). Psychologie des Unterrichts. Stuttgart, Germany: UTB."},{"key":"387_CR47","unstructured":"Landauer, T. K., Laham, D., Foltz., P. (2003). Automated scoring and annotation of essays with the intelligent essay assessor. In Shermis, M. D., & Burstein, J. (Eds.): Automated essay scoring: A cross-disciplinary perspective. Routledge."},{"issue":"2\u20133","key":"387_CR48","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1080\/01638539809545028","volume":"25","author":"TK Landauer","year":"1998","unstructured":"Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 25(2\u20133), 259\u2013284. https:\/\/doi.org\/10.1080\/01638539809545028","journal-title":"Discourse Processes"},{"key":"387_CR49","doi-asserted-by":"publisher","unstructured":"Li, J., & Hovy, E. (2014). A model of coherence based on distributed sentence representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (pp. 2039\u20132048). Association for Computational Linguistics. https:\/\/doi.org\/10.3115\/v1\/D14-1218","DOI":"10.3115\/v1\/D14-1218"},{"key":"387_CR50","doi-asserted-by":"publisher","unstructured":"Li, J., & Jurafsky, D. (2017). Neural net models of open-domain discourse coherence. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (pp. 198\u2013209). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/D17-1019","DOI":"10.18653\/v1\/D17-1019"},{"issue":"3","key":"387_CR51","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1080\/02602938.2020.1782831","volume":"46","author":"LA Lim","year":"2021","unstructured":"Lim, L. A., Dawson, S., Ga\u0161evi\u0107, D., Joksimovi\u0107, S., Pardo, A., Fudge, A., & Gentili, S. (2021). Students\u2019 perceptions of, and emotional responses to, personalised learning analytics-based feedback: An exploratory study of four courses. Assessment & Evaluation in Higher Education, 46(3), 339\u2013359.","journal-title":"Assessment & Evaluation in Higher Education"},{"key":"387_CR52","unstructured":"Livingston, S. A. (2009). Constructed-response test questions: Why we use them; How we score them. In R&D Connections. Number 11. Educational Testing Service. https:\/\/eric.ed.gov\/?id=ED507802"},{"key":"387_CR53","unstructured":"Loshchilov, I., & Hutter, F. (2019). Decoupled weight decay regularization. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6\u20139, 2019. OpenReview.net."},{"key":"387_CR54","first-page":"512","volume-title":"Advances in neural information processing systems 12","author":"L Mason","year":"1999","unstructured":"Mason, L., Baxter, J., Bartlett, P., & Frean, M. (1999). Boosting algorithms as gradient descent. In S. A. Solla, T. K. Leen, & K.-R. M\u00fcller (Eds.), Advances in neural information processing systems 12 (pp. 512\u2013518). MIT Press."},{"key":"387_CR55","doi-asserted-by":"publisher","unstructured":"Mayfield, E., & Black, A. W. (2020). Should you fine-tune BERT for automated essay scoring? In Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 155\u2013164). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/2020.bea-1.15","DOI":"10.18653\/v1\/2020.bea-1.15"},{"key":"387_CR56","doi-asserted-by":"crossref","unstructured":"McDonald, J., Knott, A., Stein, S., & Zeng, R. (2013). An empirically-based, tutorial dialogue system: Design, implementation and evaluation in a first year health sciences course. In ASCILITE-Australian Society for Computers in Learning in Tertiary Education Annual Conference (pp. 562\u2013572). Australasian Society for Computers in Learning in Tertiary Education.","DOI":"10.14742\/apubs.2013.1355"},{"issue":"3","key":"387_CR57","doi-asserted-by":"publisher","first-page":"276","DOI":"10.11613\/BM.2012.031","volume":"22","author":"ML McHugh","year":"2012","unstructured":"McHugh, M. L. (2012). Interrater reliability: The kappa statistic. Biochemia Medica, 22(3), 276\u2013282.","journal-title":"Biochemia Medica"},{"key":"387_CR58","doi-asserted-by":"publisher","unstructured":"McNamara, D. S., Crossley, S. A., Roscoe, R. D., Allen, L. K., & Dai, J. (2015). A hierarchical classification approach to automated essay scoring. Assessing Writing (Vol. 23, pp. 35\u201359). Elsevier BV. https:\/\/doi.org\/10.1016\/j.asw.2014.09.002","DOI":"10.1016\/j.asw.2014.09.002"},{"issue":"4","key":"387_CR59","doi-asserted-by":"publisher","first-page":"528","DOI":"10.1016\/j.ipm.2010.11.008","volume":"47","author":"H Misra","year":"2011","unstructured":"Misra, H., Yvon, F., Capp\u00e9, O., & Jose, J. (2011). Text segmentation: A topic modeling perspective. Information Processing & Management, 47(4), 528\u2013544. https:\/\/doi.org\/10.1016\/j.ipm.2010.11.008","journal-title":"Information Processing & Management"},{"issue":"1","key":"387_CR60","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1080\/10494820.2018.1558257","volume":"29","author":"E Mousavinasab","year":"2021","unstructured":"Mousavinasab, E., Zarifsanaiey, N., Niakan Kalhori, S. R., Rakhshan, M., Keikha, L., & Ghazi Saeedi, M. (2021). Intelligent tutoring systems: A systematic review of characteristics, applications, and evaluation methods. Interactive Learning Environments, 29(1), 142\u2013163. https:\/\/doi.org\/10.1080\/10494820.2018.1558257","journal-title":"Interactive Learning Environments"},{"key":"387_CR61","doi-asserted-by":"publisher","DOI":"10.1007\/s11251-020-09525-2","author":"V Nachtigall","year":"2020","unstructured":"Nachtigall, V., Serova, K., & Rummel, N. (2020). When failure fails to be productive: Probing the effectiveness of productive failure for learning beyond stem domains. Instructional Science. https:\/\/doi.org\/10.1007\/s11251-020-09525-2","journal-title":"Instructional Science"},{"key":"387_CR62","first-page":"125","volume-title":"Handbook of research on educational communications and technology","author":"S Narciss","year":"2008","unstructured":"Narciss, S. (2008). Feedback strategies for interactive learning tasks. In J. M. Spector, M. D. Merrill, J. van Merri\u00ebnboer, & D. M. Driscoll (Eds.), Handbook of research on educational communications and technology (pp. 125\u2013144). Routledge."},{"key":"387_CR63","first-page":"1","volume-title":"Metacognition. Knowing about knowing","author":"TO Nelson","year":"1994","unstructured":"Nelson, T. O., & Narens, L. (1994). Why investigate Metacognition. In J. Metcalfe & A. P. Shimamura (Eds.), Metacognition. Knowing about knowing (pp. 1\u201325). Cambridge University Press."},{"key":"387_CR64","doi-asserted-by":"publisher","unstructured":"Nguyen, H., & Litman, D. (2018). Argument mining for improving the automated scoring of persuasive essays. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence. https:\/\/doi.org\/10.1609\/aaai.v32i1.12046","DOI":"10.1609\/aaai.v32i1.12046"},{"issue":"2","key":"387_CR65","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1080\/03075070600572090","volume":"31","author":"DJ Nicol","year":"2006","unstructured":"Nicol, D. J., & Macfarlane-Dick, D. (2006). Formative assessment and self-regulated learning: A model and seven principles of good feedback practice. Studies in Higher Education, 31(2), 199\u2013218. https:\/\/doi.org\/10.1080\/03075070600572090","journal-title":"Studies in Higher Education"},{"issue":"3","key":"387_CR66","first-page":"44","volume":"34","author":"D N\u00fcbling","year":"2018","unstructured":"N\u00fcbling, D. (2018). Und ob das Genus mit dem Sexus. Genus verweist nicht nur auf Geschlecht, sondern auch auf die Geschlechterordnung. Sprachreport, 34(3), 44\u201350.","journal-title":"Sprachreport"},{"key":"387_CR67","unstructured":"Ortmann, K., Roussel, A., & Dipper, S. (2019). Evaluating Off-the-Shelf NLP Tools for German. Proceedings of the 15th Conference on Natural Language Processing, KONVENS 2019, Erlangen, Germany, October 9-11, 2019. https:\/\/corpora.linguistik.uni-erlangen.de\/data\/konvens\/proceedings\/papers\/KONVENS2019_paper_55.pdf"},{"key":"387_CR68","unstructured":"Page, E. B. (1967). Grading essays by computer: Progress report. In Proceedings of the Invitational Conference on Testing Problems (pp. 87\u2013100)."},{"key":"387_CR69","doi-asserted-by":"crossref","unstructured":"Pardo, A., Bartimote, K., Shum, S. B., Dawson, S., Gao, J., Ga\u0161evi\u0107, D., ..., & Vigentini, L. (2018). OnTask: Delivering data-informed, personalized learning support actions. Journal of Learning Analytics, 5(3), 235\u2013249.","DOI":"10.18608\/jla.2018.53.15"},{"key":"387_CR70","unstructured":"Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., \u2026, & Duchesnay, \u00c9. (2011). Scikit-learn: Machine learning in python. Journal of Machine Learning Research, 12, 2825\u20132830."},{"key":"387_CR71","doi-asserted-by":"publisher","unstructured":"Phandi, P., Chai, K. M. A., & Ng, H. T. (2015). Flexible Domain Adaptation for Automated Essay Scoring Using Correlated Linear Regression. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp. 446\u2013452). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/d15-1049","DOI":"10.18653\/v1\/d15-1049"},{"issue":"1","key":"387_CR72","first-page":"73","volume":"24","author":"B Pohlmann","year":"2010","unstructured":"Pohlmann, B., & M\u00f6ller, J. (2010). Fragebogen zur Erfassung der Motivation f\u00fcr die Wahl des Lehramtsstudiums (FEMOLA). Zeitschrift f\u00fcr p\u00e4dagogische Psychologie, 24(1), 73\u201384.","journal-title":"Zeitschrift f\u00fcr p\u00e4dagogische Psychologie"},{"key":"387_CR73","doi-asserted-by":"publisher","unstructured":"Proisl, T., & Uhrig, P. (2016). SoMaJo: State-of-the-art tokenization for German web and social media texts. In Proceedings of the 10th web as corpus workshop (pp. 75\u201378). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/w16-2607","DOI":"10.18653\/v1\/w16-2607"},{"key":"387_CR74","first-page":"6639","volume-title":"Advances in neural information processing systems 31","author":"L Prokhorenkova","year":"2018","unstructured":"Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: Unbiased boosting with categorical features. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, & R. Garnett (Eds.), Advances in neural information processing systems 31 (pp. 6639\u20136649). Curran Associates Inc."},{"key":"387_CR75","doi-asserted-by":"publisher","unstructured":"Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C. D. (2020). Stanza: A python natural language processing toolkit for many human languages. In Proceedings of the 58th annual meeting of the association for computational linguistics: System demonstrations (pp. 101\u2013108). https:\/\/doi.org\/10.18653\/v1\/2020.acl-demos.14","DOI":"10.18653\/v1\/2020.acl-demos.14"},{"key":"387_CR76","unstructured":"Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., \u2026, & Liu, P. J. (2020). Exploring the Limits of transfer learning with a unified text-to-text transformer. Journal of Machinge Learning Research, 21(1), 1-67."},{"key":"387_CR77","doi-asserted-by":"publisher","unstructured":"Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using siamese BERT-networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3982\u20133992. https:\/\/doi.org\/10.18653\/v1\/D19-1410","DOI":"10.18653\/v1\/D19-1410"},{"key":"387_CR78","unstructured":"Riedl, M., & Biemann, C. (2012). TopicTiling: A text segmentation algorithm based on LDA. In Proceedings of ACL 2012 Student Research Workshop (pp. 37\u201342). Association for Computational Linguistics. https:\/\/aclanthology.org\/W12-3307"},{"key":"387_CR79","doi-asserted-by":"publisher","unstructured":"Riedl, M., & Biemann, C. (2016). Unsupervised compound splitting with distributional semantics rivals supervised methods. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 617\u2013622. https:\/\/doi.org\/10.18653\/v1\/N16-1075","DOI":"10.18653\/v1\/N16-1075"},{"key":"387_CR80","doi-asserted-by":"publisher","unstructured":"Rodriguez, P. U., Jafari, A., & Ormerod, C. M. (2019). Language models and automated essay scoring. arXiv preprint arXiv:1909.09482. https:\/\/doi.org\/10.48550\/arXiv.1909.09482","DOI":"10.48550\/arXiv.1909.09482"},{"issue":"1","key":"387_CR81","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1037\/0003-066X.55.1.68","volume":"55","author":"RM Ryan","year":"2000","unstructured":"Ryan, R. M., & Deci, E. L. (2000). Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. American Psychologist, 55(1), 68. https:\/\/doi.org\/10.1037\/0003-066X.55.1.68","journal-title":"American Psychologist"},{"key":"387_CR82","unstructured":"Sansone, C., & Harackiewicz, J. M. (Eds.). (2000). Intrinsic and extrinsic\u00a0motivation: The search for optimal motivation and performance. Academic Press."},{"issue":"2","key":"387_CR83","doi-asserted-by":"publisher","first-page":"265","DOI":"10.18608\/jla.2022.7643","volume":"9","author":"M Schmitz","year":"2022","unstructured":"Schmitz, M., Scheffel, M., Bemelmans, R., & Drachsler, H. (2022). FoLA2 \u2014 a method for co-creating learning analytics-supported learning design. Journal of Learning Analytics, 9(2), 265\u2013281. https:\/\/doi.org\/10.18608\/jla.2022.7643","journal-title":"Journal of Learning Analytics"},{"key":"387_CR84","doi-asserted-by":"crossref","unstructured":"Seabold, S., & Perktold, J. (2010). Statsmodels: Econometric and statistical modeling with python. In Proceedings of the 9th python in science conference (Vol. 57, No. 61, pp. 10\u201325080).","DOI":"10.25080\/Majora-92bf1922-011"},{"key":"387_CR85","unstructured":"Shazeer, N., & Stern, M. (2018). Adafactor: Adaptive learning rates with sublinear memory cost. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsm\u00e4ssan, Stockholm, Sweden, July 10\u201315, 2018 (pp. 4603\u20134611). PMLR."},{"key":"387_CR86","doi-asserted-by":"publisher","unstructured":"Song, X., Salcianu, A., Song, Y., Dopson, D., & Zhou, D. (2021). Fast WordPiece tokenization. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2089\u20132103. https:\/\/doi.org\/10.18653\/v1\/2021.emnlp-main.160","DOI":"10.18653\/v1\/2021.emnlp-main.160"},{"key":"387_CR87","doi-asserted-by":"publisher","unstructured":"Sung, C., Dhamecha, T., Saha, S., Ma, T., Reddy, V., & Arora, R. (2019). Pre-training BERT on domain resources for short answer grading. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 6071\u20136075). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/D19-1628","DOI":"10.18653\/v1\/D19-1628"},{"key":"387_CR88","doi-asserted-by":"publisher","unstructured":"Taghipour, K., & Ng, H. T. (2016). A neural approach to automated essay scoring. In Proceedings of the 2016 Conference on empirical methods in natural language processing (pp. 1882\u20131891). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/d16-1193","DOI":"10.18653\/v1\/d16-1193"},{"key":"387_CR89","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1007\/s10459-012-9401-0","volume":"18","author":"OTJ ten Cate","year":"2013","unstructured":"ten Cate, O. T. J. (2013). Why receiving feedback collides with self determination. Advances in Health Sciences Education, 18, 845\u2013849. https:\/\/doi.org\/10.1007\/s10459-012-9401-0","journal-title":"Advances in Health Sciences Education"},{"key":"387_CR90","doi-asserted-by":"publisher","unstructured":"Tenney, I., Das, D., & Pavlick, E. (2019). BERT rediscovers the classical NLP pipeline. In Proceedings of the 57th annual meeting of the association for computational linguistics (pp. 4593\u20134601). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/p19-1452","DOI":"10.18653\/v1\/p19-1452"},{"issue":"1","key":"387_CR91","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1007\/s40593-017-0142-3","volume":"28","author":"S Vajjala","year":"2018","unstructured":"Vajjala, S. (2018). Automated assessment of non-native learner essays: Investigating the role of linguistic features. International Journal of Artificial Intelligence in Education, 28(1), 79\u2013105. https:\/\/doi.org\/10.1007\/s40593-017-0142-3","journal-title":"International Journal of Artificial Intelligence in Education"},{"key":"387_CR92","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., \u2026, & Polosukhin, I. (2017). Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, & R. Garnett (Eds.), Advances in Neural Information Processing Systems (Vol. 30) (pp. 1-11)."},{"key":"387_CR93","first-page":"297","volume-title":"Motivation and self-regulated learning: Theory, research, and applications","author":"PH Winne","year":"2008","unstructured":"Winne, P. H., & Hadwin, A. F. (2008). The weave of motivation and self-regulated learning. In D. H. Schunk & B. J. Zimmerman (Eds.), Motivation and self-regulated learning: Theory, research, and applications (pp. 297\u2013314). Lawrence Erlbaum Associates Publishers."},{"key":"387_CR94","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3389\/fpsyg.2019.03087","volume":"10","author":"B Wisniewski","year":"2020","unstructured":"Wisniewski, B., Zierer, K., & Hattie, J. (2020). The power of feedback revisited: A meta-analysis of educational feedback research. Frontiers in Psychology, 10, 1\u201314. https:\/\/doi.org\/10.3389\/fpsyg.2019.03087","journal-title":"Frontiers in Psychology"},{"key":"387_CR95","doi-asserted-by":"publisher","unstructured":"Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C., Moi, A., \u2026, & Rush, A. (2020). Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 conference on empirical methods in natural language processing: System demonstrations (pp. 38\u201345). Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/2020.emnlp-demos.6","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"387_CR96","unstructured":"Yannakoudakis, H., Briscoe, T., & Medlock, B. (2011). A new dataset and method for automatically grading ESOL texts. In Proceedings of the 49th annual meeting of the association for computational linguistics: Human language technologies (pp. 180\u2013189). Association for Computational Linguistics. https:\/\/aclanthology.org\/P11-1019"},{"key":"387_CR97","unstructured":"Yannakoudakis, H., & Briscoe, T. (2012). Modeling coherence in ESOL learner texts. In Proceedings of the seventh workshop on building educational applications using NLP (pp. 33\u201343). Association for Computational Linguistics. https:\/\/aclanthology.org\/W12-2004"},{"key":"387_CR98","doi-asserted-by":"publisher","unstructured":"Zehe, A., Konle, L., D\u00fcmpelmann, L. K., Gius, E., Hotho, A., Jannidis, F., \u2026, & Wiedmer, N. (2021). Detecting scenes in fiction: A new segmentation task. In Proceedings of the 16th conference of the European chapter of the association for computational linguistics: Main volume (pp. 3167\u20133177). https:\/\/doi.org\/10.18653\/v1\/2021.eacl-main.276","DOI":"10.18653\/v1\/2021.eacl-main.276"},{"key":"387_CR99","doi-asserted-by":"publisher","unstructured":"Zesch, T., Wojatzki, M., & Scholten-Akoun, D. (2015). Task-independent features for automated essay grading. In Proceedings of the tenth workshop on innovative use of NLP for building educational applications (pp. 224\u2013232). Association for Computational Linguistics. https:\/\/doi.org\/10.3115\/v1\/w15-0626","DOI":"10.3115\/v1\/w15-0626"},{"key":"387_CR100","doi-asserted-by":"publisher","unstructured":"Zesch, T., Horbach, A., & Zehner, F. (2023). To score or not to score: Factors influencing performance and feasibility of automatic content scoring of text responses. Educational measurement: Issues and practice. Wiley. https:\/\/doi.org\/10.1111\/emip.12544","DOI":"10.1111\/emip.12544"}],"container-title":["International Journal of Artificial Intelligence in Education"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40593-023-00387-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40593-023-00387-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40593-023-00387-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,4]],"date-time":"2026-03-04T18:12:23Z","timestamp":1772647943000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40593-023-00387-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,25]]},"references-count":100,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["387"],"URL":"https:\/\/doi.org\/10.1007\/s40593-023-00387-6","relation":{},"ISSN":["1560-4292","1560-4306"],"issn-type":[{"value":"1560-4292","type":"print"},{"value":"1560-4306","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,25]]},"assertion":[{"value":"5 December 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 January 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no relevant financial or non-financial interests to disclose.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicting Interests"}}]}}