{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T02:32:47Z","timestamp":1769913167135,"version":"3.49.0"},"reference-count":41,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2021,7,9]],"date-time":"2021-07-09T00:00:00Z","timestamp":1625788800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Nowadays, the importance of educational data mining and learning analytics in higher education institutions is being recognised. The analysis of university careers and of student dropout prediction is one of the most studied topics in the area of learning analytics. From the perspective of estimating the likelihood of a student dropping out, we propose an innovative statistical method that is a generalisation of mixed-effects trees for a response variable in the exponential family: generalised mixed-effects trees (GMET). We performed a simulation study in order to validate the performance of our proposed method and to compare GMET to classical models. In the case study, we applied GMET to model undergraduate student dropout in different courses at Politecnico di Milano. The model was able to identify discriminating student characteristics and estimate the effect of each degree-based course on the probability of student dropout.<\/jats:p>","DOI":"10.3390\/data6070074","type":"journal-article","created":{"date-parts":[[2021,7,12]],"date-time":"2021-07-12T00:23:36Z","timestamp":1626049416000},"page":"74","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Performing Learning Analytics via Generalised Mixed-Effects Trees"],"prefix":"10.3390","volume":"6","author":[{"given":"Luca","family":"Fontana","sequence":"first","affiliation":[{"name":"MOX\u2014 Laboratory for Modeling and Scientific Computing, Department of Mathematics, The Polytechnic University of Milan, 20133 Milan, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9208-3194","authenticated-orcid":false,"given":"Chiara","family":"Masci","sequence":"additional","affiliation":[{"name":"MOX\u2014 Laboratory for Modeling and Scientific Computing, Department of Mathematics, The Polytechnic University of Milan, 20133 Milan, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0165-1983","authenticated-orcid":false,"given":"Francesca","family":"Ieva","sequence":"additional","affiliation":[{"name":"MOX\u2014 Laboratory for Modeling and Scientific Computing, Department of Mathematics, The Polytechnic University of Milan, 20133 Milan, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8253-3630","authenticated-orcid":false,"given":"Anna Maria","family":"Paganoni","sequence":"additional","affiliation":[{"name":"MOX\u2014 Laboratory for Modeling and Scientific Computing, Department of Mathematics, The Polytechnic University of Milan, 20133 Milan, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2021,7,9]]},"reference":[{"key":"ref_1","unstructured":"SPEETproject (2020, May 05). SPEET, Proposal for Strategic Partnerships (Proposal Narrative). Available online: https:\/\/www.speet-project.com\/the-project."},{"key":"ref_2","unstructured":"Barbu, M., Vilanova, R., Lopez Vicario, J., Pereira, M.J., Alves, P., Podpdora, M., \u00c1ngel Prada, M., Mor\u00e1n, A., Torreburno, A., and Marin, S. (2017). Data mining tool for academic data exploitation: Literature review and first architecture proposal. Projecto SPEET-Student Profile for Enhancing Engineering Tutoring, IEEE Access."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1109\/TSMCC.2010.2053532","article-title":"Educational data mining: A review of the state of the art","volume":"40","author":"Romero","year":"2010","journal-title":"IEEE Trans. Syst. Man Cybern. Part C Appl. Rev."},{"key":"ref_4","unstructured":"Bock, R.D. (2014). Multilevel Analysis of Educational Data, Elsevier."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Goldstein, H. (2011). Multilevel Statistical Models, John Wiley & Sons.","DOI":"10.1002\/9780470973394"},{"key":"ref_6","unstructured":"Agresti, A. (2018). An Introduction to Categorical Data Analysis, Wiley."},{"key":"ref_7","unstructured":"Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J. (1984). Classification and Regression Trees, The Wadsworth Statistics and Probability Series, Wadsworth International Group."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1007\/s10994-011-5258-3","article-title":"RE-EM trees: A data mining approach for longitudinal and clustered data","volume":"86","author":"Sela","year":"2012","journal-title":"Mach. Learn."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1016\/j.spl.2010.12.003","article-title":"Mixed effects regression trees for clustered data","volume":"81","author":"Hajjem","year":"2011","journal-title":"Stat. Probab. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.spl.2017.02.033","article-title":"Generalized mixed effects regression trees","volume":"126","author":"Hajjem","year":"2017","journal-title":"Stat. Probab. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2016","DOI":"10.3758\/s13428-017-0971-x","article-title":"Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees","volume":"50","author":"Fokkema","year":"2018","journal-title":"Behav. Res. Methods"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/03610918.2018.1490429","article-title":"BiMM tree: A decision tree method for modeling clustered and longitudinal binary outcomes","volume":"Volume 49","author":"Speiser","year":"2020","journal-title":"Communications in Statistics-Simulation and Computation"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"492","DOI":"10.1198\/106186008X319331","article-title":"Model-based recursive partitioning","volume":"17","author":"Zeileis","year":"2008","journal-title":"J. Comput. Graph. Stat."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1353\/rhe.1990.0020","article-title":"Exploring the effects of ability to pay on persistence in college","volume":"13","author":"Cabrera","year":"1990","journal-title":"Rev. High. Educ."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1007\/BF01730115","article-title":"The nexus between college choice and persistence","volume":"37","author":"John","year":"1996","journal-title":"Res. High. Educ."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1080\/00221546.1980.11780030","article-title":"Predicting freshman persistence and voluntary dropout decisions from a theoretical model","volume":"51","author":"Pascarella","year":"1980","journal-title":"J. High. Educ."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1007\/BF02214313","article-title":"Dropouts from higher education: An interdisciplinary review and synthesis","volume":"1","author":"Spady","year":"1970","journal-title":"Interchange"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"89","DOI":"10.3102\/00346543045001089","article-title":"Dropout from higher education: A theoretical synthesis of recent research","volume":"45","author":"Tinto","year":"1975","journal-title":"Rev. Educ. Res."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1080\/00313831.2018.1476407","article-title":"Identifying problematic study progression and \u201cat-risk\u201d students in higher education in Finland","volume":"63","author":"Korhonen","year":"2019","journal-title":"Scand. J. Educ. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"200","DOI":"10.1177\/0004944117712310","article-title":"Using predictive analytics to target and improve first year student attrition","volume":"61","author":"Seidel","year":"2017","journal-title":"Aust. J. Educ."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2096","DOI":"10.1080\/03075079.2018.1496408","article-title":"The determinants of academic performance: Evidence from a Cambodian university","volume":"44","author":"Sothan","year":"2019","journal-title":"Stud. High. Educ."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"567","DOI":"10.1007\/s10758-019-09408-7","article-title":"Factors affecting students\u2019 performance in higher education: A systematic review of predictive data mining techniques","volume":"24","author":"Saa","year":"2019","journal-title":"Technol. Knowl. Learn."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1195","DOI":"10.1007\/s10639-016-9485-x","article-title":"Educational data mining acceptance among undergraduate students","volume":"22","author":"Wook","year":"2017","journal-title":"Educ. Inf. Technol."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Tampakas, V., Livieris, I.E., Pintelas, E., Karacapilidis, N., and Pintelas, P. (2018, January 20\u201322). Prediction of students\u2019 graduation time using a two-level classification algorithm. Proceedings of the International Conference on Technology and Innovation in Learning, Teaching and Education, Thessaloniki, Greece.","DOI":"10.1007\/978-3-030-20954-4_42"},{"key":"ref_25","unstructured":"Sanyal, D., Bosch, N., and Paquette, L. (2020). Feature Selection Metrics: Similarities, Differences, and Characteristics of the Selected Models, International Educational Data Mining Society."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.17485\/ijst\/2016\/v9i4\/87032","article-title":"Predictive modeling of student dropout indicators in educational data mining using improved decision tree","volume":"9","author":"Sivakumar","year":"2016","journal-title":"Indian J. Sci. Technol."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1080\/01587919.2013.793642","article-title":"Application of the classification tree model in predicting learner dropout behaviour in open and distance learning","volume":"34","author":"Yasmin","year":"2013","journal-title":"Distance Educ."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Abu-Oda, G.S., and El-Halees, A.M. (2015). Data mining in higher education: University student dropout case study. Int. J. Data Min. Knowl. Manag. Process, 5.","DOI":"10.5121\/ijdkp.2015.5102"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Meedech, P., Iam-On, N., and Boongoen, T. (2016). Prediction of student dropout using personal profile and data mining approach. Intelligent and Evolutionary Systems, Springer.","DOI":"10.1007\/978-3-319-27000-5_12"},{"key":"ref_30","unstructured":"Team, R.C. (2014). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Searle, S.R., and McCulloch, C.E. (2001). Generalized, Linear, and Mixed Models, Wiley.","DOI":"10.1002\/9780470057339.vag009"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"McCullagh, P., and Nelder, J. (2019). Generalized Linear Models, Taylor & Francis Group.","DOI":"10.1201\/9780203753736"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Friedman, J., Hastie, T., and Tibshirani, R. (2001). The Elements of Statistical Learning, Springer.","DOI":"10.1007\/978-0-387-21606-5"},{"key":"ref_34","unstructured":"Therneau, T., Atkinson, B., and Ripley, B. (2016, April 20). Rpart: Recursive Partitioning and Regression Trees (R Package). Available online: cran.ma.ic.ac.uk\/web\/packages\/rpart\/rpart.pdf."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Bates, D., M\u00e4chler, M., Bolker, B., and Walker, S. (2014). Fitting linear mixed-effects models using lme4. arXiv.","DOI":"10.18637\/jss.v067.i01"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1177\/1471082X0100100302","article-title":"A multivariate generalized linear mixed model for joint modelling of clustered outcomes in the exponential family","volume":"1","author":"Gueorguieva","year":"2001","journal-title":"Stat. Model."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"020033","DOI":"10.1063\/1.4979449","article-title":"A comparative study of approximation methods for maximum likelihood estimation in generalized linear mixed models (GLMM)","volume":"Volume 1827","author":"Handayani","year":"2017","journal-title":"Proceedings of the AIP Conference"},{"key":"ref_38","unstructured":"Pinheiro, J., and Bates, D. (2006). Mixed-Effects Models in S and S-PLUS, Springer Science & Business Media."},{"key":"ref_39","first-page":"223","article-title":"Partitioning variation in multilevel models","volume":"1","author":"Goldstein","year":"2002","journal-title":"Underst. Stat. Stat. Issues Psychol. Educ. Soc. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1111\/j.1467-985X.2004.00365.x","article-title":"Variance partitioning in multilevel logistic models that exhibit overdispersion","volume":"168","author":"Browne","year":"2005","journal-title":"J. R. Stat. Soc. Ser. A Stat. Soc."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Pintelas, E., Livieris, I.E., and Pintelas, P. (2020). A grey-box ensemble model exploiting black-box accuracy and white-box intrinsic interpretability. Algorithms, 13.","DOI":"10.3390\/a13010017"}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/6\/7\/74\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:28:38Z","timestamp":1760164118000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/6\/7\/74"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,9]]},"references-count":41,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2021,7]]}},"alternative-id":["data6070074"],"URL":"https:\/\/doi.org\/10.3390\/data6070074","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,9]]}}}