{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,5]],"date-time":"2025-11-05T14:26:29Z","timestamp":1762352789088,"version":"build-2065373602"},"reference-count":43,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2019,5,8]],"date-time":"2019-05-08T00:00:00Z","timestamp":1557273600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Every year, academic institutions invest considerable effort and substantial resources to influence, predict and understand the decision-making choices of applicants who have been offered admission. In this study, we applied several supervised machine learning techniques to four years of data on 11,001 students, each with 35 associated features, admitted to a small liberal arts college in California to predict student college commitment decisions. By treating the question of whether a student offered admission will accept it as a binary classification problem, we implemented a number of different classifiers and then evaluated the performance of these algorithms using the metrics of accuracy, precision, recall, F-measure and area under the receiver operator curve. The results from this study indicate that the logistic regression classifier performed best in modeling the student college commitment decision problem, i.e., predicting whether a student will accept an admission offer, with an AUC score of 79.6%. The significance of this research is that it demonstrates that many institutions could use machine learning algorithms to improve the accuracy of their estimates of entering class sizes, thus allowing more optimal allocation of resources and better control over net tuition revenue.<\/jats:p>","DOI":"10.3390\/data4020065","type":"journal-article","created":{"date-parts":[[2019,5,13]],"date-time":"2019-05-13T11:00:57Z","timestamp":1557745257000},"page":"65","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":28,"title":["Predictive Models of Student College Commitment Decisions Using Machine Learning"],"prefix":"10.3390","volume":"4","author":[{"given":"Kanadpriya","family":"Basu","sequence":"first","affiliation":[{"name":"SnackNation, 3534 Hayden Avenue, Culver City, CA 90232, USA"}]},{"given":"Treena","family":"Basu","sequence":"additional","affiliation":[{"name":"Occidental College, 1600 Campus Road, Los Angeles, CA 90041, USA"}]},{"given":"Ron","family":"Buckmire","sequence":"additional","affiliation":[{"name":"Occidental College, 1600 Campus Road, Los Angeles, CA 90041, USA"}]},{"given":"Nishu","family":"Lal","sequence":"additional","affiliation":[{"name":"Occidental College, 1600 Campus Road, Los Angeles, CA 90041, USA"}]}],"member":"1968","published-online":{"date-parts":[[2019,5,8]]},"reference":[{"key":"ref_1","unstructured":"Lapovsky, L. (2018). The Changing Business Model For Colleges And Universities. Forbes, Available online: https:\/\/www.forbes.com\/sites\/lucielapovsky\/2018\/02\/06\/the-changing-business-model-for-colleges-and-universities\/#bbc03d45ed59."},{"key":"ref_2","unstructured":"(2018, December 15). The Higher Education Business Model, Innovation and Financial Sustainability. Available online: https:\/\/www.tiaa.org\/public\/pdf\/higher-education-business-model.pdf."},{"key":"ref_3","unstructured":"Occidental College (2018, December 15). Available online: https:\/\/www.oxy.edu."},{"key":"ref_4","unstructured":"Occidental College Office of Financial Aid (2018, December 15). Available online: https:\/\/www.oxy.edu\/admission-aid\/costs-financial-aid."},{"key":"ref_5","unstructured":"(2018, December 15). Tuition Discounting. Available online: https:\/\/www.agb.org\/briefs\/tuition-discounting."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1002\/he.283","article-title":"Fixing the net tuition revenue dilemma: The Dickinson College story","volume":"140","author":"Massa","year":"2007","journal-title":"New Dir. High. Educ."},{"key":"ref_7","unstructured":"Hossler, D., and Bean, J.P. (1990). The Strategic Management of College Enrollments, Jossey Bass. [1st ed.]."},{"key":"ref_8","unstructured":"G\u00e9ron, A. (2017). Hands-On Machine Learning with Scikit-Learn & Tensor Flow, O\u2019Reilly. [1st ed.]."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Hastie, T., Tibshirani, R., and Friedman, J. (2009). The Elements of Statistical Learning, Data Mining, Inference and Prediction, Springer. [2nd ed.].","DOI":"10.1007\/978-0-387-84858-7"},{"key":"ref_10","unstructured":"Grus, J. (2015). Data Science From Scratch First Principles with Python, O\u2019Reilly. [1st ed.]."},{"key":"ref_11","first-page":"249","article-title":"Supervised Machine Learning: A Review of Classification Techniques","volume":"4","author":"Kotsiantis","year":"2007","journal-title":"Informatica"},{"key":"ref_12","unstructured":"Alpaydin, E. (2010). Introduction to Machine Learning, MIT Press. [3rd ed.]."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Kuncheva, L.I. (2014). Combining Pattern Classifiers: Methods and Algorithms, John Wiley & Sons, Inc.. [2nd ed.].","DOI":"10.1002\/9781118914564"},{"key":"ref_14","unstructured":"Occidental College Office of Admissions (2018, December 15). Available online: https:\/\/www.oxy.edu\/admission-aid."},{"key":"ref_15","unstructured":"(2018, December 15). Journal of Educational Data Mining. Available online: http:\/\/jedm.educationaldatamining.org\/index.php\/JEDM."},{"key":"ref_16","unstructured":"(2018, December 15). Educational Data Mining Conference 2018. Available online: http:\/\/educationaldatamining.org\/EDM2018\/."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/j.eswa.2006.04.005","article-title":"Educational data mining: A survey from 1995 to 2005","volume":"33","author":"Romero","year":"2007","journal-title":"Expert Syst. Appl."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1432","DOI":"10.1016\/j.eswa.2013.08.042","article-title":"Educational data mining: A survey and a data mining-based analysis of recent works","volume":"41","year":"2014","journal-title":"Expert Syst. Appl."},{"key":"ref_19","unstructured":"Waters, A., and Miikkulainen, R. (2013, January 14\u201318). GRADE: Machine Learning Support for Graduate Admissions. Proceedings of the Twenty-Fifth Conference on Innovative Applications of Artificial Intelligence, Bellevue, WA, USA. Available online: http:\/\/www.cs.utexas.edu\/users\/ai-lab\/downloadPublication.php?filename=http:\/\/www.cs.utexas.edu\/users\/nn\/downloads\/papers\/waters.iaai13.pdf&pubid=127269."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"118","DOI":"10.2478\/eurodl-2014-0008","article-title":"Predicting Dropout Student: An Application of Data Mining Methods in an Online Education Program","volume":"17","author":"Yukselturk","year":"2014","journal-title":"Eur. J. Open Distance E-Learn."},{"key":"ref_21","unstructured":"Tampakas, V., Livieris, I.E., Pintelas, E., Karacapilidis, N., and Pintelas, P. (2018, January 20\u201322). Prediction of students\u2019 graduation time using a two-level classification algorithm. Proceedings of the 1st International Conference on Technology and Innovation in Learning, Teaching and Education (TECH-EDU 2018), Thessaloniki, Greece."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Livieris, I.E., Kotsilieris, T., Tampakas, V., and Pintelas, P. (2018). Improving the evaluation process of students\u2019 performance utilizing a decision support software. Neural Comput. Appl.","DOI":"10.1007\/s00521-018-3756-y"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/978-3-319-65172-9_6","article-title":"DSS-PSP-a decision support software for evaluating students\u2019 performance","volume":"744","author":"Livieris","year":"2017","journal-title":"Eng. Appl. Neural Netw. (EANN)"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Duzhin, F., and Gustafsson, A. (2018). Machine Learning-Based App for Self-Evaluation of Teacher-Specific Instructional Style and Tools. Educ. Sci., 8.","DOI":"10.3390\/educsci8010007"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1002\/ir.187","article-title":"Applying Data Mining to Predict College Admissions Yield: A Case Study","volume":"131","author":"Chang","year":"2006","journal-title":"New Dir. Institutional Res."},{"key":"ref_26","unstructured":"Powell, F. (2018, December 15). Universities, Colleges Where Students Are Eager to Enroll. Available online: https:\/\/www.usnews.com\/education\/best-colleges\/articles\/2018-01-23\/universities-colleges-where-students-are-eager-to-enroll."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10115-007-0114-2","article-title":"Top 10 algorithms in data mining","volume":"14","author":"Wu","year":"2008","journal-title":"Knowl. Inf. Syst."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"429","DOI":"10.3233\/IDA-2002-6504","article-title":"The class imbalance problem: A systematic study","volume":"6","author":"Japkowicz","year":"2002","journal-title":"Intell. Data Anal."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013). An Introduction to Statistical Learning with Applications in R, Springer. [7th ed.].","DOI":"10.1007\/978-1-4614-7138-7"},{"key":"ref_30","unstructured":"Brink, H., Richards, J., and Fetherolf, M. (2017). Real World Machine Learning, Manning. [1st ed.]. Available online: https:\/\/www.manning.com\/books\/real-world-machine-learning."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Rao, R.B., and Fung, G. (2008, January 24\u201326). On the Dangers of Cross-Validation: An Experimental Evaluation. Proceedings of the 2008 International Conference on Data Mining, Atlanta, GA, USA. Available online: https:\/\/doi.org\/10.1137\/1.9781611972788.54.","DOI":"10.1137\/1.9781611972788.54"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1111\/j.1600-0587.2012.07348.x","article-title":"Collinearity: A review of methods to deal with it and a simulation study evaluating their performance","volume":"36","author":"Dormann","year":"2012","journal-title":"Ecography"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1007\/s41237-017-0024-x","article-title":"A note on large-scale logistic prediction, using an approximate graphical model to deal with collinearity and missing data","volume":"44","author":"Maarsman","year":"2017","journal-title":"Behaviometrika"},{"key":"ref_34","unstructured":"Peck, R., Olsen, C., and Devore, J. (2016). Statistics and Data Analysis, Cengage. [5th ed.]."},{"key":"ref_35","unstructured":"(2018, December 15). Scikit-learn, Machine Learning in Python. Available online: http:\/\/scikit-learn.org\/stable\/."},{"key":"ref_36","unstructured":"(2018, December 15). Python 3.0. Available online: https:\/\/www.python.org."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Schapire, R.E. (2003). The Boosting Approach to Machine Learning: An Overview, Springer. [1st ed.].","DOI":"10.1007\/978-0-387-21579-2_9"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Godsey, B. (2017). Think Like a Data Scientist, Manning. [1st ed.].","DOI":"10.12968\/prtu.2017.71.3"},{"key":"ref_39","unstructured":"Cielen, D., Meysman, A., and Ali, M. (2016). Introducing Data Science, Manning. [1st ed.]. Available online: https:\/\/www.manning.com\/books\/introducing-data-science."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: Synthetic Minority Over-sampling Technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res."},{"key":"ref_41","unstructured":"(2018, December 15). IBM SPSS Software. Available online: https:\/\/www.ibm.com\/analytics\/spss-statistics-software."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Karimi, H.A., and Karimi, B. (2017). Geospatial Data Science Techniques and Applications, CRC Press. [1st ed.].","DOI":"10.1201\/b22052"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Millar, R.B. (2011). Maximum Likelihood Estimation and Inference: With Examples in R, SAS and ADMB, Wiley. [1st ed.].","DOI":"10.1002\/9780470094846"}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/4\/2\/65\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:50:02Z","timestamp":1760187002000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/4\/2\/65"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,5,8]]},"references-count":43,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2019,6]]}},"alternative-id":["data4020065"],"URL":"https:\/\/doi.org\/10.3390\/data4020065","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2019,5,8]]}}}