{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T23:46:08Z","timestamp":1776901568745,"version":"3.51.2"},"reference-count":22,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2022,5,16]],"date-time":"2022-05-16T00:00:00Z","timestamp":1652659200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001871","name":"Portuguese funding agency","doi-asserted-by":"publisher","award":["FCT DSAIPA\/DS\/0090\/2018"],"award-info":[{"award-number":["FCT DSAIPA\/DS\/0090\/2018"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computers"],"abstract":"<jats:p>Portugal has the sixth highest road fatality rate among European Union members. This is a problem of different dimensions with serious consequences in people\u2019s lives. This study analyses daily data from police and government authorities on road traffic accidents that occurred between 2016 and 2019 in a district of Portugal. This paper looks for the determinants that contribute to the existence of victims in road traffic accidents, as well as the determinants for fatalities and\/or serious injuries in accidents with victims. We use logistic regression models, and the results are compared to the machine-learning model results. For the severity model, where the response variable indicates whether only property damage or casualties resulted in the traffic accident, we used a large sample with a small imbalance. For the serious injuries model, where the response variable indicates whether or not there were victims with serious injuries and\/or fatalities in the traffic accident with victims, we used a small sample with very imbalanced data. Empirical analysis supports the conclusion that, with a small sample of imbalanced data, machine-learning models generally do not perform better than statistical models; however, they perform similarly when the sample is large and has a small imbalance.<\/jats:p>","DOI":"10.3390\/computers11050080","type":"journal-article","created":{"date-parts":[[2022,5,16]],"date-time":"2022-05-16T13:06:23Z","timestamp":1652706383000},"page":"80","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":43,"title":["Comparison of Statistical and Machine-Learning Models on Road Traffic Accident Severity Classification"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1644-9502","authenticated-orcid":false,"given":"Paulo","family":"Infante","sequence":"first","affiliation":[{"name":"CIMA, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Matematics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3292-2208","authenticated-orcid":false,"given":"Gon\u00e7alo","family":"Jacinto","sequence":"additional","affiliation":[{"name":"CIMA, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Matematics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5517-4855","authenticated-orcid":false,"given":"Anabela","family":"Afonso","sequence":"additional","affiliation":[{"name":"CIMA, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Matematics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2538-0671","authenticated-orcid":false,"given":"Leonor","family":"Rego","sequence":"additional","affiliation":[{"name":"Department of Matematics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0793-0003","authenticated-orcid":false,"given":"Vitor","family":"Nogueira","sequence":"additional","affiliation":[{"name":"Algoritmi Research Centre, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Informatics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5086-059X","authenticated-orcid":false,"given":"Paulo","family":"Quaresma","sequence":"additional","affiliation":[{"name":"Algoritmi Research Centre, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Informatics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3025-0687","authenticated-orcid":false,"given":"Jos\u00e9","family":"Saias","sequence":"additional","affiliation":[{"name":"Algoritmi Research Centre, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Informatics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9906-0358","authenticated-orcid":false,"given":"Daniel","family":"Santos","sequence":"additional","affiliation":[{"name":"Department of Informatics, ECT, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6162-0030","authenticated-orcid":false,"given":"Pedro","family":"Nogueira","sequence":"additional","affiliation":[{"name":"ICT, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Geosciences, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6702-5753","authenticated-orcid":false,"given":"Marcelo","family":"Silva","sequence":"additional","affiliation":[{"name":"ICT, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"},{"name":"Department of Geosciences, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4549-9012","authenticated-orcid":false,"given":"Rosalina Pisco","family":"Costa","sequence":"additional","affiliation":[{"name":"CICS.NOVA.UEVORA, IIFA, University of \u00c9vora, 7000-208 \u00c9vora, Portugal"},{"name":"Department of Sociology, ECS, University of \u00c9vora, 7000-803 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4162-1634","authenticated-orcid":false,"given":"Patr\u00edcia","family":"Gois","sequence":"additional","affiliation":[{"name":"Department of Visual Arts and Design, EA, University of \u00c9vora, 7000-208 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paulo Rebelo","family":"Manuel","sequence":"additional","affiliation":[{"name":"CIMA, IIFA, University of \u00c9vora, 7000-671 \u00c9vora, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,5,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.trpro.2020.10.007","article-title":"Determining passenger traffic as important factor in urban public transport system","volume":"50","author":"Belokurov","year":"2020","journal-title":"Transp. Res. Procedia"},{"key":"ref_2","unstructured":"World Health Organization (2022, January 25). Global Status Report on Road Safety 2018. Available online: https:\/\/apps.who.int\/iris\/bitstream\/handle\/10665\/276462\/9789241565684-eng.pdf?sequence=1&isAllowed=y."},{"key":"ref_3","unstructured":"Eurostat (2022, January 25). Road Accidents: Number of Fatalities Continues Falling. Available online: https:\/\/ec.europa.eu\/eurostat\/en\/web\/products-eurostat-news\/-\/ddn-20210624-1."},{"key":"ref_4","unstructured":"Lusa (2022, January 25). Sinistralidade Rodovi\u00e1ria tem Impacto Econ\u00f3mico e Social Negativo de 1.2% do PIB-Governo. Available online: https:\/\/www.rtp.pt\/noticias\/pais\/sinistralidade-rodoviaria-tem-impacto-economico-e-social-negativo-de-12-do-pib-governo_n1112193."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1666","DOI":"10.1016\/j.aap.2011.03.025","article-title":"The statistical analysis of highway crash-injury severities: A review and assessment of methodological alternatives","volume":"43","author":"Savolainen","year":"2011","journal-title":"Accid. Anal. Prev."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1016\/j.trpro.2014.10.107","article-title":"Prediction of road accident severity using the ordered probit model","volume":"3","author":"Garrido","year":"2014","journal-title":"Transp. Res. Procedia"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"60079","DOI":"10.1109\/ACCESS.2018.2874979","article-title":"Comparing prediction performance for crash injury severity among various machine learning and statistical methods","volume":"6","author":"Zhang","year":"2018","journal-title":"IEEE Access"},{"key":"ref_8","first-page":"775","article-title":"Machine learning applied to road safety modeling: A systematic literature review","volume":"7","author":"Silva","year":"2020","journal-title":"J. Traffic Transp. Eng. (Engl. Ed.)"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"408","DOI":"10.1080\/17457300.2021.1928233","article-title":"Injury severity prediction of traffic crashes with ensemble machine learning techniques: A comparative study","volume":"28","author":"Jamal","year":"2021","journal-title":"Int. J. Inj. Control Saf. Promot."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.aap.2017.08.008","article-title":"Comparison of four statistical and machine learning methods for crash severity prediction","volume":"108","author":"Iranitalab","year":"2017","journal-title":"Accid. Anal. Prev."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1016\/j.aap.2013.06.028","article-title":"Impact of pavement conditions on crash severity","volume":"59","author":"Li","year":"2013","journal-title":"Accid. Anal. Prev."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1016\/j.aap.2013.03.005","article-title":"Comparing single vehicle and multivehicle fatal road crashes: A joint analysis of road conditions, time variables and driver characteristics","volume":"60","author":"Martensen","year":"2013","journal-title":"Accid. Anal. Prev."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1016\/j.aap.2013.10.001","article-title":"Exploring the effects of roadway characteristics on the frequency and severity of head-on crashes: Case studies from Malaysian Federal Roads","volume":"62","author":"Hosseinpour","year":"2014","journal-title":"Accid. Anal. Prev."},{"key":"ref_14","first-page":"23","article-title":"A latent segmentation based generalized ordered logit model to examine factors influencing driver injury severity","volume":"1","author":"Yasmin","year":"2014","journal-title":"Anal. Methods Accid. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.jsr.2018.12.006","article-title":"Ordered logistic models of influencing factors on crash injury severity of single and multiple-vehicle downgrade crashes: A case study in Wyoming","volume":"68","author":"Rezapour","year":"2019","journal-title":"J. Saf. Res."},{"key":"ref_16","unstructured":"ANSR (2022, January 25). Manual de Prenchimento. Boletim Estat\u00edstico de Acidente de Via\u00e7\u00e3o. Available online: http:\/\/www.ansr.pt\/Estatisticas\/BEAV\/Documents\/MANUALPREENCHIMENTOBEAV.pdf."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Hosmer, D.W., Lemeshow, S., and Sturdivant, R.X. (2013). Applied Logistic Regression, John Wiley & Sons.","DOI":"10.1002\/9781118548387"},{"key":"ref_18","unstructured":"Quinlan, J.R. (1993). C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers."},{"key":"ref_19","unstructured":"Research, R. (2022, January 25). Is See5\/C5.0 Better Than C4.5?. Available online: https:\/\/rulequest.com\/see5-comparison.html."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","article-title":"Learning from Imbalanced Data","volume":"21","author":"He","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_21","unstructured":"R Core Team (2021). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Fiorentini, N., and Losa, M. (2020). Handling Imbalanced Data in Road Crash Severity Prediction by Machine Learning Algorithms. Infrastructures, 5.","DOI":"10.3390\/infrastructures5070061"}],"container-title":["Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-431X\/11\/5\/80\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:11:05Z","timestamp":1760137865000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-431X\/11\/5\/80"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,16]]},"references-count":22,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2022,5]]}},"alternative-id":["computers11050080"],"URL":"https:\/\/doi.org\/10.3390\/computers11050080","relation":{},"ISSN":["2073-431X"],"issn-type":[{"value":"2073-431X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,16]]}}}