{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T22:06:33Z","timestamp":1768341993168,"version":"3.49.0"},"reference-count":57,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2022,1,25]],"date-time":"2022-01-25T00:00:00Z","timestamp":1643068800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Conventional approaches to modelling driver risk have incorporated measures such as driver gender, age, place of residence, vehicle model, and annual miles driven. However, in the last decade, research has shown that assessing a driver\u2019s crash risk based on these variables does not go far enough\u2014especially as advanced technology changes today\u2019s vehicles, as well as the role and behavior of the driver. There is growing recognition that actual driver usage patterns and driving behavior, when it can be properly captured in modelling risk, offers higher accuracy and more individually tailored projections. However, several challenges make this difficult. These challenges include accessing the right types of data, dealing with high-dimensional data, and identifying the underlying structure of the variance in driving behavior. There is also the challenge of how to identify key variables for detecting and predicting risk, and how to combine them in predictive algorithms. This paper proposes a systematic feature extraction and selection framework for building Comprehensive Driver Profiles that serves as a foundation for driver behavior analysis and building whole driver profiles. Features are extracted from raw data using statistical feature extraction techniques, and a hybrid feature selection algorithm is used to select the best driver profile feature set based on outcomes of interest such as crash risk. It can give rise to individualized detection and prediction of risk, and can also be used to identify types of drivers who exhibit similar patterns of driving and vehicle\/technology usage. The developed framework is applied to a naturalistic driving dataset\u2014NEST, derived from the larger SHRP2 naturalistic driving study to illustrate the types of information about driver behavior that can be harnessed\u2014as well as some of the important applications that can be derived from it.<\/jats:p>","DOI":"10.3390\/info13020061","type":"journal-article","created":{"date-parts":[[2022,1,25]],"date-time":"2022-01-25T20:40:18Z","timestamp":1643143218000},"page":"61","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["A Framework for Building Comprehensive Driver Profiles"],"prefix":"10.3390","volume":"13","author":[{"given":"Rashmi P.","family":"Payyanadan","sequence":"first","affiliation":[{"name":"Touchstone Evaluations, Inc., Detroit, MI 48202, USA"}]},{"given":"Linda S.","family":"Angell","sequence":"additional","affiliation":[{"name":"Touchstone Evaluations, Inc., Detroit, MI 48202, USA"}]}],"member":"1968","published-online":{"date-parts":[[2022,1,25]]},"reference":[{"key":"ref_1","first-page":"528","article-title":"Modelling Driver Behaviour: A Rationale for Multivariate Statistics","volume":"3","author":"Happee","year":"2012","journal-title":"Theor. Issues Ergon. Sci."},{"key":"ref_2","unstructured":"Abdelrahman, A. (2019). Driver Behavior Modelling and Risk Profiling Using Large-Scale Naturalistic Driving Data. [Ph.D. Thesis, Queen\u2019s University]."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1248","DOI":"10.1177\/0018720815591313","article-title":"A Review of Research on Driving Styles and Road Safety","volume":"57","author":"Sagberg","year":"2015","journal-title":"Hum. Factors J. Hum. Factors Ergon. Soc."},{"key":"ref_4","first-page":"101","article-title":"Conform\u2014A Visualization Tool and Method to Classify Driving Styles in Context of Highly Automated Driving","volume":"30","author":"Griesche","year":"2014","journal-title":"VDI\/VW Gem. Fahr. Integr. Sich."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1016\/j.trc.2008.01.001","article-title":"In-vehicle data recorders for monitoring and feedback on drivers\u2019 behavior","volume":"16","author":"Toledo","year":"2008","journal-title":"Transp. Res. Part C Emerg. Technol."},{"key":"ref_6","unstructured":"Jun, J. (2006). Potential Crash Measures Based On Gps-Observed Driving Behavior Activity Metrics. [Ph.D. Thesis, Georgia Institute of Technology]."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1359","DOI":"10.1016\/j.aap.2011.02.009","article-title":"Jerky Driving\u2014An Indicator of Accident Proneness?","volume":"43","author":"Bagdadi","year":"2011","journal-title":"Accid. Anal. Prev."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.aap.2015.01.009","article-title":"Driver behaviour profiles for road safety analysis","volume":"76","author":"Ellison","year":"2015","journal-title":"Accid. Anal. Prev."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"i9","DOI":"10.1136\/ip.2006.011874","article-title":"Influences on youthful driving behavior and their potential for guiding interventions to reduce crashes","volume":"12","author":"Shope","year":"2006","journal-title":"Inj. Prev."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/j.ypmed.2004.04.013","article-title":"Antecedents and Outcomes of Marijuana Use Initiation during Adolescence","volume":"39","author":"Ellickson","year":"2004","journal-title":"Prev. Med."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"472","DOI":"10.1016\/j.aap.2005.11.003","article-title":"Risky driving and lifestyles in adolescence","volume":"38","author":"Bina","year":"2005","journal-title":"Accid. Anal. Prev."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1518\/hfes.46.4.640.56806","article-title":"Profiles in Driver Distraction: Effects of Cell Phone Conversations on Younger and Older Drivers","volume":"46","author":"Strayer","year":"2004","journal-title":"Hum. Factors J. Hum. Factors Ergon. Soc."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"J\u00fanior, J.F., Carvalho, E., Ferreira, B.V., de Souza, C., Suhara, Y., Pentland, A., and Pessin, G. (2017). Driver Behavior Profiling: An Investigation with Different Smartphone Sensors and Machine Learning. PLoS ONE, 12.","DOI":"10.1371\/journal.pone.0174959"},{"key":"ref_14","unstructured":"Chang, W. (2011). Research on Driving Intention Identification Based on Hidden Markov Model. [Master\u2019s Thesis, Jilin University]."},{"key":"ref_15","first-page":"3149","article-title":"Moving Object Detection Algorithm Based on Gaussian Mixture Model","volume":"33","author":"Mei","year":"2012","journal-title":"Comput. Eng. Des."},{"key":"ref_16","unstructured":"Tram\u00e8r, F., Zhang, F., Juels, A., Reiter, M.K., and Ristenpart, T. (2016). Stealing Machine Learning Models via Prediction Apis. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1109\/MITS.2014.2328673","article-title":"Driver Behavior Profiling Using Smartphones: A Low-Cost Platform for Driver Monitoring","volume":"7","author":"Castignani","year":"2015","journal-title":"IEEE Intell. Transp. Syst. Mag."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Owens, J.M., Angell, L., Hankey, J.M., Foley, J., and Ebe, K. (2015). Creation of the Naturalistic Engagement in Secondary Tasks (NEST) distracted driving dataset. J. Saf. Res., 54.","DOI":"10.1016\/j.jsr.2015.07.001"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Antin, J., Lee, S., Hankey, J., and Dingus, T. (2011). Design of the In-Vehicle Driving Behavior and Crash Risk Study. Support of the SHRP 2 Naturalistic Driving Study, Transportation Research Board. Strategic Highway Research Program Safety Focus Area.","DOI":"10.17226\/14494"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1073\/pnas.1513271113","article-title":"Replication Data for: Driver crash risk factors and prevalence evaluation using naturalistic driving data","volume":"113","author":"Dingus","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.sna.2007.02.012","article-title":"Principal component analysis and artificial neural network based approach to analysing optical fibre sensors signals","volume":"136","author":"Lewis","year":"2007","journal-title":"Sens. Actuators A Phys."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"106839","DOI":"10.1016\/j.csda.2019.106839","article-title":"Benchmark for filter methods for feature selection in high-dimensional classification data","volume":"143","author":"Bommert","year":"2019","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1023\/A:1010920819831","article-title":"A Simple Generalisation of the Area Under the Roc Curve for Multiple Class Classification Problems","volume":"45","author":"Hand","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Anukrishna, P.R., and Paul, V. (2017, January 19\u201320). A Review on Feature Selection for High Dimensional Data. Proceedings of the 2017 International Conference on Inventive Systems and Control (ICISC), Coimbatore, India.","DOI":"10.1109\/ICISC.2017.8068746"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). Xgboost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM Sigkdd International Conference On Knowledge Discovery and Data Mining, San Francisco, CA, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_26","first-page":"18","article-title":"Classification and Regression By Randomforest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_28","unstructured":"Breiman, L., and Cutler, A. (2021, February 06). Random Forest. Available online: https:\/\/www.Stat.Berkeley.Edu\/~Breiman\/Randomforests\/."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1080\/10618600.2014.983641","article-title":"Case-Specific Random Forests","volume":"25","author":"Xu","year":"2016","journal-title":"J. Comput. Graph. Stat."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Hastie, T.J. (2017). Generalized Additive Models, Routledge.","DOI":"10.1201\/9780203753781"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1111\/j.2044-8317.1992.tb00992.x","article-title":"Backward, forward and stepwise automated subset selection algorithms: Frequency of obtaining authentic and noise variables","volume":"45","author":"Derksen","year":"1992","journal-title":"Br. J. Math. Stat. Psychol."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1111\/j.0006-341X.2001.00120.x","article-title":"Akaike\u2019s Information Criterion in Generalized Estimating Equations","volume":"57","author":"Pan","year":"2001","journal-title":"Biometrics"},{"key":"ref_33","unstructured":"Weisstein, E.W. (2021, May 23). Bonferroni Correction. Available online: https:\/\/mathworld.wolfram.com\/."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Charrad, M., Ghazzali, N., Boiteau, V., and Niknafs, A. (2021, March 04). Nbclust: An R Package for Determining the Relevant Number of Clusters In A Data Set. Ournal of Statistical Software 2014. Available online: Http:\/\/Www.Jstatsoft.Org\/V61\/I06\/.","DOI":"10.18637\/jss.v061.i06"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1497","DOI":"10.1016\/j.aap.2011.02.030","article-title":"Speed maintenance under cognitive load\u2014Implications for theories of driver behaviour","volume":"43","author":"Brookhuis","year":"2011","journal-title":"Accid. Anal. Prev."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Abney, S. (2002, January 6\u201312). Bootstrapping. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadephia, PA, USA.","DOI":"10.3115\/1073083.1073143"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Angkititrakul, P., Ryuta, T., Wakita, T., Takeda, K., Miyajima, C., and Suzuki, T. (2009, January 11\u201312). Evaluation Of Driver-Behavior Models In Real-World Car-Following Task. Proceedings of the 2009 IEEE International Conference on Vehicular Electronics and Safety (ICVES), Pune, India.","DOI":"10.1109\/ICVES.2009.5400201"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1016\/S1361-9209(00)00003-1","article-title":"Variability in urban driving patterns","volume":"5","author":"Ericsson","year":"2000","journal-title":"Transp. Res. Part D Transp. Environ."},{"key":"ref_39","unstructured":"Dong, W., Li, J., Yao, R., Li, C., Yuan, T., and Wang, L. (2016). Characterizing Driving Styles with Deep Learning. arXiv."},{"key":"ref_40","first-page":"569109","article-title":"An Overview on Study of Identification of Driver Behavior Characteristics for Automotive Control","volume":"2014","author":"Lin","year":"2014","journal-title":"Math. Probl. Eng."},{"key":"ref_41","first-page":"204","article-title":"Improving The Explainability of Random Forest Classifier-User Centered Approach","volume":"23","author":"Petkovic","year":"2018","journal-title":"Pac. Symp. Biocomput."},{"key":"ref_42","unstructured":"Watsford, R. (2008, January 9\u201313). The Success of the \u2019pinkie\u2019 Campaign-Speeding. No One Thinks Big of You: A New Approach to Road Safety Marketing. Proceedings of the Australasian College of Road Safety Conference, Washington, DC, USA."},{"key":"ref_43","first-page":"27","article-title":"Road Safety Advertising And Social Marketing","volume":"22","author":"Faulks","year":"2011","journal-title":"J. Australas. Coll. Road Saf."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1016\/S0925-7535(03)00045-6","article-title":"Accident prevention. Presentation of a model placing emphasis on human, structural and cultural factors","volume":"42","author":"Lund","year":"2004","journal-title":"Saf. Sci."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1016\/j.aap.2011.10.001","article-title":"The Effects of On-Street Parking and Road Environment Visual Complexity on Travel Speed and Reaction Time","volume":"45","author":"Edquist","year":"2012","journal-title":"Accid. Anal. Prev."},{"key":"ref_46","unstructured":"World Health Organization (2015). Global Status Report on Road Safety, World Health Organization."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1016\/j.injury.2004.10.001","article-title":"Improvements In Passive Car Safety Led To Decreased Injury Severity\u2013A Comparison Between the 1970s and 1990s","volume":"36","author":"Richter","year":"2005","journal-title":"Injury"},{"key":"ref_48","unstructured":"Ellison, A.B. (2013). Evaluating Changes in Driver Behaviour for Road Safety Outcomes: A Risk Profiling Approach. [Ph.D. Thesis, The University of Sydney]."},{"key":"ref_49","unstructured":"SAE (2016). Surface Vehicle Recommended Practice, SAE."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1518\/hfes.46.1.50.30392","article-title":"Trust in Automation: Designing for Appropriate Reliance","volume":"46","author":"Lee","year":"2004","journal-title":"Hum. Factors"},{"key":"ref_51","unstructured":"Pyta, V., and Mctiernan, D. (September, January 31). Development of A Model for Improving Safety in School Zones. Proceedings of the Australasian Road Safety Research, Policing and Education Conference, Monash University, Melbourne, Australia."},{"key":"ref_52","unstructured":"Litman, T. (2007). Distance-Based Vehicle Insurance Feasibility, Costs and Benefits, Victoria Transport Policy Institute."},{"key":"ref_53","unstructured":"Ong, P.M., and Stoll, M.A. (2008). Why Do Inner City Residents Pay Higher Premiums? The Determinants of Automobile Insurance Premiums, School of Public Affairs."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1109\/MITS.2014.2343262","article-title":"Insurance Telematics: Opportunities and Challenges with the Smartphone Solution","volume":"6","author":"Handel","year":"2014","journal-title":"IEEE Intell. Transp. Syst. Mag."},{"key":"ref_55","unstructured":"Regan, M.A., Prabhakharan, P., Wallace, P., Cunningham, M.L., and Bennett, J.M. (2020). Education and Training for Drivers of Assisted and Automated Vehicles, ARRB Group Limited."},{"key":"ref_56","unstructured":"NHTSA (2016). Federal Automated Vehicles Policy\u2013Accelerating the Next Revolution in Roadway Safety."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Klauer, S.G., Dingus, T.A., Neale, V.L., Sudweeks, J.D., and Ramsey, D.J. (2009). Comparing Real-World Behaviors of Drivers With High versus Low Rates of Crashes and Near-Crashes.","DOI":"10.1037\/e733112011-001"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/2\/61\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:07:21Z","timestamp":1760134041000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/13\/2\/61"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,25]]},"references-count":57,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2022,2]]}},"alternative-id":["info13020061"],"URL":"https:\/\/doi.org\/10.3390\/info13020061","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,25]]}}}