{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T21:00:08Z","timestamp":1779224408651,"version":"3.51.4"},"reference-count":69,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2022,4,20]],"date-time":"2022-04-20T00:00:00Z","timestamp":1650412800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,4,20]],"date-time":"2022-04-20T00:00:00Z","timestamp":1650412800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["NCSML award"],"award-info":[{"award-number":["NCSML award"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["SN COMPUT. SCI."],"published-print":{"date-parts":[[2022,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>People diagnosed with Parkinson\u2019s (PwP) exhibit a diverse manifestation of heterogeneous symptoms which likely reflect different subtypes. However, there is no widely accepted consensus on the criteria for subtype membership assignment. We explored clustering PwP using a data-driven approach mining speech signals. We used data from the three English-speaking cohorts (Boston, Oxford, Toronto) in the Parkinson\u2019s Voice Initiative (PVI), where speech and basic demographic information were collected over the standard telephone network. We acoustically characterized 2097 sustained vowel \/a\/ recordings from 1138 PwP (Boston cohort) using 307 dysphonia measures. We applied unsupervised feature selection to select a concise subset of the dysphonia measures and hierarchical clustering combined with 2D-data projections using t-distributed stochastic neighbor embedding (t-SNE) to facilitate visual exploration of PwP groups. We assessed cluster validity and consistency using silhouette plots and the cophenetic correlation coefficient. We externally validated cluster findings on the Oxford and Toronto PVI cohorts (<jats:italic>n<\/jats:italic>\u2009=\u2009285 and 107 participants, respectively). We selected 21 dysphonia measures and found four main clusters which provide tentative insights into different dominating speech-associated characteristics (cophenetic coefficient\u2009=\u20090.72, silhouette score\u2009=\u20090.67). The cluster findings were consistent across the three PVI cohorts, strongly supporting the generalization of the presented methodology towards PwP subtype assignment, and were independently visually verified in 2D projections with t-SNE. The presented methodology with mining sustained vowels and clustering may provide an objective and streamlined approach towards informing PwP subtype assignment. This may have important implications towards developing more personalized clinical management of symptoms for PwP.<\/jats:p>","DOI":"10.1007\/s42979-022-01123-y","type":"journal-article","created":{"date-parts":[[2022,4,20]],"date-time":"2022-04-20T11:03:42Z","timestamp":1650452622000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Data-Driven Subtyping of Parkinson\u2019s Using Acoustic Analysis of Sustained Vowels and Cluster Analysis: Findings in the Parkinson\u2019s Voice Initiative Study"],"prefix":"10.1007","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0994-8100","authenticated-orcid":false,"given":"Athanasios","family":"Tsanas","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6499-6941","authenticated-orcid":false,"given":"Siddharth","family":"Arora","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,4,20]]},"reference":[{"issue":"11","key":"1123_CR1","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1016\/S1474-4422(18)30295-3","volume":"17","author":"ER Dorsey","year":"2018","unstructured":"Dorsey ER, et al. Global, regional, and national burden of Parkinson\u2019s disease, 1990\u20132016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 2018;17(11):939\u201353. https:\/\/doi.org\/10.1016\/S1474-4422(18)30295-3.","journal-title":"Lancet Neurol"},{"issue":"2","key":"1123_CR2","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1001\/jamaneurol.2020.4152","volume":"78","author":"VL Feigin","year":"2021","unstructured":"Feigin VL, et al. Burden of neurological disorders across the US from 1990\u20132017: a global burden of disease study. JAMA Neurol. 2021;78(2):165\u201376. https:\/\/doi.org\/10.1001\/jamaneurol.2020.4152.","journal-title":"JAMA Neurol"},{"key":"1123_CR3","doi-asserted-by":"publisher","first-page":"2284","DOI":"10.1016\/S0140-6736(21)00218-X","volume":"12","author":"BR Bloem","year":"2021","unstructured":"Bloem BR, Okun MS, Klein C. Parkinson\u2019s disease. Lancet. 2021;12:2284\u2013303. https:\/\/doi.org\/10.1016\/S0140-6736(21)00218-X.","journal-title":"Lancet"},{"issue":"5","key":"1123_CR4","doi-asserted-by":"publisher","first-page":"462","DOI":"10.1016\/S1474-4422(19)30397-7","volume":"19","author":"E Warmerdam","year":"2020","unstructured":"Warmerdam E, et al. Long-term unsupervised mobility assessment in movement disorders. Lancet Neurol. 2020;19(5):462\u201370. https:\/\/doi.org\/10.1016\/S1474-4422(19)30397-7.","journal-title":"Lancet Neurol"},{"key":"1123_CR5","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1186\/s12883-017-0904-4","volume":"17","author":"G McLean","year":"2017","unstructured":"McLean G, Hindle JV, Guthrie B, Mercer SW. Co-morbidity and polypharmacy in Parkinson\u2019s disease: Insights from a large Scottish primary care database. BMC Neurol. 2017;17:126. https:\/\/doi.org\/10.1186\/s12883-017-0904-4.","journal-title":"BMC Neurol"},{"issue":"8","key":"1123_CR6","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1001\/jamaneurol.2015.0703","volume":"72","author":"SM Fereshtehnejad","year":"2015","unstructured":"Fereshtehnejad SM, Romenets SR, Anang JBM, Latreille V, Gagnon JF, Postuma RB. New clinical subtypes of Parkinson disease and their longitudinal progression a prospective cohort comparison with other phenotypes. JAMA Neurol. 2015;72(8):863\u201373. https:\/\/doi.org\/10.1001\/jamaneurol.2015.0703.","journal-title":"JAMA Neurol"},{"issue":"4","key":"1123_CR7","doi-asserted-by":"publisher","first-page":"469","DOI":"10.1002\/ana.24261","volume":"76","author":"B-A Sieber","year":"2014","unstructured":"Sieber B-A, et al. Prioritized Research Recommendations from the National Institute of Neurological Disorders and Stroke Parkinson\u2019s Disease 2014 Conference. Ann Neurol. 2014;76(4):469\u201372.","journal-title":"Ann Neurol"},{"issue":"8","key":"1123_CR8","doi-asserted-by":"publisher","first-page":"969","DOI":"10.1002\/mds.23116","volume":"25","author":"SM van Rooden","year":"2010","unstructured":"van Rooden SM, Heiser WJ, Kok JN, Verbaan D, van Hilten JJ, Marinus J. The identification of Parkinson\u2019s disease subtypes using cluster analysis: a systematic review. Mov Disord. 2010;25(8):969\u201378. https:\/\/doi.org\/10.1002\/mds.23116.","journal-title":"Mov Disord"},{"issue":"4","key":"1123_CR9","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1136\/jnnp-2012-303455","volume":"84","author":"C Marras","year":"2013","unstructured":"Marras C, Lang A. Parkinson\u2019s disease subtypes: lost in translation? J Neurol Neurosurg Psychiatry. 2013;84(4):409\u201315. https:\/\/doi.org\/10.1136\/jnnp-2012-303455.","journal-title":"J Neurol Neurosurg Psychiatry"},{"issue":"11","key":"1123_CR10","doi-asserted-by":"publisher","first-page":"2947","DOI":"10.1093\/brain\/awp234","volume":"132","author":"M Selikhova","year":"2009","unstructured":"Selikhova M, Williams DR, Kempster PA, Holton JL, Revesz T, Lees AJ. A clinico-pathological study of subtypes in Parkinson\u2019s disease. Brain. 2009;132(11):2947\u201357. https:\/\/doi.org\/10.1093\/brain\/awp234.","journal-title":"Brain"},{"issue":"2","key":"1123_CR11","doi-asserted-by":"publisher","first-page":"269","DOI":"10.3233\/JPD-140523","volume":"5","author":"M Lawton","year":"2015","unstructured":"Lawton M, et al. Parkinson\u2019s disease subtypes in the Oxford Parkinson disease centre (OPDC) discovery cohort. J Parkinsons Dis. 2015;5(2):269\u201379. https:\/\/doi.org\/10.3233\/JPD-140523.","journal-title":"J Parkinsons Dis"},{"issue":"12","key":"1123_CR12","doi-asserted-by":"publisher","first-page":"1279","DOI":"10.1136\/jnnp-2018-318337","volume":"89","author":"M Lawton","year":"2018","unstructured":"Lawton M, et al. Developing and validating Parkinson\u2019s disease subtypes and their motor and cognitive progression. J Neurol Neurosurg Psychiatry. 2018;89(12):1279\u201387. https:\/\/doi.org\/10.1136\/jnnp-2018-318337.","journal-title":"J Neurol Neurosurg Psychiatry"},{"issue":"7","key":"1123_CR13","doi-asserted-by":"publisher","first-page":"1567","DOI":"10.14336\/AD.2021.0519","volume":"12","author":"RM Hendricks","year":"2021","unstructured":"Hendricks RM, Khasawneh MT. A systematic review of Parkinson\u2019s disease cluster analysis research. Aging Dis. 2021;12(7):1567\u201386. https:\/\/doi.org\/10.14336\/AD.2021.0519.","journal-title":"Aging Dis"},{"key":"1123_CR14","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1038\/s41531-021-00228-0","volume":"7","author":"M Brendel","year":"2021","unstructured":"Brendel M, Su C, Hou Y, Henchcliffe C, Wang F. \u201cComprehensive subtyping of Parkinson\u2019s disease patients with similarity fusion: a case study with BioFIND data. npj Park Dis. 2021;7:83. https:\/\/doi.org\/10.1038\/s41531-021-00228-0.","journal-title":"npj Park Dis"},{"key":"1123_CR15","doi-asserted-by":"publisher","first-page":"23645","DOI":"10.1038\/s41598-021-03118-w","volume":"11","author":"F Rodriguez-Sanchez","year":"2021","unstructured":"Rodriguez-Sanchez F, et al. Identifying Parkinson\u2019s disease subtypes with motor and non-motor symptoms via model-based multi-partition clustering. Sci Rep. 2021;11:23645. https:\/\/doi.org\/10.1038\/s41598-021-03118-w.","journal-title":"Sci Rep"},{"issue":"1","key":"1123_CR16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12984-019-0548-2","volume":"16","author":"A Nguyen","year":"2019","unstructured":"Nguyen A, et al. Development and clinical validation of inertial sensor-based gait-clustering methods in Parkinson\u2019s disease. J Neuroeng Rehabil. 2019;16(1):1\u201314. https:\/\/doi.org\/10.1186\/s12984-019-0548-2.","journal-title":"J Neuroeng Rehabil"},{"key":"1123_CR17","doi-asserted-by":"publisher","unstructured":"Tsanas A. New insights into Parkinson\u2019s disease through statistical analysis of standard clinical scales quantifying symptom severity. In: 41st IEEE Engineering in Medicine and Biology Conference, 2019;3412\u20133415, https:\/\/doi.org\/10.1109\/EMBC.2019.8856559.","DOI":"10.1109\/EMBC.2019.8856559"},{"key":"1123_CR18","unstructured":"Tsanas A. Accurate telemonitoring of Parkinson\u2019s disease using nonlinear speech signal processing and statistical machine learning. PhD thesis, University of Oxford; 2012."},{"issue":"5","key":"1123_CR19","doi-asserted-by":"publisher","first-page":"1264","DOI":"10.1109\/TBME.2012.2183367","volume":"59","author":"A Tsanas","year":"2012","unstructured":"Tsanas A, et al. Novel speech signal processing algorithms for high-accuracy classification of Parkinson\u2019s disease. IEEE Trans Biomed Eng. 2012;59(5):1264\u201371. https:\/\/doi.org\/10.1109\/TBME.2012.2183367.","journal-title":"IEEE Trans Biomed Eng"},{"issue":"59","key":"1123_CR20","doi-asserted-by":"publisher","first-page":"842","DOI":"10.1098\/rsif.2010.0456","volume":"8","author":"A Tsanas","year":"2011","unstructured":"Tsanas A, Little MA, McSharry PE, Ramig LO. Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson\u2019s disease symptom severity. J R Soc Interface. 2011;8(59):842\u201355. https:\/\/doi.org\/10.1098\/rsif.2010.0456.","journal-title":"J R Soc Interface"},{"key":"1123_CR21","doi-asserted-by":"publisher","first-page":"11024","DOI":"10.1109\/ACCESS.2021.3050524","volume":"9","author":"A Tsanas","year":"2021","unstructured":"Tsanas A, Little MA, Ramig LO. Remote assessment of Parkinson\u2019s disease symptom severity using the simulated cellular mobile telephone network. IEEE Access. 2021;9:11024\u201336. https:\/\/doi.org\/10.1109\/ACCESS.2021.3050524.","journal-title":"IEEE Access"},{"key":"1123_CR22","unstructured":"Tsanas A, Little MA, McSharry PE, Ramig LO. New nonlinear markers and insights into speech signal degradation for effective tracking of Parkinson\u2019s disease symptom severity. In: International symposium on nonlinear theory and its applications (NOLTA), 2010, September, pp. 457\u2013460."},{"issue":"1","key":"1123_CR23","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1109\/TNSRE.2013.2293575","volume":"22","author":"A Tsanas","year":"2014","unstructured":"Tsanas A, Little MA, Fox C, Ramig LO. Objective automatic assessment of rehabilitative speech treatment in Parkinson\u2019s disease. IEEE Trans Neural Syst Rehabil Eng. 2014;22(1):181\u201390. https:\/\/doi.org\/10.1109\/TNSRE.2013.2293575.","journal-title":"IEEE Trans Neural Syst Rehabil Eng"},{"issue":"4","key":"1123_CR24","doi-asserted-by":"publisher","first-page":"503","DOI":"10.3233\/JPD-181389","volume":"8","author":"S Arora","year":"2018","unstructured":"Arora S, et al. Investigating voice as a biomarker for leucine-rich repeat kinase 2-associated Parkinson\u2019s disease. J Parkinson\u2019s Dis. 2018;8(4):503\u201310. https:\/\/doi.org\/10.3233\/JPD-181389.","journal-title":"J Parkinson\u2019s Dis"},{"key":"1123_CR25","doi-asserted-by":"publisher","first-page":"44813","DOI":"10.1109\/ACCESS.2021.3057715","volume":"9","author":"S Arora","year":"2021","unstructured":"Arora S, Lo C, Hu M, Tsanas A. Smartphone speech testing for symptom assessment in rapid eye movement sleep behavior disorder and Parkinson\u2019s disease. IEEE Access. 2021;9:44813\u201324. https:\/\/doi.org\/10.1109\/ACCESS.2021.3057715.","journal-title":"IEEE Access"},{"key":"1123_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2021.102422","volume":"66","author":"A G\u00f3mez","year":"2021","unstructured":"G\u00f3mez A, Tsanas A, G\u00f3mez P, Palacios-Alonso D, Rodellar V, \u00c1lvarez A. Acoustic to kinematic projection in Parkinson\u2019s disease dysarthria. Biomed Signal Process Control. 2021;66: e102422. https:\/\/doi.org\/10.1016\/j.bspc.2021.102422.","journal-title":"Biomed Signal Process Control"},{"issue":"02","key":"1123_CR27","doi-asserted-by":"publisher","first-page":"1840007","DOI":"10.1142\/s2424922x18400077","volume":"10","author":"A Rueda","year":"2018","unstructured":"Rueda A, Krishnan S. Clustering Parkinson\u2019s and age-related voice impairment signal features for unsupervised learning. Adv Data Sci Adapt Anal. 2018;10(02):1840007. https:\/\/doi.org\/10.1142\/s2424922x18400077.","journal-title":"Adv Data Sci Adapt Anal"},{"key":"1123_CR28","doi-asserted-by":"publisher","unstructured":"Tsanas A, Arora S. Large-scale clustering of people diagnosed with Parkinson\u2019s disease using acoustic analysis of sustained vowels: Findings in the Parkinson\u2019s voice initiative study. In: BIOSIGNALS 2020\u201413th International Conference on Bio-Inspired Systems and Signal Processing, Proceedings; Part of 13th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2020, 2020, pp. 369\u2013376, https:\/\/doi.org\/10.5220\/0009361203690376.","DOI":"10.5220\/0009361203690376"},{"key":"1123_CR29","doi-asserted-by":"publisher","unstructured":"Tsanas A, Arora S. Assessing Parkinson\u2019s disease speech signal generalization of clustering results across three countries: findings in the Parkinson\u2019s voice initiative study. In: BIOSIGNALS 2021\u201414th International Conference on Bio-Inspired Systems and Signal Processing; Part of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2021, 2021, pp. 124\u2013131, https:\/\/doi.org\/10.5220\/0010383001240131.","DOI":"10.5220\/0010383001240131"},{"issue":"5","key":"1123_CR30","doi-asserted-by":"publisher","first-page":"2871","DOI":"10.1121\/1.5100272","volume":"145","author":"S Arora","year":"2019","unstructured":"Arora S, Baghai-Ravary L, Tsanas A. Developing a large scale population screening tool for the assessment of Parkinson \u2019 s disease using telephone-quality voice. J Acoust Soc Am. 2019;145(5):2871\u201384.","journal-title":"J Acoust Soc Am"},{"key":"1123_CR31","unstructured":"Tsanas A, Arora S. Biomedical speech signal insights from a large scale cohort across seven countries: the Parkinson\u2019s voice initiative study. In: Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), 2019, pp 45\u201348."},{"key":"1123_CR32","volume-title":"Principles of voice production","author":"IR Titze","year":"2000","unstructured":"Titze IR. Principles of voice production. Iowa City: National Center for Voice and Speech; 2000."},{"issue":"4","key":"1123_CR33","doi-asserted-by":"publisher","first-page":"884","DOI":"10.1109\/TBME.2009.2036000","volume":"57","author":"A Tsanas","year":"2010","unstructured":"Tsanas A, Little MA, McSharry PE, Ramig LO. Accurate telemonitoring of Parkinson\u2019s disease progression by noninvasive speech tests. IEEE Trans Biomed Eng. 2010;57(4):884\u201393. https:\/\/doi.org\/10.1109\/TBME.2009.2036000.","journal-title":"IEEE Trans Biomed Eng"},{"issue":"10","key":"1123_CR34","doi-asserted-by":"publisher","DOI":"10.3390\/diagnostics11101892","volume":"11","author":"S Arora","year":"2021","unstructured":"Arora S, Tsanas A. Assessing Parkinson\u2019s disease at scale using telephone-recorded speech: insights from the Parkinson\u2019s Voice Initiative. Diagnostics. 2021;11(10): e1892. https:\/\/doi.org\/10.3390\/diagnostics11101892.","journal-title":"Diagnostics"},{"key":"1123_CR35","doi-asserted-by":"publisher","unstructured":"Tsanas A, Little MA, McSharry PE, Ramig LO. Enhanced classical dysphonia measures and sparse regression for telemonitoring of Parkinson\u2019s disease progression. In: 2010 IEEE Int. Conf. Acoust. Speech Signal Process., no. March, pp. 594\u2013597, 2010, https:\/\/doi.org\/10.1109\/ICASSP.2010.5495554.","DOI":"10.1109\/ICASSP.2010.5495554"},{"key":"1123_CR36","unstructured":"Tsanas A. Acoustic analysis toolkit for biomedical speech signal processing: concepts and algorithms. In: 8th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), 2013, pp. 37\u201340."},{"key":"1123_CR37","unstructured":"Tsanas A, G\u00f3mez-Vilda P. Novel robust decision support tool assisting early diagnosis of pathological voices using acoustic analysis of sustained vowels. In: Multidisciplinary Conference of Users of Voice, Speech and Singing (JVHC 13), 2013, pp. 3\u201312."},{"key":"1123_CR38","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/j.forsciint.2016.11.020","volume":"270","author":"E San Segundo","year":"2017","unstructured":"San Segundo E, Tsanas A, Gomez-Vilda P. Euclidean Distances as measures of speaker similarity including identical twin pairs: a forensic investigation using source and filter voice characteristics. Forensic Sci Int. 2017;270:25\u201338. https:\/\/doi.org\/10.1016\/j.forsciint.2016.11.020.","journal-title":"Forensic Sci Int"},{"key":"1123_CR39","first-page":"495","volume-title":"Speech coding and synthesis","author":"D Talkin","year":"1995","unstructured":"Talkin D. A robust algorithm for pitch tracking (RAPT). In: Klejn WB, Paliwal KK, editors. Speech coding and synthesis. Elsevier Science B.V; 1995. p. 495\u2013518."},{"issue":"5","key":"1123_CR40","doi-asserted-by":"publisher","first-page":"2885","DOI":"10.1121\/1.4870484","volume":"135","author":"A Tsanas","year":"2014","unstructured":"Tsanas A, Za\u00f1artu M, Little MA, Fox C, Ramig LO, Clifford GD. Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering. J Acoust Soc Am. 2014;135(5):2885\u2013901. https:\/\/doi.org\/10.1121\/1.4870484.","journal-title":"J Acoust Soc Am"},{"issue":"3","key":"1123_CR41","doi-asserted-by":"publisher","first-page":"325","DOI":"10.1016\/j.jvoice.2005.12.009","volume":"20","author":"RM Roark","year":"2006","unstructured":"Roark RM. Frequency and voice: perspectives in the time domain. J Voice. 2006;20(3):325\u201354. https:\/\/doi.org\/10.1016\/j.jvoice.2005.12.009.","journal-title":"J Voice"},{"issue":"3","key":"1123_CR42","doi-asserted-by":"publisher","first-page":"1638","DOI":"10.1121\/1.2951592","volume":"124","author":"A Camacho","year":"2008","unstructured":"Camacho A, Harris JG. A sawtooth waveform inspired pitch estimator for speech and music. J Acoust Soc Am. 2008;124(3):1638\u201352. https:\/\/doi.org\/10.1121\/1.2951592.","journal-title":"J Acoust Soc Am"},{"key":"1123_CR43","volume-title":"Pattern recognition and machine learning","author":"C Bishop","year":"2006","unstructured":"Bishop C. Pattern recognition and machine learning. New York: Springer; 2006."},{"key":"1123_CR44","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-84858-7","volume-title":"Elements of statistical learning","author":"T Hastie","year":"2009","unstructured":"Hastie T, Tibshirani R, Friedman J. Elements of statistical learning. 2nd ed. Springer; 2009.","edition":"2"},{"key":"1123_CR45","first-page":"1157","volume":"3","author":"I Guyon","year":"2003","unstructured":"Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157\u201382.","journal-title":"J Mach Learn Res"},{"key":"1123_CR46","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1007\/978-1-4614-4998-0_7","volume-title":"Handbook of Systems and complexity in health","author":"A Tsanas","year":"2013","unstructured":"Tsanas A, Little MA, McSharry PE. A methodology for the analysis of medical data. In: Sturmberg JP, Martin CM, editors. Handbook of Systems and complexity in health. Springer; 2013. p. 113\u201325."},{"key":"1123_CR47","volume-title":"Feature extraction foundations and applications","year":"2006","unstructured":"Guyon I, Gunn S, Nikravesh M, Zadeh LA, editors. Feature extraction foundations and applications. Chennai: Springer; 2006."},{"issue":"120","key":"1123_CR48","doi-asserted-by":"publisher","first-page":"20160266","DOI":"10.1098\/rsif.2016.0266","volume":"13","author":"E Naydenova","year":"2016","unstructured":"Naydenova E, Tsanas A, Howie S, Casals-Pascual C, De Vos M. The power of data mining in diagnosis of childhood pneumonia. J R Soc Interface. 2016;13(120):20160266. https:\/\/doi.org\/10.1098\/rsif.2016.0266.","journal-title":"J R Soc Interface"},{"key":"1123_CR49","doi-asserted-by":"publisher","first-page":"100471","DOI":"10.1016\/j.patter.2022.100471","volume":"3","author":"A Tsanas","year":"2022","unstructured":"Tsanas A. Relevance, redundancy and complementarity trade-off (RRCT): a principled, generic, robust feature selection tool. Patterns 2022;3:100471. https:\/\/doi.org\/10.1016\/j.patter.2022.100471.","journal-title":"Patterns"},{"key":"1123_CR50","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1007\/springerreference_302701","volume":"5","author":"JG Dy","year":"2004","unstructured":"Dy JG, Brodley CE. Feature selection for unsupervised learning. J Mach Learn Res. 2004;5:845\u201389. https:\/\/doi.org\/10.1007\/springerreference_302701.","journal-title":"J Mach Learn Res"},{"key":"1123_CR51","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1016\/j.patrec.2014.11.006","volume":"53","author":"J Yao","year":"2015","unstructured":"Yao J, Mao Q, Goodison S, Mai V, Sun Y. Feature selection for unsupervised learning through local learning. Pattern Recognit Lett. 2015;53:100\u20137. https:\/\/doi.org\/10.1016\/j.patrec.2014.11.006.","journal-title":"Pattern Recognit Lett"},{"key":"1123_CR52","doi-asserted-by":"publisher","first-page":"173","DOI":"10.5665\/sleep.5342","volume":"39","author":"B Sheaves","year":"2016","unstructured":"Sheaves B, et al. Insomnia, nightmares, and chronotype as markers of risk for severe mental illness: results from a student population. Sleep. 2016;39:173\u201381.","journal-title":"Sleep"},{"key":"1123_CR53","first-page":"2579","volume":"9","author":"L van der Maaten","year":"2008","unstructured":"van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9:2579\u2013605.","journal-title":"J Mach Learn Res"},{"issue":"2","key":"1123_CR54","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1137\/18m1216134","volume":"1","author":"GC Linderman","year":"2019","unstructured":"Linderman GC, Steinerberger S. Clustering with t-SNE, provably. SIAM J Math Data Sci. 2019;1(2):313\u201332. https:\/\/doi.org\/10.1137\/18m1216134.","journal-title":"SIAM J Math Data Sci"},{"issue":"1","key":"1123_CR55","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-019-44221-3","volume":"9","author":"AP Vogel","year":"2019","unstructured":"Vogel AP, Tsanas A, Scattoni ML. Quantifying ultrasonic mouse vocalizations using acoustic analysis in a supervised statistical machine learning framework. Sci Rep. 2019;9(1): e8100. https:\/\/doi.org\/10.1038\/s41598-019-44221-3.","journal-title":"Sci Rep"},{"key":"1123_CR56","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","volume":"20","author":"PJ Rousseeuw","year":"1987","unstructured":"Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53\u201365. https:\/\/doi.org\/10.1016\/0377-0427(87)90125-7.","journal-title":"J Comput Appl Math"},{"key":"1123_CR57","doi-asserted-by":"publisher","first-page":"112","DOI":"10.3390\/diagnostics12010112","volume":"12","author":"SH Lee","year":"2022","unstructured":"Lee SH, et al. Parkinson\u2019s disease subtyping using clinical features and biomarkers: literature review and preliminary study of subtype clustering. Diagnostics. 2022;12:112. https:\/\/doi.org\/10.3390\/diagnostics12010112.","journal-title":"Diagnostics"},{"issue":"3","key":"1123_CR58","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1136\/jnnp.2003.033530","volume":"76","author":"SJG Lewis","year":"2005","unstructured":"Lewis SJG, Foltynie T, Blackwell AD, Bobbins TW, Owen AM, Barker RA. Heterogeneity of Parkinson\u2019s disease in the early clinical stages using a data driven approach. J Neurol Neurosurg Psychiatry. 2005;76(3):343\u20138. https:\/\/doi.org\/10.1136\/jnnp.2003.033530.","journal-title":"J Neurol Neurosurg Psychiatry"},{"issue":"1","key":"1123_CR59","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1002\/mds.23346","volume":"26","author":"SM Van Rooden","year":"2011","unstructured":"Van Rooden SM, et al. Clinical subtypes of Parkinson\u2019s disease. Mov Disord. 2011;26(1):51\u20138. https:\/\/doi.org\/10.1002\/mds.23346.","journal-title":"Mov Disord"},{"issue":"9","key":"1123_CR60","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3389\/fnagi.2017.00301","volume":"9","author":"J Mu","year":"2017","unstructured":"Mu J, Chaudhuri KR, Bielza C, de Pedro-Cuesta J, Larra\u00f1aga P, Martinez-Martin P. Parkinson\u2019s disease subtypes identified from cluster analysis of motor and non-motor symptoms. Front Aging Neurosci. 2017;9(9):1\u201310. https:\/\/doi.org\/10.3389\/fnagi.2017.00301.","journal-title":"Front Aging Neurosci."},{"issue":"5","key":"1123_CR61","doi-asserted-by":"publisher","DOI":"10.2196\/16452","volume":"8","author":"E Horne","year":"2020","unstructured":"Horne E, Tibble H, Sheikh A, Tsanas A. Challenges of clustering multimodal clinical data: review of applications in asthma subtyping. JMIR Med Inform. 2020;8(5): e16452. https:\/\/doi.org\/10.2196\/16452.","journal-title":"JMIR Med Inform"},{"key":"1123_CR62","unstructured":"Tsanas A, Little MA, McSharry PE, Ramig LO. Journal of the Royal Society Interface\u2014supplementary material. J R Soc Interface, 2011; pp 1\u201321."},{"key":"1123_CR63","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2020.3015018","author":"K Woodward","year":"2021","unstructured":"Woodward K, et al. Beyond mobile apps: a survey of technologies for mental well-being. IEEE Trans Affect Comput. 2021. https:\/\/doi.org\/10.1109\/TAFFC.2020.3015018.","journal-title":"IEEE Trans Affect Comput"},{"issue":"June","key":"1123_CR64","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1016\/j.neucom.2020.05.078","volume":"410","author":"JM G\u00f3rriz","year":"2020","unstructured":"G\u00f3rriz JM, et al. Artificial intelligence within the interplay between natural and artificial computation: advances in data science, trends and applications. Neurocomputing. 2020;410(June):237\u201370. https:\/\/doi.org\/10.1016\/j.neucom.2020.05.078.","journal-title":"Neurocomputing"},{"issue":"4","key":"1123_CR65","doi-asserted-by":"publisher","first-page":"e12286","DOI":"10.2196\/12286","volume":"21","author":"AK Triantafyllidis","year":"2019","unstructured":"Triantafyllidis AK, Tsanas A. Applications of machine learning in real-life digital health interventions: review of the literature. J Med Internet Res. 2019;21(4):e12286.","journal-title":"J Med Internet Res"},{"issue":"8","key":"1123_CR66","doi-asserted-by":"publisher","first-page":"1761","DOI":"10.1109\/TBME.2016.2611862","volume":"64","author":"N Palmius","year":"2017","unstructured":"Palmius N, et al. Detecting bipolar depression from geographic location data. IEEE Trans Biomed Eng. 2017;64(8):1761\u201371. https:\/\/doi.org\/10.1109\/TBME.2016.2611862.","journal-title":"IEEE Trans Biomed Eng"},{"issue":"4","key":"1123_CR67","doi-asserted-by":"publisher","DOI":"10.2196\/14306","volume":"8","author":"A Tsanas","year":"2020","unstructured":"Tsanas A, Woodward E, Ehlers A. Objective characterization of activity, sleep, and circadian rhythm patterns using a wrist-worn actigraphy sensor: insights into post-traumatic stress disorder. JMIR mHealth uHealth. 2020;8(4): e14306. https:\/\/doi.org\/10.2196\/14306.","journal-title":"JMIR mHealth uHealth"},{"key":"1123_CR68","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1016\/j.jad.2016.06.065","volume":"205","author":"A Tsanas","year":"2016","unstructured":"Tsanas A, et al. Daily longitudinal self-monitoring of mood variability in bipolar disorder and borderline personality disorder. J Affect Disord. 2016;205:225\u201333. https:\/\/doi.org\/10.1016\/j.jad.2016.06.065.","journal-title":"J Affect Disord"},{"issue":"2","key":"1123_CR69","doi-asserted-by":"publisher","DOI":"10.2196\/mental.6917","volume":"4","author":"A Tsanas","year":"2017","unstructured":"Tsanas A, Saunders KEAK, Bilderbeck AAC, Palmius N, Goodwin GM, De Vos M. Clinical insight into latent variables of psychiatric questionnaires for mood symptom self-assessment. JMIR Ment Heal. 2017;4(2): e15. https:\/\/doi.org\/10.2196\/mental.6917.","journal-title":"JMIR Ment Heal"}],"container-title":["SN Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-022-01123-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s42979-022-01123-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-022-01123-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,9]],"date-time":"2022-05-09T18:02:47Z","timestamp":1652119367000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s42979-022-01123-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,20]]},"references-count":69,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,5]]}},"alternative-id":["1123"],"URL":"https:\/\/doi.org\/10.1007\/s42979-022-01123-y","relation":{},"ISSN":["2662-995X","2661-8907"],"issn-type":[{"value":"2662-995X","type":"print"},{"value":"2661-8907","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,20]]},"assertion":[{"value":"18 October 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 March 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 April 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Both authors have no COI to declare.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of Interest"}},{"value":"This is secondary use of the data that have been previously used to publish work, e.g. in the prestigious <i>Journal of Acoustical Society of America<\/i> where we first reported on the Parkinson\u2019s Voice Initiative, and more recently in <i>Diagnostics<\/i>.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical Approval"}},{"value":"Participants provided oral consent by self-selecting to enroll in the study over the phone, after listening to the purpose of the study and having the option to listen in more detail Terms & Conditions.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent to Participate"}},{"value":"Participants provided consent for their data to be used for academic research (and hence publication).","order":5,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for Publication"}}],"article-number":"232"}}