{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T13:05:56Z","timestamp":1776776756957,"version":"3.51.2"},"reference-count":39,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2024,2,26]],"date-time":"2024-02-26T00:00:00Z","timestamp":1708905600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Parkinson\u2019s disease (PD) is a neurodegenerative disorder characterized by a range of motor and non-motor symptoms. One of the notable non-motor symptoms of PD is the presence of vocal disorders, attributed to the underlying pathophysiological changes in the neural control of the laryngeal and vocal tract musculature. From this perspective, the integration of machine learning (ML) techniques in the analysis of speech signals has significantly contributed to the detection and diagnosis of PD. Particularly, MEL Frequency Cepstral Coefficients (MFCCs) and Gammatone Frequency Cepstral Coefficients (GTCCs) are both feature extraction techniques commonly used in the field of speech and audio signal processing that could exhibit great potential for vocal disorder identification. This study presents a novel approach to the early detection of PD through ML applied to speech analysis, leveraging both MFCCs and GTCCs. The recordings contained in the Mobile Device Voice Recordings at King\u2019s College London (MDVR-KCL) dataset were used. These recordings were collected from healthy individuals and PD patients while they read a passage and during a spontaneous conversation on the phone. Particularly, the speech data regarding the spontaneous dialogue task were processed through speaker diarization, a technique that partitions an audio stream into homogeneous segments according to speaker identity. The ML applied to MFCCS and GTCCs allowed us to classify PD patients with a test accuracy of 92.3%. This research further demonstrates the potential to employ mobile phones as a non-invasive, cost-effective tool for the early detection of PD, significantly improving patient prognosis and quality of life.<\/jats:p>","DOI":"10.3390\/s24051499","type":"journal-article","created":{"date-parts":[[2024,2,26]],"date-time":"2024-02-26T03:34:04Z","timestamp":1708918444000},"page":"1499","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":45,"title":["Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson\u2019s Disease: A Study on Speaker Diarization and Classification Techniques"],"prefix":"10.3390","volume":"24","author":[{"given":"Michele Giuseppe","family":"Di Cesare","sequence":"first","affiliation":[{"name":"Department of Engineering and Geology, University G. D\u2019Annunzio of Chieti-Pescara, 65127 Pescara, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1903-0501","authenticated-orcid":false,"given":"David","family":"Perpetuini","sequence":"additional","affiliation":[{"name":"Department of Engineering and Geology, University G. D\u2019Annunzio of Chieti-Pescara, 65127 Pescara, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1506-1995","authenticated-orcid":false,"given":"Daniela","family":"Cardone","sequence":"additional","affiliation":[{"name":"Department of Engineering and Geology, University G. D\u2019Annunzio of Chieti-Pescara, 65127 Pescara, Italy"}]},{"given":"Arcangelo","family":"Merla","sequence":"additional","affiliation":[{"name":"Department of Engineering and Geology, University G. D\u2019Annunzio of Chieti-Pescara, 65127 Pescara, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2024,2,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1109\/MWC.001.2000345","article-title":"Edge Intelligence for Empowering IoT-Based Healthcare Systems","volume":"28","author":"Hayyolalam","year":"2021","journal-title":"IEEE Wirel. Commun."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1016\/j.future.2017.04.036","article-title":"Towards Fog-Driven IoT eHealth: Promises and Challenges of IoT in Medicine and Healthcare","volume":"78","author":"Farahani","year":"2018","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3391","DOI":"10.1007\/s00405-015-3708-4","article-title":"Exploring the Feasibility of Smart Phone Microphone for Measurement of Acoustic Voice Parameters and Voice Pathology Screening","volume":"272","author":"Uloza","year":"2015","journal-title":"Eur. Arch. Oto-Rhino-Laryngol."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ferreira-Cardoso, H., J\u00e1come, C., Silva, S., Amorim, A., Redondo, M.T., Fontoura-Matias, J., Vicente-Ferreira, M., Vieira-Marques, P., Valente, J., and Almeida, R. (2021). Lung Auscultation Using the Smartphone\u2014Feasibility Study in Real-World Clinical Practice. Sensors, 21.","DOI":"10.3390\/s21144931"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3991","DOI":"10.1044\/2020_JSLHR-20-00212","article-title":"Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions","volume":"63","author":"Wu","year":"2020","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1007\/s00405-022-07546-w","article-title":"An iOS-Based VoiceScreen Application: Feasibility for Use in Clinical Settings\u2014A Pilot Study","volume":"280","author":"Uloza","year":"2023","journal-title":"Eur. Arch. Oto-Rhino-Laryngol."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"EL327","DOI":"10.1121\/1.4964639","article-title":"Evaluation of Smartphone Sound Measurement Applications (Apps) Using External Microphones\u2014A Follow-up Study","volume":"140","author":"Kardous","year":"2016","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Maskeli\u016bnas, R., Dama\u0161evi\u010dius, R., Bla\u017eauskas, T., Pribui\u0161is, K., Ulozait\u0117-Stanien\u0117, N., and Uloza, V. (2023). Pareto-Optimized AVQI Assessment of Dysphonia: A Clinical Trial Using Various Smartphones. Appl. Sci., 13.","DOI":"10.3390\/app13095363"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Gutierrez, L.J., Rabbani, K., Ajayi, O.J., Gebresilassie, S.K., Rafferty, J., Castro, L.A., and Banos, O. (2021). Internet of Things for Mental Health: Open Issues in Data Acquisition, Self-Organization, Service Level Agreement, and Identity Management. Int. J. Environ. Res. Public Health, 18.","DOI":"10.3390\/ijerph18031327"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.patrec.2020.05.016","article-title":"Trends in IoT Based Solutions for Health Care: Moving AI to the Edge","volume":"135","author":"Greco","year":"2020","journal-title":"Pattern Recognit. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"e33944","DOI":"10.2196\/33944","article-title":"Use of Mobile Apps for Self-Care in People with Parkinson Disease: Systematic Review","volume":"10","author":"Lee","year":"2022","journal-title":"JMIR Mhealth Uhealth"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Gaggi, G., Di Credico, A., Izzicupo, P., Iannetti, G., Di Baldassarre, A., and Ghinassi, B. (2021). Chemical and Biological Molecules Involved in Differentiation, Maturation, and Survival of Dopaminergic Neurons in Health and Parkinson\u2019s Disease: Physiological Aspects and Clinical Implications. Biomedicines, 9.","DOI":"10.3390\/biomedicines9070754"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"4887","DOI":"10.1097\/MS9.0000000000001142","article-title":"Parkinson\u2019s Disease Updates: Addressing the Pathophysiology, Risk Factors, Genetics, Diagnosis, along with the Medical and Surgical Treatment","volume":"85","author":"Prajjwal","year":"2023","journal-title":"Ann. Med. Surg."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1330","DOI":"10.1044\/2014_JSLHR-S-13-0039","article-title":"Multiple Factors Are Involved in the Dysarthria Associated With Parkinson\u2019s Disease: A Review With Implications for Clinical Practice and Research","volume":"57","author":"Sapir","year":"2014","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1590\/2317-1782\/20152014083","article-title":"Dysarthria and Quality of Life in Neurologically Healthy Elderly and Patients with Parkinson\u2019s Disease","volume":"27","author":"Gobbi","year":"2015","journal-title":"CoDAS"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"652167","DOI":"10.1155\/S1110865704309030","article-title":"Using Mel-Frequency Cepstral Coefficients in Missing Data Technique","volume":"2004","author":"Jun","year":"2004","journal-title":"EURASIP J. Adv. Signal Process."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhao, X., and Wang, D. (2013, January 26\u201331). Analyzing Noise Robustness of MFCC and GFCC Features in Speaker Identification. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6639061"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Benba, A., Jilbab, A., Hammouch, A., and Sandabad, S. (2015, January 25\u201327). Voiceprints Analysis Using MFCC and SVM for Detecting Patients with Parkinson\u2019s Disease. Proceedings of the 2015 International Conference on Electrical and Information Technologies (ICEIT), Marrakech, Morocco.","DOI":"10.1109\/EITech.2015.7163000"},{"key":"ref_19","unstructured":"Jaeger, H., Trivedi, D., and Stadtschnitzer, M. (2019). Mobile Device Voice Recordings at King\u2019s College London (MDVR-KCL) from Both Early and Advanced Parkinson\u2019s Disease Patients and Healthy Controls. Zenodo."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Adiga, A., Magimai, M., and Seelamantula, C.S. (2013, January 22\u201325). Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition. Proceedings of the 2013 IEEE International Conference of IEEE Region 10 (TENCON 2013), Xi\u2019an, China.","DOI":"10.1109\/TENCON.2013.6718948"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"855","DOI":"10.2147\/JAA.S285742","article-title":"Application of Machine Learning Algorithms for Asthma Management with mHealth: A Clinical Review","volume":"15","author":"Tsang","year":"2022","journal-title":"J. Asthma Allergy"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"96162","DOI":"10.1109\/ACCESS.2020.2995737","article-title":"Detection of Speech Impairments Using Cepstrum, Auditory Spectrogram and Wavelet Time Scattering Domain Features","volume":"8","author":"Lauraitis","year":"2020","journal-title":"IEEE Access"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Tripathi, A., Singh, U., Bansal, G., Gupta, R., and Singh, A.K. (2020, January 21\u201323). A Review on Emotion Detection and Classification Using Speech 2020. Proceedings of the International Conference in innovative Computing and Communication (ICICC-2020), Vallodid, Spain.","DOI":"10.2139\/ssrn.3601803"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1186\/1687-6180-2011-141","article-title":"Transient Noise Reduction in Speech Signal with a Modified Long-Term Predictor","volume":"2011","author":"Choi","year":"2011","journal-title":"EURASIP J. Adv. Signal Process."},{"key":"ref_25","first-page":"297","article-title":"Detecting Patients with Parkinson\u2019s Disease Using Mel Frequency Cepstral Coefficients and Support Vector Machines","volume":"7","author":"Benba","year":"2015","journal-title":"Int. J. Electr. Eng. Inform."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.35784\/acs-2023-11","article-title":"CNN And LSTM For The Classification Of Parkinson\u2019s Disease Based On The GTCC And MFCC","volume":"19","author":"Boualoulou","year":"2023","journal-title":"Appl. Comput. Sci."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"578369","DOI":"10.3389\/fninf.2021.578369","article-title":"X-Vectors: New Quantitative Biomarkers for Early Parkinson\u2019s Disease Detection From Speech","volume":"15","author":"Jeancolas","year":"2021","journal-title":"Front. Neuroinform."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Khan, A., Javed, A., Malik, K.M., Raza, M.A., Ryan, J., Saudagar, A.K.J., and Malik, H. (2022). Toward Realigning Automatic Speaker Verification in the Era of COVID-19. Sensors, 22.","DOI":"10.3390\/s22072638"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1007\/s00530-002-0065-0","article-title":"Content-Based Audio Classification and Segmentation by Using Support Vector Machines","volume":"8","author":"Lu","year":"2003","journal-title":"Multimed. Syst."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Prasanna, S.R.M., Karpov, A., Samudravijaya, K., and Agrawal, S.S. (2022, January 14\u201316). Assessment of Speech Quality During Speech Rehabilitation Based on the Solution of the Classification Problem. Proceedings of the Speech and Computer, Gurugram, India.","DOI":"10.1007\/978-3-031-20980-2"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1109\/72.991427","article-title":"A Comparison of Methods for Multiclass Support Vector Machines","volume":"13","author":"Hsu","year":"2002","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1002\/cem.1225","article-title":"Repeated Double Cross Validation","volume":"23","author":"Filzmoser","year":"2009","journal-title":"J. Chemom."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1186\/1758-2946-6-10","article-title":"Cross-Validation Pitfalls When Selecting and Assessing Regression and Classification Models","volume":"6","author":"Krstajic","year":"2014","journal-title":"J. Cheminform."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Di Credico, A., Perpetuini, D., Chiacchiaretta, P., Cardone, D., Filippini, C., Gaggi, G., Merla, A., Ghinassi, B., Di Baldassarre, A., and Izzicupo, P. (2021). The Prediction of Running Velocity during the 30\u201315 Intermittent Fitness Test Using Accelerometry-Derived Metrics and Physiological Parameters: A Machine Learning Approach. Int. J. Environ. Res. Public Health, 18.","DOI":"10.3390\/ijerph182010854"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"01019","DOI":"10.1051\/itmconf\/20224301019","article-title":"Speech Analysis for the Detection of Parkinson\u2019s Disease by Combined Use of Empirical Mode Decomposition, Mel Frequency Cepstral Coefficients, and the K-Nearest Neighbor Classifier","volume":"43","author":"Boualoulou","year":"2022","journal-title":"ITM Web Conf."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Fahed, V.S., Doheny, E.P., Busse, M., Hoblyn, J., and Lowery, M.M. (J. Voice, 2022). Comparison of Acoustic Voice Features Derived from Mobile Devices and Studio Microphone Recordings, J. Voice, in press.","DOI":"10.1016\/j.jvoice.2022.10.006"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Awan, S.N., Shaikh, M.A., Awan, J.A., Abdalla, I., Lim, K.O., and Misono, S. (J. Voice, 2023). Smartphone Recordings Are Comparable to \u201cGold Standard\u201d Recordings for Acoustic Measurements of Voice, J. Voice, in press.","DOI":"10.1016\/j.jvoice.2023.01.031"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"22799036221102491","DOI":"10.1177\/22799036221102491","article-title":"The Ethical Dilemma of Mobile Phone Data Monitoring during COVID-19: The Case for South Korea and the United States","volume":"11","author":"Anom","year":"2022","journal-title":"J. Public Health Res."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"549","DOI":"10.3390\/biomedinformatics4010031","article-title":"Assessment of Voice Disorders Using Machine Learning and Vocal Analysis of Voice Samples Recorded through Smartphones","volume":"4","author":"Perpetuini","year":"2024","journal-title":"BioMedInformatics"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/5\/1499\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:04:49Z","timestamp":1760105089000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/5\/1499"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,26]]},"references-count":39,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2024,3]]}},"alternative-id":["s24051499"],"URL":"https:\/\/doi.org\/10.3390\/s24051499","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,26]]}}}