{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T01:14:19Z","timestamp":1774401259428,"version":"3.50.1"},"publisher-location":"Cham","reference-count":32,"publisher":"Springer Nature Switzerland","isbn-type":[{"value":"9783031530241","type":"print"},{"value":"9783031530258","type":"electronic"}],"license":[{"start":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T00:00:00Z","timestamp":1704067200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T00:00:00Z","timestamp":1706745600000},"content-version":"vor","delay-in-days":31,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Using acoustic analysis to classify and identify speech disorders non-invasively can reduce waiting times for patients and specialists while also increasing the accuracy of diagnoses. In order to identify models to use in a vocal disease diagnosis system, we want to know which models have higher success rates in distinguishing between healthy and pathological sounds. For this purpose, 708 diseased people spread throughout 19 pathologies, and 194 control people were used. There are nine sound files per subject, three vowels in three tones, for each subject. From each sound file, 13 parameters were extracted. For the classification of healthy\/pathological individuals, a variety of classifiers based on Machine Learning models were used, including decision trees, discriminant analyses, logistic regression classifiers, naive Bayes classifiers, support vector machines, classifiers of closely related variables, ensemble classifiers and artificial neural network classifiers. For each patient, 118 parameters were used initially. The first analysis aimed to find the best classifier, thus obtaining an accuracy of 81.3% for the Ensemble Sub-space Discriminant classifier. The second and third analyses aimed to improve ground accuracy using preprocessing methodologies. Therefore, in the second analysis, the PCA technique was used, with an accuracy of 80.2%. The third analysis combined several outlier treatment models with several data normalization models and, in general, accuracy improved, obtaining the best accuracy (82.9%) with the combination of the Greebs model for outliers treatment and the range model for the normalization of data procedure.<\/jats:p>","DOI":"10.1007\/978-3-031-53025-8_20","type":"book-chapter","created":{"date-parts":[[2024,1,31]],"date-time":"2024-01-31T20:02:12Z","timestamp":1706731332000},"page":"287-299","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Accuracy Optimization in Speech Pathology Diagnosis with Data Preprocessing Techniques"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0618-4627","authenticated-orcid":false,"given":"Joana Filipa Teixeira","family":"Fernandes","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4260-9677","authenticated-orcid":false,"given":"Diamantino Rui","family":"Freitas","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6679-5702","authenticated-orcid":false,"given":"Jo\u00e3o Paulo","family":"Teixeira","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,2,1]]},"reference":[{"issue":"2","key":"20_CR1","doi-asserted-by":"publisher","first-page":"1270","DOI":"10.1109\/TKDE.2021.3103571","volume":"35","author":"MB Toller","year":"2023","unstructured":"Toller, M.B., Geiger, B.C., Kern, R.: Cluster purging: efficient outlier detection based on rate-distortion theory. IEEE Trans. Knowl. Data Eng. 35(2), 1270\u20131282 (2023). https:\/\/doi.org\/10.1109\/TKDE.2021.3103571","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"20_CR2","doi-asserted-by":"publisher","first-page":"118904","DOI":"10.1016\/J.ESWA.2022.118904","volume":"213","author":"A Abhaya","year":"2023","unstructured":"Abhaya, A., Patra, B.K.: An efficient method for autoencoder based outlier detection. Exp. Syst. Appl. 213, 118904 (2023). https:\/\/doi.org\/10.1016\/J.ESWA.2022.118904","journal-title":"Exp. Syst. Appl."},{"key":"20_CR3","doi-asserted-by":"publisher","first-page":"678","DOI":"10.1016\/J.PROCS.2019.12.235","volume":"164","author":"L Silva","year":"2019","unstructured":"Silva, L., et al.: Outliers treatment to improve the recognition of voice pathologies. Procedia Comput. Sci. 164, 678\u2013685 (2019). https:\/\/doi.org\/10.1016\/J.PROCS.2019.12.235","journal-title":"Procedia Comput. Sci."},{"issue":"1","key":"20_CR4","doi-asserted-by":"publisher","first-page":"2408","DOI":"10.1038\/s41598-023-29549-1","volume":"13","author":"X Du","year":"2023","unstructured":"Du, X., Zuo, E., Chu, Z., He, Z., Yu, J.: Fluctuation-based outlier detection. Sci. Rep. 13(1), 2408 (2023). https:\/\/doi.org\/10.1038\/s41598-023-29549-1","journal-title":"Sci. Rep."},{"issue":"1","key":"20_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/00401706.1969.10490657","volume":"11","author":"FE Grubbs","year":"1969","unstructured":"Grubbs, F.E.: Procedures for detecting outlying observations in samples. Technometrics 11(1), 1\u201321 (1969). https:\/\/doi.org\/10.1080\/00401706.1969.10490657","journal-title":"Technometrics"},{"issue":"4","key":"20_CR6","doi-asserted-by":"publisher","first-page":"860","DOI":"10.2307\/2530182","volume":"37","author":"AC Atkinson","year":"1981","unstructured":"Atkinson, A.C., Hawkins, D.M.: Identification of outliers. Biometrics 37(4), 860 (1981). https:\/\/doi.org\/10.2307\/2530182","journal-title":"Biometrics"},{"key":"20_CR7","doi-asserted-by":"publisher","unstructured":"Yang, X., Latecki, L.J., Pokrajac, D.: Outlier detection with globally optimal exemplar-based GMM. In: 2009 9th SIAM International Conference on Data Mining. Proceedings in Applied Mathematics, vol. 1, pp. 144\u2013153. Society for Industrial and Applied Mathematics (2009). https:\/\/doi.org\/10.1137\/1.9781611972795.13","DOI":"10.1137\/1.9781611972795.13"},{"key":"20_CR8","unstructured":"Seo, S., Marsh, P.D.G.M.: A review and comparison of methods for detecting outliersin univariate data sets (2006). http:\/\/d-scholarship.pitt.edu\/7948\/"},{"issue":"2","key":"20_CR9","first-page":"17","volume":"61","author":"FA Pino","year":"2014","unstructured":"Pino, F.A.: A quest\u00e3o da n\u00e3o normalidade: uma revis\u00e3o. Rev. Econ. Agr\u00edcola 61(2), 17\u201333 (2014)","journal-title":"Rev. Econ. Agr\u00edcola"},{"key":"20_CR10","first-page":"1157","volume":"3","author":"I Guyon","year":"2003","unstructured":"Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157\u20131182 (2003)","journal-title":"J. Mach. Learn. Res."},{"key":"20_CR11","doi-asserted-by":"publisher","unstructured":"Rodrigues, P.M., Teixeira, J.P.: Classification of electroencephalogram signals using artificial neural networks. In: Proceedings of the 2010 3rd International Conference on Biomedical Engineering and Informatics, BMEI 2010, vol. 2, pp. 808\u2013812 (2010). https:\/\/doi.org\/10.1109\/BMEI.2010.5639941","DOI":"10.1109\/BMEI.2010.5639941"},{"key":"20_CR12","doi-asserted-by":"publisher","first-page":"948","DOI":"10.1016\/J.PROCS.2021.01.251","volume":"181","author":"L Silva","year":"2021","unstructured":"Silva, L., Bispo, B., Teixeira, J.P.: Features selection algorithms for classification of voice signals. Procedia Comput. Sci. 181, 948\u2013956 (2021). https:\/\/doi.org\/10.1016\/J.PROCS.2021.01.251","journal-title":"Procedia Comput. Sci."},{"key":"20_CR13","doi-asserted-by":"crossref","unstructured":"Teixeira, J.P., Freitas, D.: Segmental durations predicted with a neural network. In: International Conference on Spoken Language Processing, Proceedings of Eurospeech 2003, pp. 169\u2013172 (2003)","DOI":"10.21437\/Eurospeech.2003-91"},{"key":"20_CR14","doi-asserted-by":"crossref","unstructured":"Teixeira, J.P., Freitas, D., Braga, D., Barros, M.J., Latsch, V.: Phonetic events from the labeling the European Portuguese database for speech synthesis, FEUP\/IPB-DB. In: International Conference on Spoken Language Processing, Proceedings of Eurospeech 2001, pp. 1707\u20131710 (2001). 8790834100, 978-879083410-4","DOI":"10.21437\/Eurospeech.2001-400"},{"key":"20_CR15","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1016\/J.PROCS.2016.09.155","volume":"100","author":"JP Teixeira","year":"2016","unstructured":"Teixeira, J.P., Gon\u00e7alves, A.: Algorithm for jitter and shimmer measurement in pathologic voices. Procedia Comput. Sci. 100, 271\u2013279 (2016). https:\/\/doi.org\/10.1016\/J.PROCS.2016.09.155","journal-title":"Procedia Comput. Sci."},{"key":"20_CR16","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1016\/J.PROCS.2018.10.040","volume":"138","author":"J Fernandes","year":"2018","unstructured":"Fernandes, J., Teixeira, F., Guedes, V., Junior, A., Teixeira, J.P.: Harmonic to noise ratio measurement - selection of window and length. Procedia Comput. Sci. 138, 280\u2013285 (2018). https:\/\/doi.org\/10.1016\/J.PROCS.2018.10.040","journal-title":"Procedia Comput. Sci."},{"key":"20_CR17","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1007\/978-3-031-23236-7_29","volume-title":"Optimization, Learning Algorithms and Applications: Second International Conference, OL2A 2022, P\u00f3voa de Varzim, Portugal, October 24\u201325, 2022, Proceedings","author":"J Fernandes","year":"2022","unstructured":"Fernandes, J., Junior, A.C., Freitas, D., Teixeira, J.P.: Smart data driven system for\u00a0pathological voices classification. In: Pereira, A.I., Ko\u0161ir, A., Fernandes, F.P., Pacheco, M.F., Teixeira, J.P., Lopes, R.P. (eds.) Optimization, Learning Algorithms and Applications: Second International Conference, OL2A 2022, P\u00f3voa de Varzim, Portugal, October 24\u201325, 2022, Proceedings, pp. 419\u2013426. Springer, Cham (2022). https:\/\/doi.org\/10.1007\/978-3-031-23236-7_29"},{"key":"20_CR18","unstructured":"P\u00fctzer, M., Barry, W.J.: Saarbruecken Voice Database. Institute of Phonetics at the University of Saarland (2007). http:\/\/www.stimmdatenbank.coli.uni-saarland.de. Accessed 05 Nov 2021"},{"key":"20_CR19","doi-asserted-by":"publisher","first-page":"654","DOI":"10.1016\/J.PROCS.2019.12.232","volume":"164","author":"J Fernandes","year":"2019","unstructured":"Fernandes, J., Silva, L., Teixeira, F., Guedes, V., Santos, J., Teixeira, J.P.: Parameters for vocal acoustic analysis - cured database. Procedia Comput. Sci. 164, 654\u2013661 (2019). https:\/\/doi.org\/10.1016\/J.PROCS.2019.12.232","journal-title":"Procedia Comput. Sci."},{"key":"20_CR20","doi-asserted-by":"publisher","first-page":"1085","DOI":"10.3844\/jcssp.2020.1085.1099","volume":"16","author":"R Hamdi","year":"2020","unstructured":"Hamdi, R., Hajji, S., Cherif, A., Processing, S.: Recognition of pathological voices by human factor cepstral coefficients (HFCC). J. Comput. Sci. 16, 1085\u20131099 (2020). https:\/\/doi.org\/10.3844\/jcssp.2020.1085.1099","journal-title":"J. Comput. Sci."},{"issue":"4","key":"20_CR21","doi-asserted-by":"publisher","first-page":"2333","DOI":"10.3390\/app13042333","volume":"13","author":"JFT Fernandes","year":"2023","unstructured":"Fernandes, J.F.T., Freitas, D., Junior, A.C., Teixeira, J.P.: Determination of harmonic parameters in pathological voices\u2014efficient algorithm. Appl. Sci. 13(4), 2333 (2023). https:\/\/doi.org\/10.3390\/app13042333","journal-title":"Appl. Sci."},{"key":"20_CR22","doi-asserted-by":"publisher","first-page":"466","DOI":"10.1016\/J.PROCS.2015.08.544","volume":"64","author":"JP Teixeira","year":"2015","unstructured":"Teixeira, J.P., Fernandes, P.O.: Acoustic analysis of vocal dysphonia. Procedia Comput. Sci. 64, 466\u2013473 (2015). https:\/\/doi.org\/10.1016\/J.PROCS.2015.08.544","journal-title":"Procedia Comput. Sci."},{"key":"20_CR23","doi-asserted-by":"publisher","unstructured":"Teixeira, J.P., Fernandes, J., Teixeira, F., Fernandes, P.O.: Acoustic analysis of chronic laryngitis statistical analysis of sustained speech parameters. In: 11th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2018, vol. 4, pp. 168\u2013175 (2018). https:\/\/doi.org\/10.5220\/0006586301680175","DOI":"10.5220\/0006586301680175"},{"key":"20_CR24","unstructured":"Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In: IFA Proceedings 17, vol. 17, pp. 97\u2013110 (1993). http:\/\/www.fon.hum.uva.nl\/paul\/papers\/Proceedings_1993.pdf"},{"issue":"4","key":"20_CR25","first-page":"237","volume":"12","author":"P Boersma","year":"2004","unstructured":"Boersma, P.: Stemmen meten met Praat. Stem-, Spraak- en Taalpathologie 12(4), 237\u2013251 (2004)","journal-title":"Stem-, Spraak- en Taalpathologie"},{"issue":"4","key":"20_CR26","doi-asserted-by":"publisher","first-page":"141","DOI":"10.3390\/bioengineering9040141","volume":"9","author":"T Ara\u00fajo","year":"2022","unstructured":"Ara\u00fajo, T., Teixeira, J.P., Rodrigues, P.M.: Smart-data-driven system for alzheimer disease detection through electroencephalographic signals. Bioengineering 9(4), 141 (2022). https:\/\/doi.org\/10.3390\/bioengineering9040141","journal-title":"Bioengineering"},{"key":"20_CR27","unstructured":"NIST\/SEMATECH: e-Handbook of Statistical Methods. http:\/\/www.itl.nist.gov\/div898\/handbook\/. Accessed 14 Jun 2023"},{"key":"20_CR28","doi-asserted-by":"publisher","unstructured":"Unwin, A.: Exploratory data analysis, 3rd edn. In: International Encyclopedia of Education, pp. 156\u2013161. Elsevier, Amsterdam (2010). https:\/\/doi.org\/10.1016\/B978-0-08-044894-7.01327-0","DOI":"10.1016\/B978-0-08-044894-7.01327-0"},{"key":"20_CR29","unstructured":"Triola, M.F.: Introdu\u00e7\u00e3o \u00e0 estat\u00edstica, 12th edn. In: Elementary Statistics. Pearson Education INC, Rio de Janeiro (2017)"},{"key":"20_CR30","unstructured":"MathWorks: Normalize. https:\/\/www.mathworks.com\/help\/matlab\/ref\/double.normalize.html#d124e1046230. Accessed 14 Jun 2023"},{"issue":"1","key":"20_CR31","doi-asserted-by":"publisher","first-page":"37","DOI":"10.4018\/IJEHMC.2020010103","volume":"11","author":"JP Teixeira","year":"2020","unstructured":"Teixeira, J.P., Alves, N., Fernandes, P.O.: Vocal acoustic analysis: ANN Versos SVM in classification of dysphonic voices and vocal cords paralysis. Int. J. E-Health Med. Commun. 11(1), 37\u201351 (2020). https:\/\/doi.org\/10.4018\/IJEHMC.2020010103","journal-title":"Int. J. E-Health Med. Commun."},{"issue":"1","key":"20_CR32","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/s13755-018-0059-8","volume":"6","author":"AS Ashour","year":"2018","unstructured":"Ashour, A.S., Guo, Y., Hawas, A.R., Guan, Xu.: Ensemble of subspace discriminant classifiers for schistosomal liver fibrosis staging in mice microscopic images. Health Inf. Sci. Syst. 6(1), 21 (2018). https:\/\/doi.org\/10.1007\/s13755-018-0059-8","journal-title":"Health Inf. Sci. Syst."}],"container-title":["Communications in Computer and Information Science","Optimization, Learning Algorithms and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-53025-8_20","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,31]],"date-time":"2024-01-31T20:18:42Z","timestamp":1706732322000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-53025-8_20"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024]]},"ISBN":["9783031530241","9783031530258"],"references-count":32,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-53025-8_20","relation":{},"ISSN":["1865-0929","1865-0937"],"issn-type":[{"value":"1865-0929","type":"print"},{"value":"1865-0937","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024]]},"assertion":[{"value":"1 February 2024","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"OL2A","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Optimization, Learning Algorithms and Applications","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Ponta Delgada","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Portugal","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2023","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"27 September 2023","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"29 September 2023","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"3","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"ol2a2023","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/ol2a.ipb.pt\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Single-blind","order":1,"name":"type","label":"Type","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"EasyChair","order":2,"name":"conference_management_system","label":"Conference Management System","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"162","order":3,"name":"number_of_submissions_sent_for_review","label":"Number of Submissions Sent for Review","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"66","order":4,"name":"number_of_full_papers_accepted","label":"Number of Full Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"0","order":5,"name":"number_of_short_papers_accepted","label":"Number of Short Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"41% - The value is computed by the equation \"Number of Full Papers Accepted \/ Number of Submissions Sent for Review * 100\" and then rounded to a whole number.","order":6,"name":"acceptance_rate_of_full_papers","label":"Acceptance Rate of Full Papers","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":7,"name":"average_number_of_reviews_per_paper","label":"Average Number of Reviews per Paper","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"4","order":8,"name":"average_number_of_papers_per_reviewer","label":"Average Number of Papers per Reviewer","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"No","order":9,"name":"external_reviewers_involved","label":"External Reviewers Involved","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}}]}}