{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T11:12:55Z","timestamp":1781608375485,"version":"3.54.5"},"reference-count":58,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances\u2014blocks before words starting with plosives, syllable repetitions, and sound-initial prolongations\u2014was transformed using principal component analysis. The result was a model containing four principal components describing analysed utterances. Distances between standardised original variables and elements of the observation matrix in a new system of coordinates were calculated and then applied in the recognition process. As a classifying algorithm, the multilayer perceptron network was used. Achieved results were compared with outcomes from previous experiments where speech samples were parameterised with the Kohonen network application. The classifying network achieved overall accuracy at 76% (from 50% to 91%, depending on the dysfluency type).<\/jats:p>","DOI":"10.3390\/s22010321","type":"journal-article","created":{"date-parts":[[2022,1,9]],"date-time":"2022-01-09T23:08:26Z","timestamp":1641769706000},"page":"321","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1053-0948","authenticated-orcid":false,"given":"Izabela","family":"\u015awietlicka","sequence":"first","affiliation":[{"name":"Department of Biophysics, University of Life Sciences, Akademicka 13, 20-950 Lublin, Poland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wies\u0142awa","family":"Kuniszyk-J\u00f3\u017akowiak","sequence":"additional","affiliation":[{"name":"Faculty of Physical Education and Health in Bia\u0142a Podlaska, J\u00f3zef Pi\u0142sudski University of Physical Education in Warsaw, Akademicka 2, 21-500 Bia\u0142a Podlaska, Poland"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3625-542X","authenticated-orcid":false,"given":"Micha\u0142","family":"\u015awietlicki","sequence":"additional","affiliation":[{"name":"Department of Applied Physics, Faculty of Mechanical Engineering, Lublin University of Technology, Nadbystrzycka 36, 20-618 Lublin, Poland"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,1,1]]},"reference":[{"key":"ref_1","unstructured":"Howell, P., and Sackin, S. (1995, January 8\u201311). Automatic recognition of repetitions and prolongations in stuttered speech. Proceedings of the First World Congress on Fluency Disorders, Munich, Germany."},{"key":"ref_2","first-page":"1","article-title":"The syndrome of stuttering","volume":"Volume 17","author":"Andrews","year":"1964","journal-title":"Clinics in Developmental Medicine"},{"key":"ref_3","unstructured":"Bloodstein, O. (1995). A Handbook on Stuttering, Singular Publishing Group Inc."},{"key":"ref_4","unstructured":"Van-Riper, C. (1982). The Nature of Stuttering, Prentice Hall."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1016\/j.jfludis.2006.07.002","article-title":"Comparing judgments of stuttering made by students, clinicians, and highly experienced judges","volume":"31","author":"Brundage","year":"2006","journal-title":"J. Fluency Disord."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1121\/1.424585","article-title":"Utterance rate and linguistic properties as determinants of lexical dysfluencies in children who stutter","volume":"105","author":"Howell","year":"1999","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1044\/jslhr.4005.1073","article-title":"Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: I. Psychometric Procedures Appropriate for Selection of Training Material for Lexical Dysfluency Classifiers","volume":"40","author":"Howell","year":"1997","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1044\/jslhr.4005.1085","article-title":"Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers","volume":"40","author":"Howell","year":"1997","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1044\/1092-4388(2008\/063)","article-title":"Identification of Children\u2019s Stuttered and Nonstuttered Speech by Highly Experienced Judges: Binary Judgments and Comparisons with Disfluency-Types Definitions","volume":"51","author":"Bothe","year":"2008","journal-title":"J. Speech, Lang. Hear. Res."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Heeman, P.A., Lunsford, R., McMillin, A., and Yaruss, J.S. (2016, January 8\u201312). Using clinician annotations to improve automatic speech recognition of stuttered speech. Proceedings of the INTERSPEECH 2016, San Francisco, CA, USA.","DOI":"10.21437\/Interspeech.2016-1388"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.bspc.2016.01.005","article-title":"Speech rate estimation in disordered speech based on spectral landmark detection","volume":"27","author":"Huici","year":"2016","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Manjula, G., Shivakumar, M., and Geetha, Y.V. (2019, January 19\u201321). Adaptive optimization based neural network for classification of stuttered speech. Proceedings of the 3rd International Conference on Cryptography, Security and Privacy, Kuala Lumpur Malaysia.","DOI":"10.1145\/3309074.3309113"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Narasimhan, S., and Rao, R.R. (2019, January 11\u201312). Neural Network based speech assistance tool to enhance the fluency of adults who stutter. Proceedings of the 2019 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER), Manipal, India.","DOI":"10.1109\/DISCOVER47552.2019.9008034"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"101308","DOI":"10.1016\/j.csl.2021.101308","article-title":"Generative adversarial networks for speech processing: A review","volume":"72","author":"Wali","year":"2021","journal-title":"Comput. Speech Lang."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.inffus.2021.10.012","article-title":"Deep learning for depression recognition with audiovisual cues: A review","volume":"80","author":"He","year":"2021","journal-title":"Inf. Fusion"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2022","DOI":"10.1016\/j.engappai.2013.06.004","article-title":"Self-Adjustable Neural Network for speech recognition","volume":"26","author":"Ting","year":"2013","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Lei, X., Lin, H., and Heigold, G. (2013, January 26\u201331). Deep neural networks with auxiliary Gaussian mixture models for real-time speech recognition. Proceedings of the ICASSP 2013\u20142013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6639148"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"836","DOI":"10.1109\/TASLP.2014.2308398","article-title":"Robust Speaker Identification in Noisy and Reverberant Conditions","volume":"22","author":"Zhao","year":"2014","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Sarma, M., and Sarma, K.K. (2013, January 4\u20139). Speaker identification model for Assamese language using a neural framework. Proceedings of the International Joint Conference on Neural Networks, Dallas, TX, USA.","DOI":"10.1109\/IJCNN.2013.6707000"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1007\/s00500-006-0099-x","article-title":"Text-dependent speaker recognition using wavelets and neural networks","volume":"11","author":"Lim","year":"2007","journal-title":"Soft Comput."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"101317","DOI":"10.1016\/j.csl.2021.101317","article-title":"A review of speaker diarization: Recent advances with deep learning","volume":"72","author":"Park","year":"2022","journal-title":"Comput. Speech Lang."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1109\/LSP.2014.2303136","article-title":"Efficient One-Pass Decoding with NNLM for Speech Recognition","volume":"21","author":"Shi","year":"2014","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/j.neucom.2012.03.041","article-title":"Learning by abstraction: Hierarchical classification model using evidential theoretic approach and Bayesian ensemble model","volume":"130","author":"Naeini","year":"2014","journal-title":"Neurocomputing"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"6069","DOI":"10.1016\/j.eswa.2008.06.126","article-title":"Classification of audio signals using SVM and RBFNN","volume":"36","author":"Dhanalakshmi","year":"2009","journal-title":"Expert Syst. Appl."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1016\/j.advengsoft.2005.07.005","article-title":"A classification technique based on radial basis function neural networks","volume":"37","author":"Sarimveis","year":"2006","journal-title":"Adv. Eng. Softw."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1007\/s10772-012-9136-6","article-title":"Time\u2013domain non-linear feature parameter for consonant classification","volume":"15","author":"Thasleema","year":"2012","journal-title":"Int. J. Speech Technol."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1016\/j.csl.2013.02.003","article-title":"Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis","volume":"27","author":"Reddy","year":"2013","journal-title":"Comput. Speech Lang."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1007\/s10772-012-9169-x","article-title":"Phoneme recognition using zerocrossing interval distribution of speech patterns and ANN","volume":"16","author":"Kumar","year":"2013","journal-title":"Int. J. Speech Technol."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Jaitly, N., Nguyen, P., Senior, A., and Vanhoucke, V. (2012, January 9\u201313). Application of pretrained deep neural networks to large vocabulary speech recognition. Proceedings of the 13th Annual Conference of the International Speech Communication Association 2012 (INTERSPEECH 2012), Portland, OR, USA.","DOI":"10.21437\/Interspeech.2012-10"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"3650","DOI":"10.1007\/s00034-016-0476-3","article-title":"Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System","volume":"36","author":"Narendra","year":"2017","journal-title":"Circuits Syst. Signal Process."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1016\/j.csl.2012.05.003","article-title":"Hierarchical ANN system for stuttering identification","volume":"27","year":"2013","journal-title":"Comput. Speech Lang."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1007\/s00521-009-0261-3","article-title":"Speech nonfluency detection using Kohonen networks","volume":"18","author":"Szczurowska","year":"2009","journal-title":"Neural Comput. Appl."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1016\/S1350-4533(02)00064-4","article-title":"Pathological voice quality assessment using artificial neural networks","volume":"24","author":"Ritchings","year":"2002","journal-title":"Med. Eng. Phys."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/j.bspc.2009.01.007","article-title":"Automatic detection of voice impairments from text-dependent running speech","volume":"4","author":"Fraile","year":"2009","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Khara, S., Singh, S., and Vir, D. (2018, January 20\u201321). A comparative study of the techniques for feature extraction and classification in stuttering. Proceedings of the 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Ranganathan Engineering College, Coimbatore, India.","DOI":"10.1109\/ICICCT.2018.8473099"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"2986","DOI":"10.1109\/TASLP.2021.3110146","article-title":"FluentNet: End-to-End Detection of Stuttered Speech Disfluencies with Deep Learning","volume":"29","author":"Kourkounakis","year":"2021","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_37","unstructured":"Gupta, D., Bansal, P., and Choudhary, K. (2015, January 2\u20135). The state of the art of feature extraction techniques in speech recognition. Proceedings of the 50th Annual Convention of Computer Society of India, New Delhi, India."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1007\/s10844-019-00546-z","article-title":"Effect of speech segment samples selection in stutter block detection and remediation","volume":"53","author":"Arbajian","year":"2019","journal-title":"J. Intell. Inf. Syst."},{"key":"ref_39","first-page":"387","article-title":"Gaussian mixture model based classification of stuttering dysfluencies","volume":"25","author":"Mahesha","year":"2015","journal-title":"J. Intell. Syst."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.bspc.2015.08.006","article-title":"Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools","volume":"23","author":"Esmaili","year":"2016","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.specom.2019.04.003","article-title":"Dysarthric speech classification from coded telephone speech using glottal features","volume":"110","author":"Narendra","year":"2019","journal-title":"Speech Commun."},{"key":"ref_42","unstructured":"Momo, N., and Uddin, J. (2018, January 19\u201322). Speech recognition using feed forward neural network and principle component analysis. Proceedings of the 4th International Symposium on Signal Processing and Intelligent Recognition Systems, Bangalore, India."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Raitio, T., Suni, A., Vainio, M., and Alku, P. (2013, January 26\u201331). Comparing glottal-flow-excited statistical parametric speech synthesis methods. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada.","DOI":"10.1109\/ICASSP.2013.6639188"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Abolhassani, A.H., Selouani, S.-A., and O\u2019Shaughnessy, D. (2007, January 9\u201313). Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition. Proceedings of the 2007 IEEE Workshop on Automatic Speech Recognition & Understanding, Kyoto, Japan.","DOI":"10.1109\/ASRU.2007.4430077"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Chien, J.-T., and Ting, C.-W. (2004, January 4\u20138). Speaker identification using probabilistic PCA model selection. Proceedings of the 8th International Conference on Spoken Language Processing INTERSPEECH 2004, Jeju Island, Korea.","DOI":"10.21437\/Interspeech.2004-515"},{"key":"ref_46","unstructured":"Jolliffe, I.T. (2002). Principal Component Analysis, Springer. [2nd ed.]."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Jhawar, G., Nagraj, P., and Mahalakshmi, P. (2016, January 6\u20138). Speech disorder recognition using MFCC. Proceedings of the 2016 International Conference on Communication and Signal Processing (ICCSP), Melmaruvathur, India.","DOI":"10.1109\/ICCSP.2016.7754132"},{"key":"ref_48","first-page":"1","article-title":"Deep Learning Bidirectional LSTM based Detection of Prolongation and Repetition in Stuttered Speech using Weighted MFCC","volume":"11","author":"Gupta","year":"2020","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Bishop, C.M. (1995). Neural Networks for Pattern Recognition, Oxford University Press.","DOI":"10.1093\/oso\/9780198538493.001.0001"},{"key":"ref_50","unstructured":"Kobosko, J. (1999). Stuttering, PWN. (In Polish)."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"951","DOI":"10.1044\/jslhr.4304.951","article-title":"Individual and Consensus Judgments of Disfluency Types in the Speech of Persons Who Stutter","volume":"43","author":"Cordes","year":"2000","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Winursito, A., Hidayat, R., and Bejo, A. (2018, January 6\u20137). Improvement of MFCC feature extraction accuracy using PCA in Indonesian speech recognition. Proceedings of the 2018 International Conference on Information and Communications Technology (ICOIACT), Yogyakarta, Indonesia.","DOI":"10.1109\/ICOIACT.2018.8350748"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Rasheed, J., Hameed, A.A., Ajlouni, N., Jamil, A., \u00d6zyava\u015f, A., and Orman, Z. (2020, January 26\u201327). Application of Adaptive Back-Propagation Neural Networks for Parkinson\u2019s Disease Prediction. Proceedings of the 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI), Sakheer, Bahrain.","DOI":"10.1109\/ICDABI51230.2020.9325709"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.eij.2019.10.002","article-title":"Employing PCA and t-statistical approach for feature extraction and classification of emotion from multichannel EEG signal","volume":"21","author":"Rahman","year":"2020","journal-title":"Egypt. Inform. J."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"101238","DOI":"10.1016\/j.aei.2020.101238","article-title":"Ambient acoustic event assistive framework for identification, detection, and recognition of unknown acoustic events of a residence","volume":"47","author":"Pandya","year":"2021","journal-title":"Adv. Eng. Inform."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Ghayvat, H., Pandya, S., and Patel, A. (2020, January 28\u201329). Deep learning model for acoustics signal based preventive healthcare monitoring and activity of daily living. Proceedings of the 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India.","DOI":"10.1109\/IDEA49133.2020.9170666"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Kourkounakis, T., Hajavi, A., and Etemad, A. (2020, January 4\u20138). Detecting multiple speech disfluencies using a deep residual network with bidirectional Long Short-Term Memory. Proceedings of the ICASSP 2020\u20142020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.","DOI":"10.1109\/ICASSP40776.2020.9053893"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"1152","DOI":"10.35940\/ijitee.J9077.0981119","article-title":"Identifying stuttering using deep learning","volume":"8","author":"Tibrewal","year":"2019","journal-title":"Int. J. Innov. Technol. Explor. Eng."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/1\/321\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T14:12:27Z","timestamp":1760364747000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/1\/321"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,1]]},"references-count":58,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,1]]}},"alternative-id":["s22010321"],"URL":"https:\/\/doi.org\/10.3390\/s22010321","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,1]]}}}