{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:27:33Z","timestamp":1750307253959,"version":"3.41.0"},"reference-count":27,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2011,6,1]],"date-time":"2011-06-01T00:00:00Z","timestamp":1306886400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2011,6]]},"abstract":"<jats:p>This article presents a novel approach to speaker-adaptive recognition of speech from articulation-disordered speakers without a large amount of adaptation data. An unsupervised, incremental adaptation method is adopted for personalized model adaptation based on the recognized syllables with high recognition confidence from an automatic speech recognition (ASR) system. For articulation pattern discovery, the manually transcribed syllables and the corresponding recognized syllables are associated with each other using articulatory features. The Apriori algorithm is applied to discover the articulation patterns in the corpus, which are then used to construct a personalized pronunciation dictionary to improve the recognition accuracy of the ASR. The experimental results indicate that the proposed adaptation method achieves a syllable error rate reduction of 6.1%, outperforming the conventional adaptation methods that have a syllable error rate reduction of 3.8%. In addition, an average syllable error rate reduction of 5.04% is obtained for the ASR using the expanded pronunciation dictionary.<\/jats:p>","DOI":"10.1145\/1967293.1967294","type":"journal-article","created":{"date-parts":[[2011,6,28]],"date-time":"2011-06-28T17:31:10Z","timestamp":1309282270000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Articulation-Disordered Speech Recognition Using Speaker-Adaptive Acoustic Models and Personalized Articulation Patterns"],"prefix":"10.1145","volume":"10","author":[{"given":"Chung-Hsien","family":"Wu","sequence":"first","affiliation":[{"name":"National Cheng Kung University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hung-Yu","family":"Su","sequence":"additional","affiliation":[{"name":"National Cheng Kung University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Han-Ping","family":"Shen","sequence":"additional","affiliation":[{"name":"National Cheng Kung University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2011,6]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/170036.170072"},{"volume-title":"Proceedings of International Conference on Speech Communication and Technology (EUROSPEECH\u201995)","author":"Aubert X.","key":"e_1_2_1_2_1"},{"key":"e_1_2_1_3_1","unstructured":"Bernthal J. E. and Bankson W. B. 2004. Articulation and Phonological Disorders. Allyn and Bacon. Bernthal J. E. and Bankson W. B. 2004. Articulation and Phonological Disorders . Allyn and Bacon."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1080\/02699200050024001"},{"volume-title":"Proceedings of the European Conference on Speech Communication and Technology (INTERSPEECH\u201908)","author":"Carmichael J.","key":"e_1_2_1_5_1"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1080\/07434619112331275663"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(99)00034-5"},{"key":"e_1_2_1_8_1","first-page":"309","article-title":"Dysarthric speech, a comparison of computerized speech recognition and listener intelligibility","volume":"34","author":"Doyle P.","year":"1997","journal-title":"J. Rehab. Res. Dev."},{"volume-title":"Differential Diagnosis, and Management. Mosby: St.","author":"Duffy J. R.","key":"e_1_2_1_9_1"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1177\/026921558800200401"},{"volume-title":"Proceedings of International Conference on Speech Communication and Technology (EUROSPEECH\u201995)","author":"Green P.","key":"e_1_2_1_11_1"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.medengphy.2006.06.009"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2007.907429"},{"volume-title":"The MIT Encyclopedia of Communication Disorders","author":"Kent R. D.","key":"e_1_2_1_14_1"},{"key":"e_1_2_1_15_1","first-page":"1490","article-title":"Maximum likelihood linear regression for speaker adaptation of continuous density HMM. Compu","volume":"9","author":"Leggetter C.","year":"1995","journal-title":"Speech Lang."},{"key":"e_1_2_1_16_1","first-page":"33","article-title":"Evaluation of speech recognition by a person with articulation disorder in operation for home information applications","volume":"107","author":"Matsumasa H.","year":"2007","journal-title":"IEICE Welfare Inf. Technol."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1155\/2009\/629030"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1155\/2009\/308340"},{"edition":"3","volume-title":"Dictionary of Communication Disorders","author":"Morris D.","key":"e_1_2_1_19_1"},{"key":"e_1_2_1_20_1","first-page":"864","article-title":"Speech recognition for disabilities people","volume":"1","author":"Mosbah B. B.","year":"2006","journal-title":"Inf. Comm. Technol."},{"volume-title":"Proceedings of the Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition (ESCA\u201998)","author":"Nock H. J.","key":"e_1_2_1_21_1"},{"volume-title":"Proceedings of the 4th International Conference on Biomedical Engineering (BioMed\u201908)","author":"Rodriguez W. R.","key":"e_1_2_1_22_1"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1155\/2009\/540409"},{"volume-title":"CASALA: Computer aided speech and language analysis. Austral. Comm. Quart. 27--28.","year":"1997","author":"Serry T.","key":"e_1_2_1_24_1"},{"volume-title":"Proceedings of the 8th Annual Conference on Rehabilitation Technology (CT\u201985)","author":"Stevens G.","key":"e_1_2_1_25_1"},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201997)","author":"Torre D.","key":"e_1_2_1_26_1"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2006.876769"}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1967293.1967294","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1967293.1967294","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T10:52:21Z","timestamp":1750243941000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1967293.1967294"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6]]},"references-count":27,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2011,6]]}},"alternative-id":["10.1145\/1967293.1967294"],"URL":"https:\/\/doi.org\/10.1145\/1967293.1967294","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"type":"print","value":"1530-0226"},{"type":"electronic","value":"1558-3430"}],"subject":[],"published":{"date-parts":[[2011,6]]},"assertion":[{"value":"2010-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-11-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}