{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:32:49Z","timestamp":1777703569041,"version":"3.51.4"},"reference-count":31,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2018,5,18]],"date-time":"2018-05-18T00:00:00Z","timestamp":1526601600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2018,5,24]]},"abstract":"<jats:p>This paper presents a system for improving the quality of pronunciation error detection and correction for Qur\u2019an recitation by Non-Arabic speakers. Most of the classical speech recognition systems are built using the Hidden Markov Model (HMM) with a Mixture of Gaussian Model (GMM). This paper attempts to enhance the GMM-HMM model\u2019s performance by using Deep Neural Networks (DNNs). The major part of the work done in this paper is involved in the collection and processing of speakers\u2019 data, and building and evaluation of baseline GMM system and the proposed DNN acoustic models for the Qur\u2019an recitation framework. With the aim of solving some pronunciation problems and enhancing the overall performance of such a speech recognition system, we replace the mixture of Gaussians with a DNN. The DNN-HMM model outperforms the GMM-HMM model by 1.02% based on HTK\u2019s word accuracy equation. By calculating the insertion results for both models, DNN-HMM showed progress by 2.59%. In addition, in substitution results, DNN-HMM shows progress with the confusion phonemes DAA by 15.09% and DHA by 17.28%. All experiments and results are presented and discussed in detail.<\/jats:p>","DOI":"10.3233\/jifs-169508","type":"journal-article","created":{"date-parts":[[2018,5,22]],"date-time":"2018-05-22T10:54:42Z","timestamp":1526986482000},"page":"3257-3271","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":7,"title":["Computer Aided Qur\u2019an Pronunciation using DNN"],"prefix":"10.1177","volume":"34","author":[{"given":"Mubarak","family":"Al-Marri","sequence":"first","affiliation":[{"name":"Computer Science Department, Kuwait University, Kuwait"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hazem","family":"Raafat","sequence":"additional","affiliation":[{"name":"Computer Science Department, Kuwait University, Kuwait"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mustafa","family":"Abdallah","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, Cairo University, Egypt"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sherif","family":"Abdou","sequence":"additional","affiliation":[{"name":"Faculty of Computers and Information, Cairo University, Egypt"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohsen","family":"Rashwan","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, Cairo University, Egypt"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2018,5,18]]},"reference":[{"key":"e_1_3_3_2_2","first-page":"249","article-title":"HMM modeling for speaker independent voice dialing in car environment","volume":"1","author":"Fissore L.","year":"1992","unstructured":"FissoreL., LafaceP. and RuscittiP., HMM modeling for speaker independent voice dialing in car environment, 1992 IEEE International Conference1 (1992), 249\u2013252.","journal-title":"1992 IEEE International Conference"},{"key":"e_1_3_3_3_2","first-page":"4660","article-title":"Improved models for Mandarin speech-to-text transcription","author":"Lamel L.","year":"2011","unstructured":"LamelL., GauvainJ., LeV., OparinI. and MengS., Improved models for Mandarin speech-to-text transcription, IEEE International Conference (2011), 4660\u20134663.","journal-title":"IEEE International Conference"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/9072.001.0001"},{"key":"e_1_3_3_5_2","author":"Ahmed M.","year":"2014","unstructured":"AhmedM., The Holy Quran \u2013 A Linguistic Miracle, 19 November [Online]. Available:. [Accessed 20 January], (2014) http:\/\/cisweb.lk\/the-miracle-of-the-quran-by-khalid-baig\/.","journal-title":"The Holy Quran \u2013 A Linguistic Miracle, 19 November [Online]. Available:. [Accessed 20 January]"},{"key":"e_1_3_3_6_2","volume-title":"Tajweed rules of the Qur\u2019an - Part One","author":"Czerepinski K.C.","year":"2003","unstructured":"CzerepinskiK.C., Tajweed rules of the Qur\u2019an - Part One, Syria - Damascus:Dar Al-Khair Islamic Books Publisher;, 2003."},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2006-287"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/5.18626"},{"key":"e_1_3_3_10_2","first-page":"356","article-title":"Porting concepts from DNNs back to GMMs","author":"Demuynck K.","year":"2013","unstructured":"DemuynckK. and TriefenbachF., Porting concepts from DNNs back to GMMs, IEEE Workshop (2013), 356\u2013361.","journal-title":"IEEE Workshop"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1985.1164727"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-04208-9_41"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2013.58"},{"key":"e_1_3_3_14_2","author":"Mohamed A.-R.","year":"2009","unstructured":"MohamedA.-R., DahlG. and HintonG., Deep Belief Networks for phone recognition, NIPS Workshop, Whistler, BC, Canada;, 2009.","journal-title":"Deep Belief Networks for phone recognition, NIPS Workshop, Whistler, BC, Canada;"},{"key":"e_1_3_3_15_2","article-title":"Comparison of syllable-based and phoneme-based DNN-HMM in Japanese speech recognition, in Indonesia;","author":"Seki H.","year":"2014","unstructured":"SekiH., YamamotoK. and NakagawaS., Comparison of syllable-based and phoneme-based DNN-HMM in Japanese speech recognition, in Indonesia;, (ICAICTA), 2014.","journal-title":"(ICAICTA)"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1013670312989"},{"key":"e_1_3_3_17_2","first-page":"858","article-title":"Chinese-English phone set construction for code-switching ASR using acoustic and DNN-extracted articulatory features","author":"Wu C.-H.","year":"2014","unstructured":"WuC.-H., ShenH.-P. and YangY.-T., Chinese-English phone set construction for code-switching ASR using acoustic and DNN-extracted articulatory features, IEEE Press Piscataway (2014), 858\u2013862.","journal-title":"IEEE Press Piscataway"},{"key":"e_1_3_3_18_2","article-title":"Hadj and M. Alkanhal, A manual system to segment and transcribe arabic speech, in Dubai;","author":"Alghamdi M.","year":"2007","unstructured":"AlghamdiM. and ElY.O.M., Hadj and M. Alkanhal, A manual system to segment and transcribe arabic speech, in Dubai;, Signal Processing and Communications2007.","journal-title":"Signal Processing and Communications"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.5120\/1462-1976"},{"key":"e_1_3_3_20_2","article-title":"Falou and B. Monla, Analysis and implementation of a \u201cQuranic\u201d verses delimitation system in audio files using speech recognition techniques, in Damascus, Syria;","author":"Tabbal H.","year":"2006","unstructured":"TabbalH. and ElW., Falou and B. Monla, Analysis and implementation of a \u201cQuranic\u201d verses delimitation system in audio files using speech recognition techniques, in Damascus, Syria;, Information and Communication Technologies2006.","journal-title":"Information and Communication Technologies"},{"key":"e_1_3_3_21_2","unstructured":"WalkerW. LamereP. KwokP. RajB. SinghR. GouveaE. WolfP. and WoelfelJ. Sphinx-4: A flexible open source framework for speech recognition Sun Microsystems lifornia; Menlo Park Ca 2004."},{"key":"e_1_3_3_22_2","article-title":"Voice Content Matching System for Quran Readers, in mexico;","author":"Muhammad W.M.","year":"2010","unstructured":"MuhammadW.M., MuhammadR., MuhammadA. and A.M.M.-E., Voice Content Matching System for Quran Readers, in mexico;, Ninth Mexican International Conference on Artificial Intelligence2010.","journal-title":"Ninth Mexican International Conference on Artificial Intelligence"},{"key":"e_1_3_3_23_2","unstructured":"ChelbaC. BikelD. ShugrinaM. NguyenP. and KumarS. Large Scale Language Modeling in Automatic Speech Recognition Reseach at Google; (2012) pp1\u20137."},{"key":"e_1_3_3_24_2","author":"Ar-Ra\u2019ee S.N.M.","year":"2009","unstructured":"Ar-Ra\u2019eeS.N.M., Noorani Qa\u2019idah, India: Darul Salaam; 2nd edition -01-01);, (1656), 2009.","journal-title":"Noorani Qa\u2019idah"},{"issue":"3","key":"e_1_3_3_25_2","first-page":"1","article-title":"Improving holy qur\u2019an recitation system using hybrid deep neural network-hidden markov model approach","volume":"4","author":"Abdallah M.","year":"2015","unstructured":"AbdallahM., Al-MarriM., AbdouS., RaafatH., RashwanM. and El-GamalM.A., Improving holy qur\u2019an recitation system using hybrid deep neural network-hidden markov model approach, International Journal on Islamic Applications in Computer Science And Technology4(3) (2015), 1\u20138.","journal-title":"International Journal on Islamic Applications in Computer Science And Technology"},{"key":"e_1_3_3_26_2","unstructured":"TevahR.T. GMM-HMM [Online]. Available: [Accessed 01 October] (2015). http:\/\/www.gta.ufrj.br\/grad\/09_1\/versao-final\/impvocal\/hmms_arquivos\/image002.jpg."},{"key":"e_1_3_3_27_2","article-title":"Improving the filter bank of a classic speech feature extraction algorithm, in Bangkok, Thailand;","author":"Skowronski M.","year":"2003","unstructured":"SkowronskiM. and HarrisJ., Improving the filter bank of a classic speech feature extraction algorithm, in Bangkok, Thailand;, ISCAS \u2019032003.","journal-title":"ISCAS \u201903"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(00)00048-0"},{"key":"e_1_3_3_29_2","article-title":"Baum-Welch training for segment-based speech recognition, in","author":"Shu H.","year":"2003","unstructured":"ShuH., HetheringtonL.L. and GlassJ., Baum-Welch training for segment-based speech recognition, in, Automatic Speech Recognition and Understanding, 2003. ASRU \u201903. 2003 IEEE Workshop on2003.","journal-title":"Automatic Speech Recognition and Understanding, 2003. ASRU \u201903. 2003 IEEE Workshop on"},{"key":"e_1_3_3_30_2","article-title":"Understanding how Deep Belief Networks perform acoustic modelling, in Kyoto;","author":"Mohamed A.-R.","year":"2012","unstructured":"MohamedA.-R., HintonG. and PennG., Understanding how Deep Belief Networks perform acoustic modelling, in Kyoto;, Speech and Signal Processing (ICASSP)2012.","journal-title":"Speech and Signal Processing (ICASSP)"},{"key":"e_1_3_3_31_2","unstructured":"Letters and Sounds Pronunciation [Online]. Available: http:\/\/41.media.tumblr.com\/tumblr_m3sxwzrke71qirjfeo1_500.jpg [Accessed 24 December] (2015)."},{"key":"e_1_3_3_32_2","first-page":"851","article-title":"Review On Error Detection and Error Correction Techniques in NLP","author":"Kaur B.","year":"2014","unstructured":"KaurB., Review On Error Detection and Error Correction Techniques in NLP, International Journal of Advanced Research in Computer Science and Software Engineering (2014), 851\u2013853.","journal-title":"International Journal of Advanced Research in Computer Science and Software Engineering"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169508","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-169508","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169508","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:39:00Z","timestamp":1777455540000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-169508"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,5,18]]},"references-count":31,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2018,5,24]]}},"alternative-id":["10.3233\/JIFS-169508"],"URL":"https:\/\/doi.org\/10.3233\/jifs-169508","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,5,18]]}}}