{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,5]],"date-time":"2022-04-05T21:25:43Z","timestamp":1649193943882},"reference-count":48,"publisher":"World Scientific Pub Co Pte Lt","issue":"02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Wavelets Multiresolut Inf. Process."],"published-print":{"date-parts":[[2021,3]]},"abstract":"<jats:p> Speech recognition is a rapidly emerging research area as the speech signal contains linguistic information and speaker information that can be used in applications including surveillance, authentication, and forensic field. The performance of speech recognition systems degrades expeditiously nowadays due to channel degradations, mismatches, and noise. To provide better performance of speech recognition, the Taylor-Deep Belief Network (Taylor-DBN) classifier is proposed, which is the modification of the Gradient Descent (GD) algorithm with Taylor series in the existing DBN classifier. Initially, the noise present in the speech signal is removed through the speech signal enhancement. The features, such as Holoentropy with the eXtended Linear Prediction using autocorrelation Snapshot (HXLPS), spectral kurtosis, and spectral skewness, are extracted from the enhanced speech signal, which is fed to the Taylor-DBN classifier that identifies the speech of the impaired persons. The experimentation is done using the TensorFlow speech recognition database, the real database, and the ESC-50 dataset. The accuracy, False Acceptance Rate (FAR), False Rejection Rate (FRR), and Mean Square Error (MSE) of the Taylor-DBN for TensorFlow speech recognition database are 96.95%, 3.04%, 3.04%, and 0.045, respectively, and for real database, the accuracy, FAR, FRR, and MSE are 96.67%, 3.32%, 3.32%, and 0.0499, respectively. Similarly, for the ESC-50 dataset, the accuracy, FAR, FRR, and MSE are 96.81%, 3.18%, 3.18%, and 0.047, respectively. The results imply that the Taylor-DBN provides better performance as compared to the existing conventional methods. <\/jats:p>","DOI":"10.1142\/s021969132050071x","type":"journal-article","created":{"date-parts":[[2020,10,17]],"date-time":"2020-10-17T15:50:46Z","timestamp":1602949846000},"page":"2050071","source":"Crossref","is-referenced-by-count":1,"title":["Taylor-DBN: A new framework for speech recognition systems"],"prefix":"10.1142","volume":"19","author":[{"given":"Arul Valiyavalappil","family":"Haridas","sequence":"first","affiliation":[{"name":"Department of Electronics and Communication Engineering, Sathyabama Institute of Science & Technology, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ramalatha","family":"Marimuthu","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communication Engineering, Kumaraguru College of Technology, Coimbatore 641049, Tamil Nadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"V. G.","family":"Sivakumar","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communication Engineering, Vidya Jyothi Institute of Technology, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Basabi","family":"Chakraborty","sequence":"additional","affiliation":[{"name":"Faculty of Software and Information Science, Iwate Prefectural University, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2020,12,11]]},"reference":[{"key":"S021969132050071XBIB001","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2339736"},{"key":"S021969132050071XBIB002","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-011-9108-2"},{"key":"S021969132050071XBIB003","doi-asserted-by":"publisher","DOI":"10.1109\/ICEEICT.2015.7307401"},{"issue":"1","key":"S021969132050071XBIB004","first-page":"41","volume":"89","author":"Mangai S. Alamelu","year":"2014","journal-title":"Int. J. Comput. Appl."},{"key":"S021969132050071XBIB005","volume-title":"Proc. IEEE SOUTHEASTCON \u201997: \u2018Engineering the New Century\u2019","author":"Alsaka Y. A.","year":"2002"},{"issue":"20","key":"S021969132050071XBIB006","first-page":"30","volume":"73","author":"Ananthi S.","year":"2013","journal-title":"Int. J. Comput. Appl."},{"key":"S021969132050071XBIB007","doi-asserted-by":"publisher","DOI":"10.1109\/PlatCon.2017.7883728"},{"key":"S021969132050071XBIB008","volume-title":"Proc. 3rd Int. Conf. Learning Representations","author":"Bahdanau D.","year":"2015"},{"key":"S021969132050071XBIB010","doi-asserted-by":"publisher","DOI":"10.1016\/j.biosystemseng.2006.06.012"},{"key":"S021969132050071XBIB011","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404837"},{"key":"S021969132050071XBIB012","doi-asserted-by":"publisher","DOI":"10.1002\/ima.22087"},{"key":"S021969132050071XBIB013","doi-asserted-by":"publisher","DOI":"10.1016\/j.trc.2017.06.001"},{"key":"S021969132050071XBIB014","doi-asserted-by":"publisher","DOI":"10.1016\/S0885-2308(03)00010-X"},{"key":"S021969132050071XBIB015","first-page":"68","author":"Bombatkar A.","year":"2014","journal-title":"Int. J. Eng. Res. Appl."},{"key":"S021969132050071XBIB016","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2008.4518755"},{"key":"S021969132050071XBIB017","doi-asserted-by":"publisher","DOI":"10.1109\/89.536929"},{"key":"S021969132050071XBIB018","volume-title":"Deep Learning","author":"Goodfellow I.","year":"2016"},{"key":"S021969132050071XBIB019","first-page":"2672","volume-title":"Proc. 27th Int. Conf. Neural Information Processing Systems","volume":"2","author":"Goodfellow I. J.","year":"2014"},{"key":"S021969132050071XBIB020","doi-asserted-by":"publisher","DOI":"10.3390\/e20090714"},{"key":"S021969132050071XBIB021","doi-asserted-by":"publisher","DOI":"10.3390\/e21030304"},{"key":"S021969132050071XBIB022","doi-asserted-by":"publisher","DOI":"10.1016\/j.aml.2011.02.018"},{"key":"S021969132050071XBIB023","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2017.07.004"},{"key":"S021969132050071XBIB024","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-70602-1"},{"key":"S021969132050071XBIB025","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404843"},{"key":"S021969132050071XBIB026","doi-asserted-by":"publisher","DOI":"10.1162\/0899766054322964"},{"key":"S021969132050071XBIB028","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2018.01.007"},{"key":"S021969132050071XBIB029","volume-title":"Proc. 3rd Int. Conf. Learning Representations","author":"Kingma D.","year":"2015"},{"key":"S021969132050071XBIB030","doi-asserted-by":"publisher","DOI":"10.1109\/WASPAA.2013.6701894"},{"key":"S021969132050071XBIB031","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2304637"},{"key":"S021969132050071XBIB032","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-016-9368-y"},{"key":"S021969132050071XBIB033","doi-asserted-by":"publisher","DOI":"10.1109\/ICSPCS.2010.5709761"},{"key":"S021969132050071XBIB035","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2007.4409052"},{"key":"S021969132050071XBIB036","author":"Rajpurohit V. S.","year":"2020","journal-title":"Int. J. Knowl.-Based Intell. Eng. Syst."},{"key":"S021969132050071XBIB037","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/bxz103"},{"key":"S021969132050071XBIB038","doi-asserted-by":"publisher","DOI":"10.1109\/TETCI.2017.2762739"},{"issue":"2","key":"S021969132050071XBIB039","first-page":"565","volume":"20","author":"Sanchis A.","year":"2012","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"S021969132050071XBIB040","doi-asserted-by":"publisher","DOI":"10.1038\/nature16961"},{"issue":"56","key":"S021969132050071XBIB041","first-page":"1929","volume":"15","author":"Srivastava N.","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"S021969132050071XBIB042","doi-asserted-by":"publisher","DOI":"10.1016\/j.aej.2016.12.009"},{"issue":"1","key":"S021969132050071XBIB043","first-page":"33","volume":"1","author":"Thomas R.","year":"2018","journal-title":"Multimed. Res."},{"key":"S021969132050071XBIB044","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2008.4518658"},{"key":"S021969132050071XBIB046","volume-title":"Proc. 6th IEEE Int. Workshop Nonlinear Signal and Image Processing (NSIP)","author":"Vrabie V.","year":"2003"},{"key":"S021969132050071XBIB047","volume-title":"Proc. Advances in Quantitative Laryngology, Voice and Speech Research","author":"Wielgat R.","year":"2006"},{"issue":"2","key":"S021969132050071XBIB048","first-page":"61","volume":"1","author":"Yadav K. S.","year":"2013","journal-title":"Int. J. Sci. Eng."},{"key":"S021969132050071XBIB049","volume-title":"Proc. Int. Joint Conf. Neural Networks","author":"Yang S.","year":"2001"},{"key":"S021969132050071XBIB050","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2017.08.007"},{"key":"S021969132050071XBIB051","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4471-5779-3"},{"key":"S021969132050071XBIB052","doi-asserted-by":"publisher","DOI":"10.1109\/EIConRus.2018.8317401"}],"container-title":["International Journal of Wavelets, Multiresolution and Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S021969132050071X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,15]],"date-time":"2021-03-15T08:28:43Z","timestamp":1615796923000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S021969132050071X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,11]]},"references-count":48,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2021,3]]}},"alternative-id":["10.1142\/S021969132050071X"],"URL":"https:\/\/doi.org\/10.1142\/s021969132050071x","relation":{},"ISSN":["0219-6913","1793-690X"],"issn-type":[{"value":"0219-6913","type":"print"},{"value":"1793-690X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12,11]]}}}