{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:30:00Z","timestamp":1777703400474,"version":"3.51.4"},"reference-count":42,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2019,6,18]],"date-time":"2019-06-18T00:00:00Z","timestamp":1560816000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,7,9]]},"abstract":"<jats:p>The rapid development in technology has led to a colossal surge in the use of biometric authentication system. Speaker identification biometric is one of the fields that is under progress and demands more and more precision. The objective of this research is to explore the issue of identifying a speaker from voice regardless of the content. Perceptual Wavelet Packet Transform (PWPT) and Artificial Neural Networks (ANN) approach are discussed in this paper for speaker identification. Perceptual Wavelet Packet Cepstral Coefficients (PWPCC) are used for transforming speech into spectral feature vectors, and the most germane aspects of the speech signal are selected from the energy and variance distribution characteristics. These selected attributes are presented to the Cascaded Feedforward Neural Network (CFNN) and trained with Levenberg-Marquardt Back Propagation (LMBP) algorithm for further classification. The performance of the network is determined by evaluating the Speaker Identification Rate (SIR). For comparison, five different gradient descent training algorithms are considered and it is found that the LMBP produces better performance. The proposed model is evaluated for clean as well as noisy speech at various SNR levels and is found to be competitive, and the experimental results show significant improvement in speaker identification rate compared with other classical methods.<\/jats:p>","DOI":"10.3233\/jifs-182599","type":"journal-article","created":{"date-parts":[[2019,6,21]],"date-time":"2019-06-21T11:49:46Z","timestamp":1561117786000},"page":"1141-1153","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":4,"title":["Cascaded Feedforward Neural Networks for speaker identification using Perceptual Wavelet based Cepstral Coefficients"],"prefix":"10.1177","volume":"37","author":[{"given":"G.","family":"Renisha","sequence":"first","affiliation":[{"name":"Department of Electronics and Communication Engineering, Government College of Engineering, Tirunelveli, Tamilnadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"T.","family":"Jayasree","sequence":"additional","affiliation":[{"name":"Department of Electronics and Communication Engineering, Government College of Engineering, Tirunelveli, Tamilnadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2019,6,18]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-015-1005-5"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/35.46670"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-1678-3_21"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jestch.2014.04.004"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2008.08.008"},{"key":"e_1_3_1_7_2","first-page":"2004","article-title":"ASR on Speech Reconstructed from Short-time Fourier Phase Spectra","author":"Alsteris L.D.","unstructured":"AlsterisL.D. and PaliwalK.K., ASR on Speech Reconstructed from Short-time Fourier Phase Spectra, Proc of Int Conference on Spoken Language Processing 2004.","journal-title":"Proc of Int Conference on Spoken Language Processing"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1049\/el.2015.1418"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-015-1019-z"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1155\/2016\/4908412"},{"key":"e_1_3_1_11_2","first-page":"2014","article-title":"Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter, Coimbatore, India","author":"Chauhan P.M.","unstructured":"ChauhanP.M. and DesaiN.P., Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter, Coimbatore, India, Proc IEEE Int Conference on Green Computing Communication and Electrical Engineering (ICGCCEE) 2014.","journal-title":"Proc IEEE Int Conference on Green Computing Communication and Electrical Engineering (ICGCCEE)"},{"key":"e_1_3_1_12_2","first-page":"2012","article-title":"Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Portland, Oregon","author":"Li W.","unstructured":"LiW. and BourlardH., Sub-Band Based Log-Energy and its Dynamic Range Stretching for Robust In-Car Speech Recognition, Portland, Oregon, Proc Int Conference Speech Communication Association 2012.","journal-title":"Proc Int Conference Speech Communication Association"},{"key":"e_1_3_1_13_2","first-page":"791","article-title":"Integration of Mel-frequency Cepstral Coefficients with Log Energy and Temporal Derivatives for Text-Independent Speaker Identification, Singapore, Springer, Volume 1, pp","author":"Dhonde S.B.","unstructured":"DhondeS.B., ChaudhariA. and JagadeS.M., Integration of Mel-frequency Cepstral Coefficients with Log Energy and Temporal Derivatives for Text-Independent Speaker Identification, Singapore, Springer, Volume 1, pp, Proc Int Conference on Data Engineering and Communication Technology: ICDECT 2016 791\u2013797.","journal-title":"Proc Int Conference on Data Engineering and Communication Technology: ICDECT 2016"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1049\/iet-bmt.2014.0011"},{"key":"e_1_3_1_15_2","doi-asserted-by":"crossref","unstructured":"BarbosaF.G. and SilvaW.L.S. Support Vector Machines Mel-Frequency Cepstral Coefficients and the Discrete Cosine Transform Applied on Voice Based Biometric Authentication Proc Int Conference SAI Intelligent Systems Conference London 2015.","DOI":"10.1109\/IntelliSys.2015.7361270"},{"key":"e_1_3_1_16_2","first-page":"1","article-title":"Text Independent Speaker Recognition Using Wavelet Cepstral Coefficient and Butter Filter, India, pp","author":"Rathor S.","year":"2017","unstructured":"RathorS. and JadonR.S., Text Independent Speaker Recognition Using Wavelet Cepstral Coefficient and Butter Filter, India, pp, Proc Of IEEE Int Conference on Computing Communication and Networking Technologies (ICCCNT) (2017), 1\u20135 Delhi Worth.","journal-title":"Proc Of IEEE Int Conference on Computing Communication and Networking Technologies (ICCCNT)"},{"issue":"05","key":"e_1_3_1_17_2","first-page":"367","article-title":"Emotion recognition using multilayer perceptron and generalized feed forward neural network","volume":"68","author":"Khanchandani K.B.","year":"2009","unstructured":"KhanchandaniK.B. and HussainM.A., Emotion recognition using multilayer perceptron and generalized feed forward neural network, Journal of Scientific and Industrial Research (JSIR) 68(05) (2009), 367\u2013371.","journal-title":"Journal of Scientific and Industrial Research (JSIR)"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2014.11.016"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2377591"},{"issue":"9","key":"e_1_3_1_20_2","first-page":"75","article-title":"Speaker identification system using wavelet transform and VQ modeling technique","volume":"112","author":"Yadav S.S.","year":"2015","unstructured":"YadavS.S. and BhalkeD.G., Speaker identification system using wavelet transform and VQ modeling technique, International Journal of Computer Applications 112(9) (2015), 75\u201388.","journal-title":"International Journal of Computer Applications"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asej.2014.07.006"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2012.02.002"},{"key":"e_1_3_1_23_2","unstructured":"Pradeep KumarP. RaoP. A Study of Frequency-Scale Warping for Speaker Recognition Proc of National Conference on Communications NCC 2004 IISc Bangalore 2004."},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1049\/iet-spr.2015.0488"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2011.05.035"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.3233\/JIFS-18425"},{"key":"e_1_3_1_27_2","doi-asserted-by":"publisher","DOI":"10.3233\/JIFS-17267"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/TFUZZ.2009.2029569"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2017.2661862"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-016-2263-2"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-016-2401-x"},{"key":"e_1_3_1_32_2","article-title":"Study of speaker recognition system based on Feed Forward deep neural networks exploring text-dependent mode","author":"Makrem B.J.","year":"2016","unstructured":"MakremB.J., ImenJ. and Ka\u00efsO., Study of speaker recognition system based on Feed Forward deep neural networks exploring text-dependent mode, Sciences of Electronics, Technologies of Information and Telecommunications SETIT 2016.","journal-title":"Sciences of Electronics, Technologies of Information and Telecommunications SETIT"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISCON.2002.1215947"},{"key":"e_1_3_1_34_2","doi-asserted-by":"crossref","unstructured":"Le-QingZ. Insect Sound Recognition Based on MFCC and PNN Pro International Conference on Multimedia and Signal Processing (CMSP)2011.","DOI":"10.1109\/CMSP.2011.100"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2009.11.008"},{"key":"e_1_3_1_36_2","doi-asserted-by":"crossref","unstructured":"MaazouziA.-E. AqiliN. RajiM. and HammouchA. A speaker recognition system using power spectrum density and similarity measurements IEEE Third World Conference on Complex Systems (WCCS) Marrakech Morocco 2015.","DOI":"10.1109\/ICoCS.2015.7483213"},{"key":"e_1_3_1_37_2","doi-asserted-by":"crossref","unstructured":"ShoumyN.J. YaakobS.N. EhkanP. AliMd.S. and KhatunS. Cascade-forward neural network performance study for bloodstain image analysis IEEE Int Conference Electronic Design (ICED) Thailand IEEE 2016.","DOI":"10.1109\/ICED.2016.7804646"},{"key":"e_1_3_1_38_2","unstructured":"KuaJ.M.K. ThiruvaranT. AmbikairajahM.N.E. and EppsJ. Investigation of Spectral Centroid Magnitude and Frequency for Speaker Recognition The Speaker and Language Recognition Workshop Brno Czech Republic 2010."},{"key":"e_1_3_1_39_2","doi-asserted-by":"publisher","DOI":"10.5614\/itbj.ict.res.appl.2013.7.2.1"},{"key":"e_1_3_1_40_2","unstructured":"FuruiS.J. Journal of VLSI Signal Processing Systems for Signal Image and Video Technology 2005."},{"key":"e_1_3_1_41_2","article-title":"Experiments on Acoustic Model supervised adaptation and evaluation by K-Fold Cross Validation technique","author":"Caon D.R.S.","year":"2010","unstructured":"CaonD.R.S., AmehrayeA., RazikJ., CholletG., AndreaoR.V. and MokbelC., Experiments on Acoustic Model supervised adaptation and evaluation by K-Fold Cross Validation technique, Proc International Symposium on I\/V Communications and Mobile Network, 2010.","journal-title":"Proc International Symposium on I\/V Communications and Mobile Network"},{"issue":"4","key":"e_1_3_1_42_2","article-title":"Classification of gender based on voice using support vector machine","volume":"3","author":"Renisha G.P.","year":"2016","unstructured":"RenishaG.P., KarpagavalliK.P. and Krishnaveni, Classification of gender based on voice using support vector machine, International Journal of Advanced Research Trends in Engineering and Technology (IJARTET) 3(4) (2016).","journal-title":"International Journal of Advanced Research Trends in Engineering and Technology (IJARTET)"},{"issue":"2","key":"e_1_3_1_43_2","article-title":"Person authentication using face and voice modalities","volume":"1","author":"Lalitha S.","year":"2013","unstructured":"LalithaS., AshwiniV., MadhusudhanK.N. and SachinB.S., Person authentication using face and voice modalities, International Journal of Advances in Science Engineering and Technology 1(2) (2013).","journal-title":"International Journal of Advances in Science Engineering and Technology"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-182599","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-182599","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-182599","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:38:38Z","timestamp":1777455518000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-182599"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,6,18]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,7,9]]}},"alternative-id":["10.3233\/JIFS-182599"],"URL":"https:\/\/doi.org\/10.3233\/jifs-182599","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,6,18]]}}}