{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T03:32:35Z","timestamp":1775878355220,"version":"3.50.1"},"reference-count":76,"publisher":"Wiley","license":[{"start":{"date-parts":[[2022,3,7]],"date-time":"2022-03-07T00:00:00Z","timestamp":1646611200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Applied Computational Intelligence and Soft Computing"],"published-print":{"date-parts":[[2022,3,7]]},"abstract":"<jats:p>The present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from voice processing systems. The proposed method minimizes the system\u2019s complexity by reducing search space during the testing process of speaker identification. The speaker is searched from the set of speakers of the identified dialect instead of all the speakers present in system training. The study is conducted on the Pashto language, and the voice data samples are collected from native Pashto speakers of specific regions of Pakistan and Afghanistan where Pashto is spoken with different dialectal variations. The task of speaker identification is achieved with the help of a novel hierarchical framework that works in two steps. In the first step, the speaker\u2019s dialect is identified. For automated dialect identification, the spectral and prosodic features have been used in conjunction with Gaussian mixture model (GMM). In the second step, the speaker is identified using a multilayer perceptron (MLP)-based speaker identification system, which gets aggregated input from the first step, i.e., dialect identification along with prosodic and spectral features. The robustness of the proposed SIS is compared with traditional state-of-the-art methods in the literature. The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker.<\/jats:p>","DOI":"10.1155\/2022\/4980920","type":"journal-article","created":{"date-parts":[[2022,3,7]],"date-time":"2022-03-07T23:35:08Z","timestamp":1646696108000},"page":"1-16","source":"Crossref","is-referenced-by-count":5,"title":["A Robust Approach for Speaker Identification Using Dialect Information"],"prefix":"10.1155","volume":"2022","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0953-4055","authenticated-orcid":true,"given":"Shahid Munir","family":"Shah","sequence":"first","affiliation":[{"name":"Faculty of IT, Salim Habib University, Karachi, Pakistan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4735-0692","authenticated-orcid":true,"given":"Muhammad","family":"Moinuddin","sequence":"additional","affiliation":[{"name":"Center of Excellence in Intelligent Engineering Systems, King Abdul Aziz University, Jeddah, Saudi Arabia"},{"name":"Electrical and Computer Engineering Department, King Abdul Aziz University, Jeddah, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0819-800X","authenticated-orcid":true,"given":"Rizwan Ahmed","family":"Khan","sequence":"additional","affiliation":[{"name":"Faculty of IT, Salim Habib University, Karachi, Pakistan"}]}],"member":"311","reference":[{"key":"1","first-page":"434","article-title":"Secure voice biometric authentication","volume":"16","author":"R. Roberts","year":"2019","journal-title":"US Patent App"},{"key":"2","volume-title":"Voxceleb: A Large-Scale Speaker Identification Dataset","author":"A. Nagrani","year":"2017"},{"key":"3","first-page":"247","article-title":"Secure smart home: a voiceprint and internet based authentication system for remote accessing","author":"H. Ren"},{"key":"4","doi-asserted-by":"crossref","DOI":"10.14722\/ndss.2019.23362","article-title":"Practical hidden voice attacks against speech and speaker recognition systems","author":"H. Abdullah","year":"2019"},{"key":"5","first-page":"109","article-title":"Voice-based user interface with dynamically switchable endpoints","volume":"15","author":"H. Huang","year":"2019","journal-title":"NoteUS Patent App"},{"key":"6","article-title":"Adversarial examples in the physical world","author":"A. Kurakin","year":"2016"},{"key":"7","first-page":"49","article-title":"A systematic approach for practical adversarial voice recognition","author":"X. Yuan"},{"key":"8","first-page":"1","article-title":"Audio adversarial examples: targeted attacks on speech-to-text","author":"N. Carlini"},{"key":"9","first-page":"6221","article-title":"Channel adversarial training for cross-channel text-independent speaker recognition","author":"X. Fang"},{"key":"10","first-page":"6331","article-title":"An improved uncertainty propagation method for robust i-vector based speaker recognition","author":"D. Ribas"},{"key":"11","first-page":"1106","volume-title":"Robust Speaker Recognition From Distant Speech Under Real Reverberant Environments Using Speaker Embeddings","author":"M. K. Nandwana"},{"key":"12","doi-asserted-by":"publisher","DOI":"10.1109\/msp.2015.2462851"},{"key":"13","first-page":"3698","volume-title":"Improving Robustness to Compressed Speech in Speaker Recognition","author":"M. McLaren"},{"key":"14","doi-asserted-by":"crossref","first-page":"1256","DOI":"10.1134\/S1064226919110184","article-title":"Speaker modeling using emotional speech for more robust speaker identification","volume":"64","author":"M. M. \u017d. Nedeljkovi\u0107","year":"2019","journal-title":"Journal of Communications Technology and Electronics"},{"key":"15","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-016-3350-1"},{"key":"16","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-015-9328-y"},{"key":"17","doi-asserted-by":"publisher","DOI":"10.1109\/access.2020.2974799"},{"key":"18","first-page":"2138","article-title":"A systematic strategy for robust automatic dialect identification","author":"G. A. Liu"},{"key":"19","article-title":"Automatic dialect and accent recognition and its application to speech recognition","author":"F. Biadsy","year":"2011"},{"key":"20","doi-asserted-by":"publisher","DOI":"10.1109\/icassp.2013.6639089"},{"key":"21","doi-asserted-by":"crossref","DOI":"10.21437\/Odyssey.2014-17","volume-title":"Swiss French Regional Accent Identification","author":"A. Lazaridis","year":"2014"},{"key":"22","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2013.07.008"},{"key":"23","doi-asserted-by":"publisher","DOI":"10.1142\/s0219691318500315"},{"key":"24","first-page":"1784","article-title":"Accent identification, proceeding of fourth international conference on spoken language processing","volume":"3","author":"T. Carlos","year":"1996","journal-title":"IEEE"},{"key":"25","first-page":"554","article-title":"Accent classification in human speech biometrics for native and non-native English speakers","author":"J. J. Bird"},{"key":"26","doi-asserted-by":"crossref","article-title":"Dialect recognition using a phone-gmm-supervector-based svm kernel","author":"F. Biadsy","DOI":"10.21437\/Interspeech.2010-277"},{"key":"27","first-page":"26","volume-title":"Detecting Nonnative Speech Using Speaker Recognition Approaches","author":"E. Shriberg","year":"2008"},{"key":"28","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2014.10.004"},{"key":"29","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1109\/TASLP.2015.2489558","article-title":"I-vector modeling of speech attributes for automatic foreign accent recognition","volume":"24","author":"H. Behravan","year":"2015","journal-title":"IEEE\/ACM Transactions on Audio, Speech, and Language Processing"},{"key":"30","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-016-9351-7"},{"key":"31","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/d15-1254"},{"key":"32","first-page":"35","article-title":"Arabic dialect identification using a parallel multidialectal corpus","author":"S. Malmasi"},{"key":"33","article-title":"Automatic dialect detection in arabic broadcast speech","author":"A. Ali","year":"2015"},{"key":"34","first-page":"512","article-title":"Voice-based liveness verification","volume":"10","author":"B. Pellom","year":"2019","journal-title":"US Patent"},{"key":"35","first-page":"132","volume-title":"Identification of British English Regional Accents Using Fusion of I-Vector and Multi-Accent Phonotactic Systems","author":"M. Najafian","year":"2016"},{"key":"36","doi-asserted-by":"publisher","DOI":"10.1109\/disa.2018.8490639"},{"key":"37","article-title":"Turkish regional dialect recognition using acoustic features of voiced segments","volume":"8","author":"B. Uslu","year":"2018","journal-title":"IJSPS"},{"key":"38","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-016-9347-6"},{"key":"39","article-title":"Automatic Arabic dialect classification","volume":"8887","author":"E. J. Harfash","year":"2017","journal-title":"International Journal of Computer Application"},{"key":"40","doi-asserted-by":"publisher","DOI":"10.1109\/intellisys.2015.7361259"},{"key":"41","doi-asserted-by":"publisher","DOI":"10.1109\/icaci.2016.7449852"},{"key":"42","doi-asserted-by":"publisher","DOI":"10.18178\/ijsps.4.3.235-238"},{"key":"43","article-title":"Classification of closely related sub-dialects of Arabic using support-vector machines","author":"S. Wray"},{"key":"44","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/w17-1222"},{"key":"45","doi-asserted-by":"crossref","first-page":"1534","DOI":"10.1016\/j.procs.2018.08.126","article-title":"I-vector extraction for speaker recognition based on dimensionality reduction","volume":"126","author":"I. Salwani","year":"2018","journal-title":"Procedia Computer Science"},{"key":"46","doi-asserted-by":"crossref","article-title":"Convolutional neural networks and language embeddings for end-to-end dialect recognition","author":"S. Shon","DOI":"10.21437\/Odyssey.2018-14"},{"key":"47","first-page":"5716","article-title":"A highly adaptive acoustic model for accurate multi-dialect speech recognition","author":"S. Yoo"},{"key":"48","doi-asserted-by":"publisher","DOI":"10.1017\/s1351324920000091"},{"key":"49","doi-asserted-by":"publisher","DOI":"10.21437\/interspeech.2009-77"},{"key":"50","article-title":"Two-stage training for chinese dialect recognition","author":"Z. Ren","year":"2019"},{"key":"51","doi-asserted-by":"publisher","DOI":"10.3389\/fcomm.2019.00064"},{"key":"52","first-page":"12","article-title":"Gender, age and dialect recognition using tweets in a deep learning framework-notebook for fire","author":"C. Suman"},{"key":"53","first-page":"6","article-title":"Deep neural network acoustic modeling for native and non-native Mandarin speech recognition","author":"X. Chen"},{"key":"54","doi-asserted-by":"publisher","DOI":"10.21437\/interspeech.2014-497"},{"key":"55","doi-asserted-by":"publisher","DOI":"10.21437\/interspeech.2015-718"},{"key":"56","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/j.specom.2003.09.003","article-title":"Generating non-native pronunciation variants for lexicon adaptation","volume":"42","author":"G. Silke","year":"2004","journal-title":"Speech Communication"},{"key":"57","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1016\/j.specom.2020.05.003","article-title":"Automatic accent identification as an analytical tool for accent robust automatic speech recognition","volume":"122","author":"N. Maryam","year":"2020","journal-title":"Speech Communication"},{"key":"58","doi-asserted-by":"crossref","first-page":"133","DOI":"10.54418\/ca-85.15","article-title":"The 19th and early 20th-century us women\u2019s rights struggle: implications for contemporary afghani and pakistani pashtun women","volume":"85","author":"I. Ali","year":"2019","journal-title":"Central Asia"},{"key":"59","article-title":"Archive for the \u201cpashtunistan\u201d category","author":"World Press"},{"key":"60","first-page":"2190","article-title":"Speaker recognition for Pashto speakers based on isolated digits recognition using accent and dialect approach","volume":"15","author":"S. M. Shah","year":"2020","journal-title":"Journal of Engineering Science & Technology"},{"key":"61","doi-asserted-by":"publisher","DOI":"10.1504\/ijapr.2017.089398"},{"key":"62","doi-asserted-by":"publisher","DOI":"10.1109\/icieect.2017.7916565"},{"key":"63","first-page":"1","article-title":"Pashto spoken digits database for the automatic speech recognition research","author":"A. W. Abbas"},{"key":"64","doi-asserted-by":"publisher","DOI":"10.1007\/s13369-017-2941-0"},{"key":"65","doi-asserted-by":"publisher","DOI":"10.1109\/icetst49965.2020.9080730"},{"key":"66","doi-asserted-by":"publisher","DOI":"10.1109\/ic3.2019.8844908"},{"key":"67","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-019-00928-3"},{"key":"68","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-018-09582-6"},{"key":"69","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-0372-6_20"},{"key":"70","doi-asserted-by":"publisher","DOI":"10.1121\/1.2916590"},{"key":"71","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"72","doi-asserted-by":"publisher","DOI":"10.1109\/msp.2012.2205597"},{"key":"73","doi-asserted-by":"publisher","DOI":"10.1109\/icassp.2013.6639344"},{"key":"74","first-page":"665","article-title":"Text-independent speaker identification using backpropagation mlp network classifier for a closed set of speakers","author":"A. Sharma"},{"key":"75","first-page":"19","article-title":"MFCC and its applications in speaker recognition","volume":"1","author":"V. Tiwari","year":"2010","journal-title":"International Journal on Emerging Technologies"},{"key":"76","doi-asserted-by":"publisher","DOI":"10.1109\/5.628714"}],"container-title":["Applied Computational Intelligence and Soft Computing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/acisc\/2022\/4980920.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/acisc\/2022\/4980920.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/acisc\/2022\/4980920.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,28]],"date-time":"2023-01-28T15:27:25Z","timestamp":1674919645000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/acisc\/2022\/4980920\/"}},"subtitle":[],"editor":[{"given":"Upaka","family":"Rathnayake","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,7]]},"references-count":76,"alternative-id":["4980920","4980920"],"URL":"https:\/\/doi.org\/10.1155\/2022\/4980920","relation":{},"ISSN":["1687-9732","1687-9724"],"issn-type":[{"value":"1687-9732","type":"electronic"},{"value":"1687-9724","type":"print"}],"subject":[],"published":{"date-parts":[[2022,3,7]]}}}