{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T19:28:01Z","timestamp":1774639681695,"version":"3.50.1"},"reference-count":32,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Info. Know. Mgmt."],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:p> Spoken language identification is the process of recognising language in an audio segment and is the precursor for several technologies such as automatic call routing, language recognition, multilingual conversation, language parsing, and sentimental analysis. Language identification has become a challenging task for low-resource languages like Kashmiri and Ladakhi spoken in the UT\u2019s of Jammu and Kashmir (JK) and Ladakh, India. This is mainly due to speaker variations like duration, moderator, and ambiance particularly when training and testing are done on different datasets whilst analysing the accuracy of language identification system in actual implementation, thus producing low accuracy results. In order to tackle this problem, we propose a hybrid convolutional bi-directional gated recurrent unit (Bi-GRU) utilising the effects of both static and dynamic behaviour of the audio signal in order to achieve better results as compared to state-of-the-art models. The audio signals are first converted into two-dimensional structures called Mel-spectrograms to represent the frequency distribution over time. To investigate the spectral behaviour of audio signals, we employ a convolutional neural network (CNN) that perceives Mel-spectrograms in multiple dimensions. The CNN-learned feature vector serves as input to the Bi-GRU that maintains the dynamic behaviour of the audio signal. Experiments are done on six spoken languages, i.e. Ladakhi, Kashmiri, Hindi, Urdu, English, and Dogri. The data corpora used for experimentation are the International Institute of Information Technology Hyderabad-Indian Language Speech Corpus (IIITH-ILSC) and the self-created data corpus for the Ladakhi language. The model is tested on two datasets, i.e. speaker-dependent and speaker-independent. Results show that when validating the efficiency of our proposed model on both speaker-dependent and speaker-independent datasets, we achieve optimal accuracies of 99% and 91%, respectively, thus achieving promising results in comparison to the state-of-the-art models available. <\/jats:p>","DOI":"10.1142\/s0219649223500284","type":"journal-article","created":{"date-parts":[[2023,7,19]],"date-time":"2023-07-19T10:21:46Z","timestamp":1689762106000},"source":"Crossref","is-referenced-by-count":3,"title":["A Hybrid Convolutional Bi-Directional Gated Recurrent Unit System for Spoken Languages of JK and Ladakhi"],"prefix":"10.1142","volume":"22","author":[{"given":"Irshad Ahmad","family":"Thukroo","sequence":"first","affiliation":[{"name":"Department of Computer Science, Islamic University of Science & Technology, 1-University Avenue, Awantipora, Pulwama 192122, Jammu and Kashmir, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6656-005X","authenticated-orcid":false,"given":"Rumaan","family":"Bashir","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Islamic University of Science & Technology, 1-University Avenue, Awantipora, Pulwama 192122, Jammu and Kashmir, India"}]},{"given":"Kaiser J.","family":"Giri","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Islamic University of Science & Technology, 1-University Avenue, Awantipora, Pulwama 192122, Jammu and Kashmir, India"}]}],"member":"219","published-online":{"date-parts":[[2023,7,18]]},"reference":[{"issue":"8","key":"S0219649223500284BIB001","doi-asserted-by":"crossref","first-page":"3589","DOI":"10.1007\/s00034-017-0724-1","volume":"37","author":"Adeeba F","year":"2018","journal-title":"Circuits, Systems, and Signal Processing"},{"key":"S0219649223500284BIB002","doi-asserted-by":"crossref","first-page":"4596","DOI":"10.1007\/s00034-020-01388-9","volume":"39","author":"Albadr M","year":"2020","journal-title":"Circuits, Systems, and Signal Processing"},{"issue":"2","key":"S0219649223500284BIB003","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/MCAS.2011.941081","volume":"11","author":"Ambikairajah E","year":"2011","journal-title":"IEEE Circuits and Systems Magazine"},{"key":"S0219649223500284BIB004","doi-asserted-by":"crossref","first-page":"880","DOI":"10.1007\/978-3-319-70136-3_93","volume-title":"International Conference on Neural Information Processing","author":"Bartz C","year":"2017"},{"key":"S0219649223500284BIB005","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1109\/ICIIP.2013.6707658","volume-title":"2013 IEEE Second International Conference on Image Information Processing (ICIIP-2013)","author":"Bashir R","year":"2013"},{"key":"S0219649223500284BIB006","doi-asserted-by":"crossref","first-page":"5991","DOI":"10.1109\/ICASSP.2019.8682386","volume-title":"ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Cai W","year":"2019"},{"key":"S0219649223500284BIB007","doi-asserted-by":"crossref","first-page":"181432","DOI":"10.1109\/ACCESS.2020.3028241","volume":"8","author":"Das A","year":"2020","journal-title":"IEEE Access"},{"issue":"4","key":"S0219649223500284BIB008","doi-asserted-by":"crossref","first-page":"3425","DOI":"10.1007\/s13369-020-04430-9","volume":"45","author":"Das HS","year":"2020","journal-title":"Arabian Journal for Science and Engineering"},{"key":"S0219649223500284BIB009","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1007\/s10772-018-9526-5","volume":"21","author":"Firooz G","year":"2018","journal-title":"International Journal of Speech Technology"},{"key":"S0219649223500284BIB010","volume-title":"Fifteenth Annual Conference of the International Speech Communication Association","author":"Ganapathy S","year":"2014"},{"key":"S0219649223500284BIB011","doi-asserted-by":"crossref","first-page":"114416","DOI":"10.1016\/j.eswa.2020.114416","volume":"168","author":"Garain A","year":"2021","journal-title":"Expert Systems with Applications"},{"key":"S0219649223500284BIB012","doi-asserted-by":"crossref","first-page":"182868","DOI":"10.1109\/ACCESS.2020.3028121","volume":"8","author":"Guha S","year":"2020","journal-title":"IEEE Access"},{"key":"S0219649223500284BIB013","first-page":"448","volume-title":"International Conference on Machine Learning","author":"Ioffe S","year":"2015"},{"issue":"3","key":"S0219649223500284BIB014","doi-asserted-by":"crossref","first-page":"544","DOI":"10.1016\/j.dsp.2011.11.008","volume":"22","author":"Jothilakshmi S","year":"2012","journal-title":"Digital Signal Processing"},{"key":"S0219649223500284BIB015","first-page":"1","volume-title":"2021 IEEE International Conference on Smart Information Systems and Technologies (SIST)","author":"Kaiyr A","year":"2021"},{"issue":"1","key":"S0219649223500284BIB016","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1017\/S0261143021000192","volume":"40","author":"Keeken AV","year":"2021","journal-title":"Popular Music"},{"issue":"4","key":"S0219649223500284BIB017","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1007\/s10772-017-9466-5","volume":"20","author":"Koolagudi SG","year":"2017","journal-title":"International Journal of Speech Technology"},{"key":"S0219649223500284BIB018","doi-asserted-by":"crossref","first-page":"3391","DOI":"10.1016\/j.proeng.2012.06.392","volume":"38","author":"Koolagudi SG","year":"2012","journal-title":"Procedia Engineering"},{"key":"S0219649223500284BIB019","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1007\/978-981-16-7996-4_7","volume-title":"Machine Learning and Autonomous Systems","author":"Kulkarni R","year":"2022"},{"issue":"5","key":"S0219649223500284BIB020","doi-asserted-by":"crossref","first-page":"1136","DOI":"10.1109\/JPROC.2012.2237151","volume":"101","author":"Li H","year":"2013","journal-title":"Proceedings of the IEEE"},{"key":"S0219649223500284BIB021","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.csl.2017.01.006","volume":"44","author":"Lu X","year":"2017","journal-title":"Computer Speech & Language"},{"issue":"1","key":"S0219649223500284BIB022","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1109\/TVT.2018.2879361","volume":"68","author":"Ma Z","year":"2019","journal-title":"IEEE Transactions on Vehicular Technology"},{"key":"S0219649223500284BIB023","first-page":"1","volume-title":"2012 National Conference on Communications (NCC)","author":"Maity S","year":"2012"},{"key":"S0219649223500284BIB024","volume-title":"Second International Conference on Spoken Language Processing","author":"Muthusamy YK","year":"1992"},{"key":"S0219649223500284BIB025","first-page":"1","volume-title":"2013 International Conference Oriental COCOSDA Held Jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA\/CASLRE)","author":"Nandi D","year":"2013"},{"issue":"4","key":"S0219649223500284BIB026","doi-asserted-by":"crossref","first-page":"97","DOI":"10.23919\/SAIEE.2009.8531857","volume":"100","author":"Pech\u00e9 M","year":"2009","journal-title":"SAIEE Africa Research Journal"},{"issue":"4","key":"S0219649223500284BIB027","doi-asserted-by":"crossref","first-page":"489","DOI":"10.1007\/s10772-013-9198-0","volume":"16","author":"Reddy VR","year":"2013","journal-title":"International Journal of Speech Technology"},{"key":"S0219649223500284BIB028","doi-asserted-by":"crossref","first-page":"107020","DOI":"10.1016\/j.apacoust.2019.107020","volume":"158","author":"Sharma G","year":"2020","journal-title":"Applied Acoustics"},{"issue":"11","key":"S0219649223500284BIB029","doi-asserted-by":"crossref","first-page":"5018","DOI":"10.1007\/s00034-019-01100-6","volume":"38","author":"Srinivas NS","year":"2019","journal-title":"Circuits, Systems, and Signal Processing"},{"key":"S0219649223500284BIB030","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1109\/ICSC53193.2021.9673212","volume-title":"2021 7th International Conference on Signal Processing and Communication (ICSC)","author":"Thukroo IA","year":"2021"},{"key":"S0219649223500284BIB031","doi-asserted-by":"crossref","first-page":"56","DOI":"10.21437\/SLTU.2018-12","volume-title":"6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018)","author":"Vuddagiri RK","year":"2018"},{"key":"S0219649223500284BIB032","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1016\/j.neucom.2020.08.069","volume":"453","author":"Zhang Z","year":"2021","journal-title":"Neurocomputing"}],"container-title":["Journal of Information &amp; Knowledge Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219649223500284","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,5]],"date-time":"2023-09-05T05:57:42Z","timestamp":1693893462000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0219649223500284"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,18]]},"references-count":32,"journal-issue":{"issue":"04","published-print":{"date-parts":[[2023,8]]}},"alternative-id":["10.1142\/S0219649223500284"],"URL":"https:\/\/doi.org\/10.1142\/s0219649223500284","relation":{},"ISSN":["0219-6492","1793-6926"],"issn-type":[{"value":"0219-6492","type":"print"},{"value":"1793-6926","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,18]]},"article-number":"2350028"}}