{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:27:23Z","timestamp":1754155643199,"version":"3.41.2"},"reference-count":19,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2013,6,7]],"date-time":"2013-06-07T00:00:00Z","timestamp":1370563200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,6,7]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-heading\">Purpose<\/jats:title><jats:p>The purpose of this paper is to introduce VoxGrid, a mobile voice verification system intended for improving the security of the username\u2010password authentication scheme.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title><jats:p>The system incorporates text\u2010dependant speaker verification via mobile devices that provides for a three\u2010factor authentication scheme for granting authorised access to certain websites or applications. The same speech recognition engine used by Google Voice Search is utilised to provide voice\u2010to\u2010text feature. All verification tasks are performed on a centralised server to minimise computing requirements on mobile platforms where feature extractions is executed using Mel Frequency Cepstral Coefficients. The resulting features are transmitted to the server instead of raw voice data to reduce network load. Actual voice verification takes place in the central server using Vector Quantisation.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Findings<\/jats:title><jats:p>The initial results have indicated that VoxGrid is capable of providing an additional level of security on user authentications at a low cost and without using extra security tokens other than one's voice with a good enough performance given the limited resources available during testing.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title><jats:p>Past speaker verification experiments have been conducted but we see that this is the first time it is done on mobile devices with a client\u2010server architecture using K\u2010Means Clustering and Vector Quantisation. Future improvements on performance and testing could result in a more secure mobile computing environment.<\/jats:p><\/jats:sec>","DOI":"10.1108\/imcs-09-2012-0048","type":"journal-article","created":{"date-parts":[[2013,7,25]],"date-time":"2013-07-25T14:11:53Z","timestamp":1374761513000},"page":"110-120","source":"Crossref","is-referenced-by-count":0,"title":["VoxGrid: a mobile voice verification system"],"prefix":"10.1108","volume":"21","author":[{"given":"Mariah","family":"Strella P. Indrinal","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ranyel","family":"Bryan L. Maliwanag","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marynyriene I.","family":"Silvestre","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"doi-asserted-by":"crossref","unstructured":"Besacier, L., Ariyaeeinia, A.M., Mason, J.S., Bonastre, J.F., Mayorga, P., Fredouille, C., Meignier, S., Siau, J., Evans, N.W.D., Auckenthaler, R. and Stapert, R. (2004), \u201cVoice biometrics over the internet in the framework of COST action 275\u201d, EURASIP Journal on Applied Signal Processing, Vol. 2004, pp. 466\u2010479.","key":"key2022012120195292700_b2","DOI":"10.1155\/S1110865704310012"},{"doi-asserted-by":"crossref","unstructured":"Bimbot, F., Bonastre, J.\u2010F., Fredouille, C., Gravier, G., Magrin\u2010Chagnolleau, I., Meignier, S., Merlin, T., Ortega\u2010Garcia, J., Petrovska\u2010Delacretaz, D. and Reynolds, D.A. (2004), \u201cA tutorial on text\u2010independent speaker verification\u201d, Eurasip Journal on Applied Signal Processing, Vol. 2004, pp. 430\u2010451.","key":"key2022012120195292700_b12","DOI":"10.1155\/S1110865704310024"},{"doi-asserted-by":"crossref","unstructured":"Daemen, J. and Rijmen, V. (2002), The Design of Rijndael: AES \u2013 The Advanced Encryption Standard, Springer, Berlin.","key":"key2022012120195292700_b3","DOI":"10.1007\/978-3-662-04722-4_1"},{"unstructured":"Federal Information Processing Standards (2001), Specification for the Advanced Encryption Standard (AES), available at: http:\/\/csrc.nist.gov\/publications\/fips\/fips197\/fips\u2010197.pdf (accessed 1 January 2012).","key":"key2022012120195292700_b1"},{"unstructured":"Hasan, M.R., Jamil, M., Rabbani, M.G. and Rahman, M.S. (2004), \u201cSpeaker identification using mel frequency cepstral coefficients\u201d, 3rd International Conference on Electrical & Computer Engineering in Dhaka, Bangladesh, pp. 28\u201030.","key":"key2022012120195292700_b13"},{"doi-asserted-by":"crossref","unstructured":"Kaur, J., Kaur, R. and Gill, M.K. (2010), \u201cVector quantization based speaker identification\u201d, International Journal of Computer Applications, Vol. 4 No. 2, pp. 1\u20104.","key":"key2022012120195292700_b4","DOI":"10.5120\/806-1146"},{"doi-asserted-by":"crossref","unstructured":"Kinnunen, T. and Li, H. (2009), An Overview of Text\u2010Independent Speaker Recognition: From Features to Supervectors, Elsevier, Amsterdam.","key":"key2022012120195292700_b16","DOI":"10.1016\/j.specom.2009.08.009"},{"unstructured":"McEnnis, D. (2011), JAudio Package, Version 1.0.4, Sourceforge, Mountain View, CA.","key":"key2022012120195292700_b19"},{"unstructured":"Markov, K. (2012), Lecture 2, ITA14: Introduction to Automatic Speech Recognition, University of Aizu, Aizuwakamatsu, First quarter.","key":"key2022012120195292700_b17"},{"doi-asserted-by":"crossref","unstructured":"Mehendale, A. and Dixit, M.R. (2011), \u201cSpeaker identification\u201d, Signal & Image Processing: An International Journal (SIPIJ\u2009), Vol. 2 No. 2, pp. 62\u201069.","key":"key2022012120195292700_b14","DOI":"10.5121\/sipij.2011.2206"},{"unstructured":"Modi, S.K. (2011), Biometrics in Identity Management: Concepts to Applications, Artech House, London.","key":"key2022012120195292700_b15"},{"unstructured":"Muda, M.B.L. and Elamvazuthi, I. (2010), \u201cVoice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques\u201d, Journal of Computing, Vol. 2 No. 3, pp. 138\u2010143.","key":"key2022012120195292700_b5"},{"doi-asserted-by":"crossref","unstructured":"Munteanu, D.P. and Toma, S.A. (2010), \u201cAutomatic speaker verification experiments using HMM\u201d, IEEE 8th International Conference on Communications (COMM) in Bucharest, Romania, pp. 107\u2010110.","key":"key2022012120195292700_b6","DOI":"10.1109\/ICCOMM.2010.5509021"},{"unstructured":"Nilsson, M. (2001), \u201cSpeaker verification in Java\u201d, Master's thesis, Griffith University, Queensland, October.","key":"key2022012120195292700_b7"},{"doi-asserted-by":"crossref","unstructured":"Picone, J.W. (1993), \u201cSignal modeling techniques in speech recognition\u201d, Proceedings of the IEEE, pp. 1215\u20101247.","key":"key2022012120195292700_b8","DOI":"10.1109\/5.237532"},{"unstructured":"Reynolds, D.A. (1992), \u201cA gaussian mixture modeling approach to text\u2010independent speaker identification\u201d, PhD thesis, Georgia Institute of Technology, Atlanta, GA, September.","key":"key2022012120195292700_b9"},{"doi-asserted-by":"crossref","unstructured":"Roberts, W.J.J. and Willmore, J.P. (1999), \u201cAutomatic speaker recognition using gaussian mixture models\u201d, in Evans, R., White, L., McMichael, D. and Sciacca, L. (Eds), Proceedings of Information Decision and Control 99 in Adelaide, Australia, IEEE, New York, NY, pp. 465\u2010470.","key":"key2022012120195292700_b10","DOI":"10.1109\/IDC.1999.754201"},{"unstructured":"Wriggers, W. (2003), \u201cVector quantization and reduced models\u201d, paper presented at Situs Modeling Workshop in San Diego, California, USA.","key":"key2022012120195292700_b18"},{"unstructured":"Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V. and Woodland, P. (2000), The HTK Book Version 3.0, Cambridge University, Cambridge.","key":"key2022012120195292700_b11"}],"container-title":["Information Management &amp; Computer Security"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/IMCS-09-2012-0048","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IMCS-09-2012-0048\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/IMCS-09-2012-0048\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T21:50:47Z","timestamp":1753393847000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ics\/article\/21\/2\/110-120\/180527"}},"subtitle":[],"editor":[{"given":"Veniamin","family":"Ginodman","sequence":"first","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2013,6,7]]},"references-count":19,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2013,6,7]]}},"alternative-id":["10.1108\/IMCS-09-2012-0048"],"URL":"https:\/\/doi.org\/10.1108\/imcs-09-2012-0048","relation":{},"ISSN":["0968-5227"],"issn-type":[{"type":"print","value":"0968-5227"}],"subject":[],"published":{"date-parts":[[2013,6,7]]}}}