{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T20:27:00Z","timestamp":1772915220725,"version":"3.50.1"},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2021,8,2]],"date-time":"2021-08-02T00:00:00Z","timestamp":1627862400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,8,2]],"date-time":"2021-08-02T00:00:00Z","timestamp":1627862400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003093","name":"Ministry of Higher Education, Malaysia","doi-asserted-by":"publisher","award":["FRGS19-051-0659"],"award-info":[{"award-number":["FRGS19-051-0659"]}],"id":[{"id":"10.13039\/501100003093","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Speech Technol"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>One of the most challenging techniques for speech analysis applications in mobile phones is acoustic feature extraction. The adverse environment noises, diversity of microphone specifications, and various recording software have a significant effect on the values of the extracted acoustic features. In this study, we investigate the robustness of different types of acoustic features related to time-based, frequency-based, and sustained vowel using 11 different mobile recording devices. 49 recordings of subjects reciting the Rainbow Passage and 25 recordings of sustained vowel \/a\/ were collected. By way of synchronous recording, we analyzed and compared the extracted 253-dimensional acoustic feature vectors in order to examine how consistent the data values between the different recording devices. The variability of data values was measured using the method of coefficient of variance. Data values with low variability were identified to be from features such as the transition parameters, amplitude modulation, contrast, Chroma, mean fundamental frequency and formants. These groups of features turn out to be more reliable than others in their dependency on the recording device specifications.<\/jats:p>","DOI":"10.1007\/s10772-021-09877-1","type":"journal-article","created":{"date-parts":[[2021,8,2]],"date-time":"2021-08-02T09:10:18Z","timestamp":1627895418000},"page":"1089-1100","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Mobile microphone robust acoustic feature identification using coefficient of variance"],"prefix":"10.1007","volume":"24","author":[{"given":"Nik Nur Wahidah Nik","family":"Hashim","sequence":"first","affiliation":[]},{"given":"Mugahed Al-Ezzi Ahmed","family":"Ezzi","sequence":"additional","affiliation":[]},{"given":"Mitchell D.","family":"Wilkes","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,8,2]]},"reference":[{"key":"9877_CR1","unstructured":"Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound."},{"key":"9877_CR2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvoice.2018.10.016","author":"P Bottalico","year":"2018","unstructured":"Bottalico, P., et al. (2018). Reproducibility of voice parameters: The effect of room acoustics and microphones. Journal of Voice. https:\/\/doi.org\/10.1016\/j.jvoice.2018.10.016","journal-title":"Journal of Voice"},{"key":"9877_CR3","doi-asserted-by":"publisher","DOI":"10.1121\/1.4950019","author":"WW Clark","year":"2016","unstructured":"Clark, W. W., & Saunders, S. (2016). Assessment of noise exposures for pre-term infants during air transport to neonatal intensive care units using iPhone sound meter apps. Journal of the Acoustical Society of America. https:\/\/doi.org\/10.1121\/1.4950019","journal-title":"Journal of the Acoustical Society of America"},{"issue":"2","key":"9877_CR4","doi-asserted-by":"publisher","first-page":"167","DOI":"10.2307\/843871","volume":"42","author":"R Cohn","year":"1998","unstructured":"Cohn, R. (1998). Introduction to Neo-Riemannian to theory: A survey and a historical perspective. Journal of Music Theory, 42(2), 167\u2013180.","journal-title":"Journal of Music Theory"},{"issue":"4","key":"9877_CR5","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1109\/TASSP.1980.1163420","volume":"28","author":"S Davis","year":"1980","unstructured":"Davis, S., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(4), 357\u2013366.","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing"},{"key":"9877_CR6","doi-asserted-by":"publisher","DOI":"10.1159\/000093184","author":"DD Deliyski","year":"2006","unstructured":"Deliyski, D. D., Shaw, H. S., Evans, M. K., & Vesselinov, R. (2006). Regression tree approach to studying factors influencing acoustic voice analysis. Folia Phoniatrica Et Logopedica. https:\/\/doi.org\/10.1159\/000093184","journal-title":"Folia Phoniatrica Et Logopedica"},{"key":"9877_CR7","doi-asserted-by":"publisher","DOI":"10.1121\/1.4950018","author":"M Dickerson","year":"2016","unstructured":"Dickerson, M. (2016). Investigating the feasibility of using mobile devices for remote noise monitoring and data acquisition. Journal of the Acoustical Society of America. https:\/\/doi.org\/10.1121\/1.4950018","journal-title":"Journal of the Acoustical Society of America"},{"key":"9877_CR8","unstructured":"Ellis, D. P. W. (2007). Classifying music audio with timbral and chroma features."},{"key":"9877_CR9","unstructured":"Faber, B. M. (2017). Acoustical measurements with smartphones\u202f: Possibilities and limitations. Acoutics Today."},{"key":"9877_CR10","doi-asserted-by":"publisher","unstructured":"Ghosal, D., & Kolekar, M. H. (2018). Music genre recognition using deep neural networks and transfer learning. In Proc. annu. conf. int. speech commun. assoc. INTERSPEECH, vol. 2018-Septe, no. September, pp. 2087\u20132091. https:\/\/doi.org\/10.21437\/Interspeech.2018-2045.","DOI":"10.21437\/Interspeech.2018-2045"},{"key":"9877_CR11","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvoice.2016.06.006","author":"NW Hashim","year":"2017","unstructured":"Hashim, N. W., Wilkes, M., Salomon, R., Meggs, J., & France, D. J. (2017). Evaluation of voice acoustics as predictors of clinical depression scores. Journal of Voice. https:\/\/doi.org\/10.1016\/j.jvoice.2016.06.006","journal-title":"Journal of Voice"},{"key":"9877_CR12","unstructured":"Jiang, D.-N., Lu, L., Zhang, H.-J., Tao, J.-H., & Cai, L.-H. (2002). Music type classification by spectral contrast feature. In IEEE Int. Conf. Multimed. Expo, Vol. 1, 113\u2013116."},{"key":"9877_CR13","doi-asserted-by":"publisher","DOI":"10.1044\/jshr.3404.781","author":"MP Karnell","year":"1991","unstructured":"Karnell, M. P., Scherer, R. S., & Fischer, L. B. (1991). Comparison of acoustic voice perturbation measures among three independent voice laboratories. Journal of Speech and Hearing Research. https:\/\/doi.org\/10.1044\/jshr.3404.781","journal-title":"Journal of Speech and Hearing Research"},{"key":"9877_CR14","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvoice.2014.11.006","author":"JS Kisenwether","year":"2015","unstructured":"Kisenwether, J. S., & Sataloff, R. T. (2015). The effect of microphone type on acoustical measures of synthesized vowels. Journal of Voice. https:\/\/doi.org\/10.1016\/j.jvoice.2014.11.006","journal-title":"Journal of Voice"},{"key":"9877_CR15","doi-asserted-by":"publisher","DOI":"10.1590\/2317-6431-2018-2113","author":"VM Krik","year":"2019","unstructured":"Krik, V. M., Ribeiro, V. V., Siqueira, L. T. D., Rosa, M. D. O., & Leite, A. P. D. (2019). An\u00e1lise ac\u00fastica da voz: compara\u00e7\u00e3o entre dois tipos de microfones. Audiology Communication Research. https:\/\/doi.org\/10.1590\/2317-6431-2018-2113","journal-title":"Audiology Communication Research"},{"key":"9877_CR16","unstructured":"Mcfee, B., et al. (2015). Librosa\u2014audio processing Python library. In Proc. 14th python sci. conf."},{"key":"9877_CR17","doi-asserted-by":"publisher","unstructured":"M\u00fcller, M., Ewert, S., & Kreuzer, S. (2009). Making chroma features more robust to timbre changes. https:\/\/doi.org\/10.1109\/ICASSP.2009.4959974.","DOI":"10.1109\/ICASSP.2009.4959974"},{"key":"9877_CR18","doi-asserted-by":"crossref","unstructured":"Pan, Y., & Waibel, A. (2000). The effects of room acoustics on MFCC speech parameter.","DOI":"10.21437\/ICSLP.2000-768"},{"key":"9877_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/S0892-1997(01)00035-2","author":"V Parsa","year":"2001","unstructured":"Parsa, V., Jamieson, D. G., & Pretty, B. R. (2001). Effects of microphone type on acoustic measures of voice. Journal of Voice. https:\/\/doi.org\/10.1016\/S0892-1997(01)00035-2","journal-title":"Journal of Voice"},{"key":"9877_CR20","doi-asserted-by":"publisher","DOI":"10.1121\/1.4950021","author":"S Sinha","year":"2016","unstructured":"Sinha, S., et al. (2016). Real-time sound measurements of exercise classes with mobile app demonstrate excessive noise exposure. Journal of the Acoustical Society of America. https:\/\/doi.org\/10.1121\/1.4950021","journal-title":"Journal of the Acoustical Society of America"},{"key":"9877_CR21","unstructured":"Slaney, M. (1993). Auditory toolbox. Apple Comput. Co. Apple Tech. Rep."},{"key":"9877_CR22","unstructured":"Styler, W. (2013). Using Praat for linguistic research. Savevowels."},{"key":"9877_CR23","doi-asserted-by":"publisher","DOI":"10.1016\/j.apacoust.2019.107050","author":"Y Su","year":"2020","unstructured":"Su, Y., Zhang, K., Wang, J., Zhou, D., & Madani, K. (2020). Performance analysis of multiple aggregated acoustic features for environment sound classification. Applied Acoustics. https:\/\/doi.org\/10.1016\/j.apacoust.2019.107050","journal-title":"Applied Acoustics"},{"key":"9877_CR24","doi-asserted-by":"publisher","DOI":"10.1044\/1058-0360(2010\/09-0091)","author":"JG \u0160vec","year":"2010","unstructured":"\u0160vec, J. G., & Granqvist, S. (2010). Guidelines for selecting microphones for human voice production research. American Journal of Speech-Language Pathology. https:\/\/doi.org\/10.1044\/1058-0360(2010\/09-0091)","journal-title":"American Journal of Speech-Language Pathology."},{"key":"9877_CR25","doi-asserted-by":"publisher","DOI":"10.1044\/jshr.3606.1177","author":"R Titze","year":"1993","unstructured":"Titze, R., & Winholtz, W. S. (1993). Effect of microphone type and placement on voice perturbation measurements. Journal of Speech and Hearing Research. https:\/\/doi.org\/10.1044\/jshr.3606.1177","journal-title":"Journal of Speech and Hearing Research"}],"container-title":["International Journal of Speech Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10772-021-09877-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10772-021-09877-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10772-021-09877-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,6]],"date-time":"2023-01-06T04:13:15Z","timestamp":1672978395000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10772-021-09877-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,2]]},"references-count":25,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["9877"],"URL":"https:\/\/doi.org\/10.1007\/s10772-021-09877-1","relation":{},"ISSN":["1381-2416","1572-8110"],"issn-type":[{"value":"1381-2416","type":"print"},{"value":"1572-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,2]]},"assertion":[{"value":"23 December 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 July 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 August 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interest"}},{"value":"All procedures performed in studies involving human participants were in accordance with the ethical standards and has been approved by the IIUM Research Ethical Committee (IREC 2019-006). Subject Informed Consent (SIC) was also obtained from all individual participants included in the study.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval"}}]}}