{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,3,3]],"date-time":"2024-03-03T14:51:31Z","timestamp":1709477491361},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int J Speech Technol"],"published-print":{"date-parts":[[2012,9]]},"DOI":"10.1007\/s10772-012-9165-1","type":"journal-article","created":{"date-parts":[[2012,6,28]],"date-time":"2012-06-28T09:01:47Z","timestamp":1340874107000},"page":"295-311","source":"Crossref","is-referenced-by-count":5,"title":["TEO-based speaker stress assessment using hybrid classification and tracking schemes"],"prefix":"10.1007","volume":"15","author":[{"given":"John H. L.","family":"Hansen","sequence":"first","affiliation":[]},{"given":"Evan","family":"Ruzanski","sequence":"additional","affiliation":[]},{"given":"Hynek","family":"Bo\u0159il","sequence":"additional","affiliation":[]},{"given":"James","family":"Meyerhoff","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,6,29]]},"reference":[{"key":"9165_CR1","first-page":"97","volume":"17","author":"P. Boersma","year":"1993","unstructured":"Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Proceedings - Instituut Voor Fonetische Wetenschappen, Universiteit Van Amsterdam, 17, 97\u2013110.","journal-title":"Proceedings - Instituut Voor Fonetische Wetenschappen, Universiteit Van Amsterdam"},{"key":"9165_CR2","unstructured":"Bo\u0159il, H. (2008). Robust speech recognition: Analysis and equalization of Lombard effect in Czech corpora. Ph.D. thesis, Czech Technical University in Prague, Czech Republic. http:\/\/www.utdallas.edu\/~hxb076000 ."},{"key":"9165_CR3","doi-asserted-by":"crossref","first-page":"1379","DOI":"10.1109\/TASL.2009.2034770","volume":"18","author":"H. Bo\u0159il","year":"2010","unstructured":"Bo\u0159il, H., & Hansen, J. H. L. (2010). Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments. IEEE Transactions on Audio, Speech, and Language Processing, 18, 1379\u20131393.","journal-title":"IEEE Transactions on Audio, Speech, and Language Processing"},{"key":"9165_CR4","doi-asserted-by":"crossref","first-page":"502","DOI":"10.21437\/Interspeech.2010-208","volume-title":"Interspeech\u201910","author":"H. Bo\u0159il","year":"2010","unstructured":"Bo\u0159il, H., Sadjadi, O., Kleinschmidt, T., & Hansen, J. H. L. (2010). Analysis and detection of cognitive load and frustration in drivers\u2019 speech. In Interspeech\u201910, Makuhari, Chiba, Japan (pp. 502\u2013505)."},{"key":"9165_CR5","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/978-1-4419-9607-7_1","volume-title":"Digital signal processing for in-vehicle systems and safety","author":"H. Bo\u0159il","year":"2012","unstructured":"Bo\u0159il, H., Boyraz, P., & Hansen, J. H. L. (2012). Towards multi-modal driver\u2019s stress detection. In J. Hansen, P. Boyraz, K. Takeda & H. Abut (Eds.), Digital signal processing for in-vehicle systems and safety (pp. 3\u201320). New York: Springer."},{"key":"9165_CR6","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1109\/89.668815","volume":"6","author":"S. Bou-Ghazale","year":"1998","unstructured":"Bou-Ghazale, S., & Hansen, J. (1998). HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress. IEEE Transactions on Speech and Audio Processing, 6, 201\u2013216.","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"key":"9165_CR7","doi-asserted-by":"crossref","DOI":"10.1007\/b97391","volume-title":"Introduction to time series and forecasting","author":"P. Brockwell","year":"2002","unstructured":"Brockwell, P., & Davis, R. (2002). Introduction to time series and forecasting. New York: Springer."},{"key":"9165_CR8","doi-asserted-by":"crossref","first-page":"3392","DOI":"10.1121\/1.410601","volume":"96","author":"D. A. Cairns","year":"1994","unstructured":"Cairns, D. A., & Hansen, J. H. L. (1994). Nonlinear analysis and classification of speech under stressed conditions. The Journal of the Acoustical Society of America, 96, 3392\u20133400.","journal-title":"The Journal of the Acoustical Society of America"},{"key":"9165_CR9","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1109\/TASSP.1980.1163420","volume":"28","author":"S. B. Davis","year":"1980","unstructured":"Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28, 357\u2013366.","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing"},{"key":"9165_CR10","unstructured":"Hansen, J. H. L. (1988). Analysis and compensation of stressed and noisy speech with application to robust automatic recognition. Ph.D. thesis, Dept. of Elect. Eng., Georgia Institute of Technology, Atlanta, GA."},{"key":"9165_CR11","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/S0167-6393(96)00050-7","volume":"20","author":"J. H. L. Hansen","year":"1996","unstructured":"Hansen, J. H. L. (1996). Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Communication, 20, 151\u2013173.","journal-title":"Speech Communication"},{"key":"9165_CR12","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1109\/TASL.2008.2009019","volume":"17","author":"J. Hansen","year":"2009","unstructured":"Hansen, J., & Varadarajan, V. (2009). Analysis and compensation of Lombard speech across noise type and levels with application to in-set\/out-of-set speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17, 366\u2013378.","journal-title":"IEEE Transactions on Audio, Speech, and Language Processing"},{"key":"9165_CR13","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1007\/978-1-4614-0263-3_5","volume-title":"Forensic speaker recognition: law enforcement and counter-terrorism","author":"J. Hansen","year":"2012","unstructured":"Hansen, J., Sangwan, A., & Kim, W. (2012). Speech under stress and Lombard effect: impact and solutions for forensic speaker recognition. In H. Patil & A. Neustein (Eds.), Forensic speaker recognition: law enforcement and counter-terrorism (pp. 103\u2013123). New York: Springer."},{"key":"9165_CR14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/AERO.2007.352975","volume-title":"IEEE aerospace conference","author":"A. Ikeno","year":"2007","unstructured":"Ikeno, A., Varadarajan, V., Patil, S., & Hansen, J. (2007). UT-Scope: speech under Lombard effect and cognitive stress. In IEEE aerospace conference (pp. 1\u20137)."},{"key":"9165_CR15","first-page":"381","volume-title":"Proceedings of IEEE international conference on acoustics, speech, and signal processing 1990 (ICASSP\u201990)","author":"J. Kaiser","year":"1990","unstructured":"Kaiser, J. (1990a). On a simple algorithm to calculate the \u2018energy\u2019 of a signal. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 1990 (ICASSP\u201990), Albuquerque, NM (Vol.\u00a01, pp. 381\u2013384)."},{"key":"9165_CR16","volume-title":"Proc. 4th IEEE digital signal processing workshop","author":"J. Kaiser","year":"1990","unstructured":"Kaiser, J. (1990b). On Teager\u2019s energy algorithm and its generalization to continuous signals. In Proc. 4th IEEE digital signal processing workshop, Mohonk, NY."},{"key":"9165_CR17","doi-asserted-by":"crossref","first-page":"1475","DOI":"10.1016\/0024-3205(96)00118-X","volume":"58","author":"C. Kirschbaum","year":"1996","unstructured":"Kirschbaum, C., Wolf, O. T., May, M., Wippich, W., & Hellhammer, D. H. (1996). Stress- and treatment-induced elevations of cortisol levels associated with impaired declarative memory in healthy adults. Life Sciences, 58, 1475\u20131483.","journal-title":"Life Sciences"},{"key":"9165_CR18","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1097\/00006842-198805000-00007","volume":"50","author":"J. L. Meyerhoff","year":"1998","unstructured":"Meyerhoff, J. L., Oleshansky, M. A., & Mougey, E. H. (1998). Psychological stress increases plasma levels of prolactin, cortisol and POMC-derived peptides in man. Psychosomatische Medizin, 50, 295\u2013303.","journal-title":"Psychosomatische Medizin"},{"key":"9165_CR19","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1196\/annals.1314.031","volume":"1032","author":"J. L. Meyerhoff","year":"2004","unstructured":"Meyerhoff, J. L., Norris, W., Saviolakis, G., Wollert, T., Burge, B., Atkins, V., & Spielberger, C. (2004). Evaluating performance of federal law enforcement personnel during a stressful training scenario. Annals of the New York Academy of Sciences, 1032, 250\u2013253.","journal-title":"Annals of the New York Academy of Sciences"},{"key":"9165_CR20","volume-title":"Digital signal processing: a computer-based approach","author":"S. K. Mitra","year":"2001","unstructured":"Mitra, S. K. (2001). Digital signal processing: a computer-based approach (2nd ed.). New York: McGraw Hill.","edition":"2"},{"key":"9165_CR21","doi-asserted-by":"crossref","unstructured":"Patil, S. A., & Hansen, J. H. (2010). The physiological microphone (PMIC): a competitive alternative for speaker assessment in stress detection and speaker verification. Speech Communication, 327\u2013340.","DOI":"10.1016\/j.specom.2009.11.006"},{"key":"9165_CR22","first-page":"2021","volume-title":"ICSLP\u201302","author":"M. Rahurkar","year":"2002","unstructured":"Rahurkar, M., Hansen, J., Oleshansky, M., Meyerhoff, J., & Koenig, M. (2002). Frequency band analysis for stress detection using a Teager energy operator-based feature. In ICSLP\u201302, Denver, CO (pp. 2021\u20132024)."},{"key":"9165_CR23","first-page":"733","volume-title":"Proc. of ICASSP\u201886","author":"P. Rajasekaran","year":"1986","unstructured":"Rajasekaran, P., Doddington, G., & Picone, J. (1986). Recognition of speech under stress and in noise. In Proc. of ICASSP\u201886, Tokyo, Japan (Vol.\u00a011, pp. 733\u2013736)."},{"key":"9165_CR24","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1006\/dspr.1999.0361","volume":"10","author":"D. A. Reynolds","year":"2000","unstructured":"Reynolds, D. A., Quatieri, T. F., & Dunn, R. B. (2000). Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 10, 19\u201341.","journal-title":"Digital Signal Processing"},{"key":"9165_CR25","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1109\/ICASSP.2005.1415124","volume-title":"Proceedings of IEEE international conference on acoustics, speech, and signal processing 2005 (ICASSP\u201905)","author":"E. Ruzanski","year":"2005","unstructured":"Ruzanski, E., Hansen, J., Meyerhoff, J., Saviolakis, G., & Koenig, M. (2005). Effects of phoneme characteristics on teo feature-based automatic stress detection in speech. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 2005 (ICASSP\u201905), Philadelphia, PA (Vol.\u00a01, pp. 357\u2013360)."},{"key":"9165_CR26","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1037\/1076-8998.1.2.170","volume":"1","author":"T. Saunders","year":"1996","unstructured":"Saunders, T., Driskell, J., Johnston, J., & Salas, E. (1996). The effect of stress inoculation training on anxiety and performance. Journal of Occupational Health Psychology, 1, 170\u2013186.","journal-title":"Journal of Occupational Health Psychology"},{"key":"9165_CR27","volume-title":"Foundation of modern auditory theory","author":"B. Scharf","year":"1970","unstructured":"Scharf, B. (1970). Critical bands. In V. Tobias (Ed.), Foundation of modern auditory theory. New York: Academic Press."},{"key":"9165_CR28","unstructured":"Schwarz, P. (2009). Phoneme recognition based on long temporal context. Ph.D. thesis, Brno University of Technology, Czech Republic."},{"key":"9165_CR29","first-page":"333","volume-title":"Proceedings of IEEE international conference on acoustics, speech, and signal processing 2002 (ICASSP\u201902)","author":"X. Sun","year":"2002","unstructured":"Sun, X. (2002). Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 2002 (ICASSP\u201902), Orlando, FL (Vol.\u00a01, pp. 333\u2013336)."},{"key":"9165_CR30","first-page":"495","volume-title":"Speech coding and synthesis","author":"D. Talkin","year":"1995","unstructured":"Talkin, D. (1995). A robust algorithm for pitch tracking (RAPT). In W.\u00a0B. Kleijn & K.\u00a0K. Paliwal (Eds.), Speech coding and synthesis (pp. 495\u2013518). Amsterdam: Elsevier."},{"key":"9165_CR31","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1109\/TASSP.1980.1163453","volume":"28","author":"H. M. Teager","year":"1980","unstructured":"Teager, H. M. (1980). Some observations on oral air flow during phonation. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28, 599\u2013601.","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing"},{"key":"9165_CR32","first-page":"241","volume":"55","author":"H. Teager","year":"1989","unstructured":"Teager, H., & Teager, S. (1989). Evidence for nonlinear production mechanisms in the vocal tract. Speech Production and Speech Modelling, 55, 241\u2013261.","journal-title":"Speech Production and Speech Modelling"},{"key":"9165_CR33","volume-title":"Pattern recognition","author":"S. Theodoridis","year":"2003","unstructured":"Theodoridis, S., & Koutroumbas, K. (2003). Pattern recognition (2nd ed.). Amsterdam: Elsevier.","edition":"2"},{"key":"9165_CR34","doi-asserted-by":"crossref","first-page":"668","DOI":"10.1109\/89.799692","volume":"7","author":"B. Womack","year":"1999","unstructured":"Womack, B., & Hansen, J. (1999). N-channel hidden Markov models for combined stress speech classification and recognition. IEEE Transactions on Speech and Audio Processing, 7, 668\u2013677.","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"key":"9165_CR35","volume-title":"Fundamentals of hearing","author":"W. Yost","year":"1994","unstructured":"Yost, W. (1994). Fundamentals of hearing. New York: Academic Press."},{"key":"9165_CR36","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1109\/89.905995","volume":"9","author":"G. Zhou","year":"2001","unstructured":"Zhou, G., Hansen, J., & Kaiser, J. (2001). Nonlinear feature-based classification of speech under stress. IEEE Transactions on Speech and Audio Processing, 9, 201\u2013216.","journal-title":"IEEE Transactions on Speech and Audio Processing"}],"container-title":["International Journal of Speech Technology"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.springerlink.com\/index\/pdf\/10.1007\/s10772-012-9165-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,1,19]],"date-time":"2022-01-19T20:09:14Z","timestamp":1642622954000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10772-012-9165-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,6,29]]},"references-count":36,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2012,9]]}},"alternative-id":["9165"],"URL":"https:\/\/doi.org\/10.1007\/s10772-012-9165-1","relation":{},"ISSN":["1381-2416","1572-8110"],"issn-type":[{"value":"1381-2416","type":"print"},{"value":"1572-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,6,29]]}}}