{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T08:27:35Z","timestamp":1759134455170,"version":"3.38.0"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2011,8,25]],"date-time":"2011-08-25T00:00:00Z","timestamp":1314230400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computing"],"published-print":{"date-parts":[[2012,1]]},"DOI":"10.1007\/s00607-011-0152-1","type":"journal-article","created":{"date-parts":[[2011,8,24]],"date-time":"2011-08-24T05:56:25Z","timestamp":1314165385000},"page":"1-20","source":"Crossref","is-referenced-by-count":2,"title":["Two-stage model-based feature compensation for robust speech recognition"],"prefix":"10.1007","volume":"94","author":[{"given":"Haifeng","family":"Shen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gang","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jun","family":"Guo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2011,8,25]]},"reference":[{"key":"152_CR1","doi-asserted-by":"crossref","first-page":"1738","DOI":"10.1121\/1.399423","volume":"87","author":"H Hermansky","year":"1990","unstructured":"Hermansky H (1990) Perceptual linear predictive (PLP) analysis for speech. J Acoust Soc Am 87: 1738\u20131752","journal-title":"J Acoust Soc Am"},{"key":"152_CR2","doi-asserted-by":"crossref","unstructured":"Hermansky H, Morgan N, Bayya A, Kohn P (1991) Rasta-PLP speech analysis","DOI":"10.1109\/ICASSP.1992.225957"},{"key":"152_CR3","first-page":"S535","volume":"66","author":"MJ Hunt","year":"1979","unstructured":"Hunt MJ (1979) A statistical approach to metrics for word and syllable recognition. J Acoust Soc Am 66: S535\u2013S536","journal-title":"J Acoust Soc Am"},{"key":"152_CR4","unstructured":"Stern RM, Raj B, Moreno PJ (1997) Compensation for environmental degradation in automatic speech recognition. In: Proceedings of ESCA-NATO Tutorial Research Workshop Robust Speech Recognition for Unknown Communication Channels, pp 33\u201342"},{"key":"152_CR5","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1016\/S0167-6393(98)00025-9","volume":"24","author":"PJ Moreno","year":"1998","unstructured":"Moreno PJ, Raj B, Stern RM (1998) Data-driven environmental compensation for speech recognition: a unified approach. Speech Commun 24: 267\u2013285","journal-title":"Speech Commun"},{"key":"152_CR6","doi-asserted-by":"crossref","unstructured":"Moreno PJ, Raj B, Stern RM (1996) A vector Taylor series approach for environment-independent speech recognition. In: Proceedings of ICASSP-96, pp 733\u2013736","DOI":"10.1109\/ICASSP.1996.543225"},{"key":"152_CR7","unstructured":"Moreno PJ (1996) Speech recognition in noisy environments. Ph.D. Dissertation, ECE Department, CMU"},{"key":"152_CR8","doi-asserted-by":"crossref","unstructured":"Raj B, Gouvea EB, Moreno PJ, Stern RM (1996) Cepstral compensation by polynomial approximation for environment-independent speech recognition. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, pp 2340\u20132343","DOI":"10.1109\/ICSLP.1996.607277"},{"issue":"1","key":"152_CR9","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/97.654866","volume":"5","author":"NS Kim","year":"1998","unstructured":"Kim NS (1998) Statistical linear approximation for environment compensation. IEEE Signal Process Lett 5(1): 8\u201310","journal-title":"IEEE Signal Process Lett"},{"key":"152_CR10","unstructured":"Han ZB, Zhang SW, Zhang HY, Xu B (2003) A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR. In: Proceedings of ICASSP-2003, pp 117\u2013120"},{"issue":"3","key":"152_CR11","first-page":"8","volume":"5","author":"NS Kim","year":"1998","unstructured":"Kim NS (1998) Nonstationary environment compensation based on sequential estimation. IEEE Signal Process Lett 5(3): 8\u201310","journal-title":"IEEE Signal Process Lett"},{"key":"152_CR12","doi-asserted-by":"crossref","unstructured":"Deng L, Droppo J, Acero A (2001) Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp 81\u201384","DOI":"10.1109\/ASRU.2001.1034594"},{"key":"152_CR13","doi-asserted-by":"crossref","unstructured":"Shen HF, Liu G, Guo J, Li QX (2005) Two-domain feature compensation for robust speech recognition. In: Proceedings of the ISNN-2005, Lecture Notes in Computer Science, vol 3497, Advance in Neural Network-ISNN 2005, Springer, pp 351\u2013356","DOI":"10.1007\/11427445_57"},{"key":"152_CR14","doi-asserted-by":"crossref","unstructured":"Shen HF, Liu G, Guo J, Huang PM, Li QX (2005) Environment compensation based on maximum a posteriori estimation for improved speech recognition. In: Proceedings of the MICAI-2005, Lecture Notes in Artificial Intelligence, vol 3789, MICAI 2005. Advances in Artificial Intelligence, Springer, pp 854\u2013862 (2005)","DOI":"10.1007\/11579427_87"},{"key":"152_CR15","doi-asserted-by":"crossref","unstructured":"Couvreur C, Hamme HV (2000) Model-based feature enhancement for noisy speech recognition. In: Proceedings of the ICASSP-2000, vol 3, pp 1719\u20131722","DOI":"10.1109\/ICASSP.2000.862083"},{"key":"152_CR16","doi-asserted-by":"crossref","unstructured":"Stouten V, Hamme HV, Demuynck K, Wambacq P (2003) Robust speech recognition using model-based feature enhancement. In: Proceedings of the Eurospeech 2003, pp 17\u201320","DOI":"10.21437\/Eurospeech.2003-5"},{"key":"152_CR17","doi-asserted-by":"crossref","unstructured":"Normandin Y, Cardin R, Mori RD (1994) High-performance connected digit recognition using maximum mutual information estimation. IEEE Trans. Speech Audio Process 2(2):299\u2013311","DOI":"10.1109\/89.279279"},{"issue":"9","key":"152_CR18","doi-asserted-by":"crossref","first-page":"1432","DOI":"10.1109\/29.90371","volume":"36","author":"A Nadas","year":"1998","unstructured":"Nadas A, Nahamoo D, Picheny MA (1998) On a model-robust training method for speech recognition. IEEE Trans Acoust Speech Signal Process 36(9): 1432\u20131436","journal-title":"IEEE Trans Acoust Speech Signal Process"},{"issue":"5","key":"152_CR19","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1109\/18.42209","volume":"35","author":"Y Ephraim","year":"1989","unstructured":"Ephraim Y, Dembo A, Rabiner L (1989) A Minimum discrimination information approach for hidden markov modeling. IEEE Trans Inf Theory 35(5): 1001\u20131013","journal-title":"IEEE Trans Inf Theory"},{"issue":"2","key":"152_CR20","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1109\/89.279278","volume":"2","author":"JL Gauvain","year":"1994","unstructured":"Gauvain JL, Lee CH (1994) Maximum a posteriori estimation for multivariate Gaussian mixture observation of Markov chains. IEEE Trans Speech Audio Process 2(2): 291\u2013298","journal-title":"IEEE Trans Speech Audio Process"},{"issue":"2","key":"152_CR21","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1109\/89.554778","volume":"5","author":"Q Huo","year":"1997","unstructured":"Huo Q, Lee CH (1997) On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate. IEEE Trans Speech Audio Process 5(2): 161\u2013172","journal-title":"IEEE Trans Speech Audio Process"},{"key":"152_CR22","doi-asserted-by":"crossref","unstructured":"Huo Q, Chan C, Lee CH (1995) Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition. IEEE Trans Speech Audio Process 3(5):334\u20133 (1995)","DOI":"10.1109\/89.466661"},{"key":"152_CR23","unstructured":"Huo Q, Chan C, Lee CH (1994) Bayesian learning of the SCHMM parameters for speech recognition. In: Proceedings of the ICASSP-94, vol 1, pp 221\u2013224"},{"key":"152_CR24","unstructured":"Huo Q, Lee CH (1995) A study of on-line quasi-Bayes adaptation for CDHMM-based speech recognition. In: Proceedings of the ICASSP-96, vol 2, pp 705\u2013708"},{"key":"152_CR25","doi-asserted-by":"crossref","unstructured":"Acero A, Deng L, Kristjansson K, Zhang J (2000) HMM adaptation using vector Taylor series for noisy speech recognition. In: Proceedings of the ICSLP 2000","DOI":"10.21437\/ICSLP.2000-672"},{"key":"152_CR26","doi-asserted-by":"crossref","unstructured":"SagaYama S, Yamaguchi Y, Takahashi S, Takahashi J (1997) Jacobian approach to fast acoustic model adaptation. In: Proceedings of the ICASSP-97, Munich, pp 835\u2013838","DOI":"10.1109\/ICASSP.1997.596063"},{"key":"152_CR27","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.specom.2003.08.003","volume":"42","author":"C Cerisara","year":"2004","unstructured":"Cerisara C, Rigazio L, JunQua JC (2004) \u03b1-Jacobian environmental adaptation. Speech Commun 42: 25\u201341","journal-title":"Speech Commun"},{"key":"152_CR28","unstructured":"Gales MJF (1995) Model-based techniques for noise robust speech recognition. Ph.D. thesis, University of Cambridge"},{"key":"152_CR29","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/0167-6393(93)90093-Z","volume":"12","author":"MJF Gales","year":"1993","unstructured":"Gales MJF, Young SJ (1993) Cepstral parameter compensation for HMM recognition in noise. Speech Commun 12: 231\u2013239","journal-title":"Speech Commun"},{"key":"152_CR30","unstructured":"Gales MJF, Young SJ (1995) A fast and flexible implementation of parallel model combination. In: Proceedings of the ICASSP-95, Detroit, pp 133\u2013136"},{"key":"152_CR31","doi-asserted-by":"crossref","unstructured":"Komori Y, Kosaka T, Yamamoto, H, Yamada M (1997) Fast parallel model combination noise adaptation processing. In: Proceedings of the Eurospeech-97, Rhodes, Greece, pp 1523\u20131526","DOI":"10.21437\/Eurospeech.1997-439"},{"key":"152_CR32","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/S0167-6393(98)00029-6","volume":"25","author":"MJF Gales","year":"1998","unstructured":"Gales MJF (1998) Predictive model-based compensation schemes for robust speech recognition. Speech Commun 25: 49\u201374","journal-title":"Speech Commun"},{"key":"152_CR33","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1006\/csla.2000.0137","volume":"14","author":"TH Hwang","year":"2000","unstructured":"Hwang TH, Wang HC (2000) A fast algorithm for parallel model combination for noisy speech recognition. Comput Speech Lang 14: 81\u2013100","journal-title":"Comput Speech Lang"},{"key":"152_CR34","doi-asserted-by":"crossref","unstructured":"Shen HF, Li QX, Guo J, Liu G (2005) HMM parameter adaptation using the truncated first-order VTS and EM algorithm for robust speech recognition. In: Proceedings of the CIS-2005, Lecture Notes in Artificial Intelligence, vol 3801, Springer, pp 979\u2013984","DOI":"10.1007\/11596448_145"},{"issue":"6","key":"152_CR35","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1109\/LSP.2005.847862","volume":"12","author":"NS Kim","year":"2005","unstructured":"Kim NS, Lim W, Stern RM (2005) Feature compensation based on switching linear dynamic model. IEEE Signal Process Lett 12(6): 473\u2013476","journal-title":"IEEE Signal Process Lett"},{"key":"152_CR36","doi-asserted-by":"crossref","unstructured":"Droppo J, Acero A (2004) Noise robust speech recognition with a switching linear dynamic model. In: Proceedings of the ICASSP-2004, Montreal, QC, Canada, pp 953\u2013956","DOI":"10.1109\/ICASSP.2004.1326145"},{"issue":"1","key":"152_CR37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"AP Dempster","year":"1977","unstructured":"Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39(1): 1\u201338","journal-title":"J R Stat Soc B"},{"key":"152_CR38","unstructured":"Zu YQ Issues in the scientific design of the continuous speech database. Available at: http:\/\/www.cass.net.cn\/chinese\/s18_yys\/yuyin\/report\/report_1998.htm"},{"key":"152_CR39","unstructured":"Varga A, Steenneken HJM, Tomilson M, Jones D (1992) The NOISEX\u201392 study on the effect of additive noise on automatic speech recognition. Documentation on the NOISEX-92 CD-ROMs"},{"key":"152_CR40","unstructured":"ETSI standard document. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithm. ETSI document ES 202 050 v1.1.3 (2003-11), Nov. 2003"}],"container-title":["Computing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00607-011-0152-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s00607-011-0152-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s00607-011-0152-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,9]],"date-time":"2025-03-09T06:01:25Z","timestamp":1741500085000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s00607-011-0152-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,8,25]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,1]]}},"alternative-id":["152"],"URL":"https:\/\/doi.org\/10.1007\/s00607-011-0152-1","relation":{},"ISSN":["0010-485X","1436-5057"],"issn-type":[{"type":"print","value":"0010-485X"},{"type":"electronic","value":"1436-5057"}],"subject":[],"published":{"date-parts":[[2011,8,25]]}}}