{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,30]],"date-time":"2025-05-30T04:12:12Z","timestamp":1748578332688,"version":"3.41.0"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2015,8,20]],"date-time":"2015-08-20T00:00:00Z","timestamp":1440028800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EURASIP J. Adv. Signal Process."],"published-print":{"date-parts":[[2015,12]]},"DOI":"10.1186\/s13634-015-0259-1","type":"journal-article","created":{"date-parts":[[2015,8,19]],"date-time":"2015-08-19T11:29:30Z","timestamp":1439983770000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization"],"prefix":"10.1186","volume":"2015","author":[{"given":"Sami","family":"Keronen","sequence":"first","affiliation":[]},{"given":"Heikki","family":"Kallasjoki","sequence":"additional","affiliation":[]},{"given":"Kalle J.","family":"Palom\u00e4ki","sequence":"additional","affiliation":[]},{"given":"Guy J.","family":"Brown","sequence":"additional","affiliation":[]},{"given":"Jort F.","family":"Gemmeke","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2015,8,20]]},"reference":[{"issue":"6","key":"259_CR1","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1109\/MSP.2012.2205597","volume":"29","author":"G Hinton","year":"2012","unstructured":"G Hinton, L Deng, D Yu, G Dahl, A Mohamed, N Jaitly, A Senior, V Vanhoucke, T Sainath, B Kingsbury, Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Proc. Mag. 29(6), 82\u201397 (2012).","journal-title":"IEEE Signal Proc. Mag."},{"key":"259_CR2","doi-asserted-by":"crossref","unstructured":"JT Geiger, JF Gemmeke, B Schuller, G Rigoll, in Proc. INTERSPEECH. Investigating NMF speech enhancement for neural network based acoustic models (IEEE Singapore, Singapore, 2014).","DOI":"10.21437\/Interspeech.2014-229"},{"key":"259_CR3","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1109\/LSP.2008.2002708","volume":"15","author":"S Thomas","year":"2008","unstructured":"S Thomas, S Ganapathy, H Hermansky, Recognition of reverberant speech using frequency domain linear prediction. IEEE Signal Proc. Let. 15, 681\u2013684 (2008).","journal-title":"IEEE Signal Proc. Let."},{"key":"259_CR4","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1016\/S0167-6393(98)00032-6","volume":"25","author":"B Kingsbury","year":"1998","unstructured":"B Kingsbury, N Morgan, S Greenberg, Robust speech recognition using the modulation spectrogram. Speech Commun. 25, 117\u2013132 (1998).","journal-title":"Speech Commun."},{"issue":"1\u20132","key":"259_CR5","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1016\/j.specom.2004.02.005","volume":"43","author":"KJ Palom\u00e4ki","year":"2004","unstructured":"KJ Palom\u00e4ki, GJ Brown, JP Barker, Techniques for handling convolutional distortion with \u2018missing data\u2019 automatic speech recognition. Speech Commun. 43(1\u20132), 123\u2013142 (2004).","journal-title":"Speech Commun."},{"key":"259_CR6","unstructured":"F Weninger, S Watanabe, J Le Roux, JR Hershey, Y Tachioka, J Geiger, B Schuller, G Rigoll, in Proc. REVERB Workshop (REVERB\u201914). The MERL\/MELCO\/TUM system for the REVERB Challenge using deep recurrent neural network feature enhancement (Florence, Italy, 2014)."},{"key":"259_CR7","unstructured":"JT Geiger, E Marchi, B Schuller, G Rigoll, in Proc. REVERB Workshop (REVERB\u201914). The TUM system for the REVERB Challenge: recognition of reverberated speech using multi-channel correlation shaping dereverberation and BLSTM recurrent neural networks (Florence, Italy, 2014)."},{"issue":"7","key":"259_CR8","doi-asserted-by":"publisher","first-page":"1676","DOI":"10.1109\/TASL.2010.2050511","volume":"18","author":"A Sehr","year":"2010","unstructured":"A Sehr, R Maas, W Kellermann, Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition. IEEE Trans. Audio, Speech, Language Process. 18(7), 1676\u20131691 (2010).","journal-title":"IEEE Trans. Audio, Speech, Language Process."},{"key":"259_CR9","unstructured":"DD Lee, HS Seung, in Adv. Neur. In. 13, ed. by TK Leen, TG Dietterich, and V Tresp. Algorithms for non-negative matrix factorization (MIT PressCambridge, 2001), pp. 556\u2013562."},{"issue":"3","key":"259_CR10","doi-asserted-by":"publisher","first-page":"1066","DOI":"10.1109\/TASL.2006.885253","volume":"15","author":"T Virtanen","year":"2007","unstructured":"T Virtanen, Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria. IEEE T. Audio Speech. 15(3), 1066\u20131074 (2007).","journal-title":"IEEE T. Audio Speech"},{"key":"259_CR11","doi-asserted-by":"crossref","unstructured":"P Smaragdis, JC Brown, in IEEE Workshop Applicat. Signal Process. Audio and Acoust. Non-negative matrix factorization for polyphonic music transcription (IEEENew Paltz, NY, USA, 2003), pp. 177\u2013180.","DOI":"10.1109\/ASPAA.2003.1285860"},{"key":"259_CR12","doi-asserted-by":"crossref","unstructured":"KW Wilson, B Raj, P Smaragdis, A Divakaran, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Speech denoising using nonnegative matrix factorization with priors (IEEELas Vegas, NV, USA, 2008), pp. 4029\u20134032.","DOI":"10.1109\/ICASSP.2008.4518538"},{"issue":"7","key":"259_CR13","doi-asserted-by":"publisher","first-page":"2067","DOI":"10.1109\/TASL.2011.2112350","volume":"19","author":"JF Gemmeke","year":"2011","unstructured":"JF Gemmeke, T Virtanen, A Hurmalainen, Exemplar-based sparse representations for noise robust automatic speech recognition. IEEE T. Audio Speech. 19(7), 2067\u20132080 (2011).","journal-title":"IEEE T. Audio Speech"},{"key":"259_CR14","doi-asserted-by":"crossref","unstructured":"P Smaragdis, in Independent Component Analysis and Blind Signal Separation. Lecture Notes in Computer Science, 3195, ed. by CG Puntonet, A Prieto. Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs (SpringerBerlin Heidelberg, 2004), pp. 494\u2013499.","DOI":"10.1007\/978-3-540-30110-3_63"},{"key":"259_CR15","doi-asserted-by":"crossref","unstructured":"H Kameoka, T Nakatani, T Yoshioka, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms (IEEETaipei, Taiwan, 2009), pp. 45\u201348.","DOI":"10.1109\/ICASSP.2009.4959516"},{"key":"259_CR16","doi-asserted-by":"crossref","unstructured":"K Kumar, R Singh, B Raj, R Stern, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Gammatone sub-band magnitude-domain dereverberation for ASR (IEEEPrague, Czech Republic, 2011), pp. 4604\u20134607.","DOI":"10.1109\/ICASSP.2011.5947380"},{"key":"259_CR17","unstructured":"H Kallasjoki, JF Gemmeke, KJ Palom\u00e4ki, AV Beeston, GJ Brown, in Proc. REVERB Workshop (REVERB\u201914). Recognition of reverberant speech by missing data imputation and NMF feature enhancement (Florence, Italy, 2014)."},{"key":"259_CR18","unstructured":"K Palom\u00e4ki, H Kallasjoki, in Proc. REVERB Workshop (REVERB\u201914). Reverberation robust speech recognition by matching distributions of spectrally and temporally decorrelated features (Florence, Italy, 2014)."},{"key":"259_CR19","doi-asserted-by":"crossref","unstructured":"U Remes, in Proc. INTERSPEECH. Bounded conditional mean imputation with an approximate posterior (ISCALyon, France, 2013), pp. 3007\u20133011.","DOI":"10.21437\/Interspeech.2013-279"},{"key":"259_CR20","unstructured":"AV Beeston, GJ Brown, in UK Speech Conf. Modelling reverberation compensation effects in time-forward and time-reversed rooms (Cambridge, UK, 2013)."},{"key":"259_CR21","doi-asserted-by":"crossref","unstructured":"S Dharanipragada, M Padmanabhan, in Proc. Int. Conf. Spoken Lang. Process. (ICSLP). A non-linear unsupervised adaptation technique for speech recognition (ISCABeijing, 2000).","DOI":"10.21437\/ICSLP.2000-872"},{"key":"259_CR22","doi-asserted-by":"crossref","unstructured":"K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, E Habets, R Haeb-Umbach, V Leutnant, A Sehr, W Kellermann, R Maas, S Gannot, B Raj, in Proc. IEEE Workshop Applicat. Signal Process. Audio and Acoust. (WASPAA). The REVERB challenge: a common evaluation framework for dereverberation and recognition of reverberant speech (IEEENew Paltz, NY, USA, 2013).","DOI":"10.1109\/WASPAA.2013.6701894"},{"key":"259_CR23","unstructured":"D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, M Hannemann, P Motlicek, Y Qian, P Schwarz, J Silovsky, G Stemmer, K Vesely, in IEEE Automat. Speech Recognition and Understanding Workshop. The Kaldi speech recognition toolkit (IEEEWaikoloa, HI, USA, 2011)."},{"issue":"3","key":"259_CR24","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1016\/S0734-189X(87)80186-X","volume":"39","author":"SM Pizer","year":"1987","unstructured":"SM Pizer, EP Amburn, JD Austin, R Cromartie, A Geselowitz, T Greer, JB Zimmerman, K Zuiderveld, Adaptive histogram equalization and its variations. Comput. Vision Graph. 39(3), 355\u2013368 (1987).","journal-title":"Comput. Vision Graph."},{"key":"259_CR25","doi-asserted-by":"crossref","unstructured":"G Saon, S Dharanipragada, D Povey, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP), 1. Feature space Gaussianization (IEEEMontreal, Canada, 2004), pp. 329\u2013332.","DOI":"10.1109\/ICASSP.2004.1325989"},{"key":"259_CR26","unstructured":"CB Moler, Numerical Computing with MATLAB, Revised Reprint Paperback (Society of Industrial and Applied Mathematics, Philadelphia, Pennsylvania, 2008)."},{"key":"259_CR27","unstructured":"KJ Palom\u00e4ki, GJ Brown, JP Barker, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Recognition of reverberant speech using full cepstral features and spectral missing data (IEEEToulouse, France, 2006)."},{"key":"259_CR28","unstructured":"T Robinson, J Fransen, D Pye, J Foote, S Renals, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). WSJCAM0: a British English speech corpus for large vocabulary continuous speech recognition (IEEEDetroit, MI, USA, 1995)."},{"key":"259_CR29","doi-asserted-by":"crossref","unstructured":"M Lincoln, I McCowan, J Vepa, HK Maganti, in IEEE Automat. Speech Recognition and Understanding Workshop. The multi-channel Wall Street Journal audio visual corpus (MC-WSJ-AV): Specification and initial experiments (IEEECanc\u00fan, Mexico, 2005).","DOI":"10.1109\/ASRU.2005.1566470"},{"key":"259_CR30","unstructured":"Y Tachioka, T Narita, F Weninger, S Watanabe, in Proc. REVERB Workshop (REVERB\u201914). Dual system combination approach for various reverberant environments with dereverberation techniques (Florence, Italy, 2014)."},{"key":"259_CR31","doi-asserted-by":"crossref","unstructured":"D Povey, K Yao, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). A basis method for robust estimation of constrained MLLR (IEEEPrague, Czech Republic, 2011), pp. 4460\u20134463.","DOI":"10.1109\/ICASSP.2011.5947344"},{"key":"259_CR32","doi-asserted-by":"crossref","unstructured":"D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, K Visweswariah, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Boosted MMI for model and feature-space discriminative training (IEEELas Vegas, NV, USA, 2008), pp. 4057\u20134060.","DOI":"10.1109\/ICASSP.2008.4518545"},{"issue":"4","key":"259_CR33","doi-asserted-by":"publisher","first-page":"802","DOI":"10.1016\/j.csl.2011.03.001","volume":"25","author":"H Xu","year":"2011","unstructured":"H Xu, D Povey, L Mangu, J Zhu, Minimum Bayes risk decoding and system combination based on a recursion for edit distance. Comput. Speech Lang. 25(4), 802\u2013828 (2011).","journal-title":"Comput. Speech Lang."},{"key":"259_CR34","doi-asserted-by":"crossref","unstructured":"X Zhang, J Trmal, D Povey, S Khudanpur, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Improving deep neural network acoustic models using generalized maxout networks (IEEEFlorence, Italy, 2014).","DOI":"10.1109\/ICASSP.2014.6853589"},{"key":"259_CR35","doi-asserted-by":"crossref","unstructured":"B Kingsbury, in Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP). Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling (IEEETaipei, Taiwan, 2009), pp. 3761\u20133764.","DOI":"10.1109\/ICASSP.2009.4960445"},{"key":"259_CR36","unstructured":"MF Font, Multi-microphone signal processing for automatic speech recognition in meeting rooms. Master\u2019s thesis, Universitat Polit\u00e8cnica de Catalunya, Spain, 2005."},{"issue":"4","key":"259_CR37","doi-asserted-by":"publisher","first-page":"320","DOI":"10.1109\/TASSP.1976.1162830","volume":"24","author":"CH Knapp","year":"1976","unstructured":"CH Knapp, GC Carter, The generalized correlation method for estimation of time delay. IEEE T. Acoust. Speech. 24(4), 320\u2013327 (1976).","journal-title":"IEEE T. Acoust. Speech"},{"key":"259_CR38","unstructured":"M Delcoix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, M Espi, T Hori, T Nakatani, A Nakamura, in Proc. REVERB Workshop (REVERB\u201914). Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB Challenge (Florence, Italy, 2014)."}],"container-title":["EURASIP Journal on Advances in Signal Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13634-015-0259-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13634-015-0259-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13634-015-0259-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13634-015-0259-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,30]],"date-time":"2025-05-30T02:15:59Z","timestamp":1748571359000},"score":1,"resource":{"primary":{"URL":"https:\/\/asp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13634-015-0259-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,8,20]]},"references-count":38,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,12]]}},"alternative-id":["259"],"URL":"https:\/\/doi.org\/10.1186\/s13634-015-0259-1","relation":{},"ISSN":["1687-6180"],"issn-type":[{"type":"electronic","value":"1687-6180"}],"subject":[],"published":{"date-parts":[[2015,8,20]]},"article-number":"76"}}