{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,31]],"date-time":"2025-12-31T00:51:16Z","timestamp":1767142276012,"version":"build-2238731810"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2020,9,17]],"date-time":"2020-09-17T00:00:00Z","timestamp":1600300800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2020,9,17]],"date-time":"2020-09-17T00:00:00Z","timestamp":1600300800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Speech Technol"],"published-print":{"date-parts":[[2020,12]]},"DOI":"10.1007\/s10772-020-09751-6","type":"journal-article","created":{"date-parts":[[2020,9,17]],"date-time":"2020-09-17T11:03:59Z","timestamp":1600340639000},"page":"917-937","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Performance analysis of neural network, NMF and statistical approaches for speech enhancement"],"prefix":"10.1007","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9860-3912","authenticated-orcid":false,"given":"Ravi Kumar","family":"Kandagatla","sequence":"first","affiliation":[]},{"given":"Venkata Subbaiah","family":"Potluri","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,9,17]]},"reference":[{"key":"9751_CR1","unstructured":"Bryan, N. J. & Mysore, G. J. (2013) An efficient posterior regularized latent variable model for interactive sound source separation. In International Conference of Machine Learning (ICML), pp. 208\u2013216"},{"issue":"6","key":"9751_CR2","doi-asserted-by":"publisher","first-page":"1109","DOI":"10.1109\/TASSP.1984.1164453","volume":"32","author":"Y Ephraim","year":"1984","unstructured":"Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 32(6), 1109\u20131121.","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing"},{"issue":"2","key":"9751_CR3","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1109\/TASSP.1985.1164550","volume":"33","author":"Y Ephraim","year":"1985","unstructured":"Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. Proceedings of IEEE Transactions on ASS, 33(2), 443\u2013445.","journal-title":"Proceedings of IEEE Transactions on ASS"},{"key":"9751_CR4","doi-asserted-by":"crossref","unstructured":"Fodor B, Fingscheidt T (2012) MMSE speech enhancement under speech presence uncertainty assuming (generalized) gamma speech prioris throughout. In IEEE International conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4033\u20134036, 25\u201330","DOI":"10.1109\/ICASSP.2012.6288803"},{"issue":"16","key":"9751_CR5","doi-asserted-by":"publisher","first-page":"4199","DOI":"10.1109\/TSP.2014.2336615","volume":"62","author":"T Gerkman","year":"2014","unstructured":"Gerkman, T. (2014). Bayesian estimation of clean speech spectral coefficients given a priori knowledge of the phase. IEEE Transactions on Signal Processing, 62(16), 4199\u20134208.","journal-title":"IEEE Transactions on Signal Processing"},{"issue":"2","key":"9751_CR6","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1109\/LSP.2012.2233470","volume":"20","author":"T Gerkmann","year":"2013","unstructured":"Gerkmann, T., & Krawczyk, M. (2013). MMSE-optimal spectral amplitude estimation given the STFT-phase. IEEE Signal Processing Letters, 20(2), 129\u2013132.","journal-title":"IEEE Signal Processing Letters"},{"issue":"2","key":"9751_CR7","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1109\/MSP.2014.2369251","volume":"32","author":"T Gerkmann","year":"2015","unstructured":"Gerkmann, T., Krawczyk-Becker, M., & Le Roux, J. (2015). Phase processing for single channel speech enhancement. IEEE Signal Processing Magazine, 32(2), 55\u201366.","journal-title":"IEEE Signal Processing Magazine"},{"key":"9751_CR300","unstructured":"Hu, G. (2004). 100 nonspeech environmental sounds, 2004 [Online]. Available: http:\/\/www.cse.ohio-state.edu\/pnl\/corpus\/HuCorpus.html."},{"key":"9751_CR301","doi-asserted-by":"crossref","unstructured":"Hu, G., & Wang D. L., (2010). A tandem algorithm for pitch estimation and voiced speech segregation. IEEE Transactions on Audio, Speech, and Language Processing, 18, 2067\u20132079.","DOI":"10.1109\/TASL.2010.2041110"},{"issue":"4","key":"9751_CR8","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1109\/LSP.2014.2362556","volume":"22","author":"K Kisoo","year":"2015","unstructured":"Kisoo, K., Shin, J. W., & Kim, N. S. (2015). NMF-based speech enhancement using bases update. IEEE Signal Processing Letters, 22(4), 450\u2013454.","journal-title":"IEEE Signal Processing Letters"},{"key":"9751_CR9","unstructured":"Lee, D. D. & Seung, H. S. (2001). Algorithms for non-negative matrix factorization. In Proceedings of Advances in Neural Information Processing Systems (NIPS), pp. 55\u2013562"},{"issue":"5","key":"9751_CR10","doi-asserted-by":"publisher","first-page":"857","DOI":"10.1109\/TSA.2005.851929","volume":"13","author":"PC Loizou","year":"2005","unstructured":"Loizou, P. C., & Member, S. (2005). Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum. SIEEE Transactions Speech and Audio Process., 13(5), 857\u2013869.","journal-title":"SIEEE Transactions Speech and Audio Process."},{"issue":"7","key":"9751_CR11","doi-asserted-by":"publisher","first-page":"1110","DOI":"10.1155\/ASP.2005.1110","volume":"2005","author":"T Lotter","year":"2005","unstructured":"Lotter, T., & Vary, P. (2005). Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model. EURASIP Journal on Advances in Signal Processing, 2005(7), 1110\u20131126.","journal-title":"EURASIP Journal on Advances in Signal Processing"},{"issue":"5","key":"9751_CR12","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1109\/TSA.2005.851927","volume":"13","author":"R Martin","year":"2005","unstructured":"Martin, R. (2005). Speech enhancement based on minimum mean-square error estimation and super-Gaussian priors. IEEE Transactions on Speech and Audio Processing, 13(5), 845\u2013856.","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"key":"9751_CR13","unstructured":"NOIZEUS\u00a0Database: https:\/\/ecs.utdallas.edu\/loizou\/speech\/noizeus\/"},{"key":"9751_CR15","doi-asserted-by":"crossref","unstructured":"Ravi Kumar, K. & Subbaiah, P. V. (2016). Enhancement of noisy speech using sub-band harmonic regeneration and speech presence uncertainty estimator. In IEEE International Conference on Recent Trends in Electronics Information Communication Technology, pp. 456\u2013460","DOI":"10.1109\/RTEICT.2016.7807862"},{"issue":"2","key":"9751_CR25","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1007\/s10772-017-9406-4","volume":"20","author":"K Ravi Kumar","year":"2017","unstructured":"Ravi Kumar, K., & Subbaiah, P. V. (2017). Speech enhancement using MMSE estimation under phase uncertainty. International Journal of Speech Technology, 20(2), 373\u2013385.","journal-title":"International Journal of Speech Technology"},{"key":"9751_CR16","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1016\/j.specom.2017.11.001","volume":"96","author":"K RaviKumar","year":"2018","unstructured":"RaviKumar, K., & Subbaiah, P. V. (2018). Speech enhancement using MMSE estimation of amplitude and complex speech spectral coefficients under phase uncertainty. Speech Communication Journal, 96, 10\u201327.","journal-title":"Speech Communication Journal"},{"issue":"5S","key":"9751_CR14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.35940\/ijitee.J9861.0981119","volume":"8","author":"K Ravi Kumar","year":"2019","unstructured":"Ravi Kumar, K., & Subbaiah, P. V. (2019). Posteriori Regularization based Non-Negative Matrix Factorization approach for Speech Enhancement. International Journal of Innovative Technology and Exploring Engineering, 8(5S), 1\u20136.","journal-title":"International Journal of Innovative Technology and Exploring Engineering"},{"key":"9751_CR17","doi-asserted-by":"publisher","unstructured":"Rehr, R. & Timo, G. (2019). An analysis of noise-aware features in combination with the size and diversity of training data for DNN-based speech enhancement. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) https:\/\/doi.org\/10.1109\/ICASSP.2019.8682991","DOI":"10.1109\/ICASSP.2019.8682991"},{"issue":"12","key":"9751_CR18","doi-asserted-by":"publisher","first-page":"69","DOI":"10.5815\/ijisa.2014.12.10","volume":"6","author":"V Sunnydayal","year":"2014","unstructured":"Sunnydayal, V., Sivaprasad, N., & KishoreKumar, T. (2014). A survey on statistical based single channel speech enhancement techniques. IJISA, 6(12), 69\u201385.","journal-title":"IJISA"},{"key":"9751_CR19","unstructured":"Tashev, I. & Acero, A. (2010). Statistical Modelling of the Speech Signal. In International Workshop on Acoustic, Echo, and Noise Control (IWAENC)"},{"issue":"1","key":"9751_CR20","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1109\/LSP.2013.2291240","volume":"21","author":"Y Xu","year":"2014","unstructured":"Xu, Y., Du, J., Dai, L. R., & Lee, C. H. (2014). An experimental study on speech Enhancement based on deep neural networks. IEEE Signal Processing Letters, 21(1), 65\u201368.","journal-title":"IEEE Signal Processing Letters"},{"issue":"1","key":"9751_CR21","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1109\/TASLP.2014.2364452","volume":"23","author":"Y Xu","year":"2015","unstructured":"Xu, Y., Du, J., Dai, L., & Lee, C. (2015). A regression approach to speech enhancement based on deep neural networks. IEEE\/ACM Transactions on Audio, Speech, and Language Processing, 23(1), 7\u201319.","journal-title":"IEEE\/ACM Transactions on Audio, Speech, and Language Processing"},{"issue":"4","key":"9751_CR22","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1109\/TSA.2005.848883","volume":"13","author":"CH You","year":"2005","unstructured":"You, C. H., Koh, S. N., & Rahardja, S. (2005). \u03b2-order MMSE spectral amplitude estimation for speech enhancement. IEEE Transactions on Speech and Audio Processing, 13(4), 475\u2013486.","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"key":"9751_CR23","doi-asserted-by":"publisher","first-page":"15647","DOI":"10.1007\/s11042-018-6990-5","volume":"78","author":"W Zhou","year":"2018","unstructured":"Zhou, W., Zhu, Z., & Liang, P. (2018). Speech denoising using Bayesian NMF with online base update. Multimedia Tools and Applications, 78, 15647\u201315664.","journal-title":"Multimedia Tools and Applications"}],"updated-by":[{"DOI":"10.1007\/s10772-020-09763-2","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2020,11,4]],"date-time":"2020-11-04T00:00:00Z","timestamp":1604448000000}}],"container-title":["International Journal of Speech Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10772-020-09751-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10772-020-09751-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10772-020-09751-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,7]],"date-time":"2023-10-07T09:16:43Z","timestamp":1696670203000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10772-020-09751-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,17]]},"references-count":26,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["9751"],"URL":"https:\/\/doi.org\/10.1007\/s10772-020-09751-6","relation":{},"ISSN":["1381-2416","1572-8110"],"issn-type":[{"value":"1381-2416","type":"print"},{"value":"1572-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,17]]},"assertion":[{"value":"3 May 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 September 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 September 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 November 2020","order":4,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Correction","order":5,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Two entries were missing in the reference list of the original publication. Hu, G. &amp; Wang D. L. (2010). A tandem algorithm for pitch estimation and voiced speech segregation. <Emphasis Type=\"Italic\">IEEE Transactions on Audio, Speech, and Language Processing,<\/Emphasis> <Emphasis Type=\"Italic\">18<\/Emphasis>, 2067\u20132079 Hu, G. (2004). 100 nonspeech environmental sounds, Available: <ExternalRef><RefSource>http:\/\/www.cse.ohio-state.edu\/pnl\/corpus\/HuCorpus.html<\/RefSource><RefTarget Address=\"http:\/\/www.cse.ohio-state.edu\/pnl\/corpus\/HuCorpus.html\" TargetType=\"URL\"\/><\/ExternalRef> he original article has been corrected.","order":6,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}}]}}