{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,13]],"date-time":"2024-10-13T04:11:21Z","timestamp":1728792681591},"reference-count":40,"publisher":"Institute of Electronics, Information and Communications Engineers (IEICE)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IEICE Trans. Fundamentals"],"published-print":{"date-parts":[[2023,7,1]]},"DOI":"10.1587\/transfun.2022eap1098","type":"journal-article","created":{"date-parts":[[2023,1,18]],"date-time":"2023-01-18T22:09:56Z","timestamp":1674079796000},"page":"962-975","source":"Crossref","is-referenced-by-count":0,"title":["Deep Multiplicative Update Algorithm for Nonnegative Matrix Factorization and Its Application to Audio Signals"],"prefix":"10.1587","volume":"E106.A","author":[{"given":"Hiroki","family":"TANJI","sequence":"first","affiliation":[{"name":"Department of Electronics and Bioinformatics, Meiji University"}]},{"given":"Takahiro","family":"MURAKAMI","sequence":"additional","affiliation":[{"name":"Department of Electronics and Bioinformatics, Meiji University"}]}],"member":"532","reference":[{"key":"1","doi-asserted-by":"crossref","unstructured":"[1] D.D. Lee and H.S. Seung, \u201cLearning the parts of objects with nonnegative matrix factorization,\u201d Nature, vol.401, no.6755, pp.788-791, Oct. 1999. 10.1038\/44565","DOI":"10.1038\/44565"},{"key":"2","doi-asserted-by":"crossref","unstructured":"[2] P. Smaragdis and J.C. Brown, \u201cNon-negative matrix factorization for polyphonic music transcription,\u201d Proc. 2003 IEEE International Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, pp.177-180, Oct. 2003. 10.1109\/aspaa.2003.1285860","DOI":"10.1109\/ASPAA.2003.1285860"},{"key":"3","doi-asserted-by":"publisher","unstructured":"[3] P. Smaragdis, \u201cConvolutive speech bases and their application to supervised speech separation,\u201d IEEE Trans. Audio, Speech, Language Process., vol.15, no.1, pp.1-12, Jan. 2007. 10.1109\/tasl.2006.876726","DOI":"10.1109\/TASL.2006.876726"},{"key":"4","doi-asserted-by":"crossref","unstructured":"[4] K.W. Wilson, B. Raj, P. Smaragdis, and A. Divakaran, \u201cSpeech denoising using nonnegative matrix factorization with priors,\u201d Proc. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Las Vegas, USA, pp.4029-4032, March 2008. 10.1109\/icassp.2008.4518538","DOI":"10.1109\/ICASSP.2008.4518538"},{"key":"5","doi-asserted-by":"crossref","unstructured":"[5] D. FitzGerald, M. Cranitch, and E. Coyle, \u201cOn the use of the beta divergence for musical source separation,\u201d Proc. IET Irish Signals and Systems Conference 2009 (ISSC), Dublin, Ireland, June 2009. 10.1049\/cp.2009.1711","DOI":"10.1049\/cp.2009.1711"},{"key":"6","doi-asserted-by":"crossref","unstructured":"[6] N. Lyubimov and M. Kotov, \u201cNon-negative matrix factorization with linear constraints for single-channel speech enhancement,\u201d Proc. 14th Annual Conference of the International Speech Communication Association (INTERSPEECH), Lyon, France, pp.446-450, Aug. 2013. 10.21437\/interspeech.2013-132","DOI":"10.21437\/Interspeech.2013-132"},{"key":"7","doi-asserted-by":"publisher","unstructured":"[7] N. Mohammadiha, P. Smaragdis, and A. Leijon, \u201cSupervised and unsupervised speech enhancement using nonnegative matrix factorization,\u201d IEEE Trans. Audio, Speech, Language Process., vol.21, no.10, pp.2140-2151, Oct. 2013. 10.1109\/tasl.2013.2270369","DOI":"10.1109\/TASL.2013.2270369"},{"key":"8","doi-asserted-by":"crossref","unstructured":"[8] F. Weninger, J. Le Roux, J.R. Hershey, and S. Watanabe, \u201cDiscriminative NMF and its application to single-channel source separation,\u201d Proc. 15th Annual Conference of the International Speech Communication Association (INTERSPEECH), Singapore, pp.865-869, Sept. 2014. 10.21437\/interspeech.2014-218","DOI":"10.21437\/Interspeech.2014-218"},{"key":"9","doi-asserted-by":"publisher","unstructured":"[9] F.J. Canadas-Quesada, P. Vera-Candeas, N. Ruiz-Reyes, J. Carabias-Orti, and P. Cabanas-Molero, \u201cPercussive\/harmonic sound separation by non-negative matrix factorization with smoothness\/sparseness constraints,\u201d EURASIP Journal on Audio, Speech, and Music Processing, vol.2014, no.1, pp.26-42, July 2014. 10.1186\/s13636-014-0026-5","DOI":"10.1186\/s13636-014-0026-5"},{"key":"10","doi-asserted-by":"crossref","unstructured":"[10] D. Fagot, H. Wendt, C. Fevotte, and P. Smaragdis, \u201cMajorization-minimization algorithms for convolutive NMF with the beta-divergence,\u201d Proc. 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, pp.8202-8206, May 2019. 10.1109\/icassp.2019.8683837","DOI":"10.1109\/ICASSP.2019.8683837"},{"key":"11","unstructured":"[11] I. Dhillon and S. Sra, \u201cGeneralized nonnegative matrix approximations with Bregman divergences,\u201d Proc. 2005 Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, Canada, pp.283-290, MIT Press, Dec. 2005."},{"key":"12","doi-asserted-by":"publisher","unstructured":"[12] C. Fevotte and J. Idier, \u201cAlgorithms for nonnegative matrix factorization with the beta-divergence,\u201d Neural Computation, vol.23, no.9, pp.2421-2456, Sept. 2011. 10.1162\/neco_a_00168","DOI":"10.1162\/NECO_a_00168"},{"key":"13","doi-asserted-by":"crossref","unstructured":"[13] M. Nakano, H. Kameoka, J. Le Roux, Y. Kitano, N. Ono, and S. Sagayama, \u201cConvergence-guaranteed multiplicative algorithms for nonnegative matrix factorization with \u03b2-divergence,\u201d Proc. 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Kittila, Finland, pp.283-288, Aug. 2010. 10.1109\/mlsp.2010.5589233","DOI":"10.1109\/MLSP.2010.5589233"},{"key":"14","doi-asserted-by":"crossref","unstructured":"[14] A. Liutkus, D. FitzGerald, and R. Badeau, \u201cCauchy nonnegative matrix factorization,\u201d Proc. 2015 IEEE International Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, Oct. 2015. 10.1109\/waspaa.2015.7336900","DOI":"10.1109\/WASPAA.2015.7336900"},{"key":"15","doi-asserted-by":"publisher","unstructured":"[15] A. Cichocki, S. Cruces, and S. Amari, \u201cGeneralized alpha-beta divergences and their application to robust nonnegative matrix factorization,\u201d Entropy, vol.13, no.1, pp.134-170, Jan. 2011. 10.3390\/e13010134","DOI":"10.3390\/e13010134"},{"key":"16","doi-asserted-by":"publisher","unstructured":"[16] R. Kompass, \u201cA generalized divergence measure for nonnegative matrix factorization,\u201d Neural Computation, vol.19, no.3, pp.780-791, March 2007. 10.1162\/neco.2007.19.3.780","DOI":"10.1162\/neco.2007.19.3.780"},{"key":"17","doi-asserted-by":"publisher","unstructured":"[17] U. Simsekli, A. Liutkus, and A.T. Cemgil, \u201cAlpha-stable matrix factorization,\u201d IEEE Signal Process. Lett., vol.22, no.12, pp.2289-2293, Dec. 2015. 10.1109\/lsp.2015.2477535","DOI":"10.1109\/LSP.2015.2477535"},{"key":"18","doi-asserted-by":"crossref","unstructured":"[18] K. Yoshii, K. Itoyama, and M. Goto, \u201cStudent&apos;s <i>t<\/i> nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation,\u201d Proc. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, pp.51-55, March 2016. 10.1109\/icassp.2016.7471635","DOI":"10.1109\/ICASSP.2016.7471635"},{"key":"19","doi-asserted-by":"publisher","unstructured":"[19] D. Kitamura, \u201cNonnegative matrix factorization based on complex generative model,\u201d Acoustical Science and Technology, vol.40, no.3, pp.155-161, May 2019. 10.1250\/ast.40.155","DOI":"10.1250\/ast.40.155"},{"key":"20","doi-asserted-by":"crossref","unstructured":"[20] H. Tanji, T. Murakami, and H. Kamata, \u201cA generalization of Laplace nonnegative matrix factorization and its multichannel extension,\u201d Proc. 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Lanzhou, China, pp.1694-1699, Nov. 2019. 10.1109\/apsipaasc47483.2019.9023125","DOI":"10.1109\/APSIPAASC47483.2019.9023125"},{"key":"21","doi-asserted-by":"crossref","unstructured":"[21] C. Fevotte, N. Bertin, and J.L. Durrieu, \u201cNonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis,\u201d Neural Computation, vol.21, no.3, pp.793-830, Sept. 2008. 10.1162\/neco.2008.04-08-771","DOI":"10.1162\/neco.2008.04-08-771"},{"key":"22","doi-asserted-by":"crossref","unstructured":"[22] H. Tanji, T. Murakami, and H. Kamata, \u201cLaplace nonnegative matrix factorization with application to semi-supervised audio denoising,\u201d Proc. 27th European Signal Processing Conference (EUSIPCO), A Coruna, Spain, Sept. 2019. 10.23919\/eusipco.2019.8903074","DOI":"10.23919\/EUSIPCO.2019.8903074"},{"key":"23","unstructured":"[23] H. Tanji and T. Murakami, \u201cLearning the statistical model of the NMF using the deep multiplicative update algorithm with applications,\u201d Proc. 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Tokyo, Japan, pp.205-211, Dec. 2021."},{"key":"24","doi-asserted-by":"publisher","unstructured":"[25] R. Hennequin, B. David, and R. Badeau, \u201cBeta-divergence as a subclass of Bregman divergence,\u201d IEEE Signal Process. Lett., vol.18, no.2, pp.83-86, Feb. 2011. 10.1109\/lsp.2010.2096211","DOI":"10.1109\/LSP.2010.2096211"},{"key":"25","doi-asserted-by":"crossref","unstructured":"[26] J. Le Roux, S. Wisdom, H. Erdogan, and J.R. Hershey, \u201cSDR-Half-baked or well done?,\u201d Proc. 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, pp.626-630, May 2019. 10.1109\/icassp.2019.8683855","DOI":"10.1109\/ICASSP.2019.8683855"},{"key":"26","doi-asserted-by":"publisher","unstructured":"[27] V. Monga, Y. Li, and Y.C. Eldar, \u201cAlgorithm unrolling: Interpretable, efficient deep learning for signal and image processing,\u201d IEEE Signal Process. Mag., vol.38, no.2, pp.18-44, March 2021. 10.1109\/msp.2020.3016905","DOI":"10.1109\/MSP.2020.3016905"},{"key":"27","doi-asserted-by":"crossref","unstructured":"[28] T. Meinhardt, M. Moeller, C. Hazirbas, and D. Cremers, \u201cLearning proximal operators: Using denoising networks for regularizing inverse imaging problems,\u201d Proc. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, pp.1799-1808, Oct. 2017. 10.1109\/iccv.2017.198","DOI":"10.1109\/ICCV.2017.198"},{"key":"28","doi-asserted-by":"publisher","unstructured":"[29] Y. Yang, J. Sun, H. Li, and Z. Xu, \u201cADMM-CSNet: A deep learning approach for image compressive sensing,\u201d IEEE Trans. Pattern Anal. Mach. Intell., vol.42, no.3, pp.521-538, March 2020. 10.1109\/tpami.2018.2883941","DOI":"10.1109\/TPAMI.2018.2883941"},{"key":"29","doi-asserted-by":"publisher","unstructured":"[30] O. Solomon, R. Cohen, Y. Zhang, Y. Yang, Q. He, J. Luo, R.J.G. van Sloun, and Y.C. Eldar, \u201cDeep unfolded robust PCA with application to clutter suppression in ultrasound,\u201d IEEE Trans. Med. Imag., vol.39, no.4, pp.1051-1063, April 2020. 10.1109\/tmi.2019.2941271","DOI":"10.1109\/TMI.2019.2941271"},{"key":"30","doi-asserted-by":"crossref","unstructured":"[31] J. Le Roux, J.R. Hershey, and F. Weninger, \u201cDeep NMF for speech separation,\u201d Proc. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, Australia, pp.66-70, April 2015. 10.1109\/icassp.2015.7177933","DOI":"10.1109\/ICASSP.2015.7177933"},{"key":"31","doi-asserted-by":"crossref","unstructured":"[32] S. Wisdom, T. Powers, J. Pitton, and L. Atlas, \u201cDeep recurrent NMF for speech separation by unfolding iterative thresholding,\u201d Proc. 2017 IEEE International Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, pp.254-258, Oct. 2017. 10.1109\/waspaa.2017.8170034","DOI":"10.1109\/WASPAA.2017.8170034"},{"key":"32","doi-asserted-by":"publisher","unstructured":"[33] Y. Luo and N. Mesgarani, \u201cConv-TasNet: Surpassing ideal time-frequency magnitude masking for speech separation,\u201d IEEE\/ACM Trans. Audio, Speech, Language Process., vol.27, no.8, pp.1256-1266, Aug. 2019. 10.1109\/taslp.2019.2915167","DOI":"10.1109\/TASLP.2019.2915167"},{"key":"33","doi-asserted-by":"publisher","unstructured":"[34] Q. Zhang, A. Nicolson, M. Wang, K.K. Paliwal, and C. Wang, \u201cDeepMMSE: A deep learning approach to MMSE-based noise power spectral density estimation,\u201d IEEE\/ACM Trans. Audio, Speech, Language Process., vol.28, pp.1404-1415, April 2020. 10.1109\/taslp.2020.2987441","DOI":"10.1109\/TASLP.2020.2987441"},{"key":"34","doi-asserted-by":"crossref","unstructured":"[35] P. Magron, R. Badeau, and A. Liutkus, \u201cLevy NMF for robust nonnegative source separation,\u201d Proc. 2017 IEEE International Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, USA, pp.259-263, Oct. 2017. 10.1109\/waspaa.2017.8170035","DOI":"10.1109\/WASPAA.2017.8170035"},{"key":"35","unstructured":"[36] C. Kehling, J. Abesser, C. Dittmar, and G. Schuller, \u201cAutomatic tablature transcription of electric guitar recordings by estimation of score- and instrument-related parameters,\u201d Proc. 17th International Conference on Digital Audio Effects (DAFx), Erlangen, Germany, Sept. 2014."},{"key":"36","unstructured":"[37] D.P. Kingma and J. Ba, \u201cAdam: a method for stochastic optimization,\u201d Proc. 3rd International Conference on Learning Representations (ICLR), San Diego, USA, Dec. 2015."},{"key":"37","unstructured":"[38] R. Pascanu, T. Mikolov, and Y. Bengio, \u201cOn the difficulty of training recurrent neural networks,\u201d Proc. 30th International Conference on Machine Learning (ICML), Atlanta, USA, pp.1310-1318, June 2013."},{"key":"38","doi-asserted-by":"crossref","unstructured":"[39] V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, \u201cLibrispeech: An ASR corpus based on public domain audio books,\u201d Proc. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, Australia, pp.5206-5210, April 2015. 10.1109\/icassp.2015.7178964","DOI":"10.1109\/ICASSP.2015.7178964"},{"key":"39","doi-asserted-by":"crossref","unstructured":"[40] B. King, C. Fevotte, and P. Smaragdis, \u201cOptimal cost function and magnitude power for NMF-based speech separation and music interpolation,\u201d Proc. 2012 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Santander, Spain, Sept. 2012. 10.1109\/mlsp.2012.6349726","DOI":"10.1109\/MLSP.2012.6349726"},{"key":"40","unstructured":"[41] Y.N. Dauphin, A. Fan, M. Auli, and D. Grangier, \u201cLanguage modeling with gated convolutional networks,\u201d Proc. 34th International Conference on Machine Learning (ICML), Sydney, Australia, pp.933-941, Aug. 2017."}],"container-title":["IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transfun\/E106.A\/7\/E106.A_2022EAP1098\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,12]],"date-time":"2024-10-12T13:11:33Z","timestamp":1728738693000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transfun\/E106.A\/7\/E106.A_2022EAP1098\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,1]]},"references-count":40,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2023]]}},"URL":"https:\/\/doi.org\/10.1587\/transfun.2022eap1098","relation":{},"ISSN":["0916-8508","1745-1337"],"issn-type":[{"type":"print","value":"0916-8508"},{"type":"electronic","value":"1745-1337"}],"subject":[],"published":{"date-parts":[[2023,7,1]]},"article-number":"2022EAP1098"}}