{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T13:17:28Z","timestamp":1740143848351,"version":"3.37.3"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,7,31]],"date-time":"2020-07-31T00:00:00Z","timestamp":1596153600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,7,31]],"date-time":"2020-07-31T00:00:00Z","timestamp":1596153600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Ego-noise, i.e., the noise a robot causes by its own motions, significantly corrupts the microphone signal and severely impairs the robot\u2019s capability to interact seamlessly with its environment. Therefore, suitable ego-noise suppression techniques are required. For this, it is intuitive to use also motor data collected by proprioceptors mounted to the joints of the robot since it describes the physical state of the robot and provides additional information about the ego-noise sources. In this paper, we use a dictionary-based approach for ego-noise suppression in a semi-supervised manner: first, an ego-noise dictionary is learned and subsequently used to estimate the ego-noise components of a mixture by computing a weighted sum of dictionary entries. The estimation of the weights is very sensitive against other signals beside ego-noise contained in the mixture. For increased robustness, we therefore propose to incorporate knowledge about the physical state of the robot to the estimation of the weights. This is achieved by introducing a motor data-based regularization term to the estimation problem which promotes similar weights for similar physical states. The regularization is derived by representing the motor data as a graph and imprints the intrinsic structure of the motor data space onto the dictionary model. We analyze the proposed method and evaluate its ego-noise suppression performance for a large variety of different movements and demonstrate the superiority of the proposed method compared to an approach without using motor data.<\/jats:p>","DOI":"10.1186\/s13636-020-00178-0","type":"journal-article","created":{"date-parts":[[2020,7,31]],"date-time":"2020-07-31T09:03:44Z","timestamp":1596186224000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Motor data-regularized nonnegative matrix factorization for ego-noise suppression"],"prefix":"10.1186","volume":"2020","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3013-3192","authenticated-orcid":false,"given":"Alexander","family":"Schmidt","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Brendel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Haubner","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Walter","family":"Kellermann","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,7,31]]},"reference":[{"key":"178_CR1","first-page":"832","volume-title":"Proc. 17th Nat. Conf. Artificial Intell. (AAAI)","author":"K. Nakadai","year":"2000","unstructured":"K. Nakadai, T. Lourens, H. G. Okuno, H. Kitano, in Proc. 17th Nat. Conf. Artificial Intell. (AAAI). Active audition for humanoid (AAAIAustin, TX, 2000), pp. 832\u2013839."},{"key":"178_CR2","first-page":"5610","volume-title":"Proc. IEEE Int. Conf. Acoust., Speech and Signal Process. (ICASSP)","author":"H. G. Okuno","year":"2015","unstructured":"H. G. Okuno, K. Nakadai, in Proc. IEEE Int. Conf. Acoust., Speech and Signal Process. (ICASSP). Robot audition: its rise and perspectives (IEEESouth Brisbane, QL, Australia, 2015), pp. 5610\u20135614."},{"key":"178_CR3","first-page":"658","volume-title":"Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS)","author":"J. Even","year":"2009","unstructured":"J. Even, H. Saruwatari, K. Shikano, T. Takatani, in Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS). Semi-blind suppression of internal noise for hands-free robot spoken dialog system (IEEESt. Louis, MO, 2009), pp. 658\u2013663."},{"issue":"11","key":"178_CR4","doi-asserted-by":"publisher","first-page":"4311","DOI":"10.1109\/TSP.2006.881199","volume":"54","author":"M. Aharon","year":"2006","unstructured":"M. Aharon, M. Elad, A. Bruckstein, K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process.54(11), 4311\u20134322 (2006).","journal-title":"IEEE Trans. Signal Process."},{"key":"178_CR5","first-page":"355","volume-title":"Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP)","author":"A. Deleforge","year":"2015","unstructured":"A. Deleforge, W. Kellermann, in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP). Phase-optimized K-SVD for signal extraction from underdetermined multichannel sparse mixtures (IEEESouth Brisbane, QL, Australia, 2015), pp. 355\u2013359."},{"issue":"6755","key":"178_CR6","doi-asserted-by":"publisher","first-page":"788","DOI":"10.1038\/44565","volume":"401","author":"D. D. Lee","year":"1999","unstructured":"D. D. Lee, H. S. Seung, Learning the parts of objects by non-negative matrix factorization. Nature. 401(6755), 788\u2013791 (1999).","journal-title":"Nature"},{"key":"178_CR7","first-page":"535","volume-title":"Proc. 13th Int. Conf. Neural Inform. Process. Syst. (NIPS)","author":"D. D. Lee","year":"2000","unstructured":"D. D. Lee, H. S. Seung, in Proc. 13th Int. Conf. Neural Inform. Process. Syst. (NIPS). Algorithms for non-negative matrix factorization (NeurlPSDenver, CO, 2000), pp. 535\u2013541."},{"issue":"9","key":"178_CR8","doi-asserted-by":"publisher","first-page":"2421","DOI":"10.1162\/NECO_a_00168","volume":"23","author":"C. F\u00e9votte","year":"2011","unstructured":"C. F\u00e9votte, J. Idier, Algorithms for non-negative matrix factorization with the \u03b2-divergence. Neural Comput.23(9), 2421\u20132456 (2011).","journal-title":"Neural Comput."},{"key":"178_CR9","first-page":"6293","volume-title":"Proc. IEEE Int, Conf. Robotics and Automation (ICRA)","author":"T. Tezuka","year":"2014","unstructured":"T. Tezuka, T. Yoshida, K. Nakadai, in Proc. IEEE Int, Conf. Robotics and Automation (ICRA). Ego-motion noise suppression for robots based on semi-blind infinite non-negative matrix factorization (IEEEFlorence, Italy, 2014), pp. 6293\u20136298."},{"issue":"5","key":"178_CR10","doi-asserted-by":"publisher","first-page":"971","DOI":"10.1109\/TASL.2013.2239990","volume":"21","author":"H. Sawada","year":"2013","unstructured":"H. Sawada, H. Kameoka, S. Araki, N. Ueda, Multichannel extensions of non-negative matrix factorization with complex-valued data. IEEE\/ACM Trans. Audio, Speech, Language Process.21(5), 971\u2013982 (2013).","journal-title":"IEEE\/ACM Trans. Audio, Speech, Language Process."},{"key":"178_CR11","first-page":"136","volume-title":"Proc. ITG Fachtagung Sprachkommunikation","author":"T. Haubner","year":"2018","unstructured":"T. Haubner, A. Schmidt, W. Kellermann, in Proc. ITG Fachtagung Sprachkommunikation. Multichannel nonnegative matrix factorization for ego-noise suppression (VDE-VerlagOldenburg, Germany, 2018), pp. 136\u2013140."},{"key":"178_CR12","unstructured":"Clean PNG, NAO, der humanoide Roboter.https:\/\/de.cleanpng.com\/png-m5r7ur\/ Accessed 20 May 2020."},{"key":"178_CR13","first-page":"2685","volume-title":"Proc. European Conf. Speech Communication and Technology (INTERSPEECH - Eurospeech)","author":"A. Ito","year":"2005","unstructured":"A. Ito, T. Kanayama, M. Suzuki, S. Makino, in Proc. European Conf. Speech Communication and Technology (INTERSPEECH - Eurospeech). Internal noise suppression for speech recognition by small robots (ISCALisbon, Portugal, 2005), pp. 2685\u20132688."},{"key":"178_CR14","first-page":"116","volume-title":"Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP)","author":"A. Schmidt","year":"2019","unstructured":"A. Schmidt, W. Kellermann, in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Process. (ICASSP). Informed ego-noise suppression using motor data-driven dictionaries (IEEEBrighton, UK, 2019), pp. 116\u2013120."},{"key":"178_CR15","first-page":"26","volume-title":"Proc. IEEE\/ RAS Int, Conf. Humanoid Robots (Humanoids)","author":"Y. Nishimura","year":"2006","unstructured":"Y. Nishimura, M. Ishizuka, K. Nakadai, M. Nakano, H. Tsujino, in Proc. IEEE\/ RAS Int, Conf. Humanoid Robots (Humanoids). Speech recognition for a humanoid with motor noise utilizing missing feature theory (IEEECancun, Mexico, 2006), pp. 26\u201333."},{"key":"178_CR16","first-page":"199","volume-title":"Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS)","author":"G. Ince","year":"2009","unstructured":"G. Ince, K. Nakadai, T. Rodemann, Y. Hasegawa, H. Tsujino, J. Imura, in Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS). Ego-noise suppression of a robot using template subtraction (IEEESt. Louis, MO, 2009), pp. 199\u2013204."},{"key":"178_CR17","first-page":"3623","volume-title":"Proc. IEEE Int, Conf. Robotics and Automation (ICRA)","author":"G. Ince","year":"2010","unstructured":"G. Ince, K. Nakadai, T. Rodemann, Y. Hasegawa, H. Tsujino, in Proc. IEEE Int, Conf. Robotics and Automation (ICRA). Imura: A hybrid framework for ego noise cancellation of a robot (IEEEAnchorage, AK, 2010), pp. 3623\u20133628."},{"key":"178_CR18","first-page":"1281","volume-title":"Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS)","author":"A. Schmidt","year":"2016","unstructured":"A. Schmidt, A. Deleforge, W. Kellermann, in Proc. IEEE\/RSJ Int. Conf. Intelligent Robots and Systems (IROS). Ego-noise reduction using a motor data-guided multichannel dictionary (IEEEDaejon, South Korea, 2016), pp. 1281\u20131286."},{"key":"178_CR19","first-page":"63","volume-title":"Proc. 8th IEEE Int, Conf. on Data Mining","author":"D. Cai","year":"2008","unstructured":"D. Cai, X. He, X. Wu, J. Han, in Proc. 8th IEEE Int, Conf. on Data Mining. Non-negative matrix factorization on manifold (IEEEPisa, Italy, 2008), pp. 63\u201372."},{"issue":"8","key":"178_CR20","doi-asserted-by":"publisher","first-page":"1548","DOI":"10.1109\/TPAMI.2010.231","volume":"33","author":"D. Cai","year":"2011","unstructured":"D. Cai, X. He, J. Han, T. S. Huang, Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. and Mach. Intell.33(8), 1548\u20131560 (2011).","journal-title":"IEEE Trans. Pattern Anal. and Mach. Intell."},{"key":"178_CR21","first-page":"431","volume-title":"Proc. IEEE Workshop Mach. Learning Signal Process","author":"M. N. Schmidt","year":"2007","unstructured":"M. N. Schmidt, J. Larsen, F. -T. Hsiao, in Proc. IEEE Workshop Mach. Learning Signal Process. Wind noise reduction using non-negative sparse coding (IEEEThessaloniki, Greece, 2007), pp. 431\u2013436."},{"issue":"4","key":"178_CR22","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","volume":"17","author":"U. von Luxburg","year":"2007","unstructured":"U. von Luxburg, A tutorial on spectral clustering. Statistics and Computing. 17(4), 395\u2013416 (2007).","journal-title":"Statistics and Computing"},{"key":"178_CR23","volume-title":"Spectral graph theory","author":"F. R. K. Chung","year":"1997","unstructured":"F. R. K. Chung, Spectral graph theory, 1st edn, vol. 1 (American Mathematical Soc., Providence, RI, 1997)."},{"key":"178_CR24","first-page":"2399","volume":"7","author":"M. Belkin","year":"2006","unstructured":"M. Belkin, P. Niyogi, V. Sindhwani, Manifold regularization: a geometric framework for learning from labeled and uUnlabeled examples. J. Mach. Learn. Research. 7:, 2399\u20132434 (2006).","journal-title":"J. Mach. Learn. Research"},{"key":"178_CR25","volume-title":"Problems of learning on manifolds. PhD Thesis","author":"M. Belkin","year":"2003","unstructured":"M. Belkin, Problems of learning on manifolds. PhD Thesis (The University of Chicago, Chicago, 2003)."},{"key":"178_CR26","unstructured":"Seventh Framework Programme, \u2018Embodied Audition for RobotS\u2019 (EARS).https:\/\/robot-ears.eu\/. Accessed 25 Sept 2018."},{"issue":"5","key":"178_CR27","doi-asserted-by":"publisher","first-page":"2421","DOI":"10.1121\/1.2229005","volume":"120","author":"M. Cooke","year":"2006","unstructured":"M. Cooke, J. Barker, An audio-visual corpus for speech perception and automatic speech recognition. J. Acoustical Society of America. 120(5), 2421\u20132424 (2006).","journal-title":"J. Acoustical Society of America"},{"key":"178_CR28","volume-title":"Technical Report 1706","author":"C. F\u00e9votte","year":"2005","unstructured":"C. F\u00e9votte, R. Griboval, E. Vincent, in Technical Report 1706. BSS EVAL toolbox user guide (IRISARennes, France, 2005). Software available at http:\/\/www.irisa.fr\/metiss\/bsseval\/."},{"key":"178_CR29","unstructured":"ITU-T Recommendation P.862.2: Wideband extension to recommendation P.862 for the assessment of wideband telephone networks and speech codecs. Recommendation, ITU (November 2007)."}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-020-00178-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13636-020-00178-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-020-00178-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,31]],"date-time":"2021-07-31T00:44:52Z","timestamp":1627692292000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13636-020-00178-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,31]]},"references-count":29,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["178"],"URL":"https:\/\/doi.org\/10.1186\/s13636-020-00178-0","relation":{},"ISSN":["1687-4722"],"issn-type":[{"type":"electronic","value":"1687-4722"}],"subject":[],"published":{"date-parts":[[2020,7,31]]},"assertion":[{"value":"8 January 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 July 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"11"}}