{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,18]],"date-time":"2025-10-18T10:59:59Z","timestamp":1760785199744,"version":"3.37.3"},"reference-count":82,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T00:00:00Z","timestamp":1726185600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T00:00:00Z","timestamp":1726185600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["773268"],"award-info":[{"award-number":["773268"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Room impulse responses (RIRs) are used in several applications, such as augmented reality and virtual reality. These applications require a large number of RIRs to be convolved with audio, under strict latency constraints. In this paper, we consider the compression of RIRs, in conjunction with fast time-domain convolution. We consider three different methods of RIR approximation for the purpose of RIR compression and compare them to state-of-the-art compression. The methods are evaluated using several standard objective quality measures, both channel-based and signal-based. We also propose a novel low-rank-based algorithm for fast time-domain convolution and show how the convolution can be carried out without the need to decompress the RIR. Numerical simulations are performed using RIRs of different lengths, recorded in three different rooms. It is shown that compression using low-rank approximation is a very compelling option to the state-of-the-art Opus compression, as it performs as well or better than on all but one considered measure, with the added benefit of being amenable to fast time-domain convolution.<\/jats:p>","DOI":"10.1186\/s13636-024-00363-5","type":"journal-article","created":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T16:25:25Z","timestamp":1726244725000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Compression of room impulse responses for compact storage and fast low-latency convolution"],"prefix":"10.1186","volume":"2024","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1578-080X","authenticated-orcid":false,"given":"Martin","family":"J\u00e4lmby","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Filip","family":"Elvander","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Toon","family":"van Waterschoot","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,9,13]]},"reference":[{"key":"363_CR1","doi-asserted-by":"publisher","first-page":"1620","DOI":"10.1109\/TASLP.2020.2990485","volume":"28","author":"C Evers","year":"2020","unstructured":"C. Evers, H.W. L\u00f6llmann, H. Mellmann, A. Schmidt, H. Barfuss, P.A. Naylor, W. Kellermann, The LOCATA challenge: acoustic source localization and tracking. IEEE\/ACM Trans. Audio Speech Lang. Process. 28, 1620\u20131643 (2020). https:\/\/doi.org\/10.1109\/TASLP.2020.2990485","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-84996-056-4","volume-title":"Speech Dereverberation","author":"PA Naylor","year":"2010","unstructured":"P.A. Naylor, N.D. Gaubitch, Speech Dereverberation (Springer, London, 2010)"},{"key":"363_CR3","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-51202-6","volume-title":"Auralization: Fundamentals of Acoustics, Modelling, Simulation, Algorithms and Acoustic Virtual Reality","author":"M Vorl\u00e4nder","year":"2020","unstructured":"M. Vorl\u00e4nder, Auralization: Fundamentals of Acoustics, Modelling, Simulation, Algorithms and Acoustic Virtual Reality (Springer Nature, Switzerland, 2020)"},{"issue":"4","key":"363_CR4","doi-asserted-by":"publisher","first-page":"2746","DOI":"10.1121\/1.5096178","volume":"145","author":"F Brinkmann","year":"2019","unstructured":"F. Brinkmann, L. Asp\u00f6ck, D. Ackermann, S. Lepa, M. Vorl\u00e4nder, S. Weinzierl, A round robin on room acoustical simulation and auralization. J. Acoust. Soc. Am. 145(4), 2746\u20132760 (2019)","journal-title":"J. Acoust. Soc. Am."},{"issue":"4","key":"363_CR5","doi-asserted-by":"publisher","first-page":"692","DOI":"10.1109\/TASLP.2016.2647702","volume":"25","author":"S Gannot","year":"2017","unstructured":"S. Gannot, E. Vincent, S. Markovich-Golan, A. Ozerov, A consolidated perspective on multimicrophone speech enhancement and source separation. IEEE\/ACM Trans. Audio Speech Lang. Process. 25(4), 692\u2013730 (2017). https:\/\/doi.org\/10.1109\/TASLP.2016.2647702","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR6","doi-asserted-by":"publisher","unstructured":"S.\u00a0Goetze, E.\u00a0Albertin, M.\u00a0Kallinger, A.\u00a0Mertins, K.D. Kammeyer, in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Quality assessment for listening-room compensation algorithms (2010), pp. 2450\u20132453. https:\/\/doi.org\/10.1109\/ICASSP.2010.5496301","DOI":"10.1109\/ICASSP.2010.5496301"},{"key":"363_CR7","unstructured":"G.W. Elko, E.\u00a0Diethorn, T.\u00a0Gaensler, Room impulse response variation due to thermal fluctuation and its impact on acoustic echo cancellation (Kyoto, 2003)"},{"issue":"2","key":"363_CR8","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1016\/0022-460X(91)90764-B","volume":"146","author":"J Mourjopoulos","year":"1991","unstructured":"J. Mourjopoulos, M. Paraskevas, Pole and zero modeling of room transfer functions. J. Sound Vib. 146(2), 281\u2013302 (1991)","journal-title":"J. Sound Vib."},{"issue":"7","key":"363_CR9","doi-asserted-by":"publisher","first-page":"1547","DOI":"10.1109\/TASLP.2017.2700940","volume":"25","author":"G Vairetti","year":"2017","unstructured":"G. Vairetti, E. De Sena, M. Catrysse, S.H. Jensen, M. Moonen, T. van Waterschoot, A scalable algorithm for physically motivated and sparse approximation of room impulse responses with orthonormal basis functions. IEEE\/ACM Trans. Audio Speech Lang. Process. 25(7), 1547\u20131561 (2017)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR10","doi-asserted-by":"crossref","unstructured":"O.\u00a0Das, P.\u00a0Calamia, S.V. Amengual\u00a0Gari, in 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Room impulse response interpolation from a sparse set of measurements using a modal architecture (Toronto, 2021), pp. 960\u2013964","DOI":"10.1109\/ICASSP39728.2021.9414399"},{"key":"363_CR11","unstructured":"J.S. Abel, S.\u00a0Coffin, K.\u00a0Spratt, A modal architecture for artificial reverberation with application to room acoustics modeling (Los Angeles, 2014). AES Preprint 9208"},{"key":"363_CR12","doi-asserted-by":"publisher","first-page":"3617","DOI":"10.1121\/1.2934828","volume":"123","author":"C Huszty","year":"2008","unstructured":"C. Huszty, N. Bukuli, \u00c1. Torma, F. Augusztinovicz, Effects of filtering of room impulse responses on room acoustics parameters by using different filter structures. J. Acoust. Soc. Amer. 123, 3617 (2008)","journal-title":"J. Acoust. Soc. Amer."},{"key":"363_CR13","unstructured":"G.\u00a0Vairetti, Efficient parametric modeling, identification and equalization of room acoustics (Ph.D. thesis, KU Leuven, 2018)"},{"issue":"3","key":"363_CR14","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1109\/TSA.2003.811536","volume":"11","author":"LSH Ngia","year":"2003","unstructured":"L.S.H. Ngia, Recursive identification of acoustic echo systems using orthonormal basis functions. IEEE Trans. Speech Audio Process. 11(3), 278\u2013293 (2003)","journal-title":"IEEE Trans. Speech Audio Process."},{"key":"363_CR15","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4939-0755-7","volume-title":"Springer Handbook of Acoustics","author":"T Rossing","year":"2014","unstructured":"T. Rossing, Springer Handbook of Acoustics (Springer, New York, 2014)"},{"key":"363_CR16","doi-asserted-by":"crossref","unstructured":"K.\u00a0Shi, X.\u00a0Ma, G.\u00a0Tong Zhou, An efficient acoustic echo cancellation design for systems with long room impulses and nonlinear loudspeakers. Sign. Process. 89(2), 121\u2013132 (2009)","DOI":"10.1016\/j.sigpro.2008.07.009"},{"key":"363_CR17","doi-asserted-by":"publisher","unstructured":"L.\u00a0Krishnan, P.D. Teal, T.\u00a0Betlehem, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), A robust sparse approach to acoustic impulse response shaping (2015), pp. 738\u2013742. https:\/\/doi.org\/10.1109\/ICASSP.2015.7178067","DOI":"10.1109\/ICASSP.2015.7178067"},{"issue":"3","key":"363_CR18","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1109\/MSP.2017.2666081","volume":"34","author":"H Hacihabiboglu","year":"2017","unstructured":"H. Hacihabiboglu, E. De Sena, Z. Cvetkovic, J. Johnston, J.O. Smith III., Perceptual spatial audio recording, simulation, and rendering: an overview of spatial-audio techniques based on psychoacoustics. IEEE Signal Proc. Mag. 34(3), 36\u201354 (2017). https:\/\/doi.org\/10.1109\/MSP.2017.2666081","journal-title":"IEEE Signal Proc. Mag."},{"key":"363_CR19","doi-asserted-by":"crossref","unstructured":"B.F.G. Katz, D. Murphy, A. Farina, in Augmented Reality, Virtual Reality, and Computer Graphics, ed. by L.T. De Paolis, P. Bourdot. The past has ears (PHE): XR explorations of acoustic spaces as cultural heritage (Springer International Publishing, Cham, 2020), pp.91\u201398","DOI":"10.1007\/978-3-030-58468-9_7"},{"issue":"10","key":"363_CR20","doi-asserted-by":"publisher","first-page":"3790","DOI":"10.1109\/TSP.2006.879280","volume":"54","author":"T Ajdler","year":"2006","unstructured":"T. Ajdler, L. Sbaiz, M. Vetterli, The plenacoustic function and its sampling. IEEE Trans. Signal Process. 54(10), 3790\u20133804 (2006). https:\/\/doi.org\/10.1109\/TSP.2006.879280","journal-title":"IEEE Trans. Signal Process."},{"key":"363_CR21","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1051\/aacus\/2022040","volume":"6","author":"B Rafaely","year":"2022","unstructured":"B. Rafaely, V. Tourbabin, E. Habets, Z. Ben-Hur, H. Lee, H. Gamper, L. Arbel, L. Birnie, T. Abhayapala, P. Samarasinghe, Spatial audio signal processing for binaural reproduction of recorded acoustic scenes - review and challenges. Acta Acust. 6, 47 (2022)","journal-title":"Acta Acust."},{"issue":"3","key":"363_CR22","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1109\/MSP.2021.3110108","volume":"39","author":"R Gupta","year":"2022","unstructured":"R. Gupta, J. He, R. Ranjan, W.S. Gan, F. Klein, C. Schneiderwind, A. Neidhardt, K. Brandenburg, V. V\u00e4lim\u00e4ki, Augmented\/mixed reality audio for hearables: sensing, control, and rendering. IEEE Signal Proc. Mag. 39(3), 63\u201389 (2022). https:\/\/doi.org\/10.1109\/MSP.2021.3110108","journal-title":"IEEE Signal Proc. Mag."},{"key":"363_CR23","doi-asserted-by":"publisher","unstructured":"C. Schissler, P. Stirling, R. Mehra, in 2017 IEEE Virtual Reality (VR), Efficient construction of the spatial room impulse response (2017), pp. 122\u2013130 https:\/\/doi.org\/10.1109\/VR.2017.7892239","DOI":"10.1109\/VR.2017.7892239"},{"key":"363_CR24","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1109\/TASLP.2019.2951995","volume":"28","author":"MB M\u00f8ller","year":"2020","unstructured":"M.B. M\u00f8ller, J. \u00d8stergaard, A moving horizon framework for sound zones. IEEE\/ACM Trans. Audio Speech Lang. Process. 28, 256\u2013265 (2020). https:\/\/doi.org\/10.1109\/TASLP.2019.2951995","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR25","doi-asserted-by":"publisher","unstructured":"J.\u00a0Brunnstr\u00f6m, M.\u00a0J\u00e4lmby, T.\u00a0van Waterschoot, M.\u00a0Moonen, in Proceedings of the Fast low-rank filtered-x least mean squares for multichannel active noise control. 1085\u20131089 (2023). https:\/\/doi.org\/10.1109\/IEEECONF59524.2023.10477017","DOI":"10.1109\/IEEECONF59524.2023.10477017"},{"key":"363_CR26","unstructured":"A.\u00a0Car\u00f4t, C.\u00a0Werner, in Proceedings of the \u201cMusic in the Global Village\u201d-Conference, Budapest, Hungary, vol. 162, Network music performance-problems, approaches and perspectives (2007), pp. 10\u201323.\u00a0https:\/\/www.carot.de\/Docs\/MITGV_AC_CW.pdf"},{"issue":"5","key":"363_CR27","doi-asserted-by":"publisher","first-page":"1421","DOI":"10.1109\/TASL.2012.2189567","volume":"20","author":"V V\u00e4limaki","year":"2012","unstructured":"V. V\u00e4limaki, J.D. Parker, L. Savioja, J.O. Smith, J.S. Abel, Fifty years of artificial reverberation. IEEE\/ACM Trans. Audio Speech Lang. Process. 20(5), 1421\u20131448 (2012). https:\/\/doi.org\/10.1109\/TASL.2012.2189567","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR28","doi-asserted-by":"publisher","first-page":"297","DOI":"10.1090\/S0025-5718-1965-0178586-1","volume":"19","author":"JW Cooley","year":"1965","unstructured":"J.W. Cooley, J.W. Tukey, An algorithm for the machine calculation of complex fourier series. Math. Comput. 19, 297\u2013301 (1965)","journal-title":"Math. Comput."},{"key":"363_CR29","doi-asserted-by":"publisher","unstructured":"T.G. Stockham, in Proceedings of the April 26-28, 1966, Spring Joint Computer Conference, AFIPS \u201966 (Spring), High-speed convolution and correlation (Association for Computing Machinery, New York, 1966), pp. 229\u2013233. https:\/\/doi.org\/10.1145\/1464182.1464209","DOI":"10.1145\/1464182.1464209"},{"key":"363_CR30","volume-title":"Partitioned Convolution Algorithms for Real-Time Auralization","author":"F Wefers","year":"2015","unstructured":"F. Wefers, Partitioned Convolution Algorithms for Real-Time Auralization (Logos Verlag, DEU, 2015)"},{"issue":"5","key":"363_CR31","doi-asserted-by":"publisher","first-page":"985","DOI":"10.1007\/s11760-012-0387-0","volume":"8","author":"A Primavera","year":"2014","unstructured":"A. Primavera, S. Cecchi, L. Romoli, P. Peretti, F. Piazza, A low latency implementation of a non-uniform partitioned convolution algorithm for room acoustic simulation. SIViP. 8(5), 985\u2013994 (2014)","journal-title":"SIViP."},{"issue":"1","key":"363_CR32","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1080\/19401493.2014.888594","volume":"8","author":"M Vorl\u00e4nder","year":"2015","unstructured":"M. Vorl\u00e4nder, D. Schr\u00f6der, S. Pelzer, F. Wefers, Virtual reality for architectural acoustics. J. Build. Perform. Simul. 8(1), 15\u201325 (2015)","journal-title":"J. Build. Perform. Simul."},{"key":"363_CR33","unstructured":"W.C. Lee, C.M. Liu, C.H. Yang, J.I. Guo, in 6th International Conference on Digital Audio Effects (DAFx-03), Fast perceptual convolution for room reverberation (London, 2003)"},{"key":"363_CR34","doi-asserted-by":"crossref","unstructured":"N. Jillings, J.D. Reiss, R. Stables, in Proc. 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Zero-delay large signal convolution using multiple processor architectures (2017), pp. 339\u2013343","DOI":"10.1109\/WASPAA.2017.8170051"},{"key":"363_CR35","unstructured":"B.\u00a0Holm-Rasmussen, H.\u00a0Lehtonen, V.\u00a0V\u00e4lim\u00e4ki, in Proc. 16th Int. Conf. Digital Audio Effects (DAFx-13), A new reverberator based on variable sparsity convolution (Maynooth, 2013)"},{"key":"363_CR36","unstructured":"T.\u00a0Carpentier, M.\u00a0Noisternig, O.\u00a0Warusfel, in 17th International Conference on Digital Audio Effects - DAFx-14, Hybrid reverberation processor with perceptual control (Erlangen, 2014), pp. 93 \u2013 100"},{"key":"363_CR37","doi-asserted-by":"publisher","unstructured":"M.\u00a0J\u00e4lmby, F.\u00a0Elvander, T.\u00a0van Waterschoot, in 2021 29th European Signal Processing Conference (EUSIPCO), Low-rank tensor modeling of room impulse responses (2021), pp. 111\u2013115. https:\/\/doi.org\/10.23919\/EUSIPCO54536.2021.9616075","DOI":"10.23919\/EUSIPCO54536.2021.9616075"},{"key":"363_CR38","doi-asserted-by":"publisher","first-page":"957","DOI":"10.1109\/TASLP.2023.3240650","volume":"31","author":"M J\u00e4lmby","year":"2023","unstructured":"M. J\u00e4lmby, F. Elvander, T. van Waterschoot, Low-rank room impulse response estimation. IEEE\/ACM Trans. Audio Speech Lang. Process. 31, 957\u2013969 (2023). https:\/\/doi.org\/10.1109\/TASLP.2023.3240650","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR39","doi-asserted-by":"publisher","unstructured":"M.\u00a0J\u00e4lmby, F.\u00a0Elvander, T.\u00a0van Waterschoot. Multi-channel low-rank convolution of jointly compressed room impulse responses, IEEE Open Journal of Signal Processing. 5, 850-857 (2025). https:\/\/doi.org\/10.1109\/OJSP.2024.3410089","DOI":"10.1109\/OJSP.2024.3410089"},{"key":"363_CR40","doi-asserted-by":"publisher","unstructured":"J.\u00a0Atkins, A.\u00a0Strauss, C.\u00a0Zhang, in Proc. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Approximate convolution using partitioned truncated singular value decomposition filtering (2013), pp. 176\u2013180. https:\/\/doi.org\/10.1109\/ICASSP.2013.6637632","DOI":"10.1109\/ICASSP.2013.6637632"},{"key":"363_CR41","doi-asserted-by":"publisher","unstructured":"M.\u00a0J\u00e4lmby, F.\u00a0Elvander, T.\u00a0van Waterschoot, in 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Fast low-latency convolution by low-rank tensor approximation (2023), pp. 1\u20135. https:\/\/doi.org\/10.1109\/ICASSP49357.2023.10095908","DOI":"10.1109\/ICASSP49357.2023.10095908"},{"key":"363_CR42","doi-asserted-by":"crossref","unstructured":"M.\u00a0Jaderberg, A.\u00a0Vedaldi, A.\u00a0Zisserman. Speeding up convolutional neural networks with low rank expansions (2014). arXiv:1405.3866","DOI":"10.5244\/C.28.88"},{"key":"363_CR43","doi-asserted-by":"publisher","unstructured":"L. Sorber, M. Van Barel, L. De Lathauwer, Optimization-based algorithms for tensor decompositions: canonical polyadic decomposition, decomposition in rank-(Lr, Lr, 1) terms, and a new generalization. SIAM J. Optim. 23(2), 695\u2013720 (2013). https:\/\/doi.org\/10.1137\/120868323","DOI":"10.1137\/120868323"},{"key":"363_CR44","unstructured":"N.\u00a0Vervliet, O.\u00a0Debals, L.\u00a0Sorber, M.\u00a0Van\u00a0Barel, L.\u00a0De\u00a0Lathauwer. Tensorlab 3.0 (2016). https:\/\/www.tensorlab.net. Accessed 15 Aug 2024"},{"key":"363_CR45","unstructured":"J.M. Valin, K.\u00a0Vos, T.\u00a0Terriberry. Definition of the opus audio codec (2012). https:\/\/www.rfc-editor.org\/rfc\/rfc6716. Accessed 15 Aug 2024"},{"key":"363_CR46","unstructured":"J.M. Valin, G.\u00a0Maxwell, T.B. Terriberry, K.\u00a0Vos, in Proc. 135th AES Convention, High-quality, low-delay music coding in the Opus codec (New York, 2012)"},{"key":"363_CR47","unstructured":"K.\u00a0Vos, S.\u00a0Jensen, K.\u00a0Soerensen. Silk speech codec (2010). https:\/\/datatracker.ietf.org\/doc\/html\/draft-vos-silk-02. Accessed 15 Aug 2024"},{"key":"363_CR48","unstructured":"J.M.M. Valin, T.B. Terriberry, G. Maxwell, in 2009 17th European Signal Processing Conference (EUSIPCO), A full-bandwidth audio codec with low complexity and very low delay (2009), pp. 1254\u20131258"},{"issue":"1","key":"363_CR49","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1109\/TASL.2009.2023186","volume":"18","author":"JM Valin","year":"2010","unstructured":"J.M. Valin, T.B. Terriberry, C. Montgomery, G. Maxwell, A high-quality speech and audio codec with less than 10-ms delay. IEEE Trans. Audio Speech Lang. Process. 18(1), 58\u201367 (2010). https:\/\/doi.org\/10.1109\/TASL.2009.2023186","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"363_CR50","doi-asserted-by":"publisher","unstructured":"H.\u00a0Ren, C.\u00a0Ritz, J.\u00a0Zhao, D.\u00a0Jang, in 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Impact of compression on the performance of the room impulse response interpolation approach to spatial audio synthesis (2022), pp. 442\u2013448. https:\/\/doi.org\/10.23919\/APSIPAASC55919.2022.9980324","DOI":"10.23919\/APSIPAASC55919.2022.9980324"},{"key":"363_CR51","volume-title":"Room Acoustics","author":"H Kuttruff","year":"2009","unstructured":"H. Kuttruff, Room Acoustics (Spon Press, London, 2009)"},{"issue":"2","key":"363_CR52","doi-asserted-by":"publisher","first-page":"320","DOI":"10.1109\/89.279281","volume":"2","author":"Y Haneda","year":"1994","unstructured":"Y. Haneda, S. Makino, Y. Kaneda, Common acoustical pole and zero modeling of room transfer functions. IEEE Trans. Speech Audio Process. 2(2), 320\u2013328 (1994). https:\/\/doi.org\/10.1109\/89.279281","journal-title":"IEEE Trans. Speech Audio Process."},{"key":"363_CR53","first-page":"1012","volume":"50","author":"M Karjalainen","year":"2002","unstructured":"M. Karjalainen, P.A. Esquef, P. Antsalo, A. M\u00e4kivirta, V. V\u00e4lim\u00e4ki, Frequency-zooming ARMA modeling of resonant and reverberant systems. J. Audio Eng. Soc. 50, 1012\u20131029 (2002)","journal-title":"J. Audio Eng. Soc."},{"key":"363_CR54","doi-asserted-by":"crossref","unstructured":"J.K. Nielsen, J.R. Jensen, S.H. Jensen, M.G. Christensen, The single- and multichannel audio recordings database (SMARD) (Antibes, 2014)","DOI":"10.1109\/IWAENC.2014.6953334"},{"key":"#cr-split#-363_CR55.1","unstructured":"A. Hines, J. Skoglund, A. Kokaram, N. Harte, in IWAENC 2012"},{"key":"#cr-split#-363_CR55.2","unstructured":"International Workshop on Acoustic Signal Enhancement, ViSQOL: the virtual speech quality objective listener (Aachen, 2012), pp.1-4"},{"issue":"1","key":"363_CR56","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1186\/s13636-015-0054-9","volume":"2015","author":"A Hines","year":"2015","unstructured":"A. Hines, J. Skoglund, A.C. Kokaram, N. Harte, Visqol: an objective speech quality model. EURASIP J. Audio Speech Music Process. 2015(1), 13 (2015)","journal-title":"EURASIP J. Audio Speech Music Process."},{"key":"363_CR57","doi-asserted-by":"crossref","unstructured":"A.\u00a0Hines, E.\u00a0Gillen, D.\u00a0Kelly, J.\u00a0Skoglund, A.\u00a0Kokaram, N.\u00a0Harte, ViSQOLaudio: an objective audio quality metric for low bitrate codecs. J. Acoust. Soc. Am. 137 (6), EL449\u2013EL455 (2015)","DOI":"10.1121\/1.4921674"},{"issue":"3","key":"363_CR58","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1121\/1.1909343","volume":"37","author":"MR Schroeder","year":"1965","unstructured":"M.R. Schroeder, New method of measuring reverberation time. J. Acoust. Soc. Am. 37(3), 409\u2013412 (1965)","journal-title":"J. Acoust. Soc. Am."},{"key":"363_CR59","volume-title":"Spatial Audio","author":"F Rumsey","year":"2001","unstructured":"F. Rumsey, Spatial Audio (Focal Press, Oxford, 2001)"},{"key":"363_CR60","unstructured":"J.\u00a0Abel, P.\u00a0Huang, in Audio Engineering Society Convention 121, A simple, robust measure of reverberation echo density (Audio Engineering Society, 2006)"},{"issue":"9","key":"363_CR61","doi-asserted-by":"publisher","first-page":"1478","DOI":"10.1109\/TASLP.2015.2438547","volume":"23","author":"E De Sena","year":"2015","unstructured":"E. De Sena, H. Hacihabiboglu, Z. Cvetkovic, J.O. Smith, Efficient synthesis of room acoustics via scattering delay networks. IEEE\/ACM Trans. Audio Speech Lang. Process. 23(9), 1478\u20131492 (2015). https:\/\/doi.org\/10.1109\/TASLP.2015.2438547","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"363_CR62","unstructured":"M.\u00a0Karjalainen, H.\u00a0J\u00e4rvelainen, in Proceedings of the 111th Audio Engineering Society Convention, More about this reverberation science: perceptually good late reverberation (New York, 2011)"},{"key":"363_CR63","doi-asserted-by":"publisher","unstructured":"K.\u00a0MacWilliam, F.\u00a0Elvander, T.\u00a0van Waterschoot, in 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Simultaneous acoustic echo sorting and 3-D room geometry inference (2023), pp. 1\u20135. https:\/\/doi.org\/10.1109\/ICASSP49357.2023.10096005","DOI":"10.1109\/ICASSP49357.2023.10096005"},{"key":"363_CR64","doi-asserted-by":"publisher","unstructured":"H. Rosseel, T. van Waterschoot, in 2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA), Improved acoustic source localization by time delay estimation with subsample accuracy (2021), pp. 1\u20138 https:\/\/doi.org\/10.1109\/I3DA48870.2021.9610902","DOI":"10.1109\/I3DA48870.2021.9610902"},{"key":"363_CR65","doi-asserted-by":"publisher","unstructured":"M.\u00a0Cartwright, B.\u00a0Pardo, G.J. Mysore, M.\u00a0Hoffman, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Fast and easy crowdsourced perceptual audio evaluation (2016), pp. 619\u2013623. https:\/\/doi.org\/10.1109\/ICASSP.2016.7471749","DOI":"10.1109\/ICASSP.2016.7471749"},{"key":"363_CR66","unstructured":"A. Spriet, K. Eneman, M. Moonen, J. Wouters, in 2008 16th European Signal Processing Conference (EUSIPCO), Objective measures for real-time evaluation of adaptive feedback cancellation algorithms in hearing aids (Lausanne, 2008), pp. 1\u20135"},{"issue":"2","key":"363_CR67","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1016\/j.specom.2011.09.004","volume":"54","author":"A Hines","year":"2012","unstructured":"A. Hines, N. Harte, Speech intelligibility prediction using a neurogram similarity index measure. Speech Commun. 54(2), 306\u2013320 (2012). https:\/\/doi.org\/10.1016\/j.specom.2011.09.004","journal-title":"Speech Commun."},{"key":"363_CR68","unstructured":"Rec.ITU-R.BS.1534-1:, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA) (International Telecommunication Union, Geneva 2003)"},{"key":"363_CR69","doi-asserted-by":"publisher","unstructured":"M.\u00a0Narbutt, A.\u00a0Allen, J.\u00a0Skoglund, M.\u00a0Chinen, A.\u00a0Hines, in 2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX), Ambiqual - a full reference objective quality metric for ambisonic spatial audio (2018), pp. 1\u20136. https:\/\/doi.org\/10.1109\/QoMEX.2018.8463408","DOI":"10.1109\/QoMEX.2018.8463408"},{"key":"363_CR70","doi-asserted-by":"publisher","unstructured":"M.\u00a0Narbutt, J.\u00a0Skoglund, A.\u00a0Allen, M.\u00a0Chinen, D.\u00a0Barry, A.\u00a0Hines, Ambiqual: Towards a quality metric for headphone rendered compressed ambisonic spatial audio. Appl. Sci. 10(9) (2020). https:\/\/doi.org\/10.3390\/app10093188","DOI":"10.3390\/app10093188"},{"key":"363_CR71","doi-asserted-by":"publisher","unstructured":"A.\u00a0Rix, J.\u00a0Beerends, M.\u00a0Hollier, A.\u00a0Hekstra, in 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol.\u00a02, Perceptual evaluation of speech quality (PESQ) - a new method for speech quality assessment of telephone networks and codecs (2001), pp. 749\u2013752. https:\/\/doi.org\/10.1109\/ICASSP.2001.941023","DOI":"10.1109\/ICASSP.2001.941023"},{"issue":"6","key":"363_CR72","first-page":"366","volume":"61","author":"J Beerends","year":"2013","unstructured":"J. Beerends, C. Schmidmer, J. Berger, M. Obermann, R. Ullmann, J. Pomy, M. Keyhl, Perceptual objective listening quality assessment (POLQA), the third generation ITU-T standard for end-to-end speech quality measurement part I \u2013 temporal alignment. J. Audio Eng. Soc. 61(6), 366\u2013384 (2013)","journal-title":"J. Audio Eng. Soc."},{"issue":"1","key":"363_CR73","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/s13636-023-00284-9","volume":"2023","author":"T Dietzen","year":"2023","unstructured":"T. Dietzen, R. Ali, M. Taseska, T. van Waterschoot, MYRiAD: a multi-array room acoustic database. EURASIP J. Audio Speech Music Process. 2023(1), 17 (2023). https:\/\/doi.org\/10.1186\/s13636-023-00284-9","journal-title":"EURASIP J. Audio Speech Music Process."},{"key":"363_CR74","unstructured":"D.\u00a0Thery, B.F. Katz, in Intl Cong on Acoustics (ICA), Anechoic audio and 3D-video content database of small ensemble performances for virtual concerts (Aachen, 2019). https:\/\/hal.science\/hal-02354814"},{"key":"363_CR75","doi-asserted-by":"crossref","unstructured":"J.P. Paulo, C.R. Martins, J.\u00a0Bento Coelho, A hybrid MLS technique for room impulse response estimation. Appl. Acoust. 70(4), 556\u2013562 (2009)","DOI":"10.1016\/j.apacoust.2008.07.007"},{"key":"363_CR76","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.apacoust.2017.11.018","volume":"132","author":"DG \u0106iri\u0107","year":"2018","unstructured":"D.G. \u0106iri\u0107, M. Jankovi\u0107, Correction of room impulse response truncation based on a nonlinear decay model. Appl. Acoust. 132, 210\u2013222 (2018)","journal-title":"Appl. Acoust."},{"key":"363_CR77","doi-asserted-by":"publisher","unstructured":"M.\u00a0Crocco, A.\u00a0Del\u00a0Bue, in 2015 23rd European Signal Processing Conference (EUSIPCO), Room impulse response estimation by iterative weighted L1-norm (2015), pp. 1895\u20131899. https:\/\/doi.org\/10.1109\/EUSIPCO.2015.7362713","DOI":"10.1109\/EUSIPCO.2015.7362713"},{"key":"363_CR78","doi-asserted-by":"publisher","unstructured":"G. Huang, J. Benesty, J. Chen, C. Paleologu, S. Ciochin\u0103, W. Kellermann, I. Cohen, in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Acoustic system identification with partially time-varying models based on tensor decompositions (2022), pp. 1\u20135 https:\/\/doi.org\/10.1109\/IWAENC53105.2022.9914787","DOI":"10.1109\/IWAENC53105.2022.9914787"},{"key":"363_CR79","doi-asserted-by":"publisher","unstructured":"M.\u00a0Chen, C.M. Lee, The optimal determination of the truncation time of non-exponential sound decays. Buildings. 12(5) (2022). https:\/\/doi.org\/10.3390\/buildings12050697","DOI":"10.3390\/buildings12050697"},{"key":"#cr-split#-363_CR80.1","unstructured":"N.D. Gaubitch, H.W. Loellmann, M. Jeub, T.H. Falk, P.A. Naylor, P. Vary, M. Brookes, in IWAENC 2012"},{"key":"#cr-split#-363_CR80.2","unstructured":"International Workshop on Acoustic Signal Enhancement, Performance comparison of algorithms for blind reverberation time estimation from speech (Aachen, 2012), pp. 1-4"}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-024-00363-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13636-024-00363-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-024-00363-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,13]],"date-time":"2024-09-13T16:29:18Z","timestamp":1726244958000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13636-024-00363-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,13]]},"references-count":82,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["363"],"URL":"https:\/\/doi.org\/10.1186\/s13636-024-00363-5","relation":{},"ISSN":["1687-4722"],"issn-type":[{"type":"electronic","value":"1687-4722"}],"subject":[],"published":{"date-parts":[[2024,9,13]]},"assertion":[{"value":"3 December 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 July 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 September 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"45"}}