{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T15:33:02Z","timestamp":1773761582537,"version":"3.50.1"},"reference-count":68,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,3,28]],"date-time":"2024-03-28T00:00:00Z","timestamp":1711584000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,3,28]],"date-time":"2024-03-28T00:00:00Z","timestamp":1711584000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Most soundfield synthesis approaches deal with extensive and regular loudspeaker arrays, which are often not suitable for home audio systems, due to physical space constraints. In this article, we propose a technique for soundfield synthesis through more easily deployable irregular loudspeaker arrays, i.e., where the spacing between loudspeakers is not constant, based on deep learning. The input are the driving signals obtained through a plane wave decomposition-based technique. While the considered driving signals are able to correctly reproduce the soundfield with a regular array, they show degraded performances when using irregular setups. Through a complex-valued convolutional neural network (CNN), we modify the driving signals in order to compensate the errors in the reproduction of the desired soundfield. Since no ground truth driving signals are available for the compensated ones, we train the model by calculating the loss between the desired soundfield at a number of control points and the one obtained through the driving signals estimated by the network. The proposed model must be retrained for each irregular loudspeaker array configuration. Numerical results show better reproduction accuracy with respect to the plane wave decomposition-based technique, pressure-matching approach, and linear optimizers for driving signal compensation.<\/jats:p>","DOI":"10.1186\/s13636-024-00337-7","type":"journal-article","created":{"date-parts":[[2024,3,28]],"date-time":"2024-03-28T02:01:31Z","timestamp":1711591291000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks"],"prefix":"10.1186","volume":"2024","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4167-5173","authenticated-orcid":false,"given":"Luca","family":"Comanducci","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fabio","family":"Antonacci","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Augusto","family":"Sarti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,3,28]]},"reference":[{"issue":"5","key":"337_CR1","doi-asserted-by":"publisher","first-page":"2764","DOI":"10.1121\/1.405852","volume":"93","author":"AJ Berkhout","year":"1993","unstructured":"A.J. Berkhout, D. de Vries, P. Vogel, Acoustic control by wave field synthesis. J. Acoust. Soc. Am. 93(5), 2764\u20132778 (1993)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR2","unstructured":"S. Spors, R. Rabenstein, J. Ahrens, in 124th AES convention. The theory of wave field synthesis revisited (Audio Engineering Society (AES),\u00a0New York, 2008), pp. 17\u201320"},{"issue":"1","key":"337_CR3","first-page":"2","volume":"21","author":"MA Gerzon","year":"1973","unstructured":"M.A. Gerzon, Periphony: With-height sound reproduction. J. Audio Eng. Soc. 21(1), 2\u201310 (1973)","journal-title":"J. Audio Eng. Soc."},{"issue":"6","key":"337_CR4","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1109\/89.943347","volume":"9","author":"DB Ward","year":"2001","unstructured":"D.B. Ward, T.D. Abhayapala, Reproduction of a plane-wave sound field using an array of loudspeakers. IEEE Trans. Speech Audio Process. 9(6), 697\u2013707 (2001)","journal-title":"IEEE Trans. Speech Audio Process."},{"issue":"11","key":"337_CR5","first-page":"1004","volume":"53","author":"MA Poletti","year":"2005","unstructured":"M.A. Poletti, Three-dimensional surround sound systems based on spherical harmonics. J. Audio Eng. Soc. 53(11), 1004\u20131025 (2005)","journal-title":"J. Audio Eng. Soc."},{"issue":"6","key":"337_CR6","doi-asserted-by":"publisher","first-page":"3590","DOI":"10.1121\/1.3409486","volume":"127","author":"M Poletti","year":"2010","unstructured":"M. Poletti, F. Fazi, P. Nelson, Sound-field reproduction systems using fixed-directivity loudspeakers. J. Acoust. Soc. Am. 127(6), 3590\u20133601 (2010)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR7","doi-asserted-by":"crossref","unstructured":"M. Kentgens, A. Behler, P. Jax, in 2020 IEEE international conference on acoustics, speech and signal processing (ICASSP). Translation of a higher order ambisonics sound scene based on parametric decomposition (IEEE,\u00a0Piscataway, 2020), pp. 151\u2013155","DOI":"10.1109\/ICASSP40776.2020.9054414"},{"issue":"8","key":"337_CR8","doi-asserted-by":"publisher","first-page":"2038","DOI":"10.1109\/TASL.2010.2041106","volume":"18","author":"J Ahrens","year":"2010","unstructured":"J. Ahrens, S. Spors, Sound field reproduction using planar and linear arrays of loudspeakers. IEEE Trans. Audio Speech Lang. Process. 18(8), 2038\u20132050 (2010)","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"337_CR9","doi-asserted-by":"crossref","unstructured":"P. Chen, et al., in 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP). 3D exterior soundfield reproduction using a planar loudspeaker array (IEEE,\u00a0Piscataway, 2018), pp. 471\u2013475","DOI":"10.1109\/ICASSP.2018.8461331"},{"key":"337_CR10","unstructured":"J. Trevino, T. Okamoto, Y. Iwaya, Y. Suzuki, High order Ambisonic decoding method for irregular loudspeaker arrays. In Proceedings of 20th International Congress on Acoustics (pp. 23\u201327)"},{"key":"337_CR11","unstructured":"F. Zotter, M. Frank, H. Pomberger, Comparison of energy-preserving and all-round ambisonic decoders. Fortschritte der Akustik, AIA-DAGA, (Meran) (2013)"},{"key":"337_CR12","doi-asserted-by":"crossref","unstructured":"T. Qu, Z. Huang, Y. Qiao, X. Wu, in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Matching projection decoding method for ambisonics system (IEEE,\u00a0Piscataway, 2018), pp. 561\u2013565","DOI":"10.1109\/ICASSP.2018.8461515"},{"key":"337_CR13","doi-asserted-by":"publisher","first-page":"1411","DOI":"10.1109\/TASLP.2021.3068002","volume":"29","author":"Z Ge","year":"2021","unstructured":"Z. Ge, L. Li, T. Qu, Partially matching projection decoding method evaluation under different playback conditions. IEEE\/ACM Trans. Audio Speech Lang. Process. 29, 1411\u20131423 (2021)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"issue":"10","key":"337_CR14","first-page":"807","volume":"60","author":"F Zotter","year":"2012","unstructured":"F. Zotter, M. Frank, All-round ambisonic panning and decoding. J. Audio Eng. Soc. 60(10), 807\u2013820 (2012)","journal-title":"J. Audio Eng. Soc."},{"issue":"4","key":"337_CR15","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1006\/jsvi.1994.1446","volume":"177","author":"PA Nelson","year":"1994","unstructured":"P.A. Nelson, Active control of acoustic fields and the reproduction of sound. J. Sound Vib. 177(4), 447\u2013477 (1994)","journal-title":"J. Sound Vib."},{"issue":"2","key":"337_CR16","doi-asserted-by":"publisher","first-page":"662","DOI":"10.1121\/1.1850032","volume":"117","author":"PA Gauthier","year":"2005","unstructured":"P.A. Gauthier, A. Berry, W. Woszczyk, Sound-field reproduction in-room using optimal control techniques: Simulations in the frequency domain. J. Acoust. Soc. Am. 117(2), 662\u2013678 (2005)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR17","doi-asserted-by":"crossref","unstructured":"P.N. Samarasinghe, M.A. Poletti, S.A. Salehin, T.D. Abhayapala, F.M. Fazi, in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. 3d soundfield reproduction using higher order loudspeakers (IEEE,\u00a0Piscataway, 2013), pp. 306\u2013310","DOI":"10.1109\/ICASSP.2013.6637658"},{"issue":"4","key":"337_CR18","doi-asserted-by":"publisher","first-page":"2100","DOI":"10.1121\/1.1863032","volume":"117","author":"T Betlehem","year":"2005","unstructured":"T. Betlehem, T.D. Abhayapala, Theory and design of sound field reproduction in reverberant rooms. J Acoust. Soc. Am. 117(4), 2100\u20132111 (2005)","journal-title":"J Acoust. Soc. Am."},{"issue":"12","key":"337_CR19","doi-asserted-by":"publisher","first-page":"1852","DOI":"10.1109\/TASLP.2019.2934834","volume":"27","author":"N Ueno","year":"2019","unstructured":"N. Ueno, S. Koyama, H. Saruwatari, Three-dimensional sound field reproduction based on weighted mode-matching method. IEEE\/ACM Trans. Audio Speech Lang. Process. 27(12), 1852\u20131867 (2019)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"337_CR20","doi-asserted-by":"crossref","unstructured":"N. Ueno, S. Koyama, H. Saruwatari, in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Sound field reproduction with exterior cancellation using analytical weighting of harmonic coefficients (IEEE,\u00a0Piscataway, 2018), pp. 466\u2013470","DOI":"10.1109\/ICASSP.2018.8462084"},{"key":"337_CR21","doi-asserted-by":"publisher","first-page":"1356","DOI":"10.1109\/TASLP.2020.2987748","volume":"28","author":"H Zuo","year":"2020","unstructured":"H. Zuo, P.N. Samarasinghe, T.D. Abhayapala, Intensity based spatial soundfield reproduction using an irregular loudspeaker array. IEEE\/ACM Trans. Audio Speech Lang. Process. 28, 1356\u20131369 (2020)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"337_CR22","doi-asserted-by":"crossref","unstructured":"H. Zuo, T.D. Abhayapala, P.N. Samarasinghe, in 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 3d multizone soundfield reproduction in a reverberant environment using intensity matching method (IEEE,\u00a0Piscataway, 2021), pp. 416\u2013420","DOI":"10.1109\/ICASSP39728.2021.9414077"},{"issue":"5","key":"337_CR23","doi-asserted-by":"publisher","first-page":"3590","DOI":"10.1121\/1.5133944","volume":"146","author":"MJ Bianco","year":"2019","unstructured":"M.J. Bianco, P. Gerstoft, J. Traer, E. Ozanich, M.A. Roch, S. Gannot, C.A. Deledalle, Machine learning in acoustics: Theory and applications. J. Acoust. Soc. Am. 146(5), 3590\u20133628 (2019)","journal-title":"J. Acoust. Soc. Am."},{"issue":"1","key":"337_CR24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13636-021-00231-6","volume":"2022","author":"M Cobos","year":"2022","unstructured":"M. Cobos, J. Ahrens, K. Kowalczyk, A. Politis, An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction. EURASIP J. Audio Speech Music Process. 2022(1), 1\u201321 (2022)","journal-title":"EURASIP J. Audio Speech Music Process."},{"key":"337_CR25","doi-asserted-by":"crossref","unstructured":"F. Lluis, P. Martinez-Nuevo, M. Bo M\u00f8ller, S. Ewan Shepstone, Sound field reconstruction in rooms: Inpainting meets super-resolution. J. Acoust. Soc. Am. 148(2), 649\u2013659 (2020)","DOI":"10.1121\/10.0001687"},{"key":"337_CR26","unstructured":"M.S. Kristoffersen, M.B. M\u00f8ller, P. Mart\u00ednez-Nuevo, J. \u00d8stergaard, Deep sound field reconstruction in real rooms: Introducing the isobel sound field dataset. (2021).\u00a0arXiv\u00a0preprint\u00a0arXiv:2102.06455"},{"key":"337_CR27","unstructured":"P. Morgado et al., in Proceedings of the 32nd Int. Conf. on Neural Information Processing Systems.\u00a0Self-supervised generation of spatial audio for 360$$^{\\circ }$$video. (Curran Associates Inc., New York, 2018), pp. 360\u2013370"},{"key":"337_CR28","unstructured":"G. Routray, S. Basu, P. Baldev, R.M. Hegde, in EAA Spatial Audio Signal Processing Symposium. Deep-sound field analysis for upscaling ambisonic signals (2019), pp. 1\u20136"},{"key":"337_CR29","doi-asserted-by":"crossref","unstructured":"S. Gao, J. Lin, W. Xihong, T. Qu, Sparse DNN model for frequency expanding of higher order ambisonics encoding process. IEEE\/ACM Trans. Audio Speech Lang. Process. (2022)","DOI":"10.1109\/TASLP.2022.3153266"},{"issue":"4","key":"337_CR30","doi-asserted-by":"publisher","first-page":"6187","DOI":"10.1007\/s11042-020-09979-z","volume":"80","author":"L Zhang","year":"2021","unstructured":"L. Zhang, X. Wang, R. Hu, D. Li, W. Tu, Estimation of spherical harmonic coefficients in sound field recording using feed-forward neural networks. Multimedia Tools Appl. 80(4), 6187\u20136202 (2021)","journal-title":"Multimedia Tools Appl."},{"key":"337_CR31","doi-asserted-by":"publisher","unstructured":"H. Chen, T. Abhayapala, in Proceedings of the 23rd International Congress on Acoustics : integrating 4th EAA Euroregio 2019 : 9-13 September 2019 in Aachen, Germany. Spatial sound field reproduction using deep neural networks (2019). https:\/\/doi.org\/10.18154\/RWTH-CONV-239844","DOI":"10.18154\/RWTH-CONV-239844"},{"key":"337_CR32","doi-asserted-by":"crossref","unstructured":"L. Comanducci, F. Antonacci, A. Sarti, in 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). A deep learning-based pressure matching approach to soundfield synthesis (IEEE,\u00a0Piscataway, 2022), pp. 1\u20135","DOI":"10.1109\/IWAENC53105.2022.9914712"},{"issue":"5","key":"337_CR33","doi-asserted-by":"publisher","first-page":"3055","DOI":"10.1121\/10.0019575","volume":"153","author":"X Hong","year":"2023","unstructured":"X. Hong, B. Du, S. Yang, M. Lei, X. Zeng, End-to-end sound field reproduction based on deep learning. J. Acoust. Soc. Am. 153(5), 3055\u20133055 (2023)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR34","doi-asserted-by":"publisher","first-page":"696","DOI":"10.1109\/TASLP.2020.2964958","volume":"28","author":"S Koyama","year":"2020","unstructured":"S. Koyama, G. Chardon, L. Daudet, Optimizing source and sensor placement for sound field control: An overview. IEEE\/ACM Trans. Audio Speech Lang. Process. 28, 696\u2013714 (2020)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"issue":"8","key":"337_CR35","doi-asserted-by":"publisher","first-page":"1406","DOI":"10.1109\/JAS.2022.105743","volume":"9","author":"C Lee","year":"2022","unstructured":"C. Lee, H. Hasegawa, S. Gao, Complex-valued neural networks: A comprehensive survey. IEEE\/CAA J. Autom. Sin. 9(8), 1406\u20131426 (2022)","journal-title":"IEEE\/CAA J. Autom. Sin."},{"key":"337_CR36","unstructured":"J. Bassey, L. Qian, X. Li, A survey of complex-valued neural networks. (2021).\u00a0arXiv\u00a0preprint\u00a0arXiv:2101.12249"},{"key":"337_CR37","unstructured":"C. Trabelsi, O. Bilaniuk, Y. Zhang, D. Serdyuk, S. Subramanian, J.F. Santos, S. Mehri, N. Rostamzadeh, Y. Bengio, C.J. Pal, in International Conference on Learning Representations. Deep complex networks (2018). https:\/\/openreview.net\/forum?id=H1T2hmZAb"},{"key":"337_CR38","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-27632-3","volume-title":"Complex-valued neural networks","author":"A Hirose","year":"2012","unstructured":"A. Hirose, Complex-valued neural networks (Springer Science & Business Media, Berlin\/Heidelberg, 2012)"},{"key":"337_CR39","doi-asserted-by":"crossref","unstructured":"M. Yang, M.Q. Ma, D. Li, Y.H.H. Tsai, R. Salakhutdinov, in 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Complex transformer: A framework for modeling complex-valued sequence (IEEE, 2020), pp. 4232\u20134236","DOI":"10.1109\/ICASSP40776.2020.9054008"},{"key":"337_CR40","doi-asserted-by":"publisher","unstructured":"H. Tsuzuki, M. Kugler, S. Kuroyanagi, A. Iwata, An approach for sound source localization by complex-valued neural network. IEICE Trans. Inf. Syst. E96.D(10), 2257\u20132265 (2013). https:\/\/doi.org\/10.1587\/transinf.E96.D.2257","DOI":"10.1587\/transinf.E96.D.2257"},{"key":"337_CR41","doi-asserted-by":"crossref","unstructured":"Y.S. Lee, C.Y. Wang, S.F. Wang, J.C. Wang, C.H. Wu, in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Fully complex deep neural network for phase-incorporating monaural source separation (IEEE,\u00a0Piscataway, 2017), pp. 281\u2013285","DOI":"10.1109\/ICASSP.2017.7952162"},{"key":"337_CR42","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1016\/j.apacoust.2015.10.010","volume":"104","author":"L Bianchi","year":"2016","unstructured":"L. Bianchi, F. Antonacci, A. Sarti, S. Tubaro, Model-based acoustic rendering based on plane wave decomposition. Appl. Acoust. 104, 127\u2013134 (2016)","journal-title":"Appl. Acoust."},{"issue":"5","key":"337_CR43","doi-asserted-by":"publisher","first-page":"2721","DOI":"10.1121\/1.2186514","volume":"119","author":"PA Gauthier","year":"2006","unstructured":"P.A. Gauthier, A. Berry, Adaptive wave field synthesis with independent radiation mode control for active sound field reproduction: Theory. J. Acoust. Soc. Am. 119(5), 2721\u20132737 (2006)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR44","doi-asserted-by":"crossref","unstructured":"P.A. Gauthier, A. Berry, in Audio Engineering Society Convention 123. Adaptive wave field synthesis for sound field reproduction: Theory, experiments, and future perspectives (Audio Engineering Society,\u00a0New York, 2007)","DOI":"10.1121\/1.2875844"},{"issue":"4","key":"337_CR45","doi-asserted-by":"publisher","first-page":"2003","DOI":"10.1121\/1.2875269","volume":"123","author":"PA Gauthier","year":"2008","unstructured":"P.A. Gauthier, A. Berry, Adaptive wave field synthesis for broadband active sound field reproduction: Signal processing. J. Acoust. Soc. Am. 123(4), 2003\u20132016 (2008)","journal-title":"J. Acoust. Soc. Am."},{"issue":"4","key":"337_CR46","doi-asserted-by":"publisher","first-page":"1991","DOI":"10.1121\/1.2875844","volume":"123","author":"PA Gauthier","year":"2008","unstructured":"P.A. Gauthier, A. Berry, Adaptive wave field synthesis for active sound field reproduction: Experimental results. J. Acoust. Soc. Am. 123(4), 1991\u20132002 (2008)","journal-title":"J. Acoust. Soc. Am."},{"key":"337_CR47","volume-title":"Fourier acoustics: sound radiation and nearfield acoustical holography","author":"EG Williams","year":"1999","unstructured":"E.G. Williams, Fourier acoustics: Sound radiation and nearfield acoustical holography (Academic press, Cambridge, 1999)"},{"issue":"4","key":"337_CR48","doi-asserted-by":"publisher","first-page":"561","DOI":"10.1137\/1034115","volume":"34","author":"PC Hansen","year":"1992","unstructured":"P.C. Hansen, Analysis of discrete ill-posed problems by means of the l-curve. SIAM Rev. 34(4), 561\u2013580 (1992)","journal-title":"SIAM Rev."},{"key":"337_CR49","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-662-03537-5","volume-title":"Inverse acoustic and electromagnetic scattering theory","author":"DL Colton","year":"1998","unstructured":"D.L. Colton, R. Kress, R. Kress, Inverse acoustic and electromagnetic scattering theory, vol. 93 (Springer, New York, 1998)"},{"issue":"1","key":"337_CR50","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1109\/TASL.2009.2022000","volume":"18","author":"DN Zotkin","year":"2009","unstructured":"D.N. Zotkin, R. Duraiswami, N.A. Gumerov, Plane-wave decomposition of acoustical scenes via spherical and cylindrical microphone arrays. IEEE Trans. Audio Speech Lang. Process. 18(1), 2\u201316 (2009)","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"issue":"3","key":"337_CR51","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1007\/BF01444290","volume":"57","author":"ET Whittaker","year":"1903","unstructured":"E.T. Whittaker, On the partial differential equations of mathematical physics. Math. Ann. 57(3), 333\u2013355 (1903)","journal-title":"Math. Ann."},{"key":"337_CR52","unstructured":"E. Verheijen, Sound field reproduction by wave field synthesis. Ph. D. dissertation, Delft University of Technology (1997)"},{"key":"337_CR53","volume-title":"Active control of sound","author":"PA Nelson","year":"1991","unstructured":"P.A. Nelson, S.J. Elliott, Active control of sound (Academic press, Cambridge, 1991)"},{"issue":"7553","key":"337_CR54","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"Y. LeCun, Y. Bengio, G. Hinton, Deep learning. Nature 521(7553), 436\u2013444 (2015)","journal-title":"Nature"},{"key":"337_CR55","unstructured":"K. Simonyan, A. Zisserman, in International Conference on Learning Representations. Very deep convolutional networks for large-scale image recognition (2015)"},{"key":"337_CR56","doi-asserted-by":"publisher","first-page":"2475","DOI":"10.1109\/TASLP.2022.3190723","volume":"30","author":"K SongGong","year":"2022","unstructured":"K. SongGong, W. Wang, H. Chen, Acoustic source localization in the circular harmonic domain using deep learning architecture. IEEE\/ACM Trans. Audio Speech Lang. Process. 30, 2475\u20132491 (2022)","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"337_CR57","doi-asserted-by":"crossref","unstructured":"A. Pandey, D. Wang, in 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Exploring deep complex networks for complex spectrogram enhancement (IEEE,\u00a0Piscataway, 2019), pp. 6885\u20136889","DOI":"10.1109\/ICASSP.2019.8682169"},{"key":"337_CR58","doi-asserted-by":"crossref","unstructured":"Y. Kuroe, M. Yoshid, T. Mori, in Artificial Neural Networks and Neural Information Processing-ICANN\/ICONIP 2003 Istanbul, Turkey, June 26\u201329, 2003 Proceedings. On activation functions for complex-valued neural networks-existence of energy functions- (Springer,\u00a0New York, 2003), pp. 985\u2013992","DOI":"10.1007\/3-540-44989-2_117"},{"key":"337_CR59","doi-asserted-by":"crossref","unstructured":"K. He, X. Zhang, S. Ren, J. Sun, in Proceedings of the IEEE international conference on computer vision. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification (IEEE, 2015), pp. 1026\u20131034","DOI":"10.1109\/ICCV.2015.123"},{"key":"337_CR60","doi-asserted-by":"publisher","unstructured":"K. He, X. Zhang, S. Ren, J. Sun, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Deep residual learning for image recognition (2016), pp. 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"issue":"1","key":"337_CR61","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929\u20131958 (2014)","journal-title":"J. Mach. Learn. Res."},{"key":"337_CR62","doi-asserted-by":"publisher","unstructured":"J.A. Barrachina. Negu93\/cvnn: Complex-valued neural networks (2022). https:\/\/doi.org\/10.5281\/zenodo.7303587","DOI":"10.5281\/zenodo.7303587"},{"key":"337_CR63","doi-asserted-by":"crossref","unstructured":"S. Koyama, K. Kimura, N. Ueno, in 2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA). Sound field reproduction with weighted mode matching and infinite-dimensional harmonic analysis: An experimental evaluation (IEEE,\u00a0Piscataway, 2021), pp. 1\u20136","DOI":"10.1109\/I3DA48870.2021.9610874"},{"key":"337_CR64","unstructured":"H. Wierstorf, S. Spors, in Audio Engineering Society Convention 132, Sound field synthesis toolbox (Audio Engineering Society, 2012). https:\/\/github.com\/sfstoolbox\/sfs-python\/releases\/tag\/0.6.2"},{"key":"337_CR65","unstructured":"D.P. Kingma, J. Ba, in 3rd Intl. Conf. on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings. Adam: A method for stochastic optimization (2015). http:\/\/arxiv.org\/abs\/1412.6980"},{"issue":"4","key":"337_CR66","doi-asserted-by":"publisher","first-page":"600","DOI":"10.1109\/TIP.2003.819861","volume":"13","author":"Z Wang","year":"2004","unstructured":"Z. Wang, A.C. Bovik, H.R. Sheikh, E.P. Simoncelli, Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600\u2013612 (2004)","journal-title":"IEEE Trans. Image Process."},{"key":"337_CR67","doi-asserted-by":"publisher","unstructured":"S. Zhao, Q. Zhu, E. Cheng, I.S. Burnett, A room impulse response database for multizone sound field reproduction (L). J. Acoust. Soc. Am. 152(4), 2505\u20132512 (2022). https:\/\/doi.org\/10.1121\/10.0014958. https:\/\/pubs.aip.org\/asa\/jasa\/article-pdf\/152\/4\/2505\/16657353\/2505_1_online.pdf","DOI":"10.1121\/10.0014958"},{"key":"337_CR68","doi-asserted-by":"crossref","unstructured":"R. Scheibler, E. Bezzam, I. Dokmani\u0107, in 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP). Pyroomacoustics: A python package for audio room simulation and array processing algorithms (IEEE,\u00a0Piscataway, 2018), pp. 351\u2013355","DOI":"10.1109\/ICASSP.2018.8461310"}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-024-00337-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13636-024-00337-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13636-024-00337-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,28]],"date-time":"2024-03-28T02:11:27Z","timestamp":1711591887000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13636-024-00337-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,28]]},"references-count":68,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["337"],"URL":"https:\/\/doi.org\/10.1186\/s13636-024-00337-7","relation":{},"ISSN":["1687-4722"],"issn-type":[{"value":"1687-4722","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,28]]},"assertion":[{"value":"11 September 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 February 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 March 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors approve and consent to participate.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"The authors consent for publication.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"17"}}