{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:01:47Z","timestamp":1768899707073,"version":"3.49.0"},"reference-count":53,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2012,10,15]],"date-time":"2012-10-15T00:00:00Z","timestamp":1350259200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/3.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>This paper presents a novel approach for indoor acoustic source localization using sensor arrays. The proposed solution starts by defining a generative model, designed to explain the acoustic power maps obtained by Steered Response Power (SRP) strategies. An optimization approach is then proposed to fit the model to real input SRP data and estimate the position of the acoustic source. Adequately fitting the model to real SRP data, where noise and other unmodelled effects distort the ideal signal, is the core contribution of the paper. Two basic strategies in the optimization are proposed. First, sparse constraints in the parameters of the model are included, enforcing the number of simultaneous active sources to be limited. Second, subspace analysis is used to filter out portions of the input signal that cannot be explained by the model. Experimental results on a realistic speech database show statistically significant localization error reductions of up to 30% when compared with the SRP-PHAT strategies.<\/jats:p>","DOI":"10.3390\/s121013781","type":"journal-article","created":{"date-parts":[[2012,10,16]],"date-time":"2012-10-16T03:33:50Z","timestamp":1350358430000},"page":"13781-13812","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":30,"title":["Source Localization with Acoustic Sensor Arrays Using Generative Model Based Fitting with Sparse Constraints"],"prefix":"10.3390","volume":"12","author":[{"given":"Jose","family":"Velasco","sequence":"first","affiliation":[{"name":"Department of Electronics, University of Alcal\u00e1, Campus Universitario s\/n, 28805, Alcal\u00e1 de Henares,Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Pizarro","sequence":"additional","affiliation":[{"name":"Department of Electronics, University of Alcal\u00e1, Campus Universitario s\/n, 28805, Alcal\u00e1 de Henares,Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3303-3963","authenticated-orcid":false,"given":"Javier","family":"Macias-Guarasa","sequence":"additional","affiliation":[{"name":"Department of Electronics, University of Alcal\u00e1, Campus Universitario s\/n, 28805, Alcal\u00e1 de Henares,Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2012,10,15]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1145\/159544.159617","article-title":"Some computer science issues in ubiquitous computing","volume":"36","author":"Weiser","year":"1993","journal-title":"Commun. ACM"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1038\/scientificamerican0496-68","article-title":"Smart rooms","volume":"274","author":"Pentland","year":"1996","journal-title":"Sci. Am."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1163\/156855302760121936","article-title":"Intelligent space concept and contents","volume":"16","author":"Lee","year":"2002","journal-title":"Adv. Robot."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1109\/TPAMI.2007.1174","article-title":"Multicamera people tracking with a probabilistic occupancy map","volume":"30","author":"Fleuret","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"8865","DOI":"10.3390\/s101008865","article-title":"Stereo vision tracking of multiple objects in complex indoor environments","volume":"10","author":"Garcia","year":"2010","journal-title":"Sensors"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"3655","DOI":"10.3390\/s100403655","article-title":"Localization of mobile robots using odometry and an external vision sensor","volume":"10","author":"Pizarro","year":"2010","journal-title":"Sensors"},{"key":"ref_7","unstructured":"Lowe, D. (September, January 20\u2013). Object recognition from local scale-invariant features. Kerkyra, Greece."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1006\/csla.1996.0024","article-title":"A practical methodology for speech source localization with microphone arrays","volume":"11","author":"Brandstein","year":"1997","journal-title":"Comput. Speech Lang."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Brandstein, M.S., and Ward, D.B. (2001). Microphone Arrays: Signal Processing Techniques and Applications, Springer-Verlag. [1st ed.].","DOI":"10.1007\/978-3-662-04619-7"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Waibel, A., and Stiefelhagen, R. (2009). Computers in the Human Interaction Loop, Springer. [2nd ed.].","DOI":"10.1007\/978-1-84882-054-8"},{"key":"ref_11","unstructured":"DiBiase, J. (2000). A high-accuracy, low-latency technique for talker localization in reverberant environments using microphone arrays. [Ph.D. Thesis, Brown University]."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/LSP.2007.910324","article-title":"A Linear Closed-Form Algorithm for Source Localization from Time-Differences of Arrival","volume":"15","author":"Gillette","year":"2008","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1109\/TASSP.1976.1162830","article-title":"The generalized correlation method for estimation of time delay","volume":"24","author":"Knapp","year":"1976","journal-title":"IEEE Trans. Acoust. Speech Signal Process"},{"key":"ref_14","unstructured":"Zhang, C., Florencio, D., and Zhang, Z. (April, January 31). Why does PHAT work well in low noise, reverberative environments?. Las Vegas, NV, USA."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1007\/978-3-642-11130-3_12","article-title":"Steered Beamforming Approaches for Acoustic Source Localization","volume":"3","author":"Cohen","year":"2010","journal-title":"Speech Processing in Modern Communication"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1109\/89.568735","article-title":"Use of The Cross-Power-Spectrum Phase in Acoustic Event Location","volume":"5","author":"Omologo","year":"1993","journal-title":"IEEE Trans. Speech Audio Process"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2510","DOI":"10.1109\/TASL.2007.906694","article-title":"A Generalized Steered Response Power Method for Computationally Viable Source Localization","volume":"15","author":"Dmochowski","year":"2007","journal-title":"IEEE Trans. Speech Audio Process"},{"key":"ref_18","unstructured":"Badali, A., Valin, J.M., Michaud, F., and Aarabi, P. (October, January 10\u2013). Evaluating real-time audio localization algorithms for artificial audition in robotics. St. Louis, MO, USA."},{"key":"ref_19","unstructured":"Do, H., and Silverman, H. (March, January 14\u2013). SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data. Dallas, TX, USA."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1109\/LSP.2010.2091502","article-title":"A modified SRP-PHAT functional for robust real-time sound source localization with scalable spatial sampling","volume":"18","author":"Cobos","year":"2011","journal-title":"IEEE Signal Proc. Lett."},{"key":"ref_21","unstructured":"Butko, T., Gonzalez Pla, R., Segura Perales, C., Nadeu Camprub\u00ed, C., and Hernando Peric\u00e1s, F.J. (September, January 29). Two-source acoustic event detection and localization: Online implementation in a smart-room. Barcelona, Spain."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Cohen, I., Benesty, J., and Gannot, S. (2010). Speech Processing in Modern Communication, Springer. Volume 3.","DOI":"10.1007\/978-3-642-11130-3"},{"key":"ref_23","unstructured":"Zhang, C., Zhang, Z., and Florencio, D. (April, January 15\u2013). Maximum Likelihood Sound Source Localization for Multiple Directional Microphones. Honolulu, HI, USA. Volume 1."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1109\/TMM.2008.917406","article-title":"Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings","volume":"10","author":"Zhang","year":"2008","journal-title":"IEEE Trans. Multimed."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1109\/TAP.1986.1143830","article-title":"Multiple emitter location and signal parameter estimation","volume":"34","author":"Schmidt","year":"1986","journal-title":"IEEE Trans. Antenn. Propag."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1109\/MSP.2007.4286571","article-title":"Compressive sensing [lecture notes]","volume":"24","author":"Baraniuk","year":"2007","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1016\/j.crma.2008.03.014","article-title":"The restricted isometry property and its implications for compressed sensing","volume":"346","author":"Candes","year":"2008","journal-title":"C R. Math."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1109\/78.738251","article-title":"An affine scaling methodology for best basis selection","volume":"47","author":"Rao","year":"1999","journal-title":"IEEE Trans. Signal Process"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1007\/BF02678430","article-title":"Adaptive greedy approximations","volume":"13","author":"Davis","year":"1997","journal-title":"Constr. Approx."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1007\/s102080010029","article-title":"Nonlinear methods of approximation","volume":"3","author":"Temlyakov","year":"2003","journal-title":"Found. Comput. Math."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"2231","DOI":"10.1109\/TIT.2004.834793","article-title":"Greed is good: Algorithmic results for sparse approximation","volume":"50","author":"Tropp","year":"2004","journal-title":"IEEE Trans. Inform. Theor."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1137\/S003614450037906X","article-title":"Atomic decomposition by basis pursuit","volume":"43","author":"Chen","year":"2001","journal-title":"SIAM Rev."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.1109\/TIT.2005.864420","article-title":"Just relax: Convex programming methods for identifying sparse signals in noise","volume":"52","author":"Tropp","year":"2006","journal-title":"IEEE Trans. Inform. Theor."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1190\/1.1440378","article-title":"Robust modeling with erratic data","volume":"38","author":"Claerbout","year":"1973","journal-title":"Geophysics"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"3010","DOI":"10.1109\/TSP.2005.850882","article-title":"A sparse signal reconstruction perspective for source localization with sensor arrays","volume":"53","author":"Malioutov","year":"2005","journal-title":"IEEE Trans. Signal Process"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"4780","DOI":"10.3390\/s110504780","article-title":"Adaptive Sparse Representation for Source Localization with Gain\/Phase Errors","volume":"11","author":"Sun","year":"2011","journal-title":"Sensors"},{"key":"ref_37","unstructured":"Ba, D., Ribeiro, K., Zhang, C., and Flore\u0302ncio, D. (March, January 14\u2013). L1 regularized room modeling with compact microphone arrays. Dallas, TX, USA."},{"key":"ref_38","unstructured":"Ribeiro, R., Ba, D., Zhang, C., and Floe\u0302ncio, D. (July, January 19\u2013). Turning enemies into friends: Using reflections to improve sound source localization. Singapore."},{"key":"ref_39","unstructured":"Chardon, G., and Daudet, L. (March, January 25\u2013). Narrowband source localization in an unknown reverberant environment using wavefield sparse decomposition. Kyoto, Japan."},{"key":"ref_40","unstructured":"Meuse, P., and Silverman, H. (April, January 19\u2013). Characterization of talker radiation pattern using a microphone array. Adelaide, Australia."},{"key":"ref_41","unstructured":"Chu, W., and Warnock, A. (2002). Detailed Directivity of Sound Fields Around Human Talkers, Institute for Research in Construction. Research Report."},{"key":"ref_42","unstructured":"Wabnitz, A., Epain, N., Jin, C.T., and van Schaik, A. (2010, January 29\u201331). Room acoustics simulation for multichannel microphone arrays. Melbourne, Australia."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1109\/TASL.2010.2045179","article-title":"Room acoustics simulation using 3-D compact explicit FDTD schemes","volume":"19","author":"Kowalczyk","year":"2011","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"key":"ref_44","unstructured":"Ziomek, L.J. (1995). Fundamentals of Acoustic Field Theory and Space-Time Signal Processing, CRC Press."},{"key":"ref_45","unstructured":"Tikhonov, A., Arsenin, V., and John, F. (1977). Solutions of Ill-Posed Problems, Vh Winston."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1016\/j.sigpro.2005.05.031","article-title":"Algorithms for simultaneous sparse approximation. Part II: Convex relaxation","volume":"86","author":"Tropp","year":"2006","journal-title":"Signal Process"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Statist. Soc. B."},{"key":"ref_48","unstructured":"Koh, K., Kim, S., and Boyd, S. 11_1s: A Matlab Solver for Large-Scale 11-Regularized Least Squares Problems. Available online: http:\/\/www.stanford.edu\/boyd\/11_1s\/ (accessed on 11 October 2012)."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1109\/JSTSP.2007.910971","article-title":"An interior-point method for large-scale 11-regularized least squares","volume":"1","author":"Kim","year":"2007","journal-title":"IEEE J. Sel. Top. Signal Process"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Lathoud, G., Odobez, J.M., and Gatica-Perez, D. (2004, January 21\u201323). AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking. Martigny, Switzerland.","DOI":"10.1007\/978-3-540-30568-2_16"},{"key":"ref_51","unstructured":"Moore, D.C. (2004). The IDIAP Smart Meeting Room, Technical Report; IDIAP Research Institute."},{"key":"ref_52","unstructured":"Lathoud, G. AV16.3 Dataset. Available online: http:\/\/www.idiap.ch\/dataset\/avl6-3\/ (accessed on 11 October 2012)."},{"key":"ref_53","unstructured":"Mostefa, D., Garcia, M., Bernardin, K., Stiefelhagen, R., McDonough, J., Voit, M., Omologo, M., Marques, F., Ekenel, H., and Pnevmatikakis, A. Available online: http:\/\/www.clear-evaluation.org\/clear06\/downloads\/chil-clear-v1.l-2006-02-21.pdf (accessed on 11 October 2012)."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/12\/10\/13781\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T21:52:49Z","timestamp":1760219569000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/12\/10\/13781"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,15]]},"references-count":53,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2012,10]]}},"alternative-id":["s121013781"],"URL":"https:\/\/doi.org\/10.3390\/s121013781","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,10,15]]}}}