{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T15:23:05Z","timestamp":1781104985535,"version":"3.54.1"},"reference-count":20,"publisher":"IGI Global Scientific Publishing","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,7,1]]},"abstract":"<p>This paper presents a system of speaker localization for a purpose of speaker tracking by camera. The authors use the information given by the two microphones, placed in opposition, to determine the position of the active speaker in trying to supervise the audio-visual recording. To achieve the speaker localization task, the authors have proposed and employed two methods, which are called respectively: the filtered correlation method and the energy differential method. The principle of the first method is based on the calculation of the correlation between the two signals collected by the two microphones and a special filtering. The second is based on the computation of the logarithmic energy differential between these two signals. However, when different methods are used simultaneously to make a decision, it is often interesting to use a fusion technique combining those estimations or decisions in order to enhance the system performances. For that purpose, this paper proposes two fusion techniques operating at the decision level which are used to fuse the two estimations into one that should be more precise.<\/p>","DOI":"10.4018\/jmcmc.2010070102","type":"journal-article","created":{"date-parts":[[2011,2,15]],"date-time":"2011-02-15T15:11:04Z","timestamp":1297782664000},"page":"15-33","source":"Crossref","is-referenced-by-count":0,"title":["Automatic Speaker Localization and Tracking"],"prefix":"10.4018","volume":"2","author":[{"given":"Siham","family":"Ouamour","sequence":"first","affiliation":[{"name":"USTHB University, Algeria"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Halim","family":"Sayoud","sequence":"additional","affiliation":[{"name":"USTHB University, Algeria"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Salah","family":"Khennouf","sequence":"additional","affiliation":[{"name":"USTHB University, Algeria"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"2432","reference":[{"key":"jmcmc.2010070102-0","unstructured":"Bjor, O. H., Enger, J., & Winsvold, B. (2001). Sound intensity for identification of aircraft noise. In Proceedings of the Inter-noise Proceedings, International Congress and Exhibition on Noise Control Engineering."},{"key":"jmcmc.2010070102-1","year":"2003","journal-title":"Weatherproof microphone unit type 4184 (Product Data)"},{"key":"jmcmc.2010070102-2","author":"H.Cox","year":"1987","journal-title":"Robust adaptive beamforming"},{"key":"jmcmc.2010070102-3","author":"B. V.Dasarathy","year":"1994","journal-title":"Decision Fusion"},{"key":"jmcmc.2010070102-4","unstructured":"Jacobsen, F. (2002). Sound Intensity and its Measurement and Applications (Tech. Rep. No. 2216). Lyngby, Denmark: Technical University of Denmark."},{"key":"jmcmc.2010070102-5","doi-asserted-by":"crossref","unstructured":"Jain, A. K., Ross, A., & Prabhakar, S. (2004, January 4-20). An Introduction to Biometric Recognition. IEEE Transactions on Circuits and Systems for Video Technology Journal, 14(1).","DOI":"10.1109\/TCSVT.2003.818349"},{"key":"jmcmc.2010070102-6","unstructured":"Kirkwood, B. C. (2003, August 4). Acoustic Source Localization Using Time-Delay Estimation. Unpublished master\u2019s thesis, Technical University of Denmark, Denmark."},{"key":"jmcmc.2010070102-7","unstructured":"Lathoud, G. (2006). Spatio-temporal analysis of spontaneous speech with microphone arrays. Unpublished doctoral dissertation, EPFL University, Switzerland."},{"key":"jmcmc.2010070102-8","doi-asserted-by":"publisher","DOI":"10.3397\/1.2827638"},{"key":"jmcmc.2010070102-9","doi-asserted-by":"crossref","unstructured":"Liu, Q., Rui, Y., Gupta, A., & Cadiz, J. J. (2000, September). Automating Camera Management for Lecture Room Environments (Tech. Rep. No. MSR-TR-2000-90). Microsoft Research.","DOI":"10.1145\/365024.365310"},{"key":"jmcmc.2010070102-10","doi-asserted-by":"crossref","unstructured":"Maganti, H. K., & Perez, D. G. (2006, November 2-4). The Effects of Accuracy on Overlapping Speech. In Proceedings of 13ICMI\u201906. Banff, Canada: Speaker Localization for Microphone ArrayBased ASR.","DOI":"10.1145\/1180995.1181004"},{"key":"jmcmc.2010070102-11","doi-asserted-by":"crossref","unstructured":"Mennen, I., Schaeffler, F., & Docherty, G. (2008, May 6-9). A methodological study into the linguistic dimensions of pitch range differences between German and English. In Proceedings of the Speech Prosody Conference, Campinas, Brazil (pp. 527-530).","DOI":"10.21437\/SpeechProsody.2008-118"},{"key":"jmcmc.2010070102-12","doi-asserted-by":"publisher","DOI":"10.1016\/S0921-8890(02)00325-1"},{"key":"jmcmc.2010070102-13","doi-asserted-by":"publisher","DOI":"10.1007\/s00779-007-0172-1"},{"key":"jmcmc.2010070102-14","unstructured":"Ouamour, S., & Sayoud, H. (2009, July). Speaker Discrimination on Broadcast News and Telephonic Calls Using a Fusion of Neural and Statistical Classifiers. The Mediterranean Journal of Computers and Networks (MedJCN), 5(3), 104-113. ISSN: 1744-2400"},{"key":"jmcmc.2010070102-15","author":"D. R.Raichel","year":"2000","journal-title":"The Science and Applications of Acoustics. AIP Series in Modern Acoustics and Signal Processing"},{"key":"jmcmc.2010070102-16","unstructured":"Rasmussen, K. (1997). Calculation methods for the physical properties of air used in the calibration of microphones (Tech. Rep. No. PL-11b). Lyngby, Denmark: Technical University of Denmark, Department of Acoustic Technology."},{"key":"jmcmc.2010070102-17","unstructured":"Stylianou, Y., Pantazis, Y., Calderero, F., Larroy, P., Severin, F., Schimke, S., et al. (2005, July 18-August 12). GMM- Based Multimodal Biometric Verification. In Proceedings of Enterface\u201905. Mons, Belgium: Valsamakis. A."},{"key":"jmcmc.2010070102-18","unstructured":"Ui-Hyun, K., Jinsung, K., Doik, K., Hyogon, K., & Bum-Jae, Y. (2008). Speaker Localization on a Humanoid Robot's Head using the TDOA-based Feature Matrix. In Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)."},{"key":"jmcmc.2010070102-19","unstructured":"Verlinde, P. (1999, September 17). Contribution \u00e0 la v\u00e9rification multi-modale de l'identit\u00e9 en utilisant la fusion de d\u00e9cisions. Th\u00e8se de doctorat, Ecole Nationale Sup\u00e9rieure des T\u00e9l\u00e9communications, MA, Bruxelles."}],"container-title":["International Journal of Mobile Computing and Multimedia Communications"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=46121","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,4]],"date-time":"2024-04-04T00:45:16Z","timestamp":1712191516000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/jmcmc.2010070102"}},"subtitle":["Using a Fusion of the Filtered Correlation with the Energy Differential"],"short-title":[],"issued":{"date-parts":[[2010,7,1]]},"references-count":20,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2010,7]]}},"URL":"https:\/\/doi.org\/10.4018\/jmcmc.2010070102","relation":{},"ISSN":["1937-9412","1937-9404"],"issn-type":[{"value":"1937-9412","type":"print"},{"value":"1937-9404","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,7,1]]}}}