{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T08:59:23Z","timestamp":1760345963249,"version":"3.41.0"},"reference-count":26,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2007,5,1]],"date-time":"2007-05-01T00:00:00Z","timestamp":1177977600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2007,5]]},"abstract":"<jats:p>We present a method for foreground\/background separation of audio using a background modelling technique. The technique models the background in an online, unsupervised, and adaptive fashion, and is designed for application to long term surveillance and monitoring problems. The background is determined using a statistical method to model the states of the audio over time. In addition, three methods are used to increase the accuracy of background modelling in complex audio environments. Such environments can cause the failure of the statistical model to accurately capture the background states. An entropy-based approach is used to unify background representations fragmented over multiple states of the statistical model. The approach successfully unifies such background states, resulting in a more robust background model. We adaptively adjust the number of states considered background according to background complexity, resulting in the more accurate classification of background models. Finally, we use an auxiliary model cache to retain potential background states in the system. This prevents the deletion of such states due to a rapid influx of observed states that can occur for highly dynamic sections of the audio signal. The separation algorithm was successfully applied to a number of audio environments representing monitoring applications.<\/jats:p>","DOI":"10.1145\/1230812.1230814","type":"journal-article","created":{"date-parts":[[2007,6,6]],"date-time":"2007-06-06T14:37:11Z","timestamp":1181140631000},"page":"8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Online audio background determination for complex audio environments"],"prefix":"10.1145","volume":"3","author":[{"given":"Simon","family":"Moncrieff","sequence":"first","affiliation":[{"name":"Curtin University of Technology, Perth, W. Australia"}]},{"given":"Svetha","family":"Venkatesh","sequence":"additional","affiliation":[{"name":"Curtin University of Technology, Perth, W. Australia"}]},{"given":"Geoff","family":"West","sequence":"additional","affiliation":[{"name":"Curtin University of Technology, Perth, W. Australia"}]}],"member":"320","published-online":{"date-parts":[[2007,5]]},"reference":[{"volume-title":"Proceedings of the ICOST'2005: 3rd. International Conference on Smart Homes and Health Telematics (July) Magog, Canada.","author":"Azlan M.","key":"e_1_2_1_1_1","unstructured":"Azlan , M. , Cartwright , I. , Jones , N. , Quirk , T. , and West , G . 2005. Multimodal monitoring of the aged in their own homes . In Proceedings of the ICOST'2005: 3rd. International Conference on Smart Homes and Health Telematics (July) Magog, Canada. Azlan, M., Cartwright, I., Jones, N., Quirk, T., and West, G. 2005. Multimodal monitoring of the aged in their own homes. In Proceedings of the ICOST'2005: 3rd. International Conference on Smart Homes and Health Telematics (July) Magog, Canada."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/11428572_4"},{"key":"e_1_2_1_3_1","volume-title":"IEEE International Symposium on Circuits and Systems (ISCAS 05)","volume":"2","author":"Chen J.","unstructured":"Chen , J. , Zhang , J. , Kam , A. , and Shue , L . 2005b. An automatic acoustic bathroom monitoring system . In IEEE International Symposium on Circuits and Systems (ISCAS 05) . vol. 2 , 1750--1753. Chen, J., Zhang, J., Kam, A., and Shue, L. 2005b. An automatic acoustic bathroom monitoring system. In IEEE International Symposium on Circuits and Systems (ISCAS 05). vol. 2, 1750--1753."},{"volume-title":"Workshop on Perceptual User Interfaces","author":"Clarkson B.","key":"e_1_2_1_4_1","unstructured":"Clarkson , B. , Sawhney , N. , and Pentland , A . 1998. Auditory context awareness in wearable computing . In Workshop on Perceptual User Interfaces . San Francisco, U.S.A., 47--61. Clarkson, B., Sawhney, N., and Pentland, A. 1998. Auditory context awareness in wearable computing. In Workshop on Perceptual User Interfaces. San Francisco, U.S.A., 47--61."},{"key":"e_1_2_1_5_1","volume-title":"IEEE International Conference on Multimedia and Expo (ICME","author":"Clavel C.","year":"2005","unstructured":"Clavel , C. , Ehrette , T. , and Richard , G . 2005. Events detection for an audio-based surveillance system . In IEEE International Conference on Multimedia and Expo (ICME 2005 ). Amsterdam, Netherlands. Clavel, C., Ehrette, T., and Richard, G. 2005. Events detection for an audio-based surveillance system. In IEEE International Conference on Multimedia and Expo (ICME 2005). Amsterdam, Netherlands."},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Cover T. and Thomas J. 1991. Elements of Information Theory. John Wiley and Sons.   Cover T. and Thomas J. 1991. Elements of Information Theory. John Wiley and Sons.","DOI":"10.1002\/0471200611"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8655(03)00147-8"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2004.653"},{"volume-title":"Ten Lectures on Wavelets","author":"Daubechies I.","key":"e_1_2_1_9_1","unstructured":"Daubechies , I. 1992. Ten Lectures on Wavelets . Society for Industrial and Applied Mathematics , Philadelphia, Pennsylvania . Daubechies, I. 1992. Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia, Pennsylvania."},{"key":"e_1_2_1_10_1","unstructured":"Deller J. R. Proakis J. G. and Hansen J. H. 1993. Discrete-Time Processing of Speech Signals. Maxwell Macmillan International.   Deller J. R. Proakis J. G. and Hansen J. H. 1993. Discrete-Time Processing of Speech Signals. Maxwell Macmillan International."},{"volume-title":"Proceedings of the 6th European Conference on Computer Vision-Part II. Springer-Verlag","author":"Elgammal A.","key":"e_1_2_1_11_1","unstructured":"Elgammal , A. , Duraiswami , R. , Harwood , D. , and Davis , L. S . 2000. Non-parametric model for background subtraction . In Proceedings of the 6th European Conference on Computer Vision-Part II. Springer-Verlag , Dublin, Ireland, 751--767. Elgammal, A., Duraiswami, R., Harwood, D., and Davis, L. S. 2000. Non-parametric model for background subtraction. In Proceedings of the 6th European Conference on Computer Vision-Part II. Springer-Verlag, Dublin, Ireland, 751--767."},{"key":"e_1_2_1_12_1","unstructured":"Ellis D. P. W. 2001. Detecting alarm sounds. In Consistent and Reliable Acoustic Cues for Sound Analysis. Aalborg Denmark.  Ellis D. P. W. 2001. Detecting alarm sounds. In Consistent and Reliable Acoustic Cues for Sound Analysis. Aalborg Denmark."},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Foote J. T. and Cooper M. L. 2003. Media segmentation using self-similarity decomposition. In SPIE Storage and Retrieval for Multimedia Databases. vol. 5021. 167--175.  Foote J. T. and Cooper M. L. 2003. Media segmentation using self-similarity decomposition. In SPIE Storage and Retrieval for Multimedia Databases. vol. 5021. 167--175.","DOI":"10.1117\/12.476302"},{"key":"e_1_2_1_14_1","volume-title":"IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98)","volume":"6","author":"Gaunard P.","unstructured":"Gaunard , P. , Mubikangiey , C. , Couvreur , C. , and Fontaine , V . 1998. Automatic classification of environmental noise events by hidden markov models . In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98) . vol. 6 , 3609--3612. Gaunard, P., Mubikangiey, C., Couvreur, C., and Fontaine, V. 1998. Automatic classification of environmental noise events by hidden markov models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98). vol. 6, 3609--3612."},{"key":"e_1_2_1_15_1","volume-title":"IEEE International Conference on Multimedia and Expo (ICME","author":"H\u00e4rm\u00e4 A.","year":"2005","unstructured":"H\u00e4rm\u00e4 , A. , McKinney , M. , and Skowronek , J . 2005. Automatic surveillance of the acoustic activity in our living environment . In IEEE International Conference on Multimedia and Expo (ICME 2005 ). Amsterdam, Netherlands. H\u00e4rm\u00e4, A., McKinney, M., and Skowronek, J. 2005. Automatic surveillance of the acoustic activity in our living environment. In IEEE International Conference on Multimedia and Expo (ICME 2005). Amsterdam, Netherlands."},{"volume-title":"IEEE International Conference on Image Processing (ICIP)","author":"Kim K.","key":"e_1_2_1_16_1","unstructured":"Kim , K. , Chalidabhongse , T. H. , Harwood , D. , and Davis , L . 2004. Background modeling and subtraction by codebook construction . In IEEE International Conference on Image Processing (ICIP) . Singapore. Kim, K., Chalidabhongse, T. H., Harwood, D., and Davis, L. 2004. Background modeling and subtraction by codebook construction. In IEEE International Conference on Image Processing (ICIP). Singapore."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.3115\/1034678.1034693"},{"key":"e_1_2_1_18_1","volume-title":"IEEE International Conference on Multimedia and Expo (ICME","author":"Moncrieff S.","year":"2005","unstructured":"Moncrieff , S. , Venkatesh , S. , and West , G . 2005. Persistent audio modelling for background determination . In IEEE International Conference on Multimedia and Expo (ICME 2005 ). Amsterdam, Netherlands. Moncrieff, S., Venkatesh, S., and West, G. 2005. Persistent audio modelling for background determination. In IEEE International Conference on Multimedia and Expo (ICME 2005). Amsterdam, Netherlands."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2006.1141"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1026711.1026738"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/946249.946913"},{"key":"e_1_2_1_22_1","volume-title":"IEEE Computer Society Conference on Computer Vision and Pattern Recognition","volume":"2","author":"Stauffer C.","year":"1999","unstructured":"Stauffer , C. and Grimson , W . 1999. Adaptive background mixture models for real-time tracking . In IEEE Computer Society Conference on Computer Vision and Pattern Recognition ( 1999 ). vol. 2 . Fort Collins, CO USA, 246--252. Stauffer, C. and Grimson, W. 1999. Adaptive background mixture models for real-time tracking. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (1999). vol. 2. Fort Collins, CO USA, 246--252."},{"volume-title":"2nd Conference on Biomedical Engineering. ACTA Press, Ed","author":"Vacher M.","key":"e_1_2_1_23_1","unstructured":"Vacher , M. , Istrate , D. , Besacier , L. , Serignat , J. F. , and Castelli , E . 2004. Sound detection and classification for medical telesurvey . In 2nd Conference on Biomedical Engineering. ACTA Press, Ed . Innsbruck, Austria, 395--398. Vacher, M., Istrate, D., Besacier, L., Serignat, J. F., and Castelli, E. 2004. Sound detection and classification for medical telesurvey. In 2nd Conference on Biomedical Engineering. ACTA Press, Ed. Innsbruck, Austria, 395--398."},{"key":"e_1_2_1_24_1","volume-title":"Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann.","author":"Witten I. H.","year":"2000","unstructured":"Witten , I. H. and Frank , E . 2000 . Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann. Witten, I. H. and Frank, E. 2000. Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.598236"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1999.757472"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1230812.1230814","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1230812.1230814","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:47:28Z","timestamp":1750258048000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1230812.1230814"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,5]]},"references-count":26,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2007,5]]}},"alternative-id":["10.1145\/1230812.1230814"],"URL":"https:\/\/doi.org\/10.1145\/1230812.1230814","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2007,5]]},"assertion":[{"value":"2007-05-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}