{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:19:56Z","timestamp":1750306796330,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2013,11,5]],"date-time":"2013-11-05T00:00:00Z","timestamp":1383609600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2013,11,5]]},"DOI":"10.1145\/2526188.2526202","type":"proceedings-article","created":{"date-parts":[[2013,11,12]],"date-time":"2013-11-12T15:29:36Z","timestamp":1384270176000},"page":"15-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Multimodal late fusion bag of features applied to scene detection"],"prefix":"10.1145","author":[{"given":"Bruno Loren\u00e7o","family":"Lopes","sequence":"first","affiliation":[{"name":"Universidade de S\u00e3o Paulo (USP), S\u00e3o Carlos, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rudinei","family":"Goularte","sequence":"additional","affiliation":[{"name":"Universidade de S\u00e3o Paulo (USP), S\u00e3o Carlos, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,11,5]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.99"},{"key":"e_1_3_2_1_2_1","volume-title":"Multimodal fusion for multimedia analysis: a survey","author":"Atrey P. K.","year":"2010","unstructured":"P. K. Atrey , M. A. Hossain , A. E. Saddik , and M. S. Kankanhalli . Multimodal fusion for multimedia analysis: a survey , 2010 . P. K. Atrey, M. A. Hossain, A. E. Saddik, and M. S. Kankanhalli. Multimodal fusion for multimedia analysis: a survey, 2010."},{"key":"e_1_3_2_1_3_1","volume-title":"Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal","author":"Bachu R. G.","year":"2008","unstructured":"R. G. Bachu and S. Kopparthi . Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal . 2008 . R. G. Bachu and S. Kopparthi. Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal. 2008."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2324796.2324816"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2072298.2071933"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2002.1035721"},{"key":"e_1_3_2_1_7_1","first-page":"687","volume-title":"Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on","volume":"2","author":"Chang S.-F.","year":"2000","unstructured":"S.-F. Chang and H. Sundaram . Structural and semantic analysis of video . In Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on , volume 2 , pages 687 -- 690 vol.2, 2000 . S.-F. Chang and H. Sundaram. Structural and semantic analysis of video. In Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on, volume 2, pages 687--690 vol.2, 2000."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1646396.1646439"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-87536-9_87"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2072298.2072030"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1413918.1413922"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00357-010-9049-5"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/1776814.1776844"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1117\/12.290336","volume-title":"MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II","author":"Foote J. T.","year":"1997","unstructured":"J. T. Foote . Content-based retrieval of music and audio . In MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II , PROC. OF SPIE , pages 138 -- 147 , 1997 . J. T. Foote. Content-based retrieval of music and audio. In MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, PROC. OF SPIE, pages 138--147, 1997."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12842-4_13"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.767124"},{"key":"e_1_3_2_1_17_1","volume-title":"I. Laptev, and C. Schmid. Will person detection help bag-of-features action recognition?","author":"Klaser A.","year":"2010","unstructured":"A. Klaser , M. Marsza lek , I. Laptev, and C. Schmid. Will person detection help bag-of-features action recognition? , 2010 . A. Klaser, M. Marsza lek, I. Laptev, and C. Schmid. Will person detection help bag-of-features action recognition?, 2010."},{"key":"e_1_3_2_1_18_1","volume-title":"C. Schmid, and A. Zisserman. Human focused action localization in video","author":"Klaser A.","year":"2010","unstructured":"A. Klaser , M. Marsza lek , C. Schmid, and A. Zisserman. Human focused action localization in video , 2010 . A. Klaser, M. Marsza lek, C. Schmid, and A. Zisserman. Human focused action localization in video, 2010."},{"key":"e_1_3_2_1_19_1","first-page":"970","volume-title":"IICAI","author":"Kumar N.","year":"2011","unstructured":"N. Kumar , P. Rai , C. Pulla , and C. V. Jawahar . Video scene segmentation with a semantic similarity. In B. Prasad, P. Lingras, and R. Nevatia, editors , IICAI , pages 970 -- 981 . IICAI, 2011 . N. Kumar, P. Rai, C. Pulla, and C. V. Jawahar. Video scene segmentation with a semantic similarity. In B. Prasad, P. Lingras, and R. Nevatia, editors, IICAI, pages 970--981. IICAI, 2011."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.5555\/1314498.1314576"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2039331.2039333"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/1816041.1816057"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1386352.1386444"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_2_1_25_1","first-page":"281","volume-title":"Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability","volume":"1","author":"MacQueen J. B.","year":"1967","unstructured":"J. B. MacQueen . Some methods for classification and analysis of multivariate observations. In L. M. L. Cam and J. Neyman, editors , Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability , volume 1 , pages 281 -- 297 . University of California Press , 1967 . J. B. MacQueen. Some methods for classification and analysis of multivariate observations. In L. M. L. Cam and J. Neyman, editors, Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, volume 1, pages 281--297. University of California Press, 1967."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2004.02.004"},{"key":"e_1_3_2_1_27_1","first-page":"565","volume-title":"3rd International Conference on Electrical & Computer Engineer, number 2004","author":"Rashidul Hasan M. J. Md.","year":"2004","unstructured":"M. J. Md. Rashidul Hasan . Speaker identification using mel frequency cepstral coefficients . In 3rd International Conference on Electrical & Computer Engineer, number 2004 , pages 565 -- 568 , Dhaka- Bangladesh , 2004 . ICECE. M. J. Md. Rashidul Hasan. Speaker identification using mel frequency cepstral coefficients. In 3rd International Conference on Electrical & Computer Engineer, number 2004, pages 565--568, Dhaka- Bangladesh, 2004. ICECE."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICAPR.2009.35"},{"key":"e_1_3_2_1_29_1","first-page":"849","volume-title":"ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS","author":"Ng A. Y.","year":"2001","unstructured":"A. Y. Ng , M. I. Jordan , and Y. Weiss . On spectral clustering: Analysis and an algorithm . In ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS , pages 849 -- 856 . MIT Press , 2001 . A. Y. Ng, M. I. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS, pages 849--856. MIT Press, 2001."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.2298\/CSIS100618036P"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2003.1211489"},{"key":"e_1_3_2_1_32_1","volume-title":"USA","author":"Rijsbergen C. J. V.","year":"1979","unstructured":"C. J. V. Rijsbergen . Information Retrieval . Butterworth-Heinemann, Newton, MA , USA , 2 nd edition, 1979 . C. J. V. Rijsbergen. Information Retrieval. Butterworth-Heinemann, Newton, MA, USA, 2nd edition, 1979.","edition":"2"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPA.2007.4383752"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2009.03.011"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.895972"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2000.871563"},{"key":"e_1_3_2_1_37_1","first-page":"193","volume-title":"Proceedings of the Twenty-eighth Australasian conference on Computer Science -","volume":"38","author":"Tahaghoghi S. M. M.","year":"2005","unstructured":"S. M. M. Tahaghoghi , H. E. Williams , J. A. Thom , and T. Volkmer . Video cut detection using frame windows . In Proceedings of the Twenty-eighth Australasian conference on Computer Science - Volume 38 , ACSC '05, pages 193 -- 199 , Darlinghurst, Australia, Australia , 2005 . Australian Computer Society, Inc. S. M. M. Tahaghoghi, H. E. Williams, J. A. Thom, and T. Volkmer. Video cut detection using frame windows. In Proceedings of the Twenty-eighth Australasian conference on Computer Science - Volume 38, ACSC '05, pages 193--199, Darlinghurst, Australia, Australia, 2005. Australian Computer Society, Inc."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2004.1261105"},{"key":"e_1_3_2_1_39_1","first-page":"2006","volume-title":"Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on","volume":"2","author":"Wang J.","unstructured":"J. Wang , L. Duan , H. Lu , J. Jin , and C. Xu . A mid-level scene change representation via audiovisual alignment. In Acoustics , Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on , volume 2 , page II, may 2006 . J. Wang, L. Duan, H. Lu, J. Jin, and C. Xu. A mid-level scene change representation via audiovisual alignment. In Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, volume 2, page II, may 2006."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1290082.1290111"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1006\/cviu.1997.0628"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2005.6"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2006.876299"}],"event":{"name":"WebMedia '13: 19th Brazilian Symposium on Multimedia and the Web","sponsor":["SBC Brazilian Computer Society","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGMM ACM Special Interest Group on Multimedia"],"location":"Salvador Brazil","acronym":"WebMedia '13"},"container-title":["Proceedings of the 19th Brazilian symposium on Multimedia and the web"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2526188.2526202","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2526188.2526202","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:34:36Z","timestamp":1750232076000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2526188.2526202"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,11,5]]},"references-count":43,"alternative-id":["10.1145\/2526188.2526202","10.1145\/2526188"],"URL":"https:\/\/doi.org\/10.1145\/2526188.2526202","relation":{},"subject":[],"published":{"date-parts":[[2013,11,5]]},"assertion":[{"value":"2013-11-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}