{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:31:19Z","timestamp":1750221079524,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":36,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,16]],"date-time":"2018-10-16T00:00:00Z","timestamp":1539648000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,16]]},"DOI":"10.1145\/3243082.3243108","type":"proceedings-article","created":{"date-parts":[[2018,9,19]],"date-time":"2018-09-19T12:16:51Z","timestamp":1537359411000},"page":"205-212","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["A new multimodal deep-learning model to video scene segmentation"],"prefix":"10.1145","author":[{"given":"Tiago H.","family":"Trojahn","sequence":"first","affiliation":[{"name":"University of S\u00e3o Paulo, Federal Institute of S\u00e3o Paulo, S\u00e3o Carlos - SP - Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rodrigo M.","family":"Kishi","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, Federal University of Mato Grosso do Sul, S\u00e3o Carlos - SP - Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rudinei","family":"Goularte","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, S\u00e3o Carlos - SP - Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,10,16]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.99"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-010-0182-0"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806316"},{"key":"e_1_3_2_1_4_1","series-title":"Lecture Notes in Computer Science","volume-title":"Measuring Scene Detection Performance","author":"Baraldi Lorenzo","unstructured":"Lorenzo Baraldi , Costantino Grana , and Rita Cucchiara . 2015. Measuring Scene Detection Performance . In Pattern Recognition and Image Analysis, Roberto Paredes, Jaime S. Cardoso, and Xos\u00e9 M. Pardo (Eds.). Lecture Notes in Computer Science , Vol. 9117 . Springer International Publishing , Cham , 395--403. Lorenzo Baraldi, Costantino Grana, and Rita Cucchiara. 2015. Measuring Scene Detection Performance. In Pattern Recognition and Image Analysis, Roberto Paredes, Jaime S. Cardoso, and Xos\u00e9 M. Pardo (Eds.). Lecture Notes in Computer Science, Vol. 9117. Springer International Publishing, Cham, 395--403."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2644872"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2695664.2695841"},{"volume-title":"Practical Recommendations for Gradient-Based Training of Deep Architectures (2 ed.)","author":"Bengio Yoshua","key":"e_1_3_2_1_7_1","unstructured":"Yoshua Bengio . 2012. Practical Recommendations for Gradient-Based Training of Deep Architectures (2 ed.) . Springer Berlin Heidelberg , Berlin, Heidelberg , 437--478. Yoshua Bengio. 2012. Practical Recommendations for Gradient-Based Training of Deep Architectures (2 ed.). Springer Berlin Heidelberg, Berlin, Heidelberg, 437--478."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.422.0233"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2008.07.003"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1646396.1646439"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2008.2008924"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_3_2_1_13_1","volume-title":"NIPS 2014 Deep Learning and Representation Learning Workshop. Curran Associates Inc.","author":"Chung Junyoung","year":"2014","unstructured":"Junyoung Chung , Caglar Gulcehre , Kyunghyun Cho , and Yoshua Bengio . 2014 . Empirical evaluation of gated recurrent neural networks on sequence modeling . In NIPS 2014 Deep Learning and Representation Learning Workshop. Curran Associates Inc. , Red Hook, NY, USA. Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS 2014 Deep Learning and Representation Learning Workshop. Curran Associates Inc., Red Hook, NY, USA."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-013-0306-4"},{"volume-title":"Personalized Digital Television: Targeting Programs to individual Viewers","author":"Dimitrova Nevenka","key":"e_1_3_2_1_15_1","unstructured":"Nevenka Dimitrova , John Zimmerman , Angel Janevski , Lalitha Agnihotri , Norman Haas , Dongge Li , Ruud Bolle , Senem Velipasalar , Thomas Mcgeeand , and Lira Nikolovska . 2004. Personalized Digital Television: Targeting Programs to individual Viewers . Springer Netherlands , Dordrecht, NL , Chapter Media Augmentation and Personalization Through Multimedia Processing and Information Extraction, 203--233. Nevenka Dimitrova, John Zimmerman, Angel Janevski, Lalitha Agnihotri, Norman Haas, Dongge Li, Ruud Bolle, Senem Velipasalar, Thomas Mcgeeand, and Lira Nikolovska. 2004. Personalized Digital Television: Targeting Programs to individual Viewers. Springer Netherlands, Dordrecht, NL, Chapter Media Augmentation and Personalization Through Multimedia Processing and Information Extraction, 203--233."},{"volume-title":"2013 International Conference on Intelligent Systems and Signal Processing (ISSP' 13)","author":"Gupta A.","key":"e_1_3_2_1_16_1","unstructured":"A. Gupta and H. Gupta . 2013. Applications of MFCC and Vector Quantization in speaker recognition . In 2013 International Conference on Intelligent Systems and Signal Processing (ISSP' 13) . IEEE, Washington, DC, USA, 170--173. A. Gupta and H. Gupta. 2013. Applications of MFCC and Vector Quantization in speaker recognition. In 2013 International Conference on Intelligent Systems and Signal Processing (ISSP' 13). IEEE, Washington, DC, USA, 170--173."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/76.767124"},{"volume-title":"Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 770--778","author":"He K.","key":"e_1_3_2_1_18_1","unstructured":"K. He , X. Zhang , S. Ren , and J. Sun . 2016 . Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 770--778 . K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 770--778."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2976796.2988174"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 25th International Conference on Neural Information Processing System (NIPS'12)","volume":"1","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E Hinton . 2012 . ImageNet Classification with Deep Convolutional Neural Networks . In Proceedings of the 25th International Conference on Neural Information Processing System (NIPS'12) , Vol. 1 . Curran Associates Inc., Red Hook, NY, USA, 1097--1105. http:\/\/dl.acm.org\/citation.cfm?id=2999134.2999257 Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the 25th International Conference on Neural Information Processing System (NIPS'12), Vol. 1. Curran Associates Inc., Red Hook, NY, USA, 1097--1105. http:\/\/dl.acm.org\/citation.cfm?id=2999134.2999257"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0621-0"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2005.188"},{"volume-title":"Advanced Data Mining and Applications","author":"Qin Zengchang","key":"e_1_3_2_1_24_1","unstructured":"Zengchang Qin , Wei Liu , and Tao Wan . 2013. A Bag-of-Tones Model with MFCC Features for Musical Genre Classification . In Advanced Data Mining and Applications , Hiroshi Motoda, Zhaohui Wu, Longbing Cao, Osmar Zaiane, Min Yao, and Wei Wang (Eds.). Springer Berlin Heidelberg, Berlin , Heidelberg , 564--575. https:\/\/link.springer.com\/chapter\/10.1007\/978-3-642-53914-5_48 Zengchang Qin, Wei Liu, and Tao Wan. 2013. A Bag-of-Tones Model with MFCC Features for Musical Genre Classification. In Advanced Data Mining and Applications, Hiroshi Motoda, Zhaohui Wu, Longbing Cao, Osmar Zaiane, Min Yao, and Wei Wang (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 564--575. https:\/\/link.springer.com\/chapter\/10.1007\/978-3-642-53914-5_48"},{"volume-title":"Information Retrieval (2 ed.)","author":"Rijsbergen C. G.","key":"e_1_3_2_1_25_1","unstructured":"C. G. Rijsbergen . 1979. Information Retrieval (2 ed.) . Butterworths , London . 224 pages. C. G. Rijsbergen. 1979. Information Retrieval (2 ed.). Butterworths, London. 224 pages."},{"volume-title":"IEEE Conference on Computer Vision and Pattern Recognition. IEEE","author":"Schroff F.","key":"e_1_3_2_1_26_1","unstructured":"F. Schroff , D. Kalenichenko , and J. Philbin . 2015. FaceNet: A unified embedding for face recognition and clustering . In IEEE Conference on Computer Vision and Pattern Recognition. IEEE , Washington, DC, USA, 815--823. F. Schroff, D. Kalenichenko, and J. Philbin. 2015. FaceNet: A unified embedding for face recognition and clustering. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Washington, DC, USA, 815--823."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2011.2181231"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2011.2138830"},{"key":"e_1_3_2_1_29_1","volume-title":"Very Deep Convolutional Networks for Large-Scale Image Recognition. ArXiv e-prints abs\/1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. ArXiv e-prints abs\/1409.1556 ( 2014 ). arXiv:1409.1556 http:\/\/arxiv.org\/abs\/1409.1556 Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. ArXiv e-prints abs\/1409.1556 (2014). arXiv:1409.1556 http:\/\/arxiv.org\/abs\/1409.1556"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.895972"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1101149.1101236"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2002.802021"},{"key":"e_1_3_2_1_33_1","first-page":"3","article-title":"A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction","volume":"64","author":"Wiatowski Thomas","year":"2018","unstructured":"Thomas Wiatowski and Helmut Bolcskei . 2018 . A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction . IEEE Transactions on Information Theory 64 , 3 (mar 2018), 1845--1866. Thomas Wiatowski and Helmut Bolcskei. 2018. A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction. IEEE Transactions on Information Theory 64, 3 (mar 2018), 1845--1866.","journal-title":"IEEE Transactions on Information Theory"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.12785\/amis\/090142"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2964328"},{"key":"e_1_3_2_1_36_1","volume-title":"A Critical Review of Recurrent Neural Networks for Sequence Learning. arXiv ePrint abs\/1506.00019","author":"Zachary Chase Lipton Charles Elkan","year":"2015","unstructured":"Charles Elkan Zachary Chase Lipton , John Berkowitz . 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning. arXiv ePrint abs\/1506.00019 ( 2015 ). http:\/\/arxiv.org\/abs\/1506.00019 Charles Elkan Zachary Chase Lipton, John Berkowitz. 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning. arXiv ePrint abs\/1506.00019 (2015). http:\/\/arxiv.org\/abs\/1506.00019"}],"event":{"name":"WebMedia '18: Brazilian Symposium on Multimedia and the Web","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SBC Brazilian Computer Society","SIGMM ACM Special Interest Group on Multimedia","CNPq Conselho Nacional de Desenvolvimento Cientifico e Tecn","CGIBR Comite Gestor da Internet no Brazil","CAPES Brazilian Higher Education Funding Council"],"location":"Salvador BA Brazil","acronym":"WebMedia '18"},"container-title":["Proceedings of the 24th Brazilian Symposium on Multimedia and the Web"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3243082.3243108","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3243082.3243108","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:57:38Z","timestamp":1750208258000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3243082.3243108"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,16]]},"references-count":36,"alternative-id":["10.1145\/3243082.3243108","10.1145\/3243082"],"URL":"https:\/\/doi.org\/10.1145\/3243082.3243108","relation":{},"subject":[],"published":{"date-parts":[[2018,10,16]]},"assertion":[{"value":"2018-10-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}