{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T23:12:01Z","timestamp":1780701121010,"version":"3.54.1"},"reference-count":72,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2018,5,1]],"date-time":"2018-05-01T00:00:00Z","timestamp":1525132800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2018,5,31]]},"abstract":"<jats:p>In this article, we address the problem of recognizing an event from a single related picture. Given the large number of event classes and the limited information contained in a single shot, the problem is known to be particularly hard. To achieve a reliable detection, we propose a combination of multiple classifiers, and we compare three alternative strategies to fuse the results of each classifier, namely: (i) induced order weighted averaging operators, (ii) genetic algorithms, and (iii) particle swarm optimization. Each method is aimed at determining the optimal weights to be assigned to the decision scores yielded by different deep models, according to the relevant optimization strategy. Experimental tests have been performed on three event recognition datasets, evaluating the performance of various deep models, both alone and selectively combined. Experimental results demonstrate that the proposed approach outperforms traditional multiple classifier solutions based on uniform weighting, and outperforms recent state-of-the-art approaches.<\/jats:p>","DOI":"10.1145\/3199668","type":"journal-article","created":{"date-parts":[[2018,5,1]],"date-time":"2018-05-01T12:00:39Z","timestamp":1525176039000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":32,"title":["Ensemble of Deep Models for Event Recognition"],"prefix":"10.1145","volume":"14","author":[{"given":"Kashif","family":"Ahmad","sequence":"first","affiliation":[{"name":"University of Trento, Italy, Via Sommarive, Trento (Italy)"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mohamed Lamine","family":"Mekhalfi","sequence":"additional","affiliation":[{"name":"University of Trento, Italy, Via Sommarive, Trento (Italy)"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nicola","family":"Conci","sequence":"additional","affiliation":[{"name":"University of Trento, Italy, Via Sommarive, Trento (Italy)"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Farid","family":"Melgani","sequence":"additional","affiliation":[{"name":"University of Trento, Italy, Via Sommarive, Trento (Italy)"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Francesco De","family":"Natale","sequence":"additional","affiliation":[{"name":"University of Trento, Italy, Via Sommarive, Trento (Italy)"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,5]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2910017.2910624"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2017.09.009"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/GlobalSIP.2016.7906036"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the MediaEval 2017 Workshop (Sept. 13--15","author":"Ahmad Sheharyar","year":"2017"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-010-0182-0"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11047-007-9050-z"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TGRS.2006.880628"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.151"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2324796.2324823"},{"key":"e_1_2_1_10_1","unstructured":"Hyeran Byun and Seong-Whan Lee. 2002. Applications of support vector machines for pattern recognition: A survey. Pattern Recognit. Support Vector Mach. (2002) 571--591.   Hyeran Byun and Seong-Whan Lee. 2002. Applications of support vector machines for pattern recognition: A survey. Pattern Recognit. Support Vector Mach. (2002) 571--591."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.76"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201905)","volume":"5","author":"Chang Shih-Fu","year":"2005"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2013.01.013"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1646021"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-012-1153-6"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the International Conference on Evolutionary Programming. Springer, 611--616","author":"Russell"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2015.40"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871465"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2568--2577","author":"Gan Chuang"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CEC.2012.6256608"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP\u201915)","author":"Guo Cong","year":"2015"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.554205"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2011.2109710"},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the International Conference on Machine Learning. 448--456","author":"Ioffe Sergey","year":"2015"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2003.1200085"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0823-z"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2006.10.019"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13735-012-0024-2"},{"key":"e_1_2_1_32_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Adv. Neural Inform. Process. Syst. 1097--1105.   Alex Krizhevsky Ilya Sutskever and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Adv. Neural Inform. Process. Syst. 1097--1105."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-27355-1_18"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2007.4408872"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2015.44"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2461466.2461493"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/11526346_10"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-013-1426-8"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2006.63"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354988"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3092831"},{"key":"e_1_2_1_42_1","unstructured":"Symeon Papadopoulos Raphael Troncy Vasileios Mezaris Benoit Huet and Ioannis Kompatsiaris. 2011. Social event detection at mediaeval 2011: Challenges dataset and evaluation. In MediaEval.  Symeon Papadopoulos Raphael Troncy Vasileios Mezaris Benoit Huet and Ioannis Kompatsiaris. 2011. Social event detection at mediaeval 2011: Challenges dataset and evaluation. In MediaEval."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2010.68"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2015.7301335"},{"key":"e_1_2_1_45_1","volume-title":"Proc. ACM ICMR 2014 Workshop on Social Events in Web Multimedia (SEWM\u201914)","author":"Petkos Georgios","year":"2014"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2003.817150"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the Korea-Japan Joint Workshop on Frontiers of Computer Vision. 85--90","author":"Rachmadi Reza Fuad","year":"2016"},{"key":"e_1_2_1_48_1","unstructured":"Reza Fuad Rachmadi Keiichi Uchimura and Gou Koutaki. 2016. Spatial pyramid convolutional neural network for social event detection in static image. arXiv:1612.04062 (2016).  Reza Fuad Rachmadi Keiichi Uchimura and Gou Koutaki. 2016. Spatial pyramid convolutional neural network for social event detection in static image. arXiv:1612.04062 (2016)."},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop.","author":"Reuter Timo","year":"2013"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2441003"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2015.7301334"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2321392"},{"key":"e_1_2_1_53_1","volume-title":"Unsupervised Learning Algorithms","author":"Scrucca Luca"},{"key":"e_1_2_1_54_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014).  Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014)."},{"key":"e_1_2_1_55_1","doi-asserted-by":"crossref","unstructured":"Alan F. Smeaton. 1998. Independence of contributing retrieval strategies in data fusion for effective information retrieval. In BCS-IRSG Annual Colloquium on IR Research.   Alan F. Smeaton. 1998. Independence of contributing retrieval strategies in data fusion for effective information retrieval. In BCS-IRSG Annual Colloquium on IR Research.","DOI":"10.14236\/ewic\/IRSG1998.12"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1101149.1101236"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/1839707.1839759"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2016.05.005"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/215206.215357"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2015.7301333"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2015.46"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1043-5"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/79.888862"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2393347.2396332"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2007.23"},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1600--1609","author":"Xiong Yuanjun","year":"2015"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/21.155943"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/3477.752789"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/954339.954342"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.319"},{"key":"e_1_2_1_72_1","unstructured":"Bolei Zhou Agata Lapedriza Jianxiong Xiao Antonio Torralba and Aude Oliva. 2014. Learning deep features for scene recognition using places database. In Advances in Neural Information Processing Systems. 487--495.   Bolei Zhou Agata Lapedriza Jianxiong Xiao Antonio Torralba and Aude Oliva. 2014. Learning deep features for scene recognition using places database. In Advances in Neural Information Processing Systems. 487--495."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3199668","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3199668","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T19:07:18Z","timestamp":1750273638000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3199668"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,5]]},"references-count":72,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,5,31]]}},"alternative-id":["10.1145\/3199668"],"URL":"https:\/\/doi.org\/10.1145\/3199668","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,5]]},"assertion":[{"value":"2017-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-05-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}