{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T08:03:00Z","timestamp":1779350580462,"version":"3.51.4"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2013,5,1]],"date-time":"2013-05-01T00:00:00Z","timestamp":1367366400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2013,5]]},"abstract":"<jats:p>Despite the fact that performance improvements have been reported in the last years, semantic concept detection in video remains a challenging problem. Existing concept detection techniques, with ontology rules, exploit the static correlations among primitive concepts but not the dynamic spatiotemporal correlations. The proposed method rewards (or punishes) detected primitive concepts using dynamic spatiotemporal correlations of the given ontology rules and updates these ontology rules based on the accuracy of detection. Adaptively learned ontology rules significantly help in improving the overall accuracy of concept detection as shown in the experimental result.<\/jats:p>","DOI":"10.1145\/2457450.2457452","type":"journal-article","created":{"date-parts":[[2013,5,14]],"date-time":"2013-05-14T12:15:20Z","timestamp":1368533720000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["A reward-and-punishment-based approach for concept detection using adaptive ontology rules"],"prefix":"10.1145","volume":"9","author":[{"given":"Chidansh A.","family":"Bhatt","sequence":"first","affiliation":[{"name":"National University of Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pradeep K.","family":"Atrey","sequence":"additional","affiliation":[{"name":"University of Winnipeg, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mohan S.","family":"Kankanhalli","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,5,10]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Proceedings of the TREC Video Retrieval Evaluation (NIST TRECVID'03)","author":"Amir A.","unstructured":"Amir , A. , Berg , M. , Chang , S.-F. , Iyengar , G. , Lin , C.-Y. , Natsev , A. , Neti , C. , Nock , H. , Naphade , M. , Hsu , W. , Smith , J. R. , Tseng , B. , Wu , Y. , Zhang , D. , and Watson , I. T. J. 2003. Ibm research trecvid-2003 video retrieval system . In Proceedings of the TREC Video Retrieval Evaluation (NIST TRECVID'03) . Amir, A., Berg, M., Chang, S.-F., Iyengar, G., Lin, C.-Y., Natsev, A., Neti, C., Nock, H., Naphade, M., Hsu, W., Smith, J. R., Tseng, B., Wu, Y., Zhang, D., and Watson, I. T. J. 2003. Ibm research trecvid-2003 video retrieval system. In Proceedings of the TREC Video Retrieval Evaluation (NIST TRECVID'03)."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2003.06.004"},{"key":"e_1_2_2_3_1","doi-asserted-by":"crossref","unstructured":"Bai L. Lao S. Zhang W. Jones G. J. and \n      Smeaton A. F\n  . \n  2007\n  . Video semantic content analysis based on ontology combinedmpeg-7. In Adaptive Multimedial Retrieval: Retrieval User and Semantics Lecture Notes in Computer Science vol. \n  4918 Springer 237--250.  Bai L. Lao S. Zhang W. Jones G. J. and Smeaton A. F. 2007. Video semantic content analysis based on ontology combinedmpeg-7. In Adaptive Multimedial Retrieval: Retrieval User and Semantics Lecture Notes in Computer Science vol. 4918 Springer 237--250.","DOI":"10.1007\/978-3-540-79860-6_19"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0643-7"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2010.4"},{"key":"e_1_2_2_6_1","volume-title":"Decision Theory: An Introduction to Dynamic Programming and Sequential Decisions","author":"Bather J.","year":"2000","unstructured":"Bather , J. 2000 . Decision Theory: An Introduction to Dynamic Programming and Sequential Decisions . John Wiley & Sons . Bather, J. 2000. Decision Theory: An Introduction to Dynamic Programming and Sequential Decisions. John Wiley & Sons."},{"key":"e_1_2_2_7_1","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo.","author":"Bertini M.","unstructured":"Bertini , M. , Cucchiara , R. , del Bimbo , A. , and Torniai , C . 2005. Video annotation with pictorially enriched ontologies . In Proceedings of the IEEE International Conference on Multimedia and Expo. Bertini, M., Cucchiara, R., del Bimbo, A., and Torniai, C. 2005. Video annotation with pictorially enriched ontologies. In Proceedings of the IEEE International Conference on Multimedia and Expo."},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0645-5"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.868685"},{"key":"e_1_2_2_10_1","volume-title":"Proceedings of the International Workshop on Ontology Dynamics.","author":"Castano S.","unstructured":"Castano , S. , Espinosa , S. , Ferrara , A. , Karkaletsis , V. , Kaya , A. , Melzer , S. , Moller , R. , Montanelli , S. , and Petasis , G . 2007. Ontology dynamics with multimedia information: The boemie evolution methodology . In Proceedings of the International Workshop on Ontology Dynamics. Castano, S., Espinosa, S., Ferrara, A., Karkaletsis, V., Kaya, A., Melzer, S., Moller, R., Montanelli, S., and Petasis, G. 2007. Ontology dynamics with multimedia information: The boemie evolution methodology. In Proceedings of the International Workshop on Ontology Dynamics."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1093\/logcom\/exn049"},{"key":"e_1_2_2_12_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.","author":"Chao C.-Y.","unstructured":"Chao , C.-Y. , Shih , H.-C. , and Huang , C . -L. 2005. Semantics-Based highlight extraction of soccer program using dbn . In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Chao, C.-Y., Shih, H.-C., and Huang, C.-L. 2005. Semantics-Based highlight extraction of soccer program using dbn. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-009-0393-6"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2005.854238"},{"key":"e_1_2_2_15_1","unstructured":"Everingham M. van Gool L. Williams C. K. I. Winn J. and Zisserman A. 2011. The PASCAL visual object classes challenge 2011 (VOC2011) results. http:\/\/www.pascal-network.org\/challenges\/VOC\/voc2011\/workshop\/index.html.  Everingham M. van Gool L. Williams C. K. I. Winn J. and Zisserman A. 2011. The PASCAL visual object classes challenge 2011 (VOC2011) results. http:\/\/www.pascal-network.org\/challenges\/VOC\/voc2011\/workshop\/index.html."},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1155\/2009\/924287"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1282280.1282311"},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2009.2014507"},{"key":"e_1_2_2_19_1","volume-title":"Proceedings of the 9th International Conference on Artificial Neural Networks.","volume":"1","author":"Kohlmorgen J.","unstructured":"Kohlmorgen , J. , Lemm , S. , Muller , K. , Liehr , S. , and Pawelzik , K . 1999. Fast change point detection in switching dynamics using a hidden Markov model of prediction experts . In Proceedings of the 9th International Conference on Artificial Neural Networks. Vol. 1 . 204--209. Kohlmorgen, J., Lemm, S., Muller, K., Liehr, S., and Pawelzik, K. 1999. Fast change point detection in switching dynamics using a hidden Markov model of prediction experts. In Proceedings of the 9th International Conference on Artificial Neural Networks. Vol. 1. 204--209."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920893"},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2010.02.006"},{"key":"e_1_2_2_22_1","volume-title":"Proceedings of the TREC Video Retrieval Evaluation (TRECVID'11)","author":"Over P.","unstructured":"Over , P. , Awad , G. , Michel , M. , Fiscus , J. , Kraaij , W. , Smeaton , A. F. , and Quenot , G . 2011. Trecvid 2011 -- An overview of the goals, tasks, data, evaluation mechanisms and metrics . In Proceedings of the TREC Video Retrieval Evaluation (TRECVID'11) Workshop. Over, P., Awad, G., Michel, M., Fiscus, J., Kraaij, W., Smeaton, A. F., and Quenot, G. 2011. Trecvid 2011 -- An overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of the TREC Video Retrieval Evaluation (TRECVID'11) Workshop."},{"key":"e_1_2_2_23_1","doi-asserted-by":"crossref","unstructured":"Petridis S. and Perantonis S. J. 2011. Semantics extraction from multimedia data: An ontology-based machine learning approach. In Perception-Action Cycle Series in Cognitive and Neural Systems Springer.  Petridis S. and Perantonis S. J. 2011. Semantics extraction from multimedia data: An ontology-based machine learning approach. In Perception-Action Cycle Series in Cognitive and Neural Systems Springer.","DOI":"10.1007\/978-1-4419-1452-1_12"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1291233.1291245"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2005.854237"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2007.911830"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1178677.1178722"},{"key":"e_1_2_2_28_1","volume-title":"Proceedings of the IEEE International Conference on Multimedia and Expo. 445--448","author":"Smith J. R.","unstructured":"Smith , J. R. , Naphade , M. , and Natsev , A . 2003. Multimedia semantic indexing using model vectors . In Proceedings of the IEEE International Conference on Multimedia and Expo. 445--448 . Smith, J. R., Naphade, M., and Natsev, A. 2003. Multimedia semantic indexing using model vectors. In Proceedings of the IEEE International Conference on Multimedia and Expo. 445--448."},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10. 1109\/TCSVT.2009.2017400"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2009.2012919"},{"key":"e_1_2_2_31_1","volume-title":"Proceedings of the International Conference on Multimedia and Expo.","author":"Wu Y.","unstructured":"Wu , Y. , Tseng , B. L. , and Smith , J. R . 2004. Ontology-Based multi-classification learning for video concept detection . In Proceedings of the International Conference on Multimedia and Expo. Wu, Y., Tseng, B. L., and Smith, J. R. 2004. Ontology-Based multi-classification learning for video concept detection. In Proceedings of the International Conference on Multimedia and Expo."},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.129"},{"key":"e_1_2_2_33_1","volume-title":"Proceedings of the International Conference on Multimedia and Expo.","author":"Xu P.","unstructured":"Xu , P. , Xie , L. , Chang , S.-F. , Divakaran , A. , Vetro , A. , and Sun , H . 2001. Algorithms and system for segmentation and structure analysis in soccer video . In Proceedings of the International Conference on Multimedia and Expo. Xu, P., Xie, L., Chang, S.-F., Divakaran, A., Vetro, A., and Sun, H. 2001. Algorithms and system for segmentation and structure analysis in soccer video. In Proceedings of the International Conference on Multimedia and Expo."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281281"},{"key":"e_1_2_2_35_1","unstructured":"Yanagawa A. Chang S.-F. Kennedy L. and Hsu W. 2007. Columbia university's baseline detectors for 374 lscom semantic visual concepts. Tech. rep. 222-2006 Columbia University.  Yanagawa A. Chang S.-F. Kennedy L. and Hsu W. 2007. Columbia university's baseline detectors for 374 lscom semantic visual concepts. Tech. rep. 222-2006 Columbia University."},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1032604.1032616"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1290082.1290114"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1459359.1459391"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2457450.2457452","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2457450.2457452","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:19:13Z","timestamp":1750234753000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2457450.2457452"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,5]]},"references-count":38,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2013,5]]}},"alternative-id":["10.1145\/2457450.2457452"],"URL":"https:\/\/doi.org\/10.1145\/2457450.2457452","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,5]]},"assertion":[{"value":"2011-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-05-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}