{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T02:09:02Z","timestamp":1767924542801,"version":"3.49.0"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2019,2,13]],"date-time":"2019-02-13T00:00:00Z","timestamp":1550016000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001711","name":"Swiss National Science Foundation","doi-asserted-by":"crossref","award":["20CH21_151571"],"award-info":[{"award-number":["20CH21_151571"]}],"id":[{"id":"10.13039\/501100001711","id-type":"DOI","asserted-by":"crossref"}]},{"name":"European Regional Development Fund and the Carinthian Economic Promotion Fund","award":["KWF 20214 u. 3520\/26336\/38165"],"award-info":[{"award-number":["KWF 20214 u. 3520\/26336\/38165"]}]},{"name":"Universit\u00e4t Klagenfurt and Lakeside Labs GmbH, Klagenfurt, Austria"},{"name":"Council of the Hong Kong Special Administrative Region, China","award":["CityU 11250716"],"award-info":[{"award-number":["CityU 11250716"]}]},{"DOI":"10.13039\/501100001824","name":"Czech Science Foundation","doi-asserted-by":"crossref","award":["17-22224S"],"award-info":[{"award-number":["17-22224S"]}],"id":[{"id":"10.13039\/501100001824","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Horizon 2020 Research and Innovation Programme V4Design","award":["779962"],"award-info":[{"award-number":["779962"]}]},{"name":"CHIST-ERA project IMOTION"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2019,2,28]]},"abstract":"<jats:p>This work summarizes the findings of the 7th iteration of the Video Browser Showdown (VBS) competition organized as a workshop at the 24th International Conference on Multimedia Modeling in Bangkok. The competition focuses on video retrieval scenarios in which the searched scenes were either previously observed or described by another person (i.e., an example shot is not available). During the event, nine teams competed with their video retrieval tools in providing access to a shared video collection with 600 hours of video content. Evaluation objectives, rules, scoring, tasks, and all participating tools are described in the article. In addition, we provide some insights into how the different teams interacted with their video browsers, which was made possible by a novel interaction logging mechanism introduced for this iteration of the VBS. The results collected at the VBS evaluation server confirm that searching for one particular scene in the collection when given a limited time is still a challenging task for many of the approaches that were showcased during the event. Given only a short textual description, finding the correct scene is even harder. In ad hoc search with multiple relevant scenes, the tools were mostly able to find at least one scene, whereas recall was the issue for many teams. The logs also reveal that even though recent exciting advances in machine learning narrow the classical semantic gap problem, user-centric interfaces are still required to mediate access to specific content. Finally, open challenges and lessons learned are presented for future VBS events.<\/jats:p>","DOI":"10.1145\/3295663","type":"journal-article","created":{"date-parts":[[2019,2,14]],"date-time":"2019-02-14T19:36:17Z","timestamp":1550172977000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":43,"title":["Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3558-4144","authenticated-orcid":false,"given":"Jakub","family":"Loko\u010d","sequence":"first","affiliation":[{"name":"Charles University, Prague, Czech Republic"}]},{"given":"Gregor","family":"Koval\u010d\u00edk","sequence":"additional","affiliation":[{"name":"Charles University, Prague, Czech Republic"}]},{"given":"Bernd","family":"M\u00fcnzer","sequence":"additional","affiliation":[{"name":"Klagenfurt University, Austria"}]},{"given":"Klaus","family":"Sch\u00f6ffmann","sequence":"additional","affiliation":[{"name":"Klagenfurt University, Austria"}]},{"given":"Werner","family":"Bailer","sequence":"additional","affiliation":[{"name":"JOANNEUM RESEARCH, Steyrergasse, Graz, Austria"}]},{"given":"Ralph","family":"Gasser","sequence":"additional","affiliation":[{"name":"University of Basel, Basel, Switzerland"}]},{"given":"Stefanos","family":"Vrochidis","sequence":"additional","affiliation":[{"name":"Centre for Research and Technology Hellas, Thessaloniki, Greece"}]},{"given":"Phuong Anh","family":"Nguyen","sequence":"additional","affiliation":[{"name":"City University of Hong Kong, Tatchee Ave, Kowloon Tong, Hong Kong"}]},{"given":"Sitapa","family":"Rujikietgumjorn","sequence":"additional","affiliation":[{"name":"National Electronics and Computer Technology Center, Thailand"}]},{"given":"Kai Uwe","family":"Barthel","sequence":"additional","affiliation":[{"name":"HTW Berlin, Visual Computing Group, Berlin, Germany"}]}],"member":"320","published-online":{"date-parts":[[2019,2,13]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Home Page. Retrieved","author":"Elasticsearch","year":"2018","unstructured":"Elasticsearch : RESTful, Distributed Search 8 Analytics . Home Page. Retrieved March 30, 2018 , from https:\/\/www.elastic.co\/products\/elasticsearch. Elasticsearch: RESTful, Distributed Search 8 Analytics. Home Page. Retrieved March 30, 2018, from https:\/\/www.elastic.co\/products\/elasticsearch."},{"key":"e_1_2_1_2_1","unstructured":"NearPy. Home Page. Retrieved March 30 2018 from https:\/\/github.com\/pixelogik\/NearPy.  NearPy. Home Page. Retrieved March 30 2018 from https:\/\/github.com\/pixelogik\/NearPy."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3095713.3095740"},{"key":"e_1_2_1_4_1","volume-title":"TRECVID 2017: Evaluating ad-hoc and instance video search, events detection, video captioning and hyperlinking. In Proceedings of the 17th AnnualTREC Video Retrieval Evaluation (TRECVID\u201917)","author":"Awad George","year":"2017","unstructured":"George Awad , Asad Butt , Jonathan Fiscus , Martial Michel , David Joy , Wessel Kraaij , 2017 . TRECVID 2017: Evaluating ad-hoc and instance video search, events detection, video captioning and hyperlinking. In Proceedings of the 17th AnnualTREC Video Retrieval Evaluation (TRECVID\u201917) . George Awad, Asad Butt, Jonathan Fiscus, Martial Michel, David Joy, Wessel Kraaij, et al. 2017. TRECVID 2017: Evaluating ad-hoc and instance video search, events detection, video captioning and hyperlinking. In Proceedings of the 17th AnnualTREC Video Retrieval Evaluation (TRECVID\u201917)."},{"key":"e_1_2_1_5_1","volume-title":"Big Data Analytics for Large-Scale Multimedia Search","author":"Barthel Kai Uwe","unstructured":"Kai Uwe Barthel and Nico Hezel . 2018. Visually exploring millions of images using image maps and graphs . In Big Data Analytics for Large-Scale Multimedia Search , B. Huet, S. Vrochidis, and E. Chang (Eds.). John Wiley 8 Sons, New Jersey, 251--275. Kai Uwe Barthel and Nico Hezel. 2018. Visually exploring millions of images using image maps and graphs. In Big Data Analytics for Large-Scale Multimedia Search, B. Huet, S. Vrochidis, and E. Chang (Eds.). John Wiley 8 Sons, New Jersey, 251--275."},{"key":"e_1_2_1_6_1","volume-title":"MultiMedia Modeling, X. He, S. Luo, D. Tao","author":"Barthel Kai Uwe","unstructured":"Kai Uwe Barthel , Nico Hezel , and Radek Mackowiak . 2015. Graph-based browsing for large video collections . In MultiMedia Modeling, X. He, S. Luo, D. Tao , C. Xu, J. Yang, and M. A. Hasan (Eds.). Springer International Publishing , Cham, Switzerland , 237--242. Kai Uwe Barthel, Nico Hezel, and Radek Mackowiak. 2015. Graph-based browsing for large video collections. In MultiMedia Modeling, X. He, S. Luo, D. Tao, C. Xu, J. Yang, and M. A. Hasan (Eds.). Springer International Publishing, Cham, Switzerland, 237--242."},{"key":"e_1_2_1_7_1","volume-title":"MultiMedia Modeling, Q. Tian, N. Sebe, G.-J. Qi","author":"Barthel Kai Uwe","unstructured":"Kai Uwe Barthel , Nico Hezel , and Radek Mackowiak . 2016. Navigating a graph of scenes for exploring large video collections . In MultiMedia Modeling, Q. Tian, N. Sebe, G.-J. Qi , B. Huet, R. Hong, and X. Liu (Eds.). Springer International Publishing , Cham, Switzerland , 418--423. Kai Uwe Barthel, Nico Hezel, and Radek Mackowiak. 2016. Navigating a graph of scenes for exploring large video collections. In MultiMedia Modeling, Q. Tian, N. Sebe, G.-J. Qi, B. Huet, R. Hong, and X. Liu (Eds.). Springer International Publishing, Cham, Switzerland, 418--423."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-016-3661-2"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13222-015-0209-y"},{"key":"e_1_2_1_11_1","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Identity mappings in deep residual networks. arXiv:1603.05027. http:\/\/arxiv.org\/abs\/1603.05027.  Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Identity mappings in deep residual networks. arXiv:1603.05027. http:\/\/arxiv.org\/abs\/1603.05027."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the International Conference on Machine Learning. 448--456","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . 2015 . Batch normalization: Accelerating deep network training by reducing internal covariate shift . In Proceedings of the International Conference on Machine Learning. 448--456 . Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the International Conference on Machine Learning. 448--456."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/503112.503114"},{"key":"e_1_2_1_14_1","unstructured":"Justin Johnson Andrej Karpathy and Fei-Fei Li. 2015. DenseCap: Fully convolutional localization networks for dense captioning. arXiv:1511.07571. http:\/\/arxiv.org\/abs\/1511.07571.  Justin Johnson Andrej Karpathy and Fei-Fei Li. 2015. DenseCap: Fully convolutional localization networks for dense captioning. arXiv:1511.07571. http:\/\/arxiv.org\/abs\/1511.07571."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0925-2312(98)00030-7"},{"key":"e_1_2_1_16_1","volume-title":"Hinton","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E . Hinton . 2012 . Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems . 1097--1105. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. 1097--1105."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2017.9"},{"key":"e_1_2_1_18_1","volume-title":"MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho","author":"Leibetseder Andreas","unstructured":"Andreas Leibetseder , Sabrina Kletz , and Klaus Schoeffmann . 2018. Sketch-based similarity search for collaborative feature maps . In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho , et al. (Eds.). Springer International Publishing , Cham, Switzerland , 425--430. Andreas Leibetseder, Sabrina Kletz, and Klaus Schoeffmann. 2018. Sketch-based similarity search for collaborative feature maps. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing, Cham, Switzerland, 425--430."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1126004.1126005"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916)","author":"Liu N.","unstructured":"N. Liu and J. Han . 2016. DHSNet: Deep hierarchical saliency network for salient object detection . In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916) . 678--686. N. Liu and J. Han. 2016. DHSNet: Deep hierarchical saliency network for salient object detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916). 678--686."},{"key":"e_1_2_1_21_1","first-page":"2015","article-title":"On influential trends in interactive video retrieval","author":"Loko\u010d J.","year":"2018","unstructured":"J. Loko\u010d , W. Bailer , K. Schoeffmann , B. Muenzer , and G. Awad . 2018 . On influential trends in interactive video retrieval : Video Browser Showdown 2015 - 2017 . IEEE Transactions on Multimedia 20, 12, 3361--3376. J. Loko\u010d, W. Bailer, K. Schoeffmann, B. Muenzer, and G. Awad. 2018. On influential trends in interactive video retrieval: Video Browser Showdown 2015-2017. IEEE Transactions on Multimedia 20, 12, 3361--3376.","journal-title":"Video Browser Showdown"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 24th International Conference on Multimedia Modeling (MMM\u201918)","author":"Loko\u010d Jakub","year":"2018","unstructured":"Jakub Loko\u010d , Gregor Koval\u010d\u00edk , and Tom\u00e1\u0161 Sou\u010dek . 2018 . Revisiting SIRET video retrieval tool . In Proceedings of the 24th International Conference on Multimedia Modeling (MMM\u201918) , Part II. 419--424. Jakub Loko\u010d, Gregor Koval\u010d\u00edk, and Tom\u00e1\u0161 Sou\u010dek. 2018. Revisiting SIRET video retrieval tool. In Proceedings of the 24th International Conference on Multimedia Modeling (MMM\u201918), Part II. 419--424."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3210539.3210543"},{"key":"e_1_2_1_24_1","volume-title":"Hao Zhang, and Chong-Wah Ngo.","author":"Lu Yi-Jie","year":"2017","unstructured":"Yi-Jie Lu , Phuong Anh Nguyen , Hao Zhang, and Chong-Wah Ngo. 2017 . Concept-based interactive search system. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N.E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing , Cham, Switzerland, 463--468. Yi-Jie Lu, Phuong Anh Nguyen, Hao Zhang, and Chong-Wah Ngo. 2017. Concept-based interactive search system. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N.E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing, Cham, Switzerland, 463--468."},{"key":"e_1_2_1_25_1","volume-title":"Lucene in Action","author":"McCandless Michael","unstructured":"Michael McCandless , Erik Hatcher , and Otis Gospodnetic . 2010. Lucene in Action , Second Edition : Covers Apache Lucene 3.0. Manning Publications, Greenwich, CT. Michael McCandless, Erik Hatcher, and Otis Gospodnetic. 2010. Lucene in Action, Second Edition: Covers Apache Lucene 3.0. Manning Publications, Greenwich, CT."},{"key":"e_1_2_1_26_1","volume-title":"Enhanced VIREO KIS at VBS","author":"Nguyen Phuong Anh","year":"2018","unstructured":"Phuong Anh Nguyen , Yi-Jie Lu , Hao Zhang , and Chong-Wah Ngo . 2018. Enhanced VIREO KIS at VBS 2018 . In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing , Cham, Switzerland, 407--412. Phuong Anh Nguyen, Yi-Jie Lu, Hao Zhang, and Chong-Wah Ngo. 2018. Enhanced VIREO KIS at VBS 2018. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing, Cham, Switzerland, 407--412."},{"key":"e_1_2_1_27_1","volume-title":"The ITEC collaborative video search system at the Video Browser Showdown","author":"Primus Manfred J\u00fcrgen","year":"2018","unstructured":"Manfred J\u00fcrgen Primus , Bernd M\u00fcnzer , Andreas Leibetseder , and Klaus Schoeffmann . 2018. The ITEC collaborative video search system at the Video Browser Showdown 2018 . In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing , Cham, Switzerland, 438--443. Manfred J\u00fcrgen Primus, Bernd M\u00fcnzer, Andreas Leibetseder, and Klaus Schoeffmann. 2018. The ITEC collaborative video search system at the Video Browser Showdown 2018. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing, Cham, Switzerland, 438--443."},{"key":"e_1_2_1_28_1","unstructured":"Marek Rogozinski Rafal Kuc. 2013. Mastering ElasticSearch. Packt Publishing.   Marek Rogozinski Rafal Kuc. 2013. Mastering ElasticSearch. Packt Publishing."},{"key":"e_1_2_1_29_1","unstructured":"Joseph Redmon and Ali Farhadi. 2016. YOLO9000: Better faster stronger. arXiv:1612.08242. http:\/\/arxiv.org\/abs\/1612.08242  Joseph Redmon and Ali Farhadi. 2016. YOLO9000: Better faster stronger. arXiv:1612.08242. http:\/\/arxiv.org\/abs\/1612.08242"},{"key":"e_1_2_1_30_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Neural Information Processing Systems (NIPS).   Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Neural Information Processing Systems (NIPS)."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-73600-6_41"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISM.2014.38"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2973797"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-73600-6_46"},{"key":"e_1_2_1_35_1","first-page":"2012","article-title":"A user-centric media retrieval competition","author":"Schoeffmann Klaus","year":"2014","unstructured":"Klaus Schoeffmann . 2014 . A user-centric media retrieval competition : The Video Browser Showdown 2012 - 2014 . IEEE MultiMedia 21, 4, 8--13. Klaus Schoeffmann. 2014. A user-centric media retrieval competition: The Video Browser Showdown 2012-2014. IEEE MultiMedia 21, 4, 8--13.","journal-title":"The Video Browser Showdown"},{"key":"e_1_2_1_36_1","first-page":"1","article-title":"Video browsing interfaces and applications: A review","volume":"1","author":"Schoeffmann Klaus","year":"2010","unstructured":"Klaus Schoeffmann , Frank Hopfgartner , Oge Marques , Laszlo Boeszoermenyi , and Joemon M. Jose . 2010 . Video browsing interfaces and applications: A review . SPIE Reviews 1 , 1 , 018004. Klaus Schoeffmann, Frank Hopfgartner, Oge Marques, Laszlo Boeszoermenyi, and Joemon M. Jose. 2010. Video browsing interfaces and applications: A review. SPIE Reviews 1, 1, 018004.","journal-title":"SPIE Reviews"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2808796"},{"key":"e_1_2_1_38_1","volume-title":"Bernd Muenzer, Stefan Petscharnig, Christof Karisch, Qing Xu, et al.","author":"Schoeffmann Klaus","year":"2017","unstructured":"Klaus Schoeffmann , Manfred J\u00fcrgen Primus , Bernd Muenzer, Stefan Petscharnig, Christof Karisch, Qing Xu, et al. 2017 . Collaborative Feature Maps for Interactive Video Search. Springer International Publishing , Cham, Switzerland, 457--462. Klaus Schoeffmann, Manfred J\u00fcrgen Primus, Bernd Muenzer, Stefan Petscharnig, Christof Karisch, Qing Xu, et al. 2017. Collaborative Feature Maps for Interactive Video Search. Springer International Publishing, Cham, Switzerland, 457--462."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2007.911830"},{"key":"e_1_2_1_40_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arxiv:1409.1556. http:\/\/arxiv.org\/abs\/1409.1556.  Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arxiv:1409.1556. http:\/\/arxiv.org\/abs\/1409.1556."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/1304596.1304846"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2005.850966"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_1_44_1","volume-title":"Thanh Duc Ngo, et al","author":"Truong Thanh-Dat","year":"2018","unstructured":"Thanh-Dat Truong , Vinh-Tiep Nguyen , Minh-Triet Tran , Trang-Vinh Trieu , Tien Do , Thanh Duc Ngo, et al . 2018 . Video search based on semantic extraction and locally regional object proposal. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing , Cham, Switzerland, 451--456. Thanh-Dat Truong, Vinh-Tiep Nguyen, Minh-Triet Tran, Trang-Vinh Trieu, Tien Do, Thanh Duc Ngo, et al. 2018. Video search based on semantic extraction and locally regional object proposal. In MultiMedia Modeling, K. Schoeffmann, T. H. Chalidabhongse, C. W. Ngo, S. Aramvith, N. E. O\u2019Connor, Y.-S. Ho, et al. (Eds.). Springer International Publishing, Cham, Switzerland, 451--456."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2587640"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2012.53"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2011.2174782"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2723009"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.5555\/2968826.2968881"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3295663","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3295663","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:12:50Z","timestamp":1750201970000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3295663"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,13]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,2,28]]}},"alternative-id":["10.1145\/3295663"],"URL":"https:\/\/doi.org\/10.1145\/3295663","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,2,13]]},"assertion":[{"value":"2018-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-11-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-02-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}