{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:24:43Z","timestamp":1768901083561,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":50,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,8]],"date-time":"2020-06-08T00:00:00Z","timestamp":1591574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Marie Sk?odowska-Curie Grant","award":["765140"],"award-info":[{"award-number":["765140"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,8]]},"DOI":"10.1145\/3372278.3390695","type":"proceedings-article","created":{"date-parts":[[2020,6,2]],"date-time":"2020-06-02T04:35:27Z","timestamp":1591072527000},"page":"242-250","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":45,"title":["Query-controllable Video Summarization"],"prefix":"10.1145","author":[{"given":"Jia-Hong","family":"Huang","sequence":"first","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]},{"given":"Marcel","family":"Worring","sequence":"additional","affiliation":[{"name":"University of Amsterdam, Amsterdam, Netherlands"}]}],"member":"320","published-online":{"date-parts":[[2020,6,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.279"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.285"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298981"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2010.08.004"},{"key":"e_1_3_2_1_5_1","volume-title":"Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach.","author":"Fukui Akira","year":"2016","unstructured":"Akira Fukui , Dong Huk Park , Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach. 2016 . Multimodal compact bilinear pooling for visual question answering and visual grounding. arXiv preprint arXiv:1606.01847 (2016). Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach. 2016. Multimodal compact bilinear pooling for visual question answering and visual grounding. arXiv preprint arXiv:1606.01847 (2016)."},{"key":"e_1_3_2_1_6_1","unstructured":"Boqing Gong Wei-Lun Chao Kristen Grauman and Fei Sha. 2014. Diverse sequential subset selection for supervised video summarization. In Advances in Neural Information Processing Systems. 2069--2077.  Boqing Gong Wei-Lun Chao Kristen Grauman and Fei Sha. 2014. Diverse sequential subset selection for supervised video summarization. In Advances in Neural Information Processing Systems. 2069--2077."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10584-0_33"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298928"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00517"},{"key":"e_1_3_2_1_11_1","volume-title":"Robustness Analysis of Visual Question Answering Models by Basic Questions","author":"Huang Jia-Hong","year":"2017","unstructured":"Jia-Hong Huang . 2017. Robustness Analysis of Visual Question Answering Models by Basic Questions . King Abdullah University of Science and Technology MS thesis ( 2017 ). Jia-Hong Huang. 2017. Robustness Analysis of Visual Question Answering Models by Basic Questions. King Abdullah University of Science and Technology MS thesis (2017)."},{"key":"e_1_3_2_1_12_1","volume-title":"CVPR VQA Challenge Workshop","author":"Huang Jia-Hong","year":"2017","unstructured":"Jia-Hong Huang , Modar Alfadly , and Bernard Ghanem . 2017 . Vqabq: Visual question answering by basic questions . CVPR VQA Challenge Workshop (2017). Jia-Hong Huang, Modar Alfadly, and Bernard Ghanem. 2017. Vqabq: Visual question answering by basic questions. CVPR VQA Challenge Workshop (2017)."},{"key":"e_1_3_2_1_13_1","volume-title":"2019 a. Assessing the Robustness of Visual Question Answering. arXiv preprint arXiv:1912.01452","author":"Huang Jia-Hong","year":"2019","unstructured":"Jia-Hong Huang , Modar Alfadly , Bernard Ghanem , and Marcel Worring . 2019 a. Assessing the Robustness of Visual Question Answering. arXiv preprint arXiv:1912.01452 ( 2019 ). Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, and Marcel Worring. 2019 a. Assessing the Robustness of Visual Question Answering. arXiv preprint arXiv:1912.01452 (2019)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33018449"},{"key":"e_1_3_2_1_15_1","volume-title":"CVPR VQA Challenge and Visual Dialog Workshop","author":"Huang Jia-Hong","year":"2018","unstructured":"Jia-Hong Huang , Cuong Duc Dao , Modar Alfadly , C Huck Yang , and Bernard Ghanem . 2018 . Robustness analysis of visual qa models by basic questions . CVPR VQA Challenge and Visual Dialog Workshop (2018). Jia-Hong Huang, Cuong Duc Dao, Modar Alfadly, C Huck Yang, and Bernard Ghanem. 2018. Robustness analysis of visual qa models by basic questions. CVPR VQA Challenge and Visual Dialog Workshop (2018)."},{"key":"e_1_3_2_1_16_1","volume-title":"Dataset and Evaluation. MediaEval","volume":"1263","author":"Ionescu Bogdan","year":"2014","unstructured":"Bogdan Ionescu , Alexandru-Lucian Ginsca , Bogdan Boteanu , Adrian Popescu , Mihai Lupu , and Henning M\u00fcller . 2014 . Retrieving Diverse Social Images at MediaEval 2014: Challenge , Dataset and Evaluation. MediaEval , Vol. 1263 (2014). Bogdan Ionescu, Alexandru-Lucian Ginsca, Bogdan Boteanu, Adrian Popescu, Mihai Lupu, and Henning M\u00fcller. 2014. Retrieving Diverse Social Images at MediaEval 2014: Challenge, Dataset and Evaluation. MediaEval, Vol. 1263 (2014)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.284"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.348"},{"key":"e_1_3_2_1_19_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_20_1","volume-title":"2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1346--1353","author":"Lee Yong Jae","year":"2012","unstructured":"Yong Jae Lee , Joydeep Ghosh , and Kristen Grauman . 2012 . Discovering important people and objects for egocentric video summarization . In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1346--1353 . Yong Jae Lee, Joydeep Ghosh, and Kristen Grauman. 2012. Discovering important people and objects for egocentric video summarization. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1346--1353."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01237-3_10"},{"key":"e_1_3_2_1_22_1","volume-title":"Optimization algorithms for the selection of key frame sequences of variable length","author":"Liu Tiecheng","unstructured":"Tiecheng Liu and John R Kender . 2002. Optimization algorithms for the selection of key frame sequences of variable length . In ECCV. Springer , 403--417. Tiecheng Liu and John R Kender. 2002. Optimization algorithms for the selection of key frame sequences of variable length. In ECCV. Springer, 403--417."},{"key":"e_1_3_2_1_23_1","volume-title":"Jesper Tegner, and Yi-Chang James Tsai.","author":"Liu Yi-Chieh","year":"2019","unstructured":"Yi-Chieh Liu , Yung-An Hsieh , Chao-Han Huck Chen , Jesper Tegner, and Yi-Chang James Tsai. 2019 . Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding . arXiv preprint arXiv:1911.02172 (2019). Yi-Chieh Liu, Yung-An Hsieh, Chao-Han Huck Chen, Jesper Tegner, and Yi-Chang James Tsai. 2019. Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding. arXiv preprint arXiv:1911.02172 (2019)."},{"key":"e_1_3_2_1_24_1","volume-title":"Asian Conference on Computer Vision. Springer, 235--250","author":"Liu Yi-Chieh","year":"2018","unstructured":"Yi-Chieh Liu , Hao-Hsiang Yang , C-H Huck Yang , Jia-Hong Huang , Meng Tian , Hiromasa Morikawa , Yi-Chang James Tsai , and Jesper Tegner . 2018 . Synthesizing New Retinal Symptom Images by Multiple Generative Models . In Asian Conference on Computer Vision. Springer, 235--250 . Yi-Chieh Liu, Hao-Hsiang Yang, C-H Huck Yang, Jia-Hong Huang, Meng Tian, Hiromasa Morikawa, Yi-Chang James Tsai, and Jesper Tegner. 2018. Synthesizing New Retinal Symptom Images by Multiple Generative Models. In Asian Conference on Computer Vision. Springer, 235--250."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.350"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/641007.641116"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.318"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/946247.946650"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1463563.1463564"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.455"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.118"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10599-4_35"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00809"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01258-8_22"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01219-9_32"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.229"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.599"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1178677.1178722"},{"key":"e_1_3_2_1_39_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 5179--5187","author":"Song Yale","year":"2015","unstructured":"Yale Song , Jordi Vallmitjana , Amanda Stent , and Alejandro Jaimes . 2015 . Tvsum: Summarizing web videos using titles . In Proceedings of the IEEE conference on computer vision and pattern recognition. 5179--5187 . Yale Song, Jordi Vallmitjana, Amanda Stent, and Alejandro Jaimes. 2015. Tvsum: Summarizing web videos using titles. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5179--5187."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123297"},{"key":"e_1_3_2_1_41_1","volume-title":"Joint ICML and IJCAI Workshop on Computational Biology","author":"Huck Yang C-H","year":"2018","unstructured":"C-H Huck Yang , Jia-Hong Huang , Fangyu Liu , Fang-Yi Chiu , Mengya Gao , Weifeng Lyu , Jesper Tegner , 2018 a. A novel hybrid machine learning model for auto-classification of retinal diseases . Joint ICML and IJCAI Workshop on Computational Biology (2018). C-H Huck Yang, Jia-Hong Huang, Fangyu Liu, Fang-Yi Chiu, Mengya Gao, Weifeng Lyu, Jesper Tegner, et al. 2018a. A novel hybrid machine learning model for auto-classification of retinal diseases. Joint ICML and IJCAI Workshop on Computational Biology (2018)."},{"key":"e_1_3_2_1_42_1","volume-title":"Asian Conference on Computer Vision. Springer, 323--338","author":"Huck Yang C-H","year":"2018","unstructured":"C-H Huck Yang , Fangyu Liu , Jia-Hong Huang , Meng Tian , MD I- Hung Lin , Yi Chieh Liu , Hiromasa Morikawa , Hao-Hsiang Yang , and Jesper Tegner . 2018 b. Auto-classification of retinal diseases in the limit of sparse data using a two-streams machine learning model . In Asian Conference on Computer Vision. Springer, 323--338 . C-H Huck Yang, Fangyu Liu, Jia-Hong Huang, Meng Tian, MD I-Hung Lin, Yi Chieh Liu, Hiromasa Morikawa, Hao-Hsiang Yang, and Jesper Tegner. 2018b. Auto-classification of retinal diseases in the limit of sparse data using a two-streams machine learning model. In Asian Conference on Computer Vision. Springer, 323--338."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2019.8803554"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.120"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46478-7_47"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01237-3_24"},{"key":"e_1_3_2_1_47_1","volume-title":"Query-conditioned three-player adversarial network for video summarization. arXiv preprint arXiv:1807.06677","author":"Zhang Yujia","year":"2018","unstructured":"Yujia Zhang , Michael Kampffmeyer , Xiaodan Liang , Min Tan , and Eric P Xing . 2018b. Query-conditioned three-player adversarial network for video summarization. arXiv preprint arXiv:1807.06677 ( 2018 ). Yujia Zhang, Michael Kampffmeyer, Xiaodan Liang, Min Tan, and Eric P Xing. 2018b. Query-conditioned three-player adversarial network for video summarization. arXiv preprint arXiv:1807.06677 (2018)."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3321408.3322622"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.322"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.12255"}],"event":{"name":"ICMR '20: International Conference on Multimedia Retrieval","location":"Dublin Ireland","acronym":"ICMR '20","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2020 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390695","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390695","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:10Z","timestamp":1750195930000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390695"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,8]]},"references-count":50,"alternative-id":["10.1145\/3372278.3390695","10.1145\/3372278"],"URL":"https:\/\/doi.org\/10.1145\/3372278.3390695","relation":{},"subject":[],"published":{"date-parts":[[2020,6,8]]},"assertion":[{"value":"2020-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}