{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T19:03:23Z","timestamp":1777921403509,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":21,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,10,17]],"date-time":"2021-10-17T00:00:00Z","timestamp":1634428800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,10,17]]},"DOI":"10.1145\/3474085.3479206","type":"proceedings-article","created":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T10:23:20Z","timestamp":1634552600000},"page":"4730-4734","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Better Learning Shot Boundary Detection via Multi-task"],"prefix":"10.1145","author":[{"given":"Haoxin","family":"Zhang","sequence":"first","affiliation":[{"name":"Tencent Data Platform, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhimin","family":"Li","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wuhan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qinglin","family":"Lu","sequence":"additional","affiliation":[{"name":"Tencent Data Platform, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,10,17]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"George Awad Asad Butt Jonathan Fiscus David Joy Andrew Delgado Willie Mcclinton Martial Michel Alan Smeaton Yvette Graham Wessel Kraaij etal 2017. Trecvid 2017: evaluating ad-hoc and instance video search events detection video captioning and hyperlinking. In TREC Video Retrieval Evaluation (TRECVID).  George Awad Asad Butt Jonathan Fiscus David Joy Andrew Delgado Willie Mcclinton Martial Michel Alan Smeaton Yvette Graham Wessel Kraaij et al. 2017. Trecvid 2017: evaluating ad-hoc and instance video search events detection video captioning and hyperlinking. In TREC Video Retrieval Evaluation (TRECVID)."},{"key":"e_1_3_2_2_2_1","volume-title":"Shot Contrastive Self-Supervised Learning for Scene Boundary Detection. CoRR","author":"Chen Shixing","year":"2021","unstructured":"Shixing Chen , Xiaohan Nie , David Fan , Dongqing Zhang , Vimal Bhat , and Raffay Hamid . 2021. Shot Contrastive Self-Supervised Learning for Scene Boundary Detection. CoRR , Vol. abs\/ 2104 .13537 ( 2021 ). Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, and Raffay Hamid. 2021. Shot Contrastive Self-Supervised Learning for Scene Boundary Detection. CoRR, Vol. abs\/2104.13537 (2021)."},{"key":"e_1_3_2_2_3_1","volume-title":"Supervised Video Summarization Via Multiple Feature Sets with Parallel Attention. In 2021 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6s.","author":"Ghauri Junaid Ahmed","year":"2021","unstructured":"Junaid Ahmed Ghauri , Sherzod Hakimov , and Ralph Ewerth . 2021 . Supervised Video Summarization Via Multiple Feature Sets with Parallel Attention. In 2021 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6s. Junaid Ahmed Ghauri, Sherzod Hakimov, and Ralph Ewerth. 2021. Supervised Video Summarization Via Multiple Feature Sets with Parallel Attention. In 2021 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1--6s."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/APEIE.2014.7040826"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CBMI.2018.8516556"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298928"},{"key":"e_1_3_2_2_7_1","volume-title":"fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. arXiv preprint arXiv:1705.03281","author":"Hassanien Ahmed","year":"2017","unstructured":"Ahmed Hassanien , Mohamed Elgharib , Ahmed Selim , Sung-Ho Bae , Mohamed Hefeeda , and Wojciech Matusik . 2017. Large-scale , fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. arXiv preprint arXiv:1705.03281 ( 2017 ). Ahmed Hassanien, Mohamed Elgharib, Ahmed Selim, Sung-Ho Bae, Mohamed Hefeeda, and Wojciech Matusik. 2017. Large-scale, fast and accurate shot boundary detection through spatio-temporal convolutional neural networks. arXiv preprint arXiv:1705.03281 (2017)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-30754-7_14"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCC.2011.2109710"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMIP.2016.24"},{"key":"e_1_3_2_2_11_1","volume-title":"Focal Loss for Dense Object Detection. CoRR","author":"Lin Tsung-Yi","year":"2002","unstructured":"Tsung-Yi Lin , Priya Goyal , Ross B. Girshick , Kaiming He , and Piotr Doll\u00e1 r. 2017. Focal Loss for Dense Object Detection. CoRR , Vol. abs\/ 1708 .0 2002 (2017). arxiv: 1708.02002 http:\/\/arxiv.org\/abs\/1708.02002 Tsung-Yi Lin, Priya Goyal, Ross B. Girshick, Kaiming He, and Piotr Doll\u00e1 r. 2017. Focal Loss for Dense Object Detection. CoRR, Vol. abs\/1708.02002 (2017). arxiv: 1708.02002 http:\/\/arxiv.org\/abs\/1708.02002"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01016"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCT.2015.44"},{"key":"e_1_3_2_2_14_1","volume-title":"Dalton Meitei Thounaojam, and Saptarshi Chakraborty","author":"Singh Alok","year":"2019","unstructured":"Alok Singh , Dalton Meitei Thounaojam, and Saptarshi Chakraborty . 2019 . A novel automatic shot boundary detection algorithm: robust to illumination and motion effect. Signal, Image and Video Processing ( 2019), 1--9. Alok Singh, Dalton Meitei Thounaojam, and Saptarshi Chakraborty. 2019. A novel automatic shot boundary detection algorithm: robust to illumination and motion effect. Signal, Image and Video Processing (2019), 1--9."},{"key":"e_1_3_2_2_15_1","volume-title":"TransNet V2: An effective deep network architecture for fast shot transition detection. CoRR","author":"Soucek Tom\u00e1s","year":"2020","unstructured":"Tom\u00e1s Soucek and Jakub Lokoc . 2020. TransNet V2: An effective deep network architecture for fast shot transition detection. CoRR , Vol. abs\/ 2008 .04838 ( 2020 ). Tom\u00e1s Soucek and Jakub Lokoc. 2020. TransNet V2: An effective deep network architecture for fast shot transition detection. CoRR, Vol. abs\/2008.04838 (2020)."},{"key":"e_1_3_2_2_16_1","volume-title":"TransNet: A deep network for fast detection of common shot transitions. CoRR","author":"Soucek Tom\u00e1s","year":"2019","unstructured":"Tom\u00e1s Soucek , Jaroslav Moravec , and Jakub Lokoc . 2019. TransNet: A deep network for fast detection of common shot transitions. CoRR , Vol. abs\/ 1906 .03363 ( 2019 ). Tom\u00e1s Soucek, Jaroslav Moravec, and Jakub Lokoc. 2019. TransNet: A deep network for fast detection of common shot transitions. CoRR, Vol. abs\/1906.03363 (2019)."},{"key":"e_1_3_2_2_17_1","volume-title":"Asian Conference on Computer Vision. Springer, 577--592","author":"Tang Shitao","year":"2018","unstructured":"Shitao Tang , Litong Feng , Zhanghui Kuang , Yimin Chen , and Wei Zhang . 2018 . Fast video shot transition localization with deep structured models . In Asian Conference on Computer Vision. Springer, 577--592 . Shitao Tang, Litong Feng, Zhanghui Kuang, Yimin Chen, and Wei Zhang. 2018. Fast video shot transition localization with deep structured models. In Asian Conference on Computer Vision. Springer, 577--592."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2717998"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3350992"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_19"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2006.888023"}],"event":{"name":"MM '21: ACM Multimedia Conference","location":"Virtual Event China","acronym":"MM '21","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 29th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3479206","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3474085.3479206","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:48Z","timestamp":1750193328000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3474085.3479206"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,17]]},"references-count":21,"alternative-id":["10.1145\/3474085.3479206","10.1145\/3474085"],"URL":"https:\/\/doi.org\/10.1145\/3474085.3479206","relation":{},"subject":[],"published":{"date-parts":[[2021,10,17]]},"assertion":[{"value":"2021-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}