{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:08:52Z","timestamp":1750219732479,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":53,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,29]],"date-time":"2023-10-29T00:00:00Z","timestamp":1698537600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Research Foundation, Singapore","award":["Strategic Capability Research Centres Funding Initiative"],"award-info":[{"award-number":["Strategic Capability Research Centres Funding Initiative"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,29]]},"DOI":"10.1145\/3607540.3617142","type":"proceedings-article","created":{"date-parts":[[2023,10,30]],"date-time":"2023-10-30T01:00:50Z","timestamp":1698627650000},"page":"31-40","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Narrative Graph for Narrative Generation from Long Videos"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0964-1464","authenticated-orcid":false,"given":"Rishabh","family":"Sheoran","sequence":"first","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1239-4428","authenticated-orcid":false,"given":"Yongkang","family":"Wong","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4303-9020","authenticated-orcid":false,"given":"Jianquan","family":"Liu","sequence":"additional","affiliation":[{"name":"NEC Corporation, Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4846-2015","authenticated-orcid":false,"given":"Mohan","family":"Kankanhalli","sequence":"additional","affiliation":[{"name":"National University of Singapore, Singapore, Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,10,29]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics -","volume":"1","author":"Baker Collin F.","unstructured":"Collin F. Baker , Charles J. Fillmore , and John B. Lowe . 1998. The Berkeley FrameNet Project . In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1 (ACL '98\/COLING '98). Association for Computational Linguistics, 86--90. Collin F. Baker, Charles J. Fillmore, and John B. Lowe. 1998. The Berkeley FrameNet Project. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1 (ACL '98\/COLING '98). Association for Computational Linguistics, 86--90."},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, LAW-ID@ACL 2013","author":"Banarescu Laura","year":"2013","unstructured":"Laura Banarescu , Claire Bonial , Shu Cai , Madalina Georgescu , Kira Griffitt , Ulf Hermjakob , Kevin Knight , Philipp Koehn , Martha Palmer , and Nathan Schneider . 2013 . Abstract Meaning Representation for Sembanking . In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, LAW-ID@ACL 2013 , August 8 --9 , 2013, Sofia, Bulgaria. The Association for Computer Linguistics, 178--186. Laura Banarescu, Claire Bonial, Shu Cai, Madalina Georgescu, Kira Griffitt, Ulf Hermjakob, Kevin Knight, Philipp Koehn, Martha Palmer, and Nathan Schneider. 2013. Abstract Meaning Representation for Sembanking. In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, LAW-ID@ACL 2013, August 8--9, 2013, Sofia, Bulgaria. The Association for Computer Linguistics, 178--186."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization. Association for Computational Linguistics","author":"Banerjee Satanjeev","year":"2005","unstructured":"Satanjeev Banerjee and Alon Lavie . 2005 . METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments . In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization. Association for Computational Linguistics , Ann Arbor, Michigan, 65--72. Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization. Association for Computational Linguistics, Ann Arbor, Michigan, 65--72."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2022\/561"},{"key":"e_1_3_2_1_5_1","volume-title":"Yuanzhi Li, Scott Lundberg, et al.","author":"Bubeck S\u00e9bastien","year":"2023","unstructured":"S\u00e9bastien Bubeck , Varun Chandrasekaran , Ronen Eldan , Johannes Gehrke , Eric Horvitz , Ece Kamar , Peter Lee , Yin Tat Lee , Yuanzhi Li, Scott Lundberg, et al. 2023 . Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712 (2023). S\u00e9bastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, et al. 2023. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712 (2023)."},{"key":"e_1_3_2_1_6_1","first-page":"17","article-title":"Knowledge Graphs: Introduction, History and","volume":"43","author":"Chaudhri Vinay","year":"2022","unstructured":"Vinay Chaudhri , Chaitanya Baru , Naren Chittar , Xin Dong , Michael Genesereth , James Hendler , Aditya Kalyanpur , Douglas Lenat , Juan Sequeda , Denny Vrande?i?, and Kuansan Wang . 2022 . Knowledge Graphs: Introduction, History and , Perspectives. , Vol. 43 (2022), 17 -- 29 . Vinay Chaudhri, Chaitanya Baru, Naren Chittar, Xin Dong, Michael Genesereth, James Hendler, Aditya Kalyanpur, Douglas Lenat, Juan Sequeda, Denny Vrande?i?, and Kuansan Wang. 2022. Knowledge Graphs: Introduction, History and, Perspectives. , Vol. 43 (2022), 17--29.","journal-title":"Perspectives."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2019.00220"},{"volume-title":"The distinction of fiction","author":"Cohn Dorrit","key":"e_1_3_2_1_8_1","unstructured":"Dorrit Cohn . 2000. The distinction of fiction . JHU Press . Dorrit Cohn. 2000. The distinction of fiction. JHU Press."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11168-006-6327-9"},{"key":"e_1_3_2_1_10_1","unstructured":"Harper Eric Majumdar Somshubra Kuchaiev Oleksii Jason Li Zhang Yang Bakhturina Evelina Noroozi Vahid Subramanian Sandeep Nithin Koluguri Jocelyn Huang Jia Fei Balam Jagadeesh Yang Xuesong Livne Micha Dong Yi Naren Sean and Ginsburg Boris. 2022. NeMo: a toolkit for Conversational AI and Large Language Models. https:\/\/nvidia.github.io\/NeMo\/  Harper Eric Majumdar Somshubra Kuchaiev Oleksii Jason Li Zhang Yang Bakhturina Evelina Noroozi Vahid Subramanian Sandeep Nithin Koluguri Jocelyn Huang Jia Fei Balam Jagadeesh Yang Xuesong Livne Micha Dong Yi Naren Sean and Ginsburg Boris. 2022. NeMo: a toolkit for Conversational AI and Large Language Models. https:\/\/nvidia.github.io\/NeMo\/"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1093\/ijl\/16.3.235"},{"key":"e_1_3_2_1_12_1","volume-title":"Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022","author":"Han Mingfei","year":"2022","unstructured":"Mingfei Han , David Junhao Zhang , Yali Wang , Rui Yan , Lina Yao , Xiaojun Chang , and Yu Qiao . 2022 . Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 , New Orleans, LA, USA, June 18--24 , 2022. IEEE, 2980--2989. Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang, and Yu Qiao. 2022. Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18--24, 2022. IEEE, 2980--2989."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1044"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298698"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00300"},{"key":"e_1_3_2_1_16_1","volume-title":"Martin","author":"Jurafsky Daniel","year":"2023","unstructured":"Daniel Jurafsky and James H . Martin . 2023 . Speech and Language Processing (3rd Edition Draft) . https:\/\/web.stanford.edu\/ jurafsky\/slp3\/ed3book_jan72023.pdf Daniel Jurafsky and James H. Martin. 2023. Speech and Language Processing (3rd Edition Draft). https:\/\/web.stanford.edu\/ jurafsky\/slp3\/ed3book_jan72023.pdf"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2018.2837153"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Pavan Kapanipathi Ibrahim Abdelaziz Srinivas Ravishankar Salim Roukos Alexander Gray Ramon Astudillo Maria Chang Cristina Cornelio Saswati Dana Achille Fokoue etal 2020. Leveraging abstract meaning representation for knowledge base question answering. arXiv preprint arXiv:2012.01707 (2020).  Pavan Kapanipathi Ibrahim Abdelaziz Srinivas Ravishankar Salim Roukos Alexander Gray Ramon Astudillo Maria Chang Cristina Cornelio Saswati Dana Achille Fokoue et al. 2020. Leveraging abstract meaning representation for knowledge base question answering. arXiv preprint arXiv:2012.01707 (2020).","DOI":"10.18653\/v1\/2021.findings-acl.339"},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the Asian Conference on Computer Vision (ACCV).","author":"Kim Insoo","year":"2020","unstructured":"Insoo Kim , Seungju Han , Seong-Jin Park , Ji-Won Baek , Jinwoo Shin , Jae-Joon Han , and Changkyu Choi . 2020 . DiscFace: Minimum Discrepancy Learning for Deep Face Recognition . In Proceedings of the Asian Conference on Computer Vision (ACCV). Insoo Kim, Seungju Han, Seong-Jin Park, Ji-Won Baek, Jinwoo Shin, Jae-Joon Han, and Changkyu Choi. 2020. DiscFace: Minimum Discrepancy Learning for Deep Face Recognition. In Proceedings of the Asian Conference on Computer Vision (ACCV)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01819"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of Treebanks and lexical Theories","volume":"3","author":"Kingsbury Paul","year":"2003","unstructured":"Paul Kingsbury and Martha Palmer . 2003 . Propbank: the next level of treebank . In Proceedings of Treebanks and lexical Theories , Vol. 3 . Citeseer. Paul Kingsbury and Martha Palmer. 2003. Propbank: the next level of treebank. In Proceedings of Treebanks and lexical Theories, Vol. 3. Citeseer."},{"key":"e_1_3_2_1_22_1","volume-title":"Narrative theory","author":"Garcia Landa Jose Angel","year":"2005","unstructured":"Jose Angel Garcia Landa . 2005. Narrative theory . University of Zaragoza . On Line Edition ( 2005 ). Jose Angel Garcia Landa. 2005. Narrative theory. University of Zaragoza. On Line Edition (2005)."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.dlg4nlp-1.2"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01353"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00527"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.167"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1159"},{"key":"e_1_3_2_1_29_1","unstructured":"Gardner Matt Grus Joel Neumann Mark Tafjord Oyvind Dasigi Pradeep Liu Nelson Peters Matthew Schmitz Michael and Zettlemoyer Luke. [n. d.]. AllenNLP: A Deep Semantic Natural Language Processing Platform. https:\/\/github.com\/allenai\/allennlp  Gardner Matt Grus Joel Neumann Mark Tafjord Oyvind Dasigi Pradeep Liu Nelson Peters Matthew Schmitz Michael and Zettlemoyer Luke. [n. d.]. AllenNLP: A Deep Semantic Natural Language Processing Platform. https:\/\/github.com\/allenai\/allennlp"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.148"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Tim Meinhardt Alexander Kirillov Laura Leal-Taix\u00e9 and Christoph Feichtenhofer. 2022. TrackFormer: Multi-Object Tracking with Transformers. In CVPR. 8844--8854.  Tim Meinhardt Alexander Kirillov Laura Leal-Taix\u00e9 and Christoph Feichtenhofer. 2022. TrackFormer: Multi-Object Tracking with Transformers. In CVPR. 8844--8854.","DOI":"10.1109\/CVPR52688.2022.00864"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-6319"},{"key":"e_1_3_2_1_33_1","article-title":"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel , Noam Shazeer , Adam Roberts , Katherine Lee , Sharan Narang , Michael Matena , Yanqi Zhou , Wei Li , and Peter J. Liu . 2020 . Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer . J. Mach. Learn. Res. , Vol. 21 (2020), 140:1--140:67. Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. J. Mach. Learn. Res. , Vol. 21 (2020), 140:1--140:67.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/1870658.1870706"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.nlp4convai-1.20"},{"key":"e_1_3_2_1_37_1","first-page":"168","article-title":"Recent concepts of narrative and the narratives of narrative theory","volume":"34","author":"Richardson Brian","year":"2000","unstructured":"Brian Richardson . 2000 . Recent concepts of narrative and the narratives of narrative theory . Style , Vol. 34 , 2 (2000), 168 -- 175 . Brian Richardson. 2000. Recent concepts of narrative and the narratives of narrative theory. Style, Vol. 34, 2 (2000), 168--175.","journal-title":"Style"},{"key":"e_1_3_2_1_38_1","unstructured":"Josef Ruppenhofer Michael Ellsworth Miriam R. L. Petruck Christopher R. Johnson and Jan Scheffczyk. 2006. FrameNet II: Extended theory and practice.  Josef Ruppenhofer Michael Ellsworth Miriam R. L. Petruck Christopher R. Johnson and Jan Scheffczyk. 2006. FrameNet II: Extended theory and practice."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Xindi Shang Zehuan Yuan Anran Wang and Changhu Wang. 2021. Multimodal Video Summarization via Time-Aware Transformers. In ACM Multimedia. 1756--1765.  Xindi Shang Zehuan Yuan Anran Wang and Changhu Wang. 2021. Multimodal Video Summarization via Time-Aware Transformers. In ACM Multimedia. 1756--1765.","DOI":"10.1145\/3474085.3475321"},{"key":"e_1_3_2_1_40_1","volume-title":"Lin","author":"Shi Peng","year":"2019","unstructured":"Peng Shi and Jimmy J . Lin . 2019 . Simple BERT Models for Relation Extraction and Semantic Role Labeling. ArXiv , Vol. abs\/ 1904 .05255 (2019). Peng Shi and Jimmy J. Lin. 2019. Simple BERT Models for Relation Extraction and Semantic Role Labeling. ArXiv , Vol. abs\/1904.05255 (2019)."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.525"},{"key":"e_1_3_2_1_42_1","volume-title":"BLEURT: Learning Robust Metrics for Text Generation. In ACL.","author":"Thibault Sellam","year":"2020","unstructured":"Sellam Thibault , Das Dipanjan , and Parikh Ankur . 2020 . BLEURT: Learning Robust Metrics for Text Generation. In ACL. Sellam Thibault, Das Dipanjan, and Parikh Ankur. 2020. BLEURT: Learning Robust Metrics for Text Generation. In ACL."},{"key":"e_1_3_2_1_43_1","volume-title":"a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR","author":"Victor Sanh","year":"2019","unstructured":"Sanh Victor , Debut Lysandre , Chaumond Julien , and Wolf Thomas . 2019. DistilBERT , a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR , Vol. abs\/ 1910 .01108 ( 2019 ). Sanh Victor, Debut Lysandre, Chaumond Julien, and Wolf Thomas. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR , Vol. abs\/1910.01108 (2019)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_2_1_45_1","volume-title":"MM '22: The 30th ACM International Conference on Multimedia","author":"Wong Yongkang","year":"2022","unstructured":"Yongkang Wong , Shaojing Fan , Yangyang Guo , Ziwei Xu , Karen Stephen , Rishabh Sheoran , Anusha Bhamidipati , Vivek Barsopia , Jianquan Liu , and Mohan S. Kankanhalli . 2022. Compute to Tell the Tale: Goal-Driven Narrative Generation . In MM '22: The 30th ACM International Conference on Multimedia , Lisboa, Portugal, October 10 - 14 , 2022 . ACM, 6875--6882. Yongkang Wong, Shaojing Fan, Yangyang Guo, Ziwei Xu, Karen Stephen, Rishabh Sheoran, Anusha Bhamidipati, Vivek Barsopia, Jianquan Liu, and Mohan S. Kankanhalli. 2022. Compute to Tell the Tale: Goal-Driven Narrative Generation. In MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022. ACM, 6875--6882."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.507"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11518-023-5561-0"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2020.104091"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00335"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Ting Yao Tao Mei and Yong Rui. 2016. Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization. In CVPR. 982--990.  Ting Yao Tao Mei and Yong Rui. 2016. Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization. In CVPR. 982--990.","DOI":"10.1109\/CVPR.2016.112"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.284"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"crossref","unstructured":"Bin Zhao Xuelong Li and Xiaoqiang Lu. 2018. HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization. In CVPR. 7405--7414.  Bin Zhao Xuelong Li and Xiaoqiang Lu. 2018. HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization. In CVPR. 7405--7414.","DOI":"10.1109\/CVPR.2018.00773"},{"key":"e_1_3_2_1_53_1","volume-title":"Towards Automatic Learning of Procedures from Web Instructional Videos. arXiv preprint arXiv:1703.09788","author":"Zhou Luowei","year":"2017","unstructured":"Luowei Zhou , Chenliang Xu , and Jason J Corso . 2017. Towards Automatic Learning of Procedures from Web Instructional Videos. arXiv preprint arXiv:1703.09788 ( 2017 ). io Luowei Zhou, Chenliang Xu, and Jason J Corso. 2017. Towards Automatic Learning of Procedures from Web Instructional Videos. arXiv preprint arXiv:1703.09788 (2017). io"}],"event":{"name":"MM '23: The 31st ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Ottawa ON Canada","acronym":"MM '23"},"container-title":["Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607540.3617142","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3607540.3617142","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:28Z","timestamp":1750178188000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3607540.3617142"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,29]]},"references-count":53,"alternative-id":["10.1145\/3607540.3617142","10.1145\/3607540"],"URL":"https:\/\/doi.org\/10.1145\/3607540.3617142","relation":{},"subject":[],"published":{"date-parts":[[2023,10,29]]},"assertion":[{"value":"2023-10-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}