{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T16:41:06Z","timestamp":1774456866711,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":19,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,23]],"date-time":"2023-10-23T00:00:00Z","timestamp":1698019200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,23]]},"DOI":"10.1145\/3617023.3617042","type":"proceedings-article","created":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T04:11:19Z","timestamp":1696392679000},"page":"137-143","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Summarization of Educational Videos with Transformers Networks"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0097-5161","authenticated-orcid":false,"given":"Leandro Massetti Ribeiro","family":"Oliveira","sequence":"first","affiliation":[{"name":"TeleM\u00eddia@MA Lab \/ PPGCC, Universidade Federal do Maranh\u00e3o, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9192-6471","authenticated-orcid":false,"given":"Li Chang","family":"Shuen","sequence":"additional","affiliation":[{"name":"TeleM\u00eddia@MA Lab \/ DCCMAPI, Universidade Federal do Maranh\u00e3o, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2631-2032","authenticated-orcid":false,"given":"Allan K\u00e1ssio Beckman Soares","family":"da Cruz","sequence":"additional","affiliation":[{"name":"TeleM\u00eddia@MA Lab \/ DCCMAPI, Universidade Federal do Maranh\u00e3o, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6800-1881","authenticated-orcid":false,"given":"Carlos de Salles","family":"Soares","sequence":"additional","affiliation":[{"name":"TeleM\u00eddia@MA Lab \/ DCCMAPI \/ PPGCC, Universidade Federal do Maranh\u00e3o, Brazil"}]}],"member":"320","published-online":{"date-parts":[[2023,10,23]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"European Conference on Computer Vision (pp. 540-555)","author":"Potapov D.","year":"2014","unstructured":"[ 1 ] Potapov , D. , Douze , M. , Harchaoui , Z. , & Schmid , C. ( 2014 ). Category-specific video summarization. In Springer (Ed.) , European Conference on Computer Vision (pp. 540-555) . [S.l.]. [1] Potapov, D., Douze, M., Harchaoui, Z., & Schmid, C. (2014). Category-specific video summarization. In Springer (Ed.), European Conference on Computer Vision (pp. 540-555). [S.l.]."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME51207.2021.9428318"},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5179-5187)","author":"Song Y.","year":"2015","unstructured":"[ 3 ] Song , Y. , Vallmitjana , J. , Stent , A. , & Jaimes , A. ( 2015 ). Tvsum: Summarizing web videos using titles . In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5179-5187) . [3] Song, Y., Vallmitjana, J., Stent, A., & Jaimes, A. (2015). Tvsum: Summarizing web videos using titles. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5179-5187)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10639-020-10273-6"},{"key":"e_1_3_2_1_5_1","volume-title":"Classification of important segments in educational videos using multimodal features. arXiv preprint arXiv:2010.13626","author":"Ghauri J. A.","year":"2020","unstructured":"[ 5 ] Ghauri , J. A. , Hakimov , S. , & Ewerth , R. ( 2020 ). Classification of important segments in educational videos using multimodal features. arXiv preprint arXiv:2010.13626 . [5] Ghauri, J. A., Hakimov, S., & Ewerth, R. (2020). Classification of important segments in educational videos using multimodal features. arXiv preprint arXiv:2010.13626."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5753\/sbie.2021.217360"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.32604\/cmc.2022.021780"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-47560-4_7"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3539637.3556998"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3428658.3430964"},{"key":"e_1_3_2_1_11_1","volume-title":"Shaping the Video Conferences of Tomorrow With AI. In Anais Estendidos do XXVI Simp\u00f3sio Brasileiro de Sistemas Multim\u00eddia e Web (pp. 165-168)","author":"Mendes P. R. C.","year":"2020","unstructured":"[ 11 ] Mendes , P. R. C. , Vieira , E. S. , de Freitas , P. V. A. , Busson , A. J. G. , Guedes , \u00c1. L. V. , Neto , C. D. S. S. , & Colcher , S. ( 2020 , November) . Shaping the Video Conferences of Tomorrow With AI. In Anais Estendidos do XXVI Simp\u00f3sio Brasileiro de Sistemas Multim\u00eddia e Web (pp. 165-168) . SBC. [11] Mendes, P. R. C., Vieira, E. S., de Freitas, P. V. A., Busson, A. J. G., Guedes, \u00c1. L. V., Neto, C. D. S. S., & Colcher, S. (2020, November). Shaping the Video Conferences of Tomorrow With AI. In Anais Estendidos do XXVI Simp\u00f3sio Brasileiro de Sistemas Multim\u00eddia e Web (pp. 165-168). SBC."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"[\n  12\n  ]  Soares E. R. & Barr\u00e9re E. (2018 October). A framework for automatic topic segmentation in video lectures. In Anais Estendidos do XXIV Simp\u00f3sio Brasileiro de Sistemas Multim\u00eddia e Web (pp. 31-36). SBC.  [12] Soares E. R. & Barr\u00e9re E. (2018 October). A framework for automatic topic segmentation in video lectures. In Anais Estendidos do XXIV Simp\u00f3sio Brasileiro de Sistemas Multim\u00eddia e Web (pp. 31-36). SBC.","DOI":"10.5753\/webmedia.2018.4558"},{"key":"e_1_3_2_1_13_1","first-page":"13988","article-title":"Clip-it! language-guided video summarization","volume":"34","author":"Narasimhan M.","year":"2021","unstructured":"[ 13 ] Narasimhan , M. , Rohrbach , A. , & Darrell , T. ( 2021 ). Clip-it! language-guided video summarization . Advances in Neural Information Processing Systems , 34 , 13988 - 14000 . [13] Narasimhan, M., Rohrbach, A., & Darrell, T. (2021). Clip-it! language-guided video summarization. Advances in Neural Information Processing Systems, 34, 13988-14000.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3460426.3463662"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the 29th ACM International Conference on Multimedia (pp. 1756-1765)","author":"Shang X.","year":"2021","unstructured":"[ 15 ] Shang , X. , Yuan , Z. , Wang , A. , & Wang , C. ( 2021 , October). Multimodal video summarization via time-aware transformers . In Proceedings of the 29th ACM International Conference on Multimedia (pp. 1756-1765) . [15] Shang, X., Yuan, Z., Wang, A., & Wang, C. (2021, October). Multimodal video summarization via time-aware transformers. In Proceedings of the 29th ACM International Conference on Multimedia (pp. 1756-1765)."},{"key":"e_1_3_2_1_16_1","volume-title":"Sentence-BERT: Sentence embeddings using Siamese BERT-networks. arXiv preprint arXiv:1908.10084","author":"Reimers N.","year":"2019","unstructured":"[ 16 ] Reimers , N. , & Gurevych , I. ( 2019 ). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. arXiv preprint arXiv:1908.10084 . [16] Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. arXiv preprint arXiv:1908.10084."},{"key":"e_1_3_2_1_17_1","volume-title":"CoCo@ NIPs. [S.l.: s.n.].","author":"Nguyen T.","year":"2016","unstructured":"[ 17 ] Nguyen , T. , Rosenberg , M. , Song , X. , Gao , J. , Tiwary , S. , Majumder , R. , & Deng , L. ( 2016 ). Ms Marco: A human generated machine reading comprehension dataset . In CoCo@ NIPs. [S.l.: s.n.]. [17] Nguyen, T., Rosenberg, M., Song, X., Gao, J., Tiwary, S., Majumder, R., & Deng, L. (2016). Ms Marco: A human generated machine reading comprehension dataset. In CoCo@ NIPs. [S.l.: s.n.]."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3323503.3360625"},{"key":"e_1_3_2_1_20_1","volume-title":"Multilabel Active Learning for User Context Recognition In-the-Wild","author":"Balraj B.","year":"2021","unstructured":"[ 20 ] Balraj , B. ( 2021 ). Multilabel Active Learning for User Context Recognition In-the-Wild . North Carolina State University . [20] Balraj, B. (2021). Multilabel Active Learning for User Context Recognition In-the-Wild. North Carolina State University."}],"event":{"name":"WebMedia '23: Brazilian Symposium on Multimedia and the Web","location":"Ribeir\u00e3o Preto Brazil","acronym":"WebMedia '23"},"container-title":["Proceedings of the 29th Brazilian Symposium on Multimedia and the Web"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617023.3617042","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3617023.3617042","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:06Z","timestamp":1750178166000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617023.3617042"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,23]]},"references-count":19,"alternative-id":["10.1145\/3617023.3617042","10.1145\/3617023"],"URL":"https:\/\/doi.org\/10.1145\/3617023.3617042","relation":{},"subject":[],"published":{"date-parts":[[2023,10,23]]},"assertion":[{"value":"2023-10-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}