{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T02:19:53Z","timestamp":1778033993888,"version":"3.51.4"},"reference-count":138,"publisher":"Association for Computing Machinery (ACM)","issue":"8","license":[{"start":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T00:00:00Z","timestamp":1671753600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2023,8,31]]},"abstract":"<jats:p>Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader\u2019s comprehension. Recently, with the advent of neural architectures, significant research efforts have been made to advance automatic text summarization systems, and numerous studies on the challenges of extending these systems to the long document domain have emerged. In this survey, we provide a comprehensive overview of the research on long document summarization and a systematic evaluation across the three principal components of its research setting: benchmark datasets, summarization models, and evaluation metrics. For each component, we organize the literature within the context of long document summarization and conduct an empirical analysis to broaden the perspective on current research progress. The empirical analysis includes a study on the intrinsic characteristics of benchmark datasets, a multi-dimensional analysis of summarization models, and a review of the summarization evaluation metrics. Based on the overall findings, we conclude by proposing possible directions for future exploration in this rapidly growing field.<\/jats:p>","DOI":"10.1145\/3545176","type":"journal-article","created":{"date-parts":[[2022,6,29]],"date-time":"2022-06-29T12:31:39Z","timestamp":1656505899000},"page":"1-35","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":81,"title":["An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics"],"prefix":"10.1145","volume":"55","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0488-2616","authenticated-orcid":false,"given":"Huan Yee","family":"Koh","sequence":"first","affiliation":[{"name":"Monash University, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3503-5708","authenticated-orcid":false,"given":"Jiaxin","family":"Ju","sequence":"additional","affiliation":[{"name":"Monash University, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2160-6111","authenticated-orcid":false,"given":"Ming","family":"Liu","sequence":"additional","affiliation":[{"name":"Deakin University, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0794-527X","authenticated-orcid":false,"given":"Shirui","family":"Pan","sequence":"additional","affiliation":[{"name":"Monash University, Australia"}]}],"member":"320","published-online":{"date-parts":[[2022,12,23]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1371"},{"key":"e_1_3_3_3_2","unstructured":"Iz Beltagy Matthew E. Peters and Arman Cohan. 2020. Longformer: The long-document transformer. arXiv:2004.05150. Retrieved from https:\/\/arxiv.org\/abs\/2004.05150."},{"key":"e_1_3_3_4_2","doi-asserted-by":"crossref","unstructured":"Manik Bhandari Pranav Gour Atabak Ashfaq Pengfei Liu and Graham Neubig. 2020. Re-evaluating evaluation in text summarization. arXiv:2010.07100. Retrieved from https:\/\/arxiv.org\/abs\/2010.07100.","DOI":"10.18653\/v1\/2020.emnlp-main.751"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.649"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIRCA48905.2020.9183355"},{"key":"e_1_3_3_7_2","first-page":"298","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Boudin Florian","year":"2013","unstructured":"Florian Boudin and Emmanuel Morin. 2013. Keyphrase extraction for n-best reranking in multi-sentence compression. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 298\u2013305."},{"key":"e_1_3_3_8_2","unstructured":"Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et\u00a0al. 2020. Language models are few-shot learners. arXiv:2005.14165. Retrieved from https:\/\/arxiv.org\/abs\/2005.14165."},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11912"},{"key":"e_1_3_3_10_2","first-page":"28","volume-title":"Proceedings of the International Workshop on Machine Learning for Multimodal Interaction","author":"Carletta Jean","year":"2005","unstructured":"Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, et\u00a0al. 2005. The AMI meeting corpus: A pre-announcement. In Proceedings of the International Workshop on Machine Learning for Multimodal Interaction. Springer, 28\u201339."},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1150"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1060"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01818-3_23"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1223"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.34"},{"key":"e_1_3_3_16_2","doi-asserted-by":"crossref","unstructured":"Jianpeng Cheng and Mirella Lapata. 2016. Neural summarization by extracting sentences and words. arXiv:1603.07252. Retrieved from https:\/\/arxiv.org\/abs\/1603.07252.","DOI":"10.18653\/v1\/P16-1046"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.nlpmc-1.9"},{"key":"e_1_3_3_18_2","first-page":"1163","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Christensen Janara","year":"2013","unstructured":"Janara Christensen, Stephen Soderland, Oren Etzioni, et\u00a0al. 2013. Towards coherent multi-document summarization. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1163\u20131173."},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1264"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.519"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-2097"},{"key":"e_1_3_3_22_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.470"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.468"},{"key":"e_1_3_3_24_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171\u20134186."},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.594"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.eacl-main.93"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.384"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.454"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2020.113679"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.5555\/1622487.1622501"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.117"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.42"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-016-9475-9"},{"key":"e_1_3_3_34_2","unstructured":"Kavita Ganesan. 2018. ROUGE 2.0: Updated and improved measures for evaluation of summarization tasks. arXiv:1803.01937. Retrieved from https:\/\/arxiv.org\/abs\/1803.01937."},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.295"},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.124"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1443"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2020.3037401"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/383952.383955"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330955"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.322"},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1013"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1620"},{"key":"e_1_3_3_44_2","doi-asserted-by":"crossref","unstructured":"Max Grusky Mor Naaman and Yoav Artzi. 2018. Newsroom: A dataset of 1.3 million summaries with diverse extractive strategies. arXiv:1804.11283. Retrieved from https:\/\/arxiv.org\/abs\/1804.11283.","DOI":"10.18653\/v1\/N18-1065"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1169"},{"key":"e_1_3_3_46_2","unstructured":"Junxian He Wojciech Kry\u015bci\u0144ski Bryan McCann Nazneen Rajani and Caiming Xiong. 2020. Ctrlsum: Towards generic controllable text summarization. arXiv:2012.04281. Retrieved from https:\/\/arxiv.org\/abs\/2012.04281."},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.595"},{"key":"e_1_3_3_48_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1013"},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.33"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.112"},{"key":"e_1_3_3_51_2","first-page":"93","volume-title":"Proceedings of the Joint Workshop on Bibliometric-Enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL)","author":"Jaidka Kokil","year":"2016","unstructured":"Kokil Jaidka, Muthu Kumar Chandrasekaran, Sajal Rustagi, and Min-Yen Kan. 2016. Overview of the CL-Scisumm 2016 shared task. In Proceedings of the Joint Workshop on Bibliometric-Enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL). 93\u2013102."},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2003.1198793"},{"key":"e_1_3_3_53_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1002"},{"key":"e_1_3_3_54_2","article-title":"A statistical interpretation of term specificity and its application in retrieval","author":"Jones Karen Sparck","year":"1972","unstructured":"Karen Sparck Jones. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation (1972).","journal-title":"Journal of Documentation"},{"key":"e_1_3_3_55_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.sdp-1.37"},{"key":"e_1_3_3_56_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-emnlp.345"},{"key":"e_1_3_3_57_2","first-page":"2519","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Kim Byeongchang","year":"2019","unstructured":"Byeongchang Kim, Hyunwoo Kim, and Gunhee Kim. 2019. Abstractive summarization of reddit posts with multi-level memory networks. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2519\u20132531."},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(02)00222-9"},{"key":"e_1_3_3_59_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1232"},{"key":"e_1_3_3_60_2","first-page":"48","article-title":"BillSum: A corpus for automatic summarization of US legislation","author":"Kornilova Anastassia","year":"2019","unstructured":"Anastassia Kornilova and Vlad Eidelman. 2019. BillSum: A corpus for automatic summarization of US legislation. EMNLP-IJCNLP (2019), 48.","journal-title":"EMNLP-IJCNLP"},{"key":"e_1_3_3_61_2","unstructured":"Mahnaz Koupaee and William Yang Wang. 2018. WikiHow: A large scale text summarization dataset. arXiv:1810.09305. Retrieved from https:\/\/arxiv.org\/abs\/1810.09305."},{"key":"e_1_3_3_62_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1051"},{"key":"e_1_3_3_63_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.750"},{"key":"e_1_3_3_64_2","doi-asserted-by":"crossref","unstructured":"Wojciech Kry\u015bci\u0144ski Nazneen Rajani Divyansh Agarwal Caiming Xiong and Dragomir Radev. 2021. BookSum: A collection of datasets for long-form narrative summarization. arXiv:2105.08209. Retrieved from https:\/\/arxiv.org\/abs\/2105.08209.","DOI":"10.18653\/v1\/2022.findings-emnlp.488"},{"key":"e_1_3_3_65_2","doi-asserted-by":"publisher","DOI":"10.1145\/215206.215333"},{"key":"e_1_3_3_66_2","first-page":"957","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Kusner Matt","year":"2015","unstructured":"Matt Kusner, Yu Sun, Nicholas Kolkin, and Kilian Weinberger. 2015. From word embeddings to document distances. In Proceedings of the International Conference on Machine Learning. PMLR, 957\u2013966."},{"key":"e_1_3_3_67_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"e_1_3_3_68_2","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Lewis Patrick","year":"2020","unstructured":"Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich K\u00fcttler, Mike Lewis, Wen-tau Yih, Tim Rockt\u00e4schel, et\u00a0al. 2020. Retrieval-augmented generation for knowledge-intensive nlp tasks. In Proceedings of the Advances in Neural Information Processing Systems."},{"key":"e_1_3_3_69_2","doi-asserted-by":"publisher","DOI":"10.1145\/3340531.3412879"},{"key":"e_1_3_3_70_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.147"},{"key":"e_1_3_3_71_2","first-page":"74","volume-title":"Proceedings of the Text Summarization Branches Out","author":"Lin Chin-Yew","year":"2004","unstructured":"Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. In Proceedings of the Text Summarization Branches Out. 74\u201381."},{"key":"e_1_3_3_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/3292500.3330683"},{"key":"e_1_3_3_73_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-short.9"},{"key":"e_1_3_3_74_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Liu Peter J.","year":"2018","unstructured":"Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, and Noam Shazeer. 2018. Generating wikipedia by summarizing long sequences. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_75_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1387"},{"key":"e_1_3_3_76_2","doi-asserted-by":"crossref","unstructured":"Lefteris Loukas Manos Fergadiotis Ion Androutsopoulos and Prodromos Malakasiotis. 2021. EDGAR-CORPUS: Billions of tokens make the world go round. arXiv:2109.14394. Retrieved from https:\/\/arxiv.org\/abs\/2109.14394.","DOI":"10.18653\/v1\/2021.econlp-1.2"},{"key":"e_1_3_3_77_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.340"},{"key":"e_1_3_3_78_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.470"},{"key":"e_1_3_3_79_2","first-page":"821","volume-title":"Proceedings of the AAAI\/IAAI","author":"Mani Inderjeet","year":"1998","unstructured":"Inderjeet Mani and Eric Bloedorn. 1998. Machine learning of generic and user-focused summarization. In Proceedings of the AAAI\/IAAI. 821\u2013826."},{"key":"e_1_3_3_80_2","doi-asserted-by":"publisher","DOI":"10.1515\/text.1.1988.8.3.243"},{"key":"e_1_3_3_81_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.445"},{"key":"e_1_3_3_82_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.173"},{"key":"e_1_3_3_83_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-short.137"},{"key":"e_1_3_3_84_2","first-page":"404","volume-title":"Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing","author":"Mihalcea Rada","year":"2004","unstructured":"Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. 404\u2013411."},{"key":"e_1_3_3_85_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K16-1028"},{"key":"e_1_3_3_86_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.536"},{"key":"e_1_3_3_87_2","volume-title":"Proceedings of the 35th Conference on Neural Information Processing Systems","author":"Narasimhan Medhini","year":"2021","unstructured":"Medhini Narasimhan, Anna Rohrbach, and Trevor Darrell. 2021. CLIP-It! language-guided video summarization. In Proceedings of the 35th Conference on Neural Information Processing Systems."},{"key":"e_1_3_3_88_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1206"},{"key":"e_1_3_3_89_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1222"},{"key":"e_1_3_3_90_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.383"},{"key":"e_1_3_3_91_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Paulus Romain","year":"2018","unstructured":"Romain Paulus, Caiming Xiong, and Richard Socher. 2018. A deep reinforced model for abstractive summarization. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_92_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.200"},{"key":"e_1_3_3_93_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1101"},{"key":"e_1_3_3_94_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1502"},{"key":"e_1_3_3_95_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.748"},{"key":"e_1_3_3_96_2","doi-asserted-by":"publisher","DOI":"10.5555\/1599081.1599168"},{"key":"e_1_3_3_97_2","first-page":"1","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel Colin","year":"2020","unstructured":"Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research 21 (2020), 1\u201367.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_3_98_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.459"},{"key":"e_1_3_3_99_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_3_3_100_2","unstructured":"Tobias Rohde Xiaoxia Wu and Yinhan Liu. 2021. Hierarchical learning for generation with long source sequences. arXiv:2104.07545. Retrieved from https:\/\/arxiv.org\/abs\/2104.07545."},{"key":"e_1_3_3_101_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.12"},{"key":"e_1_3_3_102_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1044"},{"issue":"12","key":"e_1_3_3_103_2","first-page":"e26752","article-title":"The new york times annotated corpus","volume":"6","author":"Sandhaus Evan","year":"2008","unstructured":"Evan Sandhaus. 2008. The new york times annotated corpus. Linguistic Data Consortium, Philadelphia 6, 12 (2008), e26752.","journal-title":"Linguistic Data Consortium, Philadelphia"},{"key":"e_1_3_3_104_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1099"},{"key":"e_1_3_3_105_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1212"},{"key":"e_1_3_3_106_2","doi-asserted-by":"publisher","DOI":"10.1145\/3419106"},{"issue":"12","key":"e_1_3_3_107_2","first-page":"28873","article-title":"Text summarization using clustering technique and SVM technique","volume":"10","author":"Shivakumar K.","year":"2015","unstructured":"K. Shivakumar and Rab Soumya. 2015. Text summarization using clustering technique and SVM technique. International Journal of Applied Engineering Research 10, 12 (2015), 28873\u201328881.","journal-title":"International Journal of Applied Engineering Research"},{"key":"e_1_3_3_108_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.111"},{"key":"e_1_3_3_109_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00342"},{"key":"e_1_3_3_110_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Tay Yi","year":"2020","unstructured":"Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, and Donald Metzler. 2020. Long range arena: A benchmark for efficient transformers. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_111_2","article-title":"Efficient transformers: A survey","author":"Tay Yi","year":"2022","unstructured":"Yi Tay, Mostafa Dehghani, Dara Bahri, and Donald Metzler. 2022. Efficient transformers: A survey. ACM Computing Surveys (2022). Just Accepted.","journal-title":"ACM Computing Surveys"},{"key":"e_1_3_3_112_2","doi-asserted-by":"crossref","unstructured":"Priyam Tejaswin Dhruv Naik and Pengfei Liu. 2021. How well do you know your summarization datasets? arXiv:2106.11388. Retrieved from https:\/\/arxiv.org\/abs\/2106.11388.","DOI":"10.18653\/v1\/2021.findings-acl.303"},{"key":"e_1_3_3_113_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2007.01.023"},{"key":"e_1_3_3_114_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"},{"key":"e_1_3_3_115_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.450"},{"key":"e_1_3_3_116_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.298"},{"key":"e_1_3_3_117_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.findings-acl.454"},{"key":"e_1_3_3_118_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.294"},{"key":"e_1_3_3_119_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1298"},{"key":"e_1_3_3_120_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.508"},{"key":"e_1_3_3_121_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.451"},{"key":"e_1_3_3_122_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33017386"},{"key":"e_1_3_3_123_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K17-1045"},{"key":"e_1_3_3_124_2","first-page":"2318","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Yatskar Mark","year":"2019","unstructured":"Mark Yatskar. 2019. A qualitative comparison of CoQA, SQuAD 2.0 and QuAC. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2318\u20132323."},{"key":"e_1_3_3_125_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1228"},{"key":"e_1_3_3_126_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.326"},{"key":"e_1_3_3_127_2","unstructured":"Weizhe Yuan Pengfei Liu and Graham Neubig. 2021. Can we automate scientific reviewing? arXiv:2102.00176. Retrieved from https:\/\/arxiv.org\/abs\/2102.00176."},{"key":"e_1_3_3_128_2","volume-title":"Proceedings of the NeurIPS","author":"Zaheer Manzil","year":"2020","unstructured":"Manzil Zaheer, Guru Guruganesh, Kumar Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, et\u00a0al. 2020. Big bird: Transformers for longer sequences. In Proceedings of the NeurIPS."},{"key":"e_1_3_3_129_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1089"},{"key":"e_1_3_3_130_2","first-page":"11328","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zhang Jingqing","year":"2020","unstructured":"Jingqing Zhang, Yao Zhao, Mohammad Saleh, and Peter Liu. 2020. Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the International Conference on Machine Learning. PMLR, 11328\u201311339."},{"key":"e_1_3_3_131_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Zhang Tianyi","year":"2019","unstructured":"Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi. 2019. BERTScore: Evaluating text generation with BERT. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_3_132_2","article-title":"An exploratory study on long dialogue summarization: What works and what\u2019s next","author":"Zhang Yusen","year":"2021","unstructured":"Yusen Zhang, Ansong Ni, Tao Yu, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah, and Dragomir Radev. 2021. An exploratory study on long dialogue summarization: What works and what\u2019s next. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings. (2021).","journal-title":"In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings."},{"key":"e_1_3_3_133_2","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401327"},{"key":"e_1_3_3_134_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1053"},{"key":"e_1_3_3_135_2","unstructured":"Yao Zhao Mohammad Saleh and Peter J. Liu. 2020. Seal: Segment-wise extractive-abstractive long-form text summarization. arXiv:2006.10213. Retrieved from https:\/\/arxiv.org\/abs\/2006.10213."},{"key":"e_1_3_3_136_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1628"},{"key":"e_1_3_3_137_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.472"},{"key":"e_1_3_3_138_2","doi-asserted-by":"publisher","DOI":"10.3115\/1220835.1220892"},{"key":"e_1_3_3_139_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.474"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3545176","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3545176","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:44Z","timestamp":1750186964000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3545176"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,23]]},"references-count":138,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2023,8,31]]}},"alternative-id":["10.1145\/3545176"],"URL":"https:\/\/doi.org\/10.1145\/3545176","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,23]]},"assertion":[{"value":"2022-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-14","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-12-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}