{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T03:31:57Z","timestamp":1775100717576,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":55,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,5,23]],"date-time":"2022-05-23T00:00:00Z","timestamp":1653264000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,5,23]]},"DOI":"10.1145\/3524842.3528487","type":"proceedings-article","created":{"date-parts":[[2022,10,18]],"date-time":"2022-10-18T00:08:36Z","timestamp":1666051716000},"page":"247-251","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["SOSum"],"prefix":"10.1145","author":[{"given":"Bonan","family":"Kou","sequence":"first","affiliation":[{"name":"Purdue University"}]},{"given":"Yifeng","family":"Di","sequence":"additional","affiliation":[{"name":"Purdue University"}]},{"given":"Muhao","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Southern California"}]},{"given":"Tianyi","family":"Zhang","sequence":"additional","affiliation":[{"name":"Purdue University"}]}],"member":"320","published-online":{"date-parts":[[2022,10,17]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2022. Document Summarization on CNN\/Daily Mail. https:\/\/paperswithcode.com\/sota\/document-summarization-on-cnn-daily-mail. Accessed: 2022-3-29.  2022. Document Summarization on CNN\/Daily Mail. https:\/\/paperswithcode.com\/sota\/document-summarization-on-cnn-daily-mail. Accessed: 2022-3-29."},{"key":"e_1_3_2_1_2_1","unstructured":"2022. The GitHub repository of SOSum and SO post labeling tools. https:\/\/github.com\/BonanKou\/SOSum-A-Dataset-of-Extractive-Summaries-of-Stack-Overflow-Posts-and-labeling-tools.  2022. The GitHub repository of SOSum and SO post labeling tools. https:\/\/github.com\/BonanKou\/SOSum-A-Dataset-of-Extractive-Summaries-of-Stack-Overflow-Posts-and-labeling-tools."},{"key":"e_1_3_2_1_3_1","unstructured":"2022. Stack Exchange Data Dump. https:\/\/archive.org\/details\/stackexchange.  2022. Stack Exchange Data Dump. https:\/\/archive.org\/details\/stackexchange."},{"key":"e_1_3_2_1_4_1","unstructured":"2022. Text Summarization on GigaWord. https:\/\/paperswithcode.com\/sota\/text-summarization-on-gigaword. Accessed: 2022-3-29.  2022. Text Summarization on GigaWord. https:\/\/paperswithcode.com\/sota\/text-summarization-on-gigaword. Accessed: 2022-3-29."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MS.2017.31"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-019-09788-5"},{"key":"e_1_3_2_1_7_1","unstructured":"Steven Bird Ewan Klein etal 2009. Natural language processing with Python: analyzing text with the natural language toolkit. \" O'Reilly Media Inc.\".  Steven Bird Ewan Klein et al. 2009. Natural language processing with Python : analyzing text with the natural language toolkit. \" O'Reilly Media Inc.\"."},{"key":"e_1_3_2_1_8_1","volume-title":"Saad Benjelloun, Ikram Chairi, and Ismail Berrada.","author":"Boujou ElMehdi","year":"2021","unstructured":"ElMehdi Boujou , Hamza Chataoui , Abdellah El Mekki , Saad Benjelloun, Ikram Chairi, and Ismail Berrada. 2021 . An open access NLP dataset for Arabic dialects: Data collection, labeling, and model construction. arXiv preprint arXiv:2102.11000 (2021). ElMehdi Boujou, Hamza Chataoui, Abdellah El Mekki, Saad Benjelloun, Ikram Chairi, and Ismail Berrada. 2021. An open access NLP dataset for Arabic dialects: Data collection, labeling, and model construction. arXiv preprint arXiv:2102.11000 (2021)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1518701.1518944"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE43902.2021.00116"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2019.110454"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597008.2597146"},{"key":"e_1_3_2_1_13_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, et al. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_1_14_1","volume-title":"Banditsum: Extractive summarization as a contextual bandit. arXiv preprint arXiv:1809.09672","author":"Dong Yue","year":"2018","unstructured":"Yue Dong , Yikang Shen , 2018 . Banditsum: Extractive summarization as a contextual bandit. arXiv preprint arXiv:1809.09672 (2018). Yue Dong, Yikang Shen, et al. 2018. Banditsum: Extractive summarization as a contextual bandit. arXiv preprint arXiv:1809.09672 (2018)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE.2011.6100069"},{"key":"e_1_3_2_1_16_1","unstructured":"Karl Moritz Hermann Tomas Kocisky etal 2015. Teaching machines to read and comprehend. Advances in neural information processing systems 28 (2015) 1693--1701.  Karl Moritz Hermann Tomas Kocisky et al. 2015. Teaching machines to read and comprehend. Advances in neural information processing systems 28 (2015) 1693--1701."},{"key":"e_1_3_2_1_17_1","volume-title":"Fusionnet: Fusing via fully-aware attention with application to machine comprehension. arXiv preprint arXiv:1711.07341","author":"Huang Hsin-Yuan","year":"2017","unstructured":"Hsin-Yuan Huang , Chenguang Zhu , 2017 . Fusionnet: Fusing via fully-aware attention with application to machine comprehension. arXiv preprint arXiv:1711.07341 (2017). Hsin-Yuan Huang, Chenguang Zhu, et al. 2017. Fusionnet: Fusing via fully-aware attention with application to machine comprehension. arXiv preprint arXiv:1711.07341 (2017)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3238147.3238191"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00300"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"crossref","unstructured":"D. Khashabi S. Min etal 2020. UnifiedQA: Crossing Format Boundaries With a Single QA System. EMNLP - findings (2020).  D. Khashabi S. Min et al. 2020. UnifiedQA: Crossing Format Boundaries With a Single QA System. EMNLP - findings (2020).","DOI":"10.18653\/v1\/2020.findings-emnlp.171"},{"key":"e_1_3_2_1_21_1","volume-title":"Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461","author":"Lewis Mike","year":"2019","unstructured":"Mike Lewis , Yinhan Liu , 2019 . Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461 (2019). Mike Lewis, Yinhan Liu, et al. 2019. Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461 (2019)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSME.2018.00028"},{"key":"e_1_3_2_1_23_1","volume-title":"Unsupervised Deep Bug Report Summarization. In 2018 IEEE\/ACM 26th International Conference on Program Comprehension (ICPC). 144--14411","author":"Li Xiaochen","year":"2018","unstructured":"Xiaochen Li , He Jiang , 2018 . Unsupervised Deep Bug Report Summarization. In 2018 IEEE\/ACM 26th International Conference on Program Comprehension (ICPC). 144--14411 . Xiaochen Li, He Jiang, et al. 2018. Unsupervised Deep Bug Report Summarization. In 2018 IEEE\/ACM 26th International Conference on Program Comprehension (ICPC). 144--14411."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2019.00066"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3387904.3389272"},{"key":"e_1_3_2_1_26_1","unstructured":"Jiakun Liu Sebastian Baltes etal 2021. Characterizing Search Activities on Stack Overflow. (2021).  Jiakun Liu Sebastian Baltes et al. 2021. Characterizing Search Activities on Stack Overflow. (2021)."},{"key":"e_1_3_2_1_27_1","volume-title":"Fine-tune BERT for extractive summarization. arXiv preprint arXiv:1903.10318","author":"Liu Yang","year":"2019","unstructured":"Yang Liu . 2019. Fine-tune BERT for extractive summarization. arXiv preprint arXiv:1903.10318 ( 2019 ). Yang Liu. 2019. Fine-tune BERT for extractive summarization. arXiv preprint arXiv:1903.10318 (2019)."},{"key":"e_1_3_2_1_28_1","volume-title":"Interrater reliability: the kappa statistic. Biochemia medica 22, 3","author":"McHugh Mary L","year":"2012","unstructured":"Mary L McHugh . 2012. Interrater reliability: the kappa statistic. Biochemia medica 22, 3 ( 2012 ), 276--282. Mary L McHugh. 2012. Interrater reliability: the kappa statistic. Biochemia medica 22, 3 (2012), 276--282."},{"key":"e_1_3_2_1_29_1","volume-title":"Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:1906.04165","author":"Miller Derek","year":"2019","unstructured":"Derek Miller . 2019. Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:1906.04165 ( 2019 ). Derek Miller. 2019. Leveraging BERT for extractive text summarization on lectures. arXiv preprint arXiv:1906.04165 (2019)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/SANER48275.2020.9054828"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Shashi Narayan Shay B Cohen etal 2018. Don't Give Me the Details Just the Summary! Topic-aware Convolutional Neural Networks for Extreme Summarization. In (2018).  Shashi Narayan Shay B Cohen et al. 2018. Don't Give Me the Details Just the Summary! Topic-aware Convolutional Neural Networks for Extreme Summarization. In (2018).","DOI":"10.18653\/v1\/D18-1206"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Shashi Narayan Shay B Cohen etal 2018. Ranking sentences for extractive summarization with reinforcement learning. arXiv preprint arXiv:1802.08636 (2018).  Shashi Narayan Shay B Cohen et al. 2018. Ranking sentences for extractive summarization with reinforcement learning. arXiv preprint arXiv:1802.08636 (2018).","DOI":"10.18653\/v1\/N18-1158"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2013.6606701"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597073.2597077"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSME.2018.00057"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2013.2297712"},{"key":"e_1_3_2_1_37_1","volume-title":"Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084","author":"Reimers Nils","year":"2019","unstructured":"Nils Reimers and Iryna Gurevych . 2019 . Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019). Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019)."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236024.3264585"},{"key":"e_1_3_2_1_39_1","volume-title":"Sequence labeling with multiple annotators. Machine learning 95, 2","author":"Rodrigues Filipe","year":"2014","unstructured":"Filipe Rodrigues , Francisco Pereira , and Bernardete Ribeiro . 2014. Sequence labeling with multiple annotators. Machine learning 95, 2 ( 2014 ), 165--181. Filipe Rodrigues, Francisco Pereira, and Bernardete Ribeiro. 2014. Sequence labeling with multiple annotators. Machine learning 95, 2 (2014), 165--181."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPC.2019.00054"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2568225.2568313"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1985793.1985907"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884800"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE.2017.8115715"},{"key":"e_1_3_2_1_45_1","volume-title":"Extractive summarization using deep learning. arXiv preprint arXiv:1708.04439","author":"Verma Sukriti","year":"2017","unstructured":"Sukriti Verma and Vagisha Nidhi . 2017. Extractive summarization using deep learning. arXiv preprint arXiv:1708.04439 ( 2017 ). Sukriti Verma and Vagisha Nidhi. 2017. Extractive summarization using deep learning. arXiv preprint arXiv:1708.04439 (2017)."},{"key":"e_1_3_2_1_46_1","volume-title":"Automatic Solution Summarization for Crash Bugs. In 2021 IEEE\/ACM 43rd International Conference on Software Engineering (ICSE). IEEE, 1286--1297","author":"Wang Haoye","year":"2021","unstructured":"Haoye Wang , Xin Xia , 2021 . Automatic Solution Summarization for Crash Bugs. In 2021 IEEE\/ACM 43rd International Conference on Software Engineering (ICSE). IEEE, 1286--1297 . Haoye Wang, Xin Xia, et al. 2021. Automatic Solution Summarization for Crash Bugs. In 2021 IEEE\/ACM 43rd International Conference on Software Engineering (ICSE). IEEE, 1286--1297."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1018"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-018-9634-5"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-017-9514-4"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE.2017.8115681"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.133"},{"key":"e_1_3_2_1_52_1","unstructured":"Neng Zhang Qiao Huang etal 2020. Chatbot4qr: Interactive query refinement for technical question retrieval. IEEE Transactions on Software Engineering (2020).  Neng Zhang Qiao Huang et al. 2020. Chatbot4qr: Interactive query refinement for technical question retrieval. IEEE Transactions on Software Engineering (2020)."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3180155.3180260"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2950290.2950298"},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE.2015.24"}],"event":{"name":"MSR '22: 19th International Conference on Mining Software Repositories","location":"Pittsburgh Pennsylvania","acronym":"MSR '22","sponsor":["SIGSOFT ACM Special Interest Group on Software Engineering","IEEE CS"]},"container-title":["Proceedings of the 19th International Conference on Mining Software Repositories"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524842.3528487","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3524842.3528487","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:35Z","timestamp":1750183775000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3524842.3528487"}},"subtitle":["a dataset of stack overflow post summaries"],"short-title":[],"issued":{"date-parts":[[2022,5,23]]},"references-count":55,"alternative-id":["10.1145\/3524842.3528487","10.1145\/3524842"],"URL":"https:\/\/doi.org\/10.1145\/3524842.3528487","relation":{},"subject":[],"published":{"date-parts":[[2022,5,23]]},"assertion":[{"value":"2022-10-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}