{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:14:31Z","timestamp":1750220071992,"version":"3.41.0"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2022,11,25]],"date-time":"2022-11-25T00:00:00Z","timestamp":1669334400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100003621","name":"Ministry of Science, ICT and Future Planning","doi-asserted-by":"crossref","award":["PAJ000001-2017-101"],"award-info":[{"award-number":["PAJ000001-2017-101"]}],"id":[{"id":"10.13039\/501100003621","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Ministry of Trade, Industry & Energy","award":["10076583"],"award-info":[{"award-number":["10076583"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2023,1,31]]},"abstract":"<jats:p>Phonetic features are indispensable in understanding the spoken language. Especially in Korean, which is wh-in-situ and head-final, the addressee of spoken language sometimes finds it hard to discern the speaker\u2019s original intention if not provided with the sentence prosody. However, acoustic information may not be guaranteed for all spoken language processing, due to the difficulty of managing and computing speech data. This article suggests a corpus that aims to distinguish utterances with ambiguous intention from clear-cut ones, utilizing the prosodic ambiguity of the text input. In detail, the resulting classification system decides whether the given text input is one of fragment, statement, question, command, rhetorical question\/command, or indecisive, taking into account the intonation-dependency of the text. Based on an intuitive understanding of the Korean language engaged in the data annotation, we construct a corpus with seven intention categories, train classification systems, and validate the utility of our dataset with quantitative and qualitative analyses.<\/jats:p>","DOI":"10.1145\/3529648","type":"journal-article","created":{"date-parts":[[2022,4,20]],"date-time":"2022-04-20T12:00:42Z","timestamp":1650456042000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Text Implicates Prosodic Ambiguity: A Corpus for Intention Identification of the Korean Spoken Language"],"prefix":"10.1145","volume":"22","author":[{"given":"Won Ik","family":"Cho","sequence":"first","affiliation":[{"name":"Seoul National University, Dept. ECE and INMC"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nam Soo","family":"Kim","sequence":"additional","affiliation":[{"name":"Seoul National University, Dept. ECE and INMC"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,11,25]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"1","article-title":"Is that a real question? final rises, final falls, and discourse function in yes-no question intonation","volume":"35","author":"Banuazizi Atissa","year":"1999","unstructured":"Atissa Banuazizi and Cassandre Creswell. 1999. Is that a real question? final rises, final falls, and discourse function in yes-no question intonation. Clin. Lab. Sci. J. 35 (1999), 1\u201314.","journal-title":"Clin. Lab. Sci. J."},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00051"},{"key":"e_1_3_2_4_2","volume-title":"Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC\u201910)","author":"Bunt Harry","year":"2010","unstructured":"Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, et\u00a0al. 2010. Towards an ISO standard for dialogue act annotation. In Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC\u201910)."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/3461778.3462078"},{"key":"e_1_3_2_6_2","volume-title":"Proceedings of the 42th Annual Meeting of the Cognitive Science Society - Developing a Mind: Learning in Humans, Animals, and Machines (CogSci\u201920)","author":"Cho Won Ik","year":"2020","unstructured":"Won Ik Cho, Jeonghwa Cho, Woo Hyun Kang, and Nam Soo Kim. 2020. Text matters but speech influences: A computational analysis of syntactic ambiguity resolution. In Proceedings of the 42th Annual Meeting of the Cognitive Science Society - Developing a Mind: Learning in Humans, Animals, and Machines (CogSci\u201920), Stephanie Denison, Michael Mack, Yang Xu, and Blair C. Armstrong (Eds.)."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2020-1246"},{"key":"e_1_3_2_8_2","article-title":"Keras","author":"Chollet Fran\u00e7ois","year":"2015","unstructured":"Fran\u00e7ois Chollet et\u00a0al. 2015. Keras. Retrieved from https:\/\/github.com\/fchollet\/keras.","journal-title":"https:\/\/github.com\/fchollet\/keras"},{"key":"e_1_3_2_9_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Clark Kevin","year":"2019","unstructured":"Kevin Clark, Minh-Thang Luong, Quoc V. Le, and Christopher D. Manning. 2019. ELECTRA: Pre-training text encoders as discriminators rather than generators. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_3_2_10_2","first-page":"4171","volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171\u20134186."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1037\/h0031619"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1166"},{"key":"e_1_3_2_13_2","unstructured":"Dafydd Gibbon. 2010. The ambiguity of \u2018ambiguity\u2019: Beauty power and understanding. In Ambiguity and the Search for Meaning: English and American Studies at the Beginning of the 21st Century (Volume 2: Language and Culture) . Jagiellonian University Press 33\u201352."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-57351-9_30"},{"key":"e_1_3_2_15_2","first-page":"124","volume-title":"Semantics and Linguistic Theory","author":"Gunlogson Christine","year":"2002","unstructured":"Christine Gunlogson. 2002. Declarative questions. In Semantics and Linguistic Theory, Vol. 12. 124\u2013143."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/SLT.2018.8639043"},{"key":"e_1_3_2_17_2","volume-title":"The Structure and Interpretation of Imperatives: Mood and Force in Universal Grammar","author":"Han Chung-hye","year":"2000","unstructured":"Chung-hye Han. 2000. The Structure and Interpretation of Imperatives: Mood and Force in Universal Grammar. Psychology Press."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.3390\/app112311472"},{"key":"e_1_3_2_19_2","article-title":"Scaling laws for neural language models","author":"Kaplan Jared","year":"2020","unstructured":"Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, and Dario Amodei. 2020. Scaling laws for neural language models. arXiv:2001.08361. Retrieved from https:\/\/arxiv.org\/abs\/2001.08361.","journal-title":"arXiv:2001.08361"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1093\/logcom\/exw009"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1177\/1461445605048768"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_2_23_2","volume-title":"Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915)","author":"Kingma Diederik P.","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915), Yoshua Bengio and Yann LeCun (Eds.)."},{"key":"e_1_3_2_24_2","first-page":"1097","volume-title":"Advances in Neural Information Processing Systems","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. 1097\u20131105."},{"key":"e_1_3_2_25_2","first-page":"437","volume-title":"Proceedings of the 32nd Annual Conference on Human and Cognitive Language Technology","author":"Lee Junbum","year":"2020","unstructured":"Junbum Lee. 2020. KcBERT: Korean comments BERT. In Proceedings of the 32nd Annual Conference on Human and Cognitive Language Technology. 437\u2013440."},{"key":"e_1_3_2_26_2","article-title":"KcELECTRA: Korean Comments ELECTRA","author":"Lee Junbum","year":"2021","unstructured":"Junbum Lee. 2021. KcELECTRA: Korean Comments ELECTRA. Retrieved from https:\/\/github.com\/Beomi\/KcELECTRA.","journal-title":"https:\/\/github.com\/Beomi\/KcELECTRA"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.5555\/519369"},{"key":"e_1_3_2_28_2","first-page":"986","volume-title":"Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Li Yanran","year":"2017","unstructured":"Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. 2017. DailyDialog: A manually labelled multi-turn dialogue dataset. In Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 986\u2013995."},{"key":"e_1_3_2_29_2","unstructured":"Zhouhan Lin Minwei Feng Cicero Nogueira dos Santos Mo Yu Bing Xiang Bowen Zhou and Yoshua Bengio. 2017. A structured self-attentive sentence embedding. In 5th International Conference on Learning Representations ICLR 2017 Toulon France April 24-26 2017 Conference Track Proceedings . OpenReview.net. https:\/\/openreview.net\/forum?id=BJC_jUqxe."},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2016-1352"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-2396"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10988-005-7378-3"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1075\/sl.38.1.05nam"},{"key":"e_1_3_2_34_2","unstructured":"NIKL National Institute of Korean Languages. 2020. NIKL CORPORA 2020 (v.1.0). Retrieved from https:\/\/corpus.korean.go.kr."},{"key":"e_1_3_2_35_2","first-page":"295","article-title":"Jussive clauses and agreement of sentence final particles in Korean","volume":"14","author":"Pak Miok","year":"2006","unstructured":"Miok Pak. 2006. Jussive clauses and agreement of sentence final particles in Korean. Jpn\/Kor. Ling. 14 (2006), 295\u2013306.","journal-title":"Jpn\/Kor. Ling."},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1075\/kl.14.06mdp"},{"key":"e_1_3_2_37_2","article-title":"KoELECTRA: Pretrained ELECTRA Model for Korean","author":"Park Jangwon","year":"2020","unstructured":"Jangwon Park. 2020. KoELECTRA: Pretrained ELECTRA Model for Korean. Retrieved from https:\/\/github.com\/monologg\/KoELECTRA.","journal-title":"https:\/\/github.com\/monologg\/KoELECTRA"},{"key":"e_1_3_2_38_2","unstructured":"Sungjoon Park Jihyung Moon Sungdong Kim Won Ik Cho Jiyoon Han Jangwon Park Chisung Song Junseong Kim Yongsook Song Taehwan Oh Joohong Lee Juhyun Oh Sungwon Lyu Younghoon Jeong Inkwon Lee Sangwoo Seo Dongjun Lee Hyunwoo Kim Myeonghwa Lee Seongbo Jang Seungwon Do Sunkyoung Kim Kyungtae Lim Jongwon Lee Kyumin Park Jamin Shin Seonghyun Kim Lucy Park Alice Oh Jung-Woo Ha and Kyunghyun Cho. 2021. KLUE: Korean language understanding evaluation. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2) . https:\/\/openreview.net\/forum?id=q-8h8-LZiUm."},{"key":"e_1_3_2_39_2","first-page":"235","volume-title":"Semantics and Linguistic Theory","author":"Portner Paul","year":"2004","unstructured":"Paul Portner. 2004. The semantics of imperatives within a theory of clause types. In Semantics and Linguistic Theory, Vol. 14. 235\u2013252."},{"key":"e_1_3_2_40_2","unstructured":"Hannah Rohde. 2006. Rhetorical questions as redundant interrogatives. In San Diego Linguistics Papers . Department of Linguistics UCSD 134\u2013168."},{"key":"e_1_3_2_41_2","first-page":"155","article-title":"Speech act distinctions in syntax","volume":"1","author":"Sadock Jerrold M.","year":"1985","unstructured":"Jerrold M. Sadock and Arnold M. Zwicky. 1985. Speech act distinctions in syntax. Lang. Typol. Syntact. Descript. 1 (1985), 155\u2013196.","journal-title":"Lang. Typol. Syntact. Descript."},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1017\/S0047404500006837"},{"key":"e_1_3_2_44_2","volume-title":"The Syntax of Jussives: Speaker and Hearer at the Syntax-Discourse Interface","author":"Seo Saetbyol","year":"2017","unstructured":"Saetbyol Seo. 2017. The Syntax of Jussives: Speaker and Hearer at the Syntax-Discourse Interface. Ph.D. Dissertation. Seoul National University."},{"key":"e_1_3_2_45_2","first-page":"255","volume-title":"Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING\u201907)","author":"Shimada Kazutaka","year":"2007","unstructured":"Kazutaka Shimada, Kaoru Iwashita, and Tsutomu Endo. 2007. A case study of comparison of several methods for corpus-based speech intention identification. In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics (PACLING\u201907). Citeseer, 255\u2013262."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1162\/089120100561737"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.124"},{"key":"e_1_3_2_48_2","article-title":"Korean BERT Pre-trained Cased (KoBERT)","author":"TBrain SK","year":"2019","unstructured":"SK TBrain. 2019. Korean BERT Pre-trained Cased (KoBERT). Retrieved from https:\/\/github.com\/SKTBrain\/KoBERT.","journal-title":"https:\/\/github.com\/SKTBrain\/KoBERT"},{"key":"e_1_3_2_49_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998\u20136008."},{"key":"e_1_3_2_50_2","volume-title":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"10","author":"Vosoughi Soroush","year":"2016","unstructured":"Soroush Vosoughi and Deb Roy. 2016. Tweet acts: A speech act classifier for twitter. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 10."},{"key":"e_1_3_2_51_2","first-page":"4003","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"Wenzek Guillaume","year":"2020","unstructured":"Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzm\u00e1n, Armand Joulin, and Edouard Grave. 2020. CCNet: Extracting high quality monolingual datasets from web crawl data. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 4003\u20134012."},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_3_2_53_2","article-title":"Transformer-based Korean pretrained language models: A survey on three years of progress","author":"Yang Kichang","year":"2021","unstructured":"Kichang Yang. 2021. Transformer-based Korean pretrained language models: A survey on three years of progress. arXiv:2112.03014. Retrieved from https:\/\/arxiv.org\/abs\/2112.03014.","journal-title":"arXiv:2112.03014"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3529648","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3529648","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:09:13Z","timestamp":1750183753000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3529648"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,25]]},"references-count":52,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,1,31]]}},"alternative-id":["10.1145\/3529648"],"URL":"https:\/\/doi.org\/10.1145\/3529648","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2022,11,25]]},"assertion":[{"value":"2019-12-31","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-03-30","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}