{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,17]],"date-time":"2025-12-17T18:16:36Z","timestamp":1765995396566,"version":"3.40.3"},"publisher-location":"Cham","reference-count":34,"publisher":"Springer Nature Switzerland","isbn-type":[{"type":"print","value":"9783031635359"},{"type":"electronic","value":"9783031635366"}],"license":[{"start":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T00:00:00Z","timestamp":1704067200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,7,17]],"date-time":"2024-07-17T00:00:00Z","timestamp":1721174400000},"content-version":"vor","delay-in-days":198,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We are concerned with extracting argumentative fragments from social media, exemplified with a case study on a large corpus of English tweets about the UK Brexit referendum in 2016. Our overall approach is to parse the corpus using dedicated corpus queries that fill designated slots in predefined logical patterns. We present an inventory of logical patterns and corresponding queries, which have been carefully designed and refined. While a gold standard of substantial size is difficult to obtain by manual annotation, our queries can retrieve hundreds of thousands of examples with high precision. We show how queries can be combined to extract complex nested statements relevant to argumentation. We also show how to proceed for applications needing higher recall: high-precision query matches can be used as training data for an LLM classifier, and the trade-off between precision and recall can be freely adjusted with its cutoff threshold.<\/jats:p>","DOI":"10.1007\/978-3-031-63536-6_10","type":"book-chapter","created":{"date-parts":[[2024,7,16]],"date-time":"2024-07-16T05:01:50Z","timestamp":1721106110000},"page":"163-181","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Finding Argument Fragments on\u00a0Social Media with\u00a0Corpus Queries and\u00a0LLMs"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4650-1455","authenticated-orcid":false,"given":"Nathan","family":"Dykes","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4192-2437","authenticated-orcid":false,"given":"Stephanie","family":"Evert","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4785-9205","authenticated-orcid":false,"given":"Philipp","family":"Heinrich","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2251-8519","authenticated-orcid":false,"given":"Merlin","family":"Humml","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3146-5906","authenticated-orcid":false,"given":"Lutz","family":"Schr\u00f6der","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,7,17]]},"reference":[{"issue":"7","key":"10_CR1","doi-asserted-by":"publisher","first-page":"2147","DOI":"10.1007\/s00500-018-3380-x","volume":"23","author":"T Alsinet","year":"2019","unstructured":"Alsinet, T., Argelich, J., B\u00e9jar, R., Cemeli, J.: A distributed argumentation algorithm for mining consistent opinions in weighted Twitter discussions. Soft. Comput. 23(7), 2147\u20132166 (2019). https:\/\/doi.org\/10.1007\/s00500-018-3380-x","journal-title":"Soft. Comput."},{"key":"10_CR2","doi-asserted-by":"crossref","unstructured":"Beck, T., Lee, J.U., Viehmann, C., Maurer, M., Quiring, O. and Gurevych, I.: Investigating label suggestions for opinion mining in german covid-19 social media (2021)","DOI":"10.18653\/v1\/2021.acl-long.1"},{"key":"10_CR3","doi-asserted-by":"crossref","unstructured":"Bhatti, M.M.A., Ahmad, A.S., Park, J.: Argument Mining on Twitter: a case study on the planned parenthood debate. In: Proceedings of the 8th Workshop on Argument Mining, pp. 1\u201311. Association for Computational Linguistics, Punta Cana, Dominican Republic (2021)https:\/\/doi.org\/10.18653\/v1\/2021.argmining-1.1","DOI":"10.18653\/v1\/2021.argmining-1.1"},{"key":"10_CR4","unstructured":"Bosc, T., Cabrio, E., Villata, S.: Tweeties squabbling: positive and negative results in applying argument mining on social media. In: Computational Models of Argument, COMMA 2016. Frontiers Artificial Intelligence Applications, pp. 21-32. IOS Press (2016)"},{"key":"10_CR5","doi-asserted-by":"crossref","unstructured":"Cabrio, E., Villata, S.: Five years of argument mining: a Data\u2013driven Analysis. In: International Joint Conference on Artificial Intelligence, IJCAI 2018, pp. 5427- 5433. ijcai.org (2018)","DOI":"10.24963\/ijcai.2018\/766"},{"key":"10_CR6","doi-asserted-by":"publisher","unstructured":"Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an Architecture for Development of Robust HLT applications. In: Annual Meeting of the Association for Computational Linguistics, ACL 2002, pp. 168-175 (2002). https:\/\/doi.org\/10.3115\/1073083.1073112","DOI":"10.3115\/1073083.1073112"},{"key":"10_CR7","doi-asserted-by":"crossref","unstructured":"Dusmanu, M., Cabrio, E., Villata, S.: Argument mining on Twitter: arguments, facts and sources. In: Empirical Methods in Natural Language Processing, EMNLP 2017, pp. 2317-2322. ACL (2017)","DOI":"10.18653\/v1\/D17-1245"},{"issue":"1","key":"10_CR8","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1515\/itit-2020-0051","volume":"63","author":"N Dykes","year":"2021","unstructured":"Dykes, N., Evert, S., G\u00f6ttlinger, M., Heinrich, P., Schr\u00f6der, L.: Argument parsing via corpus queries. Inf. Technol. 63(1), 31\u201344 (2021). https:\/\/doi.org\/10.1515\/itit-2020-0051","journal-title":"Inf. Technol."},{"key":"10_CR9","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1007\/s13222-020-00342-y","volume":"20","author":"N Dykes","year":"2020","unstructured":"Dykes, N., Evert, S., G\u00f6ttlinger, M., Heinrich, P., Schr\u00f6der, L.: Reconstructing arguments from noisy text: introduction to the RANT project. Datenbank- Spektrum 20, 123\u2013129 (2020)","journal-title":"Datenbank- Spektrum"},{"key":"10_CR10","unstructured":"Evert, S., Hardie, A.: Twenty-first century Corpus Workbench: updating a query architecture for the new millennium. In: Corpus Linguistics, CL 2011. University of Birmingham (2011)"},{"key":"10_CR11","unstructured":"Evert, S.: The CWB development team: the IMS Open Corpus Workbench (CWB) CQP Interface and Query Language Tutorial. CWB Version 3.5. 2022. https:\/\/cwb.sourceforge.io\/documentation.php"},{"key":"10_CR12","unstructured":"Feger, M., Dietze, S.: TACO\u2013Twitter Arguments from COnversations. (2024)"},{"key":"10_CR13","doi-asserted-by":"publisher","unstructured":"Feng, S.Y., et al.: A Survey of data augmentation approaches for NLP. In: Zong, C., Xia, F., Li, W., Navigli, R. (eds.) Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pp. 968-988. Association for Computational Linguistics, Online (2021). https:\/\/doi.org\/10.18653\/v1\/2021.findings-acl.84","DOI":"10.18653\/v1\/2021.findings-acl.84"},{"key":"10_CR14","doi-asserted-by":"crossref","unstructured":"Goudas, T., Louizos, C., Petasis, G., Karkaletsis, V.: Argument extraction from News, blogs, and Social Media. In: Artificial Intelligence: Methods and Applications, SETN 2014, pp. 287-299. Springer (2014)","DOI":"10.1007\/978-3-319-07064-3_23"},{"issue":"50","key":"10_CR15","first-page":"46","volume":"15","author":"K Grosse","year":"2012","unstructured":"Grosse, K., Ches\u00f1evar, C., Maguitman, A., Estevez, E.: Empowering an eGovernment platform through Twitter-based arguments. Inteligencia Artif. 15(50), 46\u201356 (2012)","journal-title":"Inteligencia Artif."},{"issue":"3","key":"10_CR16","doi-asserted-by":"publisher","first-page":"380","DOI":"10.1075\/ijcl.17.3.04har","volume":"17","author":"A Hardie","year":"2012","unstructured":"Hardie, A.: CQPweb - combining power, flexibility and usability in a corpus analysis tool. Int. J. Corpus Ling. 17(3), 380\u2013409 (2012)","journal-title":"Int. J. Corpus Ling."},{"key":"10_CR17","doi-asserted-by":"publisher","unstructured":"Humml, M., Schr\u00f6der, L.: Common Knowledge of abstract groups. In: AAAI Conference on Artificial Intelligence (AAAI 2023), pp. 6434-6441 (2023). https:\/\/doi.org\/10.1609\/aaai.v37i5.25791","DOI":"10.1609\/aaai.v37i5.25791"},{"key":"10_CR18","doi-asserted-by":"publisher","unstructured":"Liu, H., et al.: Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning. arXiv preprint arXiv:2205.05638 (2022). https:\/\/doi.org\/10.48550\/arXiv.2205.05638","DOI":"10.48550\/arXiv.2205.05638"},{"issue":"6","key":"10_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102055","volume":"56","author":"A Lytos","year":"2019","unstructured":"Lytos, A., Lagkas, T., Sarigiannidis, P., Bontcheva, K.: The evolution of argumentation mining: from models to social media and emerging tools. Inf. Process. Manage. 56(6), 102055 (2019). https:\/\/doi.org\/10.1016\/j.ipm.2019.102055","journal-title":"Inf. Process. Manage."},{"key":"10_CR20","doi-asserted-by":"publisher","first-page":"102446","DOI":"10.1016\/J.SIMPAT.2021.102446","volume":"115","author":"A Lytos","year":"2022","unstructured":"Lytos, A., Lagkas, T., Sarigiannidis, P.G., Argyriou, V., Eleftherakis, G.: Modelling argumentation in short text: a case of social media debate. Simul. Model. Pract. Theory 115, 102446 (2022). https:\/\/doi.org\/10.1016\/J.SIMPAT.2021.102446","journal-title":"Simul. Model. Pract. Theory"},{"issue":"3","key":"10_CR21","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1017\/S1351324901002728","volume":"7","author":"G Minnen","year":"2001","unstructured":"Minnen, G., Carroll, J., Pearce, D.: Applied morphological processing of English. Nat. Lang. Eng. 7(3), 207\u2013223 (2001)","journal-title":"Nat. Lang. Eng."},{"key":"10_CR22","unstructured":"Owoputi, O., O\u2019Connor, B., Dyer, C., Gimpel, K., Schneider, N., Smith, N.: Improved part-of-speech tagging for online conversational text with word clusters. In: Human Language Technologies, HLT-NAACL 2013, pp. 380-390. ACL (2013)"},{"key":"10_CR23","doi-asserted-by":"crossref","unstructured":"Pantel, P., Pennacchiotti, M.: Espresso: leveraging generic patterns for automatically harvesting semantic relations. In: Computational Linguistics \/ Annual Meeting of the Association for Computational Linguistics, ACL 2006. ACL (2006)","DOI":"10.3115\/1220175.1220190"},{"key":"10_CR24","doi-asserted-by":"publisher","unstructured":"Proisl, T., Uhrig, P.: SoMaJo: state-of-the-art tokenization for German web and social media texts. In: Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task, pp. 57-62. Association for Computational Linguistics, Berlin (2016). https:\/\/doi.org\/10.18653\/v1\/W16-2607","DOI":"10.18653\/v1\/W16-2607"},{"key":"10_CR25","doi-asserted-by":"publisher","first-page":"200308","DOI":"10.1016\/j.iswa.2023.200308","volume":"21","author":"Y Qiu","year":"2024","unstructured":"Qiu, Y., Jin, Y.: ChatGPT and finetuned BERT: a comparative study for developing intelligent design support systems. Intell. Syst. Appl. 21, 200308 (2024). https:\/\/doi.org\/10.1016\/j.iswa.2023.200308","journal-title":"Intell. Syst. Appl."},{"key":"10_CR26","unstructured":"Rahman, A.M.M., Yin, W., Wang, G.: Data augmentation for text classification with EASE. In: Abbas, M., Freihat, A.A. (eds.) Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), pp. 324-332. Association for Computational Linguistics, Online (2023)"},{"key":"10_CR27","doi-asserted-by":"crossref","unstructured":"Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-Networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2019)","DOI":"10.18653\/v1\/D19-1410"},{"key":"10_CR28","doi-asserted-by":"crossref","unstructured":"Ritter, A., Mausam, Etzioni, O., Clark, S.: Open domain event extraction from twitter. In: Knowledge Discovery and Data Mining, KDD 2012, pp. 1104- 1112. ACM (2012)","DOI":"10.1145\/2339530.2339704"},{"key":"10_CR29","unstructured":"Sanh, V., Debut, L., Chaumond, J., Wolf, T.: DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs\/1910.01108 (2019)"},{"key":"10_CR30","doi-asserted-by":"publisher","unstructured":"Schaefer, R., Stede, M.: Argument mining on Twitter: a survey. Inf. Technol. 63(1), 45\u201358 (2021). https:\/\/doi.org\/10.1515\/itit-2020-0053","DOI":"10.1515\/itit-2020-0053"},{"key":"10_CR31","doi-asserted-by":"publisher","unstructured":"Shnarch, E.,et al.: Will it Blend? blending weak and strong labeled data in a neural network for argumentation mining. In: Gurevych, I., Miyao, Y. (eds.) Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 599-605. Association for Computational Linguistics, Melbourne, Australia (2018). https:\/\/doi.org\/10.18653\/v1\/P18-2095","DOI":"10.18653\/v1\/P18-2095"},{"key":"10_CR32","doi-asserted-by":"publisher","unstructured":"Son, Y., et al.: Recognizing counterfactual thinking in social media texts. In: Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (2017). https:\/\/doi.org\/10.18653\/v1\/p17-2103","DOI":"10.18653\/v1\/p17-2103"},{"key":"10_CR33","doi-asserted-by":"publisher","unstructured":"Tunstall, L., et al.: Efficient few-shot learning without prompts. arXiv preprint arXiv:2209.11055 (2022). https:\/\/doi.org\/10.48550\/ARXIV.2209.11055","DOI":"10.48550\/ARXIV.2209.11055"},{"key":"10_CR34","doi-asserted-by":"publisher","unstructured":"Walton, D., Reed, C., Macagno, F.: Argumentation Schemes. Cambridge University Press (2008). https:\/\/doi.org\/10.1017\/CBO9780511802034","DOI":"10.1017\/CBO9780511802034"}],"container-title":["Lecture Notes in Computer Science","Robust Argumentation Machines"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-63536-6_10","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,16]],"date-time":"2024-07-16T05:03:58Z","timestamp":1721106238000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-63536-6_10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024]]},"ISBN":["9783031635359","9783031635366"],"references-count":34,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-63536-6_10","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2024]]},"assertion":[{"value":"17 July 2024","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"RATIO","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Conference on Advances in Robust Argumentation Machines","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Bielefeld","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Germany","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2024","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"5 June 2024","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"7 June 2024","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"1","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"ratio2024","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"http:\/\/ratio.sc.cit-ec.uni-bielefeld.de\/de\/home\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}