{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,1]],"date-time":"2026-03-01T11:04:49Z","timestamp":1772363089497,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":56,"publisher":"ACM","funder":[{"name":"Australian Research Council","award":["CE200100005"],"award-info":[{"award-number":["CE200100005"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,12,7]]},"DOI":"10.1145\/3767695.3769508","type":"proceedings-article","created":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T17:14:58Z","timestamp":1764782098000},"page":"426-436","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Can We Hide Machines in the Crowd? Quantifying Equivalence in LLM-in-the-loop Annotation Tasks"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-2817-7675","authenticated-orcid":false,"given":"Jiaman","family":"He","sequence":"first","affiliation":[{"name":"RMIT University, Naarm\/Melbourne, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6789-4780","authenticated-orcid":false,"given":"Zikang","family":"Leng","sequence":"additional","affiliation":[{"name":"Georgia Institute of Technology, Atlanta, GA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7522-1842","authenticated-orcid":false,"given":"Dana","family":"McKay","sequence":"additional","affiliation":[{"name":"RMIT University, Naarm\/Melbourne, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9913-433X","authenticated-orcid":false,"given":"Damiano","family":"Spina","sequence":"additional","affiliation":[{"name":"RMIT University, Naarm\/Melbourne, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7801-0239","authenticated-orcid":false,"given":"Johanne R","family":"Trippas","sequence":"additional","affiliation":[{"name":"RMIT University, Naarm\/Melbourne, Australia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,12,6]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3673791.3698431"},{"key":"e_1_3_2_1_2_1","volume-title":"Using crowdsourcing for TREC relevance assessment. Information processing & management","author":"Alonso Omar","year":"2012","unstructured":"Omar Alonso and Stefano Mizzaro. 2012. Using crowdsourcing for TREC relevance assessment. Information processing & management, Vol. 48, 6 (2012), 1053-1066."},{"key":"e_1_3_2_1_3_1","first-page":"1377","volume-title":"Proceedings of the Human Factors and Ergonomics Society Annual Meeting","volume":"64","author":"Armstrong Miriam E","year":"2020","unstructured":"Miriam E Armstrong, McKenna K Tornblad, and Keith S Jones. 2020. The accuracy of interrater reliability estimates found using a subset of the total data sample: A bootstrap analysis. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Vol. 64. SAGE Publications Sage CA: Los Angeles, CA, 1377-1382."},{"key":"e_1_3_2_1_4_1","volume-title":"Inter-coder agreement for computational linguistics. Computational linguistics","author":"Artstein Ron","year":"2008","unstructured":"Ron Artstein and Massimo Poesio. 2008. Inter-coder agreement for computational linguistics. Computational linguistics, Vol. 34, 4 (2008), 555-596."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390334.1390447"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3726302.3730348"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1016\/0030-5073(80)90073-2","article-title":"Cognitive skills in judgment: Subjects' ability to use information about weights, function forms, and organizing principles","volume":"26","author":"Brehmer Berndt","year":"1980","unstructured":"Berndt Brehmer, Roger Hagafors, and Roger Johansson. 1980. Cognitive skills in judgment: Subjects' ability to use information about weights, function forms, and organizing principles. Organizational Behavior and Human Performance, Vol. 26, 3 (1980), 373-385.","journal-title":"Organizational Behavior and Human Performance"},{"key":"e_1_3_2_1_8_1","first-page":"1877","volume-title":"Lin (Eds.)","volume":"33","author":"Brown Tom","year":"2020","unstructured":"Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 1877-1901."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3706101"},{"key":"e_1_3_2_1_10_1","volume-title":"A coefficient of agreement for nominal scales. Educational and psychological measurement","author":"Cohen Jacob","year":"1960","unstructured":"Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement, Vol. 20, 1 (1960), 37-46."},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of AAAI spring symposium: exploring attitude and affect in text","volume":"102","author":"Craggs Richard","year":"2004","unstructured":"Richard Craggs and M Wood. 2004. A two dimensional annotation scheme for emotion in dialogue. In Proceedings of AAAI spring symposium: exploring attitude and affect in text, Vol. 102."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2005.03.007"},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '25)","author":"Palma Dario Di","year":"2025","unstructured":"Dario Di Palma, Felice Antonio Merra, Maurizio Sfilio, Vito Walter Anelli, Fedelucio Narducci, and Tommaso Di Noia. 2025. Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M. In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '25). 2582\u20132586."},{"key":"e_1_3_2_1_14_1","volume-title":"Proc. ICTIR.","author":"Dietz Laura","year":"2025","unstructured":"Laura Dietz, Oleg Zendel, Peter Bailey, Charles Clarke, Ellese Cotterill, Jeff Dalton, Faegheh Hasibi, Mark Sanderson, and Nick Craswell. 2025. Principles and Guidelines for the Use of LLM Judges. In Proc. ICTIR."},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1107-1128","author":"Dong Qingxiu","year":"2024","unstructured":"Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, and Zhifang Sui. 2024. A Survey on In-context Learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1107-1128."},{"key":"e_1_3_2_1_16_1","volume-title":"Use of nonlinear, noncompensatory models as a function of task and amount of information. Organizational behavior and human performance","author":"Einhorn Hillel J","year":"1971","unstructured":"Hillel J Einhorn. 1971. Use of nonlinear, noncompensatory models as a function of task and amount of information. Organizational behavior and human performance, Vol. 6, 1 (1971), 1-27."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3578337.3605136"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1037\/0022-3514.65.2.221","article-title":"You can't not believe everything you read","volume":"65","author":"Gilbert Daniel T","year":"1993","unstructured":"Daniel T Gilbert, Romin W Tafarodi, and Patrick S Malone. 1993. You can't not believe everything you read. Journal of personality and social psychology, Vol. 65, 2 (1993), 221.","journal-title":"Journal of personality and social psychology"},{"key":"e_1_3_2_1_19_1","volume-title":"Probabilistic functioning and the clinical method. Psychological review","author":"Hammond Kenneth R","year":"1955","unstructured":"Kenneth R Hammond. 1955. Probabilistic functioning and the clinical method. Psychological review, Vol. 62, 4 (1955), 255."},{"key":"e_1_3_2_1_20_1","article-title":"The MovieLens Datasets: History and Context","volume":"5","author":"Maxwell Harper F.","year":"2015","unstructured":"F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Transactions on Interactive Intelligent Systems (TiiS), Vol. 5, 4, Article 19 (Dec. 2015), 19:1-19:19 pages.","journal-title":"ACM Transactions on Interactive Intelligent Systems (TiiS)"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2602-2606","author":"He Jiaman","year":"2025","unstructured":"Jiaman He, Zikang Leng, Dana McKay, Johanne R Trippas, and Damiano Spina. 2025. Characterising Topic Familiarity and Query Specificity Using Eye-Tracking Data. In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2602-2606."},{"key":"e_1_3_2_1_22_1","volume-title":"International Conference on Artificial Intelligence and Statistics. PMLR, 5549-5581","author":"Hegselmann Stefan","year":"2023","unstructured":"Stefan Hegselmann, Alejandro Buendia, Hunter Lang, Monica Agrawal, Xiaoyi Jiang, and David Sontag. 2023. Tabllm: Few-shot classification of tabular data with large language models. In International Conference on Artificial Intelligence and Statistics. PMLR, 5549-5581."},{"key":"e_1_3_2_1_23_1","volume-title":"The paramorphic representation of clinical judgment. Psychological bulletin","author":"Hoffman Paul J","year":"1960","unstructured":"Paul J Hoffman. 1960. The paramorphic representation of clinical judgment. Psychological bulletin, Vol. 57, 2 (1960), 116."},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the 1st Workshop on NLP for COVID-19 at ACL","author":"Kenneth Huang Ting-Hao","year":"2020","unstructured":"Ting-Hao Kenneth Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, and C. Lee Giles. 2020. CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000 Abstracts in the COVID-19 Open Research Dataset. In Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020, Karin Verspoor, Kevin Bretonnel Cohen, Mark Dredze, Emilio Ferrara, Jonathan May, Robert Munro, Cecile Paris, and Byron Wallace (Eds.). Association for Computational Linguistics, Online."},{"key":"e_1_3_2_1_25_1","volume-title":"The International Encyclopedia of Communication Theory and Philosophy, 4","author":"Jensen Klaus Bruhn","unstructured":"Klaus Bruhn Jensen, Robert T Craig, Jefferson D Pooley, and Eric W Rothenbuhler. 2016. The International Encyclopedia of Communication Theory and Philosophy, 4 Volume Set. John Wiley & Sons."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/30.1-2.81"},{"key":"e_1_3_2_1_27_1","unstructured":"Klaus Krippendorff. 2011. Computing Krippendorff's alpha-reliability."},{"key":"e_1_3_2_1_28_1","volume-title":"Content Analysis: An Introduction to Its Methodology","author":"Krippendorff Klaus","year":"2018","unstructured":"Klaus Krippendorff. 2018. Content Analysis: An Introduction to Its Methodology (4th ed.). SAGE Publications, Thousand Oaks, CA.","edition":"4"},{"key":"e_1_3_2_1_29_1","volume-title":"The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 805-814","author":"Kutlu Mucahid","year":"2018","unstructured":"Mucahid Kutlu, Tyler McDonnell, Yassmine Barkallah, Tamer Elsayed, and Matthew Lease. 2018a. Crowd vs. expert: What can relevance judgment rationales teach us about assessor disagreement?. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 805-814."},{"key":"e_1_3_2_1_30_1","volume-title":"The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18)","author":"Kutlu Mucahid","year":"2018","unstructured":"Mucahid Kutlu, Tyler McDonnell, Yassmine Barkallah, Tamer Elsayed, and Matthew Lease. 2018b. Crowd vs. Expert: What Can Relevance Judgment Rationales Teach Us About Assessor Disagreement?. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18). 805\u2013814."},{"key":"e_1_3_2_1_31_1","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Lin Stephanie","unstructured":"Stephanie Lin, Jacob Hilton, and Owain Evans. 2022. TruthfulQA: Measuring How Models Mimic Human Falsehoods. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Smaranda Muresan, Preslav Nakov, and Aline Villavicencio (Eds.). Association for Computational Linguistics, 3214-3252."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3592032"},{"key":"e_1_3_2_1_33_1","volume-title":"Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 3045-3049","author":"Felipe Angel","unstructured":"Angel Felipe Magnoss ao de Paula, J Shane Culpepper, Alistair Moffat, Sachin Pathiyan Cherumanal, Falk Scholer, and Johanne Trippas. 2025. The Effects of Demographic Instructions on LLM Personas. In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 3045-3049."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i17.17745"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/262192.262203"},{"key":"e_1_3_2_1_36_1","first-page":"18511","volume-title":"Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics","author":"Davani Aida Mostafazadeh","year":"2024","unstructured":"Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, and Vinodkumar Prabhakaran. 2024. D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics, Miami, Florida, USA, 18511-18526."},{"key":"e_1_3_2_1_37_1","first-page":"9048","volume-title":"Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics","author":"Movva Rajiv","year":"2024","unstructured":"Rajiv Movva, Pang Wei Koh, and Emma Pierson. 2024. Annotation alignment: Comparing LLM and human annotations of conversational safety. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics, Miami, Florida, USA, 9048-9062."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2024.3402809"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00293"},{"key":"e_1_3_2_1_40_1","volume-title":"Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII), Jakob Prange and Annemarie Friedrich (Eds.). Association for Computational Linguistics, 252-265","author":"Pei Jiaxin","year":"2023","unstructured":"Jiaxin Pei and David Jurgens. 2023. When Do Annotator Demographics Matter? Measuring the Influence of Annotator Demographics with the POPQUORN Dataset. In Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII), Jakob Prange and Annemarie Friedrich (Eds.). Association for Computational Linguistics, 252-265."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00266"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1037\/0033-2909.113.3.553","article-title":"Using Significance Tests, to Evaluate Equivalence Between Two Experimental Groups","volume":"113","author":"Rogers James L.","year":"1993","unstructured":"James L. Rogers, Kenneth I. Howard, and John T. Vessey. 1993. Using Significance Tests, to Evaluate Equivalence Between Two Experimental Groups. Psychological Bulletin, Vol. 113, 3 (1993), 553-565.","journal-title":"Psychological Bulletin"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3597201","article-title":"How many crowd workers do I need? On statistical power when crowdsourcing relevance judgments","volume":"42","author":"Roitero Kevin","year":"2023","unstructured":"Kevin Roitero, David La Barbera, Michael Soprano, Gianluca Demartini, Stefano Mizzaro, and Tetsuya Sakai. 2023. How many crowd workers do I need? On statistical power when crowdsourcing relevance judgments. ACM Transactions on Information Systems, Vol. 42, 1 (2023), 1-26.","journal-title":"ACM Transactions on Information Systems"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2021.102688"},{"key":"e_1_3_2_1_45_1","volume-title":"Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR).","author":"Roitero Kevin","year":"2020","unstructured":"Kevin Roitero, Michael Soprano, Shaoyang Fan, Damiano Spina, Stefano Mizzaro, and Gianluca Demartini. 2020. Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3726302.3730345"},{"key":"e_1_3_2_1_47_1","volume-title":"16th IEEE International Conference on Tools with Artificial Intelligence. 576-584","author":"Salvador S.","unstructured":"S. Salvador and P. Chan. 2004. Determining the number of clusters\/segments in hierarchical clustering\/segmentation algorithms. In 16th IEEE International Conference on Tools with Artificial Intelligence. 576-584."},{"key":"e_1_3_2_1_48_1","first-page":"1288","volume-title":"Companion Proceedings of the ACM on Web Conference","author":"Schnabel Julian A","year":"2025","unstructured":"Julian A Schnabel, Johanne R Trippas, Falk Scholer, and Danula Hettiachchi. 2025. Multi-stage large language model pipelines can outperform gpt-4o in relevance assessment. In Companion Proceedings of the ACM on Web Conference 2025. 1288-1292."},{"key":"e_1_3_2_1_49_1","volume-title":"Marco La Cascia, et al","author":"Siino Marco","year":"2025","unstructured":"Marco Siino, Ilenia Tinnirello, Marco La Cascia, et al., 2025. From Foundations to GPT in Text Classification: A Comprehensive Survey on Current Approaches and Future Trends. Foundations and Trends\u00ae in Information Retrieval, Vol. 19, 5 (2025), 557-711."},{"key":"e_1_3_2_1_50_1","first-page":"1","article-title":"Noninferiority Trials","volume":"1","author":"Snapinn Steven M","year":"2000","unstructured":"Steven M Snapinn. 2000. Noninferiority Trials. Trials, Vol. 1, 1 (July 2000).","journal-title":"Trials"},{"key":"e_1_3_2_1_51_1","unstructured":"T. Spinde L. Rudnitckaia K. Sinha F. Hamborg B. Gipp and K. Donnay. 2021. MBIC - A Media Bias Annotation Dataset Including Annotator Characteristics. arXiv:2105.11910 [cs.CL] https:\/\/arxiv.org\/abs\/2105.11910"},{"key":"e_1_3_2_1_52_1","volume-title":"An introduction to the bootstrap. Monographs on statistics and applied probability","author":"Tibshirani Robert J","year":"1993","unstructured":"Robert J Tibshirani and Bradley Efron. 1993. An introduction to the bootstrap. Monographs on statistics and applied probability, Vol. 57, 1 (1993), 1-436."},{"key":"e_1_3_2_1_53_1","volume-title":"Computing machinery and intelligence","author":"Turing Alan M","unstructured":"Alan M Turing. 2009. Computing machinery and intelligence. Springer."},{"key":"e_1_3_2_1_54_1","first-page":"422","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Regina Barzilay and Min-Yen Kan (Eds.). Association for Computational Linguistics","author":"Wang William Yang","year":"2017","unstructured":"William Yang Wang. 2017. ''Liar, Liar Pants on Fire'': A New Benchmark Dataset for Fake News Detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Regina Barzilay and Min-Yen Kan (Eds.). Association for Computational Linguistics, Vancouver, Canada, 422-426."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3627508.3638322"},{"key":"e_1_3_2_1_56_1","first-page":"3881","volume-title":"Sentiment Analysis in the Era of Large Language Models: A Reality Check. In Findings of the Association for Computational Linguistics: NAACL","author":"Zhang Wenxuan","year":"2024","unstructured":"Wenxuan Zhang, Yue Deng, Bing Liu, Sinno Pan, and Lidong Bing. 2024. Sentiment Analysis in the Era of Large Language Models: A Reality Check. In Findings of the Association for Computational Linguistics: NAACL 2024. Association for Computational Linguistics, Mexico City, Mexico, 3881-3906."}],"event":{"name":"SIGIR-AP 2025:Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region","location":"Xi'an China","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"]},"container-title":["Proceedings of the 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region"],"original-title":[],"deposited":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T17:21:00Z","timestamp":1764782460000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3767695.3769508"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,6]]},"references-count":56,"alternative-id":["10.1145\/3767695.3769508","10.1145\/3767695"],"URL":"https:\/\/doi.org\/10.1145\/3767695.3769508","relation":{},"subject":[],"published":{"date-parts":[[2025,12,6]]},"assertion":[{"value":"2025-12-06","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}