{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:37:55Z","timestamp":1777653475105,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":61,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,3,8]],"date-time":"2021-03-08T00:00:00Z","timestamp":1615161600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,8]]},"DOI":"10.1145\/3437963.3441814","type":"proceedings-article","created":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T04:36:17Z","timestamp":1615005377000},"page":"40-48","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["Semi-Supervised Text Classification via Self-Pretraining"],"prefix":"10.1145","author":[{"given":"Payam","family":"Karisani","sequence":"first","affiliation":[{"name":"Emory University, Atlanta, GA, USA"}]},{"given":"Negin","family":"Karisani","sequence":"additional","affiliation":[{"name":"Purdue University, West Lafayette, IN, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,3,8]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Semisupervised Learning for Computational Linguistics","author":"Abney Steven","unstructured":"Steven Abney . 2007. Semisupervised Learning for Computational Linguistics ( 1 st ed.). Chapman & Hall\/CRC. Steven Abney. 2007. Semisupervised Learning for Computational Linguistics (1st ed.). Chapman & Hall\/CRC.","edition":"1"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1099"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Thayer Alshaabi David R Dewhurst and etal 2020. The growing echo chamber of social media: Measuring temporal and social contagion dynamics for over 150 languages on Twitter for 2009--2020. arXiv preprint arXiv:2003.03667 (2020). Thayer Alshaabi David R Dewhurst and et al. 2020. The growing echo chamber of social media: Measuring temporal and social contagion dynamics for over 150 languages on Twitter for 2009--2020. arXiv preprint arXiv:2003.03667 (2020).","DOI":"10.1140\/epjds\/s13688-021-00271-0"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1003"},{"key":"e_1_3_2_1_5_1","volume-title":"Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning. In 2020 International Joint Conference on Neural Networks, IJCNN, July 19--24","author":"Arazo Eric","year":"2020","unstructured":"Eric Arazo , Diego Ortego , Paul Albert , and et al. 2020 . Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning. In 2020 International Joint Conference on Neural Networks, IJCNN, July 19--24 , 2020 . IEEE, 1--8. Eric Arazo, Diego Ortego, Paul Albert, and et al. 2020. Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning. In 2020 International Joint Conference on Neural Networks, IJCNN, July 19--24, 2020. IEEE, 1--8."},{"key":"e_1_3_2_1_6_1","volume-title":"Smith","author":"Bamman David","year":"2015","unstructured":"David Bamman and Noah A . Smith . 2015 . Contextualized Sarcasm Detection on Twitter. In Proceedings of the Ninth ICWSM. 574--577. David Bamman and Noah A. Smith. 2015. Contextualized Sarcasm Detection on Twitter. In Proceedings of the Ninth ICWSM. 574--577."},{"key":"e_1_3_2_1_7_1","volume-title":"Curriculum Learning. In Proceedings of the 26th ICML (Montreal","author":"Bengio Yoshua","year":"2009","unstructured":"Yoshua Bengio , J\u00e9r\u00f4me Louradour , Ronan Collobert , and JasonWeston. 2009 . Curriculum Learning. In Proceedings of the 26th ICML (Montreal , Quebec, Canada) (ICML '09). Association for Computing Machinery, New York, NY, USA, 41--48. Yoshua Bengio, J\u00e9r\u00f4me Louradour, Ronan Collobert, and JasonWeston. 2009. Curriculum Learning. In Proceedings of the 26th ICML (Montreal, Quebec, Canada) (ICML '09). Association for Computing Machinery, New York, NY, USA, 41--48."},{"key":"e_1_3_2_1_8_1","first-page":"8","article-title":"MixMatch: A Holistic Approach to Semi- Supervised Learning","volume":"2019","author":"Berthelot David","year":"2019","unstructured":"David Berthelot , Nicholas Carlini , Ian J. Goodfellow , Nicolas Papernot , Avital Oliver , and Colin Raffel . 2019 . MixMatch: A Holistic Approach to Semi- Supervised Learning . In NeurIPS 2019 , 8 -- 14 Vancouver, BC, Canada. 5050--5060. David Berthelot, Nicholas Carlini, Ian J. Goodfellow, Nicolas Papernot, Avital Oliver, and Colin Raffel. 2019. MixMatch: A Holistic Approach to Semi- Supervised Learning. In NeurIPS 2019, 8--14 Vancouver, BC, Canada. 5050--5060.","journal-title":"NeurIPS"},{"key":"e_1_3_2_1_9_1","volume-title":"Combining Labeled and Unlabeled Data with Co-Training. In Proceedings of the Eleventh COLT, 1998","author":"Blum Avrim","year":"1998","unstructured":"Avrim Blum and Tom M. Mitchell . 1998 . Combining Labeled and Unlabeled Data with Co-Training. In Proceedings of the Eleventh COLT, 1998 , Madison, Wisconsin, USA, July 24--26 , 1998 . 92--100. Avrim Blum and Tom M. Mitchell. 1998. Combining Labeled and Unlabeled Data with Co-Training. In Proceedings of the Eleventh COLT, 1998, Madison, Wisconsin, USA, July 24--26, 1998. 92--100."},{"key":"e_1_3_2_1_10_1","unstructured":"Tom B Brown Benjamin Mann and etal 2020. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020). Tom B Brown Benjamin Mann and et al. 2020. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)."},{"key":"e_1_3_2_1_11_1","volume-title":"Model Compression. In Proceedings of the 12th ACM SIGKDD","author":"Buciluundefined Cristian","year":"2006","unstructured":"Cristian Buciluundefined , Rich Caruana , and Alexandru Niculescu-Mizil . 2006 . Model Compression. In Proceedings of the 12th ACM SIGKDD ( Philadelphia, PA, USA) (KDD '06). 535--541. Cristian Buciluundefined, Rich Caruana, and Alexandru Niculescu-Mizil. 2006. Model Compression. In Proceedings of the 12th ACM SIGKDD (Philadelphia, PA, USA) (KDD '06). 535--541."},{"key":"e_1_3_2_1_12_1","volume-title":"Mitchell","author":"Carlson Andrew","year":"2010","unstructured":"Andrew Carlson , Justin Betteridge , Bryan Kisiel , Burr Settles , Estevam R. Hruschka , and Tom M . Mitchell . 2010 . Toward an Architecture for Never-Ending Language Learning. In Proceedings of the Twenty-Fourth AAAI. 1306--1313. Andrew Carlson, Justin Betteridge, Bryan Kisiel, Burr Settles, Estevam R. Hruschka, and Tom M. Mitchell. 2010. Toward an Architecture for Never-Ending Language Learning. In Proceedings of the Twenty-Fourth AAAI. 1306--1313."},{"key":"e_1_3_2_1_13_1","volume-title":"Curriculum Labeling: Self-paced Pseudo-Labeling for Semi-Supervised Learning. arXiv preprint arXiv:2001.06001","author":"Cascante-Bonilla Paola","year":"2020","unstructured":"Paola Cascante-Bonilla , Fuwen Tan , Yanjun Qi , and Vicente Ordonez . 2020 . Curriculum Labeling: Self-paced Pseudo-Labeling for Semi-Supervised Learning. arXiv preprint arXiv:2001.06001 (2020). Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, and Vicente Ordonez. 2020. Curriculum Labeling: Self-paced Pseudo-Labeling for Semi-Supervised Learning. arXiv preprint arXiv:2001.06001 (2020)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Olivier Chapelle Bernhard Sch\u00f6lkopf and Alexander Zien (Eds.). 2006. Semi- Supervised Learning. The MIT Press. Olivier Chapelle Bernhard Sch\u00f6lkopf and Alexander Zien (Eds.). 2006. Semi- Supervised Learning. The MIT Press.","DOI":"10.7551\/mitpress\/9780262033589.001.0001"},{"key":"e_1_3_2_1_15_1","volume-title":"A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey Hinton . 2020. A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709 ( 2020 ). Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709 (2020)."},{"key":"e_1_3_2_1_16_1","volume-title":"Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Kevin Swersky , Mohammad Norouzi , and Geoffrey Hinton . 2020. Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029 ( 2020 ). Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey Hinton. 2020. Big self-supervised models are strong semi-supervised learners. arXiv preprint arXiv:2006.10029 (2020)."},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics","volume":"6","author":"Curran James R","year":"2007","unstructured":"James R Curran , Tara Murphy , and Bernhard Scholz . 2007 . Minimising semantic drift with mutual exclusion bootstrapping . In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics , Vol. 6 . Bali, 172--180. James R Curran, Tara Murphy, and Bernhard Scholz. 2007. Minimising semantic drift with mutual exclusion bootstrapping. In Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics, Vol. 6. Bali, 172--180."},{"key":"e_1_3_2_1_18_1","volume-title":"Proc of the 2019 NAACL. 4171--4186","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proc of the 2019 NAACL. 4171--4186 . Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proc of the 2019 NAACL. 4171--4186."},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the 35th ICML","volume":"80","author":"Furlanello Tommaso","year":"2018","unstructured":"Tommaso Furlanello , Zachary Chase Lipton , Michael Tschannen , Laurent Itti , and Anima Anandkumar . 2018 . Born-Again Neural Networks . In Proceedings of the 35th ICML , Stockholm, Sweden, July 10--15 , 2018, Vol. 80 . 1602--1611. Tommaso Furlanello, Zachary Chase Lipton, Michael Tschannen, Laurent Itti, and Anima Anandkumar. 2018. Born-Again Neural Networks. In Proceedings of the 35th ICML, Stockholm, Sweden, July 10--15, 2018, Vol. 80. 1602--1611."},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the 49th ACL","author":"Gonzalez-Ibaez Roberto","year":"2011","unstructured":"Roberto Gonzalez-Ibaez , Smaranda Muresan , and NinaWacholder. 2011 . Identifying Sarcasm in Twitter: A Closer Look . In Proceedings of the 49th ACL ( Portland, Oregon) (HLT '11). 581--586. Roberto Gonzalez-Ibaez, Smaranda Muresan, and NinaWacholder. 2011. Identifying Sarcasm in Twitter: A Closer Look. In Proceedings of the 49th ACL (Portland, Oregon) (HLT '11). 581--586."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 57th ACL","author":"Gururangan Suchin","unstructured":"Suchin Gururangan , Tam Dang , Dallas Card , and Noah A. Smith . 2019. Variational Pretraining for Semi-supervised Text Classification . In Proceedings of the 57th ACL . Florence, Italy, 5880--5894. Suchin Gururangan, Tam Dang, Dallas Card, and Noah A. Smith. 2019. Variational Pretraining for Semi-supervised Text Classification. In Proceedings of the 57th ACL. Florence, Italy, 5880--5894."},{"key":"e_1_3_2_1_22_1","volume-title":"Smith","author":"Gururangan Suchin","year":"2020","unstructured":"Suchin Gururangan , Ana Marasovi\u00b4c , Swabha Swayamdipta , Kyle Lo , Iz Beltagy , Doug Downey , and Noah A . Smith . 2020 . Don't Stop Pretraining : Adapt Language Models to Domains and Tasks. In Proceedings of ACL. Suchin Gururangan, Ana Marasovi\u00b4c, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, and Noah A. Smith. 2020. Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. In Proceedings of ACL."},{"key":"e_1_3_2_1_23_1","volume-title":"Revisiting Self-Training for Neural Sequence Generation. In 8th International Conference on Learning Representations, ICLR 2020","author":"He Junxian","year":"2020","unstructured":"Junxian He , Jiatao Gu , Jiajun Shen , and Marc? Aurelio Ranzato . 2020 . Revisiting Self-Training for Neural Sequence Generation. In 8th International Conference on Learning Representations, ICLR 2020 , Addis Ababa, Ethiopia, April 26--30 , 2020. OpenReview.net. https:\/\/openreview.net\/forum?id=SJgdnAVKDH Junxian He, Jiatao Gu, Jiajun Shen, and Marc?Aurelio Ranzato. 2020. Revisiting Self-Training for Neural Sequence Generation. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26--30, 2020. OpenReview.net. https:\/\/openreview.net\/forum?id=SJgdnAVKDH"},{"key":"e_1_3_2_1_24_1","first-page":"2712","article-title":"Using Pre-Training Can Improve Model Robustness and Uncertainty. In Proceedings of the 36th ICML","volume":"97","author":"Hendrycks Dan","year":"2019","unstructured":"Dan Hendrycks , Kimin Lee , and Mantas Mazeika . 2019 . Using Pre-Training Can Improve Model Robustness and Uncertainty. In Proceedings of the 36th ICML , California, USA , Vol. 97. 2712 -- 2721 . Dan Hendrycks, Kimin Lee, and Mantas Mazeika. 2019. Using Pre-Training Can Improve Model Robustness and Uncertainty. In Proceedings of the 36th ICML, California, USA, Vol. 97. 2712--2721.","journal-title":"California, USA"},{"key":"e_1_3_2_1_25_1","volume-title":"Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531","author":"Hinton Geoffrey","year":"2015","unstructured":"Geoffrey Hinton , Oriol Vinyals , and Jeff Dean . 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 ( 2015 ). Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)."},{"key":"e_1_3_2_1_26_1","volume-title":"A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering. arXiv preprint arXiv:2005.05257","author":"Holzenberger Nils","year":"2020","unstructured":"Nils Holzenberger , Andrew Blair-Stanek , and Benjamin Van Durme . 2020. A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering. arXiv preprint arXiv:2005.05257 ( 2020 ). Nils Holzenberger, Andrew Blair-Stanek, and Benjamin Van Durme. 2020. A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering. arXiv preprint arXiv:2005.05257 (2020)."},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the 56th ACL. 328--339","author":"Howard Jeremy","year":"2018","unstructured":"Jeremy Howard and Sebastian Ruder . 2018 . Universal Language Model Finetuning for Text Classification . In Proceedings of the 56th ACL. 328--339 . Jeremy Howard and Sebastian Ruder. 2018. Universal Language Model Finetuning for Text Classification. In Proceedings of the 56th ACL. 328--339."},{"key":"e_1_3_2_1_28_1","volume-title":"Workshops at the 31st AAAI.","author":"Huang Xiaolei","year":"2017","unstructured":"Xiaolei Huang , Michael C Smith , Michael J Paul , Dmytro Ryzhkov , Sandra C Quinn , David A Broniatowski , and Mark Dredze . 2017 . Examining patterns of influenza vaccination in social media . In Workshops at the 31st AAAI. Xiaolei Huang, Michael C Smith, Michael J Paul, Dmytro Ryzhkov, Sandra C Quinn, David A Broniatowski, and Mark Dredze. 2017. Examining patterns of influenza vaccination in social media. In Workshops at the 31st AAAI."},{"key":"e_1_3_2_1_29_1","volume-title":"Proceedings of the 2018 World Wide Web Conference","author":"Karisani Payam","year":"2018","unstructured":"Payam Karisani and Eugene Agichtein . 2018 . Did You Really Just Have a Heart Attack? Towards Robust Detection of Personal Health Mentions in Social Media . In Proceedings of the 2018 World Wide Web Conference ( Lyon, France). 137--146. Payam Karisani and Eugene Agichtein. 2018. Did You Really Just Have a Heart Attack? Towards Robust Detection of Personal Health Mentions in Social Media. In Proceedings of the 2018 World Wide Web Conference (Lyon, France). 137--146."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380304"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1611835114"},{"key":"e_1_3_2_1_32_1","volume-title":"Temporal Ensembling for Semi-Supervised Learning. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Conference Track Proceedings.","author":"Laine Samuli","year":"2017","unstructured":"Samuli Laine and Timo Aila . 2017 . Temporal Ensembling for Semi-Supervised Learning. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Conference Track Proceedings. Samuli Laine and Timo Aila. 2017. Temporal Ensembling for Semi-Supervised Learning. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24--26, 2017, Conference Track Proceedings."},{"key":"e_1_3_2_1_33_1","volume-title":"Workshop on challenges in representation learning, ICML","volume":"3","author":"Lee Dong-Hyun","year":"2013","unstructured":"Dong-Hyun Lee . 2013 . Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks . In Workshop on challenges in representation learning, ICML , Vol. 3 . Dong-Hyun Lee. 2013. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In Workshop on challenges in representation learning, ICML, Vol. 3."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Jinhyuk Lee Wonjin Yoon and etal 2019. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36 4 (09 2019) 1234--1240. Jinhyuk Lee Wonjin Yoon and et al. 2019. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36 4 (09 2019) 1234--1240.","DOI":"10.1093\/bioinformatics\/btz682"},{"key":"e_1_3_2_1_35_1","unstructured":"Kimin Lee Kibok Lee Honglak Lee and Jinwoo Shin. 2018. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks. In Advances in Neural Information Processing Systems 31. 7167--7177. Kimin Lee Kibok Lee Honglak Lee and Jinwoo Shin. 2018. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks. In Advances in Neural Information Processing Systems 31. 7167--7177."},{"key":"e_1_3_2_1_36_1","volume-title":"Cohen","author":"McCloskey Michael","year":"1989","unstructured":"Michael McCloskey and Neal J . Cohen . 1989 . Catastrophic Interference in Connectionist Networks : The Sequential Learning Problem. Psychology of Learning and Motivation, Vol. 24 . Academic Press , 109 -- 165. Michael McCloskey and Neal J. Cohen. 1989. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem. Psychology of Learning and Motivation, Vol. 24. Academic Press, 109 -- 165."},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 16th ISCRAM","author":"Mccreadie Richard","year":"2019","unstructured":"Richard Mccreadie , Cody Buntain , and Ian Soboroff . 2019 . TREC Incident Streams: Finding Actionable Information on Social Media . In Proceedings of the 16th ISCRAM , 2019. Richard Mccreadie, Cody Buntain, and Ian Soboroff. 2019. TREC Incident Streams: Finding Actionable Information on Social Media. In Proceedings of the 16th ISCRAM, 2019."},{"key":"e_1_3_2_1_38_1","first-page":"870","article-title":"Machine learning. 1997","volume":"45","author":"Mitchell Tom M","year":"1997","unstructured":"Tom M Mitchell 1997 . Machine learning. 1997 . Burr Ridge, IL: McGraw Hill 45 , 37 (1997), 870 -- 877 . Tom M Mitchell et al. 1997. Machine learning. 1997. Burr Ridge, IL: McGraw Hill 45, 37 (1997), 870--877.","journal-title":"Burr Ridge, IL: McGraw Hill"},{"key":"e_1_3_2_1_39_1","unstructured":"Subhabrata Mukherjee and Ahmed Hassan Awadallah. 2020. Uncertainty-aware Self-training for Text Classification with Few Labels. arXiv:2006.15315 [cs.CL] Subhabrata Mukherjee and Ahmed Hassan Awadallah. 2020. Uncertainty-aware Self-training for Text Classification with Few Labels. arXiv:2006.15315 [cs.CL]"},{"key":"e_1_3_2_1_40_1","volume-title":"5th ICLR 2017","author":"Pereyra Gabriel","year":"2017","unstructured":"Gabriel Pereyra , George Tucker , Jan Chorowski , Lukasz Kaiser , and Geoffrey E. Hinton . 2017. Regularizing Neural Networks by Penalizing Confident Output Distributions . In 5th ICLR 2017 , Toulon, France, April 24--26 , 2017 . Gabriel Pereyra, George Tucker, Jan Chorowski, Lukasz Kaiser, and Geoffrey E. Hinton. 2017. Regularizing Neural Networks by Penalizing Confident Output Distributions. In 5th ICLR 2017, Toulon, France, April 24--26, 2017."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1202"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_9"},{"key":"e_1_3_2_1_43_1","unstructured":"Colin Raffel and etal 2019. Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv preprint arXiv:1910.10683 (2019). Colin Raffel and et al. 2019. Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv preprint arXiv:1910.10683 (2019)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1096"},{"key":"e_1_3_2_1_45_1","volume-title":"Proceedings of the 34th ICML","author":"Saito Kuniaki","year":"2017","unstructured":"Kuniaki Saito , Yoshitaka Ushiku , and Tatsuya Harada . 2017 . Asymmetric Tri- Training for Unsupervised Domain Adaptation . In Proceedings of the 34th ICML ( Sydney, NSW, Australia) (ICML'17). 2988--2997. Kuniaki Saito, Yoshitaka Ushiku, and Tatsuya Harada. 2017. Asymmetric Tri- Training for Unsupervised Domain Adaptation. In Proceedings of the 34th ICML (Sydney, NSW, Australia) (ICML'17). 2988--2997."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"crossref","unstructured":"M. Sajjadi M. Javanmardi and T. Tasdizen. 2016. Mutual exclusivity loss for semi-supervised deep learning. In 2016 IEEE (ICIP). 1908--1912. M. Sajjadi M. Javanmardi and T. Tasdizen. 2016. Mutual exclusivity loss for semi-supervised deep learning. In 2016 IEEE (ICIP). 1908--1912.","DOI":"10.1109\/ICIP.2016.7532690"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1013947519741"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1965.1053799"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01228-1_19"},{"key":"e_1_3_2_1_50_1","volume-title":"Proceedings of the ACL 2010","author":"S\u00f8gaard Anders","year":"2010","unstructured":"Anders S\u00f8gaard . 2010 . Simple Semi-Supervised Training of Part-of-Speech Taggers . In Proceedings of the ACL 2010 ( Uppsala, Sweden). USA, 205--208. Anders S\u00f8gaard. 2010. Simple Semi-Supervised Training of Part-of-Speech Taggers. In Proceedings of the ACL 2010 (Uppsala, Sweden). USA, 205--208."},{"key":"e_1_3_2_1_51_1","volume-title":"Proceedings of the Thirtieth AAAI, February 12--17","author":"Sun Baochen","year":"2016","unstructured":"Baochen Sun , Jiashi Feng , and Kate Saenko . 2016 . Return of Frustratingly Easy Domain Adaptation . In Proceedings of the Thirtieth AAAI, February 12--17 , 2016, Phoenix, Arizona, USA. 2058--2065. Baochen Sun, Jiashi Feng, and Kate Saenko. 2016. Return of Frustratingly Easy Domain Adaptation. In Proceedings of the Thirtieth AAAI, February 12--17, 2016, Phoenix, Arizona, USA. 2058--2065."},{"key":"e_1_3_2_1_52_1","unstructured":"Antti Tarvainen and Harri Valpola. 2017. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems 30. 1195--1204. Antti Tarvainen and Harri Valpola. 2017. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems 30. 1195--1204."},{"key":"e_1_3_2_1_53_1","volume-title":"Proceedings of the Fourth Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task. Association for Computational Linguistics","author":"Graciela DavyWeissenbacher","year":"2019","unstructured":"DavyWeissenbacher and Graciela Gonzalez-Hernandez ( Eds .). 2019 . Proceedings of the Fourth Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task. Association for Computational Linguistics , Florence, Italy. DavyWeissenbacher and Graciela Gonzalez-Hernandez (Eds.). 2019. Proceedings of the Fourth Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task. Association for Computational Linguistics, Florence, Italy."},{"key":"e_1_3_2_1_54_1","unstructured":"Thomas Wolf Lysandre Debut and etal 2019. HuggingFace's Transformers: State-of-the-art Natural Language Processing. ArXiv abs\/1910.03771 (2019). Thomas Wolf Lysandre Debut and et al. 2019. HuggingFace's Transformers: State-of-the-art Natural Language Processing. ArXiv abs\/1910.03771 (2019)."},{"key":"e_1_3_2_1_55_1","volume-title":"Reinforced Co-Training. In Proceedings of the 2018 NAACL","author":"Wu Jiawei","year":"2018","unstructured":"Jiawei Wu , Lei Li , and William Yang Wang . 2018 . Reinforced Co-Training. In Proceedings of the 2018 NAACL . New Orleans, Louisiana, 1252--1262. Jiawei Wu, Lei Li, and William Yang Wang. 2018. Reinforced Co-Training. In Proceedings of the 2018 NAACL. New Orleans, Louisiana, 1252--1262."},{"key":"e_1_3_2_1_56_1","volume-title":"Unsupervised Data Augmentation for Consistency Training. arXiv preprint arXiv:1904.12848","author":"Xie Qizhe","year":"2019","unstructured":"Qizhe Xie , Zihang Dai , Eduard Hovy , Minh-Thang Luong , and Quoc V Le. 2019. Unsupervised Data Augmentation for Consistency Training. arXiv preprint arXiv:1904.12848 ( 2019 ). Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, and Quoc V Le. 2019. Unsupervised Data Augmentation for Consistency Training. arXiv preprint arXiv:1904.12848 (2019)."},{"key":"e_1_3_2_1_57_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Xie Qizhe","unstructured":"Qizhe Xie , Minh-Thang Luong , Eduard Hovy , and Quoc V. Le . 2020. Self- Training With Noisy Student Improves ImageNet Classification . In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Qizhe Xie, Minh-Thang Luong, Eduard Hovy, and Quoc V. Le. 2020. Self- Training With Noisy Student Improves ImageNet Classification. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_58_1","volume-title":"33rd","author":"Yarowsky David","unstructured":"David Yarowsky . 1995. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods . In 33rd ACL. Cambridge , Massachusetts, USA , 189--196. David Yarowsky. 1995. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods. In 33rd ACL. Cambridge, Massachusetts, USA, 189--196."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.102107"},{"key":"e_1_3_2_1_60_1","volume-title":"6th ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings.","author":"Zhang Hongyi","year":"2018","unstructured":"Hongyi Zhang , Moustapha Ciss\u00e9 , Yann N. Dauphin , and David Lopez-Paz . 2018 . mixup: Beyond Empirical Risk Minimization . In 6th ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. Hongyi Zhang, Moustapha Ciss\u00e9, Yann N. Dauphin, and David Lopez-Paz. 2018. mixup: Beyond Empirical Risk Minimization. In 6th ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings."},{"key":"e_1_3_2_1_61_1","volume-title":"Deep Mutual Learning. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Zhang Ying","year":"2018","unstructured":"Ying Zhang , Tao Xiang , Timothy M. Hospedales , and Huchuan Lu . 2018 . Deep Mutual Learning. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Ying Zhang, Tao Xiang, Timothy M. Hospedales, and Huchuan Lu. 2018. Deep Mutual Learning. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."}],"event":{"name":"WSDM '21: The Fourteenth ACM International Conference on Web Search and Data Mining","location":"Virtual Event Israel","acronym":"WSDM '21","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data","SIGIR ACM Special Interest Group on Information Retrieval"]},"container-title":["Proceedings of the 14th ACM International Conference on Web Search and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3437963.3441814","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3437963.3441814","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:47:36Z","timestamp":1750193256000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3437963.3441814"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,8]]},"references-count":61,"alternative-id":["10.1145\/3437963.3441814","10.1145\/3437963"],"URL":"https:\/\/doi.org\/10.1145\/3437963.3441814","relation":{},"subject":[],"published":{"date-parts":[[2021,3,8]]},"assertion":[{"value":"2021-03-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}