{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T08:11:42Z","timestamp":1759133502778,"version":"3.41.0"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2017,10,25]],"date-time":"2017-10-25T00:00:00Z","timestamp":1508889600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2018,3,31]]},"abstract":"<jats:p>\n            High-quality, labeled data is essential for successfully applying machine learning methods to real-world text classification problems. However, in many cases, the amount of labeled data is very small compared to that of the unlabeled, and labeling additional samples could be expensive and time consuming. Co-training algorithms, which make use of unlabeled data to improve classification, have proven to be very effective in such cases. Generally, co-training algorithms work by using two classifiers, trained on two different views of the data, to label large amounts of unlabeled data. Doing so can help minimize the human effort required for labeling new data, as well as improve classification performance. In this article, we propose an ensemble-based co-training approach that uses an ensemble of classifiers from different training iterations to improve labeling accuracy. This approach, which we call\n            <jats:italic>vertical ensemble<\/jats:italic>\n            , incurs almost no additional computational cost. Experiments conducted on six textual datasets show a significant improvement of over 45% in AUC compared with the original co-training algorithm.\n          <\/jats:p>","DOI":"10.1145\/3137114","type":"journal-article","created":{"date-parts":[[2017,10,26]],"date-time":"2017-10-26T14:19:33Z","timestamp":1509027573000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Vertical Ensemble Co-Training for Text Classification"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9478-7550","authenticated-orcid":false,"given":"Gilad","family":"Katz","sequence":"first","affiliation":[{"name":"Ben-Gurion University of the Negev, Beer Sheve, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cornelia","family":"Caragea","sequence":"additional","affiliation":[{"name":"University of North Texas, Denton, TX"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Asaf","family":"Shabtai","sequence":"additional","affiliation":[{"name":"Ben-Gurion University of the Negev, Beer Sheve, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,10,25]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Maria-Florina Balcan Avrim Blum and Ke Yang. 2004. Co-training and expansion: Towards bridging theory and practice. In Advances in Neural Information Processing Systems. 89--96.  Maria-Florina Balcan Avrim Blum and Ke Yang. 2004. Co-training and expansion: Towards bridging theory and practice. In Advances in Neural Information Processing Systems. 89--96."},{"key":"e_1_2_1_2_1","unstructured":"Maria F. Balcan Avrim Blum and Ke Yang. 2005. Co-training and expansion: Towards bridging theory and practice. In Advances in Neural Information Processing Systems.  Maria F. Balcan Avrim Blum and Ke Yang. 2005. Co-training and expansion: Towards bridging theory and practice. In Advances in Neural Information Processing Systems."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/279943.279962"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015330.1015350"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00058655"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01889584"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 28th International Conference on Machine Learning (ICML\u201911)","author":"Chen Minmin","year":"2011","unstructured":"Minmin Chen , Kilian Weinberger , and Yixin Chen . 2011 . Automatic feature decomposition for single view co-training . In Proceedings of the 28th International Conference on Machine Learning (ICML\u201911) . 953--960. Minmin Chen, Kilian Weinberger, and Yixin Chen. 2011. Automatic feature decomposition for single view co-training. In Proceedings of the 28th International Conference on Machine Learning (ICML\u201911). 953--960."},{"key":"e_1_2_1_9_1","first-page":"88","volume-title":"Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI\u201908)","author":"Christoudias C. M.","unstructured":"C. M. Christoudias , R. Urtasun , and T. Darrell . 2008. Multi-view learning in the presence of view disagreement . In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI\u201908) . 88 - 96 . C. M. Christoudias, R. Urtasun, and T. Darrell. 2008. Multi-view learning in the presence of view disagreement. In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI\u201908). 88-96."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/1248547.1248548"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the ICML 2003 Workshop: The Continuum from Labeled to Unlabeled Data. 80--87","author":"Denis Francois","year":"2003","unstructured":"Francois Denis , Anne Laurent , Razmi Gilleron , and Marc Tommasi . 2003 . Text classification and co-training from positive and unlabeled examples . In Proceedings of the ICML 2003 Workshop: The Continuum from Labeled to Unlabeled Data. 80--87 . Francois Denis, Anne Laurent, Razmi Gilleron, and Marc Tommasi. 2003. Text classification and co-training from positive and unlabeled examples. In Proceedings of the ICML 2003 Workshop: The Continuum from Labeled to Unlabeled Data. 80--87."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45014-9_1"},{"key":"e_1_2_1_13_1","volume-title":"The Handbook of Brain Theory and Neural Networks","author":"Dietterich Thomas G.","unstructured":"Thomas G. Dietterich . 2002. Ensemble learning . In The Handbook of Brain Theory and Neural Networks ( 2 nd ed.). MIT Press , Cambridge, MA , 110--125. Thomas G. Dietterich. 2002. Ensemble learning. In The Handbook of Brain Theory and Neural Networks (2nd ed.). MIT Press, Cambridge, MA, 110--125.","edition":"2"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2347736.2347755"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2010.158"},{"key":"e_1_2_1_16_1","unstructured":"Matthias Feurer Aaron Klein Katharina Eggensperger Jost Springenberg Manuel Blum and Frank Hutter. 2015. Efficient and robust automated machine learning. In Advances in Neural Information Processing Systems. 2944--2952.  Matthias Feurer Aaron Klein Katharina Eggensperger Jost Springenberg Manuel Blum and Frank Hutter. 2015. Efficient and robust automated machine learning. In Advances in Neural Information Processing Systems. 2944--2952."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the 2nd European Conference on Computational Learning Theory (EuroCOLT\u201995)","author":"Freund Yoav","year":"2093","unstructured":"Yoav Freund and Robert E. Schapire . 1995. A decision-theoretic generalization of on-line learning and an application to boosting . In Proceedings of the 2nd European Conference on Computational Learning Theory (EuroCOLT\u201995) . 23--37. http:\/\/dl.acm.org\/citation.cfm?id&equals;646943.71 2093 . Yoav Freund and Robert E. Schapire. 1995. A decision-theoretic generalization of on-line learning and an application to boosting. In Proceedings of the 2nd European Conference on Computational Learning Theory (EuroCOLT\u201995). 23--37. http:\/\/dl.acm.org\/citation.cfm?id&equals;646943.712093."},{"key":"e_1_2_1_18_1","first-page":"771","article-title":"A short introduction to boosting","volume":"14","author":"Freund Yoav","year":"1999","unstructured":"Yoav Freund and Robert E. Schapire . 1999 . A short introduction to boosting . Journal of the Japanese Society for Artificial Intelligence 14 , 5, 771 -- 780 . Yoav Freund and Robert E. Schapire. 1999. A short introduction to boosting. Journal of the Japanese Society for Artificial Intelligence 14, 5, 771--780.","journal-title":"Journal of the Japanese Society for Artificial Intelligence"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009778005914"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the 19th International Conference on Machine Learning (ICML\u201902)","author":"Ghani Rayid","year":"2002","unstructured":"Rayid Ghani . 2002 . Combining labeled and unlabeled data for multiclass text categorization . In Proceedings of the 19th International Conference on Machine Learning (ICML\u201902) . 187--194. Rayid Ghani. 2002. Combining labeled and unlabeled data for multiclass text categorization. In Proceedings of the 19th International Conference on Machine Learning (ICML\u201902). 187--194."},{"volume-title":"Proceedings of the 22nd International Conference on World Wide Web (WWW\u201913)","author":"Gollapalli Sujatha Das","key":"e_1_2_1_21_1","unstructured":"Sujatha Das Gollapalli , Cornelia Caragea , Prasenjit Mitra , and C. Lee Giles . 2013. Researcher homepage classification using unlabeled data . In Proceedings of the 22nd International Conference on World Wide Web (WWW\u201913) . 471--482. http:\/\/dl.acm.org\/citation.cfm?id&equals;2488388.2488430. Sujatha Das Gollapalli, Cornelia Caragea, Prasenjit Mitra, and C. Lee Giles. 2013. Researcher homepage classification using unlabeled data. In Proceedings of the 22nd International Conference on World Wide Web (WWW\u201913). 471--482. http:\/\/dl.acm.org\/citation.cfm?id&equals;2488388.2488430."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2767135"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1021\/ci0342472"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 16th International Conference on Machine Learning(ICML\u201999)","author":"Joachims Thorsten","year":"1999","unstructured":"Thorsten Joachims . 1999 . Transductive inference for text classification using support vector machines . In Proceedings of the 16th International Conference on Machine Learning(ICML\u201999) . 200--209. Thorsten Joachims. 1999. Transductive inference for text classification using support vector machines. In Proceedings of the 16th International Conference on Machine Learning(ICML\u201999). 200--209."},{"volume-title":"Transductive support vector machines","author":"Joachims Thorsten","key":"e_1_2_1_26_1","unstructured":"Thorsten Joachims . 2006. Transductive support vector machines . In Semi-Supervised Learning, O. Chapelle, B. Scholkopf, and A. Zieneds (Eds.). MIT Press , Cambridge, MA , 105--118. Thorsten Joachims. 2006. Transductive support vector machines. In Semi-Supervised Learning, O. Chapelle, B. Scholkopf, and A. Zieneds (Eds.). MIT Press, Cambridge, MA, 105--118."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.04.009"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-43968-5_5"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/782096.782104"},{"key":"e_1_2_1_30_1","unstructured":"Anders Krogh and Jesper Vedelsby. 1995. Neural network ensembles cross validation and active learning. In Advances in Neural Information Processing Systems. 231--238.  Anders Krogh and Jesper Vedelsby. 1995. Neural network ensembles cross validation and active learning. In Advances in Neural Information Processing Systems. 231--238."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2003.1238406"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972801.21"},{"key":"e_1_2_1_33_1","volume-title":"Hyperband: A novel bandit-based approach to hyperparameter optimization. arXiv:1603.06560.","author":"Li Lisha","year":"2016","unstructured":"Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , and Ameet Talwalkar . 2016 . Hyperband: A novel bandit-based approach to hyperparameter optimization. arXiv:1603.06560. Lisha Li, Kevin Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, and Ameet Talwalkar. 2016. Hyperband: A novel bandit-based approach to hyperparameter optimization. arXiv:1603.06560."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 2009 IEEE 12th International Conference on Computer Visiong (ICCV\u201909)","author":"Liu Rong","year":"2009","unstructured":"Rong Liu , Jian Cheng , and Hanqing Lu . 2009 . A robust boosting tracker with minimum error bound in a co-training framework . In Proceedings of the 2009 IEEE 12th International Conference on Computer Visiong (ICCV\u201909) . 1459--1466. Rong Liu, Jian Cheng, and Hanqing Lu. 2009. A robust boosting tracker with minimum error bound in a co-training framework. In Proceedings of the 2009 IEEE 12th International Conference on Computer Visiong (ICCV\u201909). 1459--1466."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972788.74"},{"key":"e_1_2_1_36_1","first-page":"11","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 142--150","author":"Maas Andrew L.","year":"2011","unstructured":"Andrew L. Maas , Raymond E. Daly , Peter T. Pham , Dan Huang , Andrew Y. Ng , and Christopher Potts . 2011 . Learning word vectors for sentiment analysis . In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 142--150 . http:\/\/www.aclweb.org\/anthology\/P 11 - 1015 . Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. 2011. Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 142--150. http:\/\/www.aclweb.org\/anthology\/P11-1015."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1529282.1529735"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2009.08.025"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the HLT-NAACL 2004 Workshop: 8th Conference on Computational Natural Language Learning (CoNLL\u201904)","author":"Mihalcea Rada","year":"2004","unstructured":"Rada Mihalcea . 2004 . Co-training and self-training for word sense disambiguation . In Proceedings of the HLT-NAACL 2004 Workshop: 8th Conference on Computational Natural Language Learning (CoNLL\u201904) . 33--40. Rada Mihalcea. 2004. Co-training and self-training for word sense disambiguation. In Proceedings of the HLT-NAACL 2004 Workshop: 8th Conference on Computational Natural Language Learning (CoNLL\u201904). 33--40."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073142"},{"volume-title":"Proceedings of the 19th International Conference on Machine Learning (ICML\u201902)","author":"Muslea Ion","key":"e_1_2_1_41_1","unstructured":"Ion Muslea , Steven Minton , and Craig A. Knoblock . 2002. Active + semi-supervised learning &equals; robust multi-view learning . In Proceedings of the 19th International Conference on Machine Learning (ICML\u201902) . 435--442. http:\/\/dl.acm.org\/citation.cfm?id&equals;645531.655845. Ion Muslea, Steven Minton, and Craig A. Knoblock. 2002. Active + semi-supervised learning &equals; robust multi-view learning. In Proceedings of the 19th International Conference on Machine Learning (ICML\u201902). 435--442. http:\/\/dl.acm.org\/citation.cfm?id&equals;645531.655845."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/354756.354805"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007692713085"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118693.1118704"},{"key":"e_1_2_1_45_1","first-page":"61","article-title":"Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods","volume":"10","author":"Platt John","year":"1999","unstructured":"John Platt . 1999 . Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods . Advances in Large Margin Classifiers 10 , 3, 61 -- 74 . John Platt. 1999. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in Large Margin Classifiers 10, 3, 61--74.","journal-title":"Advances in Large Margin Classifiers"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073336.1073359"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2008.4562952"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of the ICML Workshop on Learning with Multiple Views.","author":"Sindhwani Vikas","year":"2005","unstructured":"Vikas Sindhwani , Partha Niyogi , and Mikhail Belkin . 2005 . A co-regularization approach to semi-supervised learning with multiple views . In Proceedings of the ICML Workshop on Learning with Multiple Views. Vikas Sindhwani, Partha Niyogi, and Mikhail Belkin. 2005. A co-regularization approach to semi-supervised learning with multiple views. In Proceedings of the ICML Workshop on Learning with Multiple Views."},{"key":"e_1_2_1_49_1","unstructured":"Bradly C. Stadie Sergey Levine and Pieter Abbeel. 2015. Incentivizing exploration in reinforcement learning with deep predictive models. arXiv:1507.00814.  Bradly C. Stadie Sergey Levine and Pieter Abbeel. 2015. Incentivizing exploration in reinforcement learning with deep predictive models. arXiv:1507.00814."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2487629"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.3115\/1690219.1690291"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1162\/153244302760185243"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.5555\/1687878.1687913"},{"key":"e_1_2_1_54_1","volume-title":"Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP\u201911)","author":"Wang William Yang","year":"2011","unstructured":"William Yang Wang , Kapil Thadani , and Kathleen McKeown . 2011 . Identifying event descriptions using co-training with online news summaries . In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP\u201911) . William Yang Wang, Kapil Thadani, and Kathleen McKeown. 2011. Identifying event descriptions using co-training with online news summaries. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP\u201911)."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2011.2157998"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-02326-2_53"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.186"},{"key":"e_1_2_1_58_1","doi-asserted-by":"crossref","unstructured":"X. Zhu and A. B. Goldberg. 2009. Introduction to Semi-Supervised Learning. Morgan 8 Claypool.  X. Zhu and A. B. Goldberg. 2009. Introduction to Semi-Supervised Learning. Morgan 8 Claypool.","DOI":"10.1007\/978-3-031-01548-9"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3137114","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3137114","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:11:10Z","timestamp":1750212670000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3137114"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,25]]},"references-count":58,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,3,31]]}},"alternative-id":["10.1145\/3137114"],"URL":"https:\/\/doi.org\/10.1145\/3137114","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"type":"print","value":"2157-6904"},{"type":"electronic","value":"2157-6912"}],"subject":[],"published":{"date-parts":[[2017,10,25]]},"assertion":[{"value":"2017-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-10-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}