{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,2]],"date-time":"2026-01-02T07:44:42Z","timestamp":1767339882980,"version":"3.37.3"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2021,2,27]],"date-time":"2021-02-27T00:00:00Z","timestamp":1614384000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,6,16]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Cervical cancer is one of the most common cancers among women in the world. As at the earlier stage, cervical cancer has fewer symptoms. Cancer research is vital as the prognosis of cancer enables clinical applications for patients. In this study, we demonstrate a new approach that applies an ensemble approach to machine learning models for the automatic diagnosis of cervical cancer. The dataset used in the study is the cervical cancer dataset available at the University of California Irvine database repository. Initially, missing values are imputed (k-nearest neighbors) and then the data are balanced (oversampled). Two feature selection approaches are used to extract the most significant features. The proposed stacking architecture, applied for the first time on the cervical cancer dataset, used time elapse of 5.6\u00a0s and achieved an area under the curve score of 99.7% performing better than the methods used in previous works. The objective of the study is to propose a computational model that can predict the diagnosis of cervical cancer efficiently. Further, the proposed learning architecture is gauged with several ensemble approaches like random forest, gradient boosting, voting ensemble and weighted voting ensemble to perceive the enhancement.<\/jats:p>","DOI":"10.1093\/comjnl\/bxaa198","type":"journal-article","created":{"date-parts":[[2020,12,29]],"date-time":"2020-12-29T20:11:00Z","timestamp":1609272660000},"page":"1527-1539","source":"Crossref","is-referenced-by-count":50,"title":["Computational Prediction of Cervical Cancer Diagnosis Using Ensemble-Based Classification Algorithm"],"prefix":"10.1093","volume":"65","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6299-4109","authenticated-orcid":false,"given":"Surbhi","family":"Gupta","sequence":"first","affiliation":[{"name":"School of Computer Science & Engineering , , Katra, Jammu and Kashmir, 182320, India"},{"name":"Shri Mata Vaishno Devi University , , Katra, Jammu and Kashmir, 182320, India"}]},{"given":"Manoj K","family":"Gupta","sequence":"additional","affiliation":[{"name":"School of Computer Science & Engineering , , Katra, Jammu and Kashmir, 182320, India"},{"name":"Shri Mata Vaishno Devi University , , Katra, Jammu and Kashmir, 182320, India"}]}],"member":"286","published-online":{"date-parts":[[2021,2,27]]},"reference":[{"key":"2022061614514916000_ref1","doi-asserted-by":"crossref","first-page":"31","DOI":"10.3322\/caac.21440","article-title":"Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States","volume":"68","author":"Islami","year":"2018","journal-title":"CA Cancer J. Clin."},{"key":"2022061614514916000_ref2","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.cmpb.2018.05.034","article-title":"A review of image analysis and machine learning techniques for automated cervical cancer screening from pap-smear images","volume":"164","author":"William","year":"2018","journal-title":"Comput. Methods Prog. Biomed."},{"year":"2020","author":"World Health Organization","key":"2022061614514916000_ref3"},{"key":"2022061614514916000_ref4","doi-asserted-by":"crossref","first-page":"781","DOI":"10.1016\/S0140-6736(01)05965-7","article-title":"Survival and recurrence after concomitant chemotherapy and radiotherapy for cancer of the uterine cervix: a systematic review and meta-analysis","volume":"358","author":"Green","year":"2001","journal-title":"Lancet"},{"key":"2022061614514916000_ref5","first-page":"1","article-title":"Machine learning in oncology: a review","volume":"16","author":"Nardini","year":"2020","journal-title":"Ecancermedicalscience"},{"key":"2022061614514916000_ref6","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.imu.2017.12.006","article-title":"Type 2 diabetes mellitus prediction model based on data mining","volume":"10","author":"Wu","year":"2018","journal-title":"Inform. Med. Unlocked"},{"key":"2022061614514916000_ref7","first-page":"267","article-title":"Machine learning algorithms for diagnosis of leukemia","volume":"9","author":"Maria","year":"2020","journal-title":"IJSTR"},{"key":"2022061614514916000_ref8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12885-017-3877-1","article-title":"Using resistin, glucose, age and BMI to predict the presence of breast cancer","volume":"18","author":"Patr\u00edcio","year":"2018","journal-title":"BMC Cancer"},{"key":"2022061614514916000_ref9","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.3390\/cancers11091235","article-title":"Cancer diagnosis using deep learning: a bibliographic review","volume":"11","author":"Munir","year":"2019","journal-title":"Cancers"},{"key":"2022061614514916000_ref10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.cmpb.2017.09.005","article-title":"A deep learning-based multi-model ensemble method for cancer prediction","volume":"153","author":"Xiao","year":"2018","journal-title":"Comput. Methods Prog. Biomed."},{"key":"2022061614514916000_ref11","doi-asserted-by":"crossref","DOI":"10.1109\/CISP-BMEI.2017.8302240","article-title":"A classification model for the prostate cancer based on deep learning","volume-title":"BioMedical Engineering and Informatics (CISP-BMEI), 14\u201316 Oct 2017, Shanghai, China","author":"Liu","year":"2017"},{"key":"2022061614514916000_ref12","first-page":"110","article-title":"An approach based on neural learning for diagnosis of prostate cancer","volume":"21","author":"Gupta","year":"2020","journal-title":"J. Nat. Remedies"},{"key":"2022061614514916000_ref13","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1016\/j.csbj.2014.11.005","article-title":"Machine learning applications in cancer prognosis and prediction","volume":"13","author":"Kourou","year":"2015","journal-title":"Comput. Struct. Biotechnol. J."},{"first-page":"243","year":"2017","author":"Fernandes","key":"2022061614514916000_ref14"},{"key":"2022061614514916000_ref15","first-page":"732","article-title":"Comparison of balancing techniques for unbalanced datasets","volume":"16","author":"Dal Pozzolo","year":"2010","journal-title":"Mach. Learn. Group Univ. Libre Bruxelles Belgium"},{"key":"2022061614514916000_ref16","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1145\/1007730.1007735","article-title":"A study of the behavior of several methods for balancing machine learning training data","volume":"6","author":"Batista","year":"2004","journal-title":"ACM SIGKDD Explor."},{"key":"2022061614514916000_ref17","doi-asserted-by":"crossref","first-page":"1623","DOI":"10.1016\/j.patcog.2014.11.014","article-title":"A novel ensemble method for classifying imbalanced data","volume":"48","author":"Sun","year":"2015","journal-title":"Pattern Recognit."},{"key":"2022061614514916000_ref18","first-page":"1","article-title":"Ten quick tips for machine learning in computational biology","volume":"35","author":"Chicco","year":"2017","journal-title":"BioData Min."},{"key":"2022061614514916000_ref19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0208737","article-title":"Computational prediction of diagnosis and feature selection on mesothelioma patient health records","volume":"14","author":"Chicco","year":"2019","journal-title":"PLoS One"},{"key":"2022061614514916000_ref20","doi-asserted-by":"crossref","first-page":"886","DOI":"10.3844\/jcssp.2019.886.929","article-title":"A wide scale classification of class imbalance problem and its solutions: a systematic literature review","volume":"15","author":"Rekha","year":"2019","journal-title":"J. Comput. Sci."},{"key":"2022061614514916000_ref21","doi-asserted-by":"crossref","first-page":"103089","DOI":"10.1016\/j.jbi.2018.12.003","article-title":"A comprehensive data level analysis for cancer diagnosis on imbalanced data","volume":"90","author":"Fotouhi","year":"2019","journal-title":"J. Biomed. Inform."},{"key":"2022061614514916000_ref22","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-006-6226-1","article-title":"Extremely randomized trees","volume":"63","author":"Geurts","year":"2006","journal-title":"Mach. Learn."},{"key":"2022061614514916000_ref23","first-page":"1","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Otras Caracteristicas"},{"key":"2022061614514916000_ref24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0184370","article-title":"Application of unsupervised analysis techniques to lung cancer patient data","volume":"12","author":"Lynch","year":"2017","journal-title":"PLoS One"},{"key":"2022061614514916000_ref25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0179805","article-title":"Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: the Henry Ford ExercIse Testing (FIT) project","volume":"12","author":"Alghamdi","year":"2017","journal-title":"PLoS One"},{"key":"2022061614514916000_ref26","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1016\/S0034-4257(97)00049-7","article-title":"Decision tree classification of land cover from remotely sensed data: remote sensing of environment","volume":"61","author":"Friedl","year":"1997","journal-title":"Remote Sens. Environ."},{"key":"2022061614514916000_ref27","doi-asserted-by":"crossref","first-page":"250","DOI":"10.2307\/2981538","article-title":"Bayes's Bayesian inference","volume":"145","author":"Thomas","year":"1982","journal-title":"J. Royal Stat. Soc."},{"key":"2022061614514916000_ref28","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"2022061614514916000_ref29","first-page":"115","article-title":"Enhanced classification model for cervical cancer dataset based on cost sensitive classifier","volume":"4","author":"Fatlawi","year":"2017","journal-title":"Int. J. Comput. Techniques"},{"key":"2022061614514916000_ref30","doi-asserted-by":"crossref","first-page":"232","DOI":"10.18201\/ijisae.2017533896","article-title":"Comparison of multi-label classification methods for prediagnosis of cervical cancer","volume":"5","author":"Ceylan","year":"2017","journal-title":"Intell. Syst. Appl. Eng."},{"key":"2022061614514916000_ref31","doi-asserted-by":"crossref","first-page":"25189","DOI":"10.1109\/ACCESS.2017.2763984","article-title":"Data-driven diagnosis of cervical cancer with support vector machine-based approaches","volume":"5","author":"Wu","year":"2017","journal-title":"IEEE Access"},{"key":"2022061614514916000_ref32","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1016\/j.eswa.2018.08.050","article-title":"Classification and diagnosis of cervical cancer with stacked autoencoder and softmax classification","volume":"115","author":"Adem","year":"2019","journal-title":"Expert Syst. Appl."},{"key":"2022061614514916000_ref33","first-page":"1","article-title":"Supervised deep learning embeddings for the prediction of cervical cancer diagnosis","volume":"4","author":"Fernandes","year":"2018","journal-title":"PeerJ"},{"key":"2022061614514916000_ref34","first-page":"149","article-title":"Cervical cancer risk classification based on deep convolutional neural network","volume-title":"2018 Int. Conf. Applied Information Technology and Innovation (ICAITI)","author":"Zahras","year":"2018"},{"key":"2022061614514916000_ref35","doi-asserted-by":"crossref","first-page":"59475","DOI":"10.1109\/ACCESS.2018.2874063","article-title":"Cervical cancer diagnosis using random forest classifier with SMOTE and feature reduction techniques","volume":"6","author":"Abdoh","year":"2018","journal-title":"IEEE Access"},{"key":"2022061614514916000_ref36","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/1471-2164-13-S4-S2","article-title":"How to evaluate performance of prediction methods? Measures and their interpretation in variation effect analysis","volume":"13","author":"Vihinen","year":"2012","journal-title":"BMC Genomics"},{"key":"2022061614514916000_ref37","doi-asserted-by":"crossref","first-page":"76516","DOI":"10.1109\/ACCESS.2020.2989857","article-title":"Diabetes prediction using ensembling of different machine learning classifiers","volume":"8","author":"Das","year":"2020","journal-title":"IEEE Access"},{"key":"2022061614514916000_ref38","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1016\/j.procs.2016.04.016","article-title":"Performance analysis of data mining classification techniques to predict diabetes","volume":"82","author":"Perveen","year":"2016","journal-title":"Procedia Comput. Sci."},{"key":"2022061614514916000_ref39","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1007\/978-1-62703-059-5_22","article-title":"Principal components analysis","volume":"930","author":"Groth","year":"2013","journal-title":"Methods Mol. Biol."},{"key":"2022061614514916000_ref40","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1109\/5254.671091","article-title":"Feature subset selection using genetic algorithm","volume":"13","author":"Yang","year":"1998","journal-title":"IEEE Intell. Syst. Appl."},{"key":"2022061614514916000_ref41","first-page":"313","article-title":"Robust feature selection using ensemble feature selection techniques","volume-title":"Machine Learning and Knowledge Discovery in Databases, European Conference, ECML\/PKDD 2008, Antwerp, Belgium, September 15\u201319, 2008","author":"Saeys","year":"2008"},{"key":"2022061614514916000_ref42","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1037\/h0042519","article-title":"The perceptron: a probabilistic model for information storage and organization in the brain","volume":"65","author":"Rosenblatt","year":"1958","journal-title":"Psychol. Rev."},{"key":"2022061614514916000_ref43","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1021\/ci0500379","article-title":"Boosting: an ensemble learning tool for compound classification and QSAR modeling","volume":"45","author":"Svetnik","year":"2005","journal-title":"J. Chem. Inf. Model."},{"key":"2022061614514916000_ref44","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1080\/00031305.1992.10475879","article-title":"An introduction to kernel and nearest-neighbor nonparametric regression","volume":"46","author":"Altman","year":"1992","journal-title":"Am. Stat."},{"key":"2022061614514916000_ref45","first-page":"512","volume-title":"Advances in Neural Information Processing Systems 12","author":"Mason","year":"1999"},{"key":"2022061614514916000_ref46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10462-009-9124-7","article-title":"Ensemble-based classifiers","volume":"33","author":"Rokach","year":"2010","journal-title":"Artif. Intell. Rev."},{"key":"2022061614514916000_ref47","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1006\/inco.1994.1009","article-title":"The weighted majority algorithm","volume":"108","author":"Littlestone","year":"1994","journal-title":"Inf. Comput."},{"key":"2022061614514916000_ref48","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/S0893-6080(05)80023-1","article-title":"Stacked generalization","volume":"5","author":"Wolpert","year":"1992","journal-title":"Neural Netw."},{"key":"2022061614514916000_ref49","first-page":"265138","article-title":"Breast cancer detection with reduced feature set","volume":"2015","author":"Mert","year":"2014","journal-title":"Comput. Math. Methods Med."},{"key":"2022061614514916000_ref50","first-page":"1322","article-title":"ADASYN: Adaptive synthetic sampling approach for imbalanced learning","volume-title":"2008 IEEE Int. Joint Conf. Neural Networks (IEEE World Congress on Computational Intelligence)","author":"He","year":"2008"},{"key":"2022061614514916000_ref51","doi-asserted-by":"crossref","first-page":"1127","DOI":"10.1109\/IECON.2015.7392251","article-title":"Kernel-based SMOTE for SVM classification of imbalanced datasets","volume-title":"IECON 2015\u201441st Annual Conf. IEEE Industrial Electronics Society","author":"Mathew","year":"2015"},{"key":"2022061614514916000_ref52","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1080\/10556789208805504","article-title":"Robust linear programming discrimination of two linearly inseparable sets","volume":"1","author":"Bennett","year":"1992","journal-title":"Optim. Methods Softw."},{"key":"2022061614514916000_ref53","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.compeleceng.2011.09.001","article-title":"An approach based on probabilistic neural network for diagnosis of Mesothelioma\u2019s disease","volume":"38","author":"Er","year":"2012","journal-title":"Comput. Electr. Eng."}],"container-title":["The Computer Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/comjnl\/article-pdf\/65\/6\/1527\/44080846\/bxaa198.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/comjnl\/article-pdf\/65\/6\/1527\/44080846\/bxaa198.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,20]],"date-time":"2024-08-20T10:02:22Z","timestamp":1724148142000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/comjnl\/article\/65\/6\/1527\/6153484"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,27]]},"references-count":53,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2021,2,27]]},"published-print":{"date-parts":[[2022,6,16]]}},"URL":"https:\/\/doi.org\/10.1093\/comjnl\/bxaa198","relation":{},"ISSN":["0010-4620","1460-2067"],"issn-type":[{"type":"print","value":"0010-4620"},{"type":"electronic","value":"1460-2067"}],"subject":[],"published-other":{"date-parts":[[2022,6]]},"published":{"date-parts":[[2021,2,27]]}}}