{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,3]],"date-time":"2026-01-03T22:54:30Z","timestamp":1767480870818,"version":"3.37.3"},"reference-count":54,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2021,4,9]],"date-time":"2021-04-09T00:00:00Z","timestamp":1617926400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Research Grants Council"},{"name":"Hong Kong Special Administrative Region","award":["CityU 11200218"],"award-info":[{"award-number":["CityU 11200218"]}]},{"DOI":"10.13039\/501100005847","name":"Health and Medical Research Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100005847","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005407","name":"Food and Health Bureau","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100005407","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Government of the Hong Kong Special Administrative Region","award":["07181426"],"award-info":[{"award-number":["07181426"]}]},{"DOI":"10.13039\/100017449","name":"Hong Kong Institute for Data Science","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100017449","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100007567","name":"City University of Hong Kong","doi-asserted-by":"publisher","award":["CityU 11202219","CityU 11203520"],"award-info":[{"award-number":["CityU 11202219","CityU 11203520"]}],"id":[{"id":"10.13039\/100007567","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["32000464"],"award-info":[{"award-number":["32000464"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,10,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Early cancer detection is significant for patient mortality rate reduction. Although machine learning has been widely employed in that context, there are still deficiencies. In this work, we studied different machine learning algorithms for early cancer detection and proposed an Adaptive Support Vector Machine (ASVM) method by synergizing Shuffled Frog Leaping Algorithm and Support Vector Machine (SVM) in this study.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Since ASVM regulates SVM for parameter adaption based on data characteristics, the experimental results reflected the robust generalization capability of ASVM on different datasets under different settings; for instance, ASVM can enhance the sensitivity by over 10% for early cancer detection compared with SVM. Besides, our proposed ASVM outperformed Grid Search + SVM and Random Search + SVM by significant margins in terms of the area under the ROC curve (AUC) (0.938 versus 0.922 versus 0.921).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The proposed algorithm and dataset are available at https:\/\/github.com\/ElaineLIU-920\/ASVM-for-Early-Cancer-Detection.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab236","type":"journal-article","created":{"date-parts":[[2021,4,8]],"date-time":"2021-04-08T22:06:57Z","timestamp":1617919617000},"page":"3099-3105","source":"Crossref","is-referenced-by-count":15,"title":["Early cancer detection from genome-wide cell-free DNA fragmentation via shuffled frog leaping algorithm and support vector machine"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2366-4593","authenticated-orcid":false,"given":"Linjing","family":"Liu","sequence":"first","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9639-1584","authenticated-orcid":false,"given":"Xingjian","family":"Chen","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6062-733X","authenticated-orcid":false,"given":"Ka-Chun","family":"Wong","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong, China"},{"name":"Hong Kong Institute for Data Science, City University of Hong Kong , Hong Kong, China"}]}],"member":"286","published-online":{"date-parts":[[2021,4,9]]},"reference":[{"key":"2023051608274793500_btab236-B1","doi-asserted-by":"crossref","first-page":"1313","DOI":"10.1109\/TMI.2016.2528120","article-title":"AggNet: deep learning from crowds for mitosis detection in breast cancer histology images","volume":"35","author":"Albarqouni","year":"2016","journal-title":"IEEE Trans. Med. Imag"},{"key":"2023051608274793500_btab236-B2","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"key":"2023051608274793500_btab236-B3","doi-asserted-by":"crossref","first-page":"565","DOI":"10.2165\/11631380-000000000-00000","article-title":"Circulating cell-free DNA","volume":"42","author":"Breitbach","year":"2012","journal-title":"Sports Med"},{"key":"2023051608274793500_btab236-B4","first-page":"477","article-title":"Boosting algorithms: regularization, prediction and model fitting","volume":"22","author":"B\u00fchlmann","year":"2007","journal-title":"Stat. Sci"},{"key":"2023051608274793500_btab236-B5","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1186\/s12920-015-0107-z","article-title":"High-resolution characterization of sequence signatures due to non-random cleavage of cell-free dna","volume":"8","author":"Chandrananda","year":"2015","journal-title":"BMC Med. Genomics"},{"key":"2023051608274793500_btab236-B6","first-page":"1","article-title":"Non-invasive early detection of cancer four years before conventional diagnosis using a blood test","volume":"11","author":"Chen","year":"2020","journal-title":"Nat. Commun"},{"year":"2014","author":"Claesen","key":"2023051608274793500_btab236-B7"},{"key":"2023051608274793500_btab236-B8","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1056\/NEJMra1706174","article-title":"Application of cell-free DNA analysis to cancer treatment","volume":"379","author":"Corcoran","year":"2018","journal-title":"N. Engl. J. Med"},{"key":"2023051608274793500_btab236-B9","doi-asserted-by":"crossref","first-page":"1559","DOI":"10.1038\/s41591-018-0177-5","article-title":"Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning","volume":"24","author":"Coudray","year":"2018","journal-title":"Nat. Med"},{"key":"2023051608274793500_btab236-B10","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","author":"Cover","year":"1967","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023051608274793500_btab236-B11","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1038\/s41586-019-1272-6","article-title":"Genome-wide cell-free DNA fragmentation in patients with cancer","volume":"570","author":"Cristiano","year":"2019","journal-title":"Nature"},{"key":"2023051608274793500_btab236-B12","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/nrclinonc.2013.80","article-title":"Circulating tumour cells and cell-free DNA as tools for managing breast cancer","volume":"10","author":"De Mattos-Arruda","year":"2013","journal-title":"Nat. Rev. Clin. Oncol"},{"key":"2023051608274793500_btab236-B13","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1200\/JCO.2012.45.2011","article-title":"Liquid biopsies: genotyping circulating tumor DNA","volume":"32","author":"Diaz","year":"2014","journal-title":"J. Clin. Oncol"},{"key":"2023051608274793500_btab236-B14","doi-asserted-by":"crossref","first-page":"16368","DOI":"10.1073\/pnas.0507904102","article-title":"Detection and quantification of mutations in the plasma of patients with colorectal tumors","volume":"102","author":"Diehl","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"year":"2017","author":"Dua","key":"2023051608274793500_btab236-B15"},{"key":"2023051608274793500_btab236-B16","first-page":"92","article-title":"Prediction of recurrent events in breast cancer using the naive Bayesian classification","volume":"36","author":"Dumitru","year":"2009","journal-title":"Ann. Univ. Craiova Math. Comput. Sci. Ser"},{"key":"2023051608274793500_btab236-B17","doi-asserted-by":"crossref","first-page":"18925","DOI":"10.3390\/ijms140918925","article-title":"The clinical utilization of circulating cell free DNA (ccfDNA) in blood of cancer patients","volume":"14","author":"Elshimali","year":"2013","journal-title":"Int. J. Mol. Sci"},{"key":"2023051608274793500_btab236-B18","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1080\/03052150500384759","article-title":"Shuffled frog-leaping algorithm: a memetic meta-heuristic for discrete optimization","volume":"38","author":"Eusuff","year":"2006","journal-title":"Eng. Optim"},{"key":"2023051608274793500_btab236-B19","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1061\/(ASCE)0733-9496(2003)129:3(210)","article-title":"Optimization of water distribution network design using the shuffled frog leaping algorithm","volume":"129","author":"Eusuff","year":"2003","journal-title":"J. Water Resources Plan. Manag"},{"first-page":"3","year":"2018","author":"Fong","key":"2023051608274793500_btab236-B20"},{"key":"2023051608274793500_btab236-B21","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","article-title":"Stochastic gradient boosting","volume":"38","author":"Friedman","year":"2002","journal-title":"Comput. Stat. Data Anal"},{"key":"2023051608274793500_btab236-B22","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","article-title":"Support vector machine classification and validation of cancer tissue samples using microarray expression data","volume":"16","author":"Furey","year":"2000","journal-title":"Bioinformatics"},{"key":"2023051608274793500_btab236-B23","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1038\/nrg3729","article-title":"Mechanisms underlying mutational signatures in human cancers","volume":"15","author":"Helleday","year":"2014","journal-title":"Nat. Rev. Genet"},{"key":"2023051608274793500_btab236-B24","doi-asserted-by":"crossref","first-page":"675","DOI":"10.1198\/106186005X59630","article-title":"The design and analysis of benchmark experiments","volume":"14","author":"Hothorn","year":"2005","journal-title":"J. Comput. Graph. Stat"},{"key":"2023051608274793500_btab236-B25","first-page":"41","article-title":"Applications of support vector machine (SVM) learning in cancer genomics","volume":"15","author":"Huang","year":"2018","journal-title":"Cancer Genomics Proteomics"},{"key":"2023051608274793500_btab236-B26","first-page":"54","volume-title":"Workshops on Applications of Evolutionary Computation","author":"Jong","year":"2003"},{"first-page":"606","volume-title":"IEEE journal of selected topics in signal processing","year":"2007","key":"2023051608274793500_btab236-B54"},{"first-page":"228","year":"2008","author":"Luo","key":"2023051608274793500_btab236-B27"},{"key":"2023051608274793500_btab236-B28","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00993877","article-title":"Using neural networks to diagnose cancer","volume":"15","author":"Maclin","year":"1991","journal-title":"J. Med. Syst"},{"key":"2023051608274793500_btab236-B29","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1214\/aoms\/1177730491","article-title":"On a test of whether one of two random variables is stochastically larger than the other","volume":"18","author":"Mann","year":"1947","journal-title":"Ann. Math. Stat"},{"year":"2017","key":"2023051608274793500_btab236-B30"},{"key":"2023051608274793500_btab236-B31","doi-asserted-by":"crossref","first-page":"939","DOI":"10.1080\/14737140.2017.1359544","article-title":"Recent advances in circulating tumor cells and cell-free DNA in metastatic prostate cancer: a review","volume":"17","author":"Parimi","year":"2017","journal-title":"Exp. Rev. Anticancer Ther"},{"key":"2023051608274793500_btab236-B32","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.compmedimag.2016.12.002","article-title":"Feature selection for outcome prediction in oesophageal cancer using genetic algorithm and random forest classifier","volume":"60","author":"Paul","year":"2017","journal-title":"Comput. Med. Imaging Graph"},{"key":"2023051608274793500_btab236-B33","doi-asserted-by":"crossref","first-page":"694","DOI":"10.1016\/j.dsp.2006.10.008","article-title":"Breast cancer diagnosis using least square support vector machine","volume":"17","author":"Polat","year":"2007","journal-title":"Digital Signal Process"},{"key":"2023051608274793500_btab236-B34","doi-asserted-by":"crossref","first-page":"88","DOI":"10.18201\/ijisae.2019252786","article-title":"Performance analysis of Ann and naive Bayes classification algorithm for data classification","volume":"7","author":"Saritas","year":"2019","journal-title":"Int. J. Intell. Syst. Appl. Eng"},{"key":"2023051608274793500_btab236-B35","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1101\/gr.123497.111","article-title":"Genomic analysis of circulating cell-free DNA infers breast cancer dormancy","volume":"22","author":"Shaw","year":"2012","journal-title":"Genome Res"},{"key":"2023051608274793500_btab236-B36","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/0021-9681(85)90090-6","article-title":"Treatment selection for cancer patients: application of statistical decision theory to the treatment of advanced ovarian cancer","volume":"38","author":"Simes","year":"1985","journal-title":"J. Chronic Dis"},{"key":"2023051608274793500_btab236-B37","doi-asserted-by":"crossref","first-page":"1196","DOI":"10.1109\/TMI.2016.2525803","article-title":"Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images","volume":"35","author":"Sirinukunwattana","year":"2016","journal-title":"IEEE Trans. Med. Imaging"},{"key":"2023051608274793500_btab236-B38","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.cell.2015.11.050","article-title":"Cell-free DNA comprises an in vivo nucleosome footprint that informs its tissues-of-origin","volume":"164","author":"Snyder","year":"2016","journal-title":"Cell"},{"key":"2023051608274793500_btab236-B39","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1186\/1471-2105-9-319","article-title":"A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification","volume":"9","author":"Statnikov","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023051608274793500_btab236-B40","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1002\/path.5048","article-title":"The value of cell-free DNA for molecular pathology","volume":"244","author":"Stewart","year":"2018","journal-title":"J. Pathol"},{"key":"2023051608274793500_btab236-B41","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1159\/000226740","article-title":"Neoplastic characteristics of the DNA found in the plasma of cancer patients","volume":"46","author":"Stroun","year":"1989","journal-title":"Oncology"},{"key":"2023051608274793500_btab236-B42","doi-asserted-by":"crossref","first-page":"2789","DOI":"10.1162\/NECO_a_00336","article-title":"An extension of the receiver operating characteristic curve and AUC-optimal classification","volume":"24","author":"Takenouchi","year":"2012","journal-title":"Neural Comput"},{"key":"2023051608274793500_btab236-B43","doi-asserted-by":"crossref","first-page":"e1006162","DOI":"10.1371\/journal.pgen.1006162","article-title":"Fragment length of circulating tumor DNA","volume":"12","author":"Underhill","year":"2016","journal-title":"PLoS Genet"},{"key":"2023051608274793500_btab236-B44","doi-asserted-by":"crossref","first-page":"898","DOI":"10.1158\/1541-7786.MCR-16-0044","article-title":"Cell-free DNA (cfDNA): clinical significance and utility in cancer shaped by emerging technologies","volume":"14","author":"Volik","year":"2016","journal-title":"Mol. Cancer Res"},{"key":"2023051608274793500_btab236-B45","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1186\/1471-2458-9-223","article-title":"Rationales, design and recruitment of the Taizhou longitudinal study","volume":"9","author":"Wang","year":"2009","journal-title":"BMC Public Health"},{"key":"2023051608274793500_btab236-B46","doi-asserted-by":"crossref","first-page":"80","DOI":"10.2307\/3001968","article-title":"Individual comparisons by ranking methods","volume":"1","author":"Wilcoxon","year":"1945","journal-title":"Biometrics"},{"volume-title":"World Cancer Report: Cancer Research for Cancer Prevention","year":"2020","author":"Wild","key":"2023051608274793500_btab236-B47"},{"volume-title":"Gaussian Processes for Machine Learning","year":"2006","author":"Williams","key":"2023051608274793500_btab236-B48"},{"key":"2023051608274793500_btab236-B49","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1016\/j.neucom.2020.07.061","article-title":"On hyperparameter optimization of machine learning algorithms: theory and practice","volume":"415","author":"Yang","year":"2020","journal-title":"Neurocomputing"},{"year":"2004","author":"Zhang","key":"2023051608274793500_btab236-B50"},{"first-page":"6197","year":"2008","author":"Zhang","key":"2023051608274793500_btab236-B51"},{"key":"2023051608274793500_btab236-B52","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1373\/clinchem.2011.169318","article-title":"Nonhematopoietically derived DNA is shorter than hematopoietically derived DNA in plasma: a transplantation model","volume":"58","author":"Zheng","year":"2012","journal-title":"Clin. Chem"},{"first-page":"301","volume-title":"Journal of the royal statistical society: series B (statistical methodology)","year":"2005","key":"2023051608274793500_btab236-B55"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab236\/37767400\/btab236.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3099\/50338463\/btab236.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3099\/50338463\/btab236.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T08:43:27Z","timestamp":1684226607000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/19\/3099\/6218770"}},"subtitle":[],"editor":[{"given":"Jinbo","family":"Xu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,4,9]]},"references-count":54,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2021,10,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab236","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2021,10,1]]},"published":{"date-parts":[[2021,4,9]]}}}