{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T12:02:11Z","timestamp":1771416131017,"version":"3.50.1"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T00:00:00Z","timestamp":1771372800000},"content-version":"vor","delay-in-days":109,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>High-throughput omics data present challenges for binary classification due to platform variability, batch effects, missing values, and high dimensionality. This study presents a novel Rank-Based Learning (RBL) method that leverages relative feature rankings to improve robustness and generalizability. We evaluated RBL against established methods like Logistic Regression (LR) and Random Forest (RF) using simulated data and two real-world plasma proteomics datasets: early-stage small cell lung cancer (SCLC) and duodenopancreatic neuroendocrine tumors (dpNET) in patients with Multiple Endocrine Neoplasia type 1 (MEN1). In simulation experiments, RBL outperformed LR under conditions involving batch effects, missing data, and varying numbers of true differential features. In SCLC, RBL yielded a test AUC of 0.76 (95% CI: 0.42\u20131.00), surpassing LR with Lasso (0.65 [95% CI: 0.47\u20130.84]) and RF with feature importance (0.59 [95% CI: 0.33\u20130.87]). In dpNET, RBL achieved an AUC of 0.83 (95% CI: 0.67\u20130.97) on the development set and 0.80 (95% CI: 0.54\u20130.98) on the test set, outperforming LR with Lasso (0.57 [95% CI: 0.40\u20130.77]) and RF with feature importance (0.53 [95% CI: 0.29\u20130.77]). By emphasizing feature ranking rather than absolute expression levels, RBL effectively mitigates the impact of non-biological variation. Overall, RBL improves the predictive accuracy of diagnostic models for complex diseases and provides a promising framework for developing more reliable and generalizable diagnostic tools from omics data, moving them closer to clinical application.<\/jats:p>","DOI":"10.1093\/bib\/bbaf666","type":"journal-article","created":{"date-parts":[[2025,11,26]],"date-time":"2025-11-26T13:12:10Z","timestamp":1764162730000},"source":"Crossref","is-referenced-by-count":0,"title":["Rank-based learning: a novel high-throughput algorithm resilient to missing data and effective for datasets with small sample size"],"prefix":"10.1093","volume":"26","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-9210-9527","authenticated-orcid":false,"given":"Lulu","family":"Song","sequence":"first","affiliation":[{"name":"Department of Biostatistics, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hamid Khoshfekr","family":"Rudsari","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5088-0198","authenticated-orcid":false,"given":"Johannes F","family":"Fahrmann","sequence":"additional","affiliation":[{"name":"Department of Cancer Prevention, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jody","family":"Vykoukal","sequence":"additional","affiliation":[{"name":"Department of Cancer Prevention, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sam","family":"Hanash","sequence":"additional","affiliation":[{"name":"Department of Cancer Prevention, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James P","family":"Long","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kim-Anh","family":"Do","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7510-4849","authenticated-orcid":false,"given":"Ehsan","family":"Irajizad","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of Texas MD Anderson Cancer Center , Houston, TX ,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2025,12,12]]},"reference":[{"key":"2026021806063788100_ref1","doi-asserted-by":"publisher","first-page":"25","DOI":"10.3390\/proteomes12030025","article-title":"Transforming clinical research: The power of high-throughput omics integration","volume":"12","author":"Vitorino","year":"2024","journal-title":"Proteomes."},{"key":"2026021806063788100_ref2","doi-asserted-by":"publisher","first-page":"911861","DOI":"10.3389\/fmed.2022.911861","article-title":"Advances and trends in omics technology development","volume":"9","author":"Dai","year":"2022","journal-title":"Front Med"},{"key":"2026021806063788100_ref3","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1186\/s12711-016-0217-x","article-title":"Multi-omic data integration and analysis using systems genomics approaches: Methods and applications in animal production, health and welfare","volume":"48","author":"Suravajhala","year":"2016","journal-title":"Genet Sel Evol"},{"key":"2026021806063788100_ref4","doi-asserted-by":"publisher","first-page":"848","DOI":"10.3390\/biology13110848","article-title":"Integrating molecular perspectives: Strategies for comprehensive multi-omics integrative data analysis and machine learning applications in transcriptomics, proteomics, and metabolomics","volume":"13","author":"Sanches","year":"2024","journal-title":"Biology (Basel)"},{"key":"2026021806063788100_ref5","doi-asserted-by":"publisher","first-page":"3972","DOI":"10.3390\/cancers13163972","article-title":"Plasma-based protein signatures associated with small cell lung cancer","volume":"13","author":"Fahrmann","year":"2021","journal-title":"Cancers (Basel)"},{"key":"2026021806063788100_ref6","doi-asserted-by":"publisher","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van \u2019t Veer","year":"2002","journal-title":"Nature."},{"key":"2026021806063788100_ref7","doi-asserted-by":"publisher","first-page":"1209","DOI":"10.1186\/s12864-024-11112-5","article-title":"DeepMoIC: Multi-omics data integration via deep graph convolutional networks for cancer subtype classification","volume":"25","author":"Wu","year":"2024","journal-title":"BMC Genomics"},{"key":"2026021806063788100_ref8","doi-asserted-by":"publisher","first-page":"364","DOI":"10.1186\/s12859-024-05989-y","article-title":"MoAGL-SA: A multi-omics adaptive integration method with graph learning and self-attention for cancer subtype classification","volume":"25","author":"Cheng","year":"2024","journal-title":"BMC Bioinformatics"},{"key":"2026021806063788100_ref9","doi-asserted-by":"publisher","first-page":"bbad304","DOI":"10.1093\/bib\/bbad304","article-title":"A denoised multi-omics integration framework for cancer subtype classification and survival prediction","volume":"24","author":"Pang","year":"2023","journal-title":"Brief Bioinform"},{"key":"2026021806063788100_ref10","doi-asserted-by":"publisher","first-page":"105","DOI":"10.3389\/fgene.2011.00105","article-title":"When one and one gives more than two: Challenges and opportunities of integrative omics","volume":"2","author":"Choi","year":"2012","journal-title":"Front Genet"},{"key":"2026021806063788100_ref11","doi-asserted-by":"publisher","first-page":"I1","DOI":"10.1186\/1752-0509-8-S2-I1","article-title":"Data integration in the era of omics: Current and future challenges","volume":"8","author":"Gomez-Cabrero","year":"2014","journal-title":"BMC Syst Biol"},{"key":"2026021806063788100_ref12","doi-asserted-by":"publisher","first-page":"1551","DOI":"10.3390\/genes15121551","article-title":"From omics to multi-omics: A review of advantages and tradeoffs","volume":"15","author":"Hayes","year":"2024","journal-title":"Genes (Basel)"},{"key":"2026021806063788100_ref13","doi-asserted-by":"publisher","first-page":"1238","DOI":"10.3390\/biom12091238","article-title":"Three grand challenges in high throughput omics technologies","volume":"12","author":"Suravajhala","year":"2022","journal-title":"Biomolecules."},{"key":"2026021806063788100_ref14","doi-asserted-by":"publisher","first-page":"1293","DOI":"10.1016\/j.cell.2012.02.009","article-title":"Personal omics profiling reveals dynamic molecular and medical phenotypes","volume":"148","author":"Chen","year":"2012","journal-title":"Cell."},{"key":"2026021806063788100_ref15","doi-asserted-by":"publisher","first-page":"e1900335","DOI":"10.1002\/pmic.201900335","article-title":"Deep learning in proteomics","volume":"20","author":"Wen","year":"2020","journal-title":"Proteomics."},{"key":"2026021806063788100_ref16","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1186\/s13059-024-03401-9","article-title":"Assessing and mitigating batch effects in large-scale omics studies","volume":"25","author":"Yu","year":"2024","journal-title":"Genome Biol"},{"key":"2026021806063788100_ref17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12911-020-01225-8","article-title":"Deep learning-based feature-level integration of multi-omics data for breast cancer patients survival analysis","volume":"20","author":"Tong","year":"2020","journal-title":"BMC Med Inform Decis Mak"},{"key":"2026021806063788100_ref18","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","author":"Hastie","year":"2009","edition":"2nd ed."},{"key":"2026021806063788100_ref19","doi-asserted-by":"publisher","first-page":"847","DOI":"10.1007\/s12041-016-0705-3","article-title":"An efficient method to handle the \u2018large p, small n\u2019 problem for genomewide association studies using Haseman\u2013Elston regression","volume":"95","author":"Mei","year":"2016","journal-title":"J Genet"},{"key":"2026021806063788100_ref20","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1186\/s12918-016-0358-0","article-title":"High-dimensional omics data analysis using a variable screening protocol with prior knowledge integration (SKI)","volume":"10","author":"Liu","year":"2016","journal-title":"BMC Syst Biol"},{"key":"2026021806063788100_ref21","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1186\/s12859-024-05749-y","article-title":"Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning","volume":"25","author":"Huang","year":"2024","journal-title":"BMC Bioinformatics."},{"key":"2026021806063788100_ref22","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1109\/JBHI.2015.2489158","article-title":"Models for predicting stage in head and neck squamous cell carcinoma using proteomic and transcriptomic data","volume":"21","author":"Kaddi","year":"2015","journal-title":"IEEE J Biomed Health Inform"},{"key":"2026021806063788100_ref23","doi-asserted-by":"publisher","first-page":"e0184203","DOI":"10.1371\/journal.pone.0184203","article-title":"A novel feature ranking method for prediction of cancer stages using proteomics data","volume":"12","author":"Saghapour","year":"2017","journal-title":"PloS One"},{"key":"2026021806063788100_ref24","doi-asserted-by":"publisher","first-page":"100083","DOI":"10.1016\/j.mcpro.2021.100083","article-title":"Feature selection methods for protein biomarker discovery from proteomics or multiomics data","volume":"20","author":"Shi","year":"2021","journal-title":"Mol Cell Proteomics"},{"key":"2026021806063788100_ref25","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1186\/s12859-022-04678-y","article-title":"Comparison of five supervised feature selection algorithms leading to top features and gene signatures from multi-omics data in cancer","volume":"23","author":"Bhadra","year":"2022","journal-title":"BMC Bioinformatics."},{"key":"2026021806063788100_ref26","doi-asserted-by":"publisher","first-page":"e0189875","DOI":"10.1371\/journal.pone.0189875","article-title":"Accurate and fast feature selection workflow for high-dimensional omics data","volume":"12","author":"Perez-Riverol","year":"2017","journal-title":"PloS One"},{"key":"2026021806063788100_ref27","doi-asserted-by":"publisher","first-page":"412","DOI":"10.1186\/s12859-022-04962-x","article-title":"Benchmark study of feature selection strategies for multi-omics data","volume":"23","author":"Li","year":"2022","journal-title":"BMC Bioinformatics."},{"key":"2026021806063788100_ref28","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbae662","article-title":"scRGCL: A cell type annotation method for single-cell RNA-seq data using residual graph convolutional neural network with contrastive learning","volume":"26","author":"Yuan","year":"2024","journal-title":"Brief Bioinform"},{"key":"2026021806063788100_ref29","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbae526","article-title":"scMGATGRN: A multiview graph attention network\u2013based method for inferring gene regulatory networks from single-cell transcriptomic data","volume":"25","author":"Yuan","year":"2024","journal-title":"Brief Bioinform"},{"issue":"1","key":"2026021806063788100_ref30","doi-asserted-by":"publisher","first-page":"350","DOI":"10.1186\/s12864-025-11511-2","article-title":"scAMZI: Attention-based deep autoencoder with zero-inflated layer for clustering scRNA-seq data","volume":"26","author":"Yuan","year":"2025","journal-title":"BMC Genomics"},{"key":"2026021806063788100_ref31","doi-asserted-by":"publisher","first-page":"1277121","DOI":"10.3389\/fmicb.2023.1277121","article-title":"Oral_voting_transfer: classification of oral microorganisms\u2019 function proteins with voting transfer model","volume":"14","author":"Bao","year":"2024","journal-title":"Front Microbiol"},{"key":"2026021806063788100_ref32","doi-asserted-by":"publisher","first-page":"179","DOI":"10.2174\/0115748936330499240909082529","article-title":"CLPr_in_ML: Cleft lip and palate reconstructed features with machine learning","volume":"20","author":"Chen","year":"2025","journal-title":"Curr Bioinform"},{"key":"2026021806063788100_ref33","doi-asserted-by":"publisher","first-page":"3686","DOI":"10.1038\/s41388-018-0187-2","article-title":"Antitumor activity of CD56-chimeric antigen receptor T cells in neuroblastoma and SCLC models","volume":"37","author":"Crossland","year":"2018","journal-title":"Oncogene."},{"key":"2026021806063788100_ref34","doi-asserted-by":"publisher","first-page":"916","DOI":"10.1093\/glycob\/cwl022","article-title":"Genetic mechanisms for the synthesis of fucosyl GM1 in small cell lung cancer cell lines","volume":"16","author":"Tokuda","year":"2006","journal-title":"Glycobiology."},{"key":"2026021806063788100_ref35","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1158\/1541-7786.MCR-24-0546","article-title":"KSR2 promotes self-renewal and clonogenicity of small cell lung carcinoma","volume":"23","author":"Huisman","year":"2025","journal-title":"Mol Cancer Res"},{"key":"2026021806063788100_ref36","doi-asserted-by":"publisher","first-page":"1715","DOI":"10.3390\/pharmaceutics14081715","article-title":"Identification of transferrin receptor 1 (TfR1) overexpressed in lung cancer cells, and internalization of magnetic Au-CoFe2O4 core-shell nanoparticles functionalized with its ligand in a cellular model of small cell lung cancer (SCLC)","volume":"14","author":"Villalobos-Manzo","year":"2022","journal-title":"Pharmaceutics."},{"key":"2026021806063788100_ref37","doi-asserted-by":"publisher","first-page":"research0034","DOI":"10.1186\/gb-2002-3-7-research0034","article-title":"Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes","volume":"3","author":"Vandesompele","year":"2002","journal-title":"Genome Biol"},{"key":"2026021806063788100_ref38","doi-asserted-by":"publisher","first-page":"djw266","DOI":"10.1093\/jnci\/djw266","article-title":"Sequential validation of blood-based protein biomarker candidates for early-stage pancreatic cancer","volume":"109","author":"Capello","year":"2017","journal-title":"J Natl Cancer Inst"},{"key":"2026021806063788100_ref39","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1158\/1055-9965.EPI-19-0672","article-title":"Prognostic value of combined detection of serum IL6, YKL-40, and C-reactive protein in patients with unresectable pancreatic cancer","volume":"29","author":"Chen","year":"2020","journal-title":"Cancer Epidemiol Biomarkers Prev"},{"key":"2026021806063788100_ref40","doi-asserted-by":"publisher","first-page":"1116","DOI":"10.1021\/acs.jproteome.5b00981","article-title":"Accounting for the multiple natures of missing values in label-free quantitative proteom-ics data sets to compare imputation strategies","volume":"15","author":"Lazar","year":"2016","journal-title":"J Proteome Res"},{"key":"2026021806063788100_ref41","doi-asserted-by":"publisher","first-page":"1313","DOI":"10.3390\/biology12101313","article-title":"Deep learning methods for omics data imputation","volume":"12","author":"Huang","year":"2023","journal-title":"Biology."},{"key":"2026021806063788100_ref42","doi-asserted-by":"publisher","first-page":"663","DOI":"10.1038\/s41598-017-19120-0","article-title":"Missing value imputation approach for mass spectrometry-based metabolomics data","volume":"8","author":"Wei","year":"2018","journal-title":"Sci Rep"},{"key":"2026021806063788100_ref43","doi-asserted-by":"publisher","first-page":"1087","DOI":"10.1063\/1.1699114","article-title":"Equation of state calculations by fast computing machines","volume":"21","author":"Metropolis","year":"1953","journal-title":"J Chem Phys"},{"key":"2026021806063788100_ref44","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1016\/j.febslet.2004.07.055","article-title":"Rank products: A simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments","volume":"573","author":"Breitling","year":"2004","journal-title":"FEBS Lett"},{"key":"2026021806063788100_ref45","doi-asserted-by":"publisher","first-page":"3260","DOI":"10.1210\/clinem\/dgad315","article-title":"Blood-based proteomic signatures associated with MEN1-related duodenopancreatic neuroendocrine tumor progression","volume":"108","author":"Fahrmann","year":"2023","journal-title":"J Clin Endocrinol Metab"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/6\/bbaf666\/65855723\/bbaf666.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/6\/bbaf666\/65855723\/bbaf666.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T11:06:45Z","timestamp":1771412805000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf666\/8378203"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,1]]},"references-count":45,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf666","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,11]]},"published":{"date-parts":[[2025,11,1]]},"article-number":"bbaf666"}}