{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T12:35:20Z","timestamp":1763037320561,"version":"3.37.3"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2019,7,11]],"date-time":"2019-07-11T00:00:00Z","timestamp":1562803200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61702361","61701340"],"award-info":[{"award-number":["61702361","61701340"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006606","name":"Natural Science Foundation of Tianjin","doi-asserted-by":"publisher","award":["18JCQNJC00800","18JCQNJC00500"],"award-info":[{"award-number":["18JCQNJC00800","18JCQNJC00500"]}],"id":[{"id":"10.13039\/501100006606","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Non-small-cell lung carcinoma (NSCLC) mainly consists of two subtypes: lung squamous cell carcinoma (LUSC) and lung adenocarcinoma (LUAD). It has been reported that the genetic and epigenetic profiles vary strikingly between LUAD and LUSC in the process of tumorigenesis and development. Efficient and precise treatment can be made if subtypes can be identified correctly. Identification of discriminative expression signatures has been explored recently to aid the classification of NSCLC subtypes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we designed a classification model integrating both mRNA and long non-coding RNA (lncRNA) expression data to effectively classify the subtypes of NSCLC. A gene selection algorithm, named WGRFE, was proposed to identify the most discriminative gene signatures within the recursive feature elimination (RFE) framework. GeneRank scores considering both expression level and correlation, together with the importance generated by classifiers were all taken into account to improve the selection performance. Moreover, a module-based initial filtering of the genes was performed to reduce the computation cost of RFE. We validated the proposed algorithm on The Cancer Genome Atlas (TCGA) dataset. The results demonstrate that the developed approach identified a small number of expression signatures for accurate subtype classification and particularly, we here for the first time show the potential role of LncRNA in building computational NSCLC subtype classification models.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The R implementation for the proposed approach is available at https:\/\/github.com\/RanSuLab\/NSCLC-subtype-classification.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz557","type":"journal-article","created":{"date-parts":[[2019,7,9]],"date-time":"2019-07-09T11:11:18Z","timestamp":1562670678000},"page":"339-346","source":"Crossref","is-referenced-by-count":34,"title":["Identification of expression signatures for non-small-cell lung carcinoma subtype classification"],"prefix":"10.1093","volume":"36","author":[{"given":"Ran","family":"Su","sequence":"first","affiliation":[{"name":"School of Computer Software, College of Intelligence and Computing , Tianjin University"}]},{"given":"Jiahang","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer Software, College of Intelligence and Computing , Tianjin University"}]},{"given":"Xiaofeng","family":"Liu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Breast Cancer Prevention and Therapy , Ministry of Education, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center of Cancer, Key Laboratory of Cancer Prevention and Therapy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1444-190X","authenticated-orcid":false,"given":"Leyi","family":"Wei","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology , College of Intelligence and Computing, Tianjin University, Tianjin, China"}]}],"member":"286","published-online":{"date-parts":[[2019,7,11]]},"reference":[{"key":"2023013112065839400_btz557-B1","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. the gene ontology consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023013112065839400_btz557-B2","doi-asserted-by":"crossref","first-page":"969","DOI":"10.3390\/cancers4040969","article-title":"Estrogen signaling in lung cancer: an opportunity for novel therapy","volume":"4","author":"Baik","year":"2012","journal-title":"Cancers"},{"key":"2023013112065839400_btz557-B3","doi-asserted-by":"crossref","first-page":"2242","DOI":"10.1126\/science.1103388","article-title":"Global identification of human transcribed sequences with genome tiling arrays","volume":"306","author":"Bertone","year":"2004","journal-title":"Science"},{"key":"2023013112065839400_btz557-B4","doi-asserted-by":"crossref","first-page":"394","DOI":"10.3322\/caac.21492","article-title":"Global cancer statistics 2018: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries","volume":"68","author":"Bray","year":"2018","journal-title":"CA Cancer J. Clin"},{"key":"2023013112065839400_btz557-B5","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"first-page":"107","year":"1998","author":"Brin","key":"2023013112065839400_btz557-B6"},{"key":"2023013112065839400_btz557-B7","doi-asserted-by":"crossref","first-page":"1732.","DOI":"10.3390\/molecules22101732","article-title":"Prolango: protein function prediction using neural machine translation based on a recurrent neural network","volume":"22","author":"Cao","year":"2017","journal-title":"Molecules"},{"key":"2023013112065839400_btz557-B8","doi-asserted-by":"crossref","first-page":"1529","DOI":"10.1126\/science.1116800","article-title":"Fewer genes, more noncoding RNA","volume":"309","author":"Claverie","year":"2005","journal-title":"Science"},{"key":"2023013112065839400_btz557-B9","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn"},{"key":"2023013112065839400_btz557-B10","doi-asserted-by":"crossref","first-page":"54082","DOI":"10.18632\/oncotarget.10213","article-title":"Cancer subtypes classification using long non-coding RNA","volume":"7","author":"Flippot","year":"2016","journal-title":"Oncotarget"},{"key":"2023013112065839400_btz557-B11","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1111\/j.1445-5994.2009.02141.x","article-title":"The current status of targeted therapy for non-small cell lung cancer","volume":"40","author":"Francis","year":"2010","journal-title":"Intern. Med. J"},{"key":"2023013112065839400_btz557-B12","doi-asserted-by":"crossref","first-page":"4880","DOI":"10.1158\/1078-0432.CCR-15-2900","article-title":"An expression signature as an aid to the histologic classification of non-small cell lung cancer","volume":"22","author":"Girard","year":"2016","journal-title":"Clinical Cancer Res"},{"key":"2023013112065839400_btz557-B13","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1002\/gcc.22214","article-title":"Cytogenetic alterations and their molecular genetic correlates in head and neck squamous cell carcinoma: a next generation window to the biology of disease","volume":"53","author":"Gollin","year":"2014","journal-title":"Genes Chromosomes Cancer"},{"key":"2023013112065839400_btz557-B14","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using support vector machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn"},{"key":"2023013112065839400_btz557-B15","doi-asserted-by":"crossref","first-page":"6890","DOI":"10.1158\/0008-5472.CAN-14-0686","article-title":"Long noncoding RNA gaplinc regulates cd44-dependent cell invasiveness and associates with poor prognosis of gastric cancer","volume":"74","author":"Hu","year":"2014","journal-title":"Cancer Res"},{"key":"2023013112065839400_btz557-B16","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1038\/nm.3981","article-title":"The emerging role of lncRNAs in cancer","volume":"21","author":"Huarte","year":"2015","journal-title":"Nat. Med"},{"key":"2023013112065839400_btz557-B17","doi-asserted-by":"crossref","first-page":"2136","DOI":"10.1093\/bioinformatics\/btq345","article-title":"Integration of pathway knowledge into a reweighted recursive feature elimination approach for risk stratification of cancer patients","volume":"26","author":"Johannes","year":"2010","journal-title":"Bioinformatics"},{"key":"2023013112065839400_btz557-B18","doi-asserted-by":"crossref","first-page":"204","DOI":"10.3389\/fonc.2014.00204","article-title":"Biomarkers that currently affect clinical practice in lung cancer: egfr, alk, met, ros-1, and kras","volume":"4","author":"Korpanty","year":"2014","journal-title":"Front. Oncol"},{"key":"2023013112065839400_btz557-B19","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1186\/1471-2105-9-559","article-title":"WGCNA: an R package for weighted correlation network analysis","volume":"9","author":"Langfelder","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023013112065839400_btz557-B20","doi-asserted-by":"crossref","first-page":"e1000459.","DOI":"10.1371\/journal.pgen.1000459","article-title":"The genetic signatures of noncoding RNAs","volume":"5","author":"Mattick","year":"2009","journal-title":"PLOS Genet"},{"key":"2023013112065839400_btz557-B21","doi-asserted-by":"crossref","first-page":"584","DOI":"10.1016\/S0025-6196(11)60735-0","article-title":"Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship","volume":"83","author":"Molina","year":"2008","journal-title":"Mayo Clin. Proc"},{"key":"2023013112065839400_btz557-B22","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1186\/1471-2105-6-233","article-title":"Generank: using search engine technology for the analysis of microarray experiments","volume":"6","author":"Morrison","year":"2005","journal-title":"BMC Bioinformatics"},{"first-page":"144","year":"2008","author":"Mundra","key":"2023013112065839400_btz557-B23"},{"key":"2023013112065839400_btz557-B24","doi-asserted-by":"crossref","first-page":"7587","DOI":"10.1007\/s13277-014-2009-7","article-title":"Long non-coding RNA MVIH indicates a poor prognosis for non-small cell lung cancer and promotes cell proliferation and invasion","volume":"35","author":"Nie","year":"2014","journal-title":"Tumor Biol"},{"key":"2023013112065839400_btz557-B25","doi-asserted-by":"crossref","first-page":"179","DOI":"10.4046\/trd.2017.80.2.179","article-title":"Keratinization of lung squamous cell carcinoma is associated with poor clinical outcome","volume":"80","author":"Park","year":"2017","journal-title":"Tuberc. Respir. Dis"},{"first-page":"455","year":"2000","author":"Raychaudhuri","key":"2023013112065839400_btz557-B26"},{"key":"2023013112065839400_btz557-B27","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.neucom.2015.02.045","article-title":"Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context","volume":"161","author":"Reyes","year":"2015","journal-title":"Neurocomputing"},{"key":"2023013112065839400_btz557-B28","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edger: a bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023013112065839400_btz557-B29","doi-asserted-by":"crossref","first-page":"3529","DOI":"10.1158\/1078-0432.CCR-14-2464","article-title":"Glycodelin: a new biomarker with immunomodulatory functions in nonsmall cell lung cancer","volume":"21","author":"Schneider","year":"2015","journal-title":"Clin. Cancer Res"},{"key":"2023013112065839400_btz557-B30","article-title":"Developing a multi-dose computational model for drug-induced hepatotoxicity prediction based on toxicogenomics data","author":"Su","year":"2018","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinf"},{"key":"2023013112065839400_btz557-B31","article-title":"Meta-GDBP: a high-level stacked regression model to improve anti-cancer drug response prediction","author":"Su","year":"2019","journal-title":"Brief. Bioinf"},{"key":"2023013112065839400_btz557-B32","article-title":"MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy-defined energy","author":"Su","year":"2019","journal-title":"Brief. Bioinf"},{"key":"2023013112065839400_btz557-B33","doi-asserted-by":"crossref","first-page":"760","DOI":"10.1186\/s12885-016-2792-1","article-title":"Novel biomarkers that assist in accurate discrimination of squamous cell carcinoma from adenocarcinoma of the lung","volume":"16","author":"Takamochi","year":"2016","journal-title":"BMC Cancer"},{"key":"2023013112065839400_btz557-B34","doi-asserted-by":"crossref","first-page":"1750050.","DOI":"10.1142\/S1793524517500504","article-title":"A two-step discriminated method to identify thermophilic proteins","volume":"10","author":"Tang","year":"2017","journal-title":"Int. J. Biomath"},{"key":"2023013112065839400_btz557-B35","doi-asserted-by":"crossref","first-page":"2013","DOI":"10.1162\/089976600300015042","article-title":"Bounds on error expectation for support vector machines","volume":"12","author":"Vapnik","year":"2000","journal-title":"Neural Comput"},{"key":"2023013112065839400_btz557-B36","doi-asserted-by":"crossref","first-page":"CIN.S3794","DOI":"10.4137\/CIN.S3794","article-title":"A robust gene selection method for microarray-based cancer classification","volume":"9","author":"Wang","year":"2010","journal-title":"Cancer Inf"},{"key":"2023013112065839400_btz557-B37","doi-asserted-by":"crossref","first-page":"e1002511.","DOI":"10.1371\/journal.pcbi.1002511","article-title":"Google goes cancer: improving outcome prediction for cancer patients by network-based ranking of marker genes","volume":"8","author":"Winter","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023013112065839400_btz557-B38","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1089\/omi.2011.0118","article-title":"clusterProfiler: an r package for comparing biological themes among gene clusters","volume":"16","author":"Yu","year":"2012","journal-title":"OMICS J. Integr. Biol"},{"key":"2023013112065839400_btz557-B39","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1093\/bioinformatics\/btu684","article-title":"Dose: an r\/bioconductor package for disease ontology semantic and enrichment analysis","volume":"31","author":"Yu","year":"2015","journal-title":"Bioinformatics"},{"key":"2023013112065839400_btz557-B40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2202\/1544-6115.1128","article-title":"A general framework for weighted gene co-expression network analysis","volume":"4","author":"Zhang","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023013112065839400_btz557-B41","doi-asserted-by":"crossref","first-page":"1641","DOI":"10.1007\/s00432-012-1240-0","article-title":"miRNAs expression profiling to distinguish lung squamous-cell carcinoma from adenocarcinoma subtypes","volume":"138","author":"Zhang","year":"2012","journal-title":"J. Cancer Res. Clin. Oncol"},{"key":"2023013112065839400_btz557-B42","doi-asserted-by":"crossref","first-page":"144","DOI":"10.3389\/fnins.2019.00144","article-title":"Supervised brain tumor segmentation based on gradient and context-sensitive features","volume":"13","author":"Zhao","year":"2019","journal-title":"Front. Neurosci"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz557\/29097455\/btz557.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/339\/48991040\/btz557.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/339\/48991040\/btz557.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:24:37Z","timestamp":1675200277000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/2\/339\/5531132"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,7,11]]},"references-count":42,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz557","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,1,15]]},"published":{"date-parts":[[2019,7,11]]}}}