{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T12:46:12Z","timestamp":1767962772925,"version":"3.49.0"},"reference-count":81,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,2,24]],"date-time":"2016-02-24T00:00:00Z","timestamp":1456272000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,2,24]],"date-time":"2016-02-24T00:00:00Z","timestamp":1456272000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"the RGC of Hong Kong SAR"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Predicting protein subcellular localization is indispensable for inferring protein functions. Recent studies have been focusing on predicting not only single-location proteins, but also multi-location proteins. Almost all of the high performing predictors proposed recently use gene ontology (GO) terms to construct feature vectors for classification. Despite their high performance, their prediction decisions are difficult to interpret because of the large number of GO terms involved.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>This paper proposes using sparse regressions to exploit GO information for both predicting and interpreting subcellular localization of single- and multi-location proteins. Specifically, we compared two multi-label sparse regression algorithms, namely multi-label LASSO (mLASSO) and multi-label elastic net (mEN), for large-scale predictions of protein subcellular localization. Both algorithms can yield sparse and interpretable solutions. By using the one-vs-rest strategy, mLASSO and mEN identified 87 and 429 out of more than 8,000 GO terms, respectively, which play essential roles in determining subcellular localization. More interestingly, many of the GO terms selected by mEN are from the biological process and molecular function categories, suggesting that the GO terms of these categories also play vital roles in the prediction. With these essential GO terms, not only where a protein locates can be decided, but also why it resides there can be revealed.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Experimental results show that the output of both mEN and mLASSO are interpretable and they perform significantly better than existing state-of-the-art predictors. Moreover, mEN selects more features and performs better than mLASSO on a stringent human benchmark dataset. For readers\u2019 convenience, an online server called SpaPredictor for both mLASSO and mEN is available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/bioinfo.eie.polyu.edu.hk\/SpaPredictorServer\/\">http:\/\/bioinfo.eie.polyu.edu.hk\/SpaPredictorServer\/<\/jats:ext-link>.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-0940-x","type":"journal-article","created":{"date-parts":[[2016,2,24]],"date-time":"2016-02-24T05:13:50Z","timestamp":1456290830000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins"],"prefix":"10.1186","volume":"17","author":[{"given":"Shibiao","family":"Wan","sequence":"first","affiliation":[]},{"given":"Man-Wai","family":"Mak","sequence":"additional","affiliation":[]},{"given":"Sun-Yuan","family":"Kung","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,2,24]]},"reference":[{"issue":"Pt 20","key":"940_CR1","doi-asserted-by":"publisher","first-page":"3381","DOI":"10.1242\/jcs.089110","volume":"124","author":"MC Hung","year":"2011","unstructured":"Hung MC, Link W. Protein localization in disease and therapy. J Cell Sci. 2011; 124(Pt 20):3381\u201392.","journal-title":"J Cell Sci"},{"issue":"10","key":"940_CR2","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1007\/s00418-003-0535-2","volume":"119","author":"A Hayama","year":"2003","unstructured":"Hayama A, Rai T, Sasaki S, Uchida S. Molecular mechanisms of Bartter syndrome caused by mutations in the BSND gene. Histochem Cell Biol. 2003; 119(10):485\u201393.","journal-title":"Histochem Cell Biol"},{"key":"940_CR3","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1002\/ijc.2910560116","volume":"56","author":"V Krutovskikh","year":"1994","unstructured":"Krutovskikh V, Mazzoleni G, Mironov N, Omori Y, Aguelon AM, Mesnil M, Berger F, Partensky C, Yamasaki H. Altered homologous and heterologous gap-junctional intercellular communication in primary human liver tumors associated with aberrant protein localization but not gene mutation of connexin 32. Int J Cancer. 1994; 56:87\u201394.","journal-title":"Int J Cancer"},{"key":"940_CR4","doi-asserted-by":"publisher","first-page":"37507","DOI":"10.1074\/jbc.274.53.37507","volume":"274","author":"MD Kaytor","year":"1999","unstructured":"Kaytor MD, Warren ST. Aberrant Protein Deposition and Neurological Disease. J Biol Chem. 1999; 274:37507\u201310.","journal-title":"J Biol Chem"},{"key":"940_CR5","doi-asserted-by":"publisher","first-page":"789","DOI":"10.1126\/science.270.5237.789","volume":"270","author":"Y Chen","year":"1995","unstructured":"Chen Y, Chen CF, Riley DJ, Allred DC, Chen PL, Hoff DV, Osborne CK, Lee WH. Aberrant Subcellular Localization of BRCA1 in Breast Cancer. Science. 1995; 270:789\u201391.","journal-title":"Science"},{"key":"940_CR6","doi-asserted-by":"publisher","first-page":"808","DOI":"10.1053\/plac.2001.0722","volume":"22","author":"X Lee","year":"2001","unstructured":"Lee X, Keith JCJ, Stumm N, Moutsatsos I, McCoy JM, Crum CP, Genest D, Chin D, Ehrenfels C, Pijnenborg R, Assche FAV, Mi S. Downregulation of placental syncytin expression and abnormal protein localization in pre-eclampsia. Placenta. 2001; 22:808\u201312.","journal-title":"Placenta"},{"issue":"10","key":"940_CR7","doi-asserted-by":"publisher","first-page":"905","DOI":"10.1017\/S0022215100106784","volume":"102","author":"JB Campbell","year":"1988","unstructured":"Campbell JB, Crocker J, Shenoi PM. S-100 protein localization in minor salivary gland tumours: an aid to diagnosis. J Laryngol Otol. 1988; 102(10):905\u20138.","journal-title":"J Laryngol Otol"},{"key":"940_CR8","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.pneurobio.2005.10.001","volume":"77","author":"G Lubec","year":"2005","unstructured":"Lubec G, Afjehi-Sadat L, Yang JW, John JP. Searching for hypothetical proteins: Theory and practice based upon original data and literature. Prog. Neurobiol. 2005; 77:90\u2013127.","journal-title":"Prog. Neurobiol"},{"issue":"2","key":"940_CR9","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1002\/prot.340110203","volume":"11","author":"K Nakai","year":"1991","unstructured":"Nakai K, Kanehisa M. Expert system for predicting protein localization sites in gram-negative bacteria. Proteins Struct Funct Genet. 1991; 11(2):95\u2013110.","journal-title":"Proteins Struct Funct Genet"},{"issue":"4","key":"940_CR10","doi-asserted-by":"publisher","first-page":"1005","DOI":"10.1006\/jmbi.2000.3903","volume":"300","author":"O Emanuelsson","year":"2000","unstructured":"Emanuelsson O, Nielsen H, Brunak S, von Heijne G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol. 2000; 300(4):1005\u201316.","journal-title":"J Mol Biol"},{"key":"940_CR11","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1142\/S0129065797000537","volume":"8","author":"H Nielsen","year":"1997","unstructured":"Nielsen H, Engelbrecht J, Brunak S, von Heijne G. A neural network method for identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Int J Neural Sys. 1997; 8:581\u201399.","journal-title":"Int J Neural Sys"},{"key":"940_CR12","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1002\/prot.10251","volume":"50","author":"GP Zhou","year":"2003","unstructured":"Zhou GP, Doctor K. Subcellular location prediction of apoptosis proteins. Proteins Struct Funct Genet. 2003; 50:44\u20138.","journal-title":"Proteins Struct Funct Genet"},{"key":"940_CR13","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1016\/j.jtbi.2012.03.017","volume":"304","author":"GL Fan","year":"2012","unstructured":"Fan GL, Li QZ. Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou\u2019s pseudo amino acid composition. J Theor Biol. 2012; 304:88\u201395.","journal-title":"J Theor Biol"},{"key":"940_CR14","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1002\/prot.1035","volume":"43","author":"KC Chou","year":"2001","unstructured":"Chou KC. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins Struct Funct Genet. 2001; 43:246\u201355.","journal-title":"Proteins Struct Funct Genet"},{"key":"940_CR15","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1006\/jmbi.1994.1267","volume":"238","author":"H Nakashima","year":"1994","unstructured":"Nakashima H, Nishikawa K. Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. J Mol Biol. 1994; 238:54\u201361.","journal-title":"J Mol Biol"},{"issue":"3","key":"940_CR16","doi-asserted-by":"publisher","first-page":"416","DOI":"10.1109\/TCBB.2007.70256","volume":"5","author":"MW Mak","year":"2008","unstructured":"Mak MW, Guo J, Kung SY. PairProSVM: Protein Subcellular Localization Based on Local Pairwise Profile Alignment and SVM. IEEE\/ACM Trans Comput Biol Bioinformatics. 2008; 5(3):416\u201322.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinformatics"},{"issue":"8","key":"940_CR17","doi-asserted-by":"publisher","first-page":"1168","DOI":"10.1101\/gr.96802","volume":"12","author":"R Mott","year":"2002","unstructured":"Mott R, Schultz J, Bork P, Ponting C. Predicting protein cellular localization using a domain projection method. Genome Res. 2002; 12(8):1168\u201374.","journal-title":"Genome Res"},{"key":"940_CR18","unstructured":"Wan S, Mak MW. Machine learning for protein subcellular localization prediction, De Gruyter. Germany; 2015, p. 192. ISBN 978-1-5015-0150-0."},{"issue":"4","key":"940_CR19","doi-asserted-by":"publisher","first-page":"634","DOI":"10.1039\/c3mb25466f","volume":"9","author":"WZ Lin","year":"2013","unstructured":"Lin WZ, Fang JA, Xiao X, Chou KC. iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins. Mol BioSyst. 2013; 9(4):634\u201344.","journal-title":"Mol BioSyst"},{"key":"940_CR20","volume-title":"2011 IEEE International Workshop on Machine Learning for Signal Processing (MLSP\u201911)","author":"S Wan","year":"2011","unstructured":"Wan S, Mak MW, Kung SY. Protein subcellular localization prediction based on profile alignment and Gene Ontology. In: 2011 IEEE International Workshop on Machine Learning for Signal Processing (MLSP\u201911). New Jersey, USA: IEEE: 2011. p. 1\u20136."},{"issue":"6","key":"940_CR21","doi-asserted-by":"publisher","first-page":"e37716","DOI":"10.1371\/journal.pone.0037716","volume":"7","author":"S Mei","year":"2012","unstructured":"Mei S. Multi-label multi-kernel transfer learning for human protein subcellular localization. PLoS ONE. 2012; 7(6):e37716.","journal-title":"PLoS ONE"},{"key":"940_CR22","volume-title":"2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201913)","author":"S Wan","year":"2013","unstructured":"Wan S, Mak MW, Kung SY. Adaptive thresholding for multi-label SVM classification with application to protein subcellular localization prediction. In: 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201913). New Jersey, USA: IEEE: 2013. p. 3547\u2013551."},{"key":"940_CR23","doi-asserted-by":"publisher","first-page":"1888","DOI":"10.1021\/pr060167c","volume":"5","author":"KC Chou","year":"2006","unstructured":"Chou KC, Shen HB. Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers. J. Proteome Res. 2006; 5:1888\u201397.","journal-title":"J. Proteome Res"},{"key":"940_CR24","doi-asserted-by":"publisher","first-page":"68","DOI":"10.4236\/eng.2013.510B014","volume":"5","author":"S Wan","year":"2013","unstructured":"Wan S, Mak MW, Kung SY. Semantic similarity over gene ontology for multi-label protein subcellular localization. Engineering. 2013; 5:68\u201372.","journal-title":"Engineering"},{"key":"940_CR25","doi-asserted-by":"publisher","first-page":"1236","DOI":"10.1016\/j.bbrc.2004.06.073","volume":"320","author":"KC Chou","year":"2004","unstructured":"Chou KC, Cai YD. Prediction of protein subcellular locations by GO-FunD-PseAA predictor. Biochem Biophys Res Commun. 2004; 320:1236\u20139.","journal-title":"Biochem Biophys Res Commun"},{"key":"940_CR26","volume-title":"2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201912)","author":"S Wan","year":"2012","unstructured":"Wan S, Mak MW, Kung SY. GOASVM: Protein subcellular localization prediction based on gene ontology annotation and SVM. In: 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201912). New Jersey, USA: IEEE: 2012. p. 2229\u201332."},{"key":"940_CR27","doi-asserted-by":"publisher","first-page":"629","DOI":"10.1039\/C1MB05420A","volume":"8","author":"KC Chou","year":"2012","unstructured":"Chou KC, Wu ZC, Xiao X. iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites. Mol BioSyst. 2012; 8:629\u201341.","journal-title":"Mol BioSyst"},{"key":"940_CR28","doi-asserted-by":"crossref","unstructured":"Wan S, Mak MW, Zhang B, Wang Y, Kung SY. Ensemble random projection for multi-label classification with application to protein subcellular localization. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201914). New Jersey, USA: 2014. p 5999\u20136003.","DOI":"10.1109\/ICASSP.2014.6854755"},{"key":"940_CR29","doi-asserted-by":"publisher","first-page":"2512","DOI":"10.1093\/bioinformatics\/btn463","volume":"24","author":"A Fyshe","year":"2008","unstructured":"Fyshe A, Liu Y, Szafron D, Greiner R, Lu P. Improving subcellular localization prediction using text classification and the gene ontology. Bioinformatics. 2008; 24:2512\u20137.","journal-title":"Bioinformatics"},{"key":"940_CR30","volume-title":"Pac. Symp. Biocomput","author":"S Brady","year":"2008","unstructured":"Brady S, Shatkay H. EpiLoc: a (working) text-based system for predicting protein subcellular location. In: Pac. Symp. Biocomput. California, USA: Stanford University: 2008. p. 604\u201315."},{"key":"940_CR31","doi-asserted-by":"publisher","first-page":"2836","DOI":"10.1110\/ps.0207402","volume":"11","author":"R Nair","year":"2002","unstructured":"Nair R, Rost B. Sequence conserved for subcellular localization. Protein Sci. 2002; 11:2836\u201347.","journal-title":"Protein Sci"},{"issue":"4","key":"940_CR32","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1093\/bioinformatics\/btg447","volume":"20","author":"Z Lu","year":"2004","unstructured":"Lu Z, Szafron D, Greiner R, Lu P, Wishart DS, Poulin B, Anvik J, Macdonell C, Eisner R. Predicting subcellular localization of proteins using machine-learned classifiers. Bioinformatics. 2004; 20(4):547\u201356.","journal-title":"Bioinformatics"},{"issue":"7","key":"940_CR33","doi-asserted-by":"publisher","first-page":"1028","DOI":"10.1093\/bioinformatics\/bts062","volume":"28","author":"SM Chi","year":"2012","unstructured":"Chi SM, Nam D. WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms. Bioinformatics. 2012; 28(7):1028\u20131030. http:\/\/bioinformatics.oxfordjournals.org\/content\/28\/7\/1028.short.","journal-title":"Bioinformatics"},{"key":"940_CR34","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1016\/j.jtbi.2013.01.012","volume":"323","author":"S Wan","year":"2013","unstructured":"Wan S, Mak MW, Kung SY. GOASVM: A subcellular location predictor by incorporating term-frequency gene ontology into the general form of Chou\u2019s pseudo-amino acid composition. J Theor Biol. 2013; 323:40\u20138.","journal-title":"J Theor Biol"},{"key":"940_CR35","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1186\/1471-2105-9-80","volume":"9","author":"WL Huang","year":"2008","unstructured":"Huang WL, Tung CW, Ho SW, Hwang SF, Ho SY. ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization. BMC Bioinformatics. 2008; 9:80.","journal-title":"BMC Bioinformatics"},{"key":"940_CR36","first-page":"686","volume":"7","author":"RF Murphy","year":"2010","unstructured":"Murphy RF. communicating subcellular distributions. 77. 2010; 7:686\u201392.","journal-title":"77"},{"issue":"6","key":"940_CR37","doi-asserted-by":"publisher","first-page":"1625","DOI":"10.1105\/tpc.109.066019","volume":"21","author":"AH Millar","year":"2009","unstructured":"Millar AH, Carrie C, Pogson B, Whelan J. Exploring the function-location nexus: using multiple lines of evidence in defining the subcellular location of plant proteins. Plant Cell. 2009; 21(6):1625\u201331.","journal-title":"Plant Cell"},{"key":"940_CR38","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1186\/1471-2105-9-127","volume":"9","author":"S Zhang","year":"2008","unstructured":"Zhang S, Xia XF, Shen JC, Zhou Y, Sun Z. DBMLoc: A database of proteins with multiple subcellular localizations. BMC Bioinformatics. 2008; 9:127.","journal-title":"BMC Bioinformatics"},{"key":"940_CR39","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1016\/j.cell.2006.03.022","volume":"125","author":"LJ Foster","year":"2006","unstructured":"Foster LJ, Hoog CLD, Zhang Y, Zhang Y, Xie X, Mootha VK, Mann M. A mammalian organelle map by protein correlation profiling. Cell. 2006; 125:187\u201399.","journal-title":"Cell"},{"key":"940_CR40","doi-asserted-by":"publisher","first-page":"1667","DOI":"10.2337\/diab.46.11.1667","volume":"46","author":"S Rea","year":"1997","unstructured":"Rea S, James D. Moving GLUT4: the biogenesis and trafficking of GLUT4 storage vesicles. Diabetes. 1997; 46:1667\u201377.","journal-title":"Diabetes"},{"key":"940_CR41","first-page":"H643","volume":"277","author":"R Russell","year":"1997","unstructured":"Russell R, Bergeron R, Shulman G, Young H. Translocation of myocardial GLUT-4 and increased glucose uptake through activation of AMPK by AICAR. Am J Physiol. 1997; 277:H643\u20139.","journal-title":"Am J Physiol"},{"key":"940_CR42","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1016\/j.mito.2004.02.002","volume":"3","author":"JC Mueller","year":"2004","unstructured":"Mueller JC, Andreoli C, Prokisch H, Meitinger T. Mechanisms for multiple intracellular localization of human mitochondrial proteins. Mitochondrion. 2004; 3:315\u201325.","journal-title":"Mitochondrion"},{"issue":"2","key":"940_CR43","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1016\/j.ab.2009.07.046","volume":"394","author":"HB Shen","year":"2009","unstructured":"Shen HB, Chou KC. A top-down approach to enhance the power of predicting human protein subcellular localization: Hum-mPLoc 2.0. Anal Biochem. 2009; 394(2):269\u201374.","journal-title":"Anal Biochem"},{"key":"940_CR44","doi-asserted-by":"publisher","first-page":"290","DOI":"10.1186\/1471-2105-13-290","volume":"13","author":"S Wan","year":"2012","unstructured":"Wan S, Mak MW, Kung SY. mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines. BMC Bioinformatics. 2012; 13:290.","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"940_CR45","doi-asserted-by":"publisher","first-page":"e89545","DOI":"10.1371\/journal.pone.0089545","volume":"9","author":"S Wan","year":"2014","unstructured":"Wan S, Mak MW, Kung SY. HybridGO-Loc: Mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins. PLoS ONE. 2014; 9(3):e89545.","journal-title":"PLoS ONE"},{"key":"940_CR46","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1016\/j.jtbi.2014.06.031","volume":"360","author":"S Wan","year":"2014","unstructured":"Wan S, Mak MW, Kung SY. R3P-Loc: A compact multi-label predictor using ridge regression and random projection for protein subcellular localization. J Theor Biol. 2014; 360:34\u201345.","journal-title":"J Theor Biol"},{"key":"940_CR47","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1016\/j.ab.2014.10.014","volume":"473","author":"S Wan","year":"2015","unstructured":"Wan S, Mak MW, Kung SY. mPLR-Loc: An adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction. Anal Biochem. 2015; 473:14\u201327.","journal-title":"Anal Biochem"},{"issue":"6","key":"940_CR48","doi-asserted-by":"publisher","first-page":"e37155","DOI":"10.1371\/journal.pone.0037155","volume":"7","author":"J He","year":"2011","unstructured":"He J, Gu H, Liu W. Imbalanced multi-modal multi-label learning for subcellular localization prediction of human proteins with both single and multiple sites. PLoS ONE. 2011; 7(6):e37155.","journal-title":"PLoS ONE"},{"key":"940_CR49","doi-asserted-by":"publisher","first-page":"e31057","DOI":"10.1371\/journal.pone.0031057","volume":"7","author":"LQ Li","year":"2012","unstructured":"Li LQ, Zhang Y, Zou LY, Li CQ, Yu B, Zheng XQ, Zhou Y. An ensemble classifier for eukaryotic protein subcellular location prediction using Gene Ontology categories and amino acid hydrophobicity. PLoS ONE. 2012; 7:e31057.","journal-title":"PLoS ONE"},{"key":"940_CR50","volume-title":"2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"S Wan","year":"2013","unstructured":"Wan S, Mak MW, Zhang B, Wang Y, Kung SY. An ensemble classifier with random projection for predicting multi-label protein subcellular localization. In: 2013 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). New Jersey, USA: IEEE: 2013. p. 35\u201342."},{"issue":"Suppl 2","key":"940_CR51","doi-asserted-by":"publisher","first-page":"W497","DOI":"10.1093\/nar\/gkq477","volume":"38","author":"S Briesemeister","year":"2010","unstructured":"Briesemeister S, Rahnenf\u00fchrer J, Kohlbacher O. YLoc\u2014an interpretable web server for predicting subcellular localization. Nucleic Acids Res. 2010; 38(Suppl 2):W497\u2013502.","journal-title":"Nucleic Acids Res"},{"key":"940_CR52","volume-title":"Proc. of Pac. Symp. Biocomput (PSB\u201905)","author":"Z Lu","year":"2005","unstructured":"Lu Z, Hunter L. GO molecular function terms are predictive of subcellular localization. In: Proc. of Pac. Symp. Biocomput (PSB\u201905). California, USA: Stanford University: 2005. p. 151\u201361."},{"key":"940_CR53","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997; 25:3389\u2013402.","journal-title":"Nucleic Acids Res"},{"key":"940_CR54","doi-asserted-by":"publisher","first-page":"5363","DOI":"10.1021\/pr900665y","volume":"8","author":"S Briesemeister","year":"2009","unstructured":"Briesemeister S, Blum T, Brady S, Lam Y, Kohlbacher O, Shatkay H. SherLoc2: A high-accuracy hybrid method for predicting subcellular localization of proteins. J Proteome Res. 2009; 8:5363\u20136.","journal-title":"J Proteome Res"},{"key":"940_CR55","doi-asserted-by":"publisher","first-page":"1092","DOI":"10.1039\/c3mb25555g","volume":"9","author":"KC Chou","year":"2013","unstructured":"Chou KC. Some remarks on predicting multi-label attributes in molecular biosystems. Mol BioSyst. 2013; 9:1092\u2013100.","journal-title":"Mol BioSyst"},{"issue":"5","key":"940_CR56","doi-asserted-by":"publisher","first-page":"e36317","DOI":"10.1371\/journal.pone.0036317","volume":"7","author":"X Wang","year":"2012","unstructured":"Wang X, Li GZ. A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins. PLoS ONE. 2012; 7(5):e36317.","journal-title":"PLoS ONE"},{"key":"940_CR57","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1038\/nprot.2007.494","volume":"3","author":"KC Chou","year":"2008","unstructured":"Chou KC, Shen HB. Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms. Nat Protoc. 2008; 3:153\u201362.","journal-title":"Nat Protoc"},{"key":"940_CR58","doi-asserted-by":"publisher","first-page":"3287","DOI":"10.1039\/c1mb05232b","volume":"7","author":"ZC Wu","year":"2011","unstructured":"Wu ZC, Xiao X, Chou KC. iLoc-Plant: A multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites. Mol BioSyst. 2011; 7:3287\u201397.","journal-title":"Mol BioSyst"},{"issue":"1-2","key":"940_CR59","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1007\/s10994-012-5285-8","volume":"88","author":"K Dembczynski","year":"2012","unstructured":"Dembczynski K, Waegeman W, Cheng W, Hullermeier E. On label dependence and loss minimization in multi-label classification. Mach Learn. 2012; 88(1-2):5\u201345.","journal-title":"Mach Learn"},{"key":"940_CR60","volume-title":"Proceedings of the 24th Annual Conference on Learning Theory","author":"W Gao","year":"2011","unstructured":"Gao W, Zhou ZH. On the consistency of multi-label learning. In: Proceedings of the 24th Annual Conference on Learning Theory. Massachusetts, USA: Microtome Publishing: 2011. p. 341\u201358."},{"issue":"13","key":"940_CR61","doi-asserted-by":"publisher","first-page":"i53","DOI":"10.1093\/bioinformatics\/btt228","volume":"29","author":"WT Clark","year":"2013","unstructured":"Clark WT, Radivojac P. Information-theoretic evaluation of predicted ontological annotations. Bioinformatics. 2013; 29(13):i53\u201361.","journal-title":"Bioinformatics"},{"issue":"6","key":"940_CR62","doi-asserted-by":"publisher","first-page":"1544","DOI":"10.1110\/ps.062184006","volume":"15","author":"K Verspoor","year":"2006","unstructured":"Verspoor K, Cohn J, Mniszewski S, Joslyn C. A categorization approach to automated ontological function annotation. Protein Sci. 2006; 15(6):1544\u20139.","journal-title":"Protein Sci"},{"issue":"7","key":"940_CR63","doi-asserted-by":"publisher","first-page":"1079","DOI":"10.1109\/TKDE.2010.164","volume":"23","author":"G Tsoumakas","year":"2011","unstructured":"Tsoumakas G, Katakis I, Vlahavas I. Random k-labelsets for multilabel classification. IEEE Trans Knowl Data Eng. 2011; 23(7):1079\u201389.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"940_CR64","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511809071","volume-title":"Introduction to information retrieval Volume 1.","author":"CD Manning","year":"2008","unstructured":"Manning CD, Raghavan P, Sch\u00fctze H. Introduction to information retrieval Volume 1.Cambridge: Cambridge university press; 2008."},{"key":"940_CR65","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1016\/j.jtbi.2011.06.005","volume":"284","author":"X Xiao","year":"2011","unstructured":"Xiao X, Wu ZC, Chou KC. iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites. J Theor Biol. 2011; 284:42\u201351.","journal-title":"J Theor Biol"},{"key":"940_CR66","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-21606-5","volume-title":"The element of statistical learning","author":"T Hastie","year":"2001","unstructured":"Hastie T, Tibshirani R, Friedman J. The element of statistical learning. Berlin, Germany: Springer-Verlag; 2001."},{"issue":"2","key":"940_CR67","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol. 2005; 67(2):301\u201320.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"issue":"3","key":"940_CR68","doi-asserted-by":"publisher","first-page":"e18258","DOI":"10.1371\/journal.pone.0018258","volume":"6","author":"KC Chou","year":"2011","unstructured":"Chou KC, Wu ZC, Xiao X. iLoc-Euk: A multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PLoS ONE. 2011; 6(3):e18258.","journal-title":"PLoS ONE"},{"key":"940_CR69","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1016\/j.jtbi.2015.06.042","volume":"382","author":"S Wan","year":"2015","unstructured":"Wan S, Mak MW, Kung SY. mLASSO-Hum: A LASSO-based interpretable human-protein subcellular localization predictor. J Theor Biol. 2015; 382:223\u201334.","journal-title":"J Theor Biol"},{"key":"940_CR70","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1016\/S0065-3233(00)54009-1","volume":"54","author":"K Nakai","year":"2000","unstructured":"Nakai K. Protein sorting signals and prediction of subcellular localization. Adv Protein Chem. 2000; 54:277\u2013344.","journal-title":"Adv Protein Chem"},{"key":"940_CR71","doi-asserted-by":"crossref","unstructured":"Wan S, Mak MW, Kung SY. Mem-mEN: Predicting multi-functional types of membrane proteins by interpretable elastic nets. IEEE\/ACM Trans Comput Biol Bioinformatics. 2015. doi:http:\/\/dx.doi.org\/10.1109\/TCBB.2015.2474407.","DOI":"10.1109\/TCBB.2015.2474407"},{"key":"940_CR72","doi-asserted-by":"crossref","unstructured":"Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Methodol. 1996:267\u201388.","DOI":"10.1111\/j.2517-6161.1996.tb02080.x"},{"issue":"4","key":"940_CR73","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1093\/bioinformatics\/btn660","volume":"25","author":"B Zhang","year":"2009","unstructured":"Zhang B, Li H, Riggins RB, Zhan M, Xuan J, Zhang Z, Hoffman EP, Clarke R, Wang Y. Differential dependency network analysis to identify condition-specific topological changes in biological networks. Bioinformatics. 2009; 25(4):526\u201332.","journal-title":"Bioinformatics"},{"issue":"17","key":"940_CR74","doi-asserted-by":"publisher","first-page":"2406","DOI":"10.1093\/bioinformatics\/btr410","volume":"27","author":"Y Lu","year":"2011","unstructured":"Lu Y, Zhou Y, Qu W, Deng M, Zhang C. A Lasso regression model for the construction of microRNA-target regulatory networks. Bioinformatics. 2011; 27(17):2406\u201313.","journal-title":"Bioinformatics"},{"issue":"12","key":"940_CR75","doi-asserted-by":"publisher","first-page":"e83745","DOI":"10.1371\/journal.pone.0083745","volume":"8","author":"EL de Maturana","year":"2013","unstructured":"de Maturana EL, Ye Y, Calle ML, Rothman N, Urrea V, et al. Application of multi-SNP approaches Bayesian LASSO and AUC-RF to detect main effects of inflammatory-gene variants associated with bladder cancer risk. PLoS ONE. 2013; 8(12):e83745.","journal-title":"PLoS ONE"},{"issue":"10","key":"940_CR76","doi-asserted-by":"publisher","first-page":"2566","DOI":"10.1039\/c2mb25096a","volume":"8","author":"A Vasilevski","year":"2012","unstructured":"Vasilevski A, Giorgi FM, Bertinetti L, Usadel B. LASSO modeling of the Arabidopsis thaliana seed\/seedling transcriptome: a model case for detection of novel mucilage and pectin metabolism genes. Mol BioSyst. 2012; 8(10):2566\u201374.","journal-title":"Mol BioSyst"},{"issue":"Suppl 1","key":"940_CR77","doi-asserted-by":"publisher","first-page":"S10","DOI":"10.1186\/1471-2105-16-S1-S10","volume":"16","author":"D He","year":"2015","unstructured":"He D, Wang Z, Parida L. Data-driven encoding for quantitative genetic trait prediction. BMC Bioinformatics. 2015; 16(Suppl 1):S10.","journal-title":"BMC Bioinformatics"},{"key":"940_CR78","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1016\/j.jbi.2015.02.003","volume":"54","author":"BJ Marafino","year":"2015","unstructured":"Marafino BJ, Boscardin WJ, Dudley RA. Efficient and sparse feature selection for biomedical text classification via the elastic net: Application to ICU risk stratification from nursing notes. J Biomed Inf. 2015; 54:114\u201320.","journal-title":"J Biomed Inf"},{"issue":"8","key":"940_CR79","doi-asserted-by":"publisher","first-page":"879","DOI":"10.1002\/gepi.20543","volume":"34","author":"KL Ayers","year":"2010","unstructured":"Ayers KL, Cordell HJ. SNP Selection in genome-wide and candidate gene studies via penalized logistic regression. Genet Epidemiol. 2010; 34(8):879\u201391.","journal-title":"Genet Epidemiol"},{"issue":"2","key":"940_CR80","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1214\/009053604000000067","volume":"32","author":"B Efron","year":"2004","unstructured":"Efron B, Hastie T, Johnstone I, Tibshirani R. Least angle regression. Ann Stat. 2004; 32(2):407\u201399.","journal-title":"Ann Stat"},{"key":"940_CR81","unstructured":"Sj\u00f6strand K, Clemmensen LH, Larsen R, Ersb\u00f8ll B. SpaSM: A Matlab Toolbox for Sparse Statistical Modeling. J Stat Softw. Accepted for publication. 2004. http:\/\/www.imm.dtu.dk\/projects\/spasm\/references\/spasm.pdf."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0940-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-0940-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0940-x","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-0940-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:21:54Z","timestamp":1706811714000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-0940-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,2,24]]},"references-count":81,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["940"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-0940-x","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,2,24]]},"assertion":[{"value":"17 April 2015","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 January 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 February 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"97"}}