{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T11:30:43Z","timestamp":1774265443214,"version":"3.50.1"},"reference-count":106,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2018,7,13]],"date-time":"2018-07-13T00:00:00Z","timestamp":1531440000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31560316"],"award-info":[{"award-number":["31560316"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61261027"],"award-info":[{"award-number":["61261027"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61262038"],"award-info":[{"award-number":["61262038"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61202313"],"award-info":[{"award-number":["61202313"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Province National Natural Science Foundation of JiangXi","award":["20171ACB20023"],"award-info":[{"award-number":["20171ACB20023"]}]},{"DOI":"10.13039\/501100009102","name":"Department of Education of JiangXi Province","doi-asserted-by":"crossref","award":["GJJ170793"],"award-info":[{"award-number":["GJJ170793"]}],"id":[{"id":"10.13039\/501100009102","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>A cell contains numerous protein molecules. One of the fundamental goals in cell biology is to determine their subcellular locations, which can provide useful clues about their functions. Knowledge of protein subcellular localization is also indispensable for prioritizing and selecting the right targets for drug development. With the avalanche of protein sequences emerging in the post-genomic age, it is highly desired to develop computational tools for timely and effectively identifying their subcellular localization based on the sequence information alone. Recently, a predictor called \u2018pLoc-mAnimal\u2019 was developed for identifying the subcellular localization of animal proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with the multi-label systems in which some proteins, called \u2018multiplex proteins\u2019, may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mAnimal was trained by an extremely skewed dataset in which some subset (subcellular location) was about 128 times the size of the other subsets. Accordingly, such an uneven training dataset will inevitably cause a biased consequence.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To alleviate such biased consequence, we have developed a new and bias-reducing predictor called pLoc_bal-mAnimal by quasi-balancing the training dataset. Cross-validation tests on exactly the same experiment-confirmed dataset have indicated that the proposed new predictor is remarkably superior to pLoc-mAnimal, the existing state-of-the-art predictor, in identifying the subcellular localization of animal proteins.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>To maximize the convenience for the vast majority of experimental scientists, a user-friendly web-server for the new predictor has been established at http:\/\/www.jci-bioinfo.cn\/pLoc_bal-mAnimal\/, by which users can easily get their desired results without the need to go through the complicated mathematics.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty628","type":"journal-article","created":{"date-parts":[[2018,7,13]],"date-time":"2018-07-13T11:31:22Z","timestamp":1531481482000},"page":"398-406","source":"Crossref","is-referenced-by-count":80,"title":["pLoc_bal-mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC"],"prefix":"10.1093","volume":"35","author":[{"given":"Xiang","family":"Cheng","sequence":"first","affiliation":[{"name":"Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China"},{"name":"Computational Biology, Gordon Life Science Institute, Boston, MA, USA"}]},{"given":"Wei-Zhong","family":"Lin","sequence":"additional","affiliation":[{"name":"Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China"}]},{"given":"Xuan","family":"Xiao","sequence":"additional","affiliation":[{"name":"Computer Science, Jingdezhen Ceramic Institute, Jingdezhen, China"},{"name":"Computational Biology, Gordon Life Science Institute, Boston, MA, USA"}]},{"given":"Kuo-Chen","family":"Chou","sequence":"additional","affiliation":[{"name":"Computational Biology, Gordon Life Science Institute, Boston, MA, USA"},{"name":"Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China"}]}],"member":"286","published-online":{"date-parts":[[2018,7,13]]},"reference":[{"key":"2023013107242623100_bty628-B1","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1007\/s00232-015-9868-8","article-title":"Prediction of protein submitochondrial locations by incorporating dipeptide composition into Chou\u2019s general pseudo amino acid composition","volume":"249","author":"Ahmad","year":"2016","journal-title":"J. Membr. Biol."},{"key":"2023013107242623100_bty628-B2","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.jtbi.2015.07.034","article-title":"Classification of membrane protein types using Voting Feature Interval in combination with Chou\u2019s Pseudo Amino Acid Composition","volume":"384","author":"Ali","year":"2015","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B3","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.jtbi.2018.01.008","article-title":"iMem-2LSAAC: a two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into Chou\u2019s pseudo amino acid composition","volume":"442","author":"Arif","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B4","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1016\/j.jtbi.2005.05.034","article-title":"Using LogitBoost classifier to predict protein structural classes","volume":"238","author":"Cai","year":"2006","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B5","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1002\/jcb.10030","article-title":"Support vector machines for prediction of protein subcellular location by incorporating quasi-sequence-order effect","volume":"84","author":"Cai","year":"2002","journal-title":"J. Cell. Biochem."},{"key":"2023013107242623100_bty628-B6","doi-asserted-by":"crossref","first-page":"1163","DOI":"10.2174\/092986612803216999","article-title":"Predicting viral protein subcellular localization with Chou\u2019s pseudo amino acid composition and imbalance-weighted multi-label K-nearest neighbor algorithm","volume":"19","author":"Cao","year":"2012","journal-title":"Protein Pept. Lett."},{"key":"2023013107242623100_bty628-B7","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1006\/jmbi.1996.0804","article-title":"Relation between amino acid composition and cellular location of proteins","volume":"266","author":"Cedano","year":"1997","journal-title":"J. Mol. Biol."},{"key":"2023013107242623100_bty628-B8","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: synthetic minority over-sampling technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res."},{"key":"2023013107242623100_bty628-B9","doi-asserted-by":"crossref","first-page":"4208","DOI":"10.18632\/oncotarget.13758","article-title":"iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences","volume":"8","author":"Chen","year":"2017","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B10","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1016\/j.omtn.2018.03.012","article-title":"iRNA-3typeA: identifying 3-types of modification at RNA\u2019s adenosine sites","volume":"11","author":"Chen","year":"2018","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023013107242623100_bty628-B11","doi-asserted-by":"crossref","first-page":"e68","DOI":"10.1093\/nar\/gks1450","article-title":"iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition","volume":"41","author":"Chen","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023013107242623100_bty628-B12","first-page":"e332","article-title":"iRNA-PseU: identifying RNA pseudouridine sites","volume":"5","author":"Chen","year":"2016","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023013107242623100_bty628-B13","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1093\/bioinformatics\/bty140","article-title":"iFeature: a python package and web server for features extraction and selection from protein and peptide sequences","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B14","first-page":"30102","article-title":"pLoc-mGneg: predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC","volume":"7543","author":"Cheng","year":"2017","journal-title":"Genomics"},{"key":"2023013107242623100_bty628-B15","doi-asserted-by":"crossref","first-page":"1722","DOI":"10.1039\/C7MB00267J","article-title":"pLoc-mPlant: predict subcellular localization of multi-location plant proteins via incorporating the optimal GO information into general PseAAC","volume":"13","author":"Cheng","year":"2017","journal-title":"Mol. BioSyst."},{"key":"2023013107242623100_bty628-B16","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.ygeno.2017.08.005","article-title":"pLoc-mEuk: predict subcellular localization of multi-label eukaryotic proteins by extracting the key GO information into general PseAAC","volume":"110","author":"Cheng","year":"2018","journal-title":"Genomics"},{"key":"2023013107242623100_bty628-B17","doi-asserted-by":"crossref","first-page":"1448","DOI":"10.1093\/bioinformatics\/btx711","article-title":"pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information","volume":"34","author":"Cheng","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B18","doi-asserted-by":"crossref","first-page":"3524","DOI":"10.1093\/bioinformatics\/btx476","article-title":"pLoc-mAnimal: predict subcellular localization of animal proteins with both single and multiple sites","volume":"33","author":"Cheng","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B19","doi-asserted-by":"crossref","first-page":"58494","DOI":"10.18632\/oncotarget.17028","article-title":"iATC-mHyb: a hybrid multi-label classifier for predicting the classification of anatomical therapeutic chemicals","volume":"8","author":"Cheng","year":"2017","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B20","doi-asserted-by":"crossref","first-page":"2610","DOI":"10.1093\/bioinformatics\/btx387","article-title":"iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals","volume":"33","author":"Cheng","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B21","first-page":"195","article-title":"A vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins","volume":"16","author":"Chou","year":"1993","journal-title":"J. Biol. Chem."},{"key":"2023013107242623100_bty628-B22","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1002\/prot.1072","article-title":"Prediction of protein cellular attributes using pseudo amino acid composition","volume":"44","author":"Chou","year":"2001","journal-title":"Proteins"},{"key":"2023013107242623100_bty628-B23","doi-asserted-by":"crossref","first-page":"262","DOI":"10.2174\/157016409789973707","article-title":"Pseudo amino acid composition and its applications in bioinformatics, proteomics and system biology","volume":"6","author":"Chou","year":"2009","journal-title":"Curr. Proteomics"},{"key":"2023013107242623100_bty628-B24","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1016\/j.jtbi.2010.12.024","article-title":"Some remarks on protein attribute prediction and pseudo amino acid composition (50th Anniversary Year Review)","volume":"273","author":"Chou","year":"2011","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B25","doi-asserted-by":"crossref","first-page":"1092","DOI":"10.1039\/c3mb25555g","article-title":"Some remarks on predicting multi-label attributes in molecular biosystems","volume":"9","author":"Chou","year":"2013","journal-title":"Mol. Biosyst."},{"key":"2023013107242623100_bty628-B26","doi-asserted-by":"crossref","first-page":"218","DOI":"10.2174\/1573406411666141229162834","article-title":"Impacts of bioinformatics to medicinal chemistry","volume":"11","author":"Chou","year":"2015","journal-title":"Med. Chem."},{"key":"2023013107242623100_bty628-B27","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.2174\/1568026617666170414145508","article-title":"An unprecedented revolution in medicinal chemistry driven by the progress of biological science","volume":"17","author":"Chou","year":"2017","journal-title":"Curr. Top. Med. Chem."},{"key":"2023013107242623100_bty628-B28","doi-asserted-by":"crossref","first-page":"743","DOI":"10.1016\/j.bbrc.2003.10.062","article-title":"A new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology","volume":"311","author":"Chou","year":"2003","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023013107242623100_bty628-B29","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1016\/j.bbrc.2004.06.073","article-title":"Prediction of protein subcellular locations by GO-FunD-PseAA predicor","volume":"320","author":"Chou","year":"2004","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023013107242623100_bty628-B30","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1006\/bbrc.1998.9498","article-title":"Using discriminant function for prediction of subcellular location of prokaryotic proteins","volume":"252","author":"Chou","year":"1998","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023013107242623100_bty628-B31","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1093\/protein\/12.2.107","article-title":"Protein subcellular location prediction","volume":"12","author":"Chou","year":"1999","journal-title":"Protein Eng."},{"key":"2023013107242623100_bty628-B32","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1021\/pr0255710","article-title":"Prediction of enzyme family classes","volume":"2","author":"Chou","year":"2003","journal-title":"J. Proteome Res."},{"key":"2023013107242623100_bty628-B33","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1016\/j.bbrc.2007.06.027","article-title":"MemType-2L: a Web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM","volume":"360","author":"Chou","year":"2007","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023013107242623100_bty628-B34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.ab.2007.07.006","article-title":"Review: recent progresses in protein subcellular location prediction","volume":"370","author":"Chou","year":"2007","journal-title":"Anal. Biochem."},{"key":"2023013107242623100_bty628-B35","first-page":"63","article-title":"Recent advances in developing web-servers for predicting protein attributes","volume":"01","author":"Chou","year":"2009","journal-title":"Nat. Sci."},{"key":"2023013107242623100_bty628-B36","doi-asserted-by":"crossref","first-page":"275","DOI":"10.3109\/10409239509083488","article-title":"Review: prediction of protein structural classes","volume":"30","author":"Chou","year":"1995","journal-title":"Crit. Rev. Biochem. Mol. Biol."},{"key":"2023013107242623100_bty628-B37","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1016\/j.jtbi.2014.09.029","article-title":"Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou\u2019s general PseAAC","volume":"364","author":"Dehzangi","year":"2015","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B38","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1016\/S1534-5807(02)00216-2","article-title":"Spatio-temporal regulation of Rac1 localization and lamellipodia dynamics during epithelial cell-cell adhesion","volume":"3","author":"Ehrlich","year":"2002","journal-title":"Dev. Cell"},{"key":"2023013107242623100_bty628-B39","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1006\/jmbi.2000.3903","article-title":"Predicting subcellular localization of proteins based on their N-terminal amino acid sequence","volume":"300","author":"Emanuelsson","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023013107242623100_bty628-B40","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/j.jtbi.2009.11.016","article-title":"Using the concept of Chou\u2019s pseudo amino acid composition for risk type prediction of human papillomaviruses","volume":"263","author":"Esmaeili","year":"2010","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B41","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.jtbi.2012.03.017","article-title":"Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou\u2019s pseudo amino acid composition","volume":"304","author":"Fan","year":"2012","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B42","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.omtn.2017.03.006","article-title":"iRNA-PseColl: identifying the occurrence sites of different RNA modifications by incorporating collective effects of nucleotides into PseKNC","volume":"7","author":"Feng","year":"2017","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023013107242623100_bty628-B43","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.ab.2013.05.024","article-title":"iHSP-PseRAAAC: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition","volume":"442","author":"Feng","year":"2013","journal-title":"Anal. Biochem."},{"key":"2023013107242623100_bty628-B44","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1093\/bioinformatics\/bti057","article-title":"PSORTb v.2.0: expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis","volume":"21","author":"Gardy","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B45","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1016\/j.devcel.2006.12.007","article-title":"Automated subcellular location determination and high-throughput microscopy","volume":"12","author":"Glory","year":"2007","journal-title":"Dev. Cell"},{"key":"2023013107242623100_bty628-B46","doi-asserted-by":"crossref","first-page":"e37155","DOI":"10.1371\/journal.pone.0037155","article-title":"Imbalanced multi-modal multi-label learning for subcellular localization prediction of human proteins with both single and multiple sites","volume":"7","author":"He","year":"2012","journal-title":"PLoS One"},{"key":"2023013107242623100_bty628-B47","doi-asserted-by":"crossref","first-page":"1158","DOI":"10.1093\/bioinformatics\/btl002","article-title":"MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition","volume":"22","author":"Hoglund","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B48","doi-asserted-by":"crossref","first-page":"W585","DOI":"10.1093\/nar\/gkm259","article-title":"WoLF PSORT: protein localization predictor","volume":"35","author":"Horton","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023013107242623100_bty628-B49","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.biosystems.2013.04.005","article-title":"Using radial basis function on the general form of Chou\u2019s pseudo amino acid composition and PSSM to predict subcellular locations of proteins with both single and multiple sites","volume":"113","author":"Huang","year":"2013","journal-title":"Biosystems"},{"key":"2023013107242623100_bty628-B50","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.jtbi.2015.04.011","article-title":"iPPI-Esml: an ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC","volume":"377","author":"Jia","year":"2015","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B51","doi-asserted-by":"crossref","first-page":"34558","DOI":"10.18632\/oncotarget.9148","article-title":"iCar-PseCp: identify carbonylation sites in proteins by Monto Carlo sampling and incorporating sequence coupled effects into general PseAAC","volume":"7","author":"Jia","year":"2016","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B52","doi-asserted-by":"crossref","first-page":"95","DOI":"10.3390\/molecules21010095","article-title":"iPPBS-Opt: a Sequence-Based Ensemble Classifier for Identifying Protein-Protein Binding Sites by Optimizing Imbalanced Training Datasets","volume":"21","author":"Jia","year":"2016","journal-title":"Molecules"},{"key":"2023013107242623100_bty628-B53","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.ab.2015.12.009","article-title":"iSuc-PseOpt: identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset","volume":"497","author":"Jia","year":"2016","journal-title":"Anal. Biochem."},{"key":"2023013107242623100_bty628-B54","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1016\/j.jtbi.2016.01.020","article-title":"pSuc-Lys: predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach","volume":"394","author":"Jia","year":"2016","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B55","doi-asserted-by":"crossref","first-page":"392","DOI":"10.2174\/092986608784246443","article-title":"Using the concept of Chou\u2019s pseudo amino acid composition to predict apoptosis proteins subcellular location: an approach by approximate entropy","volume":"15","author":"Jiang","year":"2008","journal-title":"Protein Pept. Lett."},{"key":"2023013107242623100_bty628-B56","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1007\/s00438-015-1108-5","article-title":"iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou\u2019s PseAAC to formulate DNA samples","volume":"291","author":"Kabir","year":"2016","journal-title":"Mol. Genet. Genomics"},{"key":"2023013107242623100_bty628-B57","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.jtbi.2016.12.004","article-title":"Unb-DPC: identify mycobacterial membrane protein types by incorporating un-biased dipeptide composition into Chou\u2019s general PseAAC","volume":"415","author":"Khan","year":"2017","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B58","doi-asserted-by":"crossref","first-page":"375","DOI":"10.2174\/092986612799789369","article-title":"Prediction of protein subcellular multi-localization based on the general form of Chou\u2019s pseudo amino acid composition","volume":"19","author":"Li","year":"2012","journal-title":"Protein Pept. Lett."},{"key":"2023013107242623100_bty628-B59","doi-asserted-by":"crossref","first-page":"12961","DOI":"10.1093\/nar\/gku1019","article-title":"iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition","volume":"42","author":"Lin","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"2023013107242623100_bty628-B60","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1039\/c3mb25466f","article-title":"iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins","volume":"9","author":"Lin","year":"2013","journal-title":"Mol. BioSyst."},{"key":"2023013107242623100_bty628-B61","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/bty458","article-title":"iEnhancer-EL: identifying enhancers and their strength with ensemble learning approach","author":"Liu","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B62","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1093\/bioinformatics\/btw539","article-title":"iRSpot-EL: identify recombination spots with an ensemble learning approach","volume":"33","author":"Liu","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B63","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/bty312","article-title":"iRO-3wPseKNC: identify DNA replication origins by three-window-based PseKNC","author":"Liu","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B64","doi-asserted-by":"crossref","first-page":"13338","DOI":"10.18632\/oncotarget.14524","article-title":"Pse-Analysis: a python package for DNA\/RNA and protein\/peptide sequence analysis based on pseudo components and kernel methods","volume":"8","author":"Liu","year":"2017","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B65","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1016\/j.omtn.2017.04.008","article-title":"2L-piRNA: a two-layer ensemble classifier for identifying piwi-interacting RNAs and their function","volume":"7","author":"Liu","year":"2017","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023013107242623100_bty628-B66","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1093\/bioinformatics\/btx579","article-title":"iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC","volume":"34","author":"Liu","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B67","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.ab.2014.12.009","article-title":"iDNA-Methyl: identifying DNA methylation sites via pseudo trinucleotide composition","volume":"474","author":"Liu","year":"2015","journal-title":"Anal. Biochem."},{"key":"2023013107242623100_bty628-B68","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1016\/j.ab.2015.12.017","article-title":"pRNAm-PC: predicting N-methyladenosine sites in RNA sequences via physical-chemical properties","volume":"497","author":"Liu","year":"2016","journal-title":"Anal. Biochem."},{"key":"2023013107242623100_bty628-B69","doi-asserted-by":"crossref","first-page":"2804","DOI":"10.1110\/ps.051597405","article-title":"A novel representation of protein sequences for prediction of subcellular location using support vector machines","volume":"14","author":"Matsuda","year":"2005","journal-title":"Protein Sci."},{"key":"2023013107242623100_bty628-B70","doi-asserted-by":"crossref","first-page":"42362","DOI":"10.1038\/srep42362","article-title":"Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou\u2019s general PseAAC","volume":"7","author":"Meher","year":"2017","journal-title":"Sci. Rep."},{"key":"2023013107242623100_bty628-B71","doi-asserted-by":"crossref","first-page":"2359","DOI":"10.1038\/s41598-018-20819-x","article-title":"Prediction of HIV-1 and HIV-2 proteins by using Chou\u2019s pseudo amino acid compositions and different classifiers","volume":"8","author":"Mei","year":"2018","journal-title":"Sci. Rep."},{"key":"2023013107242623100_bty628-B72","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1016\/j.jtbi.2012.06.028","article-title":"Predicting plant protein subcellular multi-localization by Chou\u2019s PseAAC formulation based multi-label homolog knowledge transfer learning","volume":"310","author":"Mei","year":"2012","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B73","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1016\/j.jtbi.2011.04.017","article-title":"Prediction of GABA(A) receptor proteins using the concept of Chou\u2019s pseudo amino acid composition and support vector machine","volume":"281","author":"Mohabatkar","year":"2011","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B74","doi-asserted-by":"crossref","first-page":"1610","DOI":"10.1016\/j.patrec.2007.04.001","article-title":"Using pseudo amino acid composition to predict protein subnuclear localization: approached with PSSM","volume":"28","author":"Mundra","year":"2007","journal-title":"Pattern Recognit. Lett."},{"key":"2023013107242623100_bty628-B75","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.jtbi.2018.02.008","article-title":"Using Chou\u2019s general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains","volume":"445","author":"Muthu Krishnan","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B76","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1016\/S0888-7543(05)80111-9","article-title":"A knowledge base for predicting protein localization sites in eukaryotic cells","volume":"14","author":"Nakai","year":"1992","journal-title":"Genomics"},{"key":"2023013107242623100_bty628-B77","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1109\/TNB.2013.2272014","article-title":"Predict subcellular locations of singleplex and multiplex proteins by semi-supervised learning and dimension-reducing general mode of Chou\u2019s PseAAC","volume":"12","author":"Pacharawongsakda","year":"2013","journal-title":"IEEE Trans. Nanobioscience"},{"key":"2023013107242623100_bty628-B78","doi-asserted-by":"crossref","first-page":"44310","DOI":"10.18632\/oncotarget.10027","article-title":"iHyd-PseCp: identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC","volume":"7","author":"Qiu","year":"2016","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B79","doi-asserted-by":"crossref","first-page":"3116","DOI":"10.1093\/bioinformatics\/btw380","article-title":"iPTM-mLys: identifying multiple lysine PTM sites and their different types","volume":"32","author":"Qiu","year":"2016","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B80","doi-asserted-by":"crossref","first-page":"51270","DOI":"10.18632\/oncotarget.9987","article-title":"iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier","volume":"7","author":"Qiu","year":"2016","journal-title":"Oncotarget"},{"key":"2023013107242623100_bty628-B81","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1016\/j.jtbi.2016.11.028","article-title":"OOgenesis_Pred: a sequence-based method for predicting oogenesis proteins by six different modes of Chou\u2019s pseudo amino acid composition","volume":"414","author":"Rahimi","year":"2017","journal-title":"J Theor. Biol."},{"key":"2023013107242623100_bty628-B82","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.jtbi.2018.05.006","article-title":"DPP-PseAAC: a DNA-binding protein prediction model using Chou\u2019s general PseAAC","volume":"452","author":"Rahman","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B83","doi-asserted-by":"crossref","first-page":"2230","DOI":"10.1093\/nar\/26.9.2230","article-title":"Using neural networks for prediction of the subcellular location of proteins","volume":"26","author":"Reinhardt","year":"1998","journal-title":"Nucleic Acids Res."},{"key":"2023013107242623100_bty628-B84","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jtbi.2018.04.037","article-title":"Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou\u2019s PseKNC","volume":"452","author":"Sabooh","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B85","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.jtbi.2018.05.006","article-title":"DPP-PseAAC: a DNA-binding Protein Prediction model using Chou\u2019s general PseAAC","volume":"452","author":"Saifur Rahman","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B86","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.jtbi.2015.08.020","article-title":"Subcellular localization for Gram positive and Gram negative bacterial proteins using linear interpolation smoothing model","volume":"386","author":"Saini","year":"2015","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B87","doi-asserted-by":"crossref","first-page":"915","DOI":"10.1109\/TNB.2015.2500186","article-title":"Predict Gram-positive and Gram-negative subcellular localization via incorporating evolutionary information and physicochemical features into Chou\u2019s general PseAAC","volume":"14","author":"Sharma","year":"2015","journal-title":"IEEE Trans. Nanobioscience"},{"key":"2023013107242623100_bty628-B88","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1007\/s00726-007-0623-z","article-title":"Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution","volume":"35","author":"Shi","year":"2008","journal-title":"Amino Acids"},{"key":"2023013107242623100_bty628-B89","doi-asserted-by":"crossref","first-page":"684","DOI":"10.1093\/bioinformatics\/btx670","article-title":"PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy","volume":"34","author":"Song","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B90","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1016\/j.jtbi.2018.01.023","article-title":"PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural and network features in a machine learning framework","volume":"443","author":"Song","year":"2018","journal-title":"J. Theor. Biol."},{"key":"2023013107242623100_bty628-B91","article-title":"iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites","author":"Song","year":"2018","journal-title":"Brief. Bioinform"},{"key":"2023013107242623100_bty628-B92","doi-asserted-by":"crossref","first-page":"4196","DOI":"10.1093\/bioinformatics\/bty508","article-title":"iLoc-lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC","volume":"34","author":"Su","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B93","doi-asserted-by":"crossref","first-page":"2587","DOI":"10.1039\/C6MB00221H","article-title":"iNuc-STNC: a sequence-based predictor for identification of nucleosome positioning in genomes by extending the concept of SAAC and Chou\u2019s PseAAC","volume":"12","author":"Tahir","year":"2016","journal-title":"Mol. Biosyst."},{"key":"2023013107242623100_bty628-B94","doi-asserted-by":"crossref","first-page":"2546","DOI":"10.1093\/bioinformatics\/bty155","article-title":"Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors","volume":"34","author":"Wang","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B95","doi-asserted-by":"crossref","first-page":"2756","DOI":"10.1093\/bioinformatics\/btx302","article-title":"POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles","volume":"33","author":"Wang","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013107242623100_bty628-B96","doi-asserted-by":"crossref","first-page":"e36317","DOI":"10.1371\/journal.pone.0036317","article-title":"A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins","volume":"7","author":"Wang","year":"2012","journal-title":"PLoS One"},{"key":"2023013107242623100_bty628-B97","first-page":"331","article-title":"pLoc-mGpos: incorporate key gene ontology information into general PseAAC for predicting subcellular localization of Gram-positive bacterial proteins","volume":"9","author":"Xiao","year":"2017","journal-title":"Nat. Sci."},{"key":"2023013107242623100_bty628-B98","doi-asserted-by":"crossref","first-page":"2221","DOI":"10.1080\/07391102.2014.998710","article-title":"iDrug-Target: predicting the interactions between drug compounds and target proteins in cellular networking via the benchmark dataset optimization approach","volume":"33","author":"Xiao","year":"2015","journal-title":"J. Biomol. Struct. Dyn."},{"key":"2023013107242623100_bty628-B99","doi-asserted-by":"crossref","first-page":"e171","DOI":"10.7717\/peerj.171","article-title":"iSNO-AAPair: incorporating amino acid pairwise coupling into PseAAC for predicting cysteine S-nitrosylation sites in proteins","volume":"1","author":"Xu","year":"2013","journal-title":"PeerJ"},{"key":"2023013107242623100_bty628-B100","doi-asserted-by":"crossref","first-page":"e105018","DOI":"10.1371\/journal.pone.0105018","article-title":"iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition","volume":"9","author":"Xu","year":"2014","journal-title":"PLoS One"},{"key":"2023013107242623100_bty628-B101","doi-asserted-by":"crossref","first-page":"883","DOI":"10.7150\/ijbs.24616","article-title":"iRSpot-Pse6NC: identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC","volume":"14","author":"Yang","year":"2018","journal-title":"Int. J. Biol. Sci."},{"key":"2023013107242623100_bty628-B102","doi-asserted-by":"crossref","first-page":"1523","DOI":"10.1016\/S0006-3495(92)81728-9","article-title":"Monte Carlo simulation studies on the prediction of protein folding types from amino acid composition","volume":"63","author":"Zhang","year":"1992","journal-title":"Biophys. J."},{"key":"2023013107242623100_bty628-B103","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1007\/BF01886884","article-title":"An analysis of protein folding type prediction by seed-propagated sampling and jackknife test","volume":"14","author":"Zhang","year":"1995","journal-title":"J. Protein Chem."},{"key":"2023013107242623100_bty628-B104","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1007\/s00726-007-0010-9","article-title":"Using the concept of Chou\u2019s pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies","volume":"34","author":"Zhang","year":"2008","journal-title":"Amino Acids"},{"key":"2023013107242623100_bty628-B105","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1002\/prot.1071","article-title":"Some insights into protein structural class prediction","volume":"44","author":"Zhou","year":"2001","journal-title":"Proteins"},{"key":"2023013107242623100_bty628-B106","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1002\/prot.10251","article-title":"Subcellular location prediction of apoptosis proteins","volume":"50","author":"Zhou","year":"2003","journal-title":"Proteins"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/3\/398\/48965275\/bioinformatics_35_3_398.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/3\/398\/48965275\/bioinformatics_35_3_398.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T10:18:25Z","timestamp":1675160305000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/3\/398\/5053319"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,7,13]]},"references-count":106,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2019,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty628","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,2,1]]},"published":{"date-parts":[[2018,7,13]]}}}