{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,17]],"date-time":"2026-01-17T20:57:50Z","timestamp":1768683470101,"version":"3.49.0"},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2020,8,7]],"date-time":"2020-08-07T00:00:00Z","timestamp":1596758400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Nature Scientific Foundation of China","doi-asserted-by":"crossref","award":["61772119"],"award-info":[{"award-number":["61772119"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Sichuan Provincial Science Fund for Distinguished Young Scholars","award":["2020JDJQ0012"],"award-info":[{"award-number":["2020JDJQ0012"]}]},{"name":"Science Strength Promotion Programme of UESTC"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Protein carbonylation is one of the most important oxidative stress-induced post-translational modifications, which is generally characterized as stability, irreversibility and relative early formation. It plays a significant role in orchestrating various biological processes and has been already demonstrated to be related to many diseases. However, the experimental technologies for carbonylation sites identification are not only costly and time consuming, but also unable of processing a large number of proteins at a time. Thus, rapidly and effectively identifying carbonylation sites by computational methods will provide key clues for the analysis of occurrence and development of diseases.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this study, we developed a predictor called iCarPS to identify carbonylation sites based on sequence information. A novel feature encoding scheme called residues conical coordinates combined with their physicochemical properties was proposed to formulate carbonylated protein and non-carbonylated protein samples. To remove potential redundant features and improve the prediction performance, a feature selection technique was used. The accuracy and robustness of iCarPS were proved by experiments on training and independent datasets. Comparison with other published methods demonstrated that the proposed method is powerful and could provide powerful performance for carbonylation sites identification.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>Based on the proposed model, a user-friendly webserver and a software package were constructed, which can be freely accessed at http:\/\/lin-group.cn\/server\/iCarPS.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa702","type":"journal-article","created":{"date-parts":[[2020,8,4]],"date-time":"2020-08-04T19:31:10Z","timestamp":1596569470000},"page":"171-177","source":"Crossref","is-referenced-by-count":82,"title":["iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features"],"prefix":"10.1093","volume":"37","author":[{"given":"Dan","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]},{"given":"Zhao-Chun","family":"Xu","sequence":"additional","affiliation":[{"name":"Jingdezhen Ceramic Institute Computer Department, , Jingdezhen 333403, China"}]},{"given":"Wei","family":"Su","sequence":"additional","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]},{"given":"Yu-He","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]},{"given":"Hao","family":"Lv","sequence":"additional","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]},{"given":"Hui","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6265-2862","authenticated-orcid":false,"given":"Hao","family":"Lin","sequence":"additional","affiliation":[{"name":"School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China , Chengdu 610054, China"}]}],"member":"286","published-online":{"date-parts":[[2020,8,7]]},"reference":[{"key":"2023051510595758500_btaa702-B1","doi-asserted-by":"crossref","first-page":"1669","DOI":"10.1093\/bib\/bby041","article-title":"Toward more accurate prediction of caspase cleavage sites: a comprehensive review of current methods, tools and features","volume":"20","author":"Bao","year":"2019","journal-title":"Brief. Bioinf"},{"key":"2023051510595758500_btaa702-B2","doi-asserted-by":"crossref","first-page":"1276","DOI":"10.1002\/med.21658","article-title":"Machine intelligence in peptide therapeutics: a next-generation tool for rapid disease screening","volume":"40","author":"Basith","year":"2020","journal-title":"Med. Res. Rev"},{"key":"2023051510595758500_btaa702-B3","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/j.omtn.2019.08.011","article-title":"SDM6A: a web-based integrative machine-learning framework for predicting 6mA sites in the rice genome","volume":"18","author":"Basith","year":"2019","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023051510595758500_btaa702-B4","doi-asserted-by":"crossref","first-page":"2338","DOI":"10.1016\/j.jprot.2011.07.002","article-title":"Identification of protein carbonylation sites by two-dimensional liquid chromatography in combination with MALDI- and ESI-MS","volume":"74","author":"Bollineni","year":"2011","journal-title":"J. Proteomics"},{"key":"2023051510595758500_btaa702-B5","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1016\/j.freeradbiomed.2013.11.030","article-title":"Proteome-wide profiling of carbonylated proteins and carbonylation sites in HeLa cells under mild oxidative stress conditions","volume":"68","author":"Bollineni","year":"2014","journal-title":"Free Radic. Biol. Med"},{"key":"2023051510595758500_btaa702-B6","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn"},{"key":"2023051510595758500_btaa702-B7","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1111\/j.1582-4934.2006.tb00407.x","article-title":"Protein carbonylation, cellular dysfunction, and disease progression","volume":"10","author":"Dalle-Donne","year":"2006","journal-title":"J. Cell. Mol. Med"},{"key":"2023051510595758500_btaa702-B8","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/S1471-4914(03)00031-5","article-title":"Protein carbonylation in human diseases","volume":"9","author":"Dalle-Donne","year":"2003","journal-title":"Trends Mol. Med"},{"key":"2023051510595758500_btaa702-B9","doi-asserted-by":"crossref","first-page":"2261","DOI":"10.1002\/cbic.200500141","article-title":"Protein flexibility and ligand rigidity: a thermodynamic and kinetic study of ITAM-based ligand binding to Syk tandem SH2","volume":"6","author":"de Mol","year":"2005","journal-title":"Chembiochem. Eur. J. Chem. Biol"},{"key":"2023051510595758500_btaa702-B10","doi-asserted-by":"crossref","first-page":"2479","DOI":"10.1093\/bioinformatics\/bth261","article-title":"Data mining in bioinformatics using Weka","volume":"20","author":"Frank","year":"2004","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B11","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/s00726-006-0410-2","article-title":"Detecting oxidative post-translational modifications in proteins","volume":"33","author":"Gianazza","year":"2007","journal-title":"Amino Acids"},{"key":"2023051510595758500_btaa702-B12","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1007\/s11030-009-9204-2","article-title":"Extensions to amino acid description","volume":"14","author":"Gottfries","year":"2010","journal-title":"Mol. Divers"},{"key":"2023051510595758500_btaa702-B13","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1042\/BST0351147","article-title":"Biochemistry of oxidative stress","volume":"35","author":"Halliwell","year":"2007","journal-title":"Biochem. Soc. Trans"},{"key":"2023051510595758500_btaa702-B14","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1148\/radiology.143.1.7063747","article-title":"The meaning and use of the area under a receiver operating characteristic (ROC) curve","volume":"143","author":"Hanley","year":"1982","journal-title":"Radiology"},{"key":"2023051510595758500_btaa702-B15","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.ab.2017.03.008","article-title":"predCar-site: carbonylation sites prediction in proteins using support vector machine with resolving data imbalanced issue","volume":"525","author":"Hasan","year":"2017","journal-title":"Anal. Biochem"},{"key":"2023051510595758500_btaa702-B16","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1093\/bioinformatics\/btq003","article-title":"CD-HIT Suite: a web server for clustering and comparing biological sequences","volume":"26","author":"Huang","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B17","doi-asserted-by":"crossref","first-page":"34558","DOI":"10.18632\/oncotarget.9148","article-title":"iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC","volume":"7","author":"Jia","year":"2016","journal-title":"Oncotarget"},{"key":"2023051510595758500_btaa702-B18","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1186\/s12918-017-0511-4","article-title":"MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs","volume":"11","author":"Kao","year":"2017","journal-title":"BMC Syst. Biol"},{"key":"2023051510595758500_btaa702-B19","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1016\/j.freeradbiomed.2016.11.004","article-title":"In situ visualization of carbonylation and its co-localization with proteins, lipids, DNA and RNA in Caenorhabditis elegans","volume":"101","author":"Kuzmic","year":"2016","journal-title":"Free Radic. Biol. Med"},{"key":"2023051510595758500_btaa702-B20","doi-asserted-by":"crossref","first-page":"3336","DOI":"10.1093\/bioinformatics\/btaa155","article-title":"iMRM: a platform for simultaneously identifying multiple kinds of RNA modifications","volume":"36","author":"Liu","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B21","doi-asserted-by":"crossref","DOI":"10.2174\/1389203721666200117153412","article-title":"An overview on predicting protein subchloroplast localization by using machine learning methods","author":"Liu","year":"2020","journal-title":"Curr. Protein Peptide Sci"},{"key":"2023051510595758500_btaa702-B22","first-page":"ECE591Q","article-title":"Implementation of Breiman's random forest machine learning algorithm","volume":"2005","author":"Livingston","year":"2005","journal-title":"Mach. Learn. J. Pap"},{"key":"2023051510595758500_btaa702-B23","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1007\/BF01808781","article-title":"The degeneracy rule of genetic code","volume":"18","author":"Luo","year":"1988","journal-title":"Orig. Life Evol. Biosph."},{"key":"2023051510595758500_btaa702-B24","doi-asserted-by":"crossref","first-page":"e111478","DOI":"10.1371\/journal.pone.0111478","article-title":"CarSPred: a computational tool for predicting carbonylation sites of human proteins","volume":"9","author":"Lv","year":"2014","journal-title":"PLoS One"},{"key":"2023051510595758500_btaa702-B25","article-title":"A computational method to predict carbonylation sites in yeast proteins","volume":"Genet. Mol. Res., 15, gmr8006.","author":"Lv","year":"2016"},{"key":"2023051510595758500_btaa702-B26","doi-asserted-by":"crossref","first-page":"2757","DOI":"10.1093\/bioinformatics\/bty1047","article-title":"mAHTPred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation","volume":"35","author":"Manavalan","year":"2019","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B27","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1016\/j.omtn.2019.04.019","article-title":"Meta-4mCpred: a sequence-based meta-predictor for accurate DNA 4mC site prediction using effective feature representation","volume":"16","author":"Manavalan","year":"2019","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023051510595758500_btaa702-B28","doi-asserted-by":"crossref","first-page":"2228","DOI":"10.1016\/j.jprot.2011.05.004","article-title":"Protein carbonylation and metal-catalyzed protein oxidation in a cellular perspective","volume":"74","author":"Moller","year":"2011","journal-title":"J. Proteomics"},{"key":"2023051510595758500_btaa702-B29","doi-asserted-by":"crossref","first-page":"4166","DOI":"10.1002\/pmic.201100223","article-title":"Pattern of occurrence and occupancy of carbonylation sites in proteins","volume":"11","author":"Rao","year":"2011","journal-title":"Proteomics"},{"key":"2023051510595758500_btaa702-B30","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1002\/jms.1725","article-title":"Identification of carbonylation sites in apomyoglobin after exposure to 4-hydroxy-2-nonenal by solid-phase enrichment and liquid chromatography-electrospray ionization tandem mass spectrometry","volume":"45","author":"Rauniyar","year":"2010","journal-title":"J. Mass Spectrom."},{"key":"2023051510595758500_btaa702-B31","doi-asserted-by":"crossref","first-page":"763","DOI":"10.3233\/JAD-2009-1013","article-title":"Oxidative stress in diabetes and Alzheimer's disease","volume":"16","author":"Reddy","year":"2009","journal-title":"J. Alzheimer's Dis."},{"key":"2023051510595758500_btaa702-B32","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1007\/978-1-4939-3578-9_17","article-title":"Introducing machine learning concepts with WEKA","volume":"1418","author":"Smith","year":"2016","journal-title":"Methods Mol. Biol"},{"key":"2023051510595758500_btaa702-B33","doi-asserted-by":"crossref","first-page":"752","DOI":"10.1093\/bioinformatics\/btq043","article-title":"Cascleave: towards more accurate prediction of caspase substrate cleavage sites","volume":"26","author":"Song","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B34","doi-asserted-by":"crossref","first-page":"2466","DOI":"10.3934\/mbe.2019123","article-title":"Identification of hormone binding proteins based on machine learning methods","volume":"16","author":"Tan","year":"2019","journal-title":"Math. Biosci. Eng"},{"key":"2023051510595758500_btaa702-B35","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1039\/C5MB00883B","article-title":"Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique","volume":"12","author":"Tang","year":"2016","journal-title":"Mol. BioSyst"},{"key":"2023051510595758500_btaa702-B36","doi-asserted-by":"crossref","first-page":"957","DOI":"10.7150\/ijbs.24174","article-title":"HBPred: a tool to identify growth hormone-binding proteins","volume":"14","author":"Tang","year":"2018","journal-title":"Int. J. Biol. Sci"},{"key":"2023051510595758500_btaa702-B37","doi-asserted-by":"crossref","first-page":"1536","DOI":"10.1093\/bioinformatics\/btl151","article-title":"Two Sample Logo: a graphical representation of the differences between two sets of sequence alignments","volume":"22","author":"Vacic","year":"2006","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B38","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1093\/bioinformatics\/btt603","article-title":"Cascleave 2.0, a new approach for predicting caspase and granzyme cleavage targets","volume":"30","author":"Wang","year":"2014","journal-title":"Bioinformatics"},{"key":"2023051510595758500_btaa702-B39","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1186\/s12859-017-1472-8","article-title":"Investigation and identification of protein carbonylation sites based on position-specific amino acid composition and physicochemical features","volume":"18","author":"Weng","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2023051510595758500_btaa702-B40","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.jtbi.2013.11.012","article-title":"Prediction of posttranslational modification sites from amino acid sequences with kernel methods","volume":"344","author":"Xu","year":"2014","journal-title":"J. Theor. Biol"},{"key":"2023051510595758500_btaa702-B41","doi-asserted-by":"crossref","first-page":"234","DOI":"10.2174\/1574893613666181113131415","article-title":"A brief survey of machine learning methods in protein sub-Golgi localization","volume":"14","author":"Yang","year":"2019","journal-title":"Curr. Bioinform"},{"key":"2023051510595758500_btaa702-B42","doi-asserted-by":"crossref","first-page":"1425","DOI":"10.1093\/bib\/bbz080","article-title":"Predicting disease-associated circular RNAs using deep forests combined with positive-unlabeled learning methods","volume":"21","author":"Zeng","year":"2020","journal-title":"Brief. Bioinf"},{"key":"2023051510595758500_btaa702-B43","doi-asserted-by":"crossref","first-page":"34817","DOI":"10.1038\/srep34817","article-title":"Prediction of phosphothreonine sites in human proteins by fusing different features","volume":"6","author":"Zhao","year":"2016","journal-title":"Sci. Rep"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa702\/33862470\/btaa702.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/2\/171\/50321514\/btaa702.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/2\/171\/50321514\/btaa702.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T09:19:16Z","timestamp":1723367956000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/2\/171\/5885082"}},"subtitle":[],"editor":[{"given":"Jinbo","family":"Xu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,8,7]]},"references-count":43,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,4,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa702","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,1,15]]},"published":{"date-parts":[[2020,8,7]]}}}