{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T14:13:01Z","timestamp":1753884781252,"version":"3.41.2"},"reference-count":33,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"DOI":"10.13039\/100014718","name":"Innovative Research Group Project of the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62071079"],"award-info":[{"award-number":["62071079"]}],"id":[{"id":"10.13039\/100014718","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2022,2]]},"abstract":"<jats:p> O-glycosylation is a protein posttranslational modification important in regulating almost all cells. It is related to a large number of physiological and pathological phenomena. Recognizing O-glycosylation sites is the key to further investigating the molecular mechanism of protein posttranslational modification. This study aimed to collect a reliable dataset on Homo sapiens and develop an O-glycosylation predictor for Homo sapiens, named Captor, through multiple features. A random undersampling method and a synthetic minority oversampling technique were employed to deal with imbalanced data. In addition, the Kruskal\u2013Wallis (K\u2013W) test was adopted to optimize feature vectors and improve the performance of the model. A support vector machine, due to its optimal performance, was used to train and optimize the final prediction model after a comprehensive comparison of various classifiers in traditional machine learning methods and deep learning. On the independent test set, Captor outperformed the existing O-glycosylation tool, suggesting that Captor could provide more instructive guidance for further experimental research on O-glycosylation. The source code and datasets are available at https:\/\/github.com\/YanZhu06\/Captor\/ . <\/jats:p>","DOI":"10.1142\/s0219720021500293","type":"journal-article","created":{"date-parts":[[2021,11,22]],"date-time":"2021-11-22T14:19:26Z","timestamp":1637590766000},"source":"Crossref","is-referenced-by-count":5,"title":["O-glycosylation site prediction for <i>Homo sapiens<\/i> by combining properties and sequence features with support vector machine"],"prefix":"10.1142","volume":"20","author":[{"given":"Yan","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Science, Dalian Maritime University, Dalian 116026, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuwan","family":"Yin","sequence":"additional","affiliation":[{"name":"School of Science, Dalian Maritime University, Dalian 116026, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jia","family":"Zheng","sequence":"additional","affiliation":[{"name":"School of Science, Dalian Maritime University, Dalian 116026, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yixia","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics, Lingnan Normal University, Zhanjiang 524048, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cangzhi","family":"Jia","sequence":"additional","affiliation":[{"name":"School of Science, Dalian Maritime University, Dalian 116026, P. R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2021,11,19]]},"reference":[{"key":"S0219720021500293BIB001","doi-asserted-by":"publisher","DOI":"10.1007\/s11517-015-1268-9"},{"key":"S0219720021500293BIB002","doi-asserted-by":"publisher","DOI":"10.4137\/BBI.S26864"},{"key":"S0219720021500293BIB003","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-21802-6_72"},{"key":"S0219720021500293BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/j.gpb.2020.05.003"},{"key":"S0219720021500293BIB005","doi-asserted-by":"publisher","DOI":"10.3233\/AIC-130580"},{"key":"S0219720021500293BIB006","doi-asserted-by":"publisher","DOI":"10.1016\/j.compag.2018.12.006"},{"issue":"4","key":"S0219720021500293BIB007","first-page":"1738","volume":"13","author":"Mu RH","year":"2019","journal-title":"KSII Trans Internet Inf Syst"},{"key":"S0219720021500293BIB008","doi-asserted-by":"publisher","DOI":"10.1016\/j.gpb.2018.04.007"},{"key":"S0219720021500293BIB009","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bty1051"},{"key":"S0219720021500293BIB010","doi-asserted-by":"publisher","DOI":"10.1021\/acs.analchem.1c00354"},{"key":"S0219720021500293BIB011","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btx496"},{"key":"S0219720021500293BIB012","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bty1068"},{"key":"S0219720021500293BIB013","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.8b00350"},{"key":"S0219720021500293BIB014","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-8-438"},{"key":"S0219720021500293BIB015","doi-asserted-by":"publisher","DOI":"10.1038\/emboj.2013.79"},{"key":"S0219720021500293BIB016","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0067008"},{"key":"S0219720021500293BIB017","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btu852"},{"key":"S0219720021500293BIB018","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1038\/s41598-016-0019-y","volume":"6","author":"Li F","year":"2016","journal-title":"Sci Rep"},{"key":"S0219720021500293BIB019","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-019-2700-1"},{"key":"S0219720021500293BIB020","doi-asserted-by":"publisher","DOI":"10.6026\/97320630014213"},{"key":"S0219720021500293BIB021","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq003"},{"key":"S0219720021500293BIB022","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btz721"},{"key":"S0219720021500293BIB023","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/28.1.374"},{"key":"S0219720021500293BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/TCBB.2019.2957758"},{"key":"S0219720021500293BIB025","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbz041"},{"key":"S0219720021500293BIB026","doi-asserted-by":"publisher","DOI":"10.1038\/srep10184"},{"key":"S0219720021500293BIB027","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-021-00314-z"},{"key":"S0219720021500293BIB028","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2021.07.017"},{"key":"S0219720021500293BIB029","doi-asserted-by":"publisher","DOI":"10.1016\/j.ab.2020.113592"},{"key":"S0219720021500293BIB030","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btz015"},{"key":"S0219720021500293BIB031","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btz408"},{"key":"S0219720021500293BIB032","doi-asserted-by":"publisher","DOI":"10.1016\/j.ecolind.2021.107416"},{"key":"S0219720021500293BIB033","doi-asserted-by":"publisher","DOI":"10.1016\/j.ab.2015.12.009"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720021500293","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,3]],"date-time":"2022-03-03T08:15:27Z","timestamp":1646295327000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720021500293"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,19]]},"references-count":33,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2022,2]]}},"alternative-id":["10.1142\/S0219720021500293"],"URL":"https:\/\/doi.org\/10.1142\/s0219720021500293","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"type":"print","value":"0219-7200"},{"type":"electronic","value":"1757-6334"}],"subject":[],"published":{"date-parts":[[2021,11,19]]},"article-number":"2150029"}}