{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:08:23Z","timestamp":1753880903349,"version":"3.41.2"},"reference-count":56,"publisher":"World Scientific Pub Co Pte Ltd","issue":"05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2022,10]]},"abstract":"<jats:p>Glycoproteins play an important and ubiquitous role in many biological processes such as protein folding, cell-to-cell signaling, invading microorganism infection, tumor metastasis, and leukocyte trafficking. The key mechanism of glycoproteins must be revealed to model and refine glycosylated protein recognition, which will eventually assist in the design and discovery of carbohydrate-derived therapeutics. Experimental procedures involving wet-lab experiments to reveal glycoproteins are very time-consuming, laborious, and highly costly. However, costly and tedious experimental procedures can be assisted by ranking the most probable glycoproteins through computational methods with improved accuracy. In this study, we have proposed a novel machine learning-based predictive model for glycoproteins identification. Our proposed model is based on sequence-derived structural descriptors (SDSD) that fill the gap of unavailability of protein 3D structures and lack of accuracy in sequence information alone. Through a series of simulation studies, we have shown that our proposed model gives state-of-the-art generalization performance verified through various machine learning-centric and biologically relevant techniques and metrics. Through data mining in this study, we have also identified the role of descriptors in determining glycoproteins. Python-based standalone code together with a webserver implementation of our proposed model (COYOTE: identifiCation Of glYcoprOteins Through sEquences) is available at the URL: https:\/\/sites.google.com\/view\/wajidarshad\/software .<\/jats:p>","DOI":"10.1142\/s0219720022500196","type":"journal-article","created":{"date-parts":[[2022,8,8]],"date-time":"2022-08-08T03:28:35Z","timestamp":1659929315000},"source":"Crossref","is-referenced-by-count":0,"title":["COYOTE: Sequence-derived structural descriptors-based computational identification of glycoproteins"],"prefix":"10.1142","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7691-5715","authenticated-orcid":false,"given":"Wajid Arshad","family":"Abbasi","sequence":"first","affiliation":[{"name":"Computational Biology and Data Analysis Laboratory, Department of Computer Sciences & Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, AJ&K 13100 Pakistan"}]},{"given":"Asma","family":"Anjam","sequence":"additional","affiliation":[{"name":"Computational Biology and Data Analysis Laboratory, Department of Computer Sciences & Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, AJ&K 13100 Pakistan"}]},{"given":"Sadia","family":"Khalil","sequence":"additional","affiliation":[{"name":"Computational Biology and Data Analysis Laboratory, Department of Computer Sciences & Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, AJ&K 13100 Pakistan"}]},{"given":"Saiqa","family":"Andleeb","sequence":"additional","affiliation":[{"name":"Biotechnology Laboratory, Department of Zoology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad 13100 Pakistan"}]},{"given":"Maryum","family":"Bibi","sequence":"additional","affiliation":[{"name":"Computational Biology and Data Analysis Laboratory, Department of Computer Sciences & Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, AJ&K 13100 Pakistan"}]},{"given":"Syed Ali","family":"Abbas","sequence":"additional","affiliation":[{"name":"Computational Biology and Data Analysis Laboratory, Department of Computer Sciences & Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, AJ&K 13100 Pakistan"}]}],"member":"219","published-online":{"date-parts":[[2022,9,12]]},"reference":[{"key":"S0219720022500196BIB001","doi-asserted-by":"publisher","DOI":"10.1186\/1472-6807-7-1"},{"key":"S0219720022500196BIB002","doi-asserted-by":"publisher","DOI":"10.1002\/chem.200401030"},{"key":"S0219720022500196BIB003","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.6b00320"},{"key":"S0219720022500196BIB004","volume-title":"Human Anatomy","author":"McKinley M","year":"2014","edition":"4"},{"key":"S0219720022500196BIB005","doi-asserted-by":"publisher","DOI":"10.1046\/j.1365-2958.1998.00854.x"},{"key":"S0219720022500196BIB006","doi-asserted-by":"publisher","DOI":"10.1126\/science.1439808"},{"key":"S0219720022500196BIB007","first-page":"59","volume":"30","author":"Sharon N","year":"1995","journal-title":"Essays Biochem"},{"key":"S0219720022500196BIB008","doi-asserted-by":"publisher","DOI":"10.1002\/med.20216"},{"key":"S0219720022500196BIB009","doi-asserted-by":"publisher","DOI":"10.2174\/187152008783330833"},{"key":"S0219720022500196BIB010","doi-asserted-by":"publisher","DOI":"10.1016\/j.sbi.2010.06.008"},{"key":"S0219720022500196BIB011","doi-asserted-by":"publisher","DOI":"10.1016\/S0076-6879(10)78026-5"},{"key":"S0219720022500196BIB012","doi-asserted-by":"publisher","DOI":"10.1021\/ja511237n"},{"key":"S0219720022500196BIB013","doi-asserted-by":"crossref","first-page":"e289301","DOI":"10.1155\/2010\/289301","volume":"2010","author":"Someya S","year":"2010","journal-title":"Adv Bioinf"},{"key":"S0219720022500196BIB014","doi-asserted-by":"publisher","DOI":"10.1002\/cpps.75"},{"key":"S0219720022500196BIB015","doi-asserted-by":"crossref","first-page":"436036","DOI":"10.1155\/2010\/436036","volume":"2010","author":"Malik A","year":"2010","journal-title":"Adv Bioinf"},{"key":"S0219720022500196BIB016","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gky1049"},{"key":"S0219720022500196BIB017","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq003"},{"key":"S0219720022500196BIB018","doi-asserted-by":"publisher","DOI":"10.1007\/BF00994018"},{"key":"S0219720022500196BIB019","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"S0219720022500196BIB020","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1013203451"},{"key":"S0219720022500196BIB021","first-page":"2825","volume":"12","author":"Pedregosa F","year":"2011","journal-title":"J Mach Learn Res"},{"key":"S0219720022500196BIB022","doi-asserted-by":"publisher","DOI":"10.1186\/s13040-020-00231-w"},{"key":"S0219720022500196BIB023","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-15-291"},{"key":"S0219720022500196BIB024","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq112"},{"key":"S0219720022500196BIB025","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr513"},{"key":"S0219720022500196BIB026","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"S0219720022500196BIB027","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btv345"},{"key":"S0219720022500196BIB028","doi-asserted-by":"publisher","DOI":"10.1093\/protein\/5.5.373"},{"key":"S0219720022500196BIB029","doi-asserted-by":"publisher","DOI":"10.1016\/0022-5193(82)90191-6"},{"key":"S0219720022500196BIB030","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(76)90191-1"},{"key":"S0219720022500196BIB031","doi-asserted-by":"publisher","DOI":"10.1016\/0022-5193(67)90004-5"},{"key":"S0219720022500196BIB032","doi-asserted-by":"publisher","DOI":"10.1016\/0022-5193(81)90377-5"},{"key":"S0219720022500196BIB033","doi-asserted-by":"publisher","DOI":"10.2174\/157016409789973707"},{"key":"S0219720022500196BIB034","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010967008838"},{"key":"S0219720022500196BIB035","doi-asserted-by":"publisher","DOI":"10.1002\/bip.360270308"},{"key":"S0219720022500196BIB036","doi-asserted-by":"publisher","DOI":"10.1002\/ajpa.20250"},{"key":"S0219720022500196BIB037","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.92.19.8700"},{"key":"S0219720022500196BIB038","doi-asserted-by":"publisher","DOI":"10.1126\/science.185.4154.862"},{"key":"S0219720022500196BIB039","doi-asserted-by":"publisher","DOI":"10.1002\/prot.20045"},{"key":"S0219720022500196BIB040","doi-asserted-by":"publisher","DOI":"10.1002\/pmic.200401118"},{"key":"S0219720022500196BIB041","doi-asserted-by":"publisher","DOI":"10.1006\/bbrc.2000.3815"},{"key":"S0219720022500196BIB042","doi-asserted-by":"publisher","DOI":"10.1016\/S0006-3495(94)80782-9"},{"key":"S0219720022500196BIB043","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt072"},{"key":"S0219720022500196BIB044","first-page":"564","volume-title":"Pac Symp Biocomput","volume":"7","author":"Leslie C","year":"2002"},{"key":"S0219720022500196BIB045","doi-asserted-by":"publisher","DOI":"10.1038\/nbt0804-1035"},{"key":"S0219720022500196BIB046","doi-asserted-by":"publisher","DOI":"10.1002\/prot.25330"},{"key":"S0219720022500196BIB047","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts416"},{"key":"S0219720022500196BIB048","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-018-2448-z"},{"key":"S0219720022500196BIB049","first-page":"1137","volume-title":"Proc 14th Int Jt Conf Artif Intell","author":"Kohavi R","year":"1995"},{"key":"S0219720022500196BIB050","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143874"},{"key":"S0219720022500196BIB051","doi-asserted-by":"publisher","DOI":"10.1038\/75556"},{"key":"S0219720022500196BIB052","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-10-48"},{"key":"S0219720022500196BIB053","doi-asserted-by":"publisher","DOI":"10.1038\/234034a0"},{"key":"S0219720022500196BIB054","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720016500116"},{"issue":"1","key":"S0219720022500196BIB055","first-page":"168","volume":"17","author":"Tharwat A","year":"2020","journal-title":"Appl Comput Inf"},{"key":"S0219720022500196BIB056","doi-asserted-by":"publisher","DOI":"10.1002\/wcms.1225"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720022500196","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,25]],"date-time":"2023-11-25T11:07:54Z","timestamp":1700910474000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0219720022500196"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,12]]},"references-count":56,"journal-issue":{"issue":"05","published-print":{"date-parts":[[2022,10]]}},"alternative-id":["10.1142\/S0219720022500196"],"URL":"https:\/\/doi.org\/10.1142\/s0219720022500196","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"type":"print","value":"0219-7200"},{"type":"electronic","value":"1757-6334"}],"subject":[],"published":{"date-parts":[[2022,9,12]]},"article-number":"2250019"}}