{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T05:42:51Z","timestamp":1764740571852,"version":"3.41.2"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T00:00:00Z","timestamp":1659571200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>A newly invented post-translational modification (PTM), phosphoglycerylation, has shown its essential role in the construction and functional properties of proteins and dangerous human diseases. Hence, it is very urgent to know about the molecular mechanism behind the phosphoglycerylation process to develop the drugs for related diseases. But accurately identifying of phosphoglycerylation site from a protein sequence in a laboratory is a very difficult and challenging task. Hence, the construction of an efficient computation model is greatly sought for this purpose. A little number of computational models are currently available for identifying the phosphoglycerylation sites, which are not able to reach their prediction capability at a satisfactory level. Therefore, an effective predictor named PLP_FS has been designed and constructed to identify phosphoglycerylation sites in this study. For the training purpose, an optimal number of feature sets was obtained by fusion of multiple F_Score feature selection techniques from the features generated by three types of sequence-based feature extraction methods and fitted with the support vector machine classification technique to the prediction model. On the other hand, the k-neighbor near cleaning and SMOTE methods were also implemented to balance the benchmark dataset. The suggested model in 10-fold cross-validation obtained an accuracy of 99.22%, a sensitivity of 98.17% and a specificity of 99.75% according to the experimental findings, which are better than other currently available predictors for accurately identifying the phosphoglycerylation sites.<\/jats:p>","DOI":"10.1093\/bib\/bbac306","type":"journal-article","created":{"date-parts":[[2022,8,5]],"date-time":"2022-08-05T08:00:52Z","timestamp":1659686452000},"source":"Crossref","is-referenced-by-count":8,"title":["PLP_FS: prediction of lysine phosphoglycerylation sites in protein using support vector machine and fusion of multiple F_Score feature selection"],"prefix":"10.1093","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8042-3040","authenticated-orcid":false,"given":"Md","family":"Sohrawordi","sequence":"first","affiliation":[{"name":"Dept. of Computer Science and Engineering, Rajshahi University of Engineering and Technology , Rajshahi, Bangladesh"},{"name":"Dept. of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University , Dinajpur, Bangladesh"}]},{"given":"Md Ali","family":"Hossain","sequence":"additional","affiliation":[{"name":"Dept. of Computer Science and Engineering, Rajshahi University of Engineering and Technology , Rajshahi, Bangladesh"}]},{"given":"Md Al Mehedi","family":"Hasan","sequence":"additional","affiliation":[{"name":"Dept. of Computer Science and Engineering, Rajshahi University of Engineering and Technology , Rajshahi, Bangladesh"}]}],"member":"286","published-online":{"date-parts":[[2022,8,4]]},"reference":[{"key":"2022092013223766300_ref1","doi-asserted-by":"crossref","first-page":"113955","DOI":"10.1016\/j.ab.2020.113955","article-title":"DeepPPSite: a deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information","volume":"612","author":"Ahmed","year":"2021","journal-title":"Anal Biochem"},{"key":"2022092013223766300_ref2","first-page":"1","article-title":"Succinylation site prediction based on protein sequences using the IFS-lightgbm (BO) model","volume":"2020","author":"Zhang","year":"2020","journal-title":"Comput Math Methods Med"},{"key":"2022092013223766300_ref3","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.3390\/genes11091023","article-title":"Accurately predicting glutarylation sites using sequential bi-peptide-based evolutionary features","volume":"11","author":"Arafat","year":"2020","journal-title":"Genes"},{"key":"2022092013223766300_ref4","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-020-3342-z","article-title":"DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction","volume":"21","author":"Thapa","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2022092013223766300_ref5","doi-asserted-by":"crossref","DOI":"10.3389\/fbioe.2019.00311","article-title":"Identifying acetylation protein by fusing its pseaac and functional domain annotation","volume":"7","author":"Qiu","year":"2019","journal-title":"Front Bioeng Biotechnol"},{"key":"2022092013223766300_ref6","doi-asserted-by":"crossref","first-page":"1524","DOI":"10.3390\/genes11121524","article-title":"Ram-PGK: prediction of lysine phosphoglycerylation based on residue adjacency matrix","volume":"11","author":"Chandra","year":"2020","journal-title":"Genes"},{"key":"2022092013223766300_ref7","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1186\/s12859-020-03916-5","article-title":"IDPGK: characterization and identification of lysine phosphoglycerylation sites based on sequence-based features","volume":"21","author":"Huang","year":"2020","journal-title":"BMC Bioinformatics"},{"volume-title":"2020 IEEE Region 10 Symposium (TENSYMP)","year":"2020","author":"Sohrawordi","key":"2022092013223766300_ref8"},{"key":"2022092013223766300_ref9","doi-asserted-by":"crossref","first-page":"1291","DOI":"10.1007\/s10989-019-09931-2","article-title":"Prediction of S-sulfenylation sites using statistical moments based features via Chou\u2019s 5-step rule","volume":"26","author":"Butt","year":"2019","journal-title":"International Journal of Peptide Research and Therapeutics"},{"key":"2022092013223766300_ref10","doi-asserted-by":"crossref","first-page":"13843","DOI":"10.1007\/s00521-020-04792-z","article-title":"Sulsite-GTB: identification of protein S-sulfenylation sites by fusing multiple feature information and gradient tree boosting","volume":"32","author":"Wang","year":"2020","journal-title":"Neural Computing and Applications"},{"key":"2022092013223766300_ref11","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1021\/acs.jproteome.0c00431","article-title":"DeepPSP: a global\u2013local information-based deep neural network for the prediction of protein phosphorylation sites","volume":"20","author":"Guo","year":"2020","journal-title":"J Proteome Res"},{"key":"2022092013223766300_ref12","doi-asserted-by":"crossref","first-page":"844","DOI":"10.1093\/glycob\/cwl009","article-title":"Analysis and prediction of mammalian protein glycation","volume":"16","author":"Johansen","year":"2006","journal-title":"Glycobiology"},{"key":"2022092013223766300_ref13","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1126\/science.1238327","article-title":"Functional lysine modification by an intrinsically reactive primary glycolytic metabolite","volume":"341","author":"Moellering","year":"2013","journal-title":"Science"},{"key":"2022092013223766300_ref14","doi-asserted-by":"crossref","first-page":"874","DOI":"10.1039\/C6MB00875E","article-title":"Predicting protein lysine phosphoglycerylation sites by hybridizing many sequence based features","volume":"13","author":"Chen","year":"2017","journal-title":"Mol Biosyst"},{"key":"2022092013223766300_ref15","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1051\/bmdcn\/2017070315","article-title":"Transglutaminase 2 in human diseases","volume":"7","author":"Szondy","year":"2017","journal-title":"Biomedicine"},{"key":"2022092013223766300_ref16","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1111\/j.1742-1241.2011.02795.x","article-title":"Disorders of glucose metabolism and insulin resistance in patients with obstructive sleep apnoea syndrome","volume":"66","author":"Bulcun","year":"2011","journal-title":"Int J Clin Pract"},{"key":"2022092013223766300_ref17","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1093\/cvr\/cvr071","article-title":"Glucose metabolism and cardiac hypertrophy","volume":"90","author":"Kolwicz","year":"2011","journal-title":"Cardiovasc Res"},{"key":"2022092013223766300_ref18","doi-asserted-by":"crossref","DOI":"10.1186\/s12860-019-0240-1","article-title":"Bigram-PGK: phosphoglycerylation prediction using the technique of bigram probabilities of position specific scoring matrix","volume":"20","author":"Chandra","year":"2019","journal-title":"BMC Molecular and Cell Biology"},{"key":"2022092013223766300_ref19","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1016\/j.ygeno.2017.10.008","article-title":"IKCR-PseEns: identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier","volume":"110","author":"Qiu","year":"2018","journal-title":"Genomics"},{"key":"2022092013223766300_ref20","doi-asserted-by":"crossref","first-page":"946","DOI":"10.7150\/ijbs.24121","article-title":"PTM-SSMP: a web server for predicting different types of post-translational modification sites using novel site-specific modification profile","volume":"14","author":"Liu","year":"2018","journal-title":"Int J Biol Sci"},{"key":"2022092013223766300_ref21","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1016\/j.jtbi.2015.04.016","article-title":"Phogly\u2013PSEAAC: prediction of lysine phosphoglycerylation in proteins incorporating with position-specific propensity","volume":"379","author":"Xu","year":"2015","journal-title":"J Theor Biol"},{"key":"2022092013223766300_ref22","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1016\/j.jtbi.2016.02.020","article-title":"Predicting lysine phosphoglycerylation with fuzzy SVM by incorporating K-spaced amino acid pairs into Chou\u2019s general pseaac","volume":"397","author":"Ju","year":"2016","journal-title":"J Theor Biol"},{"key":"2022092013223766300_ref23","doi-asserted-by":"crossref","DOI":"10.1186\/s12864-018-5383-5","article-title":"Evolstruct-Phogly: incorporating structural properties and evolutionary information from profile bigrams for the phosphoglycerylation prediction","volume":"19","author":"Chandra","year":"2019","journal-title":"BMC Genomics"},{"key":"2022092013223766300_ref24","article-title":"Predphogly-site: predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PSEAAC and addressing data imbalance","volume":"16","author":"Ahmed","year":"2021","journal-title":"PLOS ONE"},{"key":"2022092013223766300_ref25","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1016\/j.jgg.2017.03.007","article-title":"PLMD: an updated data resource of protein lysine modifications","volume":"44","author":"Xu","year":"2017","journal-title":"J Genet Genomics"},{"key":"2022092013223766300_ref26","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"CD-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2022092013223766300_ref27","article-title":"Characterization and identification of lysine crotonylation sites based on machine learning method on both plant and mammalian","volume":"10","author":"Wang","year":"2020","journal-title":"Sci Rep"},{"key":"2022092013223766300_ref28","article-title":"Deep-KCR: accurate detection of lysine crotonylation sites using deep learning method","volume":"22","author":"Lv","year":"2020","journal-title":"Brief Bioinform"},{"key":"2022092013223766300_ref29","first-page":"1","article-title":"IRICE-MS: an integrated XGBoost model for detecting multitype post-translational modification sites in Rice","volume":"23","author":"Lv","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022092013223766300_ref30","doi-asserted-by":"crossref","DOI":"10.1038\/s41598-020-67384-w","article-title":"Incorporating hybrid models into lysine malonylation sites prediction on mammalian and plant proteins","volume":"10","author":"Chung","year":"2020","journal-title":"Sci Rep"},{"key":"2022092013223766300_ref31","article-title":"Stallion: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction","author":"Basith","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022092013223766300_ref32","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-018-2249-4","article-title":"Detecting succinylation sites from protein sequences using ensemble support vector machine","volume":"19","author":"Ning","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2022092013223766300_ref33","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1016\/j.biochi.2021.10.001","article-title":"Prediction of lysine formylation sites using support vector machine based on the sample selection from majority classes and synthetic minority over-sampling techniques","volume":"192","author":"Sohrawordi","year":"2022","journal-title":"Biochimie"},{"key":"2022092013223766300_ref34","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-018-2394-9","article-title":"Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites","volume":"19","author":"Huang","year":"2019","journal-title":"BMC Bioinformatics"},{"volume-title":"2018 International Conference on Current Trends towards Converging Technologies (ICCTCT)","year":"2018","author":"Spelmen","key":"2022092013223766300_ref35"},{"key":"2022092013223766300_ref36","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1109\/TCBB.2019.2957758","article-title":"Formator: predicting lysine formylation sites based on the most distant undersampling and safe-level synthetic minority oversampling","volume":"18","author":"Jia","year":"2021","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2022092013223766300_ref37","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-021-04101-y","article-title":"Prediction and analysis of multiple protein lysine modified sites based on conditional Wasserstein generative adversarial networks","volume":"22","author":"Yang","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"2022092013223766300_ref38","doi-asserted-by":"crossref","first-page":"2120","DOI":"10.3390\/ijms22042120","article-title":"Pup-fuse: prediction of protein pupylation sites by integrating multiple sequence representations","volume":"22","author":"Auliah","year":"2021","journal-title":"Int J Mol Sci"},{"key":"2022092013223766300_ref39","doi-asserted-by":"crossref","first-page":"717","DOI":"10.3390\/genes12050717","article-title":"Ubicomb: a hybrid deep learning model for predicting plant-specific protein ubiquitylation sites","volume":"12","author":"Siraj","year":"2021","journal-title":"Genes"},{"key":"2022092013223766300_ref40","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"Smote: synthetic minority over-sampling technique","volume":"16","author":"Chawla","year":"2002","journal-title":"Journal of Artificial Intelligence Research"},{"key":"2022092013223766300_ref41","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-021-04134-3","article-title":"CarSite-II: an integrated classification algorithm for identifying carbonylated sites based on K-means similarity-based undersampling and synthetic minority oversampling techniques","volume":"22","author":"Zuo","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"2022092013223766300_ref42","doi-asserted-by":"crossref","first-page":"113903","DOI":"10.1016\/j.ab.2020.113903","article-title":"Prediction of protein crotonylation sites through lightgbm classifier based on smote and elastic net","volume":"609","author":"Liu","year":"2020","journal-title":"Anal Biochem"},{"key":"2022092013223766300_ref43","doi-asserted-by":"crossref","first-page":"2395","DOI":"10.1093\/bioinformatics\/bty995","article-title":"Protein\u2013protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique","volume":"35","author":"Wang","year":"2018","journal-title":"Bioinformatics"},{"key":"2022092013223766300_ref44","first-page":"1","article-title":"An improved feature selection based on effective range for classification","volume":"2014","author":"Wang","year":"2014","journal-title":"Scientific World Journal"},{"key":"2022092013223766300_ref45","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Machine Learning"},{"key":"2022092013223766300_ref46","doi-asserted-by":"crossref","first-page":"1536","DOI":"10.1093\/bioinformatics\/btl151","article-title":"Two sample logo: a graphical representation of the differences between two sets of sequence alignments","volume":"22","author":"Vacic","year":"2006","journal-title":"Bioinformatics"},{"key":"2022092013223766300_ref47","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkt1093","article-title":"CPLM: a database of protein lysine modifications","volume":"42","author":"Liu","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2022092013223766300_ref48","article-title":"The universal protein resource (UniProt)","volume":"37","author":"Amos","year":"2009","journal-title":"Nucleic Acids Res"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac306\/45938241\/bbac306.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac306\/45938241\/bbac306.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,20]],"date-time":"2022-09-20T17:59:10Z","timestamp":1663696750000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac306\/6655632"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,4]]},"references-count":48,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac306","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2022,9]]},"published":{"date-parts":[[2022,8,4]]},"article-number":"bbac306"}}