{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T19:05:28Z","timestamp":1770750328274,"version":"3.50.0"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T00:00:00Z","timestamp":1701907200000},"content-version":"vor","delay-in-days":15,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Korea Health Technology R&D Project"},{"DOI":"10.13039\/501100003710","name":"Korea Health Industry Development Institute","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003710","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ministry of Health & Welfare, Republic of Korea","award":["HI23C0701"],"award-info":[{"award-number":["HI23C0701"]}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014188","name":"Ministry of Science and ICT","doi-asserted-by":"publisher","award":["2021R1A2C1014338"],"award-info":[{"award-number":["2021R1A2C1014338"]}],"id":[{"id":"10.13039\/501100014188","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014188","name":"Ministry of Science and ICT","doi-asserted-by":"publisher","award":["RS-2023-00217881"],"award-info":[{"award-number":["RS-2023-00217881"]}],"id":[{"id":"10.13039\/501100014188","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014188","name":"Ministry of Science and ICT","doi-asserted-by":"publisher","award":["2021R1C1C1007833"],"award-info":[{"award-number":["2021R1C1C1007833"]}],"id":[{"id":"10.13039\/501100014188","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,11,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The worldwide appearance of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has generated significant concern and posed a considerable challenge to global health. Phosphorylation is a common post-translational modification that affects many vital cellular functions and is closely associated with SARS-CoV-2 infection. Precise identification of phosphorylation sites could provide more in-depth insight into the processes underlying SARS-CoV-2 infection and help alleviate the continuing COVID-19 crisis. Currently, available computational tools for predicting these sites lack accuracy and effectiveness. In this study, we designed an innovative meta-learning model, Meta-Learning for Serine\/Threonine Phosphorylation (MeL-STPhos), to precisely identify protein phosphorylation sites. We initially performed a comprehensive assessment of 29 unique sequence-derived features, establishing prediction models for each using 14 renowned machine learning methods, ranging from traditional classifiers to advanced deep learning algorithms. We then selected the most effective model for each feature by integrating the predicted values. Rigorous feature selection strategies were employed to identify the optimal base models and classifier(s) for each cell-specific dataset. To the best of our knowledge, this is the first study to report two cell-specific models and a generic model for phosphorylation site prediction by utilizing an extensive range of sequence-derived features and machine learning algorithms. Extensive cross-validation and independent testing revealed that MeL-STPhos surpasses existing state-of-the-art tools for phosphorylation site prediction. We also developed a publicly accessible platform at https:\/\/balalab-skku.org\/MeL-STPhos. We believe that MeL-STPhos will serve as a valuable tool for accelerating the discovery of serine\/threonine phosphorylation sites and elucidating their role in post-translational regulation.<\/jats:p>","DOI":"10.1093\/bib\/bbad433","type":"journal-article","created":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T06:35:40Z","timestamp":1701930940000},"source":"Crossref","is-referenced-by-count":35,"title":["Advancing the accuracy of SARS-CoV-2 phosphorylation site detection via meta-learning approach"],"prefix":"10.1093","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8086-6722","authenticated-orcid":false,"given":"Nhat Truong","family":"Pham","sequence":"first","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Le Thi","family":"Phan","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Jimin","family":"Seo","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Yeonwoo","family":"Kim","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Minkyung","family":"Song","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Sukchan","family":"Lee","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"given":"Young-Jun","family":"Jeon","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0697-9419","authenticated-orcid":false,"given":"Balachandran","family":"Manavalan","sequence":"additional","affiliation":[{"name":"Department of Integrative Biotechnology and of Biopharmaceutical Convergence , Sungkyunkwan University, Suwon 16419, Gyeonggi-do, Republic of Korea"}]}],"member":"286","published-online":{"date-parts":[[2023,12,6]]},"reference":[{"key":"2023122811533428800_ref1","doi-asserted-by":"crossref","DOI":"10.1016\/j.scitotenv.2020.138996","article-title":"Evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as coronavirus disease 2019 (COVID-19) pandemic: a global health emergency","volume":"730","author":"Acter","year":"2020","journal-title":"Sci Total Environ"},{"key":"2023122811533428800_ref2","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1038\/s41586-020-2286-9","article-title":"A SARS-CoV-2 protein interaction map reveals targets for drug repurposing","volume":"583","author":"Gordon","year":"2020","journal-title":"Nature"},{"key":"2023122811533428800_ref3","doi-asserted-by":"crossref","first-page":"916","DOI":"10.1158\/2159-8290.CD-20-0559","article-title":"The landscape of human cancer proteins targeted by SARS-CoV-2","volume":"10","author":"Tutuncuoglu","year":"2020","journal-title":"Cancer Discov"},{"key":"2023122811533428800_ref4","doi-asserted-by":"crossref","DOI":"10.3389\/fimmu.2022.829474","article-title":"SARS-CoV-2 infection triggers phosphorylation: potential target for anti-COVID-19 therapeutics","volume":"13","author":"Chatterjee","year":"2022","journal-title":"Front Immunol"},{"key":"2023122811533428800_ref5","doi-asserted-by":"crossref","DOI":"10.15252\/msb.202110823","article-title":"Human phospho-signaling networks of SARS-CoV-2 infection are rewired by population genetic variants","volume":"18","author":"Pellegrina","year":"2022","journal-title":"Mol Syst Biol"},{"key":"2023122811533428800_ref6","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/j.virusres.2007.07.012","article-title":"Regulation of positive-strand RNA virus replication: the emerging role of phosphorylation","volume":"129","author":"Jakubiec","year":"2007","journal-title":"Virus Res"},{"key":"2023122811533428800_ref7","doi-asserted-by":"crossref","DOI":"10.1016\/j.jsb.2022.107879","article-title":"Structural basis for SARS-CoV-2 nucleocapsid (N) protein recognition by 14-3-3 proteins","volume":"214","author":"Eisenreichova","year":"2022","journal-title":"J Struct Biol"},{"key":"2023122811533428800_ref8","doi-asserted-by":"crossref","DOI":"10.1016\/j.jmb.2021.166875","article-title":"The mechanism of SARS-CoV-2 nucleocapsid protein recognition by the human 14-3-3 proteins","volume":"433","author":"Tugaeva","year":"2021","journal-title":"J Mol Biol"},{"key":"2023122811533428800_ref9","article-title":"Novel inhibitors to ADP ribose phosphatase of SARS-CoV-2 identified by structure-based high throughput virtual screening and molecular dynamics simulations","volume":"140","author":"Patel","year":"2021","journal-title":"Comput Biol Med"},{"key":"2023122811533428800_ref10","doi-asserted-by":"crossref","first-page":"894","DOI":"10.1038\/s41592-019-0499-3","article-title":"High throughput discovery of functional protein modifications by Hotspot Thermal Profiling","volume":"16","author":"Huang","year":"2019","journal-title":"Nat Methods"},{"key":"2023122811533428800_ref11","doi-asserted-by":"crossref","first-page":"2586","DOI":"10.1074\/mcp.M110.001388","article-title":"Musite, a tool for global prediction of general and kinase-specific phosphorylation sites","volume":"9","author":"Gao","year":"2010","journal-title":"Mol Cell Proteomics"},{"key":"2023122811533428800_ref12","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.1007\/s00726-014-1711-5","article-title":"PhosphoSVM: prediction of phosphorylation sites by integrating various protein sequence attributes with a support vector machine","volume":"46","author":"Dou","year":"2014","journal-title":"Amino Acids"},{"key":"2023122811533428800_ref13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2016\/3281590","article-title":"RF-Phos: a novel general phosphorylation site prediction tool based on random forest","volume":"2016","author":"Ismail","year":"2016","journal-title":"Biomed Res Int"},{"key":"2023122811533428800_ref14","doi-asserted-by":"crossref","first-page":"6862","DOI":"10.1038\/s41598-017-07199-4","article-title":"PhosphoPredict: a bioinformatics tool for prediction of human kinase-specific phosphorylation substrates and sites by integrating heterogeneous feature selection","volume":"7","author":"Song","year":"2017","journal-title":"Sci Rep"},{"key":"2023122811533428800_ref15","doi-asserted-by":"crossref","first-page":"3909","DOI":"10.1093\/bioinformatics\/btx496","article-title":"MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction","volume":"33","author":"Wang","year":"2017","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref16","doi-asserted-by":"crossref","first-page":"2766","DOI":"10.1093\/bioinformatics\/bty1051","article-title":"DeepPhos: prediction of protein phosphorylation sites with deep learning","volume":"35","author":"Luo","year":"2019","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref17","doi-asserted-by":"crossref","first-page":"W140","DOI":"10.1093\/nar\/gkaa275","article-title":"MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization","volume":"48","author":"Wang","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2023122811533428800_ref18","article-title":"DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach","volume":"22","author":"Lv","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref19","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023122811533428800_ref20","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1038\/s41586-021-03493-4","article-title":"Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV","volume":"594","author":"Stukalov","year":"2021","journal-title":"Nature"},{"key":"2023122811533428800_ref21","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1093\/bioinformatics\/btq003","article-title":"CD-HIT Suite: a web server for clustering and comparing biological sequences","volume":"26","author":"Huang","year":"2010","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref22","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1016\/j.cell.2020.06.034","article-title":"The global phosphorylation landscape of SARS-CoV-2 infection","volume":"182","author":"Bouhaddou","year":"2020","journal-title":"Cell"},{"key":"2023122811533428800_ref23","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkz740","article-title":"BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches","volume":"47","author":"Liu","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023122811533428800_ref24","doi-asserted-by":"crossref","first-page":"1047","DOI":"10.1093\/bib\/bbz041","article-title":"iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data","volume":"21","author":"Chen","year":"2020","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref25","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1093\/bioinformatics\/bty140","article-title":"iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref26","doi-asserted-by":"crossref","first-page":"W434","DOI":"10.1093\/nar\/gkac351","article-title":"iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets","volume":"50","author":"Chen","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023122811533428800_ref27","doi-asserted-by":"crossref","first-page":"2185","DOI":"10.1093\/bib\/bby079","article-title":"Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework","volume":"20","author":"Zhang","year":"2019","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref28","first-page":"1","article-title":"A Parkinson\u2019s auxiliary diagnosis algorithm based on a hyperparameter optimization method of deep learning","author":"Wang","year":"2023","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2023122811533428800_ref29","first-page":"1","article-title":"Towards automated optimization of residual convolutional neural networks for electrocardiogram classification","author":"Fki","year":"2023","journal-title":"Cognit Comput"},{"key":"2023122811533428800_ref30","first-page":"50","article-title":"Gougerot-Sjogren syndrome associated with a yersiniosis","volume":"14","author":"Fischer","year":"1985","journal-title":"Presse Med"},{"key":"2023122811533428800_ref31","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab252","article-title":"Integrative machine learning framework for the identification of cell-specific enhancers from the human genome","volume":"22","author":"Basith","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref32","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab376","article-title":"STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction","volume":"23","author":"Basith","year":"2022","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref33","doi-asserted-by":"crossref","DOI":"10.1016\/j.jmb.2022.167604","article-title":"MLCPP 2.0: an updated cell-penetrating peptides and their uptake efficiency predictor","volume":"434","author":"Manavalan","year":"2022","journal-title":"J Mol Biol"},{"key":"2023122811533428800_ref34","doi-asserted-by":"crossref","first-page":"0016","DOI":"10.34133\/research.0016","article-title":"An effective integrated machine learning framework for identifying severity of tomato yellow leaf curl virus and their experimental validation","volume":"6","author":"Bupi","year":"2023","journal-title":"Research"},{"key":"2023122811533428800_ref35","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac243","article-title":"TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization","volume":"23","author":"Jeon","year":"2022","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref36","doi-asserted-by":"crossref","DOI":"10.1016\/j.jmb.2022.167549","article-title":"THRONE: a new approach for accurate prediction of human RNA N7-methylguanosine sites","volume":"434","author":"Shoombuatong","year":"2022","journal-title":"J Mol Biol"},{"key":"2023122811533428800_ref37","doi-asserted-by":"crossref","first-page":"3350","DOI":"10.1093\/bioinformatics\/btaa160","article-title":"HLPpred-Fuse: improved and robust prediction of hemolytic peptide and its activity by fusing multiple feature representation","volume":"36","author":"Hasan","year":"2020","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref38","doi-asserted-by":"crossref","first-page":"2075","DOI":"10.1093\/bioinformatics\/bty943","article-title":"Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique","volume":"35","author":"Dao","year":"2019","journal-title":"Bioinformatics"},{"key":"2023122811533428800_ref39","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/j.omtn.2019.08.011","article-title":"SDM6A: a web-based integrative machine-learning framework for predicting 6mA sites in the rice genome","volume":"18","author":"Basith","year":"2019","journal-title":"Mol Ther Nucleic Acids"},{"key":"2023122811533428800_ref40","doi-asserted-by":"crossref","DOI":"10.1128\/JVI.01257-21","article-title":"The NF-kappaB transcriptional footprint is essential for SARS-CoV-2 replication","volume":"95","author":"Nilsson-Payant","year":"2021","journal-title":"J Virol"},{"key":"2023122811533428800_ref41","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.omtn.2023.02.027","article-title":"IPs-GRUAtt: an attention-based bidirectional gated recurrent unit network for predicting phosphorylation sites of SARS-CoV-2 infection","volume":"32","author":"Zhang","year":"2023","journal-title":"Mol Ther Nucleic Acids"},{"key":"2023122811533428800_ref42","article-title":"Phosphorylation time-course study of the response during adenovirus type 2 infection","volume":"20","author":"Valdes","year":"2020","journal-title":"Proteomics"},{"key":"2023122811533428800_ref43","doi-asserted-by":"crossref","first-page":"1749","DOI":"10.1007\/s40262-022-01180-9","article-title":"DeepIDC: a prediction framework of injectable drug combination based on heterogeneous information and deep learning","volume":"61","author":"Yang","year":"2022","journal-title":"Clin Pharmacokinet"},{"key":"2023122811533428800_ref44","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac395","article-title":"iLoc-miRNA: extracellular\/intracellular miRNA prediction using deep BiLSTM with attention mechanism","volume":"23","author":"Zhang","year":"2022","journal-title":"Brief Bioinform"},{"key":"2023122811533428800_ref45","article-title":"SiameseCPP: a sequence-based Siamese network to predict cell-penetrating peptides by contrastive learning","volume":"24","author":"Zhang","year":"2023","journal-title":"Brief Bioinform"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/1\/bbad433\/54878961\/bbad433.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/1\/bbad433\/54878961\/bbad433.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,28]],"date-time":"2023-12-28T11:54:23Z","timestamp":1703764463000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad433\/7459584"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,22]]},"references-count":45,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,11,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad433","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,1,1]]},"published":{"date-parts":[[2023,11,22]]},"article-number":"bbad433"}}