{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T07:44:34Z","timestamp":1773992674094,"version":"3.50.1"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["12101480"],"award-info":[{"award-number":["12101480"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Basic Research Program of Shaanxi","award":["2021JM-115"],"award-info":[{"award-number":["2021JM-115"]}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["JB210715"],"award-info":[{"award-number":["JB210715"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>5-Methylcytosine (m5C) is a crucial post-transcriptional modification. With the development of technology, it is widely found in various RNAs. Numerous studies have indicated that m5C plays an essential role in various activities of organisms, such as tRNA recognition, stabilization of RNA structure, RNA metabolism and so on. Traditional identification is costly and time-consuming by wet biological experiments. Therefore, computational models are commonly used to identify the m5C sites. Due to the vast computing advantages of deep learning, it is feasible to construct the predictive model through deep learning algorithms.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this study, we construct a model to identify m5C based on a deep fusion approach with an improved residual network. First, sequence features are extracted from the RNA sequences using Kmer, K-tuple nucleotide frequency component (KNFC), Pseudo dinucleotide composition (PseDNC) and Physical and chemical property (PCP). Kmer and KNFC extract information from a statistical point of view. PseDNC and PCP extract information from the physicochemical properties of RNA sequences. Then, two parts of information are fused with new features using bidirectional long- and short-term memory and attention mechanisms, respectively. Immediately after, the fused features are fed into the improved residual network for classification. Finally, 10-fold cross-validation and independent set testing are used to verify the credibility of the model. The results show that the accuracy reaches 91.87%, 95.55%, 92.27% and 95.60% on the training sets and independent test sets of Arabidopsis thaliana and M.musculus, respectively. This is a considerable improvement compared to previous studies and demonstrates the robust performance of our model.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The data and code related to the study are available at https:\/\/github.com\/alivelxj\/m5c-DFRESG.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac532","type":"journal-article","created":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T14:19:12Z","timestamp":1658499552000},"page":"4271-4277","source":"Crossref","is-referenced-by-count":19,"title":["An improved residual network using deep fusion for identifying RNA 5-methylcytosine sites"],"prefix":"10.1093","volume":"38","author":[{"given":"Xinjie","family":"Li","sequence":"first","affiliation":[{"name":"School of Mathematics and Statistics, Xidian University , Xi\u2019an 710071, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8786-0940","authenticated-orcid":false,"given":"Shengli","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics, Xidian University , Xi\u2019an 710071, P. R. China"}]},{"given":"Hongyan","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics, Xidian University , Xi\u2019an 710071, P. R. China"}]}],"member":"286","published-online":{"date-parts":[[2022,7,22]]},"reference":[{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1016\/j.ajhg.2012.03.021","article-title":"Mutations in NSUN2 cause autosomal-recessive intellectual disability","volume":"90","author":"Abbasi-Moheb","year":"2012","journal-title":"Am. J. Hum. Genet"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"4869","DOI":"10.1093\/bioinformatics\/btaa609","article-title":"iPromoter-BnCNN: a novel branched CNN-based predictor for identifying and classifying sigma promoters","volume":"36","author":"Amin","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"i237","DOI":"10.1093\/bioinformatics\/bty228","article-title":"Convolutional neural networks for classification of alignments of non-coding RNA sequences","volume":"34","author":"Aoki","year":"2018","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1027","DOI":"10.1016\/j.omtn.2021.10.012","article-title":"Staem5: a novel computational approach for accurate prediction of m5C site","volume":"26","author":"Chai","year":"2021","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"103899","DOI":"10.1016\/j.compbiomed.2020.103899","article-title":"Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier","volume":"123","author":"Chen","year":"2020","journal-title":"Comput. Biol. Med"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1016\/j.ab.2015.08.021","article-title":"iRNA-Methyl: identifying N6-methyladenosine sites using pseudo nucleotide composition","volume":"490","author":"Chen","year":"2015","journal-title":"Anal. Biochem"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"489","DOI":"10.1186\/s12859-020-03828-4","article-title":"m5CPred-SVM: a novel method for predicting m5C sites of RNA","volume":"21","author":"Chen","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"10249","DOI":"10.1021\/bi00089a047","article-title":"5-Methylcytidine is required for cooperative binding of Mg2+ and a conformational transition at the anticodon stem-loop of yeast phenylalanine tRNA","volume":"32","author":"Chen","year":"1993","journal-title":"Biochemistry"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1387","DOI":"10.1016\/j.molp.2017.09.013","article-title":"5-Methylcytosine RNA methylation in Arabidopsis thaliana","volume":"10","author":"Cui","year":"2017","journal-title":"Mol. Plant"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1016\/j.omtn.2020.06.004","article-title":"Prediction of m5C modifications in RNA sequences by combining multiple sequence features","volume":"21","author":"Dou","year":"2020","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0076-6879(07)25001-3","article-title":"Identifying modifications in RNA by MALDI mass spectrometry","volume":"425","author":"Douthwaite","year":"2007","journal-title":"Methods Enzymol"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"9373","DOI":"10.1073\/pnas.83.24.9373","article-title":"Improved free-energy parameters for predictions of RNA duplex stability","volume":"83","author":"Freier","year":"1986","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1827","DOI":"10.1073\/pnas.89.5.1827","article-title":"A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands","volume":"89","author":"Frommer","year":"1992","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1016\/j.canlet.2009.08.004","article-title":"Genomic gain of 5p15 leads to over-expression of Misu (NSUN2) in breast cancer","volume":"289","author":"Frye","year":"2010","journal-title":"Cancer Lett"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1346","DOI":"10.1126\/science.aau1646","article-title":"RNA modifications modulategene expression during development","volume":"361","author":"Frye","year":"2018","journal-title":"Science"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1632","DOI":"10.1261\/rna.043398.113","article-title":"A cluster of methylations in the domain IV of 25S rRNA is required for ribosome stability","volume":"20","author":"Gigova","year":"2014","journal-title":"RNA"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1176","DOI":"10.1002\/humu.22897","article-title":"Defects in tRNA anticodon loop 20-O-methylation are implicated in nonsyndromic X-linked intellectual disability due to mutations in FTSJ1","volume":"36","author":"Guy","year":"2015","journal-title":"Hum. Mutat"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"2009","DOI":"10.1093\/bioinformatics\/bty937","article-title":"Identifying antimicrobial peptides using word embedding with deep recurrent neural networks","volume":"35","author":"Hamid","year":"2019","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","first-page":"770","author":"He","year":"2016"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"3120","DOI":"10.1093\/bioinformatics\/btab354","article-title":"NeuralPolish: a novel nanopore polishing method based on alignment matrix construction and orthogonal Bi-GRU networks","volume":"37","author":"Huang","year":"2021","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1128\/MCB.01523-12","article-title":"The mouse cytosine-5 RNA methyltransferase NSun2 is a component of the chromatoid body and required for testis differentiation","volume":"33","author":"Hussain","year":"2013","journal-title":"Mol. Cell. Biol"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"2986","DOI":"10.1093\/bioinformatics\/btaa074","article-title":"PmliPred: a method based on hybrid model and fuzzy decision for plant miRNA-lncRNA interaction prediction","volume":"36","author":"Kang","year":"2020","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"458","DOI":"10.1038\/nbt.2566","article-title":"Identification of direct targets and modified bases of RNA cytosine methyltransferases","volume":"31","author":"Khoddami","year":"2013","journal-title":"Nat. Biotechnol"},{"key":"2023041408370176400_","first-page":"1","article-title":"DeepATT: a hybrid category attention neural network for identifying functional effects of DNA sequences","volume":"22","author":"Li","year":"2020","journal-title":"Brief. Bioinform"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1093\/bioinformatics\/17.12.1131","article-title":"Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA\/KNN method","volume":"17","author":"Li","year":"2001","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"732","DOI":"10.1093\/bioinformatics\/btx679","article-title":"Chromatin accessibility prediction via a hybrid deep convolutional neural network","volume":"34","author":"Liu","year":"2018","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"982","DOI":"10.1093\/bib\/bbz048","article-title":"Evaluation of different computational methods on 5-methylcytosine sites identification","volume":"21","author":"Lv","year":"2020","journal-title":"Brief. Bioinform"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"bbab031","DOI":"10.1093\/bib\/bbab031","article-title":"A sequence-based deep learning approach to predict CTCF-mediated chromatin loop","volume":"22","author":"Lv","year":"2021","journal-title":"Brief. Bioinform"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"2757","DOI":"10.1093\/bioinformatics\/bty1047","article-title":"mAHTPred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation","volume":"35","author":"Manavalan","year":"2019","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"3057","DOI":"10.1007\/s00018-017-2521-1","article-title":"Ultrastructural localization of 5-methylcyto-sine on DNA and RNA","volume":"74","author":"Masiello","year":"2017","journal-title":"Cell. Mol. Life Sci"},{"key":"2023041408370176400_","first-page":"511","article-title":"Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks","volume":"19","author":"Pan","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"106625","DOI":"10.1016\/j.cmpb.2022.106625","article-title":"iPro-GAN: a novel model based on generative adversarial learning for identifying promoters and their strength","volume":"215","author":"Qiao","year":"2022","journal-title":"Comput. Methods Programs Biomed"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1590","DOI":"10.1101\/gad.586710","article-title":"RNA methylation by Dnmt2 protects transfer RNAs against stress-induced cleavage","volume":"24","author":"Schaefer","year":"2010","journal-title":"Genes Dev"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"519","DOI":"10.3389\/fpls.2018.00519","article-title":"Transcriptome-wide annotation of m5C RNA modifications using machine learning","volume":"9","author":"Song","year":"2018","journal-title":"Front Plant Sci"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"5023","DOI":"10.1093\/nar\/gks144","article-title":"Widespread occurrence of 5-methylcytosine in human coding and non-coding RNA","volume":"40","author":"Squires","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"408","DOI":"10.1093\/bib\/bby124","article-title":"Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools","volume":"21","author":"Su","year":"2020","journal-title":"Brief. Bioinform"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1016\/j.omtn.2019.03.010","article-title":"iPseU-CNN: identifying RNA pseudouridine sites using convolutional neural networks","volume":"16","author":"Tahir","year":"2019","journal-title":"Mol. Ther. Nucleic Acids"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1536","DOI":"10.1093\/bioinformatics\/btl151","article-title":"Two sample logo: a graphical representation of the differences between two sets of sequence alignments","volume":"22","author":"Vacic","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1007\/s10930-021-10011-y","article-title":"UMAP-DBP: an improved DNA-Binding proteins prediction method based on uniform manifold approximation and projection","volume":"40","author":"Wang","year":"2021","journal-title":"Protein J"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"4007","DOI":"10.1093\/bioinformatics\/bty451","article-title":"ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides","volume":"34","author":"Wei","year":"2018","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"1326","DOI":"10.1093\/bioinformatics\/bty824","article-title":"Exploring sequence based features for the improved prediction of DNA N4-methylcytosine sites in multiple species","volume":"35","author":"Wei","year":"2019","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"4930","DOI":"10.1093\/bioinformatics\/btz408","article-title":"Iterative feature representations improve N4-methylcytosine site prediction","volume":"35","author":"Wei","year":"2019","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"14719","DOI":"10.1021\/bi9809425","article-title":"Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs","volume":"37","author":"Xia","year":"1998","journal-title":"Biochemistry"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"D327","DOI":"10.1093\/nar\/gkx934","article-title":"RMBase v2.0: deciphering the map of RNA modifications from epitranscriptome sequencing data","volume":"46","author":"Xuan","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"4668","DOI":"10.1093\/bioinformatics\/btab551","article-title":"PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein-protein interaction information","volume":"37","author":"Yang","year":"2021","journal-title":"Bioinformatics"},{"key":"2023041408370176400_","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1038\/cr.2017.55","article-title":"5-methylcytosine promotes mRNA export-NSUN2 as the methyltransferase and ALYREF as an m(5)C reader","volume":"27","author":"Yang","year":"2017","journal-title":"Cell Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac532\/45249830\/btac532.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/18\/4271\/49884747\/btac532.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/18\/4271\/49884747\/btac532.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,24]],"date-time":"2023-11-24T21:43:38Z","timestamp":1700862218000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/18\/4271\/6648463"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,7,22]]},"references-count":46,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2022,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac532","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,9,15]]},"published":{"date-parts":[[2022,7,22]]}}}