{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T07:02:51Z","timestamp":1758265371938},"reference-count":12,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2020,5,15]],"date-time":"2020-05-15T00:00:00Z","timestamp":1589500800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Start-up Foundation for Advanced Talents","award":["050\/804009"],"award-info":[{"award-number":["050\/804009"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,7,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>DNA N4-methylcytosine (4mC) modification is an important epigenetic modification in prokaryotic DNA due to its role in regulating DNA replication and protecting the host DNA against degradation. An efficient algorithm to identify 4mC sites is needed for downstream analyses.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we propose a new prediction method named SOMM4mC based on a second-order Markov model, which makes use of the transition probability between adjacent nucleotides to identify 4mC sites. The results show that the first-order and second-order Markov model are superior to the three existing algorithms in all six species (Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Escherichia coli, Geoalkalibacter subterruneus and Geobacter pickeringii) where benchmark datasets are available. However, the classification performance of SOMM4mC is more outstanding than that of first-order Markov model. Especially, for E.coli and C.elegans, the overall accuracy of SOMM4mC are 91.8% and 87.6%, which are 8.5% and 6.1% higher than those of the latest method 4mcPred-SVM, respectively. This shows that more discriminant sequence information is captured by SOMM4mC through the dependency between adjacent nucleotides.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The web server of SOMM4mC is freely accessible at www.insect-genome.com\/SOMM4mC.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Contact<\/jats:title>\n                  <jats:p>chenyuanyuan@njau.edu.cn or piancong@njau.edu.cn<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa507","type":"journal-article","created":{"date-parts":[[2020,5,8]],"date-time":"2020-05-08T19:11:53Z","timestamp":1588965113000},"page":"4103-4105","source":"Crossref","is-referenced-by-count":17,"title":["SOMM4mC: a second-order Markov model for DNA N4-methylcytosine site prediction in six species"],"prefix":"10.1093","volume":"36","author":[{"given":"Jiali","family":"Yang","sequence":"first","affiliation":[{"name":"Department of Mathematics , College of Science, Nanjing Agricultural University, Nanjing 210095, China"},{"name":"Department of Chinese Internal Medicine , College of integrated traditional Chinese and western medicine, Hebei university of Chinese medicine, Shijiazhuang 050200, China"}]},{"given":"Kun","family":"Lang","sequence":"additional","affiliation":[{"name":"Department of Information Resource Management , College of Information Science and Technology, Nanjing Agricultural University, Nanjing 210095, China"}]},{"given":"Guangle","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Mathematics , College of Information Science and Technology, Nanjing Agricultural University, Nanjing 210095, China"}]},{"given":"Xiaodan","family":"Fan","sequence":"additional","affiliation":[{"name":"Department of Statistics , The Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, 999077, China"}]},{"given":"Yuanyuan","family":"Chen","sequence":"additional","affiliation":[{"name":"Department of Mathematics , College of Science, Nanjing Agricultural University, Nanjing 210095, China"}]},{"given":"Cong","family":"Pian","sequence":"additional","affiliation":[{"name":"Department of Mathematics , College of Science, Nanjing Agricultural University, Nanjing 210095, China"},{"name":"Department of Statistics , The Chinese University of Hong Kong, Sha Tin, Hong Kong SAR, 999077, China"}]}],"member":"286","published-online":{"date-parts":[[2020,5,15]]},"reference":[{"key":"2023062213541985100_btaa507-B1","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1016\/0022-5193(83)90251-5","article-title":"Markov analysis of DNA sequences","volume":"104","author":"Almagor","year":"1983","journal-title":"J. Theor. Biol"},{"key":"2023062213541985100_btaa507-B2","doi-asserted-by":"crossref","first-page":"3554","DOI":"10.1093\/nar\/23.17.3554","article-title":"Detection of new genes in a bacterial genome using Markov models for three gene classes","volume":"23","author":"Borodovsky","year":"1995","journal-title":"Nucleic Acids Res"},{"key":"2023062213541985100_btaa507-B3","doi-asserted-by":"crossref","first-page":"3518","DOI":"10.1093\/bioinformatics\/btx479","article-title":"iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties","volume":"33","author":"Chen","year":"2017","journal-title":"Bioinformatics"},{"key":"2023062213541985100_btaa507-B4","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/0959-440X(95)80003-J","article-title":"DNA modification by methyltransferases","volume":"5","author":"Cheng","year":"1995","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023062213541985100_btaa507-B5","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/nmeth.1459","article-title":"Direct detection of DNA methylation during single-molecule, real-time sequencing","volume":"7","author":"Flusberg","year":"2010","journal-title":"Nat. Methods"},{"key":"2023062213541985100_btaa507-B6","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1093\/bioinformatics\/bty668","article-title":"4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction","volume":"35","author":"He","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062213541985100_btaa507-B7","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","article-title":"Comparison of the predicted and observed secondary structure of T4 phage lysozyme","volume":"405","author":"Matthews","year":"1975","journal-title":"Biochim. Biophys. Acta"},{"key":"2023062213541985100_btaa507-B8","first-page":"388","article-title":"MM-6mAPred: identifying DNA N6-methyladenine sites based on Markov Model","author":"Pian","year":"2019"},{"key":"2023062213541985100_btaa507-B9","doi-asserted-by":"crossref","first-page":"633","DOI":"10.2144\/000112807","article-title":"Bacterial genetics: past achievements, present state of the field, and future challenges","volume":"44","author":"Schweizer","year":"2008","journal-title":"Biotechniques"},{"key":"2023062213541985100_btaa507-B10","doi-asserted-by":"crossref","first-page":"1326","DOI":"10.1093\/bioinformatics\/bty824","article-title":"Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species","volume":"35","author":"Wei","year":"2019","journal-title":"Bioinformatics"},{"key":"2023062213541985100_btaa507-B11","doi-asserted-by":"crossref","first-page":"4046","DOI":"10.1093\/bioinformatics\/bti657","article-title":"Markov model recognition and classification of DNA\/protein sequences within large text databases","volume":"21","author":"Wren","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062213541985100_btaa507-B12","first-page":"e148","article-title":"Base-resolution detection of N 4-methylcytosine in genomic DNA using 4mC-Tet-assisted-bisulfite-sequencing","volume":"43","author":"Yu","year":"2015","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa507\/33479781\/btaa507.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/14\/4103\/50676880\/btaa507.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/14\/4103\/50676880\/btaa507.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,23]],"date-time":"2023-06-23T15:28:55Z","timestamp":1687534135000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/14\/4103\/5837665"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,5,15]]},"references-count":12,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2020,7,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa507","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,7,15]]},"published":{"date-parts":[[2020,5,15]]}}}