{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:43:46Z","timestamp":1753875826685,"version":"3.41.2"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T00:00:00Z","timestamp":1739923200000},"content-version":"vor","delay-in-days":89,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["2020R1A2C2005612","2022R1G1A1004613"],"award-info":[{"award-number":["2020R1A2C2005612","2022R1G1A1004613"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Korea Big Data Station"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Anomalous DNA methylation has wide-ranging implications, spanning from neurological disorders to cancer and cardiovascular complications. Current methods for single-cell DNA methylation analysis face limitations in coverage, leading to information loss and hampering our understanding of disease associations. The primary goal of this study is the imputation of CpG site methylation states in a given cell by leveraging the CpG states of other cells of the same type. To address this, we introduce CpGFuse, a novel methodology that combines information from diverse genomic features. Leveraging two benchmark datasets, we employed a careful preprocessing approach and conducted a comprehensive ablation study to assess the individual and collective contributions of DNA sequence, intercellular, and intracellular features. Our proposed model, CpGFuse, employs a convolutional neural network with an attention mechanism, surpassing existing models across HCCs and HepG2 datasets. The results highlight the effectiveness of our approach in enhancing accuracy and providing a robust tool for CpG site prediction in genomics. CpGFuse\u2019s success underscores the importance of integrating multiple genomic features for accurate identification of methylation states of CpG site.<\/jats:p>","DOI":"10.1093\/bib\/bbaf063","type":"journal-article","created":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T08:15:11Z","timestamp":1739952911000},"source":"Crossref","is-referenced-by-count":0,"title":["CpGFuse: a holistic approach for accurate identification of methylation states of DNA CpG sites"],"prefix":"10.1093","volume":"26","author":[{"given":"Sehi","family":"Park","sequence":"first","affiliation":[{"name":"Department of Electronics and Information Engineering , Jeonbuk National University, Baekje-daero, Deokjin-gu, Jeonju 54896, South","place":["Korea"]}]},{"given":"Kil","family":"To Chong","sequence":"additional","affiliation":[{"name":"Department of Electronics and Information Engineering , Jeonbuk National University, Baekje-daero, Deokjin-gu, Jeonju 54896, South","place":["Korea"]},{"name":"Advances Electronics and Information Research Center , Jeonbuk National University, Baekje-daero, Deokjin-gu, Jeonju 54896, South","place":["Korea"]}]},{"given":"Hilal","family":"Tayara","sequence":"additional","affiliation":[{"name":"School of international Engineering and Science , Jeonbuk National University, Baekje-daero, Deokjin-gu, Jeonju 54896, South","place":["Korea"]}]}],"member":"286","published-online":{"date-parts":[[2025,2,19]]},"reference":[{"key":"2025021908145992200_ref1","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-023-40845-2","article-title":"Epigenetic inheritance is unfaithful at intermediately methylated CpG sites","volume":"14","author":"Hay","year":"2023","journal-title":"Nat Commun"},{"key":"2025021908145992200_ref2","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1038\/nrg1655","article-title":"DNA methylation and human disease","volume":"6","author":"Robertson","year":"2005","journal-title":"Nat Rev Genet"},{"key":"2025021908145992200_ref3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13148-023-01468-y","article-title":"DNA methylation and cardiovascular disease in humans: a systematic review and database of known CpG methylation sites","volume":"15","author":"Krolevets","year":"2023","journal-title":"Clin Epigenetics"},{"key":"2025021908145992200_ref4","doi-asserted-by":"publisher","first-page":"371","DOI":"10.1038\/s41576-018-0004-3","article-title":"DNA methylation-based biomarkers and the epigenetic clock theory of ageing","volume":"19","author":"Horvath","year":"2018","journal-title":"Nat Rev Genet"},{"key":"2025021908145992200_ref5","doi-asserted-by":"publisher","first-page":"iii","DOI":"10.2144\/000113493","article-title":"Advances in genome-wide DNA methylation analysis","volume":"49","author":"Gupta","year":"2010","journal-title":"Biotechniques"},{"key":"2025021908145992200_ref6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2014-15-2-r38","article-title":"MOABS: model based analysis of bisulfite sequencing data","volume":"15","author":"Sun","year":"2014","journal-title":"Genome Biol"},{"key":"2025021908145992200_ref7","doi-asserted-by":"publisher","first-page":"817","DOI":"10.1038\/nmeth.3035","article-title":"Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity","volume":"11","author":"Smallwood","year":"2014","journal-title":"Nat Methods"},{"key":"2025021908145992200_ref8","doi-asserted-by":"publisher","first-page":"2126","DOI":"10.1101\/gr.161679.113","article-title":"Single-cell methylome landscapes of mouse embryonic stem cells and early embryos analyzed using reduced representation bisulfite sequencing","volume":"23","author":"Guo","year":"2013","journal-title":"Genome Res"},{"key":"2025021908145992200_ref9","doi-asserted-by":"publisher","first-page":"1202","DOI":"10.1038\/s41587-021-00895-7","article-title":"Computational principles and challenges in single-cell data integration","volume":"39","author":"Argelaguet","year":"2021","journal-title":"Nat Biotechnol"},{"key":"2025021908145992200_ref10","doi-asserted-by":"publisher","first-page":"1386","DOI":"10.1016\/j.celrep.2015.02.001","article-title":"Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics","volume":"10","author":"Farlik","year":"2015","journal-title":"Cell Rep"},{"key":"2025021908145992200_ref11","doi-asserted-by":"publisher","first-page":"304","DOI":"10.1038\/cr.2016.23","article-title":"Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas","volume":"26","author":"Hou","year":"2016","journal-title":"Cell Res"},{"key":"2025021908145992200_ref12","doi-asserted-by":"publisher","first-page":"808","DOI":"10.1016\/j.stem.2016.10.019","article-title":"DNA methylation dynamics of human hematopoietic stem cell differentiation","volume":"19","author":"Farlik","year":"2016","journal-title":"Cell Stem Cell"},{"key":"2025021908145992200_ref13","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1016\/j.ab.2014.12.009","article-title":"iDNA-Methyl: identifying DNA methylation sites via pseudo trinucleotide composition","volume":"474","author":"Liu","year":"2015","journal-title":"Anal Biochem"},{"key":"2025021908145992200_ref14","doi-asserted-by":"publisher","first-page":"1541","DOI":"10.1101\/gr.152231.112","article-title":"Estimating absolute methylation levels at single-CpG resolution from methylation enrichment and restriction enzyme sequencing methods","volume":"23","author":"Stevens","year":"2013","journal-title":"Genome Res"},{"key":"2025021908145992200_ref15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-019-5488-5","article-title":"MRCNN: a deep learning model for regression of genome-wide DNA methylation","volume":"20","author":"Tian","year":"2019","journal-title":"BMC Genomics"},{"key":"2025021908145992200_ref16","doi-asserted-by":"publisher","first-page":"104997","DOI":"10.1016\/j.neubiorev.2022.104997","article-title":"Epigenome-wide DNA methylation in externalizing behaviours: a review and combined analysis","volume":"145","author":"Meijer","year":"2022","journal-title":"Neurosci Biobehav Rev"},{"key":"2025021908145992200_ref17","doi-asserted-by":"publisher","first-page":"2388","DOI":"10.1021\/acs.jcim.0c00064","article-title":"Accurately predicting mutation-caused stability changes from protein sequences using extreme gradient boosting","volume":"60","author":"Lv","year":"2020","journal-title":"J Chem Inf Model"},{"key":"2025021908145992200_ref18","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1093\/bioinformatics\/btab746","article-title":"CpG transformer for imputation of single-cell methylomes","volume":"38","author":"De Waele","year":"2022","journal-title":"Bioinformatics"},{"key":"2025021908145992200_ref19","first-page":"1","article-title":"DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning","volume":"18","author":"Angermueller","year":"2017","journal-title":"Genome Biol"},{"key":"2025021908145992200_ref20","first-page":"1","article-title":"LightCpG: a multi-view CpG sites detection on single-cell whole genome sequence data","volume":"20","author":"Jiang","year":"2019","journal-title":"BMC Genomics"},{"key":"2025021908145992200_ref21","doi-asserted-by":"publisher","first-page":"btad474","DOI":"10.1093\/bioinformatics\/btad474","article-title":"iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data","volume":"39","author":"Park","year":"2023","journal-title":"Bioinformatics"},{"key":"2025021908145992200_ref22","doi-asserted-by":"publisher","first-page":"408","DOI":"10.1016\/j.compbiomed.2011.12.008","article-title":"Prediction of methylation CpGs and their methylation degrees in human DNA sequences","volume":"42","author":"Zhou","year":"2012","journal-title":"Comput Biol Med"},{"key":"2025021908145992200_ref23","doi-asserted-by":"publisher","first-page":"4302","DOI":"10.1016\/j.febslet.2005.07.002","article-title":"Prediction of methylated CpGs in DNA sequences using a support vector machine","volume":"579","author":"Bhasin","year":"2005","journal-title":"FEBS Lett"},{"key":"2025021908145992200_ref24","first-page":"315","article-title":"Predicting DNA methylation susceptibility using CpG flanking sequences","volume-title":"Biocomputing 2008","author":"Kim","year":"2008"},{"key":"2025021908145992200_ref25","doi-asserted-by":"publisher","first-page":"2986","DOI":"10.1093\/bioinformatics\/btx316","article-title":"DIRECTION: a machine learning framework for predicting and characterizing DNA methylation and hydroxymethylation in mammalian genomes","volume":"33","author":"Pavlovic","year":"2017","journal-title":"Bioinformatics"},{"key":"2025021908145992200_ref26","doi-asserted-by":"crossref","first-page":"904","DOI":"10.1109\/TCBB.2022.3192572","article-title":"DL-m6A: identification of N6-methyladenosine sites in mammals using deep learning based on different encoding schemes","volume":"20","author":"Rehman","year":"2022","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025021908145992200_ref27","doi-asserted-by":"publisher","first-page":"3885","DOI":"10.1093\/bioinformatics\/btac434","article-title":"i6mA-Caps: a capsulenet-based framework for identifying DNA N6-methyladenine sites","volume":"38","author":"Rehman","year":"2022","journal-title":"Bioinformatics"},{"key":"2025021908145992200_ref28","doi-asserted-by":"publisher","first-page":"8314","DOI":"10.3390\/ijms23158314","article-title":"ENet-6mA: identification of 6mA modification sites in plant genomes using elasticnet and neural networks","volume":"23","author":"Abbas","year":"2022","journal-title":"Int J Mol Sci"},{"key":"2025021908145992200_ref29","doi-asserted-by":"publisher","first-page":"296","DOI":"10.3390\/genes12020296","article-title":"4mCPred-CNN\u2014Prediction of DNA N4-methylcytosine in the mouse genome using a convolutional neural network","volume":"12","author":"Abbas","year":"2021","journal-title":"Genes"},{"key":"2025021908145992200_ref30","doi-asserted-by":"publisher","first-page":"1529","DOI":"10.3390\/genes11121529","article-title":"pcPromoter-CNN: a CNN-based prediction and classification of promoters","volume":"11","author":"Shujaat","year":"2020","journal-title":"Genes"},{"key":"2025021908145992200_ref31","doi-asserted-by":"publisher","first-page":"81485","DOI":"10.1109\/ACCESS.2021.3086102","article-title":"Cr-Prom: a convolutional neural network-based model for the prediction of rice promoters","volume":"9","author":"Shujaat","year":"2021","journal-title":"IEEE Access"},{"key":"2025021908145992200_ref32","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1038\/nature06745","article-title":"Shotgun bisulphite sequencing of the arabidopsis genome reveals DNA methylation patterning","volume":"452","author":"Cokus","year":"2008","journal-title":"Nature"},{"key":"2025021908145992200_ref33","doi-asserted-by":"publisher","first-page":"1800","DOI":"10.1109\/ACCESS.2018.2886569","article-title":"Branch point selection in RNA splicing using deep learning","volume":"7","author":"Nazari","year":"2018","journal-title":"IEEE Access"},{"key":"2025021908145992200_ref34","doi-asserted-by":"publisher","first-page":"58826","DOI":"10.1109\/ACCESS.2018.2874208","article-title":"Deep learning models based on distributed feature representations for alternative splicing prediction","volume":"6","author":"Oubounyt","year":"2018","journal-title":"IEEE Access"},{"key":"2025021908145992200_ref35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.jtbi.2018.12.034","article-title":"iRNA-PseKNC (2methyl): identify RNA 2\u2019-O-methylation sites by convolution neural network and Chou\u2019s pseudo components","volume":"465","author":"Tahir","year":"2019","journal-title":"J Theor Biol"},{"key":"2025021908145992200_ref36","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1016\/j.csbj.2022.11.056","article-title":"CapsNh-Kcr: capsule network-based prediction of lysine crotonylation sites in human non-histone proteins","volume":"21","author":"Khanal","year":"2023","journal-title":"Comput Struct Biotechnol J"},{"key":"2025021908145992200_ref37","doi-asserted-by":"publisher","first-page":"615","DOI":"10.2174\/1574893617666220405133520","article-title":"iProm70: a convolutional neural network-based tool for $\\sigma $70Promoter classification","volume":"17","author":"Shujaat","year":"2022","journal-title":"Curr Bioinform"},{"key":"2025021908145992200_ref38","doi-asserted-by":"publisher","first-page":"2120","DOI":"10.3390\/app12042120","article-title":"iEnhancer-deep: a computational predictor for enhancer sites and their strength using deep learning","volume":"12","author":"Kamran","year":"2022","journal-title":"Appl Sci"},{"key":"2025021908145992200_ref39","doi-asserted-by":"publisher","first-page":"3030","DOI":"10.1016\/j.ygeno.2021.07.004","article-title":"ncRDense: a novel computational approach for classification of non-coding RNA family by deep learning","volume":"113","author":"Chantsalnyam","year":"2021","journal-title":"Genomics"},{"article-title":"Neural machine translation by jointly learning to align and translate","year":"2014","author":"Bahdanau","key":"2025021908145992200_ref40"},{"key":"2025021908145992200_ref41","doi-asserted-by":"publisher","first-page":"3883","DOI":"10.3390\/app11093883","article-title":"Examining attention mechanisms in deep learning models for sentiment analysis","volume":"11","author":"Kardakis","year":"2021","journal-title":"Appl Sci"},{"key":"2025021908145992200_ref42","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1016\/j.neucom.2021.03.091","article-title":"A review on the attention mechanism of deep learning","volume":"452","author":"Niu","year":"2021","journal-title":"Neurocomputing"},{"key":"2025021908145992200_ref43","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1108\/IJICC-06-2021-0109","article-title":"High accuracy offering attention mechanisms based deep learning approach using CNN\/bi-LSTM for sentiment analysis","volume":"15","author":"Kota","year":"2022","journal-title":"Int J Intell Comput Cybern"},{"key":"2025021908145992200_ref44","doi-asserted-by":"publisher","first-page":"103212","DOI":"10.1016\/j.bspc.2021.103212","article-title":"Novel architecture with selected feature vector for effective classification of mitotic and non-mitotic cells in breast cancer histology images","volume":"71","author":"Rehman","year":"2022","journal-title":"Biomed Signal Process Control"},{"key":"2025021908145992200_ref45","doi-asserted-by":"publisher","first-page":"6009","DOI":"10.1016\/j.csbj.2021.10.034","article-title":"DCNN-4mC: densely connected neural network based N4-methylcytosine site prediction in multiple species","volume":"19","author":"Rehman","year":"2021","journal-title":"Comput Struct Biotechnol J"},{"key":"2025021908145992200_ref46","doi-asserted-by":"publisher","DOI":"10.3390\/pharmaceutics13111906","article-title":"Drug therapeutic-use class prediction and repurposing using graph convolutional networks","volume":"13","author":"Chipofya","year":"2021","journal-title":"Pharmaceutics"},{"key":"2025021908145992200_ref47","doi-asserted-by":"publisher","first-page":"104391","DOI":"10.1016\/j.chemolab.2021.104391","article-title":"Intelligent and robust computational prediction model for DNA N4-methylcytosine sites via natural language processing","volume":"217","author":"Tahir","year":"2021","journal-title":"Chemom Intel Lab Syst"},{"key":"2025021908145992200_ref48","doi-asserted-by":"publisher","first-page":"1117","DOI":"10.3390\/genes12081117","article-title":"i4mC-Deep: an intelligent predictor of N4-methylcytosine sites using a deep learning approach with chemical properties","volume":"12","author":"Alam","year":"2021","journal-title":"Genes"},{"key":"2025021908145992200_ref49","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-020-80430-x","article-title":"DNA sequences performs as natural language processing by exploiting deep learning algorithm for the identification of N4-methylcytosine","volume":"11","author":"Wahab","year":"2021","journal-title":"Sci Rep"},{"key":"2025021908145992200_ref50","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13040-021-00244-z","article-title":"The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation","volume":"14","author":"Chicco","year":"2021","journal-title":"BioData Mining"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbaf063\/61947319\/bbaf063.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/1\/bbaf063\/61947319\/bbaf063.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T08:15:20Z","timestamp":1739952920000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf063\/8021142"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,22]]},"references-count":50,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,11,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf063","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2025,1]]},"published":{"date-parts":[[2024,11,22]]},"article-number":"bbaf063"}}