{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T11:19:44Z","timestamp":1763810384343,"version":"3.37.3"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2020,11,28]],"date-time":"2020-11-28T00:00:00Z","timestamp":1606521600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"publisher","award":["81570807","30900810","31271344","31071097"],"award-info":[{"award-number":["81570807","30900810","31271344","31071097"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002664","name":"Hunan Normal University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002664","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01 AR069055","U19 AG055373","R01 MH104680","R01 AR059781","P20 GM109036"],"award-info":[{"award-number":["R01 AR069055","U19 AG055373","R01 MH104680","R01 AR059781","P20 GM109036"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Edward G. Schlieder Endowment"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,6,16]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Although genome-wide association studies (GWASs) have identified thousands of variants for various traits, the causal variants and the mechanisms underlying the significant loci are largely unknown. In this study, we aim to predict non-coding variants that may functionally affect translation initiation through long-range chromatin interaction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>By incorporating the Hi-C data, we propose a novel and powerful deep learning model of artificial intelligence to classify interacting and non-interacting fragment pairs and predict the functional effects of sequence alteration of single nucleotide on chromatin interaction and thus on gene expression. The changes in chromatin interaction probability between the reference sequence and the altered sequence reflect the degree of functional impact for the variant. The model was effective and efficient with the classification of interacting and non-interacting fragment pairs. The predicted causal SNPs that had a larger impact on chromatin interaction were more likely to be identified by GWAS and eQTL analyses. We demonstrate that an integrative approach combining artificial intelligence\u2014deep learning with high throughput experimental evidence of chromatin interaction leads to prioritizing the functional variants in disease- and phenotype-related loci and thus will greatly expedite uncover of the biological mechanism underlying the association identified in genomic studies.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Source code used in data preparing and model training is available at the GitHub website (https:\/\/github.com\/biocai\/DeepHiC).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa970","type":"journal-article","created":{"date-parts":[[2020,11,5]],"date-time":"2020-11-05T12:46:13Z","timestamp":1604580373000},"page":"1339-1344","source":"Crossref","is-referenced-by-count":14,"title":["Combining artificial intelligence: deep learning with Hi-C data to predict the functional effects of non-coding variants"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8731-2899","authenticated-orcid":false,"given":"Xiang-He","family":"Meng","sequence":"first","affiliation":[{"name":"Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University , Changsha, Hunan 410008, China"},{"name":"Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University , New Orleans, LA 70112, USA"},{"name":"Centers of System Biology, Data Information and Reproductive Health, Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University , Changsha, Hunan 410081, China"}]},{"given":"Hong-Mei","family":"Xiao","sequence":"additional","affiliation":[{"name":"Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University , Changsha, Hunan 410008, China"}]},{"given":"Hong-Wen","family":"Deng","sequence":"additional","affiliation":[{"name":"Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University , Changsha, Hunan 410008, China"},{"name":"Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University , New Orleans, LA 70112, USA"},{"name":"Centers of System Biology, Data Information and Reproductive Health, Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University , Changsha, Hunan 410081, China"}]}],"member":"286","published-online":{"date-parts":[[2020,11,28]]},"reference":[{"key":"2023051709343336100_btaa970-B1","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1186\/s13059-015-0745-7","article-title":"Analysis methods for studying the 3D architecture of the genome","volume":"16","author":"Ay","year":"2015","journal-title":"Genome Biol"},{"key":"2023051709343336100_btaa970-B2","doi-asserted-by":"crossref","first-page":"2114","DOI":"10.1093\/bioinformatics\/btu170","article-title":"Trimmomatic: a flexible trimmer for Illumina sequence data","volume":"30","author":"Bolger","year":"2014","journal-title":"Bioinformatics"},{"key":"2023051709343336100_btaa970-B3","doi-asserted-by":"crossref","first-page":"1790","DOI":"10.1101\/gr.137323.112","article-title":"Annotation of functional variation in personal genomes using RegulomeDB","volume":"22","author":"Boyle","year":"2012","journal-title":"Genome Res"},{"key":"2023051709343336100_btaa970-B4","doi-asserted-by":"crossref","first-page":"3121","DOI":"10.1038\/s41467-018-05379-y","article-title":"Evaluation of chromatin accessibility in prefrontal cortex of individuals with schizophrenia","volume":"9","author":"Bryois","year":"2018","journal-title":"Nature Communications"},{"key":"2023051709343336100_btaa970-B5","doi-asserted-by":"crossref","first-page":"8641","DOI":"10.1093\/nar\/gkw519","article-title":"Explaining the disease phenotype of intergenic SNP through predicted long range regulation","volume":"44","author":"Chen","year":"2016","journal-title":"Nucleic Acids Research"},{"key":"2023051709343336100_btaa970-B6","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1038\/nature14222","article-title":"Chromatin architecture reorganization during stem cell differentiation","volume":"518","author":"Dixon","year":"2015","journal-title":"Nature"},{"key":"2023051709343336100_btaa970-B7","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature11082","article-title":"Topological domains in mammalian genomes identified by analysis of chromatin interactions","volume":"485","author":"Dixon","year":"2012","journal-title":"Nature"},{"key":"2023051709343336100_btaa970-B8","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1038\/nature03467","article-title":"A common sex-dependent mutation in a RET enhancer underlies Hirschsprung disease risk","volume":"434","author":"Emison","year":"2005","journal-title":"Nature"},{"key":"2023051709343336100_btaa970-B9","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.2249","article-title":"Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture","volume":"44","author":"Estrada","year":"2012","journal-title":"Nature Genetics"},{"key":"2023051709343336100_btaa970-B10","doi-asserted-by":"crossref","first-page":"e1004383","DOI":"10.1371\/journal.pgen.1004383","article-title":"Bayesian test for colocalisation between pairs of genetic association studies using summary statistics","volume":"10","author":"Giambartolomei","year":"2014","journal-title":"PLoS Genetics"},{"key":"2023051709343336100_btaa970-B11","doi-asserted-by":"crossref","first-page":"2538","DOI":"10.1093\/bioinformatics\/bty147","article-title":"A Bayesian framework for multiple trait colocalization from summary association statistics","volume":"34","author":"Giambartolomei","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051709343336100_btaa970-B12","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1016\/j.molcel.2010.05.004","article-title":"Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities","volume":"38","author":"Heinz","year":"2010","journal-title":"Molecular Cell"},{"key":"2023051709343336100_btaa970-B13","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1038\/ng.2871","article-title":"Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment","volume":"46","author":"Hughes","year":"2014","journal-title":"Nature Genetics"},{"key":"2023051709343336100_btaa970-B14","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1534\/genetics.118.301028","article-title":"Evolution of DNAase I Hypersensitive Sites in MHC Regulatory Regions of Primates","volume":"209","author":"Jin","year":"2018","journal-title":"Genetics"},{"volume":"1412.6980","year":"2014","key":"2023051709343336100_btaa970-B15"},{"key":"2023051709343336100_btaa970-B16","doi-asserted-by":"crossref","first-page":"2049","DOI":"10.1093\/hmg\/10.19.2049","article-title":"Aniridia-associated translocations, DNase hypersensitivity, sequence comparison and transgenic analysis redefine the functional domain of PAX6","volume":"10","author":"Kleinjan","year":"2001","journal-title":"Hum. Mol. Genet"},{"key":"2023051709343336100_btaa970-B17","doi-asserted-by":"crossref","first-page":"D980","DOI":"10.1093\/nar\/gkt1113","article-title":"ClinVar: public archive of relationships among sequence variation and human phenotype","volume":"42","author":"Landrum","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023051709343336100_btaa970-B18","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows\u2013Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051709343336100_btaa970-B19","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The Sequence Alignment\/Map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051709343336100_btaa970-B20","doi-asserted-by":"crossref","first-page":"2897","DOI":"10.1093\/bioinformatics\/btq540","article-title":"FastPval: a fast and memory efficient program to calculate very low P-values from empirical distribution","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051709343336100_btaa970-B21","doi-asserted-by":"crossref","first-page":"e108","DOI":"10.1093\/nar\/gkt214","article-title":"The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote","volume":"41","author":"Liao","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023051709343336100_btaa970-B22","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1038\/s41467-018-08270-y","article-title":"Biological relevance of computationally predicted pathogenicity of noncoding variants","volume":"10","author":"Liu","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051709343336100_btaa970-B23","doi-asserted-by":"crossref","first-page":"10391","DOI":"10.1093\/nar\/gkt785","article-title":"Combining Hi-C data with phylogenetic correlation to predict the target genes of distal regulatory elements in human genome","volume":"41","author":"Lu","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023051709343336100_btaa970-B24","doi-asserted-by":"crossref","first-page":"1190","DOI":"10.1126\/science.1222794","article-title":"Systematic localization of common disease-associated variation in regulatory DNA","volume":"337","author":"Maurano","year":"2012","journal-title":"Science"},{"key":"2023051709343336100_btaa970-B25","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/j.gdata.2016.04.014","article-title":"Combined sequencing of mRNA and DNA from human embryonic stem cells","volume":"8","author":"Mertes","year":"2016","journal-title":"Genomics Data"},{"key":"2023051709343336100_btaa970-B26","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1126\/science.3090685","article-title":"Upstream operators enhance repression of the lac promoter","volume":"233","author":"Mossing","year":"1986","journal-title":"Science"},{"key":"2023051709343336100_btaa970-B27","doi-asserted-by":"crossref","first-page":"1301","DOI":"10.1534\/genetics.117.300435","article-title":"Efficient prioritization of multiple causal eQTL variants via sparse polygenic modeling","volume":"207","author":"Nariai","year":"2017","journal-title":"Genetics"},{"key":"2023051709343336100_btaa970-B28","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1016\/j.tig.2016.10.008","article-title":"Mining the unknown: assigning function to noncoding single nucleotide polymorphisms","volume":"33","author":"Nishizaki","year":"2017","journal-title":"Trends Genet. TIG"},{"key":"2023051709343336100_btaa970-B29","doi-asserted-by":"crossref","first-page":"1665","DOI":"10.1016\/j.cell.2014.11.021","article-title":"A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping","volume":"159","author":"Rao","year":"2014","journal-title":"Cell"},{"key":"2023051709343336100_btaa970-B30","doi-asserted-by":"crossref","first-page":"1199","DOI":"10.1038\/ng.446","article-title":"Twenty bone-mineral-density loci identified by large-scale meta-analysis of genome-wide association studies","volume":"41","author":"Rivadeneira","year":"2009","journal-title":"Nat. Genet"},{"key":"2023051709343336100_btaa970-B31","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1038\/nrg3051","article-title":"Understanding the contribution of synonymous mutations to human disease","volume":"12","author":"Sauna","year":"2011","journal-title":"Nat. Rev. Genet"},{"key":"2023051709343336100_btaa970-B32","first-page":"1929","article-title":"Dropout: a simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res"},{"key":"2023051709343336100_btaa970-B33","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.ajhg.2015.05.016","article-title":"Disentangling the effects of colocalizing genomic annotations to functionally prioritize non-coding variants within complex-trait loci","volume":"97","author":"Trynka","year":"2015","journal-title":"Am. J. Hum. Genet"},{"key":"2023051709343336100_btaa970-B34","doi-asserted-by":"crossref","first-page":"11.10.11","DOI":"10.1002\/0471250953.bi1110s43","article-title":"From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline","volume":"43","author":"Van der Auwera","year":"2013","journal-title":"Curr. Protoc. Bioinf"},{"key":"2023051709343336100_btaa970-B35","doi-asserted-by":"crossref","first-page":"e69","DOI":"10.1093\/nar\/gky215","article-title":"DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants","volume":"46","author":"Wang","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023051709343336100_btaa970-B36","doi-asserted-by":"crossref","first-page":"D1001","DOI":"10.1093\/nar\/gkt1229","article-title":"The NHGRI GWAS Catalog, a curated resource of SNP-trait associations","volume":"42","author":"Welter","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023051709343336100_btaa970-B37","doi-asserted-by":"crossref","first-page":"1238","DOI":"10.1038\/ng.2756","article-title":"Systematic identification of trans eQTLs as putative drivers of known disease associations","volume":"45","author":"Westra","year":"2013","journal-title":"Nat. Genet"},{"key":"2023051709343336100_btaa970-B38","first-page":"49","article-title":"RANKL\/RANK\/OPG: new therapeutic targets in bone tumours and associated osteolysis","volume":"1704","author":"Wittrant","year":"2004","journal-title":"Biochim. Biophys. Acta"},{"key":"2023051709343336100_btaa970-B39","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.cca.2010.09.024","article-title":"Genetic association of rs11610206 SNP on chromosome 12q13 with late-onset Alzheimer's disease in a Han Chinese population","volume":"412","author":"Yu","year":"2011","journal-title":"Clin. Chim. Acta Int. J. Clin. Chem"},{"key":"2023051709343336100_btaa970-B40","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1093\/hmg\/ddt575","article-title":"Multistage genome-wide association meta-analyses identified two new loci for bone mineral density","volume":"23","author":"Zhang","year":"2014","journal-title":"Hum. Mol. Genet"},{"key":"2023051709343336100_btaa970-B41","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1038\/nmeth.3547","article-title":"Predicting effects of noncoding variants with deep learning-based sequence model","volume":"12","author":"Zhou","year":"2015","journal-title":"Nat. Methods"},{"key":"2023051709343336100_btaa970-B42","doi-asserted-by":"crossref","first-page":"1335","DOI":"10.1002\/jbmr.3419","article-title":"Multiple functional variants at 13q14 risk locus for osteoporosis regulate RANKL expression through long-range super-enhancer","volume":"33","author":"Zhu","year":"2018","journal-title":"J. Bone Miner. Res Off. J. Am. Soc. Bone Miner. Res"},{"key":"2023051709343336100_btaa970-B43","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1038\/ng.3538","article-title":"Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets","volume":"48","author":"Zhu","year":"2016","journal-title":"Nat. Genet"},{"key":"2023051709343336100_btaa970-B44","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1038\/s41588-018-0295-5","article-title":"A primer on deep learning in genomics","volume":"51","author":"Zou","year":"2019","journal-title":"Nat. Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa970\/34321622\/btaa970.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/10\/1339\/50360967\/btaa970.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/10\/1339\/50360967\/btaa970.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,17]],"date-time":"2023-05-17T09:50:02Z","timestamp":1684317002000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/10\/1339\/5983320"}},"subtitle":[],"editor":[{"given":"Martelli","family":"Pier Luigi","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,11,28]]},"references-count":44,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2021,6,16]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa970","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2021,5,15]]},"published":{"date-parts":[[2020,11,28]]}}}