{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T20:04:32Z","timestamp":1781553872951,"version":"3.54.5"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T00:00:00Z","timestamp":1719532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Sony Faculty Research Award"},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["R01HG011466"],"award-info":[{"award-number":["R01HG011466"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["UM1HG011531"],"award-info":[{"award-number":["UM1HG011531"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,6,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>High-resolution Hi-C contact matrices reveal the detailed three-dimensional architecture of the genome, but high-coverage experimental Hi-C data are expensive to generate. Simultaneously, chromatin structure analyses struggle with extremely sparse contact matrices. To address this problem, computational methods to enhance low-coverage contact matrices have been developed, but existing methods are largely based on resolution enhancement methods for natural images and hence often employ models that do not distinguish between biologically meaningful contacts, such as loops and other stochastic contacts.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present Capricorn, a machine learning model for Hi-C resolution enhancement that incorporates small-scale chromatin features as additional views of the input Hi-C contact matrix and leverages a diffusion probability model backbone to generate a high-coverage matrix. We show that Capricorn outperforms the state of the art in a cross-cell-line setting, improving on existing methods by 17% in mean squared error and 26% in F1 score for chromatin loop identification from the generated high-coverage data. We also demonstrate that Capricorn performs well in the cross-chromosome setting and cross-chromosome, cross-cell-line setting, improving the downstream loop F1 score by 14% relative to existing methods. We further show that our multiview idea can also be used to improve several existing methods, HiCARN and HiCNN, indicating the wide applicability of this approach. Finally, we use DNA sequence to validate discovered loops and find that the fraction of CTCF-supported loops from Capricorn is similar to those identified from the high-coverage data. Capricorn is a powerful Hi-C resolution enhancement method that enables scientists to find chromatin features that cannot be identified in the low-coverage contact matrix.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Implementation of Capricorn and source code for reproducing all figures in this paper are available at https:\/\/github.com\/CHNFTQ\/Capricorn.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae211","type":"journal-article","created":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T09:33:42Z","timestamp":1719567222000},"page":"i471-i480","source":"Crossref","is-referenced-by-count":7,"title":["Enhancing Hi-C contact matrices for loop detection with Capricorn: a multiview diffusion model"],"prefix":"10.1093","volume":"40","author":[{"given":"Tangqi","family":"Fang","sequence":"first","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yifeng","family":"Liu","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Addie","family":"Woicik","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Minsi","family":"Lu","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3029-2086","authenticated-orcid":false,"given":"Anupama","family":"Jha","sequence":"additional","affiliation":[{"name":"Department of Genome Sciences, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiao","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Purdue University , West Lafayette, IN 47907, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Gang","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Genome Sciences, University of Washington , Seattle, WA 98195, United States"},{"name":"eScience Institute, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Borislav","family":"Hristov","sequence":"additional","affiliation":[{"name":"Department of Genome Sciences, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zixuan","family":"Liu","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hanwen","family":"Xu","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"William S","family":"Noble","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"},{"name":"Department of Genome Sciences, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sheng","family":"Wang","sequence":"additional","affiliation":[{"name":"Paul G. Allen School of Computer Science and Engineering, University of Washington , Seattle, WA 98195, United States"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2024,6,28]]},"reference":[{"key":"2024062809073943300_btae211-B1","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1038\/nature21411","article-title":"Complex multi-enhancer contacts captured by genome architecture mapping","volume":"543","author":"Beagrie","year":"2017","journal-title":"Nature"},{"key":"2024062809073943300_btae211-B2","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1038\/nrg.2016.112","article-title":"Organization and function of the 3D genome","volume":"17","author":"Bonev","year":"2016","journal-title":"Nat Rev Genet"},{"key":"2024062809073943300_btae211-B3","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/s13059-019-1913-y","article-title":"HIFI: estimating DNA-DNA interaction frequency from Hi-C data at restriction-fragment resolution","volume":"21","author":"Cameron","year":"2020","journal-title":"Genome Biol"},{"key":"2024062809073943300_btae211-B4","doi-asserted-by":"crossref","first-page":"2724","DOI":"10.1093\/bioinformatics\/bty1059","article-title":"Boost-HiC: computational enhancement of long-range contacts in chromosomal contact maps","volume":"35","author":"Carron","year":"2019","journal-title":"Bioinformatics"},{"key":"2024062809073943300_btae211-B5","doi-asserted-by":"crossref","first-page":"6827","DOI":"10.1038\/s41467-022-34626-6","article-title":"dcHiC detects differential compartments across multiple Hi-C datasets","volume":"13","author":"Chakraborty","year":"2022","journal-title":"Nat Commun"},{"key":"2024062809073943300_btae211-B6","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1038\/nature14450","article-title":"Condensin-driven remodelling of X chromosome topology during dosage compensation","volume":"523","author":"Crane","year":"2015","journal-title":"Nature"},{"key":"2024062809073943300_btae211-B7","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1038\/35066075","article-title":"Chromosome territories, nuclear architecture and gene regulation in mammalian cells","volume":"2","author":"Cremer","year":"2001","journal-title":"Nat Rev Genet"},{"key":"2024062809073943300_btae211-B8","doi-asserted-by":"crossref","first-page":"2994","DOI":"10.1093\/nar\/gkx145","article-title":"A critical assessment of topologically associating domain prediction tools","volume":"45","author":"Dali","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2024062809073943300_btae211-B9","author":"Dimmick","year":"2020"},{"key":"2024062809073943300_btae211-B10","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature11082","article-title":"Topological domains in mammalian genomes identified by analysis of chromatin interactions","volume":"485","author":"Dixon","year":"2012","journal-title":"Nature"},{"key":"2024062809073943300_btae211-B11","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1186\/1748-7188-9-14","article-title":"Identification of alternative topological domains in chromatin","volume":"9","author":"Filippova","year":"2014","journal-title":"Algorithms Mol Biol"},{"key":"2024062809073943300_btae211-B12","first-page":"10021","author":"Gao","year":"2023"},{"key":"2024062809073943300_btae211-B13","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1093\/bioinformatics\/btr064","article-title":"FIMO: scanning for occurrences of a given motif","volume":"27","author":"Grant","year":"2011","journal-title":"Bioinformatics"},{"key":"2024062809073943300_btae211-B14","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1038\/ng.857","article-title":"CTCF-mediated functional chromatin interactome in pluripotent cells","volume":"43","author":"Handoko","year":"2011","journal-title":"Nat Genet"},{"key":"2024062809073943300_btae211-B15","doi-asserted-by":"crossref","first-page":"2414","DOI":"10.1093\/bioinformatics\/btac156","article-title":"HiCARN: resolution enhancement of Hi-C data using cascading residual networks","volume":"38","author":"Hicks","year":"2022","journal-title":"Bioinformatics"},{"key":"2024062809073943300_btae211-B16","doi-asserted-by":"crossref","first-page":"8880","DOI":"10.1038\/s41598-021-88115-9","article-title":"VEHiCLE: a variationally encoded Hi-C loss enhancement algorithm for improving and generating Hi-C data","volume":"11","author":"Highsmith","year":"2021","journal-title":"Sci Rep"},{"key":"2024062809073943300_btae211-B17","first-page":"6840","volume-title":"Advances in Neural Information Processing Systems","author":"Ho","year":"2020"},{"key":"2024062809073943300_btae211-B18","doi-asserted-by":"crossref","first-page":"e1007287","DOI":"10.1371\/journal.pcbi.1007287","article-title":"DeepHiC: a generative adversarial network for enhancing Hi-C data resolution","volume":"16","author":"Hong","year":"2020","journal-title":"PLoS Comput Biol"},{"key":"2024062809073943300_btae211-B19","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.cell.2015.05.048","article-title":"Mapping nucleosome resolution chromosome folding in yeast by micro-C","volume":"162","author":"Hsieh","year":"2015","journal-title":"Cell"},{"key":"2024062809073943300_btae211-B20","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1016\/j.molcel.2020.03.002","article-title":"Resolving the 3D landscape of Transcription-Linked mammalian chromatin folding","volume":"78","author":"Hsieh","year":"2020","journal-title":"Mol Cell"},{"key":"2024062809073943300_btae211-B21","author":"Kagda","year":"2023"},{"key":"2024062809073943300_btae211-B22","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.neucom.2022.01.029","article-title":"SRDiff: single image super-resolution with diffusion probabilistic models","volume":"479","author":"Li","year":"2022","journal-title":"Neurocomputing"},{"key":"2024062809073943300_btae211-B23","doi-asserted-by":"crossref","first-page":"353","DOI":"10.3389\/fgene.2020.00353","article-title":"SRHiC: a deep learning model to enhance the resolution of Hi-C data","volume":"11","author":"Li","year":"2020","journal-title":"Front Genet"},{"key":"2024062809073943300_btae211-B24","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1126\/science.1181369","article-title":"Comprehensive mapping of long-range interactions reveals folding principles of the human genome","volume":"326","author":"Lieberman-Aiden","year":"2009","journal-title":"Science"},{"key":"2024062809073943300_btae211-B25","doi-asserted-by":"crossref","first-page":"i99","DOI":"10.1093\/bioinformatics\/btz317","article-title":"hicGAN infers super resolution Hi-C data with generative adversarial networks","volume":"35","author":"Liu","year":"2019","journal-title":"Bioinformatics"},{"key":"2024062809073943300_btae211-B26","doi-asserted-by":"crossref","first-page":"4222","DOI":"10.1093\/bioinformatics\/btz251","article-title":"HiCNN: a very deep convolutional neural network to better enhance the resolution of Hi-C data","volume":"35","author":"Liu","year":"2019","journal-title":"Bioinformatics"},{"key":"2024062809073943300_btae211-B27","doi-asserted-by":"publisher","first-page":"862","DOI":"10.3390\/genes10110862","article-title":"HiCNN2: Enhancing the resolution of hi-C data using an ensemble of convolutional neural networks","volume":"10","author":"Liu","year":"2019","journal-title":"Genes"},{"key":"2024062809073943300_btae211-B28","doi-asserted-by":"crossref","first-page":"5795","DOI":"10.1038\/s41467-020-19562-7","article-title":"Computer vision for pattern detection in chromosome contact maps","volume":"11","author":"Matthey-Doret","year":"2020","journal-title":"Nat Commun"},{"key":"2024062809073943300_btae211-B29","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1038\/nmeth.3999","article-title":"HiChIP: efficient and sensitive analysis of protein-directed genome architecture","volume":"13","author":"Mumbach","year":"2016","journal-title":"Nat Methods"},{"key":"2024062809073943300_btae211-B30","first-page":"16784","author":"Nichol","year":"2021"},{"key":"2024062809073943300_btae211-B31","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1016\/j.cell.2018.05.024","article-title":"Higher-order inter-chromosomal hubs shape 3D genome organization in the nucleus","volume":"174","author":"Quinodoz","year":"2018","journal-title":"Cell"},{"key":"2024062809073943300_btae211-B32","author":"Ramesh","year":"2022"},{"key":"2024062809073943300_btae211-B33","doi-asserted-by":"crossref","first-page":"1665","DOI":"10.1016\/j.cell.2014.11.021","article-title":"A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping","volume":"159","author":"Rao","year":"2014","journal-title":"Cell"},{"key":"2024062809073943300_btae211-B34","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1186\/s13059-020-02167-0","article-title":"Mustache: multi-scale detection of chromatin loops from Hi-C and Micro-C maps using scale-space representation","volume":"21","author":"Roayaei Ardakany","year":"2020","journal-title":"Genome Biol"},{"key":"2024062809073943300_btae211-B35","first-page":"10684","author":"Rombach"},{"key":"2024062809073943300_btae211-B36","volume-title":"Advances in Neural Information Processing Systems","author":"Saharia","year":"2022"},{"key":"2024062809073943300_btae211-B37","first-page":"4713","article-title":"Image super-resolution via iterative refinement","volume":"45","author":"Saharia","year":"2023","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2024062809073943300_btae211-B38","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1016\/j.ymeth.2009.03.001","article-title":"ChIP-seq: using high-throughput sequencing to discover protein-DNA interactions","volume":"48","author":"Schmidt","year":"2009","journal-title":"Methods"},{"key":"2024062809073943300_btae211-B39","doi-asserted-by":"crossref","first-page":"743","DOI":"10.1038\/nrm.2016.104","article-title":"Genome-wide mapping and analysis of chromosome architecture","volume":"17","author":"Schmitt","year":"2016","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2024062809073943300_btae211-B40","first-page":"2256","volume-title":"Proceedings of the 32nd International Conference on Machine Learning, Volume 37 of Proceedings of Machine Learning Research","author":"Sohl-Dickstein","year":"2015"},{"key":"2024062809073943300_btae211-B41","doi-asserted-by":"crossref","first-page":"750","DOI":"10.1038\/s41467-018-03113-2","article-title":"Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus","volume":"9","author":"Zhang","year":"2018","journal-title":"Nat Commun"},{"key":"2024062809073943300_btae211-B42","doi-asserted-by":"crossref","first-page":"i386","DOI":"10.1093\/bioinformatics\/btad266","article-title":"Reference panel-guided super-resolution inference of Hi-C data","volume":"39","author":"Zhang","year":"2023","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i471\/58354834\/btae211.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i471\/58354834\/btae211.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T09:34:06Z","timestamp":1719567246000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/40\/Supplement_1\/i471\/7700867"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,28]]},"references-count":42,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2024,6,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae211","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,6,28]]}}}