{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T05:06:54Z","timestamp":1773896814903,"version":"3.50.1"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2017,10,18]],"date-time":"2017-10-18T00:00:00Z","timestamp":1508284800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Eukaryotic chromosomes adapt a complex and highly dynamic three-dimensional (3D) structure, which profoundly affects different cellular functions and outcomes including changes in epigenetic landscape and in gene expression. Making the scenario even more complex, cancer cells harbor chromosomal abnormalities [e.g. copy number variations (CNVs) and translocations] altering their genomes both at the sequence level and at the level of 3D organization. High-throughput chromosome conformation capture techniques (e.g. Hi-C), which are originally developed for decoding the 3D structure of the chromatin, provide a great opportunity to simultaneously identify the locations of genomic rearrangements and to investigate the 3D genome organization in cancer cells. Even though Hi-C data has been used for validating known rearrangements, computational methods that can distinguish rearrangement signals from the inherent biases of Hi-C data and from the actual 3D conformation of chromatin, and can precisely detect rearrangement locations de novo have been missing.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In this work, we characterize how intra and inter-chromosomal Hi-C contacts are distributed for normal and rearranged chromosomes to devise a new set of algorithms (i) to identify genomic segments that correspond to CNV regions such as amplifications and deletions (HiCnv), (ii) to call inter-chromosomal translocations and their boundaries (HiCtrans) from Hi-C experiments and (iii) to simulate Hi-C data from genomes with desired rearrangements and abnormalities (AveSim) in order to select optimal parameters for and to benchmark the accuracy of our methods. Our results on 10 different cancer cell lines with Hi-C data show that we identify a total number of 105 amplifications and 45 deletions together with 90 translocations, whereas we identify virtually no such events for two karyotypically normal cell lines. Our CNV predictions correlate very well with whole genome sequencing data among chromosomes with CNV events for a breast cancer cell line (r\u2009=\u20090.89) and capture most of the CNVs we simulate using Avesim. For HiCtrans predictions, we report evidence from the literature for 30 out of 90 translocations for eight of our cancer cell lines. Furthermore, we show that our tools identify and correctly classify relatively understudied rearrangements such as double minutes and homogeneously staining regions. Considering the inherent limitations of existing techniques for karyotyping (i.e. missing balanced rearrangements and those near repetitive regions), the accurate identification of CNVs and translocations in a cost-effective and high-throughput setting is still a challenge. Our results show that the set of tools we develop effectively utilize moderately sequenced Hi-C libraries (100\u2013300 million reads) to identify known and de novo chromosomal rearrangements\/abnormalities in well-established cancer cell lines. With the decrease in required number of cells and the increase in attainable resolution, we believe that our framework will pave the way towards comprehensive mapping of genomic rearrangements in primary cells from cancer patients using Hi-C.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>CNV calling: https:\/\/github.com\/ay-lab\/HiCnv, Translocation calling: https:\/\/github.com\/ay-lab\/HiCtrans and Hi-C simulation: https:\/\/github.com\/ay-lab\/AveSim.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx664","type":"journal-article","created":{"date-parts":[[2017,10,17]],"date-time":"2017-10-17T07:10:08Z","timestamp":1508224208000},"page":"338-345","source":"Crossref","is-referenced-by-count":86,"title":["Identification of copy number variations and translocations in cancer cells from Hi-C data"],"prefix":"10.1093","volume":"34","author":[{"given":"Abhijit","family":"Chakraborty","sequence":"first","affiliation":[{"name":"Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0708-6914","authenticated-orcid":false,"given":"Ferhat","family":"Ay","sequence":"additional","affiliation":[{"name":"Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, CA, USA"},{"name":"School of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,10,18]]},"reference":[{"key":"2023012712231222300_btx664-B1","doi-asserted-by":"crossref","first-page":"999","DOI":"10.1101\/gr.160374.113","article-title":"Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts","volume":"24","author":"Ay","year":"2014","journal-title":"Genome Res"},{"key":"2023012712231222300_btx664-B2","doi-asserted-by":"crossref","first-page":"974","DOI":"10.1101\/gr.169417.113","article-title":"Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression","volume":"24","author":"Ay","year":"2014","journal-title":"Genome Res"},{"key":"2023012712231222300_btx664-B3","doi-asserted-by":"crossref","first-page":"121.","DOI":"10.1186\/s12864-015-1236-7","article-title":"Identifying multi-locus chromatin contacts in human cells using tethered multiple 3C","volume":"16","author":"Ay","year":"2015","journal-title":"BMC Genomics"},{"key":"2023012712231222300_btx664-B4","doi-asserted-by":"crossref","first-page":"739","DOI":"10.1126\/science.71759","article-title":"Double minute chromosomes and the homogeneously staining regions in chromosomes of a human neuroblastoma cell line","volume":"198","author":"Balaban-Malenbaum","year":"1977","journal-title":"Science"},{"key":"2023012712231222300_btx664-B5","doi-asserted-by":"crossref","first-page":"214.","DOI":"10.1186\/s13059-015-0768-0","article-title":"Chromatin interaction analysis reveals changes in small chromosome and telomere clustering between epithelial and breast cancer cells","volume":"16","author":"Barutcu","year":"2015","journal-title":"Genome Biol"},{"key":"2023012712231222300_btx664-B6","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.ceb.2013.10.002","article-title":"Large-scale chromatin organization: the good, the surprising, and the still perplexing","volume":"26","author":"Belmont","year":"2014","journal-title":"Curr. Opin. Cell Biol"},{"key":"2023012712231222300_btx664-B7","first-page":"1345","article-title":"Boundary adjusted density estimation and bandwidth selection","volume":"10","author":"Chiu","year":"2000","journal-title":"Stat. Sin"},{"key":"2023012712231222300_btx664-B8","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"Consortium EP","year":"2012","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B9","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.1054\/bjoc.2000.1458","article-title":"Molecular cytogenetic analysis of breast cancer cell lines","volume":"83","author":"Davidson","year":"2000","journal-title":"Br. J. Cancer"},{"key":"2023012712231222300_btx664-B10","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1007\/s10577-005-2168-x","article-title":"Array CGH technologies and their applications to cancer genomes","volume":"13","author":"Davies","year":"2005","journal-title":"Chromosome Res"},{"key":"2023012712231222300_btx664-B11","doi-asserted-by":"crossref","first-page":"1104","DOI":"10.1101\/gr.183699.114","article-title":"Topologically associating domains and their long-range contacts are established during early G1 coincident with the establishment of the replication-timing program","volume":"25","author":"Dileep","year":"2015","journal-title":"Genome Res"},{"key":"2023012712231222300_btx664-B12","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature11082","article-title":"Topological domains in mammalian genomes identified by analysis of chromatin interactions","volume":"485","author":"Dixon","year":"2012","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B13","author":"Dixon","year":"2017"},{"key":"2023012712231222300_btx664-B14","doi-asserted-by":"crossref","first-page":"e44196.","DOI":"10.1371\/journal.pone.0044196","article-title":"Three-dimensional genome architecture influences partner selection for chromosomal translocations in human disease","volume":"7","author":"Engreitz","year":"2012","journal-title":"PLoS One"},{"key":"2023012712231222300_btx664-B15","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1002\/ijc.23959","article-title":"Genomic instability and histone H3 phosphorylation induction by the Ras-mitogen activated protein kinase pathway in pancreatic cancer cells","volume":"124","author":"Espino","year":"2009","journal-title":"Int. J. Cancer"},{"key":"2023012712231222300_btx664-B16","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1016\/j.cell.2014.02.019","article-title":"A single oncogenic enhancer rearrangement causes concomitant EVI1 and GATA2 deregulation in leukemia","volume":"157","author":"Groschel","year":"2014","journal-title":"Cell"},{"key":"2023012712231222300_btx664-B17","doi-asserted-by":"crossref","first-page":"125.","DOI":"10.1186\/s13059-017-1253-8","article-title":"Hi-C as a tool for precise detection and characterisation of chromosomal rearrangements and copy number variation in human tumours","volume":"18","author":"Harewood","year":"2017","journal-title":"Genome Biol"},{"key":"2023012712231222300_btx664-B18","doi-asserted-by":"crossref","first-page":"3131","DOI":"10.1093\/bioinformatics\/bts570","article-title":"HiCNorm: removing biases in Hi-C data via Poisson regression","volume":"28","author":"Hu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012712231222300_btx664-B19","doi-asserted-by":"crossref","first-page":"999","DOI":"10.1038\/nmeth.2148","article-title":"Iterative correction of Hi-C data reveals hallmarks of chromosome organization","volume":"9","author":"Imakaev","year":"2012","journal-title":"Nat. Methods"},{"key":"2023012712231222300_btx664-B20","doi-asserted-by":"crossref","first-page":"1369","DOI":"10.1016\/j.cell.2016.09.037","article-title":"Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters","volume":"167","author":"Javierre","year":"2016","journal-title":"Cell"},{"key":"2023012712231222300_btx664-B21","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1038\/nature12644","article-title":"A high-resolution map of the three-dimensional chromatin interactome in human cells","volume":"503","author":"Jin","year":"2013","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B22","doi-asserted-by":"crossref","first-page":"3","DOI":"10.18637\/jss.v058.i03","article-title":"changepoint: an R package for changepoint analysis","volume":"58","author":"Killick","year":"2014","journal-title":"J. Stat. Softw"},{"key":"2023012712231222300_btx664-B23","doi-asserted-by":"crossref","first-page":"4181","DOI":"10.1093\/nar\/gkp552","article-title":"Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances","volume":"37","author":"LaFramboise","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2023012712231222300_btx664-B24","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1126\/science.1181369","article-title":"Comprehensive mapping of long-range interactions reveals folding principles of the human genome","volume":"326","author":"Lieberman-Aiden","year":"2009","journal-title":"Science"},{"key":"2023012712231222300_btx664-B25","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1016\/j.cell.2015.04.004","article-title":"Disruptions of topological chromatin domains cause pathogenic rewiring of gene\u2013enhancer interactions","volume":"161","author":"Lupianez","year":"2015","journal-title":"Cell"},{"key":"2023012712231222300_btx664-B26","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/j.tig.2016.01.003","article-title":"Breaking TADs: how alterations of chromatin domains result in disease","volume":"32","author":"Lupianez","year":"2016","journal-title":"Trends Genet"},{"key":"2023012712231222300_btx664-B27","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1038\/nmeth.3205","article-title":"Fine-scale chromatin interaction maps reveal the cis-regulatory landscape of human lincRNA genes","volume":"12","author":"Ma","year":"2015","journal-title":"Nat. Methods"},{"key":"2023012712231222300_btx664-B28","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1038\/nrc2091","article-title":"The impact of translocations and gene fusions on cancer causation","volume":"7","author":"Mitelman","year":"2007","journal-title":"Nat. Rev. Cancer"},{"key":"2023012712231222300_btx664-B30","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","article-title":"Circular binary segmentation for the analysis of array-based DNA copy number data","volume":"5","author":"Olshen","year":"2004","journal-title":"Biostatistics"},{"key":"2023012712231222300_btx664-B31","first-page":"113","article-title":"Characterization of two human lung adenocarcinoma cell lines by reciprocal chromosome painting","volume":"31","author":"Peng","year":"2010","journal-title":"Dongwuxue Yanjiu"},{"key":"2023012712231222300_btx664-B32","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1038\/nature13986","article-title":"Topologically associating domains are stable units of replication-timing regulation","volume":"515","author":"Pope","year":"2014","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B33","doi-asserted-by":"crossref","first-page":"1665","DOI":"10.1016\/j.cell.2014.11.021","article-title":"A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping","volume":"159","author":"Rao","year":"2014","journal-title":"Cell"},{"key":"2023012712231222300_btx664-B34","doi-asserted-by":"crossref","first-page":"8.","DOI":"10.1186\/1755-8166-7-8","article-title":"Differences and homologies of chromosomal alterations within and between breast cancer cell lines: a clustering analysis","volume":"7","author":"Rondon-Lagos","year":"2014","journal-title":"Mol. Cytogenet"},{"key":"2023012712231222300_btx664-B35","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1038\/243290a0","article-title":"Letter: A new consistent chromosomal abnormality in chronic myelogenous leukaemia identified by quinacrine fluorescence and Giemsa staining","volume":"243","author":"Rowley","year":"1973","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B36","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1038\/nature11279","article-title":"The long-range interaction landscape of gene promoters","volume":"489","author":"Sanyal","year":"2012","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B37","doi-asserted-by":"crossref","first-page":"13426.","DOI":"10.1038\/ncomms13426","article-title":"17q21 asthma-risk variants switch CTCF binding and regulate IL-2 production by T cells","volume":"7","author":"Schmiedel","year":"2016","journal-title":"Nat. Commun"},{"key":"2023012712231222300_btx664-B38","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1126\/science.273.5274.494","article-title":"Multicolor spectral karyotyping of human chromosomes","volume":"273","author":"Schrock","year":"1996","journal-title":"Science"},{"key":"2023012712231222300_btx664-B39","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1158\/1541-7786.MCR-16-0374","article-title":"Nucleome analysis reveals structure-function relationships for colon cancer","volume":"15","author":"Seaman","year":"2017","journal-title":"Mol. Cancer Res"},{"key":"2023012712231222300_btx664-B40","doi-asserted-by":"crossref","first-page":"259.","DOI":"10.1186\/s13059-015-0831-x","article-title":"HiC-Pro: an optimized and flexible pipeline for Hi-C data processing","volume":"16","author":"Servant","year":"2015","journal-title":"Genome Biol"},{"key":"2023012712231222300_btx664-B41","doi-asserted-by":"crossref","first-page":"1049","DOI":"10.1016\/j.cell.2015.02.040","article-title":"The role of chromosome domains in shaping the functional genome","volume":"160","author":"Sexton","year":"2015","journal-title":"Cell"},{"key":"2023012712231222300_btx664-B42","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1038\/ng0496-368","article-title":"Karyotyping human chromosomes by combinatorial multi-fluor FISH","volume":"12","author":"Speicher","year":"1996","journal-title":"Nat. Genet"},{"key":"2023012712231222300_btx664-B43","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature21429","article-title":"3D structures of individual mammalian genomes studied by single-cell Hi-C","volume":"544","author":"Stevens","year":"2017","journal-title":"Nature"},{"key":"2023012712231222300_btx664-B44","doi-asserted-by":"crossref","first-page":"1198","DOI":"10.1101\/gr.106252.110","article-title":"Gene amplification as double minutes or homogeneously staining regions in solid tumors: origin and structure","volume":"20","author":"Storlazzi","year":"2010","journal-title":"Genome Res"},{"key":"2023012712231222300_btx664-B45","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1126\/science.2554494","article-title":"p53: a frequent target for genetic abnormalities in lung cancer","volume":"246","author":"Takahashi","year":"1989","journal-title":"Science"},{"key":"2023012712231222300_btx664-B46","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1080\/10618600.1994.10474656","article-title":"Fast computation of multivariate Kernel estimators","volume":"3","author":"Wand","year":"1994","journal-title":"J. Comput. Graph. Stat"},{"key":"2023012712231222300_btx664-B47","doi-asserted-by":"crossref","first-page":"1134","DOI":"10.1038\/ng.2760","article-title":"Pan-cancer patterns of somatic copy number alteration","volume":"45","author":"Zack","year":"2013","journal-title":"Nat. Genet"},{"key":"2023012712231222300_btx664-B48","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1038\/nbt.3432","article-title":"Haplotyping germline and cancer genomes with high-throughput linked-read sequencing","volume":"34","author":"Zheng","year":"2016","journal-title":"Nat. Biotechnol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/2\/338\/48912983\/bioinformatics_34_2_338.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/2\/338\/48912983\/bioinformatics_34_2_338.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,26]],"date-time":"2025-06-26T10:53:53Z","timestamp":1750935233000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/2\/338\/4557186"}},"subtitle":[],"editor":[{"given":"Christina","family":"Curtis","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,10,18]]},"references-count":47,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx664","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/179275","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,1,15]]},"published":{"date-parts":[[2017,10,18]]}}}