{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:24:47Z","timestamp":1760171087019,"version":"3.37.3"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2018,8,8]],"date-time":"2018-08-08T00:00:00Z","timestamp":1533686400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100010941","name":"University of Vermont","doi-asserted-by":"publisher","award":["126773-IRG 14\u2013196-01"],"award-info":[{"award-number":["126773-IRG 14\u2013196-01"]}],"id":[{"id":"10.13039\/100010941","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,11,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Many viruses are capable of integrating in the human genome, particularly viruses involved in tumorigenesis. Viral integrations can be considered genetic markers for discovering virus-caused cancers and inferring cancer cell development. Next-generation sequencing (NGS) technologies have been widely used to screen for viral integrations in cancer genomes, and a number of bioinformatics tools have been developed to detect viral integrations using NGS data. However, there has been no systematic comparison of the methods or software. In this study, we performed a comprehensive comparative analysis of the designs, performance, functionality and limitations among the existing methods and software for detecting viral integrations. We further compared the sensitivity, precision and runtime of integration detection of four representative tools. Our analyses showed that each of the existing software had its own merits; however, none of them were sufficient for parallel or accurate virome-wide detection. After carefully evaluating the limitations shared by the existing methods, we proposed strategies and directions for developing virome-wide integration detection.<\/jats:p>","DOI":"10.1093\/bib\/bby070","type":"journal-article","created":{"date-parts":[[2018,7,14]],"date-time":"2018-07-14T14:48:43Z","timestamp":1531579723000},"page":"2088-2097","source":"Crossref","is-referenced-by-count":26,"title":["Comprehensive comparative analysis of methods and software for identifying viral integrations"],"prefix":"10.1093","volume":"20","author":[{"given":"Xun","family":"Chen","sequence":"first","affiliation":[{"name":"Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, Vermont 05405, USA"}]},{"given":"Jason","family":"Kost","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, Vermont 05405, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9214-4487","authenticated-orcid":false,"given":"Dawei","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, Vermont 05405, USA"},{"name":"Department of Computer Science, University of Vermont, Burlington, Vermont 05405, USA"},{"name":"Neuroscience, Behavior, and Health Initiative, University of Vermont, Burlington, Vermont 05405, USA"},{"name":"Cancer Center, University of Vermont, Burlington, Vermont 05405, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,8,8]]},"reference":[{"issue":"6","key":"2020011102350340100_ref1","doi-asserted-by":"crossref","first-page":"4897","DOI":"10.1002\/jcb.26717","article-title":"Searching for human oncoviruses: histories, challenges, and opportunities","volume":"119","author":"Cao","year":"2018","journal-title":"J Cell Biochem"},{"issue":"12","key":"2020011102350340100_ref2","doi-asserted-by":"crossref","first-page":"878","DOI":"10.1038\/nrc2961","article-title":"Why do viruses cause cancer? Highlights of the first century of human tumour virology","volume":"10","author":"Moore","year":"2010","journal-title":"Nat Rev Cancer"},{"issue":"7","key":"2020011102350340100_ref3","doi-asserted-by":"crossref","first-page":"760","DOI":"10.1038\/ng.2291","article-title":"Whole-genome sequencing of liver cancers identifies etiological influences on mutation patterns and recurrent mutations in chromatin regulators","volume":"44","author":"Fujimoto","year":"2012","journal-title":"Nat Genet"},{"issue":"7","key":"2020011102350340100_ref4","doi-asserted-by":"crossref","first-page":"765","DOI":"10.1038\/ng.2295","article-title":"Genome-wide survey of recurrent HBV integration in hepatocellular carcinoma","volume":"44","author":"Sung","year":"2012","journal-title":"Nat Genet"},{"issue":"2","key":"2020011102350340100_ref5","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1038\/ng.3178","article-title":"Genome-wide profiling of HPV integration in cervical cancer identifies clustered genomic hot spots and a potential microhomology-mediated integration mechanism","volume":"47","author":"Hu","year":"2015","journal-title":"Nat Genet"},{"issue":"11","key":"2020011102350340100_ref6","doi-asserted-by":"crossref","first-page":"1304","DOI":"10.1038\/ng.3415","article-title":"Integrated molecular analysis of adult T cell leukemia\/lymphoma","volume":"47","author":"Kataoka","year":"2015","journal-title":"Nat Genet"},{"issue":"5866","key":"2020011102350340100_ref7","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1126\/science.1152586","article-title":"Clonal integration of a polyomavirus in human Merkel cell carcinoma","volume":"319","author":"Feng","year":"2008","journal-title":"Science"},{"issue":"23","key":"2020011102350340100_ref8","doi-asserted-by":"crossref","first-page":"12100","DOI":"10.1128\/JVI.01169-10","article-title":"Herpesviruses and chromosomal integration","volume":"84","author":"Morissette","year":"2010","journal-title":"J Virol"},{"issue":"2","key":"2020011102350340100_ref9","doi-asserted-by":"crossref","first-page":"214","DOI":"10.7150\/jca.13150","article-title":"Genome-wide analysis of Epstein-Barr Virus (EBV) integration and strain in C666-1 and Raji cells","volume":"7","author":"Xiao","year":"2016","journal-title":"J Cancer"},{"issue":"10","key":"2020011102350340100_ref10","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1038\/ng.3389","article-title":"Recurrent AAV2-related insertional mutagenesis in human hepatocellular carcinomas","volume":"47","author":"Nault","year":"2015","journal-title":"Nat Genet"},{"issue":"20","key":"2020011102350340100_ref11","doi-asserted-by":"crossref","first-page":"9964","DOI":"10.1128\/JVI.01299-08","article-title":"Integration site preference of xenotropic murine leukemia virus-related virus, a new human retrovirus associated with prostate cancer","volume":"82","author":"Kim","year":"2008","journal-title":"J Virol"},{"issue":"3","key":"2020011102350340100_ref12","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1016\/j.ccr.2014.01.030","article-title":"Viral-human chimeric transcript predisposes risk to liver cancer development and progression","volume":"25","author":"Lau","year":"2014","journal-title":"Cancer Cell"},{"issue":"2","key":"2020011102350340100_ref13","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1101\/gr.164806.113","article-title":"Genome-wide analysis of HPV integration in human cancers reveals recurrent, focal genomic instability","volume":"24","author":"Akagi","year":"2014","journal-title":"Genome Res"},{"issue":"1","key":"2020011102350340100_ref14","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1128\/JVI.02570-14","article-title":"High-throughput RNA sequencing-based virome analysis of 50 lymphoma cell lines from the Cancer Cell Line Encyclopedia project","volume":"89","author":"Cao","year":"2015","journal-title":"J Virol"},{"issue":"4","key":"2020011102350340100_ref15","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0123175","article-title":"Identification of HBV-MLL4 integration and its molecular basis in chinese hepatocellular carcinoma","volume":"10","author":"Dong","year":"2015","journal-title":"PLoS ONE"},{"key":"2020011102350340100_ref16","doi-asserted-by":"crossref","DOI":"10.1038\/npjgenmed.2016.4","article-title":"Mechanistic signatures of HPV insertions in cervical carcinomas","volume":"1","author":"Holmes","year":"2016","journal-title":"Npj Genom Med"},{"key":"2020011102350340100_ref17","doi-asserted-by":"crossref","first-page":"348","DOI":"10.1186\/s12859-014-0348-4","article-title":"SummonChimera infers integrated viral genomes with nucleotide precision from NGS data","volume":"15","author":"Katz","year":"2014","journal-title":"BMC Bioinformatics"},{"issue":"7488","key":"2020011102350340100_ref18","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/nature12881","article-title":"Landscape of genomic alterations in cervical carcinomas","volume":"506","author":"Ojesina","year":"2014","journal-title":"Nature"},{"issue":"5","key":"2020011102350340100_ref19","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1038\/nbt.1868","article-title":"PathSeq: software to identify or discover microbes by deep sequencing of human tissue","volume":"29","author":"Kostic","year":"2011","journal-title":"Nat Biotechnol"},{"issue":"5","key":"2020011102350340100_ref20","doi-asserted-by":"crossref","first-page":"720","DOI":"10.1093\/bioinformatics\/btq722","article-title":"Identifying viral integration sites using SeqMap 2.0","volume":"27","author":"Hawkins","year":"2011","journal-title":"Bioinformatics"},{"key":"2020011102350340100_ref21","doi-asserted-by":"crossref","DOI":"10.1038\/srep11534","article-title":"Vy-PER: eliminating false positive detection of virus integration events in next generation sequencing data","volume":"5","author":"Forster","year":"2015","journal-title":"Sci Rep"},{"issue":"5","key":"2020011102350340100_ref22","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1093\/bioinformatics\/btt011","article-title":"ViralFusionSeq: accurately discover viral integration events and reconstruct fusion transcripts at single-base resolution","volume":"29","author":"Li","year":"2013","journal-title":"Bioinformatics"},{"issue":"Suppl 3","key":"2020011102350340100_ref23","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1186\/s12859-017-1470-x","article-title":"BATVI: fast, sensitive and accurate detection of virus integrations","volume":"18","author":"Tennakoon","year":"2017","journal-title":"BMC Bioinformatics"},{"issue":"25","key":"2020011102350340100_ref24","doi-asserted-by":"crossref","first-page":"20959","DOI":"10.18632\/oncotarget.4187","article-title":"Virus-Clip: a fast and memory-efficient viral integration site detection tool at single-base resolution with annotation capability","volume":"6","author":"Ho","year":"2015","journal-title":"Oncotarget"},{"issue":"2","key":"2020011102350340100_ref25","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1093\/bioinformatics\/bts665","article-title":"VirusSeq: software to identify viruses and their integration sites using next-generation sequencing of human cancer tissue","volume":"29","author":"Chen","year":"2013","journal-title":"Bioinformatics"},{"issue":"2","key":"2020011102350340100_ref26","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1093\/bioinformatics\/btw591","article-title":"Seeksv: an accurate tool for somatic structural variation and virus integration detection","volume":"33","author":"Liang","year":"2017","journal-title":"Bioinformatics"},{"issue":"7","key":"2020011102350340100_ref27","doi-asserted-by":"crossref","first-page":"3309","DOI":"10.1093\/nar\/gky180","article-title":"ViFi: accurate detection of viral integration and mRNA fusion reveals indiscriminate and unregulated transcription in proximal genomic regions in cervical cancer","volume":"46","author":"Nguyen","year":"2018","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2020011102350340100_ref28","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/s13073-015-0126-6","article-title":"VERSE: a novel approach to detect virus integration in host genomes through reference genome customization","volume":"7","author":"Wang","year":"2015","journal-title":"Genome Med"},{"issue":"5","key":"2020011102350340100_ref29","article-title":"VirusFinder: software for efficient and accurate detection of viruses and their integration sites in host genomes through next generation sequencing data","volume":"8","author":"Wang","year":"2013","journal-title":"PLoS ONE"},{"issue":"8","key":"2020011102350340100_ref30","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1093\/bioinformatics\/bts100","article-title":"Rapid identification of non-human sequences in high-throughput sequencing datasets","volume":"28","author":"Bhaduri","year":"2012","journal-title":"Bioinformatics"},{"key":"2020011102350340100_ref31","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1186\/1471-2105-13-206","article-title":"CaPSID: a bioinformatics platform for computational pathogen sequence identification in human genomes and transcriptomes","volume":"13","author":"Borozan","year":"2012","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"2020011102350340100_ref32","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1093\/bioinformatics\/bts684","article-title":"READSCAN: a fast and scalable pathogen discovery program with accurate genome relative abundance estimation","volume":"29","author":"Naeem","year":"2013","journal-title":"Bioinformatics"},{"issue":"7","key":"2020011102350340100_ref33","doi-asserted-by":"crossref","first-page":"1180","DOI":"10.1101\/gr.171934.113","article-title":"A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples","volume":"24","author":"Naccache","year":"2014","journal-title":"Genome Res"},{"key":"2020011102350340100_ref34","doi-asserted-by":"crossref","first-page":"D339","DOI":"10.1093\/nar\/gkl1004","article-title":"Genome Information Broker for Viruses (GIB-V): database for comparative analysis of virus genomes","volume":"35","author":"Hirahata","year":"2007","journal-title":"Nucleic Acids Res"},{"issue":"11","key":"2020011102350340100_ref35","doi-asserted-by":"crossref","first-page":"1533","DOI":"10.1093\/bioinformatics\/bts187","article-title":"pIRS: Profile-based Illumina pair-end reads simulator","volume":"28","author":"Hu","year":"2012","journal-title":"Bioinformatics"},{"article-title":"Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM","year":"2013","author":"Li","key":"2020011102350340100_ref36"},{"issue":"16","key":"2020011102350340100_ref37","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2020011102350340100_ref38","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"issue":"16","key":"2020011102350340100_ref39","doi-asserted-by":"crossref","first-page":"e164","DOI":"10.1093\/nar\/gkq603","article-title":"ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data","volume":"38","author":"Wang","year":"2010","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"2020011102350340100_ref40","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0090581","article-title":"MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping","volume":"9","author":"Lee","year":"2014","journal-title":"PLoS ONE"},{"issue":"4","key":"2020011102350340100_ref41","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with Bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat Methods"},{"issue":"7","key":"2020011102350340100_ref42","doi-asserted-by":"crossref","first-page":"644","DOI":"10.1038\/nbt.1883","article-title":"Full-length transcriptome assembly from RNA-Seq data without a reference genome","volume":"29","author":"Grabherr","year":"2011","journal-title":"Nat Biotechnol"},{"issue":"15","key":"2020011102350340100_ref43","doi-asserted-by":"crossref","first-page":"1895","DOI":"10.1093\/bioinformatics\/btq293","article-title":"SVDetect: a tool to identify genomic structural variations from paired-end and mate-pair sequencing data","volume":"26","author":"Zeitouni","year":"2010","journal-title":"Bioinformatics"},{"issue":"8","key":"2020011102350340100_ref44","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1038\/nmeth.1628","article-title":"CREST maps somatic structural variation in cancer genomes with base-pair resolution","volume":"8","author":"Wang","year":"2011","journal-title":"Nat Methods"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/6\/2088\/31789367\/bby070.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/6\/2088\/31789367\/bby070.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,1,11]],"date-time":"2020-01-11T02:43:27Z","timestamp":1578710607000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/20\/6\/2088\/5066709"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8,8]]},"references-count":44,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2018,8,8]]},"published-print":{"date-parts":[[2019,11,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bby070","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2019,11]]},"published":{"date-parts":[[2018,8,8]]}}}