{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T19:50:29Z","timestamp":1768593029345,"version":"3.49.0"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010636","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,11,8]],"date-time":"2022-11-08T00:00:00Z","timestamp":1667865600000}}],"reference-count":31,"publisher":"Public Library of Science (PLoS)","issue":"10","license":[{"start":{"date-parts":[[2022,10,27]],"date-time":"2022-10-27T00:00:00Z","timestamp":1666828800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["K01MH123896"],"award-info":[{"award-number":["K01MH123896"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01HG012572"],"award-info":[{"award-number":["R01HG012572"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["U01DA053628"],"award-info":[{"award-number":["U01DA053628"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["5R01DA051906"],"award-info":[{"award-number":["5R01DA051906"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Early and accurate detection of viruses in clinical and environmental samples is essential for effective public healthcare, treatment, and therapeutics. While PCR detects potential pathogens with high sensitivity, it is difficult to scale and requires knowledge of the exact sequence of the pathogen. With the advent of next-gen single-cell sequencing, it is now possible to scrutinize viral transcriptomics at the finest possible resolution\u2013cells. This newfound ability to investigate individual cells opens new avenues to understand viral pathophysiology with unprecedented resolution. To leverage this ability, we propose an efficient and accurate computational pipeline, named Venus, for virus detection and integration site discovery in both single-cell and bulk-tissue RNA-seq data. Specifically, Venus addresses two main questions: whether a tissue\/cell type is infected by viruses or a virus of interest? And if infected, whether and where has the virus inserted itself into the human genome? Our analysis can be broken into two parts\u2013validation and discovery. Firstly, for validation, we applied Venus on well-studied viral datasets, such as HBV- hepatocellular carcinoma and HIV-infection treated with antiretroviral therapy. Secondly, for discovery, we analyzed datasets such as HIV-infected neurological patients and deeply sequenced T-cells. We detected viral transcripts in the novel target of the brain and high-confidence integration sites in immune cells. In conclusion, here we describe Venus, a publicly available software which we believe will be a valuable virus investigation tool for the scientific community at large.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010636","type":"journal-article","created":{"date-parts":[[2022,10,27]],"date-time":"2022-10-27T18:04:13Z","timestamp":1666893853000},"page":"e1010636","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":5,"title":["Venus: An efficient virus infection detection and fusion site discovery method using single-cell and bulk RNA-seq data"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3039-4894","authenticated-orcid":true,"given":"Che Yu","family":"Lee","sequence":"first","affiliation":[]},{"given":"Yuhang","family":"Chen","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4696-0042","authenticated-orcid":true,"given":"Ziheng","family":"Duan","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0881-5891","authenticated-orcid":true,"given":"Min","family":"Xu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1647-326X","authenticated-orcid":true,"given":"Matthew J.","family":"Girgenti","sequence":"additional","affiliation":[]},{"given":"Ke","family":"Xu","sequence":"additional","affiliation":[]},{"given":"Mark","family":"Gerstein","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5970-0509","authenticated-orcid":true,"given":"Jing","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,10,27]]},"reference":[{"issue":"32","key":"pcbi.1010636.ref001","doi-asserted-by":"crossref","first-page":"5798","DOI":"10.1002\/anie.200901917","article-title":"The search for infectious causes of human cancers: where and why (Nobel lecture)","volume":"48","author":"H. zur Hausen","year":"2009","journal-title":"Angew Chem Int Ed Engl"},{"issue":"COVID19-S4","key":"pcbi.1010636.ref002","doi-asserted-by":"crossref","first-page":"S73","DOI":"10.12669\/pjms.36.COVID19-S4.2638","article-title":"Coronavirus Disease 2019 (COVID-19) Pandemic and Economic Impact","volume":"36","author":"T Ahmad","year":"2020","journal-title":"Pak J Med Sci."},{"issue":"2","key":"pcbi.1010636.ref003","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1093\/bioinformatics\/bts665","article-title":"VirusSeq: software to identify viruses and their integration sites using next-generation sequencing of human cancer tissue","volume":"29","author":"Y Chen","year":"2013","journal-title":"Bioinformatics"},{"issue":"15","key":"pcbi.1010636.ref004","doi-asserted-by":"crossref","first-page":"2027","DOI":"10.1093\/bioinformatics\/btr349","article-title":"Pathogen detection using short-RNA deep sequencing subtraction and assembly","volume":"27","author":"O Isakov","year":"2011","journal-title":"Bioinformatics"},{"issue":"4","key":"pcbi.1010636.ref005","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1002\/1878-0261.12435","article-title":"Detection of human papillomavirus in cases of head and neck squamous cell carcinoma by RNA-seq and VirTect","volume":"13","author":"A Khan","year":"2019","journal-title":"Mol Oncol"},{"issue":"5","key":"pcbi.1010636.ref006","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1038\/nbt.1868","article-title":"PathSeq: software to identify or discover microbes by deep sequencing of human tissue","volume":"29","author":"AD Kostic","year":"2011","journal-title":"Nat Biotechnol"},{"key":"pcbi.1010636.ref007","doi-asserted-by":"crossref","first-page":"14049","DOI":"10.1038\/ncomms14049","article-title":"Massively parallel digital transcriptional profiling of single cells","volume":"8","author":"GX Zheng","year":"2017","journal-title":"Nat Commun."},{"issue":"4","key":"pcbi.1010636.ref008","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1038\/nprot.2017.149","article-title":"Exponential scaling of single-cell RNA-seq in the past decade","volume":"13","author":"V Svensson","year":"2018","journal-title":"Nat Protoc"},{"issue":"4","key":"pcbi.1010636.ref009","doi-asserted-by":"crossref","DOI":"10.1128\/mBio.01037-20","article-title":"Interactions of Monocytes, HIV, and ART Identified by an Innovative scRNAseq Pipeline: Pathways to Reservoirs and HIV-Associated Comorbidities","volume":"11","author":"R Le\u00f3n-Rivera","year":"2020","journal-title":"mBio"},{"issue":"10","key":"pcbi.1010636.ref010","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1093\/bioinformatics\/btaa859","article-title":"VIRTUS: a pipeline for comprehensive virus analysis from conventional RNA-seq data","volume":"37","author":"Y Yasumizu","year":"2021","journal-title":"Bioinformatics"},{"issue":"7","key":"pcbi.1010636.ref011","doi-asserted-by":"crossref","first-page":"1475","DOI":"10.1016\/j.cell.2020.05.006","article-title":"Host-Viral Infection Maps Reveal Signatures of Severe COVID-19 Patients","volume":"181","author":"P Bost","year":"2020","journal-title":"Cell"},{"key":"pcbi.1010636.ref012","article-title":"Viral Integration and Consequences on Host Gene Expression","author":"S Desfarges","year":"2012","journal-title":"Viruses: Essential Agents of Life"},{"key":"pcbi.1010636.ref013","article-title":"Retrovirus infection and reverse transcription","author":"E. Britannica","year":"2012","journal-title":"https:\/\/www.britannica.com\/science\/reverse-transcriptase#\/media\/1\/500460\/124682:Encyclop\u00e6dia Britannica"},{"key":"pcbi.1010636.ref014","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1007\/978-1-4939-3572-7_13","article-title":"Optimizing RNA-Seq Mapping with STAR","volume":"1415","author":"A Dobin","year":"2016","journal-title":"Methods Mol Biol"},{"issue":"1","key":"pcbi.1010636.ref015","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1186\/1759-8753-4-5","article-title":"Conserved structure and inferred evolutionary history of long terminal repeats (LTRs)","volume":"4","author":"F Benachenhou","year":"2013","journal-title":"Mob DNA"},{"issue":"2","key":"pcbi.1010636.ref016","doi-asserted-by":"crossref","first-page":"MDNA3-0027-2014","DOI":"10.1128\/microbiolspec.MDNA3-0027-2014","article-title":"Reverse Transcription of Retroviruses and LTR Retrotransposons","volume":"3","author":"SH Hughes","year":"2015","journal-title":"Microbiol Spectr."},{"key":"pcbi.1010636.ref017","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1186\/s12977-015-0205-1","article-title":"Gene activity in primary T cells infected with HIV89.6: intron retention and induction of genomic repeats","volume":"12","author":"S Sherrill-Mix","year":"2015","journal-title":"Retrovirology"},{"key":"pcbi.1010636.ref018","article-title":"Trim Galore","author":"F. Krueger","year":"2012","journal-title":"Babraham Bioinformatics"},{"issue":"5","key":"pcbi.1010636.ref019","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"A Butler","year":"2018","journal-title":"Nat Biotechnol"},{"issue":"Suppl B","key":"pcbi.1010636.ref020","first-page":"63","article-title":"HBV and liver cancer","volume":"60","author":"N. Leung","year":"2005","journal-title":"Med J Malaysia"},{"issue":"13","key":"pcbi.1010636.ref021","article-title":"Distinct Patterns of HBV Integration and","volume":"22","author":"JW Jang","year":"2021","journal-title":"Int J Mol Sci."},{"issue":"10","key":"pcbi.1010636.ref022","first-page":"a003236","article-title":"Oncogenes and tumor suppressor genes","volume":"2","author":"EY Lee","year":"2010","journal-title":"Cold Spring Harb Perspect Biol"},{"issue":"8","key":"pcbi.1010636.ref023","doi-asserted-by":"crossref","first-page":"1111","DOI":"10.1592\/phco.26.8.1111","article-title":"An update and review of antiretroviral therapy","volume":"26","author":"FJ Piacenti","year":"2006","journal-title":"Pharmacotherapy"},{"key":"pcbi.1010636.ref024","doi-asserted-by":"crossref","first-page":"646936","DOI":"10.3389\/fgene.2021.646936","article-title":"A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data","volume":"12","author":"R Xiang","year":"2021","journal-title":"Front Genet."},{"issue":"6","key":"pcbi.1010636.ref025","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1016\/j.tim.2012.03.009","article-title":"Viral disruption of the blood-brain barrier","volume":"20","author":"KR Spindler","year":"2012","journal-title":"Trends Microbiol"},{"key":"pcbi.1010636.ref026","doi-asserted-by":"crossref","first-page":"397","DOI":"10.3389\/fimmu.2016.00397","article-title":"Targeting the Brain Reservoirs: Toward an HIV Cure","volume":"7","author":"C Marban","year":"2016","journal-title":"Front Immunol"},{"issue":"11","key":"pcbi.1010636.ref027","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1016\/S1473-3099(13)70269-X","article-title":"HIV-associated neurocognitive disorder","volume":"13","author":"DB Clifford","year":"2013","journal-title":"Lancet Infect Dis"},{"key":"pcbi.1010636.ref028","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1146\/annurev.med.59.062806.123001","article-title":"Hide-and-seek: the challenge of viral persistence in HIV-1 infection","volume":"59","author":"L Geeraert","year":"2008","journal-title":"Annu Rev Med"},{"key":"pcbi.1010636.ref029","doi-asserted-by":"crossref","first-page":"676693","DOI":"10.3389\/fmicb.2021.676693","article-title":"SARS-CoV-2-Host Chimeric RNA-Sequencing Reads Do Not Necessarily Arise From Virus Integration Into the Host DNA","volume":"12","author":"A Kazachenka","year":"2021","journal-title":"Front Microbiol"},{"issue":"10","key":"pcbi.1010636.ref030","doi-asserted-by":"crossref","first-page":"e1005931","DOI":"10.1371\/journal.ppat.1005931","article-title":"HIV-1 Integrates Widely throughout the Genome of the Human Blood Fluke Schistosoma mansoni","volume":"12","author":"S Suttiprapa","year":"2016","journal-title":"PLoS Pathog"},{"issue":"31","key":"pcbi.1010636.ref031","doi-asserted-by":"crossref","first-page":"8783","DOI":"10.1073\/pnas.1609057113","article-title":"Defective HIV-1 proviruses produce novel protein-coding RNA species in HIV-infected patients on combination antiretroviral therapy","volume":"113","author":"H Imamichi","year":"2016","journal-title":"Proc Natl Acad Sci U S A"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010636","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,11,8]],"date-time":"2022-11-08T00:00:00Z","timestamp":1667865600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010636","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,8]],"date-time":"2022-11-08T18:58:24Z","timestamp":1667933904000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010636"}},"subtitle":[],"editor":[{"given":"Zhaolei","family":"Zhang","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,10,27]]},"references-count":31,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2022,10,27]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010636","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1010636","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,27]]}}}