{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,5]],"date-time":"2024-05-05T00:26:20Z","timestamp":1714868780829},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,5,4]],"date-time":"2024-05-04T00:00:00Z","timestamp":1714780800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,4]],"date-time":"2024-05-04T00:00:00Z","timestamp":1714780800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Ministry of Science and Technology of the People\u00b4s Republic of China","award":["No. ZK20230149","2022FY101104"],"award-info":[{"award-number":["No. ZK20230149","2022FY101104"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Hepatitis B virus (HBV) integrates into human chromosomes and can lead to genomic instability and hepatocarcinogenesis. Current tools for HBV integration site detection lack accuracy and stability.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>This study proposes a deep learning-based method, named ViroISDC, for detecting integration sites. ViroISDC generates corresponding grammar rules and encodes the characteristics of the language data to predict integration sites accurately. Compared with Lumpy, Pindel, Seeksv, and SurVirus, ViroISDC exhibits better overall performance and is less sensitive to sequencing depth and integration sequence length, displaying good reliability, stability, and generality. Further downstream analysis of integrated sites detected by ViroISDC reveals the integration patterns and features of HBV. It is observed that HBV integration exhibits specific chromosomal preferences and tends to integrate into cancerous tissue. Moreover, HBV integration frequency was higher in males than females, and high-frequency integration sites were more likely to be present on hepatocarcinogenesis- and anti-cancer-related genes, validating the reliability of the ViroISDC.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>ViroISDC pipeline exhibits superior precision, stability, and reliability across various datasets when compared to similar software. It is invaluable in exploring HBV infection in the human body, holding significant implications for the diagnosis, treatment, and prognosis assessment of HCC.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-024-05763-0","type":"journal-article","created":{"date-parts":[[2024,5,4]],"date-time":"2024-05-04T12:01:49Z","timestamp":1714824109000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["ViroISDC: a method for calling integration sites of hepatitis B virus based on feature encoding"],"prefix":"10.1186","volume":"25","author":[{"given":"Lei","family":"Qiao","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chang","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Lin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaoqi","family":"He","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jia","family":"Mi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yigang","family":"Tong","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingyang","family":"Gao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,5,4]]},"reference":[{"key":"5763_CR1","doi-asserted-by":"publisher","first-page":"209","DOI":"10.3322\/caac.21660","volume":"71","author":"H Sung","year":"2021","unstructured":"Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209\u201349.","journal-title":"CA Cancer J Clin"},{"key":"5763_CR2","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1146\/annurev-genom-090711-163752","volume":"13","author":"Z-G Han","year":"2012","unstructured":"Han Z-G. Functional genomic studies: insights into the pathogenesis of liver cancer. Annu Rev Genomics Hum Genet. 2012;13:171\u2013205.","journal-title":"Annu Rev Genomics Hum Genet"},{"key":"5763_CR3","doi-asserted-by":"publisher","first-page":"2166","DOI":"10.1038\/sj.onc.1210279","volume":"26","author":"SP Hussain","year":"2007","unstructured":"Hussain SP, Schwank J, Staib F, Wang XW, Harris CC. TP53 mutations and hepatocellular carcinoma: insights into the etiology and pathogenesis of liver cancer. Oncogene. 2007;26:2166\u201376.","journal-title":"Oncogene"},{"key":"5763_CR4","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1007\/s40471-019-0183-2","volume":"6","author":"T VoPham","year":"2019","unstructured":"VoPham T. Environmental risk factors for liver cancer and nonalcoholic fatty liver disease. Curr Epidemiol Rep. 2019;6:50\u201366.","journal-title":"Curr Epidemiol Rep"},{"key":"5763_CR5","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1007\/s12029-017-9957-2","volume":"48","author":"M Ozturk","year":"2017","unstructured":"Ozturk M, Batur T, Ekin U, Erdogan A, \u0130scan E, Keles U, Oz O, Ozen C. Molecular pathogenesis of liver cancer. J Gastrointest Cancer. 2017;48:222\u20134.","journal-title":"J Gastrointest Cancer"},{"key":"5763_CR6","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1016\/j.jhep.2019.11.006","volume":"72","author":"J-C Nault","year":"2020","unstructured":"Nault J-C, Cheng A-L, Sangro B, Llovet JM. Milestones in the pathogenesis and management of primary liver cancer. J Hepatol. 2020;72:209\u201314.","journal-title":"J Hepatol"},{"key":"5763_CR7","doi-asserted-by":"publisher","first-page":"12992","DOI":"10.1038\/ncomms12992","volume":"7","author":"L-H Zhao","year":"2016","unstructured":"Zhao L-H, Liu X, Yan H-X, et al. Genomic and oncogenic preference of HBV integration in hepatocellular carcinoma. Nat Commun. 2016;7:12992.","journal-title":"Nat Commun"},{"key":"5763_CR8","doi-asserted-by":"publisher","first-page":"41","DOI":"10.2147\/JHC.S61146","volume":"3","author":"J Balogh","year":"2016","unstructured":"Balogh J, Victor D, Asham EH, Burroughs SG, Boktour M, Saharia A, Li X, Ghobrial RM, Monsour HP. Hepatocellular carcinoma: a review. J Hepatocell Carcinoma. 2016;3:41\u201353.","journal-title":"J Hepatocell Carcinoma"},{"key":"5763_CR9","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1016\/j.neures.2013.11.007","volume":"80","author":"S Uemura","year":"2014","unstructured":"Uemura S, Nagaoka T, Yokoyama M, Igarashi M, Kishi M. A simple and highly efficient method to identify the integration site of a transgene in the animal genome. Neurosci Res. 2014;80:91\u20134.","journal-title":"Neurosci Res"},{"key":"5763_CR10","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1038\/nrgastro.2013.107","volume":"10","author":"B Hajarizadeh","year":"2013","unstructured":"Hajarizadeh B, Grebely J, Dore GJ. Epidemiology and natural history of HCV infection. Nat Rev Gastroenterol Hepatol. 2013;10:553\u201362.","journal-title":"Nat Rev Gastroenterol Hepatol"},{"key":"5763_CR11","doi-asserted-by":"publisher","first-page":"25075","DOI":"10.18632\/oncotarget.25308","volume":"9","author":"M Furuta","year":"2018","unstructured":"Furuta M, Tanaka H, Shiraishi Y, et al. Characterization of HBV integration patterns and timing in liver cancer and HBV-infected livers. Oncotarget. 2018;9:25075\u201388.","journal-title":"Oncotarget"},{"key":"5763_CR12","doi-asserted-by":"publisher","first-page":"593","DOI":"10.1101\/gr.133926.111","volume":"22","author":"Z Jiang","year":"2012","unstructured":"Jiang Z, Jhunjhunwala S, Liu J, et al. The effects of hepatitis B virus integration into the genomes of hepatocellular carcinoma patients. Genome Res. 2012;22:593\u2013601.","journal-title":"Genome Res"},{"key":"5763_CR13","doi-asserted-by":"publisher","DOI":"10.1128\/cmr.00046-19","author":"MH Nguyen","year":"2020","unstructured":"Nguyen MH, Wong G, Gane E, Kao J-H, Dusheiko G. Hepatitis B virus: advances in prevention, diagnosis, and therapy. Clin Microbiol Rev. 2020. https:\/\/doi.org\/10.1128\/cmr.00046-19.","journal-title":"Clin Microbiol Rev"},{"key":"5763_CR14","doi-asserted-by":"publisher","first-page":"1560","DOI":"10.1002\/hep.29800","volume":"67","author":"NA Terrault","year":"2018","unstructured":"Terrault NA, Lok ASF, McMahon BJ, Chang K-M, Hwang JP, Jonas MM, Brown RS Jr, Bzowej NH, Wong JB. Update on prevention, diagnosis, and treatment of chronic hepatitis B: AASLD 2018 hepatitis B guidance. Hepatology. 2018;67:1560\u201399.","journal-title":"Hepatology"},{"key":"5763_CR15","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkaa1237","volume":"49","author":"R Rajaby","year":"2021","unstructured":"Rajaby R, Zhou Y, Meng Y, Zeng X, Li G, Wu P, Sung W-K. SurVirus: a repeat-aware virus integration caller. Nucleic Acids Res. 2021;49: e33.","journal-title":"Nucleic Acids Res"},{"key":"5763_CR16","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1093\/bioinformatics\/btw591","volume":"33","author":"Y Liang","year":"2017","unstructured":"Liang Y, Qiu K, Liao B, Zhu W, Huang X, Li L, Chen X, Li K. Seeksv: an accurate tool for somatic structural variation and virus integration detection. Bioinforma Oxf Engl. 2017;33:184\u201391.","journal-title":"Bioinforma Oxf Engl"},{"key":"5763_CR17","doi-asserted-by":"publisher","first-page":"R84","DOI":"10.1186\/gb-2014-15-6-r84","volume":"15","author":"RM Layer","year":"2014","unstructured":"Layer RM, Chiang C, Quinlan AR, Hall IM. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15:R84.","journal-title":"Genome Biol"},{"key":"5763_CR18","doi-asserted-by":"publisher","first-page":"2865","DOI":"10.1093\/bioinformatics\/btp394","volume":"25","author":"K Ye","year":"2009","unstructured":"Ye K, Schulz MH, Long Q, Apweiler R, Ning Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinforma Oxf Engl. 2009;25:2865\u201371.","journal-title":"Bioinforma Oxf Engl"},{"key":"5763_CR19","doi-asserted-by":"publisher","first-page":"5582","DOI":"10.1093\/bioinformatics\/btaa1081","volume":"36","author":"T Yun","year":"2021","unstructured":"Yun T, Li H, Chang P-C, Lin MF, Carroll A, McLean CY. Accurate, scalable cohort variant calls using DeepVariant and GLnexus. Bioinforma Oxf Engl. 2021;36:5582\u20139.","journal-title":"Bioinforma Oxf Engl"},{"key":"5763_CR20","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1016\/j.neucom.2021.05.103","volume":"470","author":"I Lauriola","year":"2022","unstructured":"Lauriola I, Lavelli A, Aiolli F. An introduction to deep learning in natural language processing: models, techniques, and tools. Neurocomputing. 2022;470:443\u201356.","journal-title":"Neurocomputing"},{"key":"5763_CR21","unstructured":"Entrez Sequences Help [Internet]. Bethesda (MD): National Center for Biotechnology Information (US); 2010-. Available from: https:\/\/www.ncbi.nlm.nih.gov\/books\/NBK44864\/"},{"issue":"16","key":"5763_CR22","doi-asserted-by":"publisher","first-page":"7762","DOI":"10.1093\/nar\/gkv784","volume":"43","author":"Y Chen","year":"2015","unstructured":"Chen Y, Ye W, Zhang Y, et al. High speed BLASTN: an accelerated MegaBLAST search tool. Nucleic Acids Res. 2015;43(16):7762\u20138.","journal-title":"Nucleic Acids Res"},{"key":"5763_CR23","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1186\/s13059-022-02607-z","volume":"23","author":"L Dressler","year":"2022","unstructured":"Dressler L, Bortolomeazzi M, Keddar MR, et al. Comparative assessment of genes driving cancer and somatic evolution in non-cancer tissues: an update of the Network of Cancer Genes (NCG) resource. Genome Biol. 2022;23:35.","journal-title":"Genome Biol"},{"key":"5763_CR24","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1016\/j.gpb.2018.07.003","volume":"16","author":"Q Lian","year":"2018","unstructured":"Lian Q, Wang S, Zhang G, Wang D, Luo G, Tang J, Chen L, Gu J. HCCDB: a database of hepatocellular carcinoma expression atlas. Genom Proteom Bioinf. 2018;16:269\u201375.","journal-title":"Genom Proteom Bioinf"},{"key":"5763_CR25","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1186\/s13046-015-0121-1","volume":"34","author":"X Xu","year":"2015","unstructured":"Xu X, Liu Z, Zhou L, Xie H, Cheng J, Ling Q, Wang J, Guo H, Wei X, Zheng S. Characterization of genome-wide TFCP2 targets in hepatocellular carcinoma: implication of targets FN1 and TJP1 in metastasis. J Exp Clin Cancer Res CR. 2015;34:6.","journal-title":"J Exp Clin Cancer Res CR"},{"key":"5763_CR26","doi-asserted-by":"publisher","first-page":"921","DOI":"10.1016\/j.jcmgh.2023.01.001","volume":"15","author":"S-H Yeh","year":"2023","unstructured":"Yeh S-H, Li C-L, Lin Y-Y, Ho M-C, Wang Y-C, Tseng S-T, Chen P-J. Hepatitis B virus DNA integration drives carcinogenesis and provides a new biomarker for HBV-related HCC. Cell Mol Gastroenterol Hepatol. 2023;15:921\u20139.","journal-title":"Cell Mol Gastroenterol Hepatol"},{"key":"5763_CR27","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1038\/ng.2295","volume":"44","author":"W-K Sung","year":"2012","unstructured":"Sung W-K, Zheng H, Li S, et al. Genome-wide survey of recurrent HBV integration in hepatocellular carcinoma. Nat Genet. 2012;44:765\u20139.","journal-title":"Nat Genet"},{"key":"5763_CR28","doi-asserted-by":"crossref","unstructured":"Zhao B W, Su X R, Hu P W, et al. iGRLDTI: an improved graph representation learning method for predicting drug\u2013target interactions over heterogeneous biological information network. Bioinformatics, 2023, 39(8): btad451.","DOI":"10.1093\/bioinformatics\/btad451"},{"key":"5763_CR29","doi-asserted-by":"crossref","unstructured":"Hu L, Yang Y, Tang Z, et al. FCAN-MOPSO: An improved fuzzy-based graph clustering algorithm for complex networks with multi-objective particle swarm optimization. IEEE Trans Fuzzy Syst 2023.","DOI":"10.1109\/TFUZZ.2023.3259726"},{"issue":"1","key":"5763_CR30","doi-asserted-by":"publisher","first-page":"188","DOI":"10.1186\/s12859-023-05309-w","volume":"24","author":"L Wong","year":"2023","unstructured":"Wong L, Wang L, You ZH, et al. GKLOMLI: a link prediction model for inferring miRNA\u2013lncRNA interactions by using Gaussian kernel-based method on network profile and linear optimization algorithm. BMC Bioinf. 2023;24(1):188.","journal-title":"BMC Bioinf"},{"key":"5763_CR31","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1002\/hep.30201","volume":"69","author":"C-L Li","year":"2019","unstructured":"Li C-L, Li C-Y, Lin Y-Y, Ho M-C, Chen D-S, Chen P-J, Yeh S-H. Androgen receptor enhances hepatic telomerase reverse transcriptase gene transcription after Hepatitis B virus integration or point mutation in promoter region. Hepatol Baltim Md. 2019;69:498\u2013512.","journal-title":"Hepatol Baltim Md"},{"key":"5763_CR32","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. https:\/\/doi.org\/10.48550\/arXiv.1810.04805","DOI":"10.48550\/arXiv.1810.04805"},{"key":"5763_CR33","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05763-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-024-05763-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05763-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,4]],"date-time":"2024-05-04T12:02:13Z","timestamp":1714824133000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-024-05763-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,4]]},"references-count":33,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["5763"],"URL":"https:\/\/doi.org\/10.1186\/s12859-024-05763-0","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,4]]},"assertion":[{"value":"22 September 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 March 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"177"}}