{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T05:29:55Z","timestamp":1774330195170,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T00:00:00Z","timestamp":1731974400000},"content-version":"vor","delay-in-days":18,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004047","name":"Karolinska Institute","doi-asserted-by":"publisher","award":["2020-01082"],"award-info":[{"award-number":["2020-01082"]}],"id":[{"id":"10.13039\/501100004047","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Repeat elements, such as transposable elements (TE), are highly repetitive DNA sequences that compose around 50% of the genome. TEs such as Alu, SVA, HERV, and L1 elements can cause disease through disrupting genes, causing frameshift mutations or altering splicing patters. These are elements challenging to characterize using short-read genome sequencing, due to its read length and TEs repetitive nature. Long-read genome sequencing (lrGS) enables bridging of TEs, allowing increased resolution across repetitive DNA sequences. lrGS therefore present an opportunity for improved TE detection and analysis not only from a research perspective but also for future clinical detection. When choosing an lrGS TE caller, parameters such as runtime, CPU hours, sensitivity, precision, and compatibility with inclusion into pipelines are crucial for efficient detection.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We therefore developed sTELLeR, (s) Transposable ELement in Long (e) Read, for accurate, fast, and effective TE detection. Particularly, sTELLeR exhibit higher precision and sensitivity for calling of Alu elements than similar tools. The caller is 5\u201348\u00d7 as fast and uses &amp;lt;2% of the CPU hours compared to competitive callers. The caller is haplotype aware and output results in a variant call format (VCF) file, enabling compatibility with other variant callers and downstream analysis.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>sTELLeR is a python-based tool and is available at https:\/\/github.com\/kristinebilgrav\/sTELLeR. Altogether, we show that sTELLeR is a fast, sensitive, and precise caller for detection of TE elements, and can easily be implemented into variant calling workflows.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae686","type":"journal-article","created":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T06:25:53Z","timestamp":1731997553000},"source":"Crossref","is-referenced-by-count":5,"title":["Detecting transposable elements in long-read genomes using sTELLeR"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7077-0279","authenticated-orcid":false,"given":"Kristine","family":"Bilgrav Saether","sequence":"first","affiliation":[{"name":"Department of Molecular Medicine and Surgery, Karolinska Institute , Stockholm 171 76,","place":["Sweden"]},{"name":"Clinical Genomics Facility, Science for Life Laboratory , Stockholm 171 76,","place":["Sweden"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3716-4917","authenticated-orcid":false,"given":"Jesper","family":"Eisfeldt","sequence":"additional","affiliation":[{"name":"Department of Molecular Medicine and Surgery, Karolinska Institute , Stockholm 171 76,","place":["Sweden"]},{"name":"Clinical Genomics Facility, Science for Life Laboratory , Stockholm 171 76,","place":["Sweden"]},{"name":"Department of Clinical Genetics and Genomics, Karolinska University Hospital , Stockholm 171 77,","place":["Sweden"]}]}],"member":"286","published-online":{"date-parts":[[2024,11,18]]},"reference":[{"key":"2024112717340409900_btae686-B1","doi-asserted-by":"crossref","first-page":"610","DOI":"10.3390\/cancers12030610","article-title":"Human endogenous retroviruses (HERVs): shaping the innate immune response in cancers","volume":"12","author":"Alcazer","year":"2020","journal-title":"Cancers (Basel)"},{"key":"2024112717340409900_btae686-B2","doi-asserted-by":"crossref","first-page":"12507","DOI":"10.1128\/JVI.79.19.12507-12514.2005","article-title":"Genomewide screening reveals high levels of insertional polymorphism in the human endogenous retrovirus family HERV-K(HML2): implications for present-day activity","volume":"79","author":"Belshaw","year":"2005","journal-title":"J Virol"},{"key":"2024112717340409900_btae686-B3","doi-asserted-by":"crossref","first-page":"e0289346","DOI":"10.1371\/journal.pone.0289346","article-title":"Transposable element insertions in 1000 Swedish individuals","volume":"18","author":"Bilgrav Saether","year":"2023","journal-title":"PLoS One"},{"key":"2024112717340409900_btae686-B4","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1186\/s13100-021-00256-w","article-title":"Whole-genome analysis reveals the contribution of non-coding de novo transposon insertions to autism spectrum disorder","volume":"12","author":"Borges-Monroy","year":"2021","journal-title":"Mob DNA"},{"key":"2024112717340409900_btae686-B5","doi-asserted-by":"crossref","first-page":"2551","DOI":"10.3390\/ijms23052551","article-title":"Transposable elements and human diseases: mechanisms and implication in the response to environmental pollutants","volume":"23","author":"Chenais","year":"2022","journal-title":"Int J Mol Sci"},{"key":"2024112717340409900_btae686-B6","doi-asserted-by":"crossref","first-page":"3836","DOI":"10.1038\/s41467-021-24041-8","article-title":"Comprehensive identification of transposable element insertions using multiple sequencing technologies","volume":"12","author":"Chu","year":"2021","journal-title":"Nat Commun"},{"key":"2024112717340409900_btae686-B7","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","article-title":"A structural variation reference for medical and population genetics","volume":"581","author":"Collins","year":"2020","journal-title":"Nature"},{"key":"2024112717340409900_btae686-B8","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1038\/nbt.3820","article-title":"Nextflow enables reproducible computational workflows","volume":"35","author":"Di Tommaso","year":"2017","journal-title":"Nat Biotechnol"},{"key":"2024112717340409900_btae686-B9","doi-asserted-by":"crossref","first-page":"1262","DOI":"10.1101\/gr.106419.110","article-title":"High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes","volume":"20","author":"Ewing","year":"2010","journal-title":"Genome Res"},{"key":"2024112717340409900_btae686-B10","doi-asserted-by":"crossref","first-page":"915","DOI":"10.1016\/j.molcel.2020.10.024","article-title":"Nanopore sequencing enables comprehensive transposable element epigenomic profiling","volume":"80","author":"Ewing","year":"2020","journal-title":"Mol Cell"},{"key":"2024112717340409900_btae686-B11","doi-asserted-by":"crossref","first-page":"1567","DOI":"10.1101\/gr.247965.118","article-title":"Pedigree-based estimation of human mobile element retrotransposition rates","volume":"29","author":"Feusier","year":"2019","journal-title":"Genome Res"},{"key":"2024112717340409900_btae686-B12","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1080\/1040841X.2018.1501345","article-title":"Human endogenous retrovirus-K (HML-2): a comprehensive review","volume":"44","author":"Garcia-Montojo","year":"2018","journal-title":"Crit Rev Microbiol"},{"key":"2024112717340409900_btae686-B13","doi-asserted-by":"crossref","first-page":"1916","DOI":"10.1101\/gr.218032.116","article-title":"The mobile element locator tool (MELT): population-scale mobile element discovery and biology","volume":"27","author":"Gardner","year":"2017","journal-title":"Genome Res"},{"key":"2024112717340409900_btae686-B14","doi-asserted-by":"crossref","first-page":"e124","DOI":"10.1093\/nar\/gkac794","article-title":"Local assembly of long reads enables phylogenomics of transposable elements in a polyploid cell line","volume":"50","author":"Han","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2024112717340409900_btae686-B15","doi-asserted-by":"crossref","first-page":"5519","DOI":"10.1093\/bioinformatics\/btaa1034","article-title":"SVIM-asm: structural variant detection from haploid and diploid genome assemblies","volume":"36","author":"Heller","year":"2021","journal-title":"Bioinformatics"},{"key":"2024112717340409900_btae686-B16","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1093\/bioinformatics\/bts697","article-title":"RetroSeq: transposable element discovery from next-generation sequencing data","volume":"29","author":"Keane","year":"2013","journal-title":"Bioinformatics"},{"key":"2024112717340409900_btae686-B17","doi-asserted-by":"crossref","first-page":"1644","DOI":"10.1056\/NEJMoa1813279","article-title":"Patient-customized oligonucleotide therapy for a rare genetic disease","volume":"381","author":"Kim","year":"2019","journal-title":"N Engl J Med"},{"key":"2024112717340409900_btae686-B18","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2024112717340409900_btae686-B19","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2024112717340409900_btae686-B20","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1038\/s41586-023-05896-x","article-title":"A draft human pangenome reference","volume":"617","author":"Liao","year":"2023","journal-title":"Nature"},{"key":"2024112717340409900_btae686-B21","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1038\/s41576-020-0236-x","article-title":"Long-read human genome sequencing and its applications","volume":"21","author":"Logsdon","year":"2020","journal-title":"Nat Rev Genet"},{"key":"2024112717340409900_btae686-B22","doi-asserted-by":"crossref","first-page":"3586","DOI":"10.1038\/s41467-021-23918-y","article-title":"Cas9 targeted enrichment of mobile elements using nanopore sequencing","volume":"12","author":"McDonald","year":"2021","journal-title":"Nat Commun"},{"key":"2024112717340409900_btae686-B23","doi-asserted-by":"crossref","first-page":"33","DOI":"10.12688\/f1000research.29032.2","article-title":"Sustainable data analysis with Snakemake","volume":"10","author":"M\u00f6lder","year":"2021","journal-title":"F1000Res"},{"key":"2024112717340409900_btae686-B24","doi-asserted-by":"crossref","first-page":"lqac092","DOI":"10.1093\/nargab\/lqac092","article-title":"PBSIM3: a simulator for all types of PacBio and ONT long reads","volume":"4","author":"Ono","year":"2022","journal-title":"NAR Genom Bioinform"},{"key":"2024112717340409900_btae686-B25","first-page":"908","article-title":"Benchmarking computational tools for polymorphic transposable element detection","volume":"18","author":"Rishishwar","year":"2017","journal-title":"Brief Bioinform"},{"key":"2024112717340409900_btae686-B26","doi-asserted-by":"crossref","first-page":"D20","DOI":"10.1093\/nar\/gkab1112","article-title":"Database resources of the national center for biotechnology information","volume":"50","author":"Sayers","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2024112717340409900_btae686-B27","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1038\/s41587-020-0503-6","article-title":"Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes","volume":"38","author":"Shafin","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2024112717340409900_btae686-B28","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/j.pbi.2019.12.012","article-title":"The current revolution in transposable element biology enabled by long reads","volume":"54","author":"Shahid","year":"2020","journal-title":"Curr Opin Plant Biol"},{"key":"2024112717340409900_btae686-B29","author":"Smit","year":"2013"},{"key":"2024112717340409900_btae686-B30","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/gm311","article-title":"Mobile elements in the human genome: implications for disease","volume":"4","author":"Solyom","year":"2012","journal-title":"Genome Med"},{"key":"2024112717340409900_btae686-B31","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/nature15394","article-title":"An integrated map of structural variation in 2,504 human genomes","volume":"526","author":"Sudmant","year":"2015","journal-title":"Nature"},{"key":"2024112717340409900_btae686-B32","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/s13100-019-0197-9","article-title":"A benchmark of transposon insertion detection tools using real data","volume":"10","author":"Vendrell-Mir","year":"2019","journal-title":"Mob DNA"},{"key":"2024112717340409900_btae686-B33","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1038\/s41586-022-04601-8","article-title":"The Human Pangenome Project: a global resource to map genomic diversity","volume":"604","author":"Wang","year":"2022","journal-title":"Nature"},{"key":"2024112717340409900_btae686-B34","doi-asserted-by":"crossref","first-page":"1690","DOI":"10.3389\/fmicb.2020.01690","article-title":"Human endogenous retrovirus K (HML-2) in health and disease","volume":"11","author":"Xue","year":"2020","journal-title":"Front Microbiol"},{"key":"2024112717340409900_btae686-B35","doi-asserted-by":"crossref","first-page":"160025","DOI":"10.1038\/sdata.2016.25","article-title":"Extensive sequencing of seven human genomes to characterize benchmark reference materials","volume":"3","author":"Zook","year":"2016","journal-title":"Sci Data"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae686\/60744083\/btae686.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/11\/btae686\/60833209\/btae686.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/11\/btae686\/60833209\/btae686.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,27]],"date-time":"2024-11-27T17:34:20Z","timestamp":1732728860000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae686\/7903282"}},"subtitle":[],"editor":[{"given":"Peter","family":"Robinson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,11,1]]},"references-count":35,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2024,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae686","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,11]]},"published":{"date-parts":[[2024,11,1]]},"article-number":"btae686"}}