{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T06:22:39Z","timestamp":1772864559813,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"23","license":[{"start":{"date-parts":[[2021,7,13]],"date-time":"2021-07-13T00:00:00Z","timestamp":1626134400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Nigerian Government"},{"name":"NEEDS Assessment Scholarship"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Probabilistic Identification of bacterial essential genes using transposon-directed insertion-site sequencing (TraDIS) data based on Tn5 libraries has received relatively little attention in the literature; most methods are designed for mariner transposon insertions. Analysis of Tn5 transposon-based genomic data is challenging due to the high insertion density and genomic resolution. We present a novel probabilistic Bayesian approach for classifying bacterial essential genes using transposon insertion density derived from transposon insertion sequencing data. We implement a Markov chain Monte Carlo sampling procedure to estimate the posterior probability that any given gene is essential. We implement a Bayesian decision theory approach to selecting essential genes. We assess the effectiveness of our approach via analysis of both simulated data and three previously published Escherichia coli, Salmonella Typhimurium and Staphylococcus aureus datasets. These three bacteria have relatively well characterized essential genes which allows us to test our classification procedure using receiver operating characteristic curves and area under the curves. We compare the classification performance with that of Bio-Tradis, a standard tool for bacterial gene classification.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Our method is able to classify genes in the three datasets with areas under the curves between 0.967 and 0.983. Our simulated synthetic datasets show that both the number of insertions and the extent to which insertions are tolerated in the distal regions of essential genes are both important in determining classification accuracy. Importantly our method gives the user the option of classifying essential genes based on the user-supplied costs of false discovery and false non-discovery.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>An R package that implements the method presented in this paper is available for download from https:\/\/github.com\/Kevin-walters\/insdens.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab508","type":"journal-article","created":{"date-parts":[[2021,7,9]],"date-time":"2021-07-09T11:17:45Z","timestamp":1625829465000},"page":"4343-4349","source":"Crossref","is-referenced-by-count":3,"title":["Probabilistic identification of bacterial essential genes via insertion density using TraDIS data with Tn5 libraries"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2884-8244","authenticated-orcid":false,"given":"Valentine U","family":"Nlebedim","sequence":"first","affiliation":[{"name":"Department of Statistics, School of Mathematics, University of Leeds , LS2 9JT, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5037-2695","authenticated-orcid":false,"given":"Roy R","family":"Chaudhuri","sequence":"additional","affiliation":[{"name":"Department of Molecular Biology and Biotechnology, University of Sheffield , Sheffield S10 2TN, UK"}]},{"given":"Kevin","family":"Walters","sequence":"additional","affiliation":[{"name":"School of Mathematics and Statistics, University of Sheffield , Sheffield, S10 2TN, UK"}]}],"member":"286","published-online":{"date-parts":[[2021,7,13]]},"reference":[{"key":"2023061310474384000_btab508-B1","doi-asserted-by":"crossref","first-page":"690","DOI":"10.1002\/gepi.22213","article-title":"Bayesian variable selection using partially observed categorical prior information in fine mapping association studies","volume":"43","author":"Alenazi","year":"2019","journal-title":"Genet. Epidemiol"},{"key":"2023061310474384000_btab508-B2","doi-asserted-by":"crossref","DOI":"10.1038\/msb4100050","article-title":"Construction of Escherichia coli k-12 in-frame, single-gene knockout mutants: the Keio collection","volume":"2,","author":"Baba","year":"2006","journal-title":"Mol. Syst. Biol"},{"key":"2023061310474384000_btab508-B3","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.4161\/rna.24765","article-title":"Approaches to querying bacterial genomes with transposon-insertion sequencing","volume":"10","author":"Barquist","year":"2013","journal-title":"RNA Biol"},{"key":"2023061310474384000_btab508-B4","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1093\/bioinformatics\/btw022","article-title":"The tradis toolkit: sequencing and analysis for dense transposon mutant libraries","volume":"32","author":"Barquist","year":"2016","journal-title":"Bioinformatics"},{"key":"2023061310474384000_btab508-B5","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1002\/gepi.21961","article-title":"equips: eqtl analysis using informed partitioning of snps\u2013a fully Bayesian approach","volume":"40","author":"Boggis","year":"2016","journal-title":"Genet. Epidemiol"},{"key":"2023061310474384000_btab508-B6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-018-1577-z","article-title":"Ten things you should know about transposable elements","volume":"19","author":"Bourque","year":"2018","journal-title":"Genome Biol"},{"key":"2023061310474384000_btab508-B7","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1038\/nrmicro.2015.7","article-title":"The design and analysis of transposon insertion sequencing experiments","volume":"14","author":"Chao","year":"2016","journal-title":"Nat. Rev. Microbiol"},{"key":"2023061310474384000_btab508-B8","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1186\/1471-2164-10-291","article-title":"Comprehensive identification of essential Staphylococcus aureus genes using transposon-mediated differential hybridisation (tmdh)","volume":"10","author":"Chaudhuri","year":"2009","journal-title":"BMC Genomics"},{"key":"2023061310474384000_btab508-B9","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1038\/msb.2011.58","article-title":"The essential genome of a bacterium","volume":"7","author":"Christen","year":"2011","journal-title":"Mol. Syst. Biol"},{"key":"2023061310474384000_btab508-B10","doi-asserted-by":"crossref","first-page":"e89018","DOI":"10.1371\/journal.pone.0089018","article-title":"Genome-wide high-throughput screening to investigate essential genes involved in methicillin-resistant Staphylococcus aureus sequence type 398 survival","volume":"9","author":"Christiansen","year":"2014","journal-title":"PLoS One"},{"key":"2023061310474384000_btab508-B11","doi-asserted-by":"crossref","first-page":"e1004401","DOI":"10.1371\/journal.pcbi.1004401","article-title":"Transit-a software tool for himar1 tnseq analysis","volume":"11","author":"DeJesus","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023061310474384000_btab508-B12","doi-asserted-by":"crossref","first-page":"e00537","DOI":"10.1128\/mBio.00537-12","article-title":"A genetic resource for rapid and comprehensive phenotype screening of non-essential Staphylococcus aureus genes","volume":"4","author":"Fey","year":"2013","journal-title":"MBio"},{"key":"2023061310474384000_btab508-B13","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1093\/molbev\/msg017","article-title":"The temporal distribution of gene duplication events in a set of highly conserved human gene families","volume":"20","author":"Friedman","year":"2003","journal-title":"Mol. Biol. Evol"},{"key":"2023061310474384000_btab508-B14","doi-asserted-by":"crossref","first-page":"16422","DOI":"10.1073\/pnas.0906627106","article-title":"Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for haemophilus genes required in the lung","volume":"106","author":"Gawronski","year":"2009","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023061310474384000_btab508-B15","doi-asserted-by":"crossref","first-page":"e02096","DOI":"10.1128\/mBio.02096-17","article-title":"The essential genome of Escherichia coli k-12","volume":"9","author":"Goodall","year":"2018","journal-title":"MBio"},{"key":"2023061310474384000_btab508-B16","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1016\/j.chom.2009.08.003","article-title":"Identifying genetic determinants needed to establish a human gut symbiont in its habitat","volume":"6","author":"Goodman","year":"2009","journal-title":"Cell Host Microbe"},{"key":"2023061310474384000_btab508-B17","doi-asserted-by":"crossref","first-page":"12758","DOI":"10.1021\/acs.chemrev.6b00003","article-title":"DNA transposition at work","volume":"116","author":"Hickman","year":"2016","journal-title":"Chem. Rev"},{"key":"2023061310474384000_btab508-B18","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1016\/S0966-842X(00)01865-5","article-title":"Transposon-based approaches to identify essential bacterial genes","volume":"8","author":"Judson","year":"2000","journal-title":"Trends Microbiol"},{"key":"2023061310474384000_btab508-B19","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1186\/s12864-018-4986-1","article-title":"Iron-dependent essential genes in Salmonella typhimurium","volume":"19","author":"Karash","year":"2018","journal-title":"BMC Genomics"},{"key":"2023061310474384000_btab508-B20","doi-asserted-by":"crossref","first-page":"e01351","DOI":"10.1128\/mBio.01351-16","article-title":"The nucleoid binding protein h-ns biases genome-wide transposon insertion landscapes","volume":"7","author":"Kimura","year":"2016","journal-title":"MBio"},{"key":"2023061310474384000_btab508-B21","doi-asserted-by":"crossref","first-page":"e1000976","DOI":"10.1371\/journal.pcbi.1000976","article-title":"The mycobacterium tuberculosis drugome and its polypharmacological implications","volume":"6","author":"Kinnings","year":"2010","journal-title":"PLoS Comput. Biol"},{"key":"2023061310474384000_btab508-B22","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1186\/1471-2164-13-578","article-title":"Identification of essential genes of the periodontal pathogen Porphyromonas gingivalis","volume":"13","author":"Klein","year":"2012","journal-title":"BMC Genom"},{"key":"2023061310474384000_btab508-B23","doi-asserted-by":"crossref","first-page":"2308","DOI":"10.1101\/gr.097097.109","article-title":"Simultaneous assay of every salmonella typhi gene using one million transposon mutants","volume":"19","author":"Langridge","year":"2009","journal-title":"Genome Res"},{"key":"2023061310474384000_btab508-B24","author":"Lariviere","year":"2020"},{"key":"2023061310474384000_btab508-B25","doi-asserted-by":"crossref","first-page":"9838","DOI":"10.1038\/srep09838","article-title":"Essential genes in the core genome of the human pathogen Streptococcus pyogenes","volume":"5","author":"Le Breton","year":"2015","journal-title":"Sci. Rep"},{"key":"2023061310474384000_btab508-B26","first-page":"1303","author":"Li","year":"2013"},{"key":"2023061310474384000_btab508-B27","doi-asserted-by":"crossref","first-page":"10","DOI":"10.14806\/ej.17.1.200","article-title":"Cutadapt removes adapter sequences from high-throughput sequencing reads","volume":"17","author":"Martin","year":"2011","journal-title":"EMBnet J"},{"key":"2023061310474384000_btab508-B28","first-page":"207384","article-title":"Systematic identification of essential genes by in vitro transposon mutagenesis","volume":"6","author":"Mekalanos","year":"2001","journal-title":"US Patent"},{"key":"2023061310474384000_btab508-B29","doi-asserted-by":"crossref","first-page":"e1007980","DOI":"10.1371\/journal.pcbi.1007980","article-title":"Albatradis: comparative analysis of large datasets from parallel transposon mutagenesis experiments","volume":"16","author":"Page","year":"2020","journal-title":"PLoS Comput. Biol"},{"key":"2023061310474384000_btab508-B30","doi-asserted-by":"crossref","first-page":"2331","DOI":"10.3389\/fmicb.2017.02331","article-title":"A comprehensive overview of online resources to identify and predict bacterial essential genes","volume":"8","author":"Peng","year":"2017","journal-title":"Front. Microbiol"},{"key":"2023061310474384000_btab508-B31","doi-asserted-by":"crossref","first-page":"e99820","DOI":"10.1371\/journal.pone.0099820","article-title":"Defined single-gene and multi-gene deletion mutant collections in salmonella enterica sv typhimurium","volume":"9","author":"Porwollik","year":"2014","journal-title":"PLoS One"},{"key":"2023061310474384000_btab508-B32","doi-asserted-by":"crossref","first-page":"e1004782","DOI":"10.1371\/journal.pgen.1004782","article-title":"Artist: high-resolution genome-wide assessment of fitness using transposon-insertion sequencing","volume":"10","author":"Pritchard","year":"2014","journal-title":"PLoS Genet"},{"key":"2023061310474384000_btab508-B33","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"Quinlan ar, hall im. bedtools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023061310474384000_btab508-B34","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1093\/bib\/bbt086","article-title":"Comparison of software packages for detecting differential expression in RNA-seq studies","volume":"16","author":"Seyednasrollah","year":"2015","journal-title":"Brief. Bioinform"},{"key":"2023061310474384000_btab508-B35","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1002\/gepi.21956","article-title":"Incorporating functional genomic information in genetic association studies using an empirical Bayes approach","volume":"40","author":"Spencer","year":"2016","journal-title":"Genet. Epidemiol"},{"key":"2023061310474384000_btab508-B36","doi-asserted-by":"crossref","DOI":"10.1128\/JCM.01405-18","article-title":"Genome-based prediction of bacterial antibiotic resistance","volume":"57","author":"Su","year":"2019","journal-title":"J. Clin. Microbiol"},{"key":"2023061310474384000_btab508-B37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-020-62287-2","article-title":"Genome-wide identification of essential genes in mycobacterium intracellulare by transposon sequencing\u2014implication for metabolic remodelling","volume":"10","author":"Tateishi","year":"2020","journal-title":"Sci. Rep"},{"key":"2023061310474384000_btab508-B38","doi-asserted-by":"crossref","first-page":"767","DOI":"10.1038\/nmeth.1377","article-title":"Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms","volume":"6","author":"Van Opijnen","year":"2009","journal-title":"Nat. Methods"},{"key":"2023061310474384000_btab508-B39","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1038\/nrmicro3033","article-title":"Transposon insertion sequencing: a new tool for systems-level analysis of microorganisms","volume":"11","author":"Van Opijnen","year":"2013","journal-title":"Nat. Rev. Microbiol"},{"key":"2023061310474384000_btab508-B40","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1002\/gepi.22375","article-title":"The utility of the laplace effect size prior distribution in Bayesian fine-mapping studies","volume":"45","author":"Walters","year":"2021","journal-title":"Genet. Epidemiol"},{"key":"2023061310474384000_btab508-B41","doi-asserted-by":"crossref","first-page":"41923","DOI":"10.1038\/srep41923","article-title":"A noise trimming and positional significance of transposon insertion system to identify essential genes in Yersinia pestis","volume":"7","author":"Yang","year":"2017","journal-title":"Sci. Rep"},{"key":"2023061310474384000_btab508-B42","doi-asserted-by":"crossref","first-page":"e43012","DOI":"10.1371\/journal.pone.0043012","article-title":"Essentials: software for rapid analysis of high throughput transposon insertion sequencing data","volume":"7","author":"Zomer","year":"2012","journal-title":"PLoS One"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab508\/40357598\/btab508.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/23\/4343\/50578886\/btab508.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/23\/4343\/50578886\/btab508.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,5]],"date-time":"2023-11-05T21:18:28Z","timestamp":1699219108000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/23\/4343\/6320781"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,7,13]]},"references-count":42,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2021,12,7]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab508","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,12,1]]},"published":{"date-parts":[[2021,7,13]]}}}