{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T08:55:46Z","timestamp":1768985746540,"version":"3.49.0"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2018,5,2]],"date-time":"2018-05-02T00:00:00Z","timestamp":1525219200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene prediction, very few studies have been conducted to quantitatively characterize ILDs of various species.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We developed a set of computer programs (fitild, compild, etc.) to build statistical models of ILDs and compare them with one another. Each ILD of more than 1000 genomes was fitted with fitild to a statistical model consisting of one, two, or three components of Frechet distributions. Several measures of distances between ILDs were calculated by compild. A theoretical model was presented to better understand the origin of the observed shape of an ILD.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The C++\u2009source codes are available at https:\/\/github.com\/ogotoh\/fitild.git\/.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty353","type":"journal-article","created":{"date-parts":[[2018,5,1]],"date-time":"2018-05-01T11:09:02Z","timestamp":1525172942000},"page":"3258-3264","source":"Crossref","is-referenced-by-count":8,"title":["Modeling one thousand intron length distributions with fitild"],"prefix":"10.1093","volume":"34","author":[{"given":"Osamu","family":"Gotoh","sequence":"first","affiliation":[{"name":"Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), AIST Tokyo Waterfront Bio-IT Research Building, Koto-ku, Tokyo, Japan"},{"name":"Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, Japan"}]}],"member":"286","published-online":{"date-parts":[[2018,5,2]]},"reference":[{"key":"2023012712492502600_bty353-B1","first-page":"267","volume-title":"Second International Symposium on Information Theory","author":"Akaike","year":"1973"},{"key":"2023012712492502600_bty353-B2","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1038\/sj.hdy.6800791","article-title":"The rise and falls of introns","volume":"96","author":"Belshaw","year":"2006","journal-title":"Heredity (Edinb)"},{"key":"2023012712492502600_bty353-B3","doi-asserted-by":"crossref","first-page":"2411","DOI":"10.1074\/jbc.270.6.2411","article-title":"Exon recognition in vertebrate splicing","volume":"270","author":"Berget","year":"1995","journal-title":"J. Biol. Chem"},{"key":"2023012712492502600_bty353-B4","doi-asserted-by":"crossref","first-page":"e0161476","DOI":"10.1371\/journal.pone.0161476","article-title":"Evolution of the exon-intron structure in ciliate genomes","volume":"11","author":"Bondarenko","year":"2016","journal-title":"PLoS One"},{"key":"2023012712492502600_bty353-B5","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1006\/jmbi.1997.0951","article-title":"Prediction of complete gene structures in human genomic DNA","volume":"268","author":"Burge","year":"1997","journal-title":"J. Mol. Biol"},{"key":"2023012712492502600_bty353-B6","doi-asserted-by":"crossref","first-page":"5094","DOI":"10.1073\/pnas.95.9.5094","article-title":"Toward a resolution of the introns early\/late debate: only phase zero introns are correlated with the structure of ancient proteins","volume":"95","author":"de Souza","year":"1998","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012712492502600_bty353-B7","doi-asserted-by":"crossref","first-page":"D136","DOI":"10.1093\/nar\/gkr1178","article-title":"The NCBI Taxonomy database","volume":"40","author":"Federhen","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023012712492502600_bty353-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1086\/284325","article-title":"Phylogenies and the comparative method","volume":"125","author":"Felsenstein","year":"1985","journal-title":"Am. Nat"},{"key":"2023012712492502600_bty353-B9","doi-asserted-by":"crossref","first-page":"2693","DOI":"10.1002\/bip.1977.360161209","article-title":"Theory of DNA melting curves","volume":"16","author":"Fixman","year":"1977","journal-title":"Biopolymers"},{"key":"2023012712492502600_bty353-B10","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1101\/gr.119834.110","article-title":"Changes in exon-intron structure during vertebrate evolution affect the splicing pattern of exons","volume":"22","author":"Gelfman","year":"2012","journal-title":"Genome Res"},{"key":"2023012712492502600_bty353-B11","doi-asserted-by":"crossref","first-page":"1447","DOI":"10.1093\/oxfordjournals.molbev.a025872","article-title":"Divergent structures of Caenorhabditis elegans cytochrome P450 genes suggest the frequent loss and gain of introns during the evolution of nematodes","volume":"15","author":"Gotoh","year":"1998","journal-title":"Mol. Biol. Evol"},{"key":"2023012712492502600_bty353-B12","doi-asserted-by":"crossref","first-page":"2438","DOI":"10.1093\/bioinformatics\/btn460","article-title":"Direct mapping and alignment of protein sequences onto genomic sequence","volume":"24","author":"Gotoh","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012712492502600_bty353-B13","doi-asserted-by":"crossref","first-page":"2630","DOI":"10.1093\/nar\/gkn105","article-title":"A space-efficient and accurate method for mapping and aligning cDNA sequences onto genomic sequence","volume":"36","author":"Gotoh","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023012712492502600_bty353-B14","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198546412.001.0001","volume-title":"The Comparative Method in Evolutionary Biology","author":"Harvey","year":"1991"},{"key":"2023012712492502600_bty353-B15","doi-asserted-by":"crossref","first-page":"9893","DOI":"10.1093\/nar\/16.21.9893","article-title":"A survey on intron and exon lengths","volume":"16","author":"Hawkins","year":"1988","journal-title":"Nucleic Acids Res"},{"key":"2023012712492502600_bty353-B16","doi-asserted-by":"crossref","first-page":"2392","DOI":"10.1093\/molbev\/msl111","article-title":"Intron size, abundance, and distribution within untranslated regions of genes","volume":"23","author":"Hong","year":"2006","journal-title":"Mol. Biol. Evol"},{"key":"2023012712492502600_bty353-B17","doi-asserted-by":"crossref","first-page":"391.","DOI":"10.1038\/377391a0","article-title":"Small genomes for better flyers","volume":"377","author":"Hughes","year":"1995","journal-title":"Nature"},{"key":"2023012712492502600_bty353-B18","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1093\/molbev\/msm220","article-title":"Complex selection on intron size in Cryptococcus neoformans","volume":"25","author":"Hughes","year":"2008","journal-title":"Mol. Biol. Evol"},{"key":"2023012712492502600_bty353-B19","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1186\/1471-2164-12-45","article-title":"Comparative analysis of information contents relevant to recognition of introns in many species","volume":"12","author":"Iwata","year":"2011","journal-title":"BMC Genomics"},{"key":"2023012712492502600_bty353-B20","doi-asserted-by":"crossref","first-page":"59.","DOI":"10.1186\/1471-2105-5-59","article-title":"Gene finding in novel genomes","volume":"5","author":"Korf","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012712492502600_bty353-B21","doi-asserted-by":"crossref","DOI":"10.1142\/p191","volume-title":"Extreme Value Distributions: Theory and Applications","author":"Kotz","year":"2000"},{"key":"2023012712492502600_bty353-B22","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1128\/EC.3.5.1088-1100.2004","article-title":"Introns and splicing elements of five diverse fungi","volume":"3","author":"Kupfer","year":"2004","journal-title":"Eukaryot. Cell"},{"key":"2023012712492502600_bty353-B23","doi-asserted-by":"crossref","first-page":"11193","DOI":"10.1073\/pnas.201407298","article-title":"A computational analysis of sequence features involved in recognition of short introns","volume":"98","author":"Lim","year":"2001","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012712492502600_bty353-B24","doi-asserted-by":"crossref","first-page":"6494","DOI":"10.1093\/nar\/gki937","article-title":"Gene identification in novel eukaryotic genomes by self-training algorithm","volume":"33","author":"Lomsadze","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023012712492502600_bty353-B25","doi-asserted-by":"crossref","first-page":"770","DOI":"10.1093\/oxfordjournals.molbev.a025980","article-title":"Genome size and intron size in Drosophila","volume":"15","author":"Moriyama","year":"1998","journal-title":"Mol. Biol. Evol"},{"key":"2023012712492502600_bty353-B26","doi-asserted-by":"crossref","first-page":"4255","DOI":"10.1093\/nar\/20.16.4255","article-title":"Splicing signals in Drosophila: intron size, information content, and consensus sequences","volume":"20","author":"Mount","year":"1992","journal-title":"Nucleic Acids Res"},{"key":"2023012712492502600_bty353-B27","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1093\/comjnl\/7.4.308","article-title":"A simplex method for function minimization","volume":"7","author":"Nelder","year":"1965","journal-title":"Computer J"},{"key":"2023012712492502600_bty353-B28","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1016\/0014-5793(96)00636-9","article-title":"The size differences among mammalian introns are due to the accumulation of small deletions","volume":"390","author":"Ogata","year":"1996","journal-title":"FEBS Lett"},{"key":"2023012712492502600_bty353-B29","doi-asserted-by":"crossref","first-page":"1859","DOI":"10.1002\/bip.1974.360130916","article-title":"Recursion relation generation of probability profiles for specific-sequence macromolecules with long-range correlations","volume":"13","author":"Poland","year":"1974","journal-title":"Biopolymers"},{"key":"2023012712492502600_bty353-B30","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden Markov models and selected applications in speech recognition","volume":"77","author":"Rabiner","year":"1989","journal-title":"Proc. IEEE"},{"key":"2023012712492502600_bty353-B31","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1080\/01621459.1971.10482356","article-title":"Objective criteria for the evaluation of clustering methods","volume":"66","author":"Rand","year":"1971","journal-title":"J. Am. Stat. Assoc"},{"key":"2023012712492502600_bty353-B32","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1101\/gr.10.4.529","article-title":"Genie\u2014Gene finding in Drosophila melanogaster","volume":"10","author":"Reese","year":"2000","journal-title":"Genome Res"},{"key":"2023012712492502600_bty353-B33","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1146\/annurev.genet.40.110405.090625","article-title":"Origins and evolution of spliceosomal introns","volume":"40","author":"Rodr\u00edguez-Trelles","year":"2006","journal-title":"Annu. Rev. Genet"},{"key":"2023012712492502600_bty353-B34","doi-asserted-by":"crossref","first-page":"11.","DOI":"10.1186\/1745-6150-7-11","article-title":"Origin and evolution of spliceosomal introns","volume":"7","author":"Rogozin","year":"2012","journal-title":"Biol. Direct"},{"key":"2023012712492502600_bty353-B35","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.tig.2008.11.004","article-title":"Mystery of intron gain: new data and new models","volume":"25","author":"Roy","year":"2009","journal-title":"Trends Genet"},{"key":"2023012712492502600_bty353-B36","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1101\/gr.10.4.516","article-title":"Ab initio gene finding in Drosophila genomic DNA","volume":"10","author":"Salamov","year":"2000","journal-title":"Genome Res"},{"key":"2023012712492502600_bty353-B37","first-page":"461","article-title":"Estimating the dimension of a model","volume":"6","author":"Schwarz","year":"1978","journal-title":"Anal. Stat"},{"key":"2023012712492502600_bty353-B38","doi-asserted-by":"crossref","first-page":"1699","DOI":"10.1093\/molbev\/msp081","article-title":"Evolution of ultrasmall spliceosomal introns in highly reduced nuclear genomes","volume":"26","author":"Slamovits","year":"2009","journal-title":"Mol. Biol. Evol"},{"key":"2023012712492502600_bty353-B39","doi-asserted-by":"crossref","first-page":"ii215","DOI":"10.1093\/bioinformatics\/btg1080","article-title":"Gene prediction with a hidden Markov model and a new intron submodel","volume":"19","author":"Stanke","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012712492502600_bty353-B40","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1126\/science.8023140","article-title":"Testing the exon theory of genes: the evidence from protein structure","volume":"265","author":"Stoltzfus","year":"1994","journal-title":"Science"},{"key":"2023012712492502600_bty353-B41","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature12031","article-title":"The genomes of four tapeworm species reveal adaptations to parasitism","volume":"496","author":"Tsai","year":"2013","journal-title":"Nature"},{"key":"2023012712492502600_bty353-B42","doi-asserted-by":"crossref","first-page":"1260","DOI":"10.1016\/j.cub.2012.05.011","article-title":"Birth of new spliceosomal introns in fungi by multiplication of introner-like elements","volume":"22","author":"van der Burgt","year":"2012","journal-title":"Curr. Biol"},{"key":"2023012712492502600_bty353-B43","doi-asserted-by":"crossref","first-page":"e24.","DOI":"10.1371\/journal.pgen.0020024","article-title":"SPA: a probabilistic algorithm for spliced alignment","volume":"2","author":"van Nimwegen","year":"2006","journal-title":"PLoS Genet"},{"key":"2023012712492502600_bty353-B44","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1007\/PL00006561","article-title":"Intron-genome size relationship on a large evolutionary scale","volume":"49","author":"Vinogradov","year":"1999","journal-title":"J. Mol. Evol"},{"key":"2023012712492502600_bty353-B45","first-page":"27","article-title":"Genome size and metabolic intensity in tetrapods: a tale of two lines","volume":"273","author":"Vinogradov","year":"2006","journal-title":"Proc. Biol. Sci"},{"key":"2023012712492502600_bty353-B46","doi-asserted-by":"crossref","first-page":"12845.","DOI":"10.1038\/ncomms12845","article-title":"Comparative genomics reveals adaptive evolution of Asian tapeworm in switching to a new intermediate host","volume":"7","author":"Wang","year":"2016","journal-title":"Nat. Commun"},{"key":"2023012712492502600_bty353-B47","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1007\/s11427-013-4540-y","article-title":"Systematic analysis of intron size and abundance parameters in diverse lineages","volume":"56","author":"Yan","year":"2013","journal-title":"Sci. China"},{"key":"2023012712492502600_bty353-B48","doi-asserted-by":"crossref","first-page":"1033","DOI":"10.1093\/gbe\/evs070","article-title":"The evolution of intron size in amniotes: a role for powered flight?","volume":"4","author":"Zhang","year":"2012","journal-title":"Genome Biol. Evol"},{"key":"2023012712492502600_bty353-B49","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2164-10-47","article-title":"Patterns of exon-intron architecture variation of genes in eukaryotic genomes","volume":"10","author":"Zhu","year":"2009","journal-title":"BMC Genomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/19\/3258\/48919130\/bioinformatics_34_19_3258.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/19\/3258\/48919130\/bioinformatics_34_19_3258.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T01:19:53Z","timestamp":1751591993000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/19\/3258\/4990825"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,5,2]]},"references-count":49,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2018,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty353","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,10,1]]},"published":{"date-parts":[[2018,5,2]]}}}