{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T22:59:34Z","timestamp":1769641174894,"version":"3.49.0"},"reference-count":54,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Distribution analysis is one of the most basic forms of statistical analysis. Thanks to improved analytical methods, accurate and extensive quantitative measurements can now be made of the mRNA, protein and metabolite from biological systems. Here, we report a large-scale analysis of the population abundance distributions of the transcriptomes, proteomes and metabolomes from varied biological systems.<\/jats:p>\n               <jats:p>Results: We compared the observed empirical distributions with a number of distributions: power law, lognormal, loglogistic, loggamma, right Pareto-lognormal (PLN) and double PLN (dPLN). The best-fit for mRNA, protein and metabolite population abundance distributions was found to be the dPLN. This distribution behaves like a lognormal distribution around the centre, and like a power law distribution in the tails. To better understand the cause of this observed distribution, we explored a simple stochastic model based on geometric Brownian motion. The distribution indicates that multiplicative effects are causally dominant in biological systems. We speculate that these effects arise from chemical reactions: the central-limit theorem then explains the central lognormal, and a number of possible mechanisms could explain the long tails: positive feedback, network topology, etc. Many of the components in the central lognormal parts of the empirical distributions are unidentified and\/or have unknown function. This indicates that much more biology awaits discovery.<\/jats:p>\n               <jats:p>Contact: \u00a0rdk@aber.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp360","type":"journal-article","created":{"date-parts":[[2009,6,18]],"date-time":"2009-06-18T01:45:34Z","timestamp":1245289534000},"page":"2020-2027","source":"Crossref","is-referenced-by-count":45,"title":["An investigation into the population abundance distribution of mRNAs, proteins, and metabolites in biological systems"],"prefix":"10.1093","volume":"25","author":[{"given":"Chuan","family":"Lu","sequence":"first","affiliation":[{"name":"Department of Computer Science, Aberystwyth University, Ceredigion SY23 3DB, UK"}]},{"given":"Ross D.","family":"King","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Aberystwyth University, Ceredigion SY23 3DB, UK"}]}],"member":"286","published-online":{"date-parts":[[2009,6,17]]},"reference":[{"key":"2023013112095356400_B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/jb\/mvi094","article-title":"Scale-freeness and biological networks","volume":"138","author":"Arita","year":"2005","journal-title":"J. Biochem."},{"key":"2023013112095356400_B2","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1007\/s00216-007-1486-6","article-title":"Quantitative mass spectrometry in proteomics: a critical review","volume":"389","author":"Bantscheff","year":"2007","journal-title":"Anal. Bioanal. Chem."},{"key":"2023013112095356400_B3","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1126\/science.286.5439.509","article-title":"Emergence of scaling in random networks","volume":"286","author":"Barab\u00e1si","year":"1999","journal-title":"Science"},{"key":"2023013112095356400_B4","doi-asserted-by":"crossref","first-page":"2242","DOI":"10.1126\/science.1103388","article-title":"Global identification of human transcribed sequences with genome tiling arrays","volume":"306","author":"Bertone","year":"2004","journal-title":"Science"},{"key":"2023013112095356400_B5","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1038\/nbt1239","article-title":"The microarray quality control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements","volume":"24","author":"Canales","year":"2006","journal-title":"Nat. Biotechnol."},{"key":"2023013112095356400_B6","doi-asserted-by":"crossref","first-page":"14458","DOI":"10.1073\/pnas.0503955102","article-title":"Hierarchical metabolomics demonstrates substantial coompositional similarity between genetically modified and conventional potato crops","volume":"102","author":"Catchpole","year":"2005","journal-title":"Proc. Natl Acad. Sci.USA"},{"key":"2023013112095356400_B7","first-page":"49","article-title":"Pareto-gamma statistic reveals global rescaling in transcriptomes of low and high aggressive breast cancer phenotypes","volume-title":"Pattern Recognition in Bioinformatics, International workshop, PRIB 2006","author":"Chua Alvin","year":"2006"},{"key":"2023013112095356400_B8","doi-asserted-by":"crossref","first-page":"991","DOI":"10.1038\/nature06525","article-title":"The biological impact of mass-spectrometry-based proteomics","volume":"450","author":"Cravatt Benjamin","year":"2007","journal-title":"Nature"},{"key":"2023013112095356400_B9","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1016\/S0006-3495(03)74839-5","article-title":"Near-critical phenomena in intracellular metabolite pools","volume":"84","author":"Elf","year":"2003","journal-title":"Biophys. J."},{"key":"2023013112095356400_B10","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1023\/A:1013713905833","article-title":"Metabolomics - the link between genotypes and phenotypes","volume":"48","author":"Fiehn","year":"2002","journal-title":"Plant Mol. Biol."},{"key":"2023013112095356400_B11","doi-asserted-by":"crossref","first-page":"16830","DOI":"10.1103\/PhysRevLett.97.168302","article-title":"Linking stochastic dynamics to population distribution: an analytical framework of gene expression","volume":"97","author":"Friedman","year":"2006","journal-title":"Phys. Rev. Lett."},{"key":"2023013112095356400_B12","doi-asserted-by":"crossref","first-page":"088102","DOI":"10.1103\/PhysRevLett.90.088102","article-title":"Zipf's law in gene expression","volume":"90","author":"Furusawa","year":"2003","journal-title":"Phys. Rev. Lett."},{"key":"2023013112095356400_B13","doi-asserted-by":"crossref","first-page":"e8","DOI":"10.1371\/journal.pcbi.0040008","article-title":"Noise propagation and signaling sensitivity in biological networks: a role for positive feedback","volume":"4","author":"Hornung","year":"2008","journal-title":"PLoS Comput. Biol."},{"key":"2023013112095356400_B14","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1093\/bioinformatics\/18.4.576","article-title":"Making sense of microarray data distributions","volume":"18","author":"Hoyle","year":"2002","journal-title":"Bioinformatics"},{"key":"2023013112095356400_B15","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1186\/1471-2164-9-102","article-title":"Protein abundance profiling of the Escherichia coli cytosol","volume":"9","author":"Ishihama","year":"2008","journal-title":"BMC Genomics"},{"key":"2023013112095356400_B16","volume-title":"Probability Theory: The Logic of Science","author":"Jaynes","year":"2005"},{"key":"2023013112095356400_B17","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1038\/35036627","article-title":"The large-scale organization of metabolic networks","volume":"407","author":"Jeong","year":"2000","journal-title":"Nature"},{"key":"2023013112095356400_B18","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/35075138","article-title":"Lethality and centrality in protein networks","volume":"411","author":"Jeong","year":"2001","journal-title":"Nature"},{"key":"2023013112095356400_B19","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.mib.2004.04.012","article-title":"Metabolomics and systems biology: making sense of the soup","volume":"7","author":"Kell","year":"2004","journal-title":"Curr. Opin Microbiol."},{"key":"2023013112095356400_B20","volume-title":"Modern Epidemiology.","author":"Kenneth","year":"1998","edition":"2"},{"key":"2023013112095356400_B21","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1089\/cmb.2006.13.810","article-title":"How scale-free are biological networks","volume":"13","author":"Khanin","year":"2006","journal-title":"J. Computat. Biol."},{"key":"2023013112095356400_B22","doi-asserted-by":"crossref","first-page":"2395","DOI":"10.1093\/nar\/gkn087","article-title":"Specificity of DNA microarray hybridization: characterization, effectors and approaches for data correction","volume":"36","author":"Koltai","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023013112095356400_B23","first-page":"1471","article-title":"Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment","volume":"5","author":"Konishi","year":"2004","journal-title":"BMC Bioinformatics."},{"key":"2023013112095356400_B24","doi-asserted-by":"crossref","first-page":"1321","DOI":"10.1093\/genetics\/161.3.1321","article-title":"General statistics of stochastic process of gene expression in eukaryotic cells","volume":"161","author":"Kuznetsov","year":"2002","journal-title":"Genetics"},{"key":"2023013112095356400_B25","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1641\/0006-3568(2001)051[0341:LNDATS]2.0.CO;2","article-title":"Log-normal distributions across the sciences: keys and clues","volume":"51","author":"Limpert","year":"2001","journal-title":"Bioscience"},{"key":"2023013112095356400_B26","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1038\/35015701","article-title":"Genomics, gene expression and DNA arrays","volume":"405","author":"Lockhart","year":"2000","journal-title":"Nature"},{"key":"2023013112095356400_B27","doi-asserted-by":"crossref","first-page":"13629","DOI":"10.1073\/pnas.0601476103","article-title":"Accurately quantifying low-abundant targets amid similar sequences by revealing hidden correlations in oligonucleotide microarray data","volume":"103","author":"Marcelino","year":"2006","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112095356400_B28","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1126\/science.298.5594.824","article-title":"Network motifs: simple building blocks of complex networks","volume":"298","author":"Milo","year":"2002","journal-title":"Science"},{"key":"2023013112095356400_B29","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1080\/15427951.2004.10129088","article-title":"A brief history of generative models for power law and lognormal distributions","volume":"1","author":"Mitzenmacher","year":"2004","journal-title":"Internet Math."},{"key":"2023013112095356400_B30","volume-title":"Chance and Necessity.","author":"Monod","year":"1971"},{"key":"2023013112095356400_B31","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1126\/science.1158441","article-title":"The transcriptional landscape of the yeast genome defined by RNA sequencing","volume":"320","author":"Nagalakshmi","year":"2008","journal-title":"Science"},{"key":"2023013112095356400_B32","doi-asserted-by":"crossref","first-page":"840","DOI":"10.1038\/nature04785","article-title":"Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise","volume":"441","author":"Newman","year":"2006","journal-title":"Nature"},{"key":"2023013112095356400_B33","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1016\/j.physleta.2004.07.045","article-title":"A constructive approach to gene expression dynamics","volume":"330","author":"Ochiai","year":"2004","journal-title":"Phys. Lett. A"},{"key":"2023013112095356400_B34","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nature02257","article-title":"Summing up the noise in gene networks","volume":"427","author":"Paulsson","year":"2004","journal-title":"Nature"},{"key":"2023013112095356400_B35","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.plrev.2005.03.003","article-title":"Models of stochastic gene expression","volume":"2","author":"Paulsson","year":"2005","journal-title":"Phys. Life Rev."},{"key":"2023013112095356400_B36","doi-asserted-by":"crossref","first-page":"963","DOI":"10.1126\/science.1093669","article-title":"Transitions from nonliving to living matter","volume":"303","author":"Rasmussen","year":"2004","journal-title":"Science"},{"key":"2023013112095356400_B37","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1016\/S0378-4371(02)01507-8","article-title":"The Pareto law of incomes - an explaination and an extention","volume":"319","author":"Reed","year":"2003","journal-title":"Physica A"},{"key":"2023013112095356400_B38","doi-asserted-by":"crossref","first-page":"1733","DOI":"10.1081\/STA-120037438","article-title":"The double Pareto-lognormal distribution - a new parametric model for size distributions","volume":"33","author":"Reed","year":"2004","journal-title":"Com. Stats Theory Methods"},{"key":"2023013112095356400_B39","doi-asserted-by":"crossref","first-page":"E190","DOI":"10.1038\/35087138","article-title":"Navigating gene expression using microarrays a technology review","volume":"3","author":"Schulze","year":"2001","journal-title":"Nat. Cell Biol."},{"key":"2023013112095356400_B40","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1073\/pnas.23.4.224","article-title":"The relation of gene to character in quantitative inheritance","volume":"23","author":"Sinnott","year":"1937","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112095356400_B41","doi-asserted-by":"crossref","first-page":"675","DOI":"10.1016\/j.cell.2004.09.008","article-title":"Robustness of cellular functions","volume":"118","author":"Stelling","year":"2004","journal-title":"Cell"},{"key":"2023013112095356400_B42","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1080\/01621459.1974.10480196","article-title":"EDF statistics for goodness of fit and some comparisons","volume":"69","author":"Stephens","year":"1974","journal-title":"J. Am. Stat. Assoc."},{"key":"2023013112095356400_B43","first-page":"65","article-title":"Statistical model selection applied to biological network data","volume":"3","author":"Stumpf","year":"2005","journal-title":"Proc. Computat. Syst. Biology"},{"key":"2023013112095356400_B44","doi-asserted-by":"crossref","first-page":"4221","DOI":"10.1073\/pnas.0501179102","article-title":"Subnets of scale-free networks are not scale-free: Sampling properties of the networks","volume":"102","author":"Stumpf","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112095356400_B45","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/j.ecoinf.2005.12.003","article-title":"Statistical mechanics of relative species abundance","volume":"1","author":"Tokita","year":"2006","journal-title":"Ecol. Inform."},{"key":"2023013112095356400_B46","doi-asserted-by":"crossref","first-page":"808","DOI":"10.1126\/science.1091317","article-title":"Global mapping of the yeast genetic interaction network","volume":"303","author":"Tong","year":"2004","journal-title":"Science"},{"key":"2023013112095356400_B47","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"The Gene Ontology Consortium","year":"2000","journal-title":"Nat. Genet."},{"key":"2023013112095356400_B48","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1038\/nature01510","article-title":"From genomics to proteomics","volume":"422","author":"Tyers","year":"2003","journal-title":"Nature"},{"key":"2023013112095356400_B49","doi-asserted-by":"crossref","first-page":"3765","DOI":"10.1073\/pnas.0306244101","article-title":"Universality and flexibility in gene expression from bacteria to human","volume":"101","author":"Ueda","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112095356400_B50","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1016\/S0168-9525(03)00203-8","article-title":"Scaling laws in the functional content of genomes","volume":"9","author":"Van Nimwegen","year":"2003","journal-title":"Trends Genet."},{"key":"2023013112095356400_B51","doi-asserted-by":"crossref","first-page":"1239","DOI":"10.1038\/nature07002","article-title":"Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution","volume":"453","author":"Wilhelm","year":"2008","journal-title":"Nature"},{"key":"2023013112095356400_B52","doi-asserted-by":"crossref","first-page":"D521","DOI":"10.1093\/nar\/gkl923","article-title":"HMDB: the human metabolome database","volume":"35","author":"Wishart","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023013112095356400_B53","doi-asserted-by":"crossref","first-page":"882","DOI":"10.1089\/cmb.2005.12.882","article-title":"Stochastic models inspired by hybridization theory for short oligonucleotide arrays","volume":"12","author":"Wu","year":"2005","journal-title":"J. Comput. Biol."},{"key":"2023013112095356400_B54","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1098\/rstb.1925.0002","article-title":"A mathematical theory of evolution based on the conclusions of Dr. J.C. Willis","volume":"213","author":"Yule","year":"1925","journal-title":"Philos. Trans. R. Soc. Lond. B"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/16\/2020\/48994458\/bioinformatics_25_16_2020.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/16\/2020\/48994458\/bioinformatics_25_16_2020.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:31:21Z","timestamp":1675200681000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/16\/2020\/204951"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,6,17]]},"references-count":54,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2009,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp360","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,8,15]]},"published":{"date-parts":[[2009,6,17]]}}}