{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T06:15:51Z","timestamp":1773382551611,"version":"3.50.1"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T00:00:00Z","timestamp":1653955200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T00:00:00Z","timestamp":1653955200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>In integrative bioinformatic analyses, it is of great interest to stablish the equivalence between gene or (more in general) feature lists, up to a given level and in terms of their annotations in the Gene Ontology. The aim of this article is to present an equivalence test based on the proportion of GO terms which are declared as enriched in both lists simultaneously.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>On the basis of these data, the dissimilarity between gene lists is measured by means of the Sorensen\u2013Dice index. We present two flavours of the same test: One of them based on the asymptotic normality of the test statistic and the other based on the bootstrap method.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The accuracy of these tests is studied by means of simulation and their possible interest is illustrated by using them over two real datasets: A collection of gene lists related to cancer and a collection of gene lists related to kidney rejection after transplantation.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-022-04739-2","type":"journal-article","created":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T15:05:18Z","timestamp":1654009518000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["An equivalence test between features lists, based on the Sorensen\u2013Dice index and the joint frequencies of GO term enrichment"],"prefix":"10.1186","volume":"23","author":[{"given":"Pablo","family":"Flores","sequence":"first","affiliation":[]},{"given":"Miquel","family":"Salicr\u00fa","sequence":"additional","affiliation":[]},{"given":"Alex","family":"S\u00e1nchez-Pla","sequence":"additional","affiliation":[]},{"given":"Jordi","family":"Oca\u00f1a","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,5,31]]},"reference":[{"key":"4739_CR1","unstructured":"Micheel CM, Nass SJ, Omenn GS, Trials, C.o.t.R.o.O.-B.T.f.P.P.O.i.C., Services, B.o.H.C., Policy, B.o.H.S., of Medicine, I.: Omics-Based Clinical Discovery: Science, Technology, and Applications 2012"},{"issue":"01","key":"4739_CR2","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1055\/s-0038-1667085","volume":"27","author":"E Math\u00e9","year":"2018","unstructured":"Math\u00e9 E, Hays JL, Stover DG, Chen JL. The omics revolution continues: the maturation of high-throughput biological data sources. Yearb Med Inform. 2018;27(01):211\u201322.","journal-title":"Yearb Med Inform"},{"issue":"1","key":"4739_CR3","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1016\/J.JNEUROIM.2012.04.008","volume":"248","author":"A S\u00e1nchez-Pla","year":"2012","unstructured":"S\u00e1nchez-Pla A, Reverter F, Ru\u00edz de Villa MC, Comabella M. Transcriptomics: mRNA and alternative splicing. J Neuroimmunol. 2012;248(1):23\u201331. https:\/\/doi.org\/10.1016\/J.JNEUROIM.2012.04.008.","journal-title":"J Neuroimmunol"},{"issue":"2","key":"4739_CR4","doi-asserted-by":"publisher","first-page":"1002375","DOI":"10.1371\/journal.pcbi.1002375","volume":"8","author":"P Khatri","year":"2012","unstructured":"Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol. 2012;8(2):1002375. https:\/\/doi.org\/10.1371\/journal.pcbi.1002375.","journal-title":"PLoS Comput Biol"},{"issue":"2","key":"4739_CR5","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1016\/S0888-7543(02)00021-6","volume":"81","author":"S Dr\u01ceghici","year":"2003","unstructured":"Dr\u01ceghici S, Khatri P, Martins RP, Ostermeier GC, Krawetz SA. Global functional profiling of gene expression. Genomics. 2003;81(2):98\u2013104.","journal-title":"Genomics"},{"issue":"43","key":"4739_CR6","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Nat Acad Sci. 2005;102(43):15545\u201350.","journal-title":"Proc Nat Acad Sci"},{"issue":"5","key":"4739_CR7","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1089\/omi.2011.0118","volume":"16","author":"G Yu","year":"2012","unstructured":"Yu G, Wang L-G, Han Y, He Q-Y. Clusterprofiler: an r package for comparing biological themes among gene clusters. Omics: J Integr Biol. 2012;16(5):284\u20137.","journal-title":"Omics: J Integr Biol"},{"issue":"4","key":"4739_CR8","doi-asserted-by":"publisher","first-page":"e0154315","DOI":"10.1371\/journal.pone.0154315","volume":"11","author":"G Lin","year":"2016","unstructured":"Lin G, Chai J, Yuan S, Mai C, Cai L, Murphy RW, Zhou W, Luo J. Vennpainter: a tool for the comparison and identification of candidate genes based on venn diagrams. PloS one. 2016;11(4):e0154315.","journal-title":"PloS one"},{"issue":"Web Server issu","key":"4739_CR9","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1093\/nar\/gkq418","volume":"38","author":"V Kaimal","year":"2010","unstructured":"Kaimal V, Bardes EE, Tabar SC, Jegga AG, Aronow BJ. ToppCluster: a multiple gene list feature analyzer for comparative enrichment clustering and network-based dissection of biological systems. Nucleic Acids Res. 2010;38(Web Server issue):96\u2013102. https:\/\/doi.org\/10.1093\/nar\/gkq418.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"4739_CR10","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1186\/1471-2105-12-401","volume":"12","author":"M Salicr\u00fa","year":"2011","unstructured":"Salicr\u00fa M, Oca\u00f1a J, S\u00e1nchez-Pla A. Comparison of lists of genes based on functional profiles. BMC Bioinform. 2011;12(1):401.","journal-title":"BMC Bioinform"},{"issue":"12","key":"4739_CR11","doi-asserted-by":"publisher","first-page":"3975","DOI":"10.1016\/j.jspi.2007.04.015","volume":"137","author":"A S\u00e1nchez-Pla","year":"2007","unstructured":"S\u00e1nchez-Pla A, Salicr\u00fa M, Oca\u00f1a J. Statistical methods for the analysis of high-throughput data based on functional profiles derived from the gene ontology. J Stat Plan Inference. 2007;137(12):3975\u201389.","journal-title":"J Stat Plan Inference"},{"issue":"1","key":"4739_CR12","doi-asserted-by":"publisher","first-page":"441","DOI":"10.1186\/s12859-019-3008-x","volume":"20","author":"A S\u00e1nchez-Pla","year":"2019","unstructured":"S\u00e1nchez-Pla A, Salicr\u00fa M, Oca\u00f1a J. An equivalence approach to the integrative analysis of feature lists. BMC Bioinform. 2019;20(1):441.","journal-title":"BMC Bioinform"},{"key":"4739_CR13","unstructured":"S\u00e1nchez-Pla A, Salicr\u00fa M, Ocana J. goProfiles: an R package for the statistical analysis of functional profiles. \u2019https:\/\/www.bioconductor.org\/packages\/release\/bioc\/html\/goProfiles.html\u2019. Accessed 2021-11-28."},{"issue":"1","key":"4739_CR14","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25\u20139.","journal-title":"Nat Genet"},{"issue":"D1","key":"4739_CR15","doi-asserted-by":"publisher","first-page":"325","DOI":"10.1093\/nar\/gkaa1113","volume":"49","author":"C Logie","year":"2021","unstructured":"Logie C, Consortium GO, et al. The gene ontology resource: enriching a gold mine. Nucleic Acids Res. 2021;49(D1):325\u201334.","journal-title":"Nucleic Acids Res"},{"key":"4739_CR16","doi-asserted-by":"publisher","DOI":"10.1201\/EBK1439808184","volume-title":"Testing statistical hypotheses of equivalence and noninferiority","author":"S Wellek","year":"2010","unstructured":"Wellek S. Testing statistical hypotheses of equivalence and noninferiority. London: Chapman and Hall\/CRC; 2010."},{"key":"4739_CR17","first-page":"1","volume":"5","author":"T S\u00f8rensen","year":"1948","unstructured":"S\u00f8rensen T. A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on danish commons. Biol Skar. 1948;5:1\u201334.","journal-title":"Biol Skar"},{"key":"4739_CR18","doi-asserted-by":"publisher","first-page":"148","DOI":"10.1111\/j.1461-0248.2004.00707.x","volume":"8","author":"A Chao","year":"2005","unstructured":"Chao A, Chazdon RL, Colwell RK, Shen T-J. A new statistical approach for assessing compositional similarity based on incidence and abundance data. Ecol Lett. 2005;8:148\u201359.","journal-title":"Ecol Lett"},{"issue":"3","key":"4739_CR19","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1214\/aoms\/1177732594","volume":"6","author":"JL Doob","year":"1935","unstructured":"Doob JL. The limiting distributions of certain statistics. Ann Math Stat. 1935;6(3):160\u20139.","journal-title":"Ann Math Stat"},{"key":"4739_CR20","unstructured":"Food U, (FDA), DA. Guidance on statistical procedures for bioequivalence using a standard two-treatment crossover design. Technical report, FDA, Division of Bioequivalence, Office of Generic Drugs, Centre for Drug Evaluation and Research, Rockville, MD 1992"},{"issue":"2","key":"4739_CR21","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1177\/009286150003400225","volume":"34","author":"JJ Chen","year":"2000","unstructured":"Chen JJ, Tsong Y, Kang S-H. Tests for equivalence or noninferiority between two proportions. Drug Inf J. 2000;34(2):569\u201378.","journal-title":"Drug Inf J"},{"key":"4739_CR22","unstructured":"Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. 1979;65\u201370"},{"issue":"1","key":"4739_CR23","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc: Ser B (Methodol). 1995;57(1):289\u2013300.","journal-title":"J R Stat Soc: Ser B (Methodol)"},{"key":"4739_CR24","unstructured":"R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2021). R Foundation for Statistical Computing. https:\/\/www.R-project.org\/"},{"issue":"6","key":"4739_CR25","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1016\/0167-9473(94)00023-C","volume":"19","author":"J Oca\u00f1a","year":"1995","unstructured":"Oca\u00f1a J, Vegas E. Variance reduction for Bernoulli response variables in simulation. Comput Stat Data Anal. 1995;19(6):631\u201340.","journal-title":"Comput Stat Data Anal"},{"key":"4739_CR26","doi-asserted-by":"crossref","unstructured":"Pesquita C. Semantic similarity in the gene ontology. In: The gene ontology handbook, New York, NY: Humana Press; 2017. pp. 161\u2013173.","DOI":"10.1007\/978-1-4939-3743-1_12"},{"key":"4739_CR27","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1007\/978-1-0716-0301-7_11","volume":"2117","author":"G Yu","year":"2020","unstructured":"Yu G. Gene ontology semantic similarity analysis using gosemsim. Methods Mol Biol. 2020;2117:207\u201315. https:\/\/doi.org\/10.1007\/978-1-0716-0301-7_11.","journal-title":"Methods Mol Biol"},{"issue":"7","key":"4739_CR28","doi-asserted-by":"publisher","first-page":"976","DOI":"10.1093\/bioinformatics\/btq064","volume":"26","author":"G Yu","year":"2010","unstructured":"Yu G, Li F, Qin Y, Bo X, Wu Y, Wang S. Gosemsim: an r package for measuring semantic similarity among go terms and gene products. Bioinformatics. 2010;26(7):976\u20138. https:\/\/doi.org\/10.1093\/bioinformatics\/btq064.","journal-title":"Bioinformatics"},{"issue":"2","key":"4739_CR29","first-page":"209","volume":"27","author":"N Mantel","year":"1967","unstructured":"Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967;27(2):209\u201320.","journal-title":"Cancer Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04739-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04739-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04739-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,26]],"date-time":"2024-09-26T05:03:21Z","timestamp":1727327001000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04739-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,31]]},"references-count":29,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4739"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04739-2","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,31]]},"assertion":[{"value":"20 January 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 May 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 May 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable because the data used is external and it comes from sources outside our control.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"207"}}