{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:11:04Z","timestamp":1772165464257,"version":"3.50.1"},"reference-count":99,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T00:00:00Z","timestamp":1658361600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T00:00:00Z","timestamp":1658361600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000265","name":"Medical Research Council","doi-asserted-by":"publisher","award":["MC UU 00002\/4"],"award-info":[{"award-number":["MC UU 00002\/4"]}],"id":[{"id":"10.13039\/501100000265","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000265","name":"Medical Research Council","doi-asserted-by":"publisher","award":["MC UU 00002\/13"],"award-info":[{"award-number":["MC UU 00002\/13"]}],"id":[{"id":"10.13039\/501100000265","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018956","name":"Cambridge Biomedical Research Centre","doi-asserted-by":"publisher","award":["BRC-1215-20014"],"award-info":[{"award-number":["BRC-1215-20014"]}],"id":[{"id":"10.13039\/501100018956","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010269","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["WT107881"],"award-info":[{"award-number":["WT107881"]}],"id":[{"id":"10.13039\/100010269","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>Cluster analysis is an integral part of precision medicine and systems biology, used to define groups of patients or biomolecules. Consensus clustering is an ensemble approach that is widely used in these areas, which combines the output from multiple runs of a non-deterministic clustering algorithm. Here we consider the application of consensus clustering to a broad class of heuristic clustering algorithms that can be derived from Bayesian mixture models (and extensions thereof) by adopting an early stopping criterion when performing sampling-based inference for these models. While the resulting approach is non-Bayesian, it inherits the usual benefits of consensus clustering, particularly in terms of computational scalability and providing assessments of clustering stability\/robustness.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In simulation studies, we show that our approach can successfully uncover the target clustering structure, while also exploring different plausible clusterings of the data. We show that, when a parallel computation environment is available, our approach offers significant reductions in runtime compared to performing sampling-based Bayesian inference for the underlying model, while retaining many of the practical benefits of the Bayesian approach, such as exploring different numbers of clusters. We propose a heuristic to decide upon ensemble size and the early stopping criterion, and then apply consensus clustering to a clustering algorithm derived from a Bayesian integrative clustering method. We use the resulting approach to perform an integrative analysis of three \u2019omics datasets for budding yeast and find clusters of co-expressed genes with shared regulatory proteins. We validate these clusters using data external to the analysis.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclustions<\/jats:title>\n                    <jats:p>Our approach can be used as a wrapper for essentially any existing sampling-based Bayesian clustering implementation, and enables meaningful clustering analyses to be performed using such implementations, even when computational Bayesian inference is not feasible, e.g. due to poor exploration of the target density (often as a result of increasing numbers of features) or a limited computational budget that does not along sufficient samples to drawn from a single chain. This enables researchers to straightforwardly extend the applicability of existing software to much larger datasets, including implementations of sophisticated models such as those that jointly model multiple datasets.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-022-04830-8","type":"journal-article","created":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T13:03:05Z","timestamp":1658408585000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":28,"title":["Consensus clustering for Bayesian mixture models"],"prefix":"10.1186","volume":"23","author":[{"given":"Stephen","family":"Coleman","sequence":"first","affiliation":[]},{"given":"Paul D. W.","family":"Kirk","sequence":"additional","affiliation":[]},{"given":"Chris","family":"Wallace","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,7,21]]},"reference":[{"issue":"6","key":"4830_CR1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1004310","volume":"11","author":"BP Hejblum","year":"2015","unstructured":"Hejblum BP, Skinner J, Thi\u00e9baut R. Time-course gene set analysis for longitudinal gene expression data. PLoS Comput Biol. 2015;11(6): e1004310.","journal-title":"PLoS Comput Biol."},{"issue":"2","key":"4830_CR2","doi-asserted-by":"publisher","first-page":"427","DOI":"10.1208\/s12248-012-9447-1","volume":"15","author":"JP Bai","year":"2013","unstructured":"Bai JP, Alekseyenko AV, Statnikov A, Wang IM, Wong PH. Strategic applications of gene expression: from drug discovery\/development to bedside. AAPS J. 2013;15(2):427\u201337.","journal-title":"AAPS J."},{"key":"4830_CR3","doi-asserted-by":"publisher","first-page":"38","DOI":"10.3389\/fcell.2014.00038","volume":"2","author":"F Emmert-Streib","year":"2014","unstructured":"Emmert-Streib F, Dehmer M, Haibe-Kains B. Gene regulatory networks and their applications: understanding biological and medical problems in terms of networks. Front Cell Dev Biol. 2014;2:38.","journal-title":"Front Cell Dev Biol."},{"issue":"2","key":"4830_CR4","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1109\/TIT.1982.1056489","volume":"28","author":"S Lloyd","year":"1982","unstructured":"Lloyd S. Least squares quantization in PCM. IEEE Trans Inf Theory. 1982;28(2):129\u201337.","journal-title":"IEEE Trans Inf Theory."},{"key":"4830_CR5","first-page":"768","volume":"21","author":"EW Forgy","year":"1965","unstructured":"Forgy EW. Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics. 1965;21:768\u20139.","journal-title":"Biometrics."},{"key":"4830_CR6","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","volume":"20","author":"PJ Rousseeuw","year":"1987","unstructured":"Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53\u201365.","journal-title":"J Comput Appl Math."},{"key":"4830_CR7","unstructured":"Arthur D, Vassilvitskii S. K-Means++: The Advantages of Careful Seeding. In: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms. SODA \u201907. USA: Society for Industrial and Applied Mathematics; 2007. p. 1027\u20131035."},{"issue":"1","key":"4830_CR8","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Mach Learn. 2001;45(1):5\u201332.","journal-title":"Mach Learn."},{"issue":"4","key":"4830_CR9","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","volume":"38","author":"JH Friedman","year":"2002","unstructured":"Friedman JH. Stochastic gradient boosting. Comput Stat Data Anal. 2002;38(4):367\u201378.","journal-title":"Comput Stat Data Anal."},{"issue":"1\u20132","key":"4830_CR10","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/A:1023949509487","volume":"52","author":"S Monti","year":"2003","unstructured":"Monti S, Tamayo P, Mesirov J, Golub T. Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data. Mach Learn. 2003;52(1\u20132):91\u2013118.","journal-title":"Mach Learn."},{"issue":"12","key":"4830_CR11","doi-asserted-by":"publisher","first-page":"1572","DOI":"10.1093\/bioinformatics\/btq170","volume":"26","author":"MD Wilkerson","year":"2010","unstructured":"Wilkerson MD, Hayes DN. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics. 2010;26(12):1572\u20133.","journal-title":"Bioinformatics."},{"issue":"1","key":"4830_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-020-58766-1","volume":"10","author":"CR John","year":"2020","unstructured":"John CR, Watson D, Russ D, Goldmann K, Ehrenstein M, Pitzalis C, et al. M3C: Monte Carlo reference-based consensus clustering. Sci Rep. 2020;10(1):1\u201314.","journal-title":"Sci Rep."},{"key":"4830_CR13","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkaa1146","author":"Z Gu","year":"2020","unstructured":"Gu Z, Schlesner M, H\u00fcbschmann D. cola: an R\/Bioconductor package for consensus partitioning through a general framework. Nucleic Acids Res. 2020. https:\/\/doi.org\/10.1093\/nar\/gkaa1146.","journal-title":"Nucleic Acids Res."},{"issue":"7","key":"4830_CR14","doi-asserted-by":"publisher","first-page":"2750","DOI":"10.1172\/JCI45014","volume":"121","author":"BD Lehmann","year":"2011","unstructured":"Lehmann BD, Bauer JA, Chen X, Sanders ME, Chakravarthy AB, Shyr Y, et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J Clin Invest. 2011;121(7):2750\u201367.","journal-title":"J Clin Invest."},{"issue":"1","key":"4830_CR15","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1016\/j.ccr.2009.12.020","volume":"17","author":"RG Verhaak","year":"2010","unstructured":"Verhaak RG, Hoadley KA, Purdom E, Wang V, Qi Y, Wilkerson MD, et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell. 2010;17(1):98\u2013110.","journal-title":"Cancer Cell."},{"issue":"5","key":"4830_CR16","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1038\/nmeth.4236","volume":"14","author":"VY Kiselev","year":"2017","unstructured":"Kiselev VY, Kirschner K, Schaub MT, Andrews T, Yiu A, Chandra T, et al. SC3: consensus clustering of single-cell RNA-seq data. Nat Methods. 2017;14(5):483\u20136.","journal-title":"Nat Methods."},{"key":"4830_CR17","doi-asserted-by":"crossref","unstructured":"Li T, Ding C. Weighted consensus clustering. In: Proceedings of the 2008 SIAM international conference on data mining. Society for Industrial and Applied Mathematics; 2008. p. 798\u2013809.","DOI":"10.1137\/1.9781611972788.72"},{"issue":"12","key":"4830_CR18","doi-asserted-by":"publisher","first-page":"2315","DOI":"10.1109\/TPAMI.2012.80","volume":"34","author":"C Carpineto","year":"2012","unstructured":"Carpineto C, Romano G. Consensus clustering based on a new probabilistic rand index with application to subtopic retrieval. IEEE Trans Pattern Anal Mach Intell. 2012;34(12):2315\u201326.","journal-title":"IEEE Trans Pattern Anal Mach Intell."},{"key":"4830_CR19","first-page":"583","volume":"3","author":"A Strehl","year":"2002","unstructured":"Strehl A, Ghosh J. Cluster ensembles\u2014a knowledge reuse framework for combining multiple partitions. J Mach Learn Res. 2002;3:583\u2013617.","journal-title":"J Mach Learn Res."},{"key":"4830_CR20","first-page":"636","volume":"50","author":"R Ghaemi","year":"2009","unstructured":"Ghaemi R, Sulaiman MN, Ibrahim H, Mustapha N, et al. A survey: clustering ensembles techniques. World Acad Sci Eng Technol. 2009;50:636\u201345.","journal-title":"World Acad Sci Eng Technol."},{"key":"4830_CR21","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1016\/j.eswa.2019.01.074","volume":"125","author":"R \u00dcnl\u00fc","year":"2019","unstructured":"\u00dcnl\u00fc R, Xanthopoulos P. Estimating the number of clusters in a dataset via consensus clustering. Expert Syst Appl. 2019;125:33\u20139.","journal-title":"Expert Syst Appl."},{"issue":"458","key":"4830_CR22","doi-asserted-by":"publisher","first-page":"611","DOI":"10.1198\/016214502760047131","volume":"97","author":"C Fraley","year":"2002","unstructured":"Fraley C, Raftery AE. Model-based clustering, discriminant analysis, and density estimation. J Am Stat Assoc. 2002;97(458):611\u201331.","journal-title":"J Am Stat Assoc."},{"issue":"8","key":"4830_CR23","doi-asserted-by":"publisher","first-page":"578","DOI":"10.1093\/comjnl\/41.8.578","volume":"41","author":"C Fraley","year":"1998","unstructured":"Fraley C. How many clusters? Which clustering method? Answers via model-based cluster analysis. Comput J. 1998;41(8):578\u201388.","journal-title":"Comput J."},{"issue":"6","key":"4830_CR24","doi-asserted-by":"publisher","first-page":"1152","DOI":"10.1214\/aos\/1176342871","volume":"2","author":"CE Antoniak","year":"1974","unstructured":"Antoniak CE. Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann Stat. 1974;2(6):1152\u201374.","journal-title":"Ann Stat."},{"key":"4830_CR25","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1016\/B978-0-12-589320-6.50018-6","volume-title":"Recent advances in statistics","author":"TS Ferguson","year":"1983","unstructured":"Ferguson TS. Bayesian density estimation by mixtures of normal distributions. In: Rizvi MH, Rustagi JS, Siegmund D, editors. Recent advances in statistics. London: Academic Press; 1983. p. 287\u2013302."},{"issue":"1","key":"4830_CR26","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1214\/aos\/1176346412","volume":"12","author":"AY Lo","year":"1984","unstructured":"Lo AY. On a class of Bayesian nonparametric estimates: I. Density estimates. Ann Stat. 1984;12(1):351\u20137.","journal-title":"Ann Stat."},{"issue":"4","key":"4830_CR27","doi-asserted-by":"publisher","first-page":"731","DOI":"10.1111\/1467-9868.00095","volume":"59","author":"S Richardson","year":"1997","unstructured":"Richardson S, Green PJ. On Bayesian analysis of mixtures with an unknown number of components. J R Stat Soc Ser B. 1997;59(4):731\u201392.","journal-title":"J R Stat Soc Ser B."},{"issue":"521","key":"4830_CR28","doi-asserted-by":"publisher","first-page":"340","DOI":"10.1080\/01621459.2016.1255636","volume":"113","author":"JW Miller","year":"2018","unstructured":"Miller JW, Harrison MT. Mixture models with a prior on the number of components. J Am Stat Assoc. 2018;113(521):340\u201356.","journal-title":"J Am Stat Assoc"},{"issue":"5","key":"4830_CR29","doi-asserted-by":"publisher","first-page":"689","DOI":"10.1111\/j.1467-9868.2011.00781.x","volume":"73","author":"J Rousseau","year":"2011","unstructured":"Rousseau J, Mengersen K. Asymptotic behaviour of the posterior distribution in overfitted mixture models. J R Stat Soc Ser B (Stat Methodol). 2011;73(5):689\u2013710.","journal-title":"J R Stat Soc Ser B (Stat Methodol)."},{"issue":"24","key":"4830_CR30","doi-asserted-by":"publisher","first-page":"3290","DOI":"10.1093\/bioinformatics\/bts595","volume":"28","author":"P Kirk","year":"2012","unstructured":"Kirk P, Griffin JE, Savage RS, Ghahramani Z, Wild DL. Bayesian correlated clustering to integrate multiple datasets. Bioinformatics. 2012;28(24):3290\u20137.","journal-title":"Bioinformatics."},{"issue":"20","key":"4830_CR31","doi-asserted-by":"publisher","first-page":"2610","DOI":"10.1093\/bioinformatics\/btt425","volume":"29","author":"EF Lock","year":"2013","unstructured":"Lock EF, Dunson DB. Bayesian consensus clustering. Bioinformatics. 2013;29(20):2610\u20136. https:\/\/doi.org\/10.1093\/bioinformatics\/btt425.","journal-title":"Bioinformatics."},{"issue":"10","key":"4830_CR32","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1005781","volume":"13","author":"E Gabasova","year":"2017","unstructured":"Gabasova E, Reid J, Wernisch L. Clusternomics: integrative context-dependent clustering for heterogeneous datasets. PLoS Comput Biol. 2017;13(10): e1005781.","journal-title":"PLoS Comput Biol."},{"issue":"9","key":"4830_CR33","doi-asserted-by":"publisher","first-page":"1194","DOI":"10.1093\/bioinformatics\/18.9.1194","volume":"18","author":"M Medvedovic","year":"2002","unstructured":"Medvedovic M, Sivaganesan S. Bayesian infinite mixture model based clustering of gene expression profiles. Bioinformatics. 2002;18(9):1194\u2013206.","journal-title":"Bioinformatics"},{"issue":"8","key":"4830_CR34","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1002\/cyto.a.20583","volume":"73","author":"C Chan","year":"2008","unstructured":"Chan C, Feng F, Ottinger J, Foster D, West M, Kepler TB. Statistical mixture modeling for cell subtype identification in flow cytometry. Cytom A J Int Soc Anal Cytol. 2008;73(8):693\u2013701.","journal-title":"Cytom A J Int Soc Anal Cytol."},{"issue":"1","key":"4830_CR35","doi-asserted-by":"publisher","first-page":"638","DOI":"10.1214\/18-AOAS1209","volume":"13","author":"BP Hejblum","year":"2019","unstructured":"Hejblum BP, Alkhassim C, Gottardo R, Caron F, Thi\u00e9baut R, et al. Sequential Dirichlet process mixtures of multivariate skew t-distributions for model-based clustering of flow cytometry data. Ann Appl Stat. 2019;13(1):638\u201360.","journal-title":"Ann Appl Stat."},{"key":"4830_CR36","unstructured":"Prabhakaran S, Azizi E, Carr A, Pe\u2019er D. Dirichlet process mixture model for correcting technical variation in single-cell gene expression data. In: International conference on machine learning; 2016. p. 1070\u20131079."},{"issue":"11","key":"4830_CR37","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1006516","volume":"14","author":"OM Crook","year":"2018","unstructured":"Crook OM, Mulvey CM, Kirk PD, Lilley KS, Gatto L. A Bayesian mixture modelling approach for spatial proteomics. PLoS Comput Biol. 2018;14(11): e1006516.","journal-title":"PLoS Comput Biol"},{"key":"4830_CR38","unstructured":"Martin GM, Frazier DT, Robert CP. Computing Bayes: Bayesian computation from 1763 to the 21st century. arXiv preprint arXiv:2004.06425 2020;."},{"issue":"5","key":"4830_CR39","doi-asserted-by":"crossref","first-page":"1484","DOI":"10.1093\/bioinformatics\/btz778","volume":"36","author":"ME Strauss","year":"2020","unstructured":"Strauss ME, Kirk PD, Reid JE, Wernisch L. GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution. Bioinformatics. 2020;36(5):1484\u201391.","journal-title":"Bioinformatics."},{"issue":"2","key":"4830_CR40","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1080\/17509653.2016.1142191","volume":"11","author":"SL Scott","year":"2016","unstructured":"Scott SL, Blocker AW, Bonassi FV, Chipman HA, George EI, McCulloch RE. Bayes and big data: the consensus Monte Carlo algorithm. Int J Manag Sci Eng Manag. 2016;11(2):78\u201388. https:\/\/doi.org\/10.1080\/17509653.2016.1142191.","journal-title":"Int J Manag Sci Eng Manag."},{"issue":"1","key":"4830_CR41","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1080\/10618600.2019.1624366","volume":"29","author":"Y Ni","year":"2020","unstructured":"Ni Y, M\u00fcller P, Diesendruck M, Williamson S, Zhu Y, Ji Y. Scalable Bayesian nonparametric clustering and classification. J Comput Graph Stat. 2020;29(1):53\u201365. https:\/\/doi.org\/10.1080\/10618600.2019.1624366.","journal-title":"J Comput Graph Stat."},{"issue":"4","key":"4830_CR42","doi-asserted-by":"publisher","first-page":"703","DOI":"10.1080\/10618600.2020.1737085","volume":"29","author":"Y Ni","year":"2020","unstructured":"Ni Y, Ji Y, M\u00fcller P. Consensus Monte Carlo for random subsets using shared anchors. J Comput Graph Stat. 2020;29(4):703\u201314. https:\/\/doi.org\/10.1080\/10618600.2020.1737085.","journal-title":"J Comput Graph Stat."},{"key":"4830_CR43","unstructured":"Welling M, Teh YW. Bayesian learning via stochastic gradient Langevin dynamics. In: Proceedings of the 28th international conference on international conference on machine learning. ICML\u201911. Madison, WI: Omnipress; 2011. p. 681-688."},{"issue":"1","key":"4830_CR44","first-page":"193","volume":"17","author":"YW Teh","year":"2016","unstructured":"Teh YW, Thiery AH, Vollmer SJ. Consistency and fluctuations for stochastic gradient Langevin dynamics. J Mach Learn Res. 2016;17(1):193\u2013225.","journal-title":"J Mach Learn Res."},{"key":"4830_CR45","unstructured":"Johndrow JE, Pillai NS, Smith A. No free lunch for approximate MCMC. arXiv; 2020. arXiv:2010.12514."},{"issue":"533","key":"4830_CR46","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1080\/01621459.2020.1847120","volume":"116","author":"C Nemeth","year":"2021","unstructured":"Nemeth C, Fearnhead P. Stochastic gradient Markov chain Monte Carlo. J Am Stat Assoc. 2021;116(533):433\u201350. https:\/\/doi.org\/10.1080\/01621459.2020.1847120.","journal-title":"J Am Stat Assoc."},{"issue":"3","key":"4830_CR47","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1111\/rssb.12336","volume":"82","author":"PE Jacob","year":"2020","unstructured":"Jacob PE, O\u2019Leary J, Atchad\u00e9 YF. Unbiased Markov chain Monte Carlo methods with couplings. J R Stat Soc Ser B (Stat Methodol). 2020;82(3):543\u2013600.","journal-title":"J R Stat Soc Ser B (Stat Methodol)."},{"issue":"5","key":"4830_CR48","doi-asserted-by":"publisher","DOI":"10.1002\/wics.1435","volume":"10","author":"CP Robert","year":"2018","unstructured":"Robert CP, Elvira V, Tawn N, Wu C. Accelerating MCMC algorithms. Wiley Interdiscip Rev Comput Stat. 2018;10(5): e1435.","journal-title":"Wiley Interdiscip Rev Comput Stat."},{"issue":"1","key":"4830_CR49","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1198\/1061860043001","volume":"13","author":"S Jain","year":"2004","unstructured":"Jain S, Neal RM. A split\u2013merge Markov chain Monte Carlo procedure for the Dirichlet process mixture model. J Comput Graph Stat. 2004;13(1):158\u201382. https:\/\/doi.org\/10.1198\/1061860043001.","journal-title":"J Comput Graph Stat."},{"issue":"3","key":"4830_CR50","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1214\/07-BA219","volume":"2","author":"S Jain","year":"2007","unstructured":"Jain S, Neal RM. Splitting and merging components of a nonconjugate Dirichlet process mixture model. Bayesian Anal. 2007;2(3):445\u201372. https:\/\/doi.org\/10.1214\/07-BA219.","journal-title":"Bayesian Anal."},{"issue":"1","key":"4830_CR51","first-page":"868","volume":"18","author":"A Bouchard-C\u00f4t\u00e9","year":"2017","unstructured":"Bouchard-C\u00f4t\u00e9 A, Doucet A, Roth A. Particle Gibbs split\u2013merge sampling for Bayesian inference in mixture models. J Mach Learn Res. 2017;18(1):868\u2013906.","journal-title":"J Mach Learn Res."},{"issue":"7","key":"4830_CR52","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1080\/00949655.2021.1998502","volume":"92","author":"DB Dahl","year":"2022","unstructured":"Dahl DB, Newcomb S. Sequentially allocated merge\u2013split samplers for conjugate Bayesian nonparametric models. J Stat Comput Simul. 2022;92(7):1487\u2013511. https:\/\/doi.org\/10.1080\/00949655.2021.1998502.","journal-title":"J Stat Comput Simul."},{"key":"4830_CR53","doi-asserted-by":"publisher","unstructured":"Broder A, Garcia-Pueyo L, Josifovski V, Vassilvitskii S, Venkatesan S. Scalable K-means by ranked retrieval. In: Proceedings of the 7th ACM international conference on web search and data mining. WSDM \u201914. New York: Association for Computing Machinery; 2014. p. 233\u201342. https:\/\/doi.org\/10.1145\/2556195.2556260.","DOI":"10.1145\/2556195.2556260"},{"key":"4830_CR54","doi-asserted-by":"publisher","unstructured":"Bachem O, Lucic M, Krause A. Scalable k-means clustering via lightweight coresets. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining. KDD \u201918. New York: Association for Computing Machinery; 2018. p. 1119\u201327. https:\/\/doi.org\/10.1145\/3219819.3219973.","DOI":"10.1145\/3219819.3219973"},{"issue":"8","key":"4830_CR55","doi-asserted-by":"publisher","first-page":"1669","DOI":"10.1109\/TCYB.2014.2358564","volume":"45","author":"D Cai","year":"2015","unstructured":"Cai D, Chen X. Large scale spectral clustering via landmark-based sparse representation. IEEE Trans Cybern. 2015;45(8):1669\u201380.","journal-title":"IEEE Trans Cybern."},{"issue":"3","key":"4830_CR56","doi-asserted-by":"publisher","first-page":"1058","DOI":"10.1109\/TCYB.2018.2794998","volume":"49","author":"L He","year":"2019","unstructured":"He L, Ray N, Guan Y, Zhang H. Fast large-scale spectral clustering via explicit feature mapping. IEEE Trans Cybern. 2019;49(3):1058\u201371.","journal-title":"IEEE Trans Cybern."},{"key":"4830_CR57","first-page":"905","volume":"13","author":"A Rinaldo","year":"2012","unstructured":"Rinaldo A, Singh A, Nugent R, Wasserman L. Stability of density-based clustering. J Mach Learn Res. 2012;13:905.","journal-title":"J Mach Learn Res."},{"key":"4830_CR58","unstructured":"Kent BP, Rinaldo A, Verstynen T. DeBaCl: a python package for interactive density-based clustering. arXiv; 2013. Available from: arXiv:1307.8136."},{"key":"4830_CR59","unstructured":"Von\u00a0Luxburg U, Ben-David S. Towards a statistical theory of clustering. In: Pascal workshop on statistics and optimization of clustering. Citeseer; 2005. p. 20\u20136."},{"issue":"4","key":"4830_CR60","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","volume":"72","author":"N Meinshausen","year":"2010","unstructured":"Meinshausen N, B\u00fchlmann P. Stability selection. J R Stat Soc Ser B (Stat Methodol). 2010;72(4):417\u201373.","journal-title":"J R Stat Soc Ser B (Stat Methodol)."},{"issue":"3","key":"4830_CR61","first-page":"235","volume":"2","author":"U Von Luxburg","year":"2010","unstructured":"Von Luxburg U. Clustering stability: an overview. Found Trends Mach Learn. 2010;2(3):235\u201374.","journal-title":"Found Trends Mach Learn."},{"issue":"1\u20133","key":"4830_CR62","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/0169-7439(87)80084-9","volume":"2","author":"S Wold","year":"1987","unstructured":"Wold S, Esbensen K, Geladi P. Principal component analysis. Chemometr Intell Lab Syst. 1987;2(1\u20133):37\u201352.","journal-title":"Chemometr Intell Lab Syst."},{"issue":"2","key":"4830_CR63","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1214\/09-BA414","volume":"4","author":"A Fritsch","year":"2009","unstructured":"Fritsch A, Ickstadt K. Improved criteria for clustering based on the posterior similarity matrix. Bayesian Anal. 2009;4(2):367\u201391.","journal-title":"Bayesian Anal"},{"key":"4830_CR64","unstructured":"Fritsch A. mcclust: process an MCMC sample of clusterings; 2012. R package version 1.0. https:\/\/CRAN.R-project.org\/package=mcclust."},{"issue":"2","key":"4830_CR65","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1214\/17-BA1073","volume":"13","author":"S Wade","year":"2018","unstructured":"Wade S, Ghahramani Z. Bayesian cluster analysis: point estimation and credible balls (with discussion). Bayesian Anal. 2018;13(2):559\u2013626.","journal-title":"Bayesian Anal."},{"issue":"1","key":"4830_CR66","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1007\/s10994-013-5339-6","volume":"98","author":"A Louren\u00e7o","year":"2015","unstructured":"Louren\u00e7o A, Rota Bul\u00f2 S, Rebagliati N, Fred ALN, Figueiredo MAT, Pelillo M. Probabilistic consensus clustering using evidence accumulation. Mach Learn. 2015;98(1):331\u201357.","journal-title":"Mach Learn."},{"key":"4830_CR67","doi-asserted-by":"crossref","unstructured":"Dahl DB, Johnson DJ, Mueller P. Search algorithms and loss functions for Bayesian clustering. 2021. arXiv:2105.04451 [stat].","DOI":"10.1080\/10618600.2022.2069779"},{"issue":"5","key":"4830_CR68","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1111\/rssb.12158","volume":"78","author":"PG Bissiri","year":"2016","unstructured":"Bissiri PG, Holmes CC, Walker SG. A general framework for updating belief distributions. J R Stat Soc Ser B (Stat Methodol). 2016;78(5):1103\u201330. https:\/\/doi.org\/10.1111\/rssb.12158.","journal-title":"J R Stat Soc Ser B (Stat Methodol)."},{"issue":"6","key":"4830_CR69","doi-asserted-by":"publisher","first-page":"442","DOI":"10.3390\/e20060442","volume":"20","author":"J Jewson","year":"2018","unstructured":"Jewson J, Smith JQ, Holmes C. Principles of Bayesian inference using general divergence criteria. Entropy. 2018;20(6):442.","journal-title":"Entropy."},{"key":"4830_CR70","doi-asserted-by":"crossref","unstructured":"Matsubara T, Knoblauch J, Briol FX, Oates C, et\u00a0al. Robust generalised Bayesian inference for intractable likelihoods. arXiv preprint arXiv:2104.07359. 2021;.","DOI":"10.1111\/rssb.12500"},{"key":"4830_CR71","unstructured":"Law M, Jain A, Figueiredo M. Feature selection in mixture-based clustering. In: Becker S, Thrun S, Obermayer K, editors. Advances in neural information processing systems. vol. 15. MIT Press; 2002. Available from: https:\/\/proceedings.neurips.cc\/paper\/2002\/file\/e58aea67b01fa747687f038dfde066f6-Paper.pdf."},{"issue":"1","key":"4830_CR72","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"L Hubert","year":"1985","unstructured":"Hubert L, Arabie P. Comparing partitions. J Classif. 1985;2(1):193\u2013218.","journal-title":"J Classif."},{"issue":"1","key":"4830_CR73","doi-asserted-by":"publisher","first-page":"289","DOI":"10.32614\/RJ-2016-021","volume":"8","author":"L Scrucca","year":"2016","unstructured":"Scrucca L, Fop M, Murphy BT, Raftery AE. mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. R J. 2016;8(1):289\u2013317. https:\/\/doi.org\/10.32614\/RJ-2016-021.","journal-title":"R J."},{"issue":"2","key":"4830_CR74","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz G, et al. Estimating the dimension of a model. Ann Stat. 1978;6(2):461\u20134.","journal-title":"Ann Stat."},{"key":"4830_CR75","doi-asserted-by":"crossref","unstructured":"Geweke J, et al. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments, vol. 196. Federal Reserve Bank of Minneapolis, Research Department Minneapolis, MN; 1991.","DOI":"10.21034\/sr.148"},{"issue":"4","key":"4830_CR76","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1214\/ss\/1177011136","volume":"7","author":"A Gelman","year":"1992","unstructured":"Gelman A, Rubin DB, et al. Inference from iterative simulation using multiple sequences. Stat Sci. 1992;7(4):457\u201372.","journal-title":"Stat Sci."},{"key":"4830_CR77","unstructured":"Vats D, Knudson C. Revisiting the Gelman\u2013Rubin diagnostic. arXiv preprint arXiv:1812.09384. 2018."},{"issue":"3\/4","key":"4830_CR78","doi-asserted-by":"publisher","first-page":"591","DOI":"10.2307\/2333709","volume":"52","author":"SS Shapiro","year":"1965","unstructured":"Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3\/4):591\u2013611.","journal-title":"Biometrika."},{"key":"4830_CR79","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1007\/978-1-4419-9863-7_16","volume-title":"Encyclopedia of systems biology","author":"JJ Tyson","year":"2013","unstructured":"Tyson JJ, Chen KC, Nov\u00e1k B. Cell cycle, budding yeast. In: Dubitzky W, Wolkenhauer O, Cho KH, Yokota H, editors. Encyclopedia of systems biology. New York: Springer; 2013. p. 337\u201341."},{"issue":"8","key":"4830_CR80","doi-asserted-by":"publisher","first-page":"3841","DOI":"10.1091\/mbc.e03-11-0794","volume":"15","author":"KC Chen","year":"2004","unstructured":"Chen KC, Calzone L, Csikasz-Nagy A, Cross FR, Novak B, Tyson JJ. Integrative analysis of cell cycle control in budding yeast. Mol Biol Cell. 2004;15(8):3841\u201362.","journal-title":"Mol Biol Cell."},{"key":"4830_CR81","first-page":"983","volume":"4","author":"B Alberts","year":"2002","unstructured":"Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P. The cell cycle and programmed cell death. Mol Biol Cell. 2002;4:983\u20131027.","journal-title":"Mol Biol Cell."},{"key":"4830_CR82","doi-asserted-by":"publisher","first-page":"117693510700300","DOI":"10.1177\/117693510700300020","volume":"3","author":"B Ingalls","year":"2007","unstructured":"Ingalls B, Duncker B, Kim D, McConkey B. Systems level modeling of the cell cycle using budding yeast. Cancer Inform. 2007;3:117693510700300020.","journal-title":"Cancer Inform."},{"issue":"3","key":"4830_CR83","doi-asserted-by":"publisher","first-page":"62","DOI":"10.15698\/mic2015.03.191","volume":"2","author":"J Jim\u00e9nez","year":"2015","unstructured":"Jim\u00e9nez J, Bru S, Ribeiro M, Clotet J. Live fast, die soon: cell cycle progression and lifespan in yeast cells. Microb Cell. 2015;2(3):62.","journal-title":"Microb Cell."},{"issue":"3","key":"4830_CR84","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2010-11-3-r24","volume":"11","author":"MV Granovskaia","year":"2010","unstructured":"Granovskaia MV, Jensen LJ, Ritchie ME, Toedling J, Ning Y, Bork P, et al. High-resolution transcription atlas of the mitotic cell cycle in budding yeast. Genome Biol. 2010;11(3):1\u201311.","journal-title":"Genome Biol."},{"issue":"7004","key":"4830_CR85","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1038\/nature02800","volume":"431","author":"CT Harbison","year":"2004","unstructured":"Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, et al. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004;431(7004):99\u2013104.","journal-title":"Nature."},{"issue":"suppl_1","key":"4830_CR86","doi-asserted-by":"publisher","first-page":"D535","DOI":"10.1093\/nar\/gkj109","volume":"34","author":"C Stark","year":"2006","unstructured":"Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic acids Res. 2006;34(suppl_1):D535\u20139.","journal-title":"Nucleic acids Res."},{"issue":"6","key":"4830_CR87","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1016\/S0092-8674(01)00494-9","volume":"106","author":"I Simon","year":"2001","unstructured":"Simon I, Barnett J, Hannett N, Harbison CT, Rinaldi NJ, Volkert TL, et al. Serial regulation of transcriptional regulators in the yeast cell cycle. Cell. 2001;106(6):697\u2013708.","journal-title":"Cell."},{"issue":"6819","key":"4830_CR88","doi-asserted-by":"publisher","first-page":"533","DOI":"10.1038\/35054095","volume":"409","author":"VR Iyer","year":"2001","unstructured":"Iyer VR, Horak CE, Scafe CS, Botstein D, Snyder M, Brown PO. Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature. 2001;409(6819):533\u20138.","journal-title":"Nature."},{"key":"4830_CR89","unstructured":"Carlson M, Falcon S, Pages H, Li N. Org. sc. sgd. db: Genome wide annotation for yeast. R package version. 2014;2(1)."},{"issue":"49","key":"4830_CR90","doi-asserted-by":"publisher","first-page":"34355","DOI":"10.1074\/jbc.M109.065730","volume":"284","author":"M Bando","year":"2009","unstructured":"Bando M, Katou Y, Komata M, Tanaka H, Itoh T, Sutani T, et al. Csm3, Tof1, and Mrc1 form a heterotrimeric mediator complex that associates with DNA replication forks. J Biol Chem. 2009;284(49):34355\u201365.","journal-title":"J Biol Chem."},{"issue":"12","key":"4830_CR91","doi-asserted-by":"crossref","first-page":"3931","DOI":"10.1534\/g3.118.200767","volume":"8","author":"JP Lao","year":"2018","unstructured":"Lao JP, Ulrich KM, Johnson JR, Newton BW, Vashisht AA, Wohlschlegel JA, et al. The yeast DNA damage checkpoint kinase Rad53 targets the exoribonuclease, Xrn1. G3 Genes Genomes Genet. 2018;8(12):3931\u201344.","journal-title":"G3 Genes Genomes Genet."},{"issue":"3","key":"4830_CR92","doi-asserted-by":"publisher","first-page":"320","DOI":"10.1101\/gad.13.3.320","volume":"13","author":"A T\u00f3th","year":"1999","unstructured":"T\u00f3th A, Ciosk R, Uhlmann F, Galova M, Schleiffer A, Nasmyth K. Yeast cohesin complex requires a conserved protein, Eco1p (Ctf7), to establish cohesion between sister chromatids during DNA replication. Genes Dev. 1999;13(3):320\u201333.","journal-title":"Genes Dev."},{"issue":"15","key":"4830_CR93","doi-asserted-by":"publisher","first-page":"2299","DOI":"10.1016\/j.febslet.2013.06.035","volume":"587","author":"GD Mehta","year":"2013","unstructured":"Mehta GD, Kumar R, Srivastava S, Ghosh SK. Cohesin: functions beyond sister chromatid cohesion. FEBS Lett. 2013;587(15):2299\u2013312.","journal-title":"FEBS Lett."},{"issue":"2","key":"4830_CR94","doi-asserted-by":"publisher","first-page":"172","DOI":"10.1016\/S0955-0674(03)00013-9","volume":"15","author":"W Fischle","year":"2003","unstructured":"Fischle W, Wang Y, Allis CD. Histone and chromatin cross-talk. Curr Opin Cell Biol. 2003;15(2):172\u201383.","journal-title":"Curr Opin Cell Biol."},{"issue":"3","key":"4830_CR95","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1038\/cr.2011.22","volume":"21","author":"AJ Bannister","year":"2011","unstructured":"Bannister AJ, Kouzarides T. Regulation of chromatin by histone modifications. Cell Res. 2011;21(3):381\u201395.","journal-title":"Cell Res."},{"issue":"4","key":"4830_CR96","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1016\/j.molcel.2006.06.025","volume":"23","author":"RA de Bruin","year":"2006","unstructured":"de Bruin RA, Kalashnikova TI, Chahwan C, McDonald WH, Wohlschlegel J, Yates J III, et al. Constraining G1-specific transcription to late G1 phase: the MBF-associated corepressor Nrm1 acts via negative feedback. Mol Cell. 2006;23(4):483\u201396.","journal-title":"Mol Cell."},{"issue":"8","key":"4830_CR97","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pgen.1000626","volume":"5","author":"S Aligianni","year":"2009","unstructured":"Aligianni S, Lackner DH, Klier S, Rustici G, Wilhelm BT, Marguerat S, et al. The fission yeast homeodomain protein Yox1p binds to MBF and confines MBF-dependent cell-cycle transcription to G1-S via negative feedback. PLoS Genet. 2009;5(8): e1000626.","journal-title":"PLoS Genet."},{"issue":"6","key":"4830_CR98","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1016\/S0092-8674(00)81211-8","volume":"93","author":"R Ciosk","year":"1998","unstructured":"Ciosk R, Zachariae W, Michaelis C, Shevchenko A, Mann M, Nasmyth K. An ESP1\/PDS1 complex regulates loss of sister chromatid cohesion at the metaphase to anaphase transition in yeast. Cell. 1998;93(6):1067\u201376.","journal-title":"Cell."},{"issue":"1","key":"4830_CR99","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1534\/genetics.108.095513","volume":"181","author":"KF Cooper","year":"2009","unstructured":"Cooper KF, Mallory MJ, Guacci V, Lowe K, Strich R. Pds1p is required for meiotic recombination and prophase I progression in Saccharomyces cerevisiae. Genetics. 2009;181(1):65\u201379.","journal-title":"Genetics."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04830-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04830-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04830-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,24]],"date-time":"2023-11-24T15:57:09Z","timestamp":1700841429000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04830-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,21]]},"references-count":99,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["4830"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04830-8","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.12.17.423244","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,21]]},"assertion":[{"value":"17 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 July 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 July 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"290"}}