{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:21:22Z","timestamp":1772173282620,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1011443","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2023,12,22]],"date-time":"2023-12-22T00:00:00Z","timestamp":1703203200000}}],"reference-count":46,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2023,12,1]],"date-time":"2023-12-01T00:00:00Z","timestamp":1701388800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["785907"],"award-info":[{"award-number":["785907"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["945539"],"award-info":[{"award-number":["945539"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004837","name":"Ministerio de Ciencia e Innovaci\u00f3n","doi-asserted-by":"publisher","award":["PID2019-109247GB-I00"],"award-info":[{"award-number":["PID2019-109247GB-I00"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004837","name":"Ministerio de Ciencia e Innovaci\u00f3n","doi-asserted-by":"publisher","award":["TED2021-131310B-I00"],"award-info":[{"award-number":["TED2021-131310B-I00"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>We present the Fast Greedy Equivalence Search (FGES)-Merge, a new method for learning the structure of gene regulatory networks via merging locally learned Bayesian networks, based on the fast greedy equivalent search algorithm. The method is competitive with the state of the art in terms of the Matthews correlation coefficient, which takes into account both precision and recall, while also improving upon it in terms of speed, scaling up to tens of thousands of variables and being able to use empirical knowledge about the topological structure of gene regulatory networks. To showcase the ability of our method to scale to massive networks, we apply it to learning the gene regulatory network for the full human genome using data from samples of different brain structures (from the Allen Human Brain Atlas). Furthermore, this Bayesian network model should predict interactions between genes in a way that is clear to experts, following the current trends in explainable artificial intelligence. To achieve this, we also present a new open-access visualization tool that facilitates the exploration of massive networks and can aid in finding nodes of interest for experimental tests.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1011443","type":"journal-article","created":{"date-parts":[[2023,12,1]],"date-time":"2023-12-01T13:47:47Z","timestamp":1701438467000},"page":"e1011443","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":10,"title":["Learning massive interpretable gene regulatory networks of the human brain by merging Bayesian networks"],"prefix":"10.1371","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8780-6837","authenticated-orcid":true,"given":"Niko","family":"Bernaola","sequence":"first","affiliation":[]},{"given":"Mario","family":"Michiels","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0652-9872","authenticated-orcid":true,"given":"Pedro","family":"Larra\u00f1aga","sequence":"additional","affiliation":[]},{"given":"Concha","family":"Bielza","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2023,12,1]]},"reference":[{"key":"pcbi.1011443.ref001","first-page":"223","volume-title":"In Situ Hybridization Protocols","author":"GJ Nuovo","year":"1995"},{"key":"pcbi.1011443.ref002","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nprot.2007.514","article-title":"High-resolution in situ hybridization to whole-mount zebrafish embryos","volume":"3","author":"C Thisse","year":"2008","journal-title":"Nature Protocols"},{"key":"pcbi.1011443.ref003","doi-asserted-by":"crossref","first-page":"527","DOI":"10.2119\/2006-00107.Trevino","article-title":"DNA microarrays: A powerful genomic tool for biomedical and clinical research","volume":"13","author":"V Trevino","year":"2007","journal-title":"Molecular Medicine"},{"key":"pcbi.1011443.ref004","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1002\/0470094419.ch13","volume-title":"Data Analysis and Visualization in Genomics and Proteomics","author":"P Larra\u00f1aga","year":"2005"},{"key":"pcbi.1011443.ref005","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4939-8882-2","volume-title":"Gene Regulatory Networks: Methods and Protocols","author":"G Sanguinetti","year":"2019"},{"key":"pcbi.1011443.ref006","doi-asserted-by":"crossref","DOI":"10.1201\/9781420011432","volume-title":"An Introduction to Systems Biology: Design Principles of Biological Circuits","author":"U Alon","year":"2006"},{"key":"pcbi.1011443.ref007","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/j.artmed.2018.10.006","article-title":"Computational methods for gene regulatory networks reconstruction and analysis: A review","volume":"95","author":"FM Delgado","year":"2019","journal-title":"Artificial Intelligence in Medicine"},{"key":"pcbi.1011443.ref008","doi-asserted-by":"crossref","first-page":"1770","DOI":"10.3389\/fpls.2018.01770","article-title":"Statistical and machine learning approaches to predict gene regulatory networks from transcriptome datasets","volume":"9","author":"K Mochida","year":"2018","journal-title":"Frontiers in Plant Science"},{"key":"pcbi.1011443.ref009","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1007\/s41060-016-0032-z","article-title":"A million variables and more: The fast greedy equivalence search algorithm for learning high-dimensional graphical causal models, with an application to functional magnetic resonance images","volume":"3","author":"J Ramsey","year":"2017","journal-title":"International Journal of Data Science and Analytics"},{"key":"pcbi.1011443.ref010","volume-title":"Probabilistic Reasoning in Intelligent Systems","author":"J Pearl","year":"1988"},{"issue":"3\u20134","key":"pcbi.1011443.ref011","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1089\/106652700750050961","article-title":"Using Bayesian networks to analyze expression data","volume":"7","author":"N Friedman","year":"2000","journal-title":"Journal of Computational Biology"},{"key":"pcbi.1011443.ref012","volume-title":"Probabilistic Graphical Models: Principles and Techniques","author":"D Koller","year":"2009"},{"key":"pcbi.1011443.ref013","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1007\/978-1-4612-2404-4_12","volume-title":"Learning from Data: Artificial Intelligence and Statistics V. Lecture Notes in Statistics","author":"DM Chickering","year":"1996"},{"issue":"2\u20133","key":"pcbi.1011443.ref014","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1016\/0004-3702(90)90060-D","article-title":"The computational complexity of probabilistic inference using Bayesian belief networks","volume":"42","author":"GF Cooper","year":"1990","journal-title":"Artificial Intelligence"},{"key":"pcbi.1011443.ref015","first-page":"1","article-title":"A survey of Bayesian network structure learning","author":"NK Kitson","year":"2023","journal-title":"Artificial Intelligence Review"},{"issue":"2","key":"pcbi.1011443.ref016","doi-asserted-by":"crossref","first-page":"12","DOI":"10.3390\/bioengineering3020012","article-title":"Stable gene regulatory network modeling from steady-state data","volume":"3","author":"JE Larvie","year":"2016","journal-title":"Bioengineering"},{"key":"pcbi.1011443.ref017","first-page":"1","article-title":"Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks","volume":"11","author":"Y Li","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"7416","key":"pcbi.1011443.ref018","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1038\/nature11405","article-title":"An anatomically comprehensive atlas of the adult human brain transcriptome","volume":"489","author":"MJ Hawrylycz","year":"2012","journal-title":"Nature"},{"key":"pcbi.1011443.ref019","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1007\/978-1-4939-8882-2_15","volume-title":"Gene Regulatory Networks: Methods and Protocols","author":"A Angelin-Bonnet","year":"2019"},{"issue":"9","key":"pcbi.1011443.ref020","doi-asserted-by":"crossref","first-page":"2449","DOI":"10.1039\/C5MB00122F","article-title":"Improving gene regulatory network inference using network topology information","volume":"11","author":"A Nair","year":"2015","journal-title":"Molecular BioSystems"},{"key":"pcbi.1011443.ref021","author":"P Spirtes","year":"2000","journal-title":"Constructing Bayesian network models of gene expression networks from microarray data"},{"issue":"1","key":"pcbi.1011443.ref022","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1186\/1752-0509-3-85","article-title":"From gene expression to gene regulatory networks in Arabidopsis thaliana","volume":"3","author":"CJ Needham","year":"2009","journal-title":"BMC Systems Biology"},{"issue":"1","key":"pcbi.1011443.ref023","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-11-18","article-title":"Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests","volume":"11","author":"D Edwards","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"pcbi.1011443.ref024","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1282","article-title":"Reconstructing gene regulatory networks with Bayesian networks by combining expression data with multiple sources of prior knowledge","volume":"6","author":"AV Werhli","year":"2007","journal-title":"Statistical Applications in Genetics and Molecular Biology"},{"issue":"3","key":"pcbi.1011443.ref025","doi-asserted-by":"crossref","DOI":"10.1515\/sagmb-2018-0042","article-title":"Combining gene expression data and prior knowledge for inferring gene regulatory networks via Bayesian networks using structural restrictions","volume":"18","author":"LM de Campos","year":"2019","journal-title":"Statistical Applications in Genetics and Molecular Biology"},{"issue":"5643","key":"pcbi.1011443.ref026","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1126\/science.1087447","article-title":"A gene-coexpression network for global discovery of conserved genetic modules","volume":"302","author":"JM Stuart","year":"2003","journal-title":"Science"},{"key":"pcbi.1011443.ref027","doi-asserted-by":"crossref","first-page":"e1005024","DOI":"10.1371\/journal.pcbi.1005024","article-title":"Inference of gene regulatory networks based on local Bayesian networks","volume":"12","author":"F Liu","year":"2016","journal-title":"PLoS Computational Biology"},{"key":"pcbi.1011443.ref028","doi-asserted-by":"crossref","unstructured":"Tsamardinos I, Aliferis CF, Statnikov A. Time and sample efficient discovery of Markov blankets and direct causal relations. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003; 673\u2013678.","DOI":"10.1145\/956750.956838"},{"issue":"7","key":"pcbi.1011443.ref029","first-page":"171","article-title":"Local causal and Markov blanket induction for causal discovery and feature selection for classication. Part I: Algorithms and empirical evaluation","volume":"1","author":"CF Aliferis","year":"2010","journal-title":"Journal of Machine Learning Research"},{"issue":"1","key":"pcbi.1011443.ref030","first-page":"235","article-title":"Local causal and Markov blanket induction for causal discovery and feature selection for classification. Part II: Analysis and extensions","volume":"11","author":"CF Aliferis","year":"2010","journal-title":"Journal of Machine Learning Research"},{"key":"pcbi.1011443.ref031","doi-asserted-by":"crossref","first-page":"796","DOI":"10.1038\/nmeth.2016","article-title":"Wisdom of crowds for robust gene network inference","volume":"9","author":"D Marbach","year":"2012","journal-title":"Nature Methods"},{"key":"pcbi.1011443.ref032","first-page":"1","article-title":"Catnet: Categorical Bayesian network inference","author":"N Balov","year":"2012","journal-title":"R Package Version 1.13.4"},{"key":"pcbi.1011443.ref033","first-page":"127","article-title":"Speaker, environment and channel change detection and clustering via the Bayesian information criterion","volume":"8","author":"S Chen","year":"1998","journal-title":"In Proceedings of the Broadcast News Transcription and Understanding Workshop"},{"issue":"1","key":"pcbi.1011443.ref034","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1207\/s15327906mbr3301_3","article-title":"The TETRAD project: Constraint based aids to causal model specification","volume":"33","author":"R Scheines","year":"1998","journal-title":"Multivariate Behavioral Research"},{"key":"pcbi.1011443.ref035","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.neucom.2020.11.066","article-title":"BayeSuites: An open web framework for massive Bayesian networks focused on neuroscience","volume":"428","author":"M Michiels","year":"2021","journal-title":"Neurocomputing"},{"issue":"6","key":"pcbi.1011443.ref036","doi-asserted-by":"crossref","first-page":"e98679","DOI":"10.1371\/journal.pone.0098679","article-title":"ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software","volume":"9","author":"M Jacomy","year":"2014","journal-title":"PLoS ONE"},{"key":"pcbi.1011443.ref037","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1007\/978-3-319-11683-9_12","volume-title":"Artificial Evolution","author":"O Gach","year":"2014"},{"key":"pcbi.1011443.ref038","doi-asserted-by":"crossref","first-page":"bav028","DOI":"10.1093\/database\/bav028","article-title":"DisGeNET: A discovery platform for the dynamical exploration of human diseases and their genes","volume":"2015","author":"J Pinero","year":"2015","journal-title":"Database"},{"issue":"239","key":"pcbi.1011443.ref039","first-page":"2","article-title":"Docker: Lightweight Linux containers for consistent development and deployment","author":"D Merkel","year":"2014","journal-title":"Linux Journal"},{"key":"pcbi.1011443.ref040","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","article-title":"Comparison of the predicted and observed secondary structure of T4 phage lysozyme","volume":"405","author":"BW Matthews","year":"1975","journal-title":"Biochimica et Biophysica Acta (BBA)\u2014Protein Structure"},{"issue":"5","key":"pcbi.1011443.ref041","doi-asserted-by":"crossref","first-page":"e1000790","DOI":"10.1371\/journal.pcbi.1000790","article-title":"Analysis and computational dissection of molecular signature multiplicity","volume":"6","author":"A Statnikov","year":"2010","journal-title":"PLoS Computational Biology"},{"issue":"3","key":"pcbi.1011443.ref042","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v035.i03","article-title":"Learning Bayesian networks with the bnlearn R Package","volume":"35","author":"M Scutari","year":"2010","journal-title":"Journal of Statistical Software"},{"issue":"9","key":"pcbi.1011443.ref043","doi-asserted-by":"crossref","first-page":"e12776","DOI":"10.1371\/journal.pone.0012776","article-title":"Inferring regulatory networks from expression data using tree-based methods","volume":"5","author":"VA Huynh-Thu","year":"2010","journal-title":"PLoS ONE"},{"issue":"10","key":"pcbi.1011443.ref044","doi-asserted-by":"crossref","first-page":"e46935","DOI":"10.1371\/journal.pone.0046935","article-title":"Non-Gaussian distributions affect identification of expression patterns, functional annotation, and prospective classification in human cancer genomes","volume":"7","author":"NF Marko","year":"2012","journal-title":"PLoS ONE"},{"issue":"21","key":"pcbi.1011443.ref045","first-page":"1","article-title":"The shape of gene expression distributions matter: How incorporating distribution shape improves the interpretation of cancer transcriptomic data","volume":"21","author":"L de Torrent\u00e9","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1011443.ref046","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1016\/j.ins.2021.10.074","article-title":"Semiparametric Bayesian networks","volume":"584","author":"D Atienza","year":"2022","journal-title":"Information Sciences"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1011443","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2023,12,22]],"date-time":"2023-12-22T00:00:00Z","timestamp":1703203200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011443","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,22]],"date-time":"2023-12-22T13:47:01Z","timestamp":1703252821000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011443"}},"subtitle":[],"editor":[{"given":"Mingyao","family":"Li","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,12,1]]},"references-count":46,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2023,12,1]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1011443","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.02.05.935007","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,1]]}}}