{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T06:31:16Z","timestamp":1768458676417,"version":"3.49.0"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,12,9]],"date-time":"2021-12-09T00:00:00Z","timestamp":1639008000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation under","doi-asserted-by":"crossref","award":["IIS-1841351"],"award-info":[{"award-number":["IIS-1841351"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,2,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Reconstruction of genome-scale networks from gene expression data is an actively studied problem. A wide range of methods that differ between the types of interactions they uncover with varying trade-offs between sensitivity and specificity have been proposed. To leverage benefits of multiple such methods, ensemble network methods that combine predictions from resulting networks have been developed, promising results better than or as good as the individual networks. Perhaps owing to the difficulty in obtaining accurate training examples, these ensemble methods hitherto are unsupervised.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article, we introduce EnGRaiN, the first supervised ensemble learning method to construct gene networks. The supervision for training is provided by small training datasets of true edge connections (positives) and edges known to be absent (negatives) among gene pairs. We demonstrate the effectiveness of EnGRaiN using simulated datasets as well as a curated collection of Arabidopsis thaliana datasets we created from microarray datasets available from public repositories. EnGRaiN shows better results not only in terms of receiver operating characteristic and PR characteristics for both real and simulated datasets compared with unsupervised methods for ensemble network construction, but also generates networks that can be mined for elucidating complex biological interactions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>EnGRaiN software and the datasets used in the study are publicly available at the github repository: https:\/\/github.com\/AluruLab\/EnGRaiN.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab829","type":"journal-article","created":{"date-parts":[[2021,12,3]],"date-time":"2021-12-03T20:12:43Z","timestamp":1638562363000},"page":"1312-1319","source":"Crossref","is-referenced-by-count":19,"title":["<i>EnGRaiN<\/i>: a supervised ensemble learning method for recovery of large-scale gene regulatory networks"],"prefix":"10.1093","volume":"38","author":[{"given":"Maneesha","family":"Aluru","sequence":"first","affiliation":[{"name":"Department of Biology, Georgia Institute of Technology , Atlanta, GA 30308, USA"}]},{"given":"Harsh","family":"Shrivastava","sequence":"additional","affiliation":[{"name":"Microsoft , Redmond, WA 98052, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1358-7691","authenticated-orcid":false,"given":"Sriram P","family":"Chockalingam","sequence":"additional","affiliation":[{"name":"Institute for Data Engineering and Science, Georgia Institute of Technology , Atlanta, GA 30308, USA"}]},{"given":"Shruti","family":"Shivakumar","sequence":"additional","affiliation":[{"name":"Department of Computational Science and Engineering, Georgia Institute of Technology , Atlanta, GA 30308, USA"}]},{"given":"Srinivas","family":"Aluru","sequence":"additional","affiliation":[{"name":"Institute for Data Engineering and Science, Georgia Institute of Technology , Atlanta, GA 30308, USA"},{"name":"Department of Computational Science and Engineering, Georgia Institute of Technology , Atlanta, GA 30308, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,12,9]]},"reference":[{"key":"2023020108551616500_btab829-B1","doi-asserted-by":"crossref","first-page":"1083","DOI":"10.1038\/nmeth.4463","article-title":"SCENIC: single-cell regulatory network inference and clustering","volume":"14","author":"Aibar","year":"2017","journal-title":"Nat. Methods"},{"key":"2023020108551616500_btab829-B2","doi-asserted-by":"crossref","first-page":"1600","DOI":"10.1093\/bioinformatics\/btl140","article-title":"Improved scoring of functional groups from gene expression data by decorrelating GO graph structure","volume":"22","author":"Alexa","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020108551616500_btab829-B3","doi-asserted-by":"crossref","first-page":"e24","DOI":"10.1093\/nar\/gks904","article-title":"Reverse engineering and analysis of large genome-scale gene networks","volume":"41","author":"Aluru","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023020108551616500_btab829-B4","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1016\/j.jmb.2006.04.029","article-title":"Comprehensive analysis of combinatorial regulation using the transcriptional regulatory network of yeast","volume":"360","author":"Balaji","year":"2006","journal-title":"J. Mol. Biol"},{"key":"2023020108551616500_btab829-B5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-015-0728-4","article-title":"NetBenchmark: a bioconductor package for reproducible benchmarks of gene regulatory network inference","volume":"16","author":"Bellot","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2023020108551616500_btab829-B6","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1007\/978-1-4939-8882-2_12","volume-title":"Gene Regulatory Networks","author":"Bellot","year":"2019"},{"key":"2023020108551616500_btab829-B7","author":"Bhattacharya","year":"2017"},{"key":"2023020108551616500_btab829-B8","doi-asserted-by":"crossref","first-page":"R36","DOI":"10.1186\/gb-2006-7-5-r36","article-title":"The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo","volume":"7","author":"Bonneau","year":"2006","journal-title":"Genome Biol"},{"key":"2023020108551616500_btab829-B9","doi-asserted-by":"crossref","first-page":"e9514","DOI":"10.1371\/journal.pone.0009514","article-title":"Transgenerational adaptation of Arabidopsis to stress requires DNA methylation and the function of Dicer-like proteins","volume":"5","author":"Boyko","year":"2010","journal-title":"PLoS One"},{"key":"2023020108551616500_btab829-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-09522-1","article-title":"Network Walking charts transcriptional dynamics of nitrogen signaling by integrating validated and predicted genome-wide interactions","volume":"10","author":"Brooks","year":"2019","journal-title":"Nat. Commun"},{"key":"2023020108551616500_btab829-B11","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1613\/jair.953","article-title":"SMOTE: synthetic minority over-sampling technique","volume":"16","author":"Chawla","year":"2002","journal-title":"J. Artif. Intell. Res"},{"key":"2023020108551616500_btab829-B12","first-page":"7","author":"Cheng","year":"2016"},{"key":"2023020108551616500_btab829-B13","doi-asserted-by":"crossref","first-page":"23","DOI":"10.3390\/microarrays5030023","article-title":"Microarray data processing techniques for genome-scale network inference from large public repositories","volume":"5","author":"Chockalingam","year":"2016","journal-title":"Microarrays"},{"key":"2023020108551616500_btab829-B14","doi-asserted-by":"crossref","first-page":"e8","DOI":"10.1371\/journal.pbio.0050008","article-title":"Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles","volume":"5","author":"Faith","year":"2007","journal-title":"PLoS Biol"},{"key":"2023020108551616500_btab829-B15","doi-asserted-by":"crossref","first-page":"554","DOI":"10.1038\/nbt0505-554","article-title":"Reverse engineering gene regulatory networks","volume":"23","author":"Hartemink","year":"2005","journal-title":"Nat. Biotechnol"},{"key":"2023020108551616500_btab829-B16","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1186\/1752-0509-6-145","article-title":"TIGRESS: trustful inference of gene regulation using stability selection","volume":"6","author":"Haury","year":"2012","journal-title":"BMC Syst. Biol"},{"key":"2023020108551616500_btab829-B17","doi-asserted-by":"crossref","first-page":"2377","DOI":"10.1093\/nar\/gkr902","article-title":"Gene network inference and visualization tools for biologists: application to new human transcriptome datasets","volume":"40","author":"Hurley","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020108551616500_btab829-B18","doi-asserted-by":"crossref","first-page":"e12776","DOI":"10.1371\/journal.pone.0012776","article-title":"Inferring regulatory networks from expression data using tree-based methods","volume":"5","author":"Huynh-Thu","year":"2010","journal-title":"PLoS One"},{"key":"2023020108551616500_btab829-B19","doi-asserted-by":"crossref","first-page":"1767","DOI":"10.1093\/molbev\/msv058","article-title":"An Arabidopsis Transcriptional Regulatory Map reveals distinct functional and evolutionary features of novel transcription factors","volume":"32","author":"Jin","year":"2015","journal-title":"Mol. Biol. Evol"},{"key":"2023020108551616500_btab829-B20","doi-asserted-by":"crossref","first-page":"D1003","DOI":"10.1093\/nar\/gku1200","article-title":"Araport: the Arabidopsis information portal","volume":"43","author":"Krishnakumar","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020108551616500_btab829-B21","doi-asserted-by":"crossref","first-page":"2233","DOI":"10.1093\/bioinformatics\/btw216","article-title":"ARACNe-AP: gene network reverse engineering through adaptive partitioning inference of mutual information","volume":"32","author":"Lachmann","year":"2016","journal-title":"Bioinformatics"},{"key":"2023020108551616500_btab829-B22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-9-559","article-title":"WGCNA: an R package for weighted correlation network analysis","volume":"9","author":"Langfelder","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020108551616500_btab829-B23","doi-asserted-by":"crossref","first-page":"796","DOI":"10.1038\/nmeth.2016","article-title":"Wisdom of crowds for robust gene network inference","volume":"9","author":"Marbach","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020108551616500_btab829-B24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-9-461","article-title":"minet: AR\/Bioconductor package for inferring large transcriptional networks using mutual information","volume":"9","author":"Meyer","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020108551616500_btab829-B25","doi-asserted-by":"crossref","first-page":"1095","DOI":"10.4161\/psb.21218","article-title":"Effect of salt stress on genes encoding translation-associated proteins in Arabidopsis thaliana","volume":"7","author":"Omidbakhshfard","year":"2012","journal-title":"Plant Signal. Behav"},{"key":"2023020108551616500_btab829-B26","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1186\/1752-0509-1-37","article-title":"From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data","volume":"1","author":"Opgen-Rhein","year":"2007","journal-title":"BMC Syst. Biol"},{"key":"2023020108551616500_btab829-B27","doi-asserted-by":"crossref","first-page":"i197","DOI":"10.1093\/bioinformatics\/btv268","article-title":"Integrative random forest for gene regulatory network inference","volume":"31","author":"Petralia","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020108551616500_btab829-B28","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1038\/s41592-019-0690-6","article-title":"Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data","volume":"17","author":"Pratapa","year":"2020","journal-title":"Nat. Methods"},{"key":"2023020108551616500_btab829-B29","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1208","article-title":"Using complexity for the estimation of Bayesian networks","volume":"5","author":"Salzman","year":"2006","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023020108551616500_btab829-B30","first-page":"707","author":"Shrivastava","year":"2015"},{"key":"2023020108551616500_btab829-B31","doi-asserted-by":"crossref","first-page":"3640","DOI":"10.1105\/tpc.113.113803","article-title":"Alternative splicing at the intersection of biological timing, development, and stress responses","volume":"25","author":"Staiger","year":"2013","journal-title":"Plant Cell"},{"key":"2023020108551616500_btab829-B32","doi-asserted-by":"crossref","first-page":"e1004755","DOI":"10.1371\/journal.pcbi.1004755","article-title":"FastGGM: an efficient algorithm for the inference of Gaussian graphical model in biological networks","volume":"12","author":"Wang","year":"2016","journal-title":"PLoS Comput. Biol"},{"key":"2023020108551616500_btab829-B33","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1038\/nrg2348","article-title":"Coordination of gene expression between organellar and nuclear genomes","volume":"9","author":"Woodson","year":"2008","journal-title":"Nat. Rev. Genet"},{"key":"2023020108551616500_btab829-B34","doi-asserted-by":"crossref","first-page":"1952","DOI":"10.1105\/tpc.16.00808","article-title":"Mutations in eIF5B confer thermosensitive and pleiotropic phenotypes via translation defects in Arabidopsis thaliana","volume":"29","author":"Zhang","year":"2017","journal-title":"Plant Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab829\/41764729\/btab829.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/5\/1312\/49008923\/btab829.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/5\/1312\/49008923\/btab829.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T20:21:42Z","timestamp":1675282902000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/5\/1312\/6458321"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,12,9]]},"references-count":34,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,2,7]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab829","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,3,1]]},"published":{"date-parts":[[2021,12,9]]}}}