{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T08:48:22Z","timestamp":1772268502917,"version":"3.50.1"},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Microarray data discretization is a basic preprocess for many algorithms of gene regulatory network inference. Some common discretization methods in informatics are used to discretize microarray data. Selection of the discretization method is often arbitrary and no systematic comparison of different discretization has been conducted, in the context of gene regulatory network inference from time series gene expression data.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In this study, we propose a new discretization method \"bikmeans\", and compare its performance with four other widely-used discretization methods using different datasets, modeling algorithms and number of intervals. Sensitivities, specificities and total accuracies were calculated and statistical analysis was carried out. Bikmeans method always gave high total accuracies.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>Our results indicate that proper discretization methods can consistently improve gene regulatory network inference independent of network modeling algorithms and datasets. Our new method, bikmeans, resulted in significant better total accuracies than other methods.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-11-520","type":"journal-article","created":{"date-parts":[[2010,10,19]],"date-time":"2010-10-19T06:17:01Z","timestamp":1287469021000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":42,"title":["Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks"],"prefix":"10.1186","volume":"11","author":[{"given":"Yong","family":"Li","sequence":"first","affiliation":[]},{"given":"Lili","family":"Liu","sequence":"additional","affiliation":[]},{"given":"Xi","family":"Bai","sequence":"additional","affiliation":[]},{"given":"Hua","family":"Cai","sequence":"additional","affiliation":[]},{"given":"Wei","family":"Ji","sequence":"additional","affiliation":[]},{"given":"Dianjing","family":"Guo","sequence":"additional","affiliation":[]},{"given":"Yanming","family":"Zhu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,10,19]]},"reference":[{"issue":"4","key":"4103_CR1","doi-asserted-by":"publisher","first-page":"523","DOI":"10.1016\/j.jtbi.2004.04.037","volume":"229","author":"R Laubenbacher","year":"2004","unstructured":"Laubenbacher R, Stigler B: A computational algebra approach to the reverse engineering of gene regulatory networks. J Theor Biol 2004, 229(4):523\u2013537. 10.1016\/j.jtbi.2004.04.037","journal-title":"J Theor Biol"},{"key":"4103_CR2","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1002\/cplx.6130010612","volume":"1","author":"R Somogyi","year":"1996","unstructured":"Somogyi R, Sniegoski C: Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation. Complexity 1996, 1: 45\u201363.","journal-title":"Complexity"},{"key":"4103_CR3","first-page":"17","volume-title":"Pac Symp Biocomput","author":"T Akutsu","year":"1999","unstructured":"Akutsu T, Miyano S, Kuhara S: Identification of genetic networks from a small number of gene expression patterns under the Boolean network model. Pac Symp Biocomput 1999, 17\u201328."},{"issue":"11","key":"4103_CR4","doi-asserted-by":"publisher","first-page":"R92","DOI":"10.1186\/gb-2004-5-11-r92","volume":"5","author":"A Wille","year":"2004","unstructured":"Wille A, Zimmermann P, Vranova E, Furholz A, Laule O, Bleuler S, Hennig L, Prelic A, von Rohr P, Thiele L, et al.: Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana. Genome Biol 2004, 5(11):R92. 10.1186\/gb-2004-5-11-r92","journal-title":"Genome Biol"},{"issue":"3-4","key":"4103_CR5","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1089\/106652700750050961","volume":"7","author":"N Friedman","year":"2000","unstructured":"Friedman N, Linial M, Nachman I, Pe'er D: Using Bayesian networks to analyze expression data. J Comput Biol 2000, 7(3\u20134):601\u2013620. 10.1089\/106652700750050961","journal-title":"J Comput Biol"},{"key":"4103_CR6","first-page":"437","volume-title":"Pac Symp Biocomput","author":"AJ Hartemink","year":"2002","unstructured":"Hartemink AJ, Gifford DK, Jaakkola TS, Young RA: Combining location and expression data for principled discovery of genetic regulatory network models. Pac Symp Biocomput 2002, 437\u2013449."},{"key":"4103_CR7","volume-title":"Computer Science Division, University of California, Berkeley, CA","author":"K Murphy","year":"1999","unstructured":"Murphy K, Mian S: Modeling gene expression data using dynamic Bayesian networks. Technical report. Computer Science Division, University of California, Berkeley, CA 1999. [http:\/\/www.cs.ubc.ca\/~murphyk\/Papers\/ismb99.pdf]"},{"issue":"1","key":"4103_CR8","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1109\/TCBB.2008.34","volume":"7","author":"SC Madeira","year":"2010","unstructured":"Madeira SC, Teixeira MC, Sa-Correia I, Oliveira AL: Identification of Regulatory Modules in Time Series Gene Expression Data Using a Linear Time Biclustering Algorithm. IEEE\/ACM Trans Comput Biol Bioinformatics 2010, 7(1):153\u2013165. 10.1109\/TCBB.2008.34","journal-title":"IEEE\/ACM Trans Comput Biol Bioinformatics"},{"key":"4103_CR9","first-page":"24","volume-title":"4th ACM SIGKDD Workshop on Data Mining in Bioinformatics","author":"R Pensa","year":"2004","unstructured":"Pensa R, Leschi C, Besson J, Boulicaut JF: Assessment of discretization techniques for relevant pattern discovery from gene expression data. 4th ACM SIGKDD Workshop on Data Mining in Bioinformatics 2004, 24\u201330. BIOKDD'04, ACM BIOKDD'04, ACM"},{"key":"4103_CR10","first-page":"164","volume-title":"Proceedings of the European working session on learning on Machine learning","author":"J Catlett","year":"1991","unstructured":"Catlett J: On changing continuous attributes into ordered discrete attributes. In Proceedings of the European working session on learning on Machine learning. Porto, Portugal: Springer-Verlag New York, Inc; 1991:164\u2013178."},{"key":"4103_CR11","first-page":"194","volume-title":"Proceedings of the Twelfth International Conference on Machine Learning: 1995; Tahoe City, California, USA","author":"J Dougherty","year":"1995","unstructured":"Dougherty J, Kohavi R, Sahami M: Supervised and Unsupervised Discretization of Continuous Features. Proceedings of the Twelfth International Conference on Machine Learning: 1995; Tahoe City, California, USA 1995, 194\u2013202."},{"key":"4103_CR12","volume-title":"Proceedings Tenth National Conference on Artificial Intelligence","author":"K Randy","year":"1992","unstructured":"Randy K: Chimerge: discretization of numeric attributes. In Proceedings Tenth National Conference on Artificial Intelligence. Publ by AAAI, Menlo Park, CA, United States; 1992."},{"key":"4103_CR13","first-page":"281","volume-title":"Proc of the fifth Berkeley Symposium on Mathematical Statistics and Probability","author":"JB MacQueen","year":"1967","unstructured":"MacQueen JB: Some Methods for Classification and Analysis of MultiVariate Observations. In Proc of the fifth Berkeley Symposium on Mathematical Statistics and Probability. Volume 1. Edited by: Cam LML, Neyman J. University of California Press; 1967:281\u2013297."},{"issue":"5","key":"4103_CR14","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1016\/j.ygeno.2009.08.009","volume":"94","author":"Y Li","year":"2009","unstructured":"Li Y, Zhu Y, Bai X, Cai H, Ji W, Guo D: ReTRN: A retriever of real transcriptional regulatory network and expression data for evaluating structure learning algorithm. Genomics 2009, 94(5):349\u2013354. 10.1016\/j.ygeno.2009.08.009","journal-title":"Genomics"},{"issue":"16","key":"4103_CR15","doi-asserted-by":"publisher","first-page":"2266","DOI":"10.1101\/gad.1450606","volume":"20","author":"T Pramila","year":"2006","unstructured":"Pramila T, Wu W, Miles S, Noble WS, Breeden LL: The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle. Genes Dev 2006, 20(16):2266\u20132278. 10.1101\/gad.1450606","journal-title":"Genes Dev"},{"key":"4103_CR16","doi-asserted-by":"crossref","unstructured":"Monteiro PT, Mendes ND, Teixeira MC, d'Orey S, Tenreiro S, Mira NP, Pais H, Francisco AP, Carvalho AM, Lourenco AB, et al.: YEASTRACT-DISCOVERER: new tools to improve the analysis of transcriptional regulatory associations in Saccharomyces cerevisiae. Nucleic Acids Res 2008, (36 Database):D132\u2013136.","DOI":"10.1093\/nar\/gkm976"},{"key":"4103_CR17","doi-asserted-by":"crossref","unstructured":"Teixeira MC, Monteiro P, Jain P, Tenreiro S, Fernandes AR, Mira NP, Alenquer M, Freitas AT, Oliveira AL, Sa-Correia I: The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae. Nucleic Acids Res 2006, (34 Database):D446\u2013451. 10.1093\/nar\/gkj013","DOI":"10.1093\/nar\/gkj013"},{"issue":"4","key":"4103_CR18","first-page":"309","volume":"9","author":"GF Cooper","year":"1992","unstructured":"Cooper GF, Herskovits E: A Bayesian method for the induction of probabilistic networks from data. Machine Learning 1992, 9(4):309\u2013347.","journal-title":"Machine Learning"},{"issue":"Suppl 1","key":"4103_CR19","doi-asserted-by":"publisher","first-page":"S7","DOI":"10.1186\/1471-2105-7-S1-S7","volume":"7","author":"AA Margolin","year":"2006","unstructured":"Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Dalla Favera R, Califano A: ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics 2006, 7(Suppl 1):S7. 10.1186\/1471-2105-7-S1-S7","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"4103_CR20","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1093\/bioinformatics\/bth463","volume":"21","author":"M Zou","year":"2005","unstructured":"Zou M, Conzen SD: A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data. Bioinformatics 2005, 21(1):71\u201379. 10.1093\/bioinformatics\/bth463","journal-title":"Bioinformatics"},{"issue":"2","key":"4103_CR21","doi-asserted-by":"publisher","first-page":"145","DOI":"10.1109\/TKDE.2004.1269594","volume":"16","author":"LA Kurgan","year":"2004","unstructured":"Kurgan LA, Cios KJ: CAIM Discretization Algorithm. IEEE Trans on Knowl and Data Eng 2004, 16(2):145\u2013153. 10.1109\/TKDE.2004.1269594","journal-title":"IEEE Trans on Knowl and Data Eng"},{"issue":"4","key":"4103_CR22","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1109\/TCBB.2007.1057","volume":"4","author":"R Xu","year":"2007","unstructured":"Xu R, Wunsch Ii D, Frank R: Inference of genetic regulatory networks with recurrent neural network models using particle swarm optimization. IEEE\/ACM Trans Comput Biol Bioinform 2007, 4(4):681\u2013692. 10.1109\/TCBB.2007.1057","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-520.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T05:28:54Z","timestamp":1630474134000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-520"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,10,19]]},"references-count":22,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["4103"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-520","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,10,19]]},"assertion":[{"value":"27 June 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 October 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 October 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"520"}}