{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T23:29:24Z","timestamp":1773271764360,"version":"3.50.1"},"reference-count":24,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-449","type":"journal-article","created":{"date-parts":[[2006,10,12]],"date-time":"2006-10-12T21:40:10Z","timestamp":1160689210000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["Integrative missing value estimation for microarray data"],"prefix":"10.1186","volume":"7","author":[{"given":"Jianjun","family":"Hu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haifeng","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael S","family":"Waterman","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xianghong Jasmine","family":"Zhou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,10,12]]},"reference":[{"key":"1188_CR1","doi-asserted-by":"publisher","first-page":"200","DOI":"10.1038\/nrg1809","volume":"7","author":"JD Hoheisel","year":"2006","unstructured":"Hoheisel JD: Microarray technology: beyond transcript profiling and genotype analysis. Nat Rev Genet 2006, 7: 200\u2013210. 10.1038\/nrg1809","journal-title":"Nat Rev Genet"},{"key":"1188_CR2","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1186\/1471-2105-5-114","volume":"5","author":"AG de Brevern","year":"2004","unstructured":"de Brevern AG, Hazout S, Malpertuy A: Influence of microarrays experiments missing values on the stability of gene groups by hierarchical clustering. BMC Bioinformatics 2004, 5: 114. 10.1186\/1471-2105-5-114","journal-title":"BMC Bioinformatics"},{"key":"1188_CR3","doi-asserted-by":"publisher","first-page":"e34","DOI":"10.1093\/nar\/gnh026","volume":"32","author":"TH Bo","year":"2004","unstructured":"Bo TH, Dysvik B, Jonassen I: LSimpute: accurate estimation of missing values in microarray data with least squares methods. Nucleic Acids Res 2004, 32: e34. 10.1093\/nar\/gnh026","journal-title":"Nucleic Acids Res"},{"key":"1188_CR4","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1093\/bioinformatics\/bth499","volume":"21","author":"H Kim","year":"2005","unstructured":"Kim H, Golub GH, Park H: Missing value estimation for DNA microarray gene expression data: local least squares imputation. Bioinformatics 2005, 21: 187\u2013198. 10.1093\/bioinformatics\/bth499","journal-title":"Bioinformatics"},{"key":"1188_CR5","first-page":"3887","volume-title":"Bioinformatics","author":"Scholz M.","year":"2005","unstructured":"M.Scholz, F.Kaplan, C.L.Guy, J.Kopka, J.Selbig: Non-linear PCA: a missing data approach. In Bioinformatics. Volume 21. Oxford; 2005:3887\u20133895. 10.1093\/bioinformatics\/bti634"},{"key":"1188_CR6","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","volume":"17","author":"O Troyanskaya","year":"2001","unstructured":"Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, Botstein D, Altman RB: Missing value estimation methods for DNA microarrays. Bioinformatics 2001, 17: 520\u2013525. 10.1093\/bioinformatics\/17.6.520","journal-title":"Bioinformatics"},{"key":"1188_CR7","doi-asserted-by":"publisher","first-page":"2302","DOI":"10.1093\/bioinformatics\/btg323","volume":"19","author":"X Zhou","year":"2003","unstructured":"Zhou X, Wang X, Dougherty ER: Missing-value estimation using linear and non-linear regression with Bayesian gene selection. Bioinformatics 2003, 19: 2302\u20132307. 10.1093\/bioinformatics\/btg323","journal-title":"Bioinformatics"},{"key":"1188_CR8","doi-asserted-by":"publisher","first-page":"2088","DOI":"10.1093\/bioinformatics\/btg287","volume":"19","author":"S Oba","year":"2003","unstructured":"Oba S, Sato MA, Takemasa I, Monden M, Matsubara K, Ishii S: A Bayesian missing value estimation method for gene expression profile data. Bioinformatics 2003, 19: 2088\u20132096. 10.1093\/bioinformatics\/btg287","journal-title":"Bioinformatics"},{"key":"1188_CR9","doi-asserted-by":"publisher","first-page":"2417","DOI":"10.1093\/bioinformatics\/bti345","volume":"21","author":"MS Sehgal","year":"2005","unstructured":"Sehgal MS, Gondal I, Dooley LS: Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data. Bioinformatics 2005, 21: 2417\u20132423. 10.1093\/bioinformatics\/bti345","journal-title":"Bioinformatics"},{"key":"1188_CR10","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1186\/1471-2105-7-32","volume":"7","author":"X Wang","year":"2006","unstructured":"Wang X, Li A, Jiang Z, Feng H: Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme. BMC Bioinformatics 2006, 7: 32. 10.1186\/1471-2105-7-32","journal-title":"BMC Bioinformatics"},{"key":"1188_CR11","doi-asserted-by":"publisher","first-page":"917","DOI":"10.1093\/bioinformatics\/bth007","volume":"20","author":"M Ouyang","year":"2004","unstructured":"Ouyang M, Welsh WJ, Georgopoulos P: Gaussian mixture clustering and imputation of microarray data. Bioinformatics 2004, 20: 917\u2013923. 10.1093\/bioinformatics\/bth007","journal-title":"Bioinformatics"},{"key":"1188_CR12","doi-asserted-by":"publisher","first-page":"4155","DOI":"10.1093\/bioinformatics\/bti638","volume":"21","author":"R Jornsten","year":"2005","unstructured":"Jornsten R, Wang HY, Welsh WJ, Ouyang M: DNA microarray data imputation and significance analysis of differential expression. Bioinformatics 2005, 21: 4155\u20134161. 10.1093\/bioinformatics\/bti638","journal-title":"Bioinformatics"},{"key":"1188_CR13","doi-asserted-by":"publisher","first-page":"566","DOI":"10.1093\/bioinformatics\/btk019","volume":"22","author":"J Tuikkala","year":"2006","unstructured":"Tuikkala J, Elo L, Nevalainen OS, Aittokallio T: Improving missing value estimation in microarray data with gene ontology. Bioinformatics 2006, 22: 566\u2013572. 10.1093\/bioinformatics\/btk019","journal-title":"Bioinformatics"},{"key":"1188_CR14","unstructured":"Princeton SGD Lite yeast datasets2005. [http:\/\/sgdlite.princeton.edu\/download\/yeast_datasets\/]"},{"key":"1188_CR15","doi-asserted-by":"publisher","first-page":"680","DOI":"10.1126\/science.278.5338.680","volume":"278","author":"JL DeRisi","year":"1997","unstructured":"DeRisi JL, Iyer VR, Brown PO: Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 1997, 278: 680\u2013686. 10.1126\/science.278.5338.680","journal-title":"Science"},{"key":"1188_CR16","doi-asserted-by":"publisher","first-page":"4309","DOI":"10.1091\/mbc.11.12.4309","volume":"11","author":"N Ogawa","year":"2000","unstructured":"Ogawa N, DeRisi J, Brown PO: New components of a system for phosphate accumulation and polyphosphate metabolism in Saccharomyces cerevisiae revealed by genomic expression analysis. Mol Biol Cell 2000, 11: 4309\u20134321.","journal-title":"Mol Biol Cell"},{"key":"1188_CR17","doi-asserted-by":"publisher","first-page":"9721","DOI":"10.1073\/pnas.96.17.9721","volume":"96","author":"TL Ferea","year":"1999","unstructured":"Ferea TL, Botstein D, Brown PO, Rosenzweig RF: Systematic changes in gene expression patterns following adaptive evolution in yeast. Proc Natl Acad Sci U S A 1999, 96: 9721\u20139726. 10.1073\/pnas.96.17.9721","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1188_CR18","doi-asserted-by":"publisher","first-page":"3273","DOI":"10.1091\/mbc.9.12.3273","volume":"9","author":"PT Spellman","year":"1998","unstructured":"Spellman PT, Sherlock G, Zhang MQ, Iyer VR, Anders K, Eisen MB, Brown PO, Botstein D, Futcher B: Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol Biol Cell 1998, 9: 3273\u20133297.","journal-title":"Mol Biol Cell"},{"key":"1188_CR19","doi-asserted-by":"publisher","first-page":"238","DOI":"10.1038\/nbt1058","volume":"23","author":"XJ Zhou","year":"2005","unstructured":"Zhou XJ, Kao MC, Huang H, Wong A, Nunez-Iglesias J, Primig M, Aparicio OM, Finch CE, Morgan TE, Wong WH: Functional annotation and network reconstruction through cross-platform integration of microarray data. Nat Biotechnol 2005, 23: 238\u2013243. 10.1038\/nbt1058","journal-title":"Nat Biotechnol"},{"key":"1188_CR20","first-page":"4427","volume":"62","author":"DR Rhodes","year":"2002","unstructured":"Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM: Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Res 2002, 62: 4427\u20134433.","journal-title":"Cancer Res"},{"key":"1188_CR21","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1093\/bioinformatics\/btg1010","volume":"19","author":"JK Choi","year":"2003","unstructured":"Choi JK, Yu U, Kim S, Yoo OJ: Combining multiple microarray studies and modeling interstudy variation. Bioinformatics 2003, 19: 184\u2013190. 10.1093\/bioinformatics\/btg1010","journal-title":"Bioinformatics"},{"key":"1188_CR22","doi-asserted-by":"publisher","first-page":"579","DOI":"10.1038\/ng1578","volume":"37","author":"DR Rhodes","year":"2005","unstructured":"Rhodes DR, Kalyana-Sundaram S, Mahavisno V, Barrette TR, Ghosh D, Chinnaiyan AM: Mining for regulatory programs in the cancer transcriptome. Nat Genet 2005, 37: 579\u2013583. 10.1038\/ng1578","journal-title":"Nat Genet"},{"key":"1188_CR23","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1016\/S0092-8674(03)00570-1","volume":"114","author":"RSFHLCBMRVKFSZCAPNGTREME Lamb J","year":"2003","unstructured":"Lamb J RSFHLCBMRVKFSZCAPNGTREME: A mechanism of cyclin D1 action encoded in the patterns of gene expression in human cancer. Cell 2003, 114: 323\u2013334. 10.1016\/S0092-8674(03)00570-1","journal-title":"Cell"},{"key":"1188_CR24","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1126\/science.1087447","volume":"302","author":"JM Stuart","year":"2003","unstructured":"Stuart JM, Segal E, Koller D, Kim SK: A gene-coexpression network for global discovery of conserved genetic modules. Science 2003, 302: 249\u2013255. 10.1126\/science.1087447","journal-title":"Science"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-449.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T03:19:02Z","timestamp":1630466342000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-449"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,10,12]]},"references-count":24,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1188"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-449","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,10,12]]},"assertion":[{"value":"16 August 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 October 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 October 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"449"}}