{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T20:04:25Z","timestamp":1780689865361,"version":"3.54.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2014,11,5]],"date-time":"2014-11-05T00:00:00Z","timestamp":1415145600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2014,12]]},"DOI":"10.1186\/s12859-014-0346-6","type":"journal-article","created":{"date-parts":[[2014,11,4]],"date-time":"2014-11-04T14:01:47Z","timestamp":1415109707000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":124,"title":["Missing value imputation in high-dimensional phenomic data: imputable or not, and how?"],"prefix":"10.1186","volume":"15","author":[{"given":"Serena G","family":"Liao","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yan","family":"Lin","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dongwan D","family":"Kang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Divay","family":"Chandra","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jessica","family":"Bon","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Naftali","family":"Kaminski","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Frank C","family":"Sciurba","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"George C","family":"Tseng","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2014,11,5]]},"reference":[{"issue":"9","key":"346_CR1","doi-asserted-by":"publisher","first-page":"1205","DOI":"10.1093\/bioinformatics\/btq126","volume":"26","author":"JC Denny","year":"2010","unstructured":"Denny JC, Ritchie MD, Basford MA, Pulley JM, Bastarache L, Brown-Gentry K, Wang D, Masys DR, Roden DM, Crawford DC: PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics. 2010, 26 (9): 1205-1210. 10.1093\/bioinformatics\/btq126.","journal-title":"Bioinformatics"},{"issue":"2","key":"346_CR2","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1136\/amiajnl-2012-001117","volume":"20","author":"DA Hanauer","year":"2013","unstructured":"Hanauer DA, Ramakrishnan N: Modeling temporal relationships in large scale clinical associations. J Am Med Inform Assoc. 2013, 20 (2): 332-341. 10.1136\/amiajnl-2012-001117.","journal-title":"J Am Med Inform Assoc"},{"issue":"e2","key":"346_CR3","doi-asserted-by":"publisher","first-page":"e297","DOI":"10.1136\/amiajnl-2013-001933","volume":"20","author":"S Lyalina","year":"2013","unstructured":"Lyalina S, Percha B, Lependu P, Iyer SV, Altman RB, Shah NH: Identifying phenotypic signatures of neuropsychiatric disorders from electronic medical records. J Am Med Inform Assoc. 2013, 20 (e2): e297-e305. 10.1136\/amiajnl-2013-001933.","journal-title":"J Am Med Inform Assoc"},{"issue":"13","key":"346_CR4","doi-asserted-by":"publisher","first-page":"1377","DOI":"10.1161\/CIRCULATIONAHA.112.000604","volume":"127","author":"MD Ritchie","year":"2013","unstructured":"Ritchie MD, Denny JC, Zuvich RL, Crawford DC, Schildcrout JS, Bastarache L, Ramirez AH, Mosley JD, Pulley JM, Basford MA, Bradford Y, Rasmussen LV, Pathak J, Chute CG, Kullo IJ, McCarty CA, Chisholm RL, Kho AN, Carlson CS, Larson EB, Jarvik GP, Sotoodehnia N, Manolio TA, Li R, Masys DR, Haines JL, Roden DM: Genome- and phenome-wide analyses of cardiac conduction identifies markers of arrhythmia risk. Circulation. 2013, 127 (13): 1377-1385. 10.1161\/CIRCULATIONAHA.112.000604.","journal-title":"Circulation"},{"issue":"4","key":"346_CR5","doi-asserted-by":"publisher","first-page":"696","DOI":"10.1136\/amiajnl-2012-001355","volume":"20","author":"JL Warner","year":"2013","unstructured":"Warner JL, Alterovitz G, Bodio K, Joyce RM: External phenome analysis enables a rational federated query strategy to detect changing rates of treatment-related complications associated with multiple myeloma. J Am Med Inform Assoc. 2013, 20 (4): 696-699. 10.1136\/amiajnl-2012-001355.","journal-title":"J Am Med Inform Assoc"},{"issue":"13","key":"346_CR6","doi-asserted-by":"publisher","first-page":"1741","DOI":"10.1093\/bioinformatics\/btr295","volume":"27","author":"GH Fernald","year":"2011","unstructured":"Fernald GH, Capriotti E, Daneshjou R, Karczewski KJ, Altman RB: Bioinformatics challenges for personalized medicine. Bioinformatics. 2011, 27 (13): 1741-1748. 10.1093\/bioinformatics\/btr295.","journal-title":"Bioinformatics"},{"issue":"6","key":"346_CR7","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/nm0605-583a","volume":"11","author":"E Singer","year":"2005","unstructured":"Singer E: \"Phenome\" project set to pin down subgroups of autism. Nat Med. 2005, 11 (6): 583-10.1038\/nm0605-583a.","journal-title":"Nat Med"},{"issue":"jun29 1","key":"346_CR8","doi-asserted-by":"publisher","first-page":"b2393","DOI":"10.1136\/bmj.b2393","volume":"338","author":"JA Sterne","year":"2009","unstructured":"Sterne JA, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, Wood AM, Carpenter JR: Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009, 338 (jun29 1): b2393-10.1136\/bmj.b2393.","journal-title":"BMJ"},{"key":"346_CR9","doi-asserted-by":"publisher","first-page":"1355","DOI":"10.1056\/NEJMsr1203730","volume":"367","author":"RJ Little","year":"2012","unstructured":"Little RJ, D'Agostino R, Cohen ML, Dickersin K, Emerson SS, Farrar JT, Frangakis C, Hogan JW, Molenberghs G, Murphy SA, Neaton JD, Rotnitzky A, Scharfstein D, Shih WJ, Siegel JP, Stern H: The prevention and treatment of missing data in clinical trials. N Engl J Med. 2012, 367: 1355-1360. 10.1056\/NEJMsr1203730.","journal-title":"N Engl J Med"},{"key":"346_CR10","doi-asserted-by":"publisher","first-page":"528","DOI":"10.1080\/01621459.1987.10478458","volume":"82","author":"MA Tanner","year":"1987","unstructured":"Tanner MA, Wong WH: The calculation of posterior distributions by data augmentation. J Am Stat Assoc. 1987, 82: 528-550. 10.1080\/01621459.1987.10478458.","journal-title":"J Am Stat Assoc"},{"key":"346_CR11","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4024-2","volume-title":"Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions","author":"MA Tanner","year":"1996","unstructured":"Tanner MA: Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions. 1996, Springer-Verlag, New York"},{"issue":"1","key":"346_CR12","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1006\/jmva.1995.1029","volume":"53","author":"C Liu","year":"1995","unstructured":"Liu C: Missing data imputation using the multivariate t distribution. J Multivar Anal. 1995, 53 (1): 139-158. 10.1006\/jmva.1995.1029.","journal-title":"J Multivar Anal"},{"key":"346_CR13","doi-asserted-by":"publisher","DOI":"10.1002\/9781119013563","volume-title":"Statistical Analysis with Missing Data","author":"RJA Little","year":"2002","unstructured":"Little RJA, Rubin DB: Statistical Analysis with Missing Data. 2002, John Wiley, New York"},{"issue":"1","key":"346_CR14","first-page":"85","volume":"27","author":"TE Raghunathan","year":"2001","unstructured":"Raghunathan TE, Lepkowski JM, Hoewyk JV, Solenberger P: A multivariate technique for multiply imputing missing values using a sequence of regression models. Survey Methodology. 2001, 27 (1): 85-95.","journal-title":"Survey Methodology"},{"key":"346_CR15","first-page":"83","volume-title":"Proceeding of the Statistical Computing Section of the American Statistical Association","author":"DB Rubin","year":"1990","unstructured":"Rubin DB, Schafer JL: Efficiently creating multiple imputations for incomplete multivariate normal data. Proceeding of the Statistical Computing Section of the American Statistical Association. 1990, 83-88."},{"issue":"3","key":"346_CR16","first-page":"1","volume":"45","author":"S van Buuren KG-O","year":"2011","unstructured":"van Buuren KG-O S: Mice: multivariate imputation by chained equations in R. J Stat Softw. 2011, 45 (3): 1-67.","journal-title":"J Stat Softw"},{"issue":"1","key":"346_CR17","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1111\/j.1751-5823.2010.00103.x","volume":"78","author":"RR Andridge","year":"2010","unstructured":"Andridge RR, Little RJ: A review of hot deck imputation for survey non-response. Int Stat Rev. 2010, 78 (1): 40-64. 10.1111\/j.1751-5823.2010.00103.x.","journal-title":"Int Stat Rev"},{"issue":"1","key":"346_CR18","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1002\/sim.2939","volume":"27","author":"RJ Little","year":"2008","unstructured":"Little RJ, Yosef M, Cain KC, Nan B, Harlow SD: A hot-deck multiple imputation procedure for gaps in longitudinal data on recurrent events. Stat Med. 2008, 27 (1): 103-120. 10.1002\/sim.2939.","journal-title":"Stat Med"},{"key":"346_CR19","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316696","volume-title":"Multiple Imputation for Nonresponse in Surveys","author":"DB Rubin","year":"1987","unstructured":"Rubin DB: Multiple Imputation for Nonresponse in Surveys. 1987, Wiley, New York"},{"key":"346_CR20","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1080\/01621459.1995.10476488","volume":"90","author":"TE Raghunathan","year":"1995","unstructured":"Raghunathan TE, Grizzle JE: A split questionnaire survey design. J Am Stat Assoc. 1995, 90: 54-63. 10.1080\/01621459.1995.10476488.","journal-title":"J Am Stat Assoc"},{"key":"346_CR21","doi-asserted-by":"publisher","first-page":"335","DOI":"10.2307\/2986092","volume":"45","author":"TE Raghunathan","year":"1996","unstructured":"Raghunathan TE, Siscovick DS: A multile imputation analysis of a case-control study of the risk of primary cardiac arrest among pharmacologically treated hypertensives. Appl Stat. 1996, 45: 335-352. 10.2307\/2986092.","journal-title":"Appl Stat"},{"key":"346_CR22","doi-asserted-by":"publisher","DOI":"10.1201\/9781439821862","volume-title":"Analysis of Incomplete Multivariate Data by Simulation","author":"JL Schafer","year":"1997","unstructured":"Schafer JL: Analysis of Incomplete Multivariate Data by Simulation. 1997, Chapman and Hall, New York"},{"issue":"2","key":"346_CR23","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1016\/j.artmed.2010.05.002","volume":"50","author":"JM Jerez","year":"2010","unstructured":"Jerez JM, Molina I, Garcia-Laencina PJ, Alba E, Ribelles N, Martin M, Franco L: Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artif Intell Med. 2010, 50 (2): 105-115. 10.1016\/j.artmed.2010.05.002.","journal-title":"Artif Intell Med"},{"key":"346_CR24","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1186\/1471-2105-9-12","volume":"9","author":"GN Brock","year":"2008","unstructured":"Brock GN, Shaffer JR, Blakesley RE, Lotz MJ, Tseng GC: Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes. BMC Bioinformatics. 2008, 9: 12-10.1186\/1471-2105-9-12.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"346_CR25","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1093\/bioinformatics\/btq613","volume":"27","author":"DDK Sunghee Oh","year":"2011","unstructured":"Sunghee Oh DDK, Brock GN, Tseng GC: Biological impact of missing-value imputation on downstream analyses of gene expression profiles. Bioinformatics. 2011, 27 (1): 78-86. 10.1093\/bioinformatics\/btq613.","journal-title":"Bioinformatics"},{"key":"346_CR26","first-page":"113","volume":"28","author":"DJSP Buhlmann","year":"2011","unstructured":"Buhlmann DJSP: MissForest - nonparametric missing value imputation for mixed-type data. Bioinformatics. 2011, 28: 113-118.","journal-title":"Bioinformatics"},{"key":"346_CR27","doi-asserted-by":"publisher","first-page":"639","DOI":"10.1007\/978-3-642-17103-1_60","volume-title":"Clustering and Data Mining Applications","author":"E Acuna","year":"2004","unstructured":"Acuna E, Rodriguez C: The treatment of missing values and its effect in the classifier accuracy. Clustering and Data Mining Applications. 2004, 639-648. 10.1007\/978-3-642-17103-1_60."},{"issue":"3","key":"346_CR28","doi-asserted-by":"publisher","first-page":"e34","DOI":"10.1093\/nar\/gnh026","volume":"32","author":"TH B\u00f8","year":"2004","unstructured":"B\u00f8 TH, Dysvik B, Jonassen I: LSimpute: accurate estimation of missing values in microarray data with least squares methods. Nucleic Acids Res. 2004, 32 (3): e34-10.1093\/nar\/gnh026.","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"346_CR29","doi-asserted-by":"publisher","first-page":"448","DOI":"10.1214\/aoms\/1177705052","volume":"32","author":"I Olkin","year":"1961","unstructured":"Olkin I, Tate RF: Multivariate correlation models with mixed disrete and continuous variables. Ann Math Stat. 1961, 32 (2): 448-465. 10.1214\/aoms\/1177705052.","journal-title":"Ann Math Stat"},{"issue":"375","key":"346_CR30","doi-asserted-by":"publisher","first-page":"524","DOI":"10.1080\/01621459.1981.10477679","volume":"76","author":"A Agresti","year":"1981","unstructured":"Agresti A: Measures of nominal-ordinal association. J Am Stat Assoc. 1981, 76 (375): 524-529. 10.1080\/01621459.1981.10477679.","journal-title":"J Am Stat Assoc"},{"issue":"3","key":"346_CR31","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1007\/BF02294164","volume":"47","author":"FD Ulf Olsson","year":"1982","unstructured":"Ulf Olsson FD, Dorans NJ: The polyserial correlation coefficient. Psychometrika. 1982, 47 (3): 337-347. 10.1007\/BF02294164.","journal-title":"Psychometrika"},{"issue":"4","key":"346_CR32","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1007\/BF02296207","volume":"44","author":"U Olsson","year":"1979","unstructured":"Olsson U: Maximum likelihood estimation of the polychoric correlation coefficient. Psychometrika. 1979, 44 (4): 443-460. 10.1007\/BF02296207.","journal-title":"Psychometrika"},{"key":"346_CR33","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1126\/science.29.751.823","volume":"29","author":"F Boas","year":"1909","unstructured":"Boas F: Determination of the coefficient of correlation. Science. 1909, 29: 823-824. 10.1126\/science.29.751.823.","journal-title":"Science"},{"key":"346_CR34","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1098\/rsta.1900.0022","volume":"195","author":"K Pearson","year":"1900","unstructured":"Pearson K: Mathematical contributions to the theory of evolution. VII. On the correlation of characters not quantitatively measurable. Philos Trans R Soc Lond Ser A Math Phys Eng Sci. 1900, 195: 1-47. 10.1098\/rsta.1900.0022.","journal-title":"Philos Trans R Soc Lond Ser A Math Phys Eng Sci"},{"key":"346_CR35","doi-asserted-by":"publisher","first-page":"579","DOI":"10.2307\/2340126","volume":"75","author":"GU Yule","year":"1912","unstructured":"Yule GU: On the methods of measuring the association between two attributes. J Roy Statist Soc. 1912, 75: 579-652. 10.2307\/2340126.","journal-title":"J Roy Statist Soc"},{"key":"346_CR36","volume-title":"Mathematical Methods of Statistics","author":"H Cram\u00e9r","year":"1946","unstructured":"Cram\u00e9r H: Mathematical Methods of Statistics. 1946, Princeton University Press, Princeton"},{"issue":"4","key":"346_CR37","doi-asserted-by":"publisher","first-page":"857","DOI":"10.2307\/2528823","volume":"27","author":"JC Gower","year":"1971","unstructured":"Gower JC: A general coefficient of similarity and some of its properties. Biometrics. 1971, 27 (4): 857-871. 10.2307\/2528823.","journal-title":"Biometrics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-014-0346-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-014-0346-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-014-0346-6","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-014-0346-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T06:16:56Z","timestamp":1630563416000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-014-0346-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,11,5]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,12]]}},"alternative-id":["346"],"URL":"https:\/\/doi.org\/10.1186\/s12859-014-0346-6","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,11,5]]},"assertion":[{"value":"6 March 2014","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 October 2014","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 November 2014","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"346"}}