{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T15:37:51Z","timestamp":1770565071983,"version":"3.49.0"},"reference-count":25,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2006,6,1]],"date-time":"2006-06-01T00:00:00Z","timestamp":1149120000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGKDD Explor. Newsl."],"published-print":{"date-parts":[[2006,6]]},"abstract":"<jats:p>\n            Missing data is a well-recognized problem in large datasets, widely discussed in the statistics and data analysis literature. Many programming environments provide explicit codes for missing data, but these are not standardized and are not always used. This lack of standardization is one of the leading causes of the subtle problem of\n            <jats:italic>disguised missing data<\/jats:italic>\n            , in which unknown, inapplicable, or otherwise nonspecified responses are encoded as valid data values. Following a brief overview of the problem of explicitly coded missing data, this paper discusses sources, consequences, and detection of disguised missing data, including two real-world examples. As the first of these examples illustrates, the consequences of disguised missing data can be quite serious. The key to its detection lies in first, recognizing disguised missing data as a possibility and second, finding a sufficiently informative view of the data to reveal its presence.\n          <\/jats:p>","DOI":"10.1145\/1147234.1147247","type":"journal-article","created":{"date-parts":[[2007,1,17]],"date-time":"2007-01-17T18:32:02Z","timestamp":1169058722000},"page":"83-92","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":59,"title":["The problem of disguised missing data"],"prefix":"10.1145","volume":"8","author":[{"given":"Ronald K.","family":"Pearson","sequence":"first","affiliation":[{"name":"ProSanos Corporation, Harrisburg, PA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2006,6]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Addison-Wesley","author":"Adriaans P.","year":"1996"},{"key":"e_1_2_1_2_1","unstructured":"V. Barnett and T. Lewis. Outliers in Statistical Data. Wiley 3rd edition 1994.  V. Barnett and T. Lewis. Outliers in Statistical Data. Wiley 3rd edition 1994."},{"key":"e_1_2_1_3_1","volume-title":"Proc. 33rd Symposium on the Interface, Computing Science and Statistics","author":"Breault J.","year":"2001"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018054314350"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1032181158"},{"key":"e_1_2_1_6_1","unstructured":"C. Date. An Introduction to Database Systems. Addison-Wesley 7th edition 2000.   C. Date. An Introduction to Database Systems. Addison-Wesley 7th edition 2000."},{"key":"e_1_2_1_7_1","volume-title":"Proc. SAS User's Group Intl. Conf., SUGI26","author":"DesJardins D.","year":"2001"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"A. Feelders. Handling missing data in trees: surrogate splits or statistical imputation? In Principles of Data Mining and Knowledge Discovery (PKDD99) pages 329--334 1999.   A. Feelders. Handling missing data in trees: surrogate splits or statistical imputation? In Principles of Data Mining and Knowledge Discovery (PKDD99) pages 329--334 1999.","DOI":"10.1007\/978-3-540-48247-5_38"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.2307\/2532251"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176348396"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1198\/000313001317098266"},{"key":"e_1_2_1_12_1","first-page":"69","article-title":"Missing data in behavioral science research: Investigation of a collection of datasets","volume":"57","author":"Huisman M.","year":"1998","journal-title":"Kwantitatieve Methoden"},{"key":"e_1_2_1_13_1","first-page":"325","volume-title":"Proc. 14th Symposium Computational Statistics, COMPSTAT 2000","author":"Huisman M.","year":"2000"},{"key":"e_1_2_1_14_1","volume-title":"University of Groningen","author":"Huisman M.","year":"1998"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1214\/009053605000000363"},{"key":"e_1_2_1_16_1","first-page":"121","volume-title":"Machine Learning: Proc. 11th International Conf.","author":"John G.","year":"1994"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/21412"},{"key":"e_1_2_1_18_1","volume-title":"Wiley","author":"McLachlan G.","year":"1997"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"J. Mistiaen and M. Ravallion. Survey compliance and the distribution of income. Policy Research Working Paper WPS2956 The World Bank Development Research Group Poverty Team available at http:\/\/econ.worldbank.org 2003.  J. Mistiaen and M. Ravallion. Survey compliance and the distribution of income. Policy Research Working Paper WPS2956 The World Bank Development Research Group Poverty Team available at http:\/\/econ.worldbank.org 2003.","DOI":"10.1596\/1813-9450-2956"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/371920.372183"},{"key":"e_1_2_1_21_1","volume-title":"SIAM","author":"Pearson R.","year":"2005"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1037\/1082-989X.7.2.147"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0011-393X(01)80070-9"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/1965392"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/lt.500030102"}],"container-title":["ACM SIGKDD Explorations Newsletter"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1147234.1147247","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1147234.1147247","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T15:06:24Z","timestamp":1750259184000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1147234.1147247"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,6]]},"references-count":25,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,6]]}},"alternative-id":["10.1145\/1147234.1147247"],"URL":"https:\/\/doi.org\/10.1145\/1147234.1147247","relation":{},"ISSN":["1931-0145","1931-0153"],"issn-type":[{"value":"1931-0145","type":"print"},{"value":"1931-0153","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,6]]},"assertion":[{"value":"2006-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}