{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T13:06:09Z","timestamp":1775912769543,"version":"3.50.1"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2017,1]]},"abstract":"<jats:p>Releasing full data records is one of the most challenging problems in data privacy. On the one hand, many of the popular techniques such as data de-identification are problematic because of their dependence on the background knowledge of adversaries. On the other hand, rigorous methods such as the exponential mechanism for differential privacy are often computationally impractical to use for releasing high dimensional data or cannot preserve high utility of original data due to their extensive data perturbation.<\/jats:p>\n          <jats:p>\n            This paper presents a criterion called\n            <jats:italic>plausible deniability<\/jats:italic>\n            that provides a formal privacy guarantee, notably for releasing sensitive datasets: an output record can be released only if a certain amount of input records are indistinguishable, up to a privacy parameter. This notion does not depend on the background knowledge of an adversary. Also, it can efficiently be checked by privacy tests. We present mechanisms to generate\n            <jats:italic>synthetic datasets<\/jats:italic>\n            with similar statistical properties to the input data and the same format. We study this technique both theoretically and experimentally. A key theoretical result shows that, with proper randomization, the plausible deniability mechanism generates differentially private synthetic data. We demonstrate the efficiency of this generative technique on a large dataset; it is shown to preserve the utility of original data with respect to various statistical analysis and machine learning measures.\n          <\/jats:p>","DOI":"10.14778\/3055540.3055542","type":"journal-article","created":{"date-parts":[[2017,3,15]],"date-time":"2017-03-15T14:27:29Z","timestamp":1489588049000},"page":"481-492","source":"Crossref","is-referenced-by-count":99,"title":["Plausible deniability for privacy-preserving data synthesis"],"prefix":"10.14778","volume":"10","author":[{"given":"Vincent","family":"Bindschaedler","sequence":"first","affiliation":[{"name":"UIUC"}]},{"given":"Reza","family":"Shokri","sequence":"additional","affiliation":[{"name":"Cornell Tech"}]},{"given":"Carl A.","family":"Gunter","sequence":"additional","affiliation":[{"name":"UIUC"}]}],"member":"320","published-online":{"date-parts":[[2017,1]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-87471-3_20"},{"key":"e_1_2_1_2_1","volume-title":"A face is exposed for aol searcher no. 4417749","author":"Barbaro M.","year":"2006","unstructured":"M. Barbaro and T. Z. Jr . A face is exposed for aol searcher no. 4417749 , 2006 . M. Barbaro and T. Z. Jr. A face is exposed for aol searcher no. 4417749, 2006."},{"key":"e_1_2_1_3_1","volume-title":"Synthesizing plausible privacy-preserving location traces","author":"Bindschaedler V.","year":"2016","unstructured":"V. Bindschaedler and R. Shokri . Synthesizing plausible privacy-preserving location traces . In IEEE S &P, 2016 . V. Bindschaedler and R. Shokri. Synthesizing plausible privacy-preserving location traces. In IEEE S&P, 2016."},{"key":"e_1_2_1_4_1","volume-title":"Plausible Deniability for Privacy-Preserving Data Synthesis (Extended Version). arXiv","author":"Bindschaedler V.","year":"2016","unstructured":"V. Bindschaedler , R. Shokri , and C. Gunter . Plausible Deniability for Privacy-Preserving Data Synthesis (Extended Version). arXiv , 2016 . V. Bindschaedler, R. Shokri, and C. Gunter. Plausible Deniability for Privacy-Preserving Data Synthesis (Extended Version). arXiv, 2016."},{"key":"e_1_2_1_5_1","volume-title":"Synthetics Generation Tool. https:\/\/vbinds.ch\/projects\/sgf","author":"Bindschaedler V.","year":"2016","unstructured":"V. Bindschaedler , R. Shokri , and C. Gunter . Synthetics Generation Tool. https:\/\/vbinds.ch\/projects\/sgf , 2016 . V. Bindschaedler, R. Shokri, and C. Gunter. Synthetics Generation Tool. https:\/\/vbinds.ch\/projects\/sgf, 2016."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14722\/ndss.2016.23328"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2450142.2450148"},{"key":"e_1_2_1_8_1","volume-title":"Differentially private data synthesis methods. arXiv preprint","author":"Bowen C. M.","year":"2016","unstructured":"C. M. Bowen and F. Liu . Differentially private data synthesis methods. arXiv preprint , 2016 . C. M. Bowen and F. Liu. Differentially private data synthesis methods. arXiv preprint, 2016."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.29012\/jpc.v2i2.589"},{"key":"e_1_2_1_10_1","volume-title":"JMLR","author":"Chaudhuri K.","year":"2011","unstructured":"K. Chaudhuri , C. Monteleoni , and A. D. Sarwate . Differentially private empirical risk minimization . JMLR , 2011 . K. Chaudhuri, C. Monteleoni, and A. D. Sarwate. Differentially private empirical risk minimization. JMLR, 2011."},{"key":"e_1_2_1_11_1","volume-title":"PVLDB","author":"Chen R.","year":"2011","unstructured":"R. Chen , N. Mohammed , B. C. Fung , B. C. Desai , and L. Xiong . Publishing set-valued data via differential privacy . PVLDB , 2011 . R. Chen, N. Mohammed, B. C. Fung, B. C. Desai, and L. Xiong. Publishing set-valued data via differential privacy. PVLDB, 2011."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2274576.2274608"},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4614-0326-5","volume-title":"Synthetic datasets for statistical disclosure control: theory and implementation","author":"Drechsler J.","year":"2011","unstructured":"J. Drechsler . Synthetic datasets for statistical disclosure control: theory and implementation . Springer Science & Business Media , 2011 . J. Drechsler. Synthetic datasets for statistical disclosure control: theory and implementation. Springer Science & Business Media, 2011."},{"key":"e_1_2_1_14_1","volume-title":"TAMC","author":"Dwork C.","year":"2008","unstructured":"C. Dwork . Differential privacy : A survey of results . In TAMC , 2008 . C. Dwork. Differential privacy: A survey of results. In TAMC, 2008."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/11681878_14"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1561\/0400000042"},{"key":"e_1_2_1_17_1","volume-title":"ISR","author":"Gibbs A. L.","year":"2002","unstructured":"A. L. Gibbs and F. E. Su . On choosing and bounding probability metrics . ISR , 2002 . A. L. Gibbs and F. E. Su. On choosing and bounding probability metrics. ISR, 2002."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01516-8_26"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/191839.191886"},{"key":"e_1_2_1_21_1","volume-title":"PJSM","author":"Hawala S.","year":"2008","unstructured":"S. Hawala . Producing partially synthetic data to avoid disclosure . In PJSM , 2008 . S. Hawala. Producing partially synthetic data to avoid disclosure. In PJSM, 2008."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2508859.2516707"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2007.06.013"},{"key":"e_1_2_1_24_1","volume-title":"Education and synthetic work-life earnings estimates. american community survey reports. acs-14","author":"Julian T.","year":"2011","unstructured":"T. Julian and R. Kominski . Education and synthetic work-life earnings estimates. american community survey reports. acs-14 . US Census Bureau , 2011 . T. Julian and R. Kominski. Education and synthetic work-life earnings estimates. american community survey reports. acs-14. US Census Bureau, 2011."},{"key":"e_1_2_1_25_1","author":"Keller S. A.","year":"2016","unstructured":"S. A. Keller , S. Shipp , and A. Schroeder . Does big data change the privacy landscape? a review of the issues. Annual Review of Statistics and Its Application , 2016 . S. A. Keller, S. Shipp, and A. Schroeder. Does big data change the privacy landscape? a review of the issues. Annual Review of Statistics and Its Application, 2016.","journal-title":"a review of the issues. Annual Review of Statistics and Its Application"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989345"},{"key":"e_1_2_1_27_1","volume-title":"Statistical Journal of the IAOS","author":"Kinney S. K.","year":"2014","unstructured":"S. K. Kinney , J. P. Reiter , and J. Miranda . Synlbd 2.0: improving the synthetic longitudinal business database . Statistical Journal of the IAOS , 2014 . S. K. Kinney, J. P. Reiter, and J. Miranda. Synlbd 2.0: improving the synthetic longitudinal business database. Statistical Journal of the IAOS, 2014."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1751-5823.2011.00153.x"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2808769.2808776"},{"key":"e_1_2_1_30_1","volume-title":"American Mathematical Soc.","author":"Levin D. A.","year":"2009","unstructured":"D. A. Levin , Y. Peres , and E. L. Wilmer . Markov chains and mixing times . American Mathematical Soc. , 2009 . D. A. Levin, Y. Peres, and E. L. Wilmer. Markov chains and mixing times. American Mathematical Soc., 2009."},{"key":"e_1_2_1_31_1","volume-title":"EDBT","author":"Li H.","year":"2014","unstructured":"H. Li , L. Xiong , and X. Jiang . Differentially private synthesization of multi-dimensional data using copula functions . In EDBT , 2014 . H. Li, L. Xiong, and X. Jiang. Differentially private synthesization of multi-dimensional data using copula functions. In EDBT, 2014."},{"key":"e_1_2_1_32_1","volume-title":"Model-based differential private data synthesis. arXiv","author":"Liu F.","year":"2016","unstructured":"F. Liu . Model-based differential private data synthesis. arXiv , 2016 . F. Liu. Model-based differential private data synthesis. arXiv, 2016."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2008.4497436"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1217299.1217302"},{"key":"e_1_2_1_35_1","volume-title":"US Army","author":"Margaritis D.","year":"2003","unstructured":"D. Margaritis . Learning Bayesian network model structure from data. PhD thesis , US Army , 2003 . D. Margaritis. Learning Bayesian network model structure from data. PhD thesis, US Army, 2003."},{"key":"e_1_2_1_36_1","volume-title":"TDP","author":"McClure D.","year":"2012","unstructured":"D. McClure and J. P. Reiter . Differential privacy and statistical disclosure risk measures: An investigation with binary synthetic data . TDP , 2012 . D. McClure and J. P. Reiter. Differential privacy and statistical disclosure risk measures: An investigation with binary synthetic data. TDP, 2012."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/FOCS.2007.41"},{"key":"e_1_2_1_38_1","author":"Mora M. T.","year":"2005","unstructured":"M. T. Mora , D. J. Villa , and A. D\u00e1vila . Language maintenance among the children of immigrants: A comparison of border states with other regions of the us. Southwest Journal of Linguistics , 2005 . M. T. Mora, D. J. Villa, and A. D\u00e1vila. Language maintenance among the children of immigrants: A comparison of border states with other regions of the us. Southwest Journal of Linguistics, 2005.","journal-title":"Southwest Journal of Linguistics"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2008.33"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2009.22"},{"key":"e_1_2_1_41_1","volume-title":"PVLDB","author":"Pedersen K. H.","year":"2006","unstructured":"K. H. Pedersen , K. Torp , and R. Wind . Simple and realistic data generation . In PVLDB , 2006 . K. H. Pedersen, K. Torp, and R. Wind. Simple and realistic data generation. In PVLDB, 2006."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.29012\/jpc.v1i1.567"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.29012\/jpc.v6i1.635"},{"key":"e_1_2_1_44_1","volume-title":"Journal of official Statistics","author":"Rubin D. B.","year":"1993","unstructured":"D. B. Rubin . Statistical disclosure limitation. Journal of official Statistics , 1993 . D. B. Rubin. Statistical disclosure limitation. Journal of official Statistics, 1993."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/275487.275508"},{"key":"e_1_2_1_46_1","unstructured":"A. Tockar. Riding with the stars: Passenger privacy in the nyc taxicab dataset. http:\/\/research.neustar.biz\/2014\/09\/15\/riding-with-the-stars-passenger-privacy-in-the-nyc-taxicab-dataset\/ 2014.  A. Tockar. Riding with the stars: Passenger privacy in the nyc taxicab dataset. http:\/\/research.neustar.biz\/2014\/09\/15\/riding-with-the-stars-passenger-privacy-in-the-nyc-taxicab-dataset\/ 2014."},{"key":"e_1_2_1_47_1","unstructured":"US Census Bureau. American community survey (acs). http:\/\/www.census.gov\/programs-surveys\/acs\/.  US Census Bureau. American community survey (acs). http:\/\/www.census.gov\/programs-surveys\/acs\/."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1653662.1653726"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1198\/jasa.2009.tm08651"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1377966.1377977"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0309-y"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2588573"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3055540.3055542","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T11:12:45Z","timestamp":1672225965000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3055540.3055542"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,1]]},"references-count":51,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2017,1]]}},"alternative-id":["10.14778\/3055540.3055542"],"URL":"https:\/\/doi.org\/10.14778\/3055540.3055542","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2017,1]]}}}