{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,21]],"date-time":"2026-07-21T03:10:22Z","timestamp":1784603422914,"version":"3.55.0"},"reference-count":35,"publisher":"Association for Computing Machinery (ACM)","issue":"10","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2018,6]]},"abstract":"<jats:p>\n            Privacy is an important concern for our society where sharing data with partners or releasing data to the public is a frequent occurrence. Some of the techniques that are being used to achieve privacy are to remove identifiers, alter quasi-identifiers, and perturb values. Unfortunately, these approaches suffer from two limitations. First, it has been shown that private information can still be leaked if attackers possess some background knowledge or other information sources. Second, they do not take into account the adverse impact these methods will have on the utility of the released data. In this paper, we propose a method that meets both requirements. Our method, called\n            <jats:italic>table-GAN<\/jats:italic>\n            , uses generative adversarial networks (GANs) to synthesize fake tables that are statistically similar to the original table yet do not incur information leakage. We show that the machine learning models trained using our synthetic tables exhibit performance that is similar to that of models trained using the original table for unknown testing cases. We call this property\n            <jats:italic>model compatibility<\/jats:italic>\n            . We believe that anonymization\/perturbation\/synthesis methods without model compatibility are of little value. We used four real-world datasets from four different domains for our experiments and conducted indepth comparisons with state-of-the-art anonymization, perturbation, and generation techniques. Throughout our experiments, only our method consistently shows balance between privacy level and model compatibility.\n          <\/jats:p>","DOI":"10.14778\/3231751.3231757","type":"journal-article","created":{"date-parts":[[2018,7,27]],"date-time":"2018-07-27T12:21:07Z","timestamp":1532694067000},"page":"1071-1083","source":"Crossref","is-referenced-by-count":436,"title":["Data synthesis based on generative adversarial networks"],"prefix":"10.14778","volume":"11","author":[{"given":"Noseong","family":"Park","sequence":"first","affiliation":[{"name":"University of North Carolina"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mahmoud","family":"Mohammadi","sequence":"additional","affiliation":[{"name":"University of North Carolina"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kshitij","family":"Gorde","sequence":"additional","affiliation":[{"name":"University of North Carolina"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sushil","family":"Jajodia","sequence":"additional","affiliation":[{"name":"George Mason University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hongkyu","family":"Park","sequence":"additional","affiliation":[{"name":"ETRI, Daejeon, South Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Youngmin","family":"Kim","sequence":"additional","affiliation":[{"name":"ETRI, Daejeon, South Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,6]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Adult Dataset. https:\/\/archive.ics.uci.edu\/ml\/machine-learning-databases\/adult.  Adult Dataset. https:\/\/archive.ics.uci.edu\/ml\/machine-learning-databases\/adult."},{"key":"e_1_2_1_2_1","unstructured":"Airline Dataset. https:\/\/www.transtats.bts.gov\/DataIndex.asp.  Airline Dataset. https:\/\/www.transtats.bts.gov\/DataIndex.asp."},{"key":"e_1_2_1_3_1","unstructured":"ARX - Data Anonymization Tool. http:\/\/arx.deidentifier.org.  ARX - Data Anonymization Tool. http:\/\/arx.deidentifier.org."},{"key":"e_1_2_1_4_1","unstructured":"Health Dataset. https:\/\/wwwn.cdc.gov\/Nchs\/Nhanes\/Default.aspx.  Health Dataset. https:\/\/wwwn.cdc.gov\/Nchs\/Nhanes\/Default.aspx."},{"key":"e_1_2_1_5_1","unstructured":"Los Angeles City Government Employee Payroll Dataset. https:\/\/controllerdata.lacity.org\/Payroll\/City-Employee-Payroll\/pazn-qyym.  Los Angeles City Government Employee Payroll Dataset. https:\/\/controllerdata.lacity.org\/Payroll\/City-Employee-Payroll\/pazn-qyym."},{"key":"e_1_2_1_6_1","unstructured":"Scikit-learn: Machine learning in Python. http:\/\/scikit-learn.org.  Scikit-learn: Machine learning in Python. http:\/\/scikit-learn.org."},{"key":"e_1_2_1_7_1","unstructured":"sdcMicro: Statistical Disclosure Control Methods for Anonymization of Microdata and Risk Estimation. https:\/\/cran.r-project.org\/web\/packages\/sdcMicro\/index.html.  sdcMicro: Statistical Disclosure Control Methods for Anonymization of Microdata and Risk Estimation. https:\/\/cran.r-project.org\/web\/packages\/sdcMicro\/index.html."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-24741-8_12"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/375551.375602"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/335191.335438"},{"key":"e_1_2_1_11_1","volume-title":"Sensegen: A deep learning architecture for synthetic sensor data generation. CoRR, abs\/1701.08886","author":"Alzantot M.","year":"2017","unstructured":"M. Alzantot , S. Chakraborty , and M. B. Srivastava . Sensegen: A deep learning architecture for synthetic sensor data generation. CoRR, abs\/1701.08886 , 2017 . M. Alzantot, S. Chakraborty, and M. B. Srivastava. Sensegen: A deep learning architecture for synthetic sensor data generation. CoRR, abs\/1701.08886, 2017."},{"key":"e_1_2_1_12_1","volume-title":"Towards principled methods for training generative adversarial networks. CoRR, abs\/1701.04862","author":"Arjovsky M.","year":"2017","unstructured":"M. Arjovsky and L. Bottou . Towards principled methods for training generative adversarial networks. CoRR, abs\/1701.04862 , 2017 . M. Arjovsky and L. Bottou. Towards principled methods for training generative adversarial networks. CoRR, abs\/1701.04862, 2017."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1561\/2200000016"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401904"},{"key":"e_1_2_1_15_1","volume-title":"Generating multi-label discrete electronic health records using generative adversarial networks. CoRR, abs\/1703.06490","author":"Choi E.","year":"2017","unstructured":"E. Choi , S. Biswal , B. Malin , J. Duke , W. F. Stewart , and J. Sun . Generating multi-label discrete electronic health records using generative adversarial networks. CoRR, abs\/1703.06490 , 2017 . E. Choi, S. Biswal, B. Malin, J. Duke, W. F. Stewart, and J. Sun. Generating multi-label discrete electronic health records using generative adversarial networks. CoRR, abs\/1703.06490, 2017."},{"key":"e_1_2_1_16_1","first-page":"286","volume-title":"Proceedings of the 2nd Machine Learning for Healthcare Conference","author":"Choi E.","year":"2017","unstructured":"E. Choi , S. Biswal , B. Malin , J. Duke , W. F. Stewart , and J. Sun . Generating multi-label discrete patient records using generative adversarial networks. In F. Doshi-Velez, J. Fackler, D. Kale, R. Ranganath, B. Wallace, and J. Wiens, editors , Proceedings of the 2nd Machine Learning for Healthcare Conference , pages 286 -- 305 , 2017 . E. Choi, S. Biswal, B. Malin, J. Duke, W. F. Stewart, and J. Sun. Generating multi-label discrete patient records using generative adversarial networks. In F. Doshi-Velez, J. Fackler, D. Kale, R. Ranganath, B. Wallace, and J. Wiens, editors, Proceedings of the 2nd Machine Learning for Healthcare Conference, pages 286--305, 2017."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1754239.1754271"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-70992-5_3"},{"key":"e_1_2_1_19_1","first-page":"2672","volume-title":"Advances in neural information processing systems","author":"Goodfellow I.","year":"2014","unstructured":"I. Goodfellow , J. Pouget-Abadie , M. Mirza , B. Xu , D. Warde-Farley , S. Ozair , A. Courville , and Y. Bengio . Generative adversarial nets . In Advances in neural information processing systems , pages 2672 -- 2680 , 2014 . I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio. Generative adversarial nets. In Advances in neural information processing systems, pages 2672--2680, 2014."},{"key":"e_1_2_1_20_1","first-page":"448","volume-title":"Proceedings of the 32nd International Conference on Machine Learning","author":"Ioffe S.","year":"2015","unstructured":"S. Ioffe and C. Szegedy . Batch normalization: Accelerating deep network training by reducing internal covariate shift . In Proceedings of the 32nd International Conference on Machine Learning , pages 448 -- 456 , 2015 . S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd International Conference on Machine Learning, pages 448--456, 2015."},{"key":"e_1_2_1_21_1","volume-title":"Progressive growing of gans for improved quality, stability, and variation. CoRR, abs\/1710.10196","author":"Karras T.","year":"2017","unstructured":"T. Karras , T. Aila , S. Laine , and J. Lehtinen . Progressive growing of gans for improved quality, stability, and variation. CoRR, abs\/1710.10196 , 2017 . T. Karras, T. Aila, S. Laine, and J. Lehtinen. Progressive growing of gans for improved quality, stability, and variation. CoRR, abs\/1710.10196, 2017."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2007.367856"},{"key":"e_1_2_1_23_1","volume-title":"ICML Workshop on Deep Learning for Audio, Speech, and Language Processing","author":"Maas A. L.","year":"2013","unstructured":"A. L. Maas , A. Y. Hannun , and A. Y. Ng . Rectifier nonlinearities improve neural network acoustic models . In ICML Workshop on Deep Learning for Audio, Speech, and Language Processing , 2013 . A. L. Maas, A. Y. Hannun, and A. Y. Ng. Rectifier nonlinearities improve neural network acoustic models. In ICML Workshop on Deep Learning for Audio, Speech, and Language Processing, 2013."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1217299.1217302"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-25955-8_16"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020487"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/3104322.3104425"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/3305890.3305954"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.14778\/3231751.3231757"},{"key":"e_1_2_1_30_1","volume-title":"Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR, abs\/1511.06434","author":"Radford A.","year":"2015","unstructured":"A. Radford , L. Metz , and S. Chintala . Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR, abs\/1511.06434 , 2015 . A. Radford, L. Metz, and S. Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR, abs\/1511.06434, 2015."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976604773135104"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the IEEE Symposium on Research in Security and Privacy","author":"Samarati P.","year":"1998","unstructured":"P. Samarati and L. Sweeney . Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression . In Proceedings of the IEEE Symposium on Research in Security and Privacy , 1998 . P. Samarati and L. Sweeney. Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. In Proceedings of the IEEE Symposium on Research in Security and Privacy, 1998."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2017.41"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1029179.1029202"},{"key":"e_1_2_1_35_1","volume-title":"Distributed stochastic optimization for deep learning. CoRR, abs\/1605.02216","author":"Zhang S.","year":"2016","unstructured":"S. Zhang . Distributed stochastic optimization for deep learning. CoRR, abs\/1605.02216 , 2016 . S. Zhang. Distributed stochastic optimization for deep learning. CoRR, abs\/1605.02216, 2016."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3231751.3231757","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:40:56Z","timestamp":1672224056000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3231751.3231757"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6]]},"references-count":35,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2018,6]]}},"alternative-id":["10.14778\/3231751.3231757"],"URL":"https:\/\/doi.org\/10.14778\/3231751.3231757","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2018,6]]}}}