{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T03:40:49Z","timestamp":1760240449474,"version":"build-2065373602"},"reference-count":27,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2019,6,20]],"date-time":"2019-06-20T00:00:00Z","timestamp":1560988800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Future Internet"],"abstract":"<jats:p>Set-valued database publication has been increasing its importance recently due to its benefit for various applications such as marketing analysis and advertising. However, publishing a raw set-valued database may cause individual privacy breach such as the leakage of sensitive information like personal tendencies when data recipients perform data analysis. Even though imposing data anonymization methods such as suppression-based methods and random data swapping methods to such a database can successfully hide personal tendency, it induces item loss from records and causes significant distortion in record structure that degrades database utility. To avoid the problems, we proposed a method based on swapping technique where an individual\u2019s items in a record are swapped to items of the other record. Our swapping technique is distinct from existing one called random data swapping which yields much structure distortion. Even though the technique results in inaccuracy at a record level, it can preserve every single item in a database from loss. Thus, data recipients may obtain all the item information in an anonymized database. In addition, by carefully selecting a pair of records for item swapping, we can avoid excessive record structure distortion that leads to alter database content immensely. More importantly, such a strategy allows one to successfully hide personal tendency without sacrificing a lot of database utility.<\/jats:p>","DOI":"10.3390\/fi11060138","type":"journal-article","created":{"date-parts":[[2019,6,20]],"date-time":"2019-06-20T10:49:59Z","timestamp":1561027799000},"page":"138","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Data Anonymization for Hiding Personal Tendency in Set-Valued Database Publication"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1816-3142","authenticated-orcid":false,"given":"Dedi","family":"Gunawan","sequence":"first","affiliation":[{"name":"Division of Electrical Engineering and Computer Science, Graduate School of Natural Science and Technology, Kanazawa University, Kanazawa, Ishikawa 920-1192, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masahiro","family":"Mambo","sequence":"additional","affiliation":[{"name":"Faculty of Electrical and Computer Engineering, Institute of Science and Engineering, Kanazawa University, Kanazawa, Ishikawa 920-1192, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2019,6,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1016\/j.dss.2011.01.017","article-title":"The Role of Affect and Cognition on Online Consumers\u2019 Decision to Disclose Personal Information to Unfamiliar Online Vendors","volume":"51","author":"Li","year":"2011","journal-title":"Decis. Support Syst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1142\/S0218488502001648","article-title":"K-anonymity: A Model For Protecting Privacy","volume":"10","author":"Sweeney","year":"2002","journal-title":"Int. J. Uncertain. Fuzziness Knowl.-Based Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"115","DOI":"10.14778\/1453856.1453874","article-title":"Privacy-preserving anonymization of set-valued data","volume":"1","author":"Terrovitis","year":"2008","journal-title":"Proc. VLDB Endow."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Xu, Y., Fung, B.C.M., Wang, K., Fu, A.W.C., and Pei, J. (2008, January 15\u201319). Publishing Sensitive Transactions for Itemset Utility. Proceedings of the 2008 Eighth ICDM \u201908 IEEE International Conference on Data Mining, Pisa, Italy.","DOI":"10.1109\/ICDM.2008.98"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1007\/s11390-012-1203-6","article-title":"Publishing set-valued data against realistic adversaries","volume":"27","author":"Liu","year":"2012","journal-title":"J. Comput. Sci. Technol."},{"key":"ref_6","unstructured":"\u00d6zcan, F. (2005). Incognito: Efficient Full-Domain K-Anonymity. SIGMOD Conference, ACM."},{"key":"ref_7","unstructured":"Jia, X., Pan, C., Xu, X., Zhu, K.Q., and Lo, E. (2014). Suppression. International Conference on Database Systems for Advanced Applications, Springer."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1109\/TKDE.2010.101","article-title":"Anonymous Publication of Sensitive Transactional Data","volume":"23","author":"Ghinita","year":"2011","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Reiss, S.P., Post, M.J., and Dalenius, T. (1982, January 29\u201331). Non-reversible Privacy Transformations. Proceedings of the 1st ACM PODS \u201982 SIGACT-SIGMOD Symposium on Principles of Database Systems, Los Angeles, CA, USA.","DOI":"10.1145\/588131.588134"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Evfimievski, A., Gehrke, J., and Srikant, R. (2003). Limiting privacy breaches in privacy preserving data mining. Pods, 211\u2013222.","DOI":"10.1145\/773153.773174"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Atluri, V., and Diaz, C. (2011). Don\u2019t Reveal My Intension: Protecting User Privacy Using Declarative Preferences during Distributed Query Processing. Computer Security\u2013ESORICS 2011, Springer.","DOI":"10.1007\/978-3-642-23822-2"},{"key":"ref_12","unstructured":"Adar, E. (2007, January 8\u201312). User 4xxxxx9: Anonymizing query logs. Proceedings of the of Query Log Analysis Workshop, International Conference on World Wide Web, Banff, AB, Canada."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1145\/348.349","article-title":"Practical Data-swapping: The First Steps","volume":"9","author":"Reiss","year":"1984","journal-title":"ACM Trans. Database Syst."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1016\/j.knosys.2016.08.014","article-title":"A Survey of Serendipity in Recommender Systems","volume":"111","author":"Kotkov","year":"2016","journal-title":"Know.-Based Syst."},{"key":"ref_15","unstructured":"Domingo-Ferrer, J., and Torra, V. (2002). Theory and Practical Applications for Statistical Agencies, Chapter A Quantitative Comparison of Disclosure Control Methods for Microdata, Confidentiality, Disclosure and Data Access, Elsevier."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.datak.2007.07.006","article-title":"Rethinking rank swapping to decrease disclosure risk","volume":"64","author":"Nin","year":"2008","journal-title":"Data Knowl. Eng."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3219","DOI":"10.1002\/sec.1527","article-title":"An Effective Value Swapping Method for Privacy Preserving Data Publishing","volume":"9","author":"Hasan","year":"2016","journal-title":"Secur. Commun. Netw."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Machanavajjhala, A., Kifer, D., Gehrke, J., and Venkitasubramaniam, M. (2007). L-diversity: Privacy Beyond K-anonymity. ACM Trans. Knowl. Discov. Data, 1.","DOI":"10.1145\/1217299.1217302"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1016\/S0377-0427(03)00643-5","article-title":"Disclosure Risk Assessment in Statistical Data Protection","volume":"164-165","author":"Torra","year":"2004","journal-title":"J. Comput. Appl. Math."},{"key":"ref_20","first-page":"219","article-title":"Personalized Anonymization for Set-Valued Data by Partial Suppression","volume":"11","author":"Nakagawa","year":"2018","journal-title":"Trans. Data Priv."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/335191.335372","article-title":"Mining Frequent Patterns Without Candidate Generation","volume":"29","author":"Han","year":"2000","journal-title":"SIGMOD Rec."},{"key":"ref_22","first-page":"43","article-title":"Privacy Preserving Frequent Itemset Mining","volume":"Volume 14","author":"Clifton","year":"2002","journal-title":"Proceedings of the IEEE ICDM Workshop on Privacy, Security and Data Mining (PSDM 2002)"},{"key":"ref_23","unstructured":"WarpDrive-Project (2018, June 01). Web-Based Attack Response with Practical and Deployable Research Initiative. Available online: https:\/\/warpdrive-project.jp\/."},{"key":"ref_24","unstructured":"Debatty, T. (2018, October 14). Java String Similarity. Available online: https:\/\/github.com\/tdebatty\/java-string-similarity."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Shen, D., Ruvini, J.D., Somaiya, M., and Sundaresan, N. (2011, January 24\u201328). Item Categorization in the e-Commerce Domain. Proceedings of the 20th CIKM \u201911 ACM International Conference on Information and Knowledge Management, Glasgow, UK.","DOI":"10.1145\/2063576.2063855"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"934","DOI":"10.14778\/1687627.1687733","article-title":"Anonymization of Set-Valued Data via Top-Down, Local Generalization","volume":"2","author":"He","year":"2009","journal-title":"Proc. VLDB Endow."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Fournier-Viger, P., Lin, J.C., Gomariz, A., Gueniche, T., Soltani, A., Deng, Z., and Lam, H.T. (2016, January 19\u201323). The SPMF Open-Source Data Mining Library Version 2. Proceedings of the Part III Machine Learning and Knowledge Discovery in Databases\u2013European Conference, ECML PKDD 2016, Riva del Garda, Italy.","DOI":"10.1007\/978-3-319-46131-1_8"}],"container-title":["Future Internet"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-5903\/11\/6\/138\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:00:00Z","timestamp":1760187600000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-5903\/11\/6\/138"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,6,20]]},"references-count":27,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2019,6]]}},"alternative-id":["fi11060138"],"URL":"https:\/\/doi.org\/10.3390\/fi11060138","relation":{},"ISSN":["1999-5903"],"issn-type":[{"type":"electronic","value":"1999-5903"}],"subject":[],"published":{"date-parts":[[2019,6,20]]}}}