{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,2]],"date-time":"2025-11-02T06:07:49Z","timestamp":1762063669510,"version":"build-2065373602"},"reference-count":100,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2022,8,15]],"date-time":"2022-08-15T00:00:00Z","timestamp":1660521600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JCP"],"abstract":"<jats:p>Two factors are crucial for the effective operation of modern-day smart services: Initially, IoT-enabled technologies have to capture and combine huge amounts of data on data subjects. Then, all these data have to be processed exhaustively by means of techniques from the area of big data analytics. With regard to the latter, thorough data refinement in terms of data cleansing and data transformation is the decisive cornerstone. Studies show that data refinement reaches its full potential only by involving domain experts in the process. However, this means that these experts need full insight into the data in order to be able to identify and resolve any issues therein, e.g., by correcting or removing inaccurate, incorrect, or irrelevant data records. In particular for sensitive data (e.g., private data or confidential data), this poses a problem, since these data are thereby disclosed to third parties such as domain experts. To this end, we introduce SMARTEN, a sample-based approach towards privacy-friendly data refinement to smarten up big data analytics and smart services. SMARTEN applies a revised data refinement process that fully involves domain experts in data pre-processing but does not expose any sensitive data to them or any other third-party. To achieve this, domain experts obtain a representative sample of the entire data set that meets all privacy policies and confidentiality guidelines. Based on this sample, domain experts define data cleaning and transformation steps. Subsequently, these steps are converted into executable data refinement rules and applied to the entire data set. Domain experts can request further samples and define further rules until the data quality required for the intended use case is reached. Evaluation results confirm that our approach is effective in terms of both data quality and data privacy.<\/jats:p>","DOI":"10.3390\/jcp2030031","type":"journal-article","created":{"date-parts":[[2022,8,15]],"date-time":"2022-08-15T03:33:20Z","timestamp":1660534400000},"page":"606-628","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["SMARTEN\u2014A Sample-Based Approach towards Privacy-Friendly Data Refinement"],"prefix":"10.3390","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3795-7909","authenticated-orcid":false,"given":"Christoph","family":"Stach","sequence":"first","affiliation":[{"name":"Institute for Parallel and Distributed Systems, University of Stuttgart, Universit\u00e4tsstra\u00dfe 38, 70569 Stuttgart, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0410-5307","authenticated-orcid":false,"given":"Michael","family":"Behringer","sequence":"additional","affiliation":[{"name":"Institute for Parallel and Distributed Systems, University of Stuttgart, Universit\u00e4tsstra\u00dfe 38, 70569 Stuttgart, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4676-6410","authenticated-orcid":false,"given":"Julia","family":"Br\u00e4cker","sequence":"additional","affiliation":[{"name":"Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 5B, 70569 Stuttgart, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0835-8678","authenticated-orcid":false,"given":"Cl\u00e9mentine","family":"Gritti","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Software Engineering, University of Canterbury, Christchurch 8041, New Zealand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bernhard","family":"Mitschang","sequence":"additional","affiliation":[{"name":"Institute for Parallel and Distributed Systems, University of Stuttgart, Universit\u00e4tsstra\u00dfe 38, 70569 Stuttgart, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,15]]},"reference":[{"key":"ref_1","unstructured":"Schwab, K., Marcus, A., Oyola, J.R., Hoffman, W., and Luzi, M. (2011). Personal Data: The Emergence of a New Asset Class, World Economic Forum."},{"key":"ref_2","unstructured":"Toonders, J. (2014). Data is the New Oil of the Digital Economy. WIRED, Cond\u00e9 Nast."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1186\/s40504-021-00118-6","article-title":"\u201cData is the new oil\u201d: Citizen science and informed consent in an era of researchers handling of an economically valuable resource","volume":"17","author":"Quigley","year":"2021","journal-title":"Life Sci. Soc. Policy"},{"key":"ref_4","first-page":"8","article-title":"Data Strategy and Data Trust\u2013Drivers for Business Development","volume":"54","author":"Jesse","year":"2021","journal-title":"IFAC Pap."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3","DOI":"10.5334\/fce.116","article-title":"A Novel Model for Data-Driven Smart Sustainable Cities of the Future: A Strategic Roadmap to Transformational Change in the Era of Big Data","volume":"7","author":"Bibri","year":"2021","journal-title":"Future Cities Environ."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Das, S., and Gochhait, S. (2021). Entertainment in Era of AI, Big Data & IoT. Digital Entertainment: The Next Evolution in Service Sector, Springer.","DOI":"10.1007\/978-981-15-9724-4"},{"key":"ref_7","unstructured":"Jossen, S. (Economist, 2017). The World\u2019s Most Valuable Resource Is No Longer Oil, But Data, Economist."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1016\/j.future.2017.09.083","article-title":"Toward efficient smartification of the Internet of Things (IoT) services","volume":"92","author":"Bello","year":"2019","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_9","unstructured":"Bhageshpur, K. (2019). Data is the New Oil\u2014And That\u2019s a Good Thing, Forbes Technololy Council."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Taffel, S. (2021). Data and oil: Metaphor, materiality and metabolic rifts. New Media Soc. (OnlineFirst), 14614448211017887.","DOI":"10.1177\/14614448211017887"},{"key":"ref_11","first-page":"1","article-title":"Understanding Data, Information, Knowledge And Their Inter-Relationships","volume":"8","author":"Liew","year":"2007","journal-title":"J. Knowl. Manag. Pract."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1177\/0165551506070706","article-title":"The wisdom hierarchy: Representations of the DIKW hierarchy","volume":"33","author":"Rowley","year":"2007","journal-title":"J. Inf. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Hashemi, S.H., Faghri, F., Rausch, P., and Campbell, R.H. (2016, January 4\u20138). World of Empowered IoT Users. Proceedings of the 2016 IEEE First International Conference on Internet-of-Things Design and Implementation (IoTDI), Berlin, Germany.","DOI":"10.1109\/IoTDI.2015.39"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"69","DOI":"10.5204\/lthj.1470","article-title":"Revising the DIKW Pyramid and the Real Relationship between Data, Information, Knowledge and Wisdom","volume":"2","year":"2020","journal-title":"Law Technol. Hum."},{"key":"ref_15","first-page":"4102","article-title":"Review of Data Preprocessing Techniques in Data Mining","volume":"12","author":"Alasadi","year":"2017","journal-title":"J. Eng. Appl. Sci."},{"key":"ref_16","unstructured":"Elgendy, N., and Elragal, A. (2014, January 16\u201320). Big Data Analytics: A Literature Review Paper. Proceedings of the 14th Industrial Conference on Data Mining (ICDM), St. Petersburg, Russia."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Maimon, O., and Rokach, L. (2010). Data Cleansing: A Prelude to Knowledge Discovery. Data Mining and Knowledge Discovery Handbook, Springer.","DOI":"10.1007\/978-0-387-09823-4"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Tawalbeh, L., Muheidat, F., Tawalbeh, M., and Quwaider, M. (2020). IoT Privacy and Security: Challenges and Solutions. Appl. Sci., 10.","DOI":"10.3390\/app10124102"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"275","DOI":"10.3390\/iot2020015","article-title":"Using Citizen Science to Complement IoT Data Collection: A Survey of Motivational and Engagement Factors in Technology-Centric Citizen Science Projects","volume":"2","author":"Ali","year":"2021","journal-title":"IoT"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"265","DOI":"10.2218\/ijdc.v8i1.259","article-title":"Data Management of Confidential Data","volume":"8","author":"Lagoze","year":"2013","journal-title":"Int. J. Digit. Curation"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Ukil, A., Bandyopadhyay, S., and Pal, A. (May, January 27). IoT-Privacy: To be private or not to be private. Proceedings of the 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada.","DOI":"10.1109\/INFCOMW.2014.6849186"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1016\/j.cose.2018.04.002","article-title":"Explaining the privacy paradox: A systematic review of literature investigating privacy attitude and behavior","volume":"77","author":"Gerber","year":"2018","journal-title":"Comput. Secur."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1007\/s42979-021-00765-8","article-title":"Data Science and Analytics: An Overview from Data-Driven Smart Computing, Decision-Making and Applications Perspective","volume":"2","author":"Sarker","year":"2021","journal-title":"SN Comput. Sci."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Matignon, R. (2007). Data Mining Using SAS Enterprise Miner, Wiley.","DOI":"10.1002\/9780470171431"},{"key":"ref_25","first-page":"13","article-title":"The CRISP-DM Model: The New Blueprint for Data Mining","volume":"5","author":"Shearer","year":"2000","journal-title":"J. Data Warehous."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/240455.240464","article-title":"The KDD Process for Extracting Useful Knowledge from Volumes of Data","volume":"39","author":"Fayyad","year":"1996","journal-title":"Commun. ACM"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Kutzias, D., Dukino, C., and Kett, H. (2021, January 25\u201329). Towards a Continuous Process Model for Data Science Projects. Proceedings of the 12th International Conference on Applied Human Factors and Ergonomics (AHFE), New York, NY, USA.","DOI":"10.1007\/978-3-030-80840-2_23"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1109\/TKDE.2008.133","article-title":"Monitoring Online Tests through Data Visualization","volume":"21","author":"Costagliola","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_29","unstructured":"Uttamchandani, S. (2020). The Self-Service Data Roadmap: Democratize Data and Reduce Time to Insight, O\u2019Reilly."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Azeroual, O. (2020). Data Wrangling in Database Systems: Purging of Dirty Data. Data, 50.","DOI":"10.3390\/data5020050"},{"key":"ref_31","unstructured":"Delen, D. (2019). Prescriptive Analytics: The Final Frontier for Evidence-Based Management and Optimal Decision Making, Pearson FT Press."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Luengo, J., Garc\u00eda-Gil, D., Ram\u00edrez-Gallego, S., Garc\u00eda, S., and Herrera, F. (2020). Big Data Preprocessing: Enabling Smart Data, Springer.","DOI":"10.1007\/978-3-030-39105-8"},{"key":"ref_33","unstructured":"European Parliament and Council of the European Union (2016). Regulation on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95\/46\/EC (Data Protection Directive). Legislative Acts L119. Off. J. Eur. Union, Available online: https:\/\/gdpr-info.eu\/."},{"key":"ref_34","first-page":"102896","article-title":"Guidelines for GDPR compliance in Big Data systems","volume":"61","author":"Rhahla","year":"2021","journal-title":"J. Inf. Secur. Appl."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2012","DOI":"10.1109\/TIFS.2019.2954652","article-title":"Data Disclosure Under Perfect Sample Privacy","volume":"15","author":"Rassouli","year":"2020","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1109\/MSEC.2018.2888775","article-title":"Privacy-Preserving Machine Learning: Threats and Solutions","volume":"17","author":"Chang","year":"2019","journal-title":"IEEE Secur. Priv."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1495","DOI":"10.1007\/s12652-020-02801-6","article-title":"A secure and efficient privacy-preserving data aggregation algorithm","volume":"13","author":"Dou","year":"2022","journal-title":"J. Ambient. Intell. Humaniz. Comput."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"7250","DOI":"10.1109\/JIOT.2020.2983213","article-title":"Smart Meter Data Obfuscation Using Correlated Noise","volume":"7","author":"Khwaja","year":"2020","journal-title":"IEEE Internet Things J."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Gangarde, R., Sharma, A., Pawar, A., Joshi, R., and Gonge, S. (2021). Privacy Preservation in Online Social Networks Using Multiple-Graph-Properties-Based Clustering to Ensure k-Anonymity, l-Diversity, and t-Closeness. Electronics, 10.","DOI":"10.3390\/electronics10222877"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Stach, C. (2015, January 15\u201318). How to Deal with Third Party Apps in a Privacy System\u2014The PMP Gatekeeper. Proceedings of the 2015 IEEE 16th International Conference on Mobile Data Management (MDM), Pittsburgh, PA, USA.","DOI":"10.1109\/MDM.2015.17"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Stach, C. (2013, January 3\u20136). How to Assure Privacy on Android Phones and Devices?. Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management (MDM), Milan, Italy.","DOI":"10.1109\/MDM.2013.54"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Stach, C., and Mitschang, B. (2013, January 3\u20136). Privacy Management for Mobile Platforms\u2014A Review of Concepts and Approaches. Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management (MDM), Milan, Italy.","DOI":"10.1109\/MDM.2013.45"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Hou, W.C., Ozsoyoglu, G., and Taneja, B.K. (1988, January 21\u201323). Statistical Estimators for Relational Algebra Expressions. Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), Austin, TX, USA.","DOI":"10.1145\/308386.308455"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Mori, P., Furnell, S., and Camp, O. (2018, January 22\u201324). Fine-Grained Privacy Control for Fitness and Health Applications Using the Privacy Management Platform. Proceedings of the Information Systems Security and Privacy: 4th International Conference, ICISSP 2018, Funchal, Portugal. Revised Selected Papers.","DOI":"10.1007\/978-3-030-25109-3"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1145\/125137.125166","article-title":"Evaluation of Relational Algebras Incorporating the Time Dimension in Databases","volume":"23","author":"McKenzie","year":"1991","journal-title":"ACM Comput. Surv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"566","DOI":"10.1145\/32204.32219","article-title":"Extending Relational Algebra and Relational Calculus with Set-Valued Attributes and Aggregate Functions","volume":"12","author":"Matos","year":"1987","journal-title":"ACM Trans. Database Syst."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Li, J., Maier, D., Tufte, K., Papadimos, V., and Tucker, P.A. (2005, January 14\u201316). Semantics and Evaluation Techniques for Window Aggregates in Data Streams. Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data (SIGMOD), Baltimore, MD, USA.","DOI":"10.1145\/1066157.1066193"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Olejnik, K., Dacosta, I., Machado, J.S., Huguenin, K., Khan, M.E., and Hubaux, J.P. (2017, January 22\u201326). SmarPer: Context-Aware and Automatic Runtime-Permissions for Mobile Devices. Proceedings of the 2017 IEEE Symposium on Security and Privacy (SP), San Jose, CA, USA.","DOI":"10.1109\/SP.2017.25"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Navidan, H., Moghtadaiee, V., Nazaran, N., and Alishahi, M. (2022, January 6\u201310). Hide me Behind the Noise: Local Differential Privacy for Indoor Location Privacy. Proceedings of the 2022 IEEE European Symposium on Security and Privacy Workshops (EuroS & PW), Genoa, Italy.","DOI":"10.1109\/EuroSPW55150.2022.00061"},{"key":"ref_50","first-page":"3619","article-title":"Publishing Sensitive Time-Series Data under Preservation of Privacy and Distance Orders","volume":"8","author":"Choi","year":"2012","journal-title":"Int. J. Innov. Comput. Inf. Control."},{"key":"ref_51","first-page":"1","article-title":"When Machine Learning Meets Privacy: A Survey and Outlook","volume":"54","author":"Liu","year":"2021","journal-title":"ACM Comput. Surv."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Alpers, S., Oberweis, A., Pieper, M., Betz, S., Fritsch, A., Schiefer, G., and Wagner, M. (2017, January 13\u201316). PRIVACY-AVARE: An approach to manage and distribute privacy settings. Proceedings of the 2017 3rd IEEE International Conference on Computer and Communications (ICCC), Chengdu, China.","DOI":"10.1109\/CompComm.2017.8322784"},{"key":"ref_53","unstructured":"Kido, H., Yanagisawa, Y., and Satoh, T. (2005, January 11\u201314). An anonymous communication technique using dummies for location-based services. Proceedings of the 2005 International Conference on Pervasive Services (ICPS), Santorini, Greece."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Cliquet, A., Wiebe, S., Anderson, P., Saggio, G., Zwiggelaar, R., Gamboa, H., Fred, A., and Berm\u00fadez i Badia, S. (2018, January 19\u201321). How to Realize Device Interoperability and Information Security in mHealth Applications. Proceedings of the Biomedical Engineering Systems and Technologies: 11th International Joint Conference, BIOSTEC 2018, Funchal, Portugal. Revised Selected Papers.","DOI":"10.1007\/978-3-030-29196-9"},{"key":"ref_55","unstructured":"Stach, C. (2019, January 27\u201331). VAULT: A Privacy Approach towards High-Utility Time Series Data. Proceedings of the Thirteenth International Conference on Emerging Security Information, Systems and Technologies (SECURWARE), Nice, France."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"101523","DOI":"10.1016\/j.pmcj.2021.101523","article-title":"A survey on privacy issues and solutions for Voice-controlled Digital Assistants","volume":"80","author":"Reinhardt","year":"2022","journal-title":"Pervasive Mob. Comput."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Oh, S.J., Benenson, R., Fritz, M., and Schiele, B. (2016, January 11\u201314). Faceless Person Recognition: Privacy Implications in Social Media. Proceedings of the 14th European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46487-9_2"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Alpers, S., Betz, S., Fritsch, A., Oberweis, A., Schiefer, G., and Wagner, M. (2018, January 19\u201321). Citizen Empowerment by a Technical Approach for Privacy Enforcement. Proceedings of the 8th International Conference on Cloud Computing and Services Science (CLOSER), Funchal, Portugal.","DOI":"10.5220\/0006789805890595"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Stach, C., D\u00fcrr, F., Mindermann, K., Palanisamy, S.M., and Wagner, S. (2018, January 19\u201323). How a Pattern-based Privacy System Contributes to Improve Context Recognition. Proceedings of the 2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), Athens, Greece.","DOI":"10.1109\/PERCOMW.2018.8480227"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1007\/s10506-014-9155-5","article-title":"\u201cI am Spartacus\u201d: Privacy enhancing technologies, collaborative obfuscation and privacy as a public good","volume":"22","author":"Kwecka","year":"2014","journal-title":"Artif. Intell. Law"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"102488","DOI":"10.1016\/j.cose.2021.102488","article-title":"k-Anonymity in practice: How generalisation and suppression affect machine learning classifiers","volume":"111","author":"Henzl","year":"2021","journal-title":"Comput. Secur."},{"key":"ref_62","unstructured":"Dwork, C. (2006, January 10\u201314). Differential Privacy. Proceedings of the 33rd International Colloquium on Automata, Languages, and Programming (ICALP), Venice, Italy,."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s40537-018-0124-9","article-title":"Differential privacy: Its technological prescriptive using big data","volume":"5","author":"Jain","year":"2018","journal-title":"J. Big Data"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Zhu, T., Li, G., Zhou, W., and Yu, P.S. (2017). Differentially Private Recommender System. Differential Privacy and Applications, Springer.","DOI":"10.1007\/978-3-319-62004-6_10"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Machanavajjhala, A., He, X., and Hay, M. (2017, January 14\u201319). Differential Privacy in the Wild: A Tutorial on Current Practices & Open Challenges. Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD), Chicago, IL, USA.","DOI":"10.1145\/3035918.3054779"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Stach, C., Alpers, S., Betz, S., D\u00fcrr, F., Fritsch, A., Mindermann, K., Palanisamy, S.M., Schiefer, G., Wagner, M., and Mitschang, B. (2018, January 26\u201328). The AVARE PATRON\u2014A Holistic Privacy Approach for the Internet of Things. Proceedings of the 15th International Joint Conference on e-Business and Telecommunications (SECRYPT), Porto, Portugal.","DOI":"10.5220\/0006850305380545"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Chai, Q., and Gong, G. (2012, January 10\u201315). Verifiable symmetric searchable encryption for semi-honest-but-curious cloud servers. Proceedings of the 2012 IEEE International Conference on Communications (ICC), Ottawa, ON, Canada.","DOI":"10.1109\/ICC.2012.6364125"},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1145\/3479587","article-title":"The Design of Reciprocal Learning Between Human and Artificial Intelligence","volume":"5","author":"Zagalsky","year":"2021","journal-title":"Proc. ACM Hum.-Comput. Interact."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Arcolezi, H.H., Couchot, J.F., Al Bouna, B., and Xiao, X. (2021, January 1\u20135). Random Sampling Plus Fake Data: Multidimensional Frequency Estimates With Local Differential Privacy. Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM), Gold Coast, QLD, Australia.","DOI":"10.1145\/3459637.3482467"},{"key":"ref_70","first-page":"57","article-title":"Technical Privacy Metrics: A Systematic Survey","volume":"51","author":"Wagner","year":"2018","journal-title":"ACM Comput. Surv."},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Oppold, S., and Herschel, M. (2020, January 8\u201312). A System Framework for Personalized and Transparent Data-Driven Decisions. Proceedings of the 32nd International Conference on Advanced Information Systems Engineering (CAiSE), Grenoble, France.","DOI":"10.1007\/978-3-030-49435-3_10"},{"key":"ref_72","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/s13222-021-00401-y","article-title":"Metrics and Algorithms for Locally Fair and Accurate Classifications using Ensembles","volume":"22","author":"Oppold","year":"2022","journal-title":"Datenbank Spektrum"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Gemp, I., Theocharous, G., and Ghavamzadeh, M. (2017, January 4\u20139). Automated Data Cleansing through Meta-Learning. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, USA.","DOI":"10.1609\/aaai.v31i2.19107"},{"key":"ref_74","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1007\/s12597-020-00467-4","article-title":"Automated Data Harmonization (ADH) using Artificial Intelligence (AI)","volume":"58","author":"Dutta","year":"2021","journal-title":"OPSEARCH"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Hammoudi, S., \u015amia\u0142ek, M., Camp, O., and Filipe, J. (2017, January 26\u201329). A Human-Centered Approach for Interactive Data Processing and Analytics. Proceedings of the Enterprise Information Systems: 19th International Conference, ICEIS 2017, Porto, Portugal. Revised Selected Papers.","DOI":"10.1007\/978-3-319-93375-7"},{"key":"ref_76","doi-asserted-by":"crossref","first-page":"105587","DOI":"10.1016\/j.clsr.2021.105587","article-title":"Legal aspects of data cleansing in medical AI","volume":"42","author":"Schneeberger","year":"2021","journal-title":"Comput. Law Secur. Rev."},{"key":"ref_77","unstructured":"El Emam, K., Mosquera, L., and Hoptroff, R. (2020). Practical Synthetic Data Generation, O\u2019Reilly."},{"key":"ref_78","unstructured":"Stach, C., Br\u00e4cker, J., Eichler, R., Giebler, C., and Mitschang, B. (December, January 29). Demand-Driven Data Provisioning in Data Lakes: BARENTS\u2014A Tailorable Data Preparation Zone. Proceedings of the 23rd International Conference on Information Integration and Web Intelligence (iiWAS), Linz, Austria."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Hosseinzadeh, M., Azhir, E., Ahmed, O.H., Ghafour, M.Y., Ahmed, S.H., Rahmani, A.M., and Vo, B. (2021). Data cleansing mechanisms and approaches for big data analytics: A systematic study. J. Ambient. Intell. Humaniz. Comput., 1\u201313.","DOI":"10.1007\/s12652-021-03590-2"},{"key":"ref_80","unstructured":"Sharma, B. (2018). Architecting Data Lakes: Data Management Architectures for Advanced Business Use Cases, O\u2019Reilly. [2nd ed.]."},{"key":"ref_81","first-page":"88","article-title":"How to Provide High-Utility Time Series Data in a Privacy-Aware Manner: A VAULT to Manage Time Series Data","volume":"13","author":"Stach","year":"2020","journal-title":"Int. J. Adv. Secur."},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Stach, C., Giebler, C., Wagner, M., Weber, C., and Mitschang, B. (2020, January 25\u201327). AMNESIA: A Technical Solution towards GDPR-compliant Machine Learning. Proceedings of the 6th International Conference on Information Systems Security and Privacy (ICISSP), Valletta, Malta.","DOI":"10.5220\/0008916700210032"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Mindermann, K., Riedel, F., Abdulkhaleq, A., Stach, C., and Wagner, S. (2017, January 4\u20138). Exploratory Study of the Privacy Extension for System Theoretic Process Analysis (STPA-Priv) to elicit Privacy Risks in eHealth. Proceedings of the 2017 IEEE 25th International Requirements Engineering Conference Workshops, 4th International Workshop on Evolving Security & Privacy Requirements Engineering (REW\/ESPRE), Lisbon, Portugal.","DOI":"10.1109\/REW.2017.30"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Shapiro, S.S. (2016, January 22\u201326). Privacy Risk Analysis Based on System Control Structures: Adapting System-Theoretic Process Analysis for Privacy Engineering. Proceedings of the 2016 IEEE Security and Privacy Workshops (SPW), San Jose, CA, USA.","DOI":"10.1109\/SPW.2016.15"},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Stach, C., and Mitschang, B. (2018, January 22\u201324). ACCESSORS: A Data-Centric Permission Model for the Internet of Things. Proceedings of the 4th International Conference on Information Systems Security and Privacy (ICISSP), Funchal, Portugal.","DOI":"10.5220\/0006572100300040"},{"key":"ref_86","doi-asserted-by":"crossref","unstructured":"Stach, C., and Steimle, F. (2019, January 8\u201312). Recommender-based Privacy Requirements Elicitation\u2014EPICUREAN: An Approach to Simplify Privacy Settings in IoT Applications with Respect to the GDPR. Proceedings of the 34th ACM\/SIGAPP Symposium On Applied Computing (SAC), Limassol, Cyprus.","DOI":"10.1145\/3297280.3297432"},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Mori, P., Furnell, S., and Camp, O. (2018, January 22\u201324). Elicitation of Privacy Requirements for the Internet of Things Using ACCESSORS. Proceedings of the Information Systems Security and Privacy: 4th International Conference, ICISSP 2018, Funchal, Portugal. Revised Selected Papers.","DOI":"10.1007\/978-3-030-25109-3"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Gritti, C., Chen, R., Susilo, W., and Plantard, T. (2017, January 13\u201315). Dynamic Provable Data Possession Protocols with Public Verifiability and Data Privacy. Proceedings of the 13th International Conference on Information Security Practice and Experience (ISPEC), Melbourne, VIC, Australia.","DOI":"10.1007\/978-3-319-72359-4_29"},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Gritti, C. (2020, January 17\u201319). Publicly Verifiable Proofs of Data Replication and Retrievability for Cloud Storage. Proceedings of the 2020 International Computer Symposium (ICS), Tainan, Taiwan.","DOI":"10.1109\/ICS51289.2020.00091"},{"key":"ref_90","unstructured":"Stach, C., Gritti, C., and Mitschang, B. (April, January 30). Bringing Privacy Control Back to Citizens: DISPEL\u2014A Distributed Privacy Management Platform for the Internet of Things. Proceedings of the 35th ACM\/SIGAPP Symposium on Applied Computing (SAC), Brno, Czech Republic."},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Gritti, C., \u00d6nen, M., and Molva, R. (2018, January 28\u201330). CHARIOT: Cloud-Assisted Access Control for the Internet of Things. Proceedings of the 2018 16th Annual Conference on Privacy, Security and Trust (PST), Belfast, Ireland.","DOI":"10.1109\/PST.2018.8514217"},{"key":"ref_92","doi-asserted-by":"crossref","unstructured":"Gritti, C., \u00d6nen, M., and Molva, R. (2019, January 8\u201312). Privacy-Preserving Delegable Authentication in the Internet of Things. Proceedings of the 34th ACM\/SIGAPP Symposium on Applied Computing (SAC), Limassol, Cyprus.","DOI":"10.1145\/3297280.3297365"},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Chaum, D., Damg\u00e5rd, I.B., and van de Graaf, J. (1988, January 16\u201320). Multiparty Computations Ensuring Privacy of Each Party\u2019s Input and Correctness of the Result. Proceedings of the 7th Annual International Cryptology Conference (CRYPTO), Santa Barbara, CA, USA.","DOI":"10.1007\/3-540-48184-2_7"},{"key":"ref_94","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1145\/359168.359176","article-title":"How to Share a Secret","volume":"22","author":"Shamir","year":"1979","journal-title":"Commun. ACM"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Barker, E. (2020). Recommendation for Key Management: Part 1\u2014General.","DOI":"10.6028\/NIST.SP.800-57pt1r5"},{"key":"ref_96","first-page":"405","article-title":"The Impact of Quantum Computing on Present Cryptography","volume":"9","author":"Mavroeidis","year":"2018","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_97","doi-asserted-by":"crossref","first-page":"142413","DOI":"10.1109\/ACCESS.2020.3013250","article-title":"A Comparison of Security and its Performance for Key Agreements in Post-Quantum Cryptography","volume":"8","author":"Borges","year":"2020","journal-title":"IEEE Access"},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Behringer, M., Hirmer, P., Fritz, M., and Mitschang, B. (2020, January 8\u201310). Empowering Domain Experts to Preprocess Massive Distributed Datasets. Proceedings of the 23rd International Conference on Business Information Systems (BIS), Colorado Springs, CO, USA.","DOI":"10.1007\/978-3-030-53337-3_5"},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Stach, C., and Brodt, A. (2011, January 6\u20139). vHike\u2014A Dynamic Ride-Sharing Service for Smartphones. Proceedings of the 2011 IEEE 12th International Conference on Mobile Data Management (MDM), Lule\u00e5, Sweden.","DOI":"10.1109\/MDM.2011.33"},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Stach, C. (2016, January 13\u201316). Secure Candy Castle\u2014A Prototype for Privacy-Aware mHealth Apps. Proceedings of the 2016 IEEE 17th International Conference on Mobile Data Management (MDM), Porto, Portugal.","DOI":"10.1109\/MDM.2016.64"}],"container-title":["Journal of Cybersecurity and Privacy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2624-800X\/2\/3\/31\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:08:43Z","timestamp":1760141323000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2624-800X\/2\/3\/31"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,15]]},"references-count":100,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2022,9]]}},"alternative-id":["jcp2030031"],"URL":"https:\/\/doi.org\/10.3390\/jcp2030031","relation":{},"ISSN":["2624-800X"],"issn-type":[{"type":"electronic","value":"2624-800X"}],"subject":[],"published":{"date-parts":[[2022,8,15]]}}}