{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T19:25:19Z","timestamp":1774121119543,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2019,11,8]],"date-time":"2019-11-08T00:00:00Z","timestamp":1573171200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The rapid progress in genome sequencing has led to high availability of genomic data. Studying these data can greatly help answer the key questions about disease associations and our evolution. However, due to growing privacy concerns about the sensitive information of participants, accessing key results and data of genomic studies (such as genome-wide association studies) is restricted to only trusted individuals. On the other hand, paving the way to biomedical breakthroughs and discoveries requires granting open access to genomic datasets. Privacy-preserving mechanisms can be a solution for granting wider access to such data while protecting their owners. In particular, there has been growing interest in applying the concept of differential privacy (DP) while sharing summary statistics about genomic data. DP provides a mathematically rigorous approach to prevent the risk of membership inference while sharing statistical information about a dataset. However, DP does not consider the dependence between tuples in the dataset, which may degrade the privacy guarantees offered by the DP.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this work, focusing on genomic datasets, we show this drawback of the DP and we propose techniques to mitigate it. First, using a real-world genomic dataset, we demonstrate the feasibility of an inference attack on differentially private query results by utilizing the correlations between the entries in the dataset. The results show the scale of vulnerability when we have dependent tuples in the dataset. We show that the adversary can infer sensitive genomic data about a user from the differentially private results of a query by exploiting the correlations between the genomes of family members. Second, we propose a mechanism for privacy-preserving sharing of statistics from genomic datasets to attain privacy guarantees while taking into consideration the dependence between tuples. By evaluating our mechanism on different genomic datasets, we empirically demonstrate that our proposed mechanism can achieve up to 50% better privacy than traditional DP-based solutions.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>https:\/\/github.com\/nourmadhoun\/Differential-privacy-genomic-inference-attack.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz837","type":"journal-article","created":{"date-parts":[[2019,11,6]],"date-time":"2019-11-06T12:15:00Z","timestamp":1573042500000},"page":"1696-1703","source":"Crossref","is-referenced-by-count":34,"title":["Differential privacy under dependent tuples\u2014the case of genomic privacy"],"prefix":"10.1093","volume":"36","author":[{"given":"Nour","family":"Almadhoun","sequence":"first","affiliation":[{"name":"Computer Engineering Department, Bilkent University , 06800 Ankara, Turkey"}]},{"given":"Erman","family":"Ayday","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, Bilkent University , 06800 Ankara, Turkey"},{"name":"Department of Electrical Engineering and Computer Science, Case Western Reserve University , Cleveland, OH 44106, USA"}]},{"given":"\u00d6zg\u00fcr","family":"Ulusoy","sequence":"additional","affiliation":[{"name":"Computer Engineering Department, Bilkent University , 06800 Ankara, Turkey"}]}],"member":"286","published-online":{"date-parts":[[2019,11,8]]},"reference":[{"key":"2023060911574189000_btz837-B1","first-page":"237","volume-title":"Data Privacy Management, and Security Assurance","author":"Alser","year":"2015"},{"key":"2023060911574189000_btz837-B2","doi-asserted-by":"crossref","first-page":"4255","DOI":"10.1093\/bioinformatics\/btz234","article-title":"Shouji: a fast and efficient pre-alignment filter for sequence alignment","volume":"35","author":"Alser","year":"2019","journal-title":"Bioinformatics"},{"key":"2023060911574189000_btz837-B3","doi-asserted-by":"crossref","first-page":"3355","DOI":"10.1093\/bioinformatics\/btx342","article-title":"Gatekeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping","volume":"33","author":"Alser","year":"2017","journal-title":"Bioinformatics"},{"key":"2023060911574189000_btz837-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2450142.2450148","article-title":"A learning theory approach to noninteractive database privacy","volume":"60","author":"Blum","year":"2013","journal-title":"JACM"},{"key":"2023060911574189000_btz837-B5","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1016\/j.ajhg.2018.07.015","article-title":"A one-penny imputed genome from next-generation reference panels","volume":"103","author":"Browning","year":"2018","journal-title":"Am. J. Hum. Genet"},{"key":"2023060911574189000_btz837-B6","first-page":"821","author":"Cao","year":"2017"},{"key":"2023060911574189000_btz837-B7","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1038\/gim.2015.187","article-title":"The Geisinger MyCode community health initiative: an electronic health record\u2013linked biobank for precision medicine research","volume":"18","author":"Carey","year":"2016","journal-title":"Genet. Med"},{"key":"2023060911574189000_btz837-B8","author":"Chaabane","year":"2012"},{"key":"2023060911574189000_btz837-B9","doi-asserted-by":"crossref","first-page":"653","DOI":"10.1007\/s00778-013-0344-8","article-title":"Correlated network data publication via differential privacy","volume":"23","author":"Chen","year":"2014","journal-title":"VLDB J"},{"key":"2023060911574189000_btz837-B10","author":"Commission","year":"2003"},{"key":"2023060911574189000_btz837-B11","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1186\/1751-0473-8-13","article-title":"Crowdsourcing the Corpasome","volume":"8","author":"Corpas","year":"2013","journal-title":"Source Code Biol. Med"},{"key":"2023060911574189000_btz837-B12","doi-asserted-by":"crossref","first-page":"989","DOI":"10.1126\/science.1133807","article-title":"HTRA1 promoter polymorphism in wet age-related macular degeneration","volume":"314","author":"DeWan","year":"2006","journal-title":"Science"},{"key":"2023060911574189000_btz837-B13","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1126\/science.1181498","article-title":"Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays","volume":"327","author":"Drmanac","year":"2010","journal-title":"Science"},{"key":"2023060911574189000_btz837-B14","first-page":"1","author":"Dwork","year":"2008"},{"key":"2023060911574189000_btz837-B15","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1038\/nrg3723","article-title":"Routes for breaching and protecting genetic privacy","volume":"15","author":"Erlich","year":"2014","journal-title":"Nat. Rev. Genet"},{"key":"2023060911574189000_btz837-B16","first-page":"17","author":"Fredrikson","year":"2014"},{"key":"2023060911574189000_btz837-B17","doi-asserted-by":"crossref","first-page":"D1","DOI":"10.1093\/nar\/gku1241","article-title":"The 2015 nucleic acids research database issue and molecular biology database collection","volume":"43","author":"Galperin","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023060911574189000_btz837-B18","first-page":"1447","author":"He","year":"2014"},{"key":"2023060911574189000_btz837-B19","doi-asserted-by":"crossref","first-page":"4618","DOI":"10.1002\/elps.200800456","article-title":"Advantages and limitations of next-generation sequencing technologies: a comparison of electrophoresis and non-electrophoresis methods","volume":"29","author":"Hert","year":"2008","journal-title":"Electrophoresis"},{"key":"2023060911574189000_btz837-B20","doi-asserted-by":"crossref","first-page":"e1000167","DOI":"10.1371\/journal.pgen.1000167","article-title":"Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays","volume":"4","author":"Homer","year":"2008","journal-title":"PLoS Genet"},{"key":"2023060911574189000_btz837-B21","first-page":"1141","author":"Humbert","year":"2013"},{"key":"2023060911574189000_btz837-B22","doi-asserted-by":"crossref","first-page":"1696","DOI":"10.1002\/ajmg.a.32322","article-title":"Relationship between public attitudes toward genomic studies related to medicine and their level of genomic literacy in Japan","volume":"146","author":"Ishiyama","year":"2008","journal-title":"Am. J. Med. Genet. A"},{"key":"2023060911574189000_btz837-B23","first-page":"1079","author":"Johnson","year":"2013"},{"key":"2023060911574189000_btz837-B24","first-page":"193","author":"Kifer","year":"2011"},{"key":"2023060911574189000_btz837-B25","first-page":"77","author":"Kifer","year":"2012"},{"key":"2023060911574189000_btz837-B26","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1007\/s10561-009-9145-0","article-title":"Public involvement in pharmacogenomics research: a national survey on public attitudes towards pharmacogenomics research and the willingness to donate DNA samples to a DNA bank in Japan","volume":"10","author":"Kobayashi","year":"2009","journal-title":"Cell Tissue Bank"},{"key":"2023060911574189000_btz837-B27","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1080\/15265161.2018.1431322","article-title":"Beyond consent: building trusting relationships with diverse populations in precision medicine research","volume":"18","author":"Kraft","year":"2018","journal-title":"Am. J. Bioeth"},{"key":"2023060911574189000_btz837-B28","doi-asserted-by":"crossref","first-page":"1019","DOI":"10.1002\/asi.20591","article-title":"The link-prediction problem for social networks","volume":"58","author":"Liben-Nowell","year":"2007","journal-title":"J. Am. Soc. Inf. Sci. Tec"},{"key":"2023060911574189000_btz837-B29","first-page":"21","author":"Liu","year":"2016"},{"key":"2023060911574189000_btz837-B30","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/j.cose.2018.12.017","article-title":"Achieving correlated differential privacy of big data publication","volume":"82","author":"Lv","year":"2019","journal-title":"Comput. Secur"},{"key":"2023060911574189000_btz837-B31","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1038\/gim.2015.138","article-title":"A systematic literature review of individuals\u2019 perspectives on broad consent and data sharing in the United States","volume":"18","author":"Nanibaa\u2019A","year":"2016","journal-title":"Genet. Med"},{"key":"2023060911574189000_btz837-B32","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1109\/TMC.2016.2561281","article-title":"Quantifying interdependent privacy risks with location data","volume":"16","author":"Olteanu","year":"2017","journal-title":"IEEE Trans. Mob. Comput"},{"key":"2023060911574189000_btz837-B33","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/s10561-007-9051-2","article-title":"Attitudes and perceptions of patients towards methods of establishing a DNA biobank","volume":"9","author":"Pulley","year":"2008","journal-title":"Cell Tissue Bank"},{"key":"2023060911574189000_btz837-B34","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1007\/s12687-013-0146-0","article-title":"Biobanking for research: a survey of patient population attitudes and understanding","volume":"4","author":"Rahm","year":"2013","journal-title":"J. Community Genet"},{"key":"2023060911574189000_btz837-B35","first-page":"1291","author":"Song","year":"2017"},{"key":"2023060911574189000_btz837-B36","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1007\/s12687-014-0191-3","article-title":"Genetic research participation in a young adult community sample","volume":"5","author":"Storr","year":"2014","journal-title":"J. Commun. Genet"},{"key":"2023060911574189000_btz837-B37","first-page":"137","article-title":"Privacy-preserving data sharing for genome-wide association studies","volume":"5","author":"Uhlerop","year":"2013","journal-title":"J. Priv. Confid"},{"key":"2023060911574189000_btz837-B38","first-page":"747","author":"Yang","year":"2015"},{"key":"2023060911574189000_btz837-B39","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/j.jbi.2014.01.008","article-title":"Scalable privacy-preserving data sharing methodology for genome-wide association studies","volume":"50","author":"Yu","year":"2014","journal-title":"J. Biomed. Inform"},{"key":"2023060911574189000_btz837-B40","first-page":"1","author":"Zhao","year":"2017"},{"key":"2023060911574189000_btz837-B41","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1109\/TIFS.2014.2368363","article-title":"Correlated differential privacy: hiding information in non-IID data set","volume":"10","author":"Zhu","year":"2015","journal-title":"IEEE Trans. Inf. Forensics Secur"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz837\/30490542\/btz837.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/6\/1696\/50553768\/bioinformatics_36_6_1696.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/6\/1696\/50553768\/bioinformatics_36_6_1696.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,9]],"date-time":"2023-06-09T11:58:11Z","timestamp":1686311891000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/6\/1696\/5614817"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,11,8]]},"references-count":41,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2020,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz837","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,3,15]]},"published":{"date-parts":[[2019,11,8]]}}}