{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T22:25:04Z","timestamp":1780439104821,"version":"3.54.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2016,10,4]],"date-time":"2016-10-04T00:00:00Z","timestamp":1475539200000},"content-version":"vor","delay-in-days":707,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0"}],"content-domain":{"domain":["bmj.com"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Objective To propose a new approach to privacy preserving data selection, which helps the data users access human genomic datasets efficiently without undermining patients\u2019 privacy.<\/jats:p><jats:p>Methods Our idea is to let each data owner publish a set of differentially-private pilot data, on which a data user can test-run arbitrary association-test algorithms, including those not known to the data owner a priori. We developed a suite of new techniques, including a pilot-data generation approach that leverages the linkage disequilibrium in the human genome to preserve both the utility of the data and the privacy of the patients, and a utility evaluation method that helps the user assess the value of the real data from its pilot version with high confidence.<\/jats:p><jats:p>Results We evaluated our approach on real human genomic data using four popular association tests. Our study shows that the proposed approach can help data users make the right choices in most cases.<\/jats:p><jats:p>Conclusions Even though the pilot data cannot be directly used for scientific discovery, it provides a useful indication of which datasets are more likely to be useful to data users, who can therefore approach the appropriate data owners to gain access to the data.<\/jats:p>","DOI":"10.1136\/amiajnl-2014-003043","type":"journal-article","created":{"date-parts":[[2014,10,29]],"date-time":"2014-10-29T03:49:40Z","timestamp":1414554580000},"page":"100-108","update-policy":"https:\/\/doi.org\/10.1136\/crossmarkpolicy","source":"Crossref","is-referenced-by-count":20,"title":["Choosing blindly but wisely: differentially private solicitation of DNA datasets for disease marker discovery"],"prefix":"10.1093","volume":"22","author":[{"given":"Yongan","family":"Zhao","sequence":"first","affiliation":[{"name":"Indiana University, Bloomington, Indiana, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaofeng","family":"Wang","sequence":"additional","affiliation":[{"name":"Indiana University, Bloomington, Indiana, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoqian","family":"Jiang","sequence":"additional","affiliation":[{"name":"University of California, San Diego (UCSD), San Diego, California, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lucila","family":"Ohno-Machado","sequence":"additional","affiliation":[{"name":"University of California, San Diego (UCSD), San Diego, California, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haixu","family":"Tang","sequence":"additional","affiliation":[{"name":"Indiana University, Bloomington, Indiana, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2014,10,28]]},"reference":[{"key":"2020110613002385500_R1","doi-asserted-by":"crossref","first-page":"1759","DOI":"10.1056\/NEJMra0808700","article-title":"Genomewide association studies and human disease","volume":"360","author":"Hardy","year":"2009","journal-title":"N Engl J Med"},{"key":"2020110613002385500_R2","doi-asserted-by":"crossref","first-page":"1166","DOI":"10.1056\/NEJMsb022863","article-title":"Race and genomics","volume":"348","author":"Cooper","year":"2003","journal-title":"N Engl J Med"},{"key":"2020110613002385500_R3","first-page":"698","article-title":"Frequency of three Hex A mutant alleles among Jewish and non-Jewish carriers identified in a Tay-Sachs screening program","volume":"47","author":"Paw","year":"1990","journal-title":"Am J Hum Genet"},{"key":"2020110613002385500_R4","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1002\/humu.1380010304","article-title":"Mutations and sequence variations detected in the cystic fibrosis transmembrane conductance regulator (CFTR) gene: a report from the Cystic Fibrosis Genetic Analysis Consortium","volume":"1","author":"Tsui","year":"1992","journal-title":"Hum Mutat"},{"key":"2020110613002385500_R5","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1046\/j.1365-2141.2003.04769.x","article-title":"Genetic insights into the clinical diversity of \u03b2 thalassaemia","volume":"124","author":"Thein","year":"2004","journal-title":"Br J Haematol"},{"key":"2020110613002385500_R6","doi-asserted-by":"crossref","first-page":"e1000167","DOI":"10.1371\/journal.pgen.1000167","article-title":"Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays","volume":"4","author":"Homer","year":"2008","journal-title":"PLoS Genet"},{"key":"2020110613002385500_R7","doi-asserted-by":"crossref","first-page":"965","DOI":"10.1038\/ng.436","article-title":"Genomic privacy and limits of individual detection in a pool","volume":"41","author":"Sankararaman","year":"2009","journal-title":"Nat Genet"},{"key":"2020110613002385500_R8"},{"key":"2020110613002385500_R9","article-title":"Learning your identity and disease from research papers: information leaks in genome wide association study","year":"2009"},{"key":"2020110613002385500_R10","doi-asserted-by":"crossref","first-page":"e1002822","DOI":"10.1371\/journal.pcbi.1002822","article-title":"Genome-wide association studies","volume":"8","author":"Bush","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"2020110613002385500_R11","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1038\/nrg2813","article-title":"New approaches to population stratification in genome-wide association studies","volume":"11","author":"Price","year":"2010","journal-title":"Nat Rev Genet"},{"key":"2020110613002385500_R12","article-title":"Revealing information while preserving privacy","year":"2003"},{"key":"2020110613002385500_R13","first-page":"265","article-title":"Calibrating noise to sensitivity in private data analysis","author":"Dwork","year":"2006"},{"key":"2020110613002385500_R14","article-title":"Privacy preserving GWAS data sharing","year":"2011"},{"key":"2020110613002385500_R15","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1038\/nrg3067","article-title":"Assessing and managing risk when sharing aggregate genetic variant data","volume":"12","author":"Craig","year":"2011","journal-title":"Nat Rev Genet"},{"key":"2020110613002385500_R16","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1038\/nrg2573","article-title":"Data sharing in genomics\u2014re-shaping scientific practice","volume":"10","author":"Kaye","year":"2009","journal-title":"Nat Rev Genet"},{"key":"2020110613002385500_R17","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1038\/ijo.2008.277","article-title":"Subtyping obesity with microarrays: implications for the diagnosis and treatment of obesity","volume":"33","author":"Wang","year":"2009","journal-title":"Int J Obes"},{"key":"2020110613002385500_R18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/11787006_1","article-title":"Differential privacy","volume-title":"Automata, languages and programming","author":"Dwork","year":"2006"},{"key":"2020110613002385500_R19","doi-asserted-by":"crossref","first-page":"2225","DOI":"10.1126\/science.1069424","article-title":"The structure of haplotype blocks in the human genome","volume":"296","author":"Gabriel","year":"2002","journal-title":"Science"},{"key":"2020110613002385500_R20","doi-asserted-by":"crossref","first-page":"908","DOI":"10.1101\/gr.1837404","article-title":"Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies","volume":"14","author":"Zhang","year":"2004","journal-title":"Genome Res"},{"key":"2020110613002385500_R21","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1111\/j.1469-1809.2010.00580.x","article-title":"A Validation Study of Type 2 Diabetes-related Variants of the TCF7L2, HHEX, KCNJ11, and ADIPOQ Genes in one Endogamous Ethnic Group of North India","volume":"74","author":"Gupta","year":"2010","journal-title":"Ann Hum Genet"},{"key":"2020110613002385500_R22","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/S1470-2045(11)70302-3","article-title":"Independent validation of genes and polymorphisms reported to be associated with radiation toxicity: a prospective analysis study","volume":"13","author":"Barnett","year":"2012","journal-title":"Lancet Oncol"},{"key":"2020110613002385500_R23","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1038\/nature05911","article-title":"Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls","volume":"447","author":"Burton","year":"2007","journal-title":"Nature"},{"key":"2020110613002385500_R24","first-page":"16","article-title":"Transforming growth factor-\u03b2 signaling pathway in patients with Kawasaki disease","volume":"4","author":"Shimizu","year":"2011","journal-title":"Circulation"},{"key":"2020110613002385500_R25","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1016\/S0140-6736(04)16814-1","article-title":"Kawasaki syndrome","volume":"364","author":"Burns","year":"2004","journal-title":"Lancet"},{"key":"2020110613002385500_R26","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J R Stat Soc B"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/1\/100\/34145394\/amiajnl-2014-003043.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/1\/100\/34145394\/amiajnl-2014-003043.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,4]],"date-time":"2024-06-04T13:10:00Z","timestamp":1717506600000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/22\/1\/100\/834712"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,10,28]]},"references-count":26,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2014,10,28]]},"published-print":{"date-parts":[[2015,1,1]]}},"URL":"https:\/\/doi.org\/10.1136\/amiajnl-2014-003043","relation":{},"ISSN":["1527-974X","1067-5027"],"issn-type":[{"value":"1527-974X","type":"electronic"},{"value":"1067-5027","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,1]]},"published":{"date-parts":[[2014,10,28]]}}}