{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T17:46:46Z","timestamp":1761155206362,"version":"3.37.0"},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: We address the problem of identifying differentially expressed genes between two conditions in the scenario where the data arise from an observational study, in which confounding factors are likely to be present.<\/jats:p><jats:p>Results: We suggest to use matching methods to balance two groups of observed cases on measured covariates, and to identify differentially expressed genes using a test suited to matched data. We illustrate this approach on two microarray studies: the first study consists of data from patients with two cancer subtypes, and the second study consists of data from AMKL patients with and without Down syndrome.<\/jats:p><jats:p>Availability: R code (www.r-project.org) for implementing our approach is included as Supplementary Material.<\/jats:p><jats:p>Contact: \u00a0ruheller@whatron.upenn.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn650","type":"journal-article","created":{"date-parts":[[2008,12,20]],"date-time":"2008-12-20T01:14:42Z","timestamp":1229735682000},"page":"904-909","source":"Crossref","is-referenced-by-count":20,"title":["Matching methods for observational microarray studies"],"prefix":"10.1093","volume":"25","author":[{"given":"Ruth","family":"Heller","sequence":"first","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340 and 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elisabetta","family":"Manduchi","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340 and 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dylan S.","family":"Small","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340 and 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,12,19]]},"reference":[{"key":"2023013110171746900_B1","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate - a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Stat. Soc. B Met."},{"key":"2023013110171746900_B2","doi-asserted-by":"crossref","first-page":"3339","DOI":"10.1073\/pnas.0511150103","article-title":"Identification of distinct molecular phenotypes in acute megakaryoblastic leukemia by gene expression profiling","volume":"103","author":"Bourquina","year":"2006","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013110171746900_B3","doi-asserted-by":"crossref","first-page":"2771","DOI":"10.1182\/blood-2003-09-3243","article-title":"Gene expression profile of adult t-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival","volume":"103","author":"Chiaretti","year":"2004","journal-title":"Blood"},{"key":"2023013110171746900_B4","doi-asserted-by":"crossref","DOI":"10.1007\/b137845","volume-title":"Statistical Methods in Bioinformatics: An Introduction","author":"Ewens","year":"2005"},{"key":"2023013110171746900_B5","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1080\/10618600.1993.10474623","article-title":"Comparison of multivariate matching methods: structures, distances, and algorithms","volume":"2","author":"Gu","year":"1993","journal-title":"J. Comput. Graph. Stat."},{"key":"2023013110171746900_B6","doi-asserted-by":"crossref","first-page":"609","DOI":"10.1198\/016214504000000647","article-title":"Full matching in an observational study of coaching for the sat","volume":"99","author":"Hansen","year":"2004","journal-title":"J. Am. Stat. Assoc."},{"journal-title":"Functions for optimal matching.","year":"2008","author":"Hansen","key":"2023013110171746900_B7"},{"key":"2023013110171746900_B8","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1037\/1082-989X.12.3.247","article-title":"Combining propensity score matching and group-based trajectory analysis in an observational study","volume":"12","author":"Haviland","year":"2007","journal-title":"Psychol. Methods"},{"key":"2023013110171746900_B9","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1214\/aoms\/1177704575","article-title":"Rank methods for combination of independent experiments in analysis of variance","volume":"33","author":"Hodges","year":"1962","journal-title":"Ann. Math. Stat."},{"key":"2023013110171746900_B10","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1093\/bioinformatics\/btm531","article-title":"Globalancova: exploration and assessment of gene group effects","volume":"24","author":"Hummel","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013110171746900_B11","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1111\/j.1467-985X.2007.00527.x","article-title":"Misunderstandings between experimentalists and observationalists about causal inference","volume":"171","author":"Imai","year":"2008","journal-title":"J. R. Stat. Soc. [ser. A]"},{"key":"2023013110171746900_B12","doi-asserted-by":"crossref","first-page":"3553","DOI":"10.1093\/bioinformatics\/bth442","article-title":"The baumgartner-wei ss-schindler test for the detection of differentially expressed genes in replicated microarray experiments","volume":"20","author":"Neuhauser","year":"2004","journal-title":"Bioinformatics"},{"key":"2023013110171746900_B13","doi-asserted-by":"crossref","first-page":"1215","DOI":"10.2307\/2533460","article-title":"A cautionary note on applying scores in stratified data","volume":"50","author":"Podgor","year":"1994","journal-title":"Biometrics"},{"key":"2023013110171746900_B14","doi-asserted-by":"crossref","first-page":"690","DOI":"10.1016\/j.tig.2003.10.005","article-title":"Epidemiology, cancer genetics and microarrays: making correct inferences, using appropriate designs","volume":"19","author":"Potter","year":"2003","journal-title":"Trends Genet."},{"key":"2023013110171746900_B15","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1111\/j.2517-6161.1991.tb01848.x","article-title":"A characterization of optimal designs for observational studies","volume":"53","author":"Rosenbaum","year":"1991","journal-title":"J. R. Stat. Soc."},{"key":"2023013110171746900_B16","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3692-2","volume-title":"Observational Studies","author":"Rosenbaum","year":"2002"},{"key":"2023013110171746900_B17","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1080\/01621459.1984.10478078","article-title":"Reducing bias in observational studies using subclassification on the propensity score","volume":"79","author":"Rosenbaum","year":"1984","journal-title":"J. Am. Stat. Assoc."},{"key":"2023013110171746900_B18","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1080\/00031305.1985.10479383","article-title":"Constructing a control-group using multivariate matched sampling methods that incorporate the propensity score","volume":"39","author":"Rosenbaum","year":"1985","journal-title":"Am. Stat."},{"key":"2023013110171746900_B19","first-page":"318","article-title":"Using multivariate matched sampling and regression adjustment to control bias in observational studies","volume":"74","author":"Rubin","year":"1979","journal-title":"J. Am. Stat. Assoc."},{"key":"2023013110171746900_B20","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1002\/sim.2739","article-title":"The design versus the analysis of observational studies for causal effects: parallels with the design of randomization trials","volume":"26","author":"Rubin","year":"2007","journal-title":"Stat. Med."},{"key":"2023013110171746900_B21","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1080\/01621459.2000.10474233","article-title":"Combining propensity score matching with additional adjustments for prognostic covariates","volume":"95","author":"Rubin","year":"2000","journal-title":"J. Am. Stat. Assoc."},{"key":"2023013110171746900_B22","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/0-387-29362-0_23","article-title":"Limma: linear models for microarray data","volume-title":"Bioinformatics and Computational Biology Solutions using R and Bioconductor.","author":"Smyth","year":"2005"},{"key":"2023013110171746900_B23","doi-asserted-by":"crossref","first-page":"1454","DOI":"10.1093\/bioinformatics\/18.11.1454","article-title":"Nonparametric methods for identifying differentially expressed genes in microarray data","volume":"18","author":"Troyanskaya","year":"2002","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/7\/904\/48985751\/bioinformatics_25_7_904.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/7\/904\/48985751\/bioinformatics_25_7_904.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,6]],"date-time":"2025-02-06T09:14:00Z","timestamp":1738833240000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/7\/904\/209969"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,12,19]]},"references-count":23,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2009,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn650","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2009,4,1]]},"published":{"date-parts":[[2008,12,19]]}}}