{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,7]],"date-time":"2025-10-07T14:30:19Z","timestamp":1759847419503},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"19","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Expression databases, including the Gene Expression Omnibus and ArrayExpress, have experienced significant growth over the past decade and now hold hundreds of thousands of arrays from multiple species. Since most drugs are initially tested on model organisms, the ability to compare expression experiments across species may help identify pathways that are activated in a similar way in humans and other organisms. However, while several methods exist for finding co-expressed genes in the same species as a query gene, looking at co-expression of homologs or arbitrary genes in other species is challenging. Unlike sequence, which is static, expression is dynamic and changes between tissues, conditions and time. Thus, to carry out cross-species analysis using these databases, we need methods that can match experiments in one species with experiments in another species.<\/jats:p>\n               <jats:p>Results: To facilitate queries in large databases, we developed a new method for comparing expression experiments from different species. We define a distance metric between the ranking of orthologous genes in the two species. We show how to solve an optimization problem for learning the parameters of this function using a training dataset of known similar expression experiments pairs. The function we learn outperforms previous methods and simpler rank comparison methods that have been used in the past for single species analysis. We used our method to compare millions of array pairs from mouse and human expression experiments. The resulting matches can be used to find functionally related genes, to hypothesize about biological response mechanisms and to highlight conditions and diseases that are activating similar pathways in both species.<\/jats:p>\n               <jats:p>Availability: Supporting methods, results and a Matlab implementation are available from http:\/\/sb.cs.cmu.edu\/ExpQ\/<\/jats:p>\n               <jats:p>Contact: \u00a0zivbj@cs.cmu.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq451","type":"journal-article","created":{"date-parts":[[2010,8,12]],"date-time":"2010-08-12T00:32:12Z","timestamp":1281573132000},"page":"2416-2423","source":"Crossref","is-referenced-by-count":25,"title":["Cross-species queries of large gene expression databases"],"prefix":"10.1093","volume":"26","author":[{"given":"Hai-Son","family":"Le","sequence":"first","affiliation":[{"name":"1 Machine Learning Department, Carnegie Mellon University, 2Department of Pathology, University of Pittsburgh Medical School and 3Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zolt\u00e1n N.","family":"Oltvai","sequence":"additional","affiliation":[{"name":"1 Machine Learning Department, Carnegie Mellon University, 2Department of Pathology, University of Pittsburgh Medical School and 3Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ziv","family":"Bar-Joseph","sequence":"additional","affiliation":[{"name":"1 Machine Learning Department, Carnegie Mellon University, 2Department of Pathology, University of Pittsburgh Medical School and 3Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA"},{"name":"1 Machine Learning Department, Carnegie Mellon University, 2Department of Pathology, University of Pittsburgh Medical School and 3Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2010,8,11]]},"reference":[{"key":"2023012508172603600_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012508172603600_B2","first-page":"937","article-title":"Learning a mahalanobis metric from equivalence constraints","volume":"6","author":"Bar-Hillel","year":"2005","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508172603600_B3","doi-asserted-by":"crossref","first-page":"4164","DOI":"10.1073\/pnas.0308531101","article-title":"Metagenes and molecular pattern discovery using matrix factorization","volume":"101","author":"Brunet","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508172603600_B4","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1517\/17425255.4.7.871","article-title":"Species selection considerations for preclinical toxicology studies for biotherapeutics","volume":"4","author":"Bussiere","year":"2008","journal-title":"Expert Opin. Drug Metab. Toxicol."},{"key":"2023012508172603600_B5","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1186\/jbiol130","article-title":"Conservation of core gene expression in vertebrate tissues","volume":"8","author":"Chan","year":"2009","journal-title":"J. Biol."},{"key":"2023012508172603600_B6","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1145\/1143844.1143874","article-title":"The relationship between precision-recall and ROC curves","volume-title":"ICML'06: Proceedings of the 23rd International Conference on Machine Learning.","author":"Davis","year":"2006"},{"key":"2023012508172603600_B7","volume-title":"Group Representations in Probability and Statistics. Institute of Mathematical Statistics Lecture Notes\u2014Monograph Series, 11.","author":"Diaconis","year":"1988"},{"key":"2023012508172603600_B8","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1186\/1471-2105-7-191","article-title":"STEM: a tool for the analysis of short time series gene expression data","volume":"7","author":"Ernst","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508172603600_B9","doi-asserted-by":"crossref","first-page":"3103","DOI":"10.1093\/bioinformatics\/btm462","article-title":"CellMontage: similar expression profile search server","volume":"23","author":"Fujibuchi","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508172603600_B10","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The Elements of Statistical Learning.","author":"Hastie","year":"2009"},{"issue":"Suppl. 1","key":"2023012508172603600_B11","doi-asserted-by":"crossref","first-page":"S115","DOI":"10.1093\/bioinformatics\/17.suppl_1.S115","article-title":"GEST: a gene expression search tool based on a novel Bayesian similarity metric","volume":"17","author":"Hunter","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012508172603600_B12","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1038\/nature05186","article-title":"Co-evolution of transcriptional and post-translational cell-cycle regulation","volume":"443","author":"Jensen","year":"2006","journal-title":"Nature."},{"key":"2023012508172603600_B13","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1038\/nrm2281","article-title":"Predicting protein function from sequence and structure","volume":"8","author":"Lee","year":"2007","journal-title":"Nat. Rev. Mol. Cell Biol."},{"key":"2023012508172603600_B14","doi-asserted-by":"crossref","first-page":"R164","DOI":"10.1186\/gb-2008-9-11-r164","article-title":"Genome adaptation to chemical stress: clues from comparative transcriptomics in Saccharomyces cerevisiae and Candida glabrata","volume":"9","author":"Lelandais","year":"2008","journal-title":"Genome Biol."},{"key":"2023012508172603600_B15","doi-asserted-by":"crossref","first-page":"W105","DOI":"10.1093\/nar\/gkm408","article-title":"Cross-species microarray analysis with the OSCAR system suggests an INSR\u2013Pax6\u2013NQO1 neuro-protective pathway in aging and Alzheimer's disease","volume":"35","author":"Lu","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508172603600_B16","doi-asserted-by":"crossref","first-page":"1476","DOI":"10.1093\/bioinformatics\/btp247","article-title":"Cross species analysis of microarray expression data","volume":"25","author":"Lu","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508172603600_B17","volume-title":"Numerical Optimization. Springer Series in Operations Research.","author":"Nocedal","year":"2006"},{"key":"2023012508172603600_B18","doi-asserted-by":"crossref","first-page":"1828","DOI":"10.1101\/gr.1125403","article-title":"A gene recommender algorithm to identify coexpressed genes in C. elegans","volume":"13","author":"Owen","year":"2003","journal-title":"Genome Res."},{"key":"2023012508172603600_B19","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1038\/nrd2110","article-title":"The mighty mouse: genetically engineered mouse models in cancer drug development","volume":"5","author":"Sharpless","year":"2006","journal-title":"Nat. Rev. Drug Discov."},{"key":"2023012508172603600_B20","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1126\/science.1087447","article-title":"A gene-coexpression network for global discovery of conserved genetic modules","volume":"302","author":"Stuart","year":"2003","journal-title":"Science"},{"key":"2023012508172603600_B21","doi-asserted-by":"crossref","first-page":"6062","DOI":"10.1073\/pnas.0400782101","article-title":"A gene atlas of the mouse and human protein-encoding transcriptomes","volume":"101","author":"Su","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508172603600_B22","doi-asserted-by":"crossref","first-page":"5959","DOI":"10.1073\/pnas.0701068104","article-title":"Metagene projection for cross-platform, cross-species characterization of global transcriptional states","volume":"104","author":"Tamayo","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508172603600_B23","doi-asserted-by":"crossref","first-page":"1977","DOI":"10.1091\/mbc.02-02-0030","article-title":"Identification of genes periodically expressed in the human cell cycle and their expression in tumors","volume":"13","author":"Whitfield","year":"2002","journal-title":"Mol. Biol. Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/19\/2416\/48857397\/bioinformatics_26_19_2416.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/19\/2416\/48857397\/bioinformatics_26_19_2416.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:19:24Z","timestamp":1674634764000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/19\/2416\/229773"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,8,11]]},"references-count":23,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2010,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq451","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,10,1]]},"published":{"date-parts":[[2010,8,11]]}}}