{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T22:53:24Z","timestamp":1762210404828},"reference-count":58,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: A critical challenge of the post-genomic era is to understand how genes are differentially regulated even when they belong to a given network. Because the fundamental mechanism controlling gene expression operates at the level of transcription initiation, computational techniques have been developed that identify cis regulatory features and map such features into expression patterns to classify genes into distinct networks. However, these methods are not focused on distinguishing between differentially regulated genes within a given network. Here we describe an unsupervised machine learning method, termed GPS for gene promoter scan, that discriminates among co-regulated promoters by simultaneously considering both cis-acting regulatory features and gene expression. GPS is particularly useful for knowledge discovery in environments with reduced datasets and high levels of uncertainty.<\/jats:p>\n               <jats:p>Results: Application of this method to the enteric bacteria Escherichia coli and Salmonella enterica uncovered novel members, as well as regulatory interactions in the regulon controlled by the PhoP protein that were not discovered using previous approaches. The predictions made by GPS were experimentally validated to establish that the PhoP protein uses multiple mechanisms to control gene transcription, and is a central element in a highly connected network.<\/jats:p>\n               <jats:p>Availability: The scripts and programs used in this work are accessible from the gps-tools.wustl.edu website. Data and predictions are available by request.<\/jats:p>\n               <jats:p>Contact: \u00a0groisman@borcim.wustl.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0<\/jats:p>","DOI":"10.1093\/bioinformatics\/bti672","type":"journal-article","created":{"date-parts":[[2005,9,14]],"date-time":"2005-09-14T03:13:12Z","timestamp":1126667592000},"page":"4073-4083","source":"Crossref","is-referenced-by-count":38,"title":["Analysis of differentially-regulated genes within a regulatory network by GPS genome navigation"],"prefix":"10.1093","volume":"21","author":[{"given":"Igor","family":"Zwir","sequence":"first","affiliation":[{"name":"Department of Molecular Microbiology, Howard Hughes Medical Institute, Washington University School of Medicine \u00a0 Campus Box 8230, 660 S. Euclid Avenue, St Louis, MO 63110, USA"}]},{"given":"Henry","family":"Huang","sequence":"additional","affiliation":[{"name":"Department of Molecular Microbiology, Howard Hughes Medical Institute, Washington University School of Medicine \u00a0 Campus Box 8230, 660 S. Euclid Avenue, St Louis, MO 63110, USA"}]},{"given":"Eduardo A.","family":"Groisman","sequence":"additional","affiliation":[{"name":"Department of Molecular Microbiology, Howard Hughes Medical Institute, Washington University School of Medicine \u00a0 Campus Box 8230, 660 S. Euclid Avenue, St Louis, MO 63110, USA"}]}],"member":"286","published-online":{"date-parts":[[2005,9,13]]},"reference":[{"key":"2023061007105431400_b1","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1109\/69.553164","article-title":"Parallel mining of association rules","volume":"8","author":"Agrawal","year":"1996","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"2023061007105431400_b2","doi-asserted-by":"crossref","first-page":"6745","DOI":"10.1073\/pnas.96.12.6745","article-title":"Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays","volume":"96","author":"Alon","year":"1999","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b3","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023061007105431400_b4","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1038\/nature03178","article-title":"The simplicity of metazoan cell lineages","volume":"433","author":"Azevedo","year":"2005","journal-title":"Nature"},{"key":"2023061007105431400_b5","doi-asserted-by":"crossref","first-page":"1337","DOI":"10.1038\/nbt890","article-title":"Computational discovery of gene modules and regulatory networks","volume":"21","author":"Bar-Joseph","year":"2003","journal-title":"Nat. Biotechnol."},{"key":"2023061007105431400_b6","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1016\/S0092-8674(04)00304-6","article-title":"Predicting gene expression from sequence","volume":"117","author":"Beer","year":"2004","journal-title":"Cell"},{"key":"2023061007105431400_b7","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-3-research0013","article-title":"Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA","volume":"3","author":"Benitez-Bellon","year":"2002","journal-title":"Genome Biol."},{"key":"2023061007105431400_b8","doi-asserted-by":"crossref","first-page":"F6.1.1","DOI":"10.1887\/0750304278\/b438c75","article-title":"Pattern analysis","volume-title":"Handbook of Fuzzy Computation","author":"Bezdek","year":"1998"},{"key":"2023061007105431400_b9","volume-title":"Fuzzy Models for Pattern Recognition: Methods that Search for Structures in Data","author":"Bezdek","year":"1992"},{"key":"2023061007105431400_b10","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-2660-4","volume-title":"Selecting Models from Data: Artificial Intelligence and Statistics IV","author":"Cheeseman","year":"1994"},{"key":"2023061007105431400_b11","first-page":"507","article-title":"Optimal structure identification with greedy search","volume":"3","author":"Chickering","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023061007105431400_b12","doi-asserted-by":"crossref","first-page":"3339","DOI":"10.1073\/pnas.0630591100","article-title":"Integrating regulatory motif discovery and genome-wide expression analysis","volume":"100","author":"Conlon","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b13","first-page":"222","article-title":"Elvira: an environment for probabilistic graphical models","author":"Consortium","year":"2002"},{"key":"2023061007105431400_b14","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1109\/51.940050","article-title":"Structural mining of molecular biology data","volume":"20","author":"Cook","year":"2001","journal-title":"IEEE Eng. Med. Biol. Mag."},{"key":"2023061007105431400_b15","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1128\/mr.55.3.371-394.1991","article-title":"Control site location and transcriptional regulation in Escherichia coli","volume":"55","author":"Collado-Vides","year":"1991","journal-title":"Microbiol. Rev."},{"key":"2023061007105431400_b16","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/BF00994110","article-title":"A Bayesian method for the induction of probabilistic networks from data","volume":"9","author":"Cooper","year":"1992","journal-title":"Machine Learning"},{"key":"2023061007105431400_b17","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1109\/91.983275","article-title":"Linguistic modeling by hierarchical systems of linguistic rules","volume":"10","author":"Cordon","year":"2002","journal-title":"IEEE Trans. Fuzzy Syst."},{"key":"2023061007105431400_b18","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.fss.2004.10.016","article-title":"A hybrid promoter analysis methodology for prokaryotic genomes","volume":"152","author":"Cotik","year":"2005","journal-title":"Fuzzy Set. Syst."},{"key":"2023061007105431400_b19","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1101\/gr.849004","article-title":"WebLogo: a sequence logo generator","volume":"14","author":"Crooks","year":"2004","journal-title":"Genome Res."},{"key":"2023061007105431400_b20","volume-title":"Multi-Objective Optimization Using Evolutionary Algorithms","author":"Deb","year":"2001"},{"key":"2023061007105431400_b21","doi-asserted-by":"crossref","first-page":"3006","DOI":"10.1128\/JB.186.10.3006-3014.2004","article-title":"Signal transduction cascade between EvgA\/EvgS and PhoP\/PhoQ two-component systems of Escherichia coli","volume":"186","author":"Eguchi","year":"2004","journal-title":"J. Bacteriol."},{"key":"2023061007105431400_b22","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-4547-1","volume-title":"A Handbook of Statistical Analysis using SAS","author":"Everitt","year":"1996"},{"key":"2023061007105431400_b23","volume-title":"Genetic Algorithms and Grouping Problems","author":"Falkenauer","year":"1998"},{"key":"2023061007105431400_b24","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-11-research0059","article-title":"Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering","volume":"3","author":"Gasch","year":"2002","journal-title":"Genome Biol."},{"key":"2023061007105431400_b25","doi-asserted-by":"crossref","first-page":"1835","DOI":"10.1128\/JB.183.6.1835-1842.2001","article-title":"The pleiotropic two-component regulatory system PhoP-PhoQ","volume":"183","author":"Groisman","year":"2001","journal-title":"J. Bacteriol."},{"key":"2023061007105431400_b26","doi-asserted-by":"crossref","first-page":"2435","DOI":"10.1101\/gr.1387003","article-title":"Regulatory network of Escherichia coli: consistency between literature knowledge and microarray profiles","volume":"13","author":"Gutierrez-Rios","year":"2003","journal-title":"Genome Res."},{"key":"2023061007105431400_b27","doi-asserted-by":"crossref","first-page":"2302","DOI":"10.1101\/gad.1230804","article-title":"Connecting two-component regulatory systems by a protein that protects a response regulator from dephosphorylation by its cognate sensor","volume":"18","author":"Kato","year":"2004","journal-title":"Genes Dev."},{"key":"2023061007105431400_b28","doi-asserted-by":"crossref","first-page":"4706","DOI":"10.1073\/pnas.0836837100","article-title":"Closing the loop: the PmrA\/PmrB two-component system negatively controls expression of its posttranscriptional activator PmrD","volume":"100","author":"Kato","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b29","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/S0004-3702(97)00043-X","article-title":"Wrappers for feature subset selection","volume":"97","author":"Kohavi","year":"1997","journal-title":"Artif. Intell."},{"key":"2023061007105431400_b30","doi-asserted-by":"crossref","first-page":"912","DOI":"10.1109\/34.537345","article-title":"Structure learning of Bayesian networks by genetic algorithms: a performance analysis of control parameters","volume":"18","author":"Larranaga","year":"1996","journal-title":"IEEE J. Pattern Anal. Mach. Intell."},{"key":"2023061007105431400_b31","doi-asserted-by":"crossref","first-page":"6287","DOI":"10.1128\/JB.185.21.6287-6294.2003","article-title":"Molecular characterization of the Mg2+-responsive PhoP-PhoQ regulon in Salmonella enterica","volume":"185","author":"Lejona","year":"2003","journal-title":"J. Bacteriol."},{"key":"2023061007105431400_b32","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1073\/pnas.98.1.31","article-title":"Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection","volume":"98","author":"Li","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b33","doi-asserted-by":"crossref","first-page":"11772","DOI":"10.1073\/pnas.112341999","article-title":"Identification of the binding sites of regulatory proteins in bacterial genomes","volume":"99","author":"Li","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b34","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1016\/j.mib.2003.09.002","article-title":"Identifying global regulators in transcriptional regulatory networks in bacteria","volume":"6","author":"Martinez-Antonio","year":"2003","journal-title":"Curr. Opin. Microbiol."},{"key":"2023061007105431400_b35","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1046\/j.1365-2958.2003.03477.x","article-title":"Regulatory network of acid resistance genes in Escherichia coli","volume":"48","author":"Masuda","year":"2003","journal-title":"Mol. Microbiol."},{"key":"2023061007105431400_b36","doi-asserted-by":"crossref","first-page":"774","DOI":"10.1093\/nar\/29.3.774","article-title":"Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes","volume":"29","author":"McCue","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023061007105431400_b37","doi-asserted-by":"crossref","first-page":"3696","DOI":"10.1128\/JB.185.13.3696-3702.2003","article-title":"Identification and molecular characterization of the Mg2+ stimulon of Escherichia coli","volume":"185","author":"Minagawa","year":"2003","journal-title":"J. Bacteriol."},{"key":"2023061007105431400_b38","volume-title":"Machine Learning","author":"Mitchell","year":"1997"},{"key":"2023061007105431400_b39","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1046\/j.1365-2958.2002.03170.x","article-title":"Transcriptome analysis of all two-component regulatory system mutants of Escherichia coli K-12","volume":"46","author":"Oshima","year":"2002","journal-title":"Mol. Microbiol."},{"key":"2023061007105431400_b40","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1038\/nbt802","article-title":"Identification of co-regulated genes through Bayesian clustering of predicted regulatory binding sites","volume":"21","author":"Qin","year":"2003","journal-title":"Nat. Biotechnol."},{"key":"2023061007105431400_b41","volume-title":"C4.5: Programs for Machine Learning","author":"Quinlan","year":"1993"},{"key":"2023061007105431400_b42","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/S0167-7152(00)00079-1","article-title":"Characterization of maximum probability points in the Multivariate Hypergeometric distribution","volume":"50","author":"Requena","year":"2000","journal-title":"Stat. Probab. Lett."},{"key":"2023061007105431400_b43","volume-title":"Stochastic Complexity in Statistical Inquiry","author":"Rissanen","year":"1989"},{"key":"2023061007105431400_b44","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1142\/9789812567796_0018","article-title":"Generalized analysis of promoters: a method for DNA sequence description","volume-title":"Applications of Multi-Objective Evolutionary Algorithms","author":"Romero Zaliz","year":"2004"},{"key":"2023061007105431400_b45","doi-asserted-by":"crossref","first-page":"10555","DOI":"10.1073\/pnas.152046799","article-title":"Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics","volume":"99","author":"Ronen","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023061007105431400_b46","volume-title":"Fundamentals of Biostatistics","author":"Rosner","year":"1986"},{"key":"2023061007105431400_b47","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1142\/9789812386533_0016","article-title":"Automated generation of qualitative representations of complex objects by hybrid soft-computing methods","volume-title":"Pattern Recognition: from Classical to Modern Approaches","author":"Ruspini","year":"2001"},{"key":"2023061007105431400_b48","doi-asserted-by":"crossref","first-page":"D303","DOI":"10.1093\/nar\/gkh140","article-title":"RegulonDB (version 4.0): transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12","volume":"32","author":"Salgado","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023061007105431400_b49","article-title":"Probabilits, analyse des donnes et statistiques","author":"Saporta","year":"1996"},{"key":"2023061007105431400_b50","doi-asserted-by":"crossref","first-page":"38618","DOI":"10.1074\/jbc.M406149200","article-title":"Transcriptional control of the antimicrobial peptide resistance ugtL gene by the Salmonella PhoP and SlyA regulatory proteins","volume":"279","author":"Shi","year":"2004","journal-title":"J. Biol. Chem."},{"key":"2023061007105431400_b51","doi-asserted-by":"crossref","first-page":"4089","DOI":"10.1074\/jbc.M412741200","article-title":"Signal-dependent binding of the response regulators PhoP and PmrA to their target promoters in vivo","volume":"280","author":"Shin","year":"2005","journal-title":"J. Biol. Chem."},{"key":"2023061007105431400_b52","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1093\/bioinformatics\/16.1.16","article-title":"DNA binding sites: representation and discovery","volume":"16","author":"Stormo","year":"2000","journal-title":"Bioinformatics"},{"key":"2023061007105431400_b53","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"Tavazoie","year":"1999","journal-title":"Nat. Genet."},{"key":"2023061007105431400_b54","doi-asserted-by":"crossref","first-page":"6551","DOI":"10.1128\/JB.184.23.6551-6558.2002","article-title":"Gene expression profiling of the pH response in Escherichia coli","volume":"184","author":"Tucker","year":"2002","journal-title":"J. Bacteriol."},{"key":"2023061007105431400_b55","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1093\/bioinformatics\/17.9.763","article-title":"Principal component analysis for clustering gene expression data","volume":"17","author":"Yeung","year":"2001","journal-title":"Bioinformatics"},{"key":"2023061007105431400_b56","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/4235.797969","article-title":"Multiobjective evolutionary algorithms: a comparative case study and the Strength Pareto approach","volume":"3","author":"Zitzler","year":"1999","journal-title":"IEEE Trans. Evol. Comput."},{"key":"2023061007105431400_b57","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1111\/j.1749-6632.2002.tb04889.x","article-title":"Automated biological sequence description by genetic multiobjective generalized clustering","volume":"980","author":"Zwir","year":"2002","journal-title":"Ann. N. Y. Acad. Sci."},{"key":"2023061007105431400_b58","doi-asserted-by":"crossref","first-page":"2862","DOI":"10.1073\/pnas.0408238102","article-title":"Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica","volume":"102","author":"Zwir","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/22\/4073\/50566212\/bioinformatics_21_22_4073.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/22\/4073\/50566212\/bioinformatics_21_22_4073.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,10]],"date-time":"2023-06-10T07:12:08Z","timestamp":1686381128000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/21\/22\/4073\/194554"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,9,13]]},"references-count":58,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2005,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bti672","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2005,11,15]]},"published":{"date-parts":[[2005,9,13]]}}}