{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,1]],"date-time":"2025-02-01T00:40:01Z","timestamp":1738370401814,"version":"3.35.0"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>There is an increasing need in transcriptome research for gene expression data and pattern warehouses. It is of importance to integrate in these warehouses both raw transcriptomic data, as well as some properties encoded in these data, like local patterns.<\/jats:p><\/jats:sec><jats:sec><jats:title>Description<\/jats:title><jats:p>We have developed an application called SQUAT (SAGE Querying and Analysis Tools) which is available at:<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/bsmc.insa-lyon.fr\/squat\/\" ext-link-type=\"uri\">http:\/\/bsmc.insa-lyon.fr\/squat\/<\/jats:ext-link>. This database gives access to both raw SAGE data and patterns mined from these data, for three species (human, mouse and chicken). This database allows to make simple queries like \"In which biological situations is my favorite gene expressed?\" as well as much more complex queries like: \u226awhat are the genes that are frequently co-over-expressed with my gene of interest in given biological situations?\u226b. Connections with external web databases enrich biological interpretations, and enable sophisticated queries. To illustrate the power of SQUAT, we show and analyze the results of three different queries, one of which led to a biological hypothesis that was experimentally validated.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>SQUAT is a user-friendly information retrieval platform, which aims at bringing some of the state-of-the-art mining tools to biologists.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-9-378","type":"journal-article","created":{"date-parts":[[2008,9,18]],"date-time":"2008-09-18T18:13:34Z","timestamp":1221761614000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["SQUAT: A web tool to mine human, murine and avian SAGE data"],"prefix":"10.1186","volume":"9","author":[{"given":"Johan","family":"Leyritz","sequence":"first","affiliation":[]},{"given":"St\u00e9phane","family":"Schicklin","sequence":"additional","affiliation":[]},{"given":"Sylvain","family":"Blachon","sequence":"additional","affiliation":[]},{"given":"C\u00e9line","family":"Keime","sequence":"additional","affiliation":[]},{"given":"C\u00e9line","family":"Robardet","sequence":"additional","affiliation":[]},{"given":"Jean-Fran\u00e7ois","family":"Boulicaut","sequence":"additional","affiliation":[]},{"given":"J\u00e9r\u00e9my","family":"Besson","sequence":"additional","affiliation":[]},{"given":"Ruggero G","family":"Pensa","sequence":"additional","affiliation":[]},{"given":"Olivier","family":"Gandrillon","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,9,18]]},"reference":[{"key":"2363_CR1","volume-title":"Springer","author":"O Maimon","year":"2005","unstructured":"Maimon O, Rokach L: The Data Mining and Knowledge Discovery Handbook. Springer 2005."},{"issue":"5235","key":"2363_CR2","doi-asserted-by":"publisher","first-page":"484","DOI":"10.1126\/science.270.5235.484","volume":"270","author":"VE Velculescu","year":"1995","unstructured":"Velculescu VE, L Zhang, B Vogelstein, KW Kinzler: Serial analysis of gene expression. Science 1995, 270(5235):484\u2013487. 10.1126\/science.270.5235.484","journal-title":"Science"},{"key":"2363_CR3","unstructured":"SAGEGenie[http:\/\/cgap.nci.nih.gov\/SAGE]"},{"issue":"Web Server issu","key":"2363_CR4","doi-asserted-by":"publisher","first-page":"W693","DOI":"10.1093\/nar\/gki444","volume":"33","author":"J Pylouster","year":"2005","unstructured":"Pylouster J, Senamaud-Beaufort C, Saison-Behmoaras TE: WEBSAGE: a web tool for visual analysis of differentially expressed human SAGE tags. Nucleic Acids Res 2005, 33(Web Server issue):W693\u2013695. 10.1093\/nar\/gki444","journal-title":"Nucleic Acids Res"},{"issue":"Web Server","key":"2363_CR5","doi-asserted-by":"publisher","first-page":"W693","DOI":"10.1093\/nar\/gki444","volume":"33","author":"J Pylouster","year":"2005","unstructured":"Pylouster J, Senamaud-Beaufort C, Saison-Behmoaras TE: WEBSAGE: a web tool for visual analysis of differentially expressed human SAGE tags. Nucleic Acids Res 2005, 33(Web Server):W693\u2013695. 10.1093\/nar\/gki444","journal-title":"Nucleic Acids Res"},{"key":"2363_CR6","first-page":"109","volume-title":"SAGE: current technologies and applications","author":"C Romualdi","year":"2005","unstructured":"Romualdi C, Bortoluzzi S: Web tools for statistical Analysis of SAGE data. In SAGE: current technologies and applications. Edited by: SM W. Horizon Bioscience; 2005:109\u2013128."},{"key":"2363_CR7","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/j.ab.2006.03.023","volume":"353","author":"M Severgnini","year":"2006","unstructured":"Severgnini M, Bicciato S, Mangano E, Scarlatti F, Mezzelani A, Mattioli M, Ghidoni R, Peano C, Bonnal R, Viti F, Milanesi L, De Bellis G, Battaglia C: Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment. Anal Biochem 2006, 353: 43\u201356. 10.1016\/j.ab.2006.03.023","journal-title":"Anal Biochem"},{"key":"2363_CR8","volume-title":"workshop on Data Mining in BioInformatics with SIGKDD '01","author":"TR Ng","year":"2001","unstructured":"Ng TR, Sander J, Sleumer M: Hierarchical Cluster Analysis of SAGE Data for Cancer Profiling. workshop on Data Mining in BioInformatics with SIGKDD '01 2001."},{"key":"2363_CR9","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1109\/TCBB.2004.2","volume":"1","author":"SC Madeira","year":"2004","unstructured":"Madeira SC, Oliveira AL: Biclustering algorithms for biological data analysis: a survey. IEEE\/ACM Transactions on Computational Biology and Bioinformatics 2004, 1: 24\u201345. 10.1109\/TCBB.2004.2","journal-title":"IEEE\/ACM Transactions on Computational Biology and Bioinformatics"},{"issue":"9","key":"2363_CR10","doi-asserted-by":"publisher","first-page":"1122","DOI":"10.1093\/bioinformatics\/btl060","volume":"22","author":"A Prelic","year":"2006","unstructured":"Prelic A, Bleuler S, Zimmermann P, Wille A, Buhlmann P, Gruissem W, Hennig L, Thiele L, Zitzler E: A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics 2006, 22(9):1122\u20131129. 10.1093\/bioinformatics\/btl060","journal-title":"Bioinformatics"},{"issue":"12","key":"2363_CR11","doi-asserted-by":"publisher","first-page":"RESEARCH0067","DOI":"10.1186\/gb-2002-3-12-research0067","volume":"3","author":"C Becquet","year":"2002","unstructured":"Becquet C, Blachon S, Jeudy B, Boulicaut JF, Gandrillon O: Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data. Genome Biol 2002, 3(12):RESEARCH0067. 10.1186\/gb-2002-3-12-research0067","journal-title":"Genome Biol"},{"issue":"1","key":"2363_CR12","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1093\/bioinformatics\/19.1.79","volume":"19","author":"C Creighton","year":"2003","unstructured":"Creighton C, Hanash S: Mining gene expression databases for association rules. Bioinformatics 2003, 19(1):79\u201386. 10.1093\/bioinformatics\/19.1.79","journal-title":"Bioinformatics"},{"key":"2363_CR13","volume-title":"Actes des Journ\u00e9es Ouvertes de Biologie Informatique et Math\u00e9matiques (JOBIM): 2005; Lyon","author":"M Elati","year":"2005","unstructured":"Elati M, Radvanyi F, Rouveirol C: Mining transcriptional regulation from expression data. Actes des Journ\u00e9es Ouvertes de Biologie Informatique et Math\u00e9matiques (JOBIM): 2005; Lyon 2005."},{"issue":"Suppl 2","key":"2363_CR14","doi-asserted-by":"publisher","first-page":"ii123","DOI":"10.1093\/bioinformatics\/bti1121","volume":"21","author":"E Georgii","year":"2005","unstructured":"Georgii E, Richter L, Ruckert U, Kramer S: Analyzing microarray data using quantitative association rules. Bioinformatics 2005, 21(Suppl 2):ii123-ii129. 10.1093\/bioinformatics\/bti1121","journal-title":"Bioinformatics"},{"key":"2363_CR15","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1093\/bioinformatics\/19.1.71","volume":"19","author":"J Li","year":"2003","unstructured":"Li J, Liu H, Downing JR, Yeoh AE, Wong L: Simple rules underlying gene expression profiles of more than six subtypes of acute lymphoblastic leukemia (ALL) patients. Bioinformatics 2003, 19: 71\u201378. 10.1093\/bioinformatics\/19.1.71","journal-title":"Bioinformatics"},{"key":"2363_CR16","first-page":"107","volume-title":"2nd Int Workshop Knowledge Discovery in Inductive Databases KDID'03 co-located with ECML-PKDD 2003: September 22 2003; Cavtat-Dubrovnik (Croatia)","author":"F Rioult","year":"2003","unstructured":"Rioult F, Robardet C, Blachon S, Cr\u00e9milleux B, Gandrillon O, Boulicaut JF: Mining concepts from large SAGE gene expression matrices. 2nd Int Workshop Knowledge Discovery in Inductive Databases KDID'03 co-located with ECML-PKDD 2003: September 22 2003; Cavtat-Dubrovnik (Croatia) 2003, 107\u2013118."},{"key":"2363_CR17","doi-asserted-by":"crossref","first-page":"0033","DOI":"10.3233\/ISI-2007-00321","volume":"7","author":"S Blachon","year":"2007","unstructured":"Blachon S, Pensa RG, Besson J, Robardet C, Boulicaut J-F, Gandrillon O: Clustering formal concepts to discover biologically relevant knowledge from gene expression data. Silico Biol 2007, 7: 0033.","journal-title":"Silico Biol"},{"key":"2363_CR18","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1007\/11504245_8","volume":"35329","author":"R Pensa","year":"2005","unstructured":"Pensa R, Boulicaut JF: Boolean property encoding for local set pattern discovery: an application to gene expression data analysis. Local Pattern Detection Springer-Verlag LNAI 2005, 35329: 115\u2013134.","journal-title":"Local Pattern Detection Springer-Verlag LNAI"},{"key":"2363_CR19","unstructured":"SAGE N[ftp:\/\/ftp1.nci.nih.gov\/pub\/SAGE\/]"},{"key":"2363_CR20","doi-asserted-by":"publisher","first-page":"390","DOI":"10.1186\/1471-2164-8-390","volume":"8","author":"C Bresson","year":"2007","unstructured":"Bresson C, Keime C, Faure C, Letrillard Y, Barbado M, Sanfilippo S, Benhra N, Gandrillon O, Gonin-Giraud S: Large-scale analysis by SAGE reveals new mechanisms of v-erbA oncogene action. BMC Genomics 2007, 8: 390. 10.1186\/1471-2164-8-390","journal-title":"BMC Genomics"},{"key":"2363_CR21","doi-asserted-by":"publisher","first-page":"7628","DOI":"10.1038\/sj.onc.1208061","volume":"23","author":"F Damiola","year":"2004","unstructured":"Damiola F, Keime C, Gonin-Giraud S, Dazy S, Gandrillon O: Global transcription analysis of immature avian erythrocytic progenitors: from self-renewal to differentiation. Oncogene 2004, 23: 7628\u20137643. 10.1038\/sj.onc.1208061","journal-title":"Oncogene"},{"issue":"1","key":"2363_CR22","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1186\/1471-2164-5-98","volume":"5","author":"MB Wahl","year":"2004","unstructured":"Wahl MB, Caldwell RB, Kierzek AM, Arakawa H, Eyras E, Hubner N, Jung C, Soeldenwagner M, Cervelli M, Wang YD, Liebscher V, Buerstedde JM: Evaluation of the chicken transcriptome by SAGE of B cells and the DT40 cell line. BMC Genomics 2004, 5(1):98. 10.1186\/1471-2164-5-98","journal-title":"BMC Genomics"},{"key":"2363_CR23","unstructured":"GEO[http:\/\/www.ncbi.nlm.nih.gov\/geo\/]"},{"issue":"1","key":"2363_CR24","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1186\/1471-2105-5-143","volume":"5","author":"C Keime","year":"2004","unstructured":"Keime C, Damiola F, Mouchiroud D, Duret L, Gandrillon O: Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries. BMC Bioinformatics 2004, 5(1):143. 10.1186\/1471-2105-5-143","journal-title":"BMC Bioinformatics"},{"key":"2363_CR25","unstructured":"National Center for Biotechnology Information[http:\/\/www.ncbi.nlm.nih.gov\/]"},{"issue":"Database issue","key":"2363_CR26","doi-asserted-by":"publisher","first-page":"D61","DOI":"10.1093\/nar\/gkl842","volume":"35","author":"KD Pruitt","year":"2007","unstructured":"Pruitt KD, Tatusova T, Maglott DR: NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2007, 35(Database issue):D61\u201365. 10.1093\/nar\/gkl842","journal-title":"Nucleic Acids Res"},{"key":"2363_CR27","first-page":"90","volume-title":"19th IEEE International Symposium on Computer-Based Medical Systems: 2006; Salt Lake City, Utah","author":"J Klema","year":"2006","unstructured":"Klema J, Soulet A, Cr\u00e9milleux B, Blachon S, Gandrillon O: Mining Plausible Patterns from Genomic Data. 19th IEEE International Symposium on Computer-Based Medical Systems: 2006; Salt Lake City, Utah 2006, 90\u2013101."},{"key":"2363_CR28","unstructured":"BioMiner[http:\/\/liris.cnrs.fr\/dmidb\/BioMiner\/]"},{"issue":"1","key":"2363_CR29","doi-asserted-by":"crossref","first-page":"59","DOI":"10.3233\/IDA-2005-9105","volume":"9","author":"J Besson","year":"2005","unstructured":"Besson J, Robardet C, Boulicaut J-F, Rome S: Constraint-based concept mining and its application to microarray data analysis. Intelligent Data Analysis 2005, 9(1):59\u201382.","journal-title":"Intelligent Data Analysis"},{"key":"2363_CR30","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1080\/15216540500037794","volume":"56","author":"T Hankeln","year":"2004","unstructured":"Hankeln T, Wystub S, Laufs T, Schmidt M, Gerlach F, Saaler-Reinhardt S, Reuss S, Burmester T: The cellular and subcellular localization of neuroglobin and cytoglobin \u2013 a clue to their function? IUBMB Life 2004, 56: 671\u2013679. 10.1080\/15216540500037794","journal-title":"IUBMB Life"},{"key":"2363_CR31","doi-asserted-by":"publisher","first-page":"1016","DOI":"10.1167\/iovs.05-0465","volume":"47","author":"J Ostojic","year":"2006","unstructured":"Ostojic J, Sakaguchi D, de Lathouder Y, Hargrove M, Trent J 3rd, Kwon Y, Kardon R, Kuehn M, Betts D, Grozdanic S: Neuroglobin and cytoglobin: oxygen-binding proteins in retinal neurons. Invest Ophthalmol Vis Sci 2006, 47: 1016\u20131023. 10.1167\/iovs.05-0465","journal-title":"Invest Ophthalmol Vis Sci"},{"issue":"5","key":"2363_CR32","doi-asserted-by":"crossref","first-page":"1955","DOI":"10.4049\/jimmunol.153.5.1955","volume":"153","author":"TJ Fleming","year":"1994","unstructured":"Fleming TJ, Malek TR: Multiple glycosylphosphatidylinositol-anchored Ly-6 molecules and transmembrane Ly-6E mediate inhibition of IL-2 production. J Immunol 1994, 153(5):1955\u20131962.","journal-title":"J Immunol"},{"key":"2363_CR33","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1111\/j.1365-2184.2008.00554.x","volume":"41","author":"C Bresson","year":"2008","unstructured":"Bresson C, Gandrillon O, Gonin-Giraud S: sca2: a new gene involved in the self-renewal of erythroid progenitors. Cell Proliferation 2008, 41: 726\u2013738. 10.1111\/j.1365-2184.2008.00554.x","journal-title":"Cell Proliferation"},{"issue":"9","key":"2363_CR34","doi-asserted-by":"publisher","first-page":"R81","DOI":"10.1186\/gb-2005-6-9-r81","volume":"6","author":"JC Newman","year":"2005","unstructured":"Newman JC, Weiner AM: L2L: a simple tool for discovering the hidden significance in microarray expression data. Genome Biol 2005, 6(9):R81. 10.1186\/gb-2005-6-9-r81","journal-title":"Genome Biol"},{"issue":"5","key":"2363_CR35","doi-asserted-by":"publisher","first-page":"P3","DOI":"10.1186\/gb-2003-4-5-p3","volume":"4","author":"G Dennis Jr","year":"2003","unstructured":"Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA: DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol 2003, 4(5):P3. 10.1186\/gb-2003-4-5-p3","journal-title":"Genome Biol"},{"key":"2363_CR36","first-page":"3","volume-title":"Database support for Data Mining Applications \u2013 Discovering Knowledge with Inductive Queries","author":"JF Boulicaut","year":"2004","unstructured":"Boulicaut JF: Inductive databases and multiple uses of frequent itemsets: the cInQ approach. In Database support for Data Mining Applications \u2013 Discovering Knowledge with Inductive Queries. Volume 2682. Springer-Verlag LNCS; 2004:3\u201326."},{"issue":"6","key":"2363_CR37","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1038\/nrg1615","volume":"6","author":"M Kaern","year":"2005","unstructured":"Kaern M, Elston TC, Blake WJ, Collins JJ: Stochasticity in gene expression: from theories to phenotypes. Nat Rev Genet 2005, 6(6):451\u2013464. 10.1038\/nrg1615","journal-title":"Nat Rev Genet"},{"key":"2363_CR38","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1007\/978-3-540-31841-5_10","volume":"3377","author":"A Soulet","year":"2005","unstructured":"Soulet A, Cr\u00e9milleux B, Rioult F: Condensed Representation of EPs and Patterns Quantified by Frequency-Based Measures. Lecture Notes in Computer Science 2005, 3377: 173\u2013189.","journal-title":"Lecture Notes in Computer Science"},{"key":"2363_CR39","unstructured":"Database of Transcriptional Start Sites[http:\/\/dbtss.hgc.jp]"},{"issue":"4","key":"2363_CR40","doi-asserted-by":"publisher","first-page":"656","DOI":"10.1101\/gr.229202. Article published online before March 2002","volume":"12","author":"WJ Kent","year":"2002","unstructured":"Kent WJ: BLAT \u2013 the BLAST-like alignment tool. Genome Res 2002, 12(4):656\u2013664.","journal-title":"Genome Res"},{"key":"2363_CR41","unstructured":"Ensembl[http:\/\/www.ensembl.org]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-378.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,1]],"date-time":"2025-02-01T00:02:02Z","timestamp":1738368122000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-378"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,9,18]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2363"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-378","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2008,9,18]]},"assertion":[{"value":"12 February 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 September 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 September 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"378"}}