{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T13:00:09Z","timestamp":1777554009330,"version":"3.51.4"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3015,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: At the heart of many important bioinformatics problems, such as gene finding and function prediction, is the classification of biological sequences. Frequently the most accurate classifiers are obtained by training support vector machines (SVMs) with complex sequence kernels. However, a cumbersome shortcoming of SVMs is that their learned decision rules are very hard to understand for humans and cannot easily be related to biological facts.<\/jats:p><jats:p>Results: To make SVM-based sequence classifiers more accessible and profitable, we introduce the concept of positional oligomer importance matrices (POIMs) and propose an efficient algorithm for their computation. In contrast to the raw SVM feature weighting, POIMs take the underlying correlation structure of k-mer features induced by overlaps of related k-mers into account. POIMs can be seen as a powerful generalization of sequence logos: they allow to capture and visualize sequence patterns that are relevant for the investigated biological phenomena.<\/jats:p><jats:p>Availability: All source code, datasets, tables and figures are available at http:\/\/www.fml.tuebingen.mpg.de\/raetsch\/projects\/POIM.<\/jats:p><jats:p>Contact: \u00a0Soeren.Sonnenburg@first.fraunhofer.de<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn170","type":"journal-article","created":{"date-parts":[[2008,6,27]],"date-time":"2008-06-27T07:43:13Z","timestamp":1214552593000},"page":"i6-i14","source":"Crossref","is-referenced-by-count":47,"title":["POIMs: positional oligomer importance matrices\u2014understanding support vector machine-based signal detectors"],"prefix":"10.1093","volume":"24","author":[{"given":"S\u00f6ren","family":"Sonnenburg","sequence":"first","affiliation":[{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Zien","sequence":"additional","affiliation":[{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"},{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"},{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Petra","family":"Philips","sequence":"additional","affiliation":[{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gunnar","family":"R\u00e4tsch","sequence":"additional","affiliation":[{"name":"1 Fraunhofer Institute FIRST, Department IDA, Kekul\u00e8str. 7, 12489 Berlin, 2Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39 and 3Max Planck Institute for Biological Cybernetics, Spemannstr. 38, 72076 T\u00fcbingen, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,7,1]]},"reference":[{"key":"2023020210404665700_B1","doi-asserted-by":"crossref","first-page":"1359","DOI":"10.1093\/genetics\/139.3.1359","article-title":"Promoter elements in D. melanogaster revealed by sequence analysis","volume":"139","author":"Arkhipova","year":"1995","journal-title":"Genetics"},{"key":"2023020210404665700_B2","article-title":"Modeling depend. in protein-DNA binding sites","author":"Barash","year":"2003"},{"key":"2023020210404665700_B3","doi-asserted-by":"crossref","first-page":"2657","DOI":"10.1093\/bioinformatics\/bti410","article-title":"Identification of transcription factor binding sites with variable-order bayesian networks","volume":"21","author":"Ben-Gal","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210404665700_B4","doi-asserted-by":"crossref","first-page":"3020","DOI":"10.1101\/gad.11.22.3020","article-title":"The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila","volume":"11","author":"Burke","year":"1997","journal-title":"Genes Dev"},{"key":"2023020210404665700_B5","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1093\/bioinformatics\/bti025","article-title":"Prediction of splice sites with dependency graphs and their expanded bayesian networks","volume":"21","author":"Chen","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020210404665700_B6","doi-asserted-by":"crossref","first-page":"458","DOI":"10.1101\/gr.216102","article-title":"Computational detection and location of transcription start sites in mammalian genomic DNA","volume":"12","author":"Down","year":"2002","journal-title":"Genome Res"},{"key":"2023020210404665700_B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1261\/rna.596707","article-title":"C. elegans sequences that control trans-splicing and operon pre-mRNA processing","volume":"13","author":"Graber","year":"2007","journal-title":"RNA"},{"key":"2023020210404665700_B8","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1162\/089976606774841611","article-title":"Classification of faces in man and machine","volume":"18","author":"Graf","year":"2006","journal-title":"Neural Comput"},{"key":"2023020210404665700_B9","doi-asserted-by":"crossref","first-page":"3015","DOI":"10.1093\/nar\/18.10.3015","article-title":"Distribution and consensus of branch point signals in eukaryotic genes: a computerized statistical analysis","volume":"18","author":"Harris","year":"1990","journal-title":"Nucleic Acids Res"},{"key":"2023020210404665700_B10","first-page":"27","article-title":"Learning the kernel matrix with semidefinite programming","volume":"5","author":"Lanckriet","year":"2004","journal-title":"J. Mach. Learn. Res"},{"key":"2023020210404665700_B11","article-title":"The spectrum kernel: a string kernel for SVM protein classification","volume-title":"Proceedings of the PSB","author":"Leslie","year":"2002"},{"key":"2023020210404665700_B12","first-page":"1417","article-title":"Mismatch string kernels for SVM protein classification","volume-title":"Advances in Neural Information Processing System 15","author":"Leslie","year":"2003"},{"key":"2023020210404665700_B13","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1186\/1471-2105-5-169","article-title":"Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites","volume":"5","author":"Meinicke","year":"2004","journal-title":"BMC Bioinf"},{"key":"2023020210404665700_B14","doi-asserted-by":"crossref","first-page":"5943","DOI":"10.1093\/nar\/gkl608","article-title":"Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction","volume":"34","author":"Ohler","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023020210404665700_B15","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0087","article-title":"Computational analysis of core promoters in the drosophila genome","volume":"3","author":"Ohler","year":"2002","journal-title":"Genome Biol"},{"key":"2023020210404665700_B16","doi-asserted-by":"crossref","first-page":"277","DOI":"10.7551\/mitpress\/4057.003.0018","article-title":"Accurate splice site detection for C. elegans","volume-title":"Kernel Methods in Computional Biology","author":"R\u00e4tsch","year":"2004"},{"issue":"Suppl. 1","key":"2023020210404665700_B17","first-page":"i369","article-title":"RASE: recognition of alternatively spliced exons","volume":"21","author":"R\u00e4tsch","year":"2005","journal-title":"C. elegans. Bioinformatics"},{"key":"2023020210404665700_B18","doi-asserted-by":"crossref","first-page":"S9","DOI":"10.1186\/1471-2105-7-S1-S9","article-title":"Learning interpretable SVMs for biological sequence classification","volume":"7","author":"R\u00e4tsch","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020210404665700_B19","doi-asserted-by":"crossref","first-page":"i418","DOI":"10.1093\/bioinformatics\/btm177","article-title":"Translation initiation site prediction on a genomic scale: beauty in simplicity","volume":"23","author":"Saeys","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210404665700_B20","volume-title":"Learning with Kernels","author":"Sch\u00f6lkopf","year":"2002"},{"key":"2023020210404665700_B21","first-page":"389","article-title":"Learning interpretable SVMs for biological sequence classification","volume-title":"RECOMB 2005, LNBI 3500","author":"Sonnenburg","year":"2005"},{"key":"2023020210404665700_B22","first-page":"1531","article-title":"Large scale multiple kernel learning","volume":"7","author":"Sonnenburg","year":"2006","journal-title":"J. Mach. Learn. Res"},{"key":"2023020210404665700_B23","doi-asserted-by":"crossref","first-page":"e472","DOI":"10.1093\/bioinformatics\/btl250","article-title":"ARTS: accurate recognition of transcription starts in human","volume":"22","author":"Sonnenburg","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210404665700_B24","doi-asserted-by":"crossref","first-page":"73","DOI":"10.7551\/mitpress\/7496.003.0006","article-title":"Large scale learning with string kernels","volume-title":"Large Scale Kernel Machines","author":"Sonnenburg","year":"2007"},{"key":"2023020210404665700_B25","first-page":"S7","article-title":"Accurate splice site prediction","volume":"8","author":"Sonnenburg","year":"2007","journal-title":"BMC Bioinformatics Special Issue from NIPS Workshop on New Problems and Methods in Computational Biology, Whistler, Canada, December 18, 2006"},{"key":"2023020210404665700_B26","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1016\/j.aca.2007.03.023","article-title":"Visualisation and interpretation of support vector regression models","volume":"595","author":"\u00dcst\u00fcn","year":"2007","journal-title":"Anal. Chim. Acta"},{"key":"2023020210404665700_B27","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The Nature of Statistical Learning Theory","author":"Vapnik","year":"1995"},{"key":"2023020210404665700_B28","article-title":"Computing positional oligomer importance matrices (POIMs). Research Report, Electronic Publishing 2, Fraunhofer FIRST","author":"Zien","year":"2007"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i6\/49053683\/bioinformatics_24_13_i6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/13\/i6\/49053683\/bioinformatics_24_13_i6.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,27]],"date-time":"2024-02-27T22:21:18Z","timestamp":1709072478000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/13\/i6\/233341"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,7,1]]},"references-count":28,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2008,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn170","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,7,1]]},"published":{"date-parts":[[2008,7,1]]}}}