{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T13:52:11Z","timestamp":1773323531961,"version":"3.50.1"},"reference-count":16,"publisher":"Oxford University Press (OUP)","issue":"5","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Summary: Given the growing amount of biological data, data mining methods have become an integral part of bioinformatics research. Unfortunately, standard data mining tools are often not sufficiently equipped for handling raw data such as e.g. amino acid sequences. One popular and freely available framework that contains many well-known data mining algorithms is the Waikato Environment for Knowledge Analysis (Weka). In the BioWeka project, we introduce various input formats for bioinformatics data and bioinformatics methods like alignments to Weka. This allows users to easily combine them with Weka's classification, clustering, validation and visualization facilities on a single platform and therefore reduces the overhead of converting data between different data formats as well as the need to write custom evaluation procedures that can deal with many different programs. We encourage users to participate in this project by adding their own components and data formats to BioWeka.<\/jats:p><jats:p>Availability: The software, documentation and tutorial are available at http:\/\/www.bioweka.org.<\/jats:p><jats:p>Contact: \u00a0support@bioweka.org<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl671","type":"journal-article","created":{"date-parts":[[2007,1,20]],"date-time":"2007-01-20T01:12:50Z","timestamp":1169255570000},"page":"651-653","source":"Crossref","is-referenced-by-count":38,"title":["BioWeka\u2014extending the Weka framework for bioinformatics"],"prefix":"10.1093","volume":"23","author":[{"given":"Jan E.","family":"Gewehr","sequence":"first","affiliation":[]},{"given":"Martin","family":"Szugat","sequence":"additional","affiliation":[]},{"given":"Ralf","family":"Zimmer","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2007,1,19]]},"reference":[{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"2247","DOI":"10.1093\/nar\/19.suppl.2247","article-title":"The SWISS-PROT protein sequence data bank","volume":"19","author":"Bairoch","year":"1991","journal-title":"Nucleic Acids Res."},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"2963","DOI":"10.1093\/nar\/21.13.2963","article-title":"GenBank","volume":"21","author":"Benson","year":"1993","journal-title":"Nucleic Acids Res"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"D189","DOI":"10.1093\/nar\/gkh034","article-title":"The ASTRAL compendium in 2004","volume":"32","author":"Chandonia","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023041109374598000_","unstructured":"EL-Manzalawy Y \u00a0HonavarV WLSVM: Integrating LibSVM into Weka Environment 2005 http:\/\/www.cs.iastate.edu\/~yasser\/wlsvm"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"2479","DOI":"10.1093\/bioinformatics\/bth261","article-title":"Data mining in bioinformatics using Weka","volume":"20","author":"Frank","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"1383","DOI":"10.1093\/bioinformatics\/bti200","article-title":"Support vector machines for separation of mixed plant-pathogen EST collections based on codon usage","volume":"21","author":"Friedel","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1006\/jmbi.1999.2583","article-title":"GenTHREADER: An effcient and reliable protein fold recognition method for genomic sequences","volume":"287","author":"Jones","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/gkh120","article-title":"The EMBL nucleotide sequence database","volume":"32","author":"Kulikova","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.jtbi.2005.11.036","article-title":"A fexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility","volume":"241","author":"Moore","year":"2006","journal-title":"J. Theor. Biol"},{"key":"2023041109374598000_","unstructured":"Moustafa A JAligner: Open Source Java Implementation of Smith-Waterman 2006 http:\/\/jaligner.sourceforge.net\/"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"2444","DOI":"10.1073\/pnas.85.8.2444","article-title":"Improved tools for biological sequence comparison","volume":"85","author":"Pearson","year":"1988","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-9-research0046","article-title":"Design and implementation of microarray gene expression markup language (MAGE-ML)","volume":"3","author":"Spellman","year":"2002","journal-title":"Genome Biol"},{"key":"2023041109374598000_","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005","edition":"2nd edn"},{"key":"2023041109374598000_","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1093\/bioinformatics\/17.9.847","article-title":"InterProScan \u2013 an integration platform for the signature-recognition methods in InterPro","volume":"17","author":"Zdobnov","year":"2001","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/5\/651\/49829927\/bioinformatics_23_5_651.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/5\/651\/49829927\/bioinformatics_23_5_651.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,10]],"date-time":"2024-02-10T09:42:48Z","timestamp":1707558168000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/5\/651\/239018"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,1,19]]},"references-count":16,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2007,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl671","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,3]]},"published":{"date-parts":[[2007,1,19]]}}}