{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,4]],"date-time":"2025-10-04T18:25:19Z","timestamp":1759602319460,"version":"3.37.3"},"reference-count":8,"publisher":"Oxford University Press (OUP)","issue":"19","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Summary: High-throughput data can be used in conjunction with clinical information to develop predictive models. Automating the process of developing, evaluating and testing such predictive models on different datasets would minimize operator errors and facilitate the comparison of different modeling approaches on the same dataset. Complete automation would also yield unambiguous documentation of the process followed to develop each model. We present the BDVal suite of programs that fully automate the construction of predictive classification models from high-throughput data and generate detailed reports about the model construction process. We have used BDVal to construct models from microarray and proteomics data, as well as from DNA-methylation datasets. The programs are designed for scalability and support the construction of thousands of alternative models from a given dataset and prediction task.<\/jats:p><jats:p>Availability and Implementation: The BDVal programs are implemented in Java, provided under the GNU General Public License and freely available at http:\/\/bdval.campagnelab.org<\/jats:p><jats:p>Contact: \u00a0fac2003@med.cornell.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq463","type":"journal-article","created":{"date-parts":[[2010,8,12]],"date-time":"2010-08-12T00:32:12Z","timestamp":1281573132000},"page":"2472-2473","source":"Crossref","is-referenced-by-count":6,"title":["BDVal: reproducible large-scale predictive model development and validation in high-throughput datasets"],"prefix":"10.1093","volume":"26","author":[{"given":"Kevin C.","family":"Dorff","sequence":"first","affiliation":[{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"}]},{"given":"Nyasha","family":"Chambwe","sequence":"additional","affiliation":[{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"},{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"}]},{"given":"Marko","family":"Srdanovic","sequence":"additional","affiliation":[{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"}]},{"given":"Fabien","family":"Campagne","sequence":"additional","affiliation":[{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"},{"name":"1 Department of Physiology and Biophysics and 2Institute for Computational Biomedicine, Weill Medical College of Cornell University, New York, NY, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,8,11]]},"reference":[{"key":"2023012508170339000_B1","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1023\/A:1012487302797","article-title":"Gene selection for cancer classification using Support Vector Machines","volume":"46","author":"Guyon","year":"2002","journal-title":"Mach. Learn."},{"key":"2023012508170339000_B2","doi-asserted-by":"crossref","first-page":"e22","DOI":"10.1371\/journal.pcbi.0040022","article-title":"Microarray based diagnosis profits from better documentation of gene expression signatures","volume":"4","author":"Kostka","year":"2008","journal-title":"PLoS Comput. Biol."},{"key":"2023012508170339000_B3","first-page":"313","article-title":"Meeting the Challenges of Functional Genomics: From the Laboratory to the Clinic","volume":"2","author":"Quackenbush","year":"2004","journal-title":"Preclinica"},{"key":"2023012508170339000_B4","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1111\/j.2517-6161.1974.tb00994.x","article-title":"Cross-Validatory Choice and Assessment of Statistical Predictions","volume":"36","author":"Stone","year":"1974","journal-title":"J. R. Stat. Soc. Ser. B (Methodological)"},{"key":"2023012508170339000_B5","first-page":"822","article-title":"A bias correction for the minimum error rate in cross-validation","volume":"822","author":"Tibshirani","year":"2009","journal-title":"Appl. Stat."},{"key":"2023012508170339000_B6","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1186\/1471-2105-7-91","article-title":"Bias in error estimation when using cross-validation for model selection","volume":"7","author":"Varma","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508170339000_B7","doi-asserted-by":"crossref","first-page":"1878","DOI":"10.1101\/gr.190001","article-title":"Biomarker identification by feature wrappers","volume":"11","author":"Xiong","year":"2001","journal-title":"Genome Res."},{"key":"2023012508170339000_B8","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1038\/nbt.1665","article-title":"The MAQC-II Project: A comprehensive study of common practices for the development and validation of microarray-based predictive models. The MicroArray Quality Control","volume":"28","author":"MAQC","year":"2010","journal-title":"Nat. Biotechnol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/19\/2472\/48855748\/bioinformatics_26_19_2472.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/19\/2472\/48855748\/bioinformatics_26_19_2472.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,24]],"date-time":"2025-02-24T06:03:31Z","timestamp":1740377011000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/19\/2472\/230253"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,8,11]]},"references-count":8,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2010,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq463","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2010,10,1]]},"published":{"date-parts":[[2010,8,11]]}}}