{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:25:28Z","timestamp":1772166328404,"version":"3.50.1"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,7,23]],"date-time":"2021-07-23T00:00:00Z","timestamp":1626998400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,7,23]],"date-time":"2021-07-23T00:00:00Z","timestamp":1626998400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"National Agency for Research and Development","award":["Scholarship Program\/DOCTORADO BECAS CHILE\/2019:Grant N. 21190261."],"award-info":[{"award-number":["Scholarship Program\/DOCTORADO BECAS CHILE\/2019:Grant N. 21190261."]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>Calcific aortic valve stenosis (CAVS) is a fatal disease and there is no pharmacological treatment to prevent the progression of CAVS. This study aims to identify genes potentially implicated with CAVS in patients with congenital bicuspid aortic valve (BAV) and tricuspid aortic valve (TAV) in comparison with patients having normal valves, using a knowledge-slanted random forest (RF).<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      This study implemented a knowledge-slanted random forest (RF) using information extracted from a protein-protein interactions network to rank genes in order to modify their selection probability to draw the candidate split-variables. A total of 15,191 genes were assessed in 19 valves with CAVS (BAV,\n                      <jats:italic>n<\/jats:italic>\n                      \u2009=\u200910; TAV,\n                      <jats:italic>n<\/jats:italic>\n                      \u2009=\u20099) and 8 normal valves. The performance of the model was evaluated using accuracy, sensitivity, and specificity to discriminate cases with CAVS. A comparison with conventional RF was also performed. The performance of this proposed approach reported improved accuracy in comparison with conventional RF to classify cases separately with BAV and TAV (Slanted RF: 59.3% versus 40.7%). When patients with BAV and TAV were grouped against patients with normal valves, the addition of prior biological information was not relevant with an accuracy of 92.6%.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>The knowledge-slanted RF approach reflected prior biological knowledge, leading to better precision in distinguishing between cases with BAV, TAV, and normal valves. The results of this study suggest that the integration of biological knowledge can be useful during difficult classification tasks.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s13040-021-00269-4","type":"journal-article","created":{"date-parts":[[2021,7,23]],"date-time":"2021-07-23T15:03:11Z","timestamp":1627052591000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis"],"prefix":"10.1186","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3320-6032","authenticated-orcid":false,"given":"Erika","family":"Cantor","sequence":"first","affiliation":[]},{"given":"Rodrigo","family":"Salas","sequence":"additional","affiliation":[]},{"given":"Harvey","family":"Rosas","sequence":"additional","affiliation":[]},{"given":"Sandra","family":"Guauque-Olarte","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,7,23]]},"reference":[{"issue":"11","key":"269_CR1","doi-asserted-by":"publisher","first-page":"1002","DOI":"10.1016\/j.jacc.2013.05.015","volume":"62","author":"RLJ Osnabrugge","year":"2013","unstructured":"Osnabrugge RLJ, Mylotte D, Head SJ, Van Mieghem NM, Nkomo VT, LeReun CM, et al. Aortic stenosis in the elderly: disease prevalence and number of candidates for transcatheter aortic valve replacement: a meta-analysis and modeling study. J Am Coll Cardiol. 2013;62(11):1002\u201312. https:\/\/doi.org\/10.1016\/j.jacc.2013.05.015.","journal-title":"J Am Coll Cardiol"},{"key":"269_CR2","doi-asserted-by":"publisher","first-page":"196037","DOI":"10.1155\/2012\/196037","volume":"2012","author":"I Mordi","year":"2012","unstructured":"Mordi I, Tzemos N. Bicuspid aortic valve disease: a comprehensive review. Cardiol Res Pract. 2012;2012:196037.","journal-title":"Cardiol Res Pract"},{"key":"269_CR3","doi-asserted-by":"publisher","first-page":"685","DOI":"10.3389\/fphar.2020.00685","volume":"11","author":"B Alushi","year":"2020","unstructured":"Alushi B, Curini L, Christopher MR, Grubitzch H, Landmesser U, Amedei A, et al. Calcific aortic valve disease-natural history and future therapeutic strategies. Front Pharmacol. 2020;11:685. https:\/\/doi.org\/10.3389\/fphar.2020.00685.","journal-title":"Front Pharmacol"},{"issue":"3","key":"269_CR4","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1007\/s40484-018-0144-7","volume":"6","author":"WV Li","year":"2018","unstructured":"Li WV, Li JJ. Modeling and analysis of RNA-seq data: a review from a statistical perspective. Quant Biol. 2018;6(3):195\u2013209. https:\/\/doi.org\/10.1007\/s40484-018-0144-7.","journal-title":"Quant Biol"},{"issue":"4","key":"269_CR5","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1515\/sagmb-2015-0072","volume":"15","author":"C Wang","year":"2016","unstructured":"Wang C, Gevertz JL. Finding causative genes from high-dimensional data: an appraisal of statistical and machine learning approaches. Stat Appl Genet Mol Biol. 2016;15(4):321\u201347. https:\/\/doi.org\/10.1515\/sagmb-2015-0072.","journal-title":"Stat Appl Genet Mol Biol"},{"issue":"530","key":"269_CR6","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1080\/01621459.2020.1762613","volume":"115","author":"B Efron","year":"2020","unstructured":"Efron B. Prediction, estimation, and attribution. J Am Stat Assoc. 2020;115(530):636\u201355. https:\/\/doi.org\/10.1080\/01621459.2020.1762613.","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"269_CR7","doi-asserted-by":"publisher","first-page":"270","DOI":"10.1186\/s12859-018-2264-5","volume":"19","author":"R Couronn\u00e9","year":"2018","unstructured":"Couronn\u00e9 R, Probst P, Boulesteix A-L. Random forest versus logistic regression: a large-scale benchmark experiment. BMC Bioinformatics. 2018;19(1):270. https:\/\/doi.org\/10.1186\/s12859-018-2264-5.","journal-title":"BMC Bioinformatics."},{"issue":"15","key":"269_CR8","doi-asserted-by":"publisher","first-page":"4301","DOI":"10.1093\/bioinformatics\/btaa483","volume":"36","author":"S Seifert","year":"2020","unstructured":"Seifert S, Gundlach S, Junge O, Szymczak S. Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study. Bioinformatics. 2020;36(15):4301\u20138. https:\/\/doi.org\/10.1093\/bioinformatics\/btaa483.","journal-title":"Bioinformatics."},{"issue":"14","key":"269_CR9","doi-asserted-by":"publisher","first-page":"2395","DOI":"10.1093\/bioinformatics\/bty995","volume":"35","author":"X Wang","year":"2019","unstructured":"Wang X, Yu B, Ma A, Chen C, Liu B, Ma Q. Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique. Bioinformatics. 2019;35(14):2395\u2013402. https:\/\/doi.org\/10.1093\/bioinformatics\/bty995.","journal-title":"Bioinformatics."},{"key":"269_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13040-021-00260-z","volume":"14","author":"SS Saharan","year":"2021","unstructured":"Saharan SS, Nagar P, Creasy KT, Stock EO, Feng J, Malloy MJ, et al. Machine learning and statistical approaches for classification of risk of coronary artery disease using plasma cytokines. BioData Min. 2021;14:1\u201314.","journal-title":"BioData Min"},{"issue":"3","key":"269_CR11","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1016\/j.cmpb.2015.02.010","volume":"119","author":"JA Nepomuceno","year":"2015","unstructured":"Nepomuceno JA, Troncoso A, Nepomuceno-Chamorro IA, Aguilar-Ruiz JS. Integrating biological knowledge based on functional annotations for biclustering of gene expression data. Comput Methods Prog Biomed. 2015;119(3):163\u201380. https:\/\/doi.org\/10.1016\/j.cmpb.2015.02.010.","journal-title":"Comput Methods Prog Biomed"},{"issue":"1","key":"269_CR12","doi-asserted-by":"publisher","first-page":"15918","DOI":"10.1038\/s41598-019-52093-w","volume":"9","author":"A Oskooei","year":"2019","unstructured":"Oskooei A, Manica M, Mathis R, Mart\u00ednez MR. Network-based biased tree ensembles (NetBiTE) for drug sensitivity prediction and drug sensitivity biomarker identification in cancer. Sci Rep. 2019;9(1):15918. https:\/\/doi.org\/10.1038\/s41598-019-52093-w.","journal-title":"Sci Rep"},{"key":"269_CR13","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1016\/j.copbio.2019.12.021","volume":"63","author":"J Crawford","year":"2020","unstructured":"Crawford J, Greene CS. Incorporating biological structure into machine learning models in biomedicine. Curr Opin Biotechnol. 2020;63:126\u201334. https:\/\/doi.org\/10.1016\/j.copbio.2019.12.021.","journal-title":"Curr Opin Biotechnol"},{"key":"269_CR14","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1186\/s12859-020-3344-x","volume":"21","author":"X Guan","year":"2020","unstructured":"Guan X, Runger G, Liu L. Dynamic incorporation of prior knowledge from multiple domains in biomarker discovery. BMC Bioinformatics. 2020;21:77.","journal-title":"BMC Bioinformatics"},{"issue":"10","key":"269_CR15","doi-asserted-by":"publisher","first-page":"749","DOI":"10.1152\/physiolgenomics.00041.2016","volume":"48","author":"S Guauque-Olarte","year":"2016","unstructured":"Guauque-Olarte S, Droit A, Tremblay-Marchand J, Gaudreault N, Kalavrouziotis D, Dagenais F, et al. RNA expression profile of calcified bicuspid, tricuspid, and normal human aortic valves by RNA sequencing. Physiol Genomics. 2016;48(10):749\u201361. https:\/\/doi.org\/10.1152\/physiolgenomics.00041.2016.","journal-title":"Physiol Genomics"},{"key":"269_CR16","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1016\/j.neucom.2015.09.136","volume":"217","author":"J Zhang","year":"2016","unstructured":"Zhang J, Yang J, Huang T, Shu Y, Chen L. Identification of novel proliferative diabetic retinopathy related genes on protein\u2013protein interaction network. Neurocomputing. 2016;217:63\u201372. https:\/\/doi.org\/10.1016\/j.neucom.2015.09.136.","journal-title":"Neurocomputing."},{"issue":"D1","key":"269_CR17","doi-asserted-by":"publisher","first-page":"D447","DOI":"10.1093\/nar\/gku1003","volume":"43","author":"D Szklarczyk","year":"2015","unstructured":"Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(D1):D447\u201352. https:\/\/doi.org\/10.1093\/nar\/gku1003.","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"269_CR18","doi-asserted-by":"publisher","first-page":"949","DOI":"10.1016\/j.ajhg.2008.02.013","volume":"82","author":"S K\u00f6hler","year":"2008","unstructured":"K\u00f6hler S, Bauer S, Horn D, Robinson PN. Walking the interactome for prioritization of candidate disease genes. Am J Hum Genet. 2008;82(4):949\u201358. https:\/\/doi.org\/10.1016\/j.ajhg.2008.02.013.","journal-title":"Am J Hum Genet"},{"issue":"3","key":"269_CR19","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1152\/physiolgenomics.00115.2014","volume":"47","author":"R Padang","year":"2015","unstructured":"Padang R, Bagnall RD, Tsoutsman T, Bannon PG, Semsarian C. Comparative transcriptome profiling in human bicuspid aortic valve disease using RNA sequencing. Physiol Genomics. 2015;47(3):75\u201387. https:\/\/doi.org\/10.1152\/physiolgenomics.00115.2014.","journal-title":"Physiol Genomics"},{"issue":"11","key":"269_CR20","doi-asserted-by":"publisher","first-page":"1108","DOI":"10.1038\/nmeth.2651","volume":"10","author":"M Hofree","year":"2013","unstructured":"Hofree M, Shen JP, Carter H, Gross A, Ideker T. Network-based stratification of tumor mutations. Nat Methods. 2013;10(11):1108\u201315. https:\/\/doi.org\/10.1038\/nmeth.2651.","journal-title":"Nat Methods"},{"issue":"1","key":"269_CR21","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L. Random forests. Mach Learn Springer. 2001;45(1):5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324.","journal-title":"Mach Learn Springer"},{"key":"269_CR22","first-page":"2579","volume":"9","author":"L Van der Maaten","year":"2008","unstructured":"Van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9:2579\u2013605.","journal-title":"J Mach Learn Res"},{"key":"269_CR23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v077.i01","volume":"77","author":"MN Wright","year":"2017","unstructured":"Wright MN, Ziegler A. Ranger: a fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw. 2017;77:1\u201317.","journal-title":"J Stat Softw"},{"key":"269_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v028.i05","volume":"28","author":"M Kuhn","year":"2008","unstructured":"Kuhn M. Building predictive models in R using the caret package. J Stat Softw. 2008;28:1\u201326.","journal-title":"J Stat Softw"},{"key":"269_CR25","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.amjcard.2020.05.008","volume":"128","author":"C-H Sia","year":"2020","unstructured":"Sia C-H, Ho JS-Y, Chua JJ-L, Tan BY-Q, Ngiam NJ, Chew N, et al. Comparison of clinical and echocardiographic features of asymptomatic patients with stenotic bicuspid versus tricuspid aortic valves. Am J Cardiol. 2020;128:210\u20135. https:\/\/doi.org\/10.1016\/j.amjcard.2020.05.008.","journal-title":"Am J Cardiol"},{"key":"269_CR26","doi-asserted-by":"publisher","unstructured":"Heuschkel MA, Skenteris NT, Hutcheson JD, van der Valk DD, Bremer J, Goody P, et al. Integrative multi-omics analysis in calcific aortic valve disease reveals a link to the formation of amyloid-like deposits. Cells. 2020;9(10). https:\/\/doi.org\/10.3390\/cells9102164.","DOI":"10.3390\/cells9102164"},{"key":"269_CR27","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1016\/j.eswa.2019.05.028","volume":"134","author":"JL Speiser","year":"2019","unstructured":"Speiser JL, Miller ME, Tooze J, Ip E. A comparison of random forest variable selection methods for classification prediction modeling. Expert Syst Appl. 2019;134:93\u2013101. https:\/\/doi.org\/10.1016\/j.eswa.2019.05.028.","journal-title":"Expert Syst Appl"},{"key":"269_CR28","doi-asserted-by":"publisher","first-page":"106839","DOI":"10.1016\/j.csda.2019.106839","volume":"143","author":"A Bommert","year":"2020","unstructured":"Bommert A, Sun X, Bischl B, Rahnenf\u00fchrer J, Lang M. Benchmark for filter methods for feature selection in high-dimensional classification data. Comput Stat Data Anal. 2020;143:106839. https:\/\/doi.org\/10.1016\/j.csda.2019.106839.","journal-title":"Comput Stat Data Anal"},{"key":"269_CR29","doi-asserted-by":"publisher","first-page":"198363","DOI":"10.1155\/2015\/198363","volume":"2015","author":"ZM Hira","year":"2015","unstructured":"Hira ZM, Gillies DF. A review of feature selection and feature extraction methods applied on microarray data. Adv Bioinforma. 2015;2015:198363.","journal-title":"Adv Bioinforma"},{"issue":"4","key":"269_CR30","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1038\/nmeth.4642","volume":"15","author":"D Bzdok","year":"2018","unstructured":"Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. 2018;15(4):233\u20134. https:\/\/doi.org\/10.1038\/nmeth.4642.","journal-title":"Nat Methods"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00269-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13040-021-00269-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00269-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,23]],"date-time":"2021-07-23T15:03:15Z","timestamp":1627052595000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/s13040-021-00269-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,23]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["269"],"URL":"https:\/\/doi.org\/10.1186\/s13040-021-00269-4","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-490040\/v1","asserted-by":"object"}]},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,23]]},"assertion":[{"value":"2 May 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 July 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 July 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The primary study was approved by the ethics committee of <i>the Institut universitaire de cardiologie et de pneumologie de Qu\u00e9bec, Laval University, Quebec, Canada.<\/i> Written informed consent was obtained from all participants.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"None declared.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"35"}}