{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T04:47:13Z","timestamp":1776833233783,"version":"3.51.2"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2017,5,13]],"date-time":"2017-05-13T00:00:00Z","timestamp":1494633600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Objective<\/jats:title><jats:p>Data integration methods that combine data from different molecular levels such as genome, epigenome, transcriptome, etc., have received a great deal of interest in the past few years. It has been demonstrated that the synergistic effects of different biological data types can boost learning capabilities and lead to a better understanding of the underlying interactions among molecular levels.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>In this paper we present a graph-based semi-supervised classification algorithm that incorporates latent biological knowledge in the form of biological pathways with gene expression and DNA methylation data. The process of graph construction from biological pathways is based on detecting condition-responsive genes, where 3 sets of genes are finally extracted: all condition responsive genes, high-frequency condition-responsive genes, and P-value\u2013filtered genes.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>The proposed approach is applied to ovarian cancer data downloaded from the Human Genome Atlas. Extensive numerical experiments demonstrate superior performance of the proposed approach compared to other state-of-the-art algorithms, including the latest graph-based classification techniques.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Simulation results demonstrate that integrating various data types enhances classification performance and leads to a better understanding of interrelations between diverse omics data types. The proposed approach outperforms many of the state-of-the-art data integration algorithms.<\/jats:p><\/jats:sec>","DOI":"10.1093\/jamia\/ocx032","type":"journal-article","created":{"date-parts":[[2017,3,14]],"date-time":"2017-03-14T20:56:40Z","timestamp":1489525000000},"page":"99-108","source":"Crossref","is-referenced-by-count":19,"title":["Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification"],"prefix":"10.1093","volume":"25","author":[{"given":"Abolfazl","family":"Doostparast Torshizi","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of California, Santa Barbara, CA, USA"}]},{"given":"Linda R","family":"Petzold","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of California, Santa Barbara, CA, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,5,13]]},"reference":[{"issue":"e1","key":"2020110612460243700_ocx032-B1","doi-asserted-by":"crossref","first-page":"e2","DOI":"10.1136\/amiajnl-2012-000969","article-title":"The coming age of data-driven medicine: translational bioinformatics\u2019 next frontier","volume":"19","author":"Shah","year":"2012","journal-title":"J Am Med Inform Assoc."},{"issue":"4","key":"2020110612460243700_ocx032-B2","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1136\/amiajnl-2013-002028","article-title":"Making it personal: translational bioinformatics","volume":"20","author":"Butte","year":"2013","journal-title":"J Am Med Inform Assoc."},{"key":"2020110612460243700_ocx032-B3","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1016\/j.compbiomed.2014.06.017","article-title":"Alpha-plane based automatic general type-ii fuzzy clustering based on simulated annealing meta-heuristic algorithm for analyzing gene expression data","volume":"64","author":"Doostparast Torshizi","year":"2015","journal-title":"Comp Bio Med."},{"key":"2020110612460243700_ocx032-B4","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"van\u2019t Veer","year":"2002","journal-title":"Nature."},{"key":"2020110612460243700_ocx032-B5","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1158\/1078-0432.CCR-09-1815","article-title":"DNA microarrays are predictive of cancer prognosis: a re-evaluation","volume":"16","author":"Fan","year":"2010","journal-title":"Clin Canc Res."},{"key":"2020110612460243700_ocx032-B6","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1016\/j.ins.2015.04.012","article-title":"Hidden Markov models for cancer classification using gene expression profiles","volume":"316","author":"Nguyen","year":"2015","journal-title":"Inf Sci."},{"key":"2020110612460243700_ocx032-B7","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1016\/j.compbiomed.2015.07.008","article-title":"Similarity-balanced discriminant neighbor embedding and its application to cancer classification based on gene expression data","volume":"64","author":"Zhang","year":"2015","journal-title":"Comp Bio Med."},{"key":"2020110612460243700_ocx032-B8","doi-asserted-by":"crossref","first-page":"1081","DOI":"10.1016\/j.molonc.2015.01.003","article-title":"Gene expression\u2013based classifications of fibroadenomas and phyllodes tumors of the breast","volume":"9","author":"Vidal","year":"2015","journal-title":"Mol Onc."},{"issue":"6","key":"2020110612460243700_ocx032-B9","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1016\/j.jbi.2013.07.008","article-title":"A simulation to analyze feature selection methods utilizing gene ontology for gene expression classification","volume":"46","author":"Gillies","year":"2013","journal-title":"J Bio Inf."},{"key":"2020110612460243700_ocx032-B10","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1038\/nrg3868","article-title":"Methods of integrating data to uncover genotype-phenotype interactions","volume":"16","author":"Ritchie","year":"2015","journal-title":"Nat Rev."},{"key":"2020110612460243700_ocx032-B11","doi-asserted-by":"crossref","first-page":"710","DOI":"10.1038\/ng1589","article-title":"An integrative genomics approach to infer causal associations between gene expression and disease","volume":"37","author":"Schadt","year":"2005","journal-title":"Nat Gen."},{"key":"2020110612460243700_ocx032-B12","doi-asserted-by":"crossref","first-page":"1353","DOI":"10.1093\/bioinformatics\/bts163","article-title":"Matrix eQTL: ultra fast eQTL analysis via large matrix operations","volume":"28","author":"Shabalin","year":"2012","journal-title":"Bioinformatics."},{"issue":"1","key":"2020110612460243700_ocx032-B13","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1086\/302698","article-title":"A general test of association for quantitative traits in nuclear families","volume":"66","author":"Abecasis","year":"2000","journal-title":"Amer J Hum Gen."},{"issue":"e164","key":"2020110612460243700_ocx032-B14","article-title":"ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data","volume":"38","author":"Wang","year":"2010","journal-title":"Nuc Acid Res."},{"key":"2020110612460243700_ocx032-B15","doi-asserted-by":"crossref","first-page":"D930","DOI":"10.1093\/nar\/gkr917","article-title":"HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants","volume":"40","author":"Ward","year":"2012","journal-title":"Nuc Acid Res."},{"issue":"9","key":"2020110612460243700_ocx032-B16","doi-asserted-by":"crossref","first-page":"1790","DOI":"10.1101\/gr.137323.112","article-title":"Annotation of functional variation in personal genomes using RegulomeDB","volume":"22","author":"Boyle","year":"2012","journal-title":"Gen Res."},{"issue":"4","key":"2020110612460243700_ocx032-B17","doi-asserted-by":"crossref","first-page":"352","DOI":"10.1002\/gepi.21628","article-title":"Bayesian integrative genomic model for pathway analysis of complex traits","volume":"36","author":"Fridley","year":"2011","journal-title":"Gen Epi."},{"issue":"11","key":"2020110612460243700_ocx032-B18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0024709","article-title":"Time to recurrence and survival in serous ovarian tumors predicted from integrated genomic profiles","volume":"6","author":"Mankoo","year":"2011","journal-title":"PloS One."},{"issue":"16","key":"2020110612460243700_ocx032-B19","doi-asserted-by":"crossref","first-page":"2626","DOI":"10.1093\/bioinformatics\/bth294","article-title":"A statistical framework for genomic data fusion","volume":"20","author":"Lanckriet","year":"2004","journal-title":"Bioinformatics."},{"key":"2020110612460243700_ocx032-B20","doi-asserted-by":"crossref","first-page":"ii59","DOI":"10.1093\/bioinformatics\/bti1110","article-title":"Fast protein classification with multiple networks","volume":"21","author":"Tsuda","year":"2005","journal-title":"Bioinformatics."},{"issue":"1","key":"2020110612460243700_ocx032-B21","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1093\/bioinformatics\/19.1.98","article-title":"Predicting HIV drug resistance with neural networks","volume":"19","author":"Draghici","year":"2003","journal-title":"Bioinformatics."},{"issue":"6","key":"2020110612460243700_ocx032-B22","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1016\/j.cell.2010.11.013","article-title":"An integrated approach to uncover drivers of cancer","volume":"143","author":"Akavia","year":"2010","journal-title":"Cell."},{"issue":"23","key":"2020110612460243700_ocx032-B23","doi-asserted-by":"crossref","first-page":"3217","DOI":"10.1093\/bioinformatics\/btm511","article-title":"Graph sharpening plus graph integration: a synergy that improves protein functional classification","volume":"23","author":"Shin","year":"2007","journal-title":"Bioinformatics."},{"issue":"23","key":"2020110612460243700_ocx032-B24","first-page":"3217","article-title":"Fast protein classification with multiple networks","volume":"21","author":"Tsuda","year":"2007","journal-title":"Bioinformatics."},{"key":"2020110612460243700_ocx032-B25","doi-asserted-by":"crossref","first-page":"1191","DOI":"10.1016\/j.jbi.2012.07.008","article-title":"Synergistic effect of different levels of genomic data for cancer clinical outcome prediction","volume":"45","author":"Kim","year":"2012","journal-title":"J Biomed Inf."},{"issue":"3","key":"2020110612460243700_ocx032-B26","first-page":"1","article-title":"Intra-relation reconstruction from inter-relation: miRNA to gene expression","volume":"7","author":"Kim","year":"2013","journal-title":"BMC Syst Bio."},{"key":"2020110612460243700_ocx032-B27","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1016\/j.ymeth.2014.02.003","article-title":"Incorporating inter-relationships between different levels of genomic data into cancer clinical outcome prediction","volume":"67","author":"Kim","year":"2014","journal-title":"Methods."},{"key":"2020110612460243700_ocx032-B28","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1136\/amiajnl-2013-002481","article-title":"Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction","volume":"22","author":"Kim","year":"2015","journal-title":"J Am Med Inform Assoc."},{"issue":"11","key":"2020110612460243700_ocx032-B29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1000217","article-title":"Inferring pathway activity toward precise disease classification","volume":"4","author":"Lee","year":"2008","journal-title":"PLoS Comp Bio."},{"key":"2020110612460243700_ocx032-B30","article-title":"Learning with local and global consistency","author":"Zhou","year":"2004","journal-title":"Proc Adv Neural Inform Process Syst."},{"key":"2020110612460243700_ocx032-B31","doi-asserted-by":"crossref","DOI":"10.2200\/S00196ED1V01Y200906AIM006","volume-title":"Introduction to Semi-Supervised Learning","author":"Zhu","year":"2009"},{"key":"2020110612460243700_ocx032-B32","first-page":"209","article-title":"Semi-supervised learning on Riemannian manifolds","volume":"56","author":"Belkin","year":"2004","journal-title":"Mach Lrn."},{"key":"2020110612460243700_ocx032-B33","article-title":"Transductive learning via spectral graph partitioning","volume-title":"Proceedings of International Conference on Machine Learning","author":"Joachims","year":"2003"},{"key":"2020110612460243700_ocx032-B34","article-title":"Learning from labeled and unlabeled data using graph mincuts","volume-title":"Proceedings of International Conference on Machine Learning","author":"Blum","year":"2001"},{"key":"2020110612460243700_ocx032-B35","article-title":"Semi-supervised learning using Gaussian fields and harmonic functions","volume-title":"Proceedings of International Conference on Machine Learning","author":"Zhu","year":"2003"},{"key":"2020110612460243700_ocx032-B36","unstructured":"Doostparast Torshizi A . http:\/\/www.cancergenome.nih.gov\/. Accessed October 2016."},{"key":"2020110612460243700_ocx032-B37","doi-asserted-by":"crossref","DOI":"10.1109\/NAFIPS.2012.6291067","article-title":"A new validation criteria for type-2 fuzzy c-means and possibilistic c-means","volume-title":"2012 Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS)","author":"Fazel Zarandi","year":"2012"},{"key":"2020110612460243700_ocx032-B38","doi-asserted-by":"crossref","DOI":"10.1109\/NORBERT.2014.6893882","article-title":"A two-stage meta-heuristic approach to general type-ii fuzzy clustering for microarray data analysis","volume-title":"IEEE Conference on Norbert Wiener in the 21st Century (21CW)","author":"Doostparast Torshizi","year":"2014"},{"issue":"19","key":"2020110612460243700_ocx032-B39","doi-asserted-by":"crossref","first-page":"e146","DOI":"10.1093\/nar\/gks615","article-title":"Co-clustering phenome\u2013genome for phenotype classification and disease gene discovery","volume":"40","author":"Hwang","year":"2012","journal-title":"Nucl. Acid Res."}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/1\/99\/34149455\/ocx032.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/25\/1\/99\/34149455\/ocx032.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,26]],"date-time":"2022-07-26T05:10:51Z","timestamp":1658812251000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/25\/1\/99\/3826530"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,5,13]]},"references-count":39,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2017,5,13]]},"published-print":{"date-parts":[[2018,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocx032","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,1]]},"published":{"date-parts":[[2017,5,13]]}}}