{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,10]],"date-time":"2024-05-10T09:27:25Z","timestamp":1715333245832},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We describe here the vision, motivations, and research plans of the National Institutes of Health Center for Excellence in Big Data Computing at the University of Illinois, Urbana-Champaign. The Center is organized around the construction of \u201cKnowledge Engine for Genomics\u201d (KnowEnG), an E-science framework for genomics where biomedical scientists will have access to powerful methods of data mining, network mining, and machine learning to extract knowledge out of genomics data. The scientist will come to KnowEnG with their own data sets in the form of spreadsheets and ask KnowEnG to analyze those data sets in the light of a massive knowledge base of community data sets called the \u201cKnowledge Network\u201d that will be at the heart of the system. The Center is undertaking discovery projects aimed at testing the utility of KnowEnG for transforming big data to knowledge. These projects span a broad range of biological enquiry, from pharmacogenomics (in collaboration with Mayo Clinic) to transcriptomics of human behavior.<\/jats:p>","DOI":"10.1093\/jamia\/ocv090","type":"journal-article","created":{"date-parts":[[2015,7,24]],"date-time":"2015-07-24T01:10:15Z","timestamp":1437700215000},"page":"1115-1119","source":"Crossref","is-referenced-by-count":13,"title":["KnowEnG: a knowledge engine for genomics"],"prefix":"10.1093","volume":"22","author":[{"given":"Saurabh","family":"Sinha","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA"},{"name":"Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA"}]},{"given":"Jun","family":"Song","sequence":"additional","affiliation":[{"name":"Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA"},{"name":"Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA"},{"name":"Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA"}]},{"given":"Richard","family":"Weinshilboum","sequence":"additional","affiliation":[{"name":"Department of Pharmacology, Mayo Clinic, Rochester, MN, USA"}]},{"given":"Victor","family":"Jongeneel","sequence":"additional","affiliation":[{"name":"Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA"}]},{"given":"Jiawei","family":"Han","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA"},{"name":"Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA"}]}],"member":"286","published-online":{"date-parts":[[2015,7,22]]},"reference":[{"issue":"6018","key":"2020110613035567000_ocv090-B1","doi-asserted-by":"crossref","first-page":"666","DOI":"10.1126\/science.331.6018.666","article-title":"Human genome 10th anniversary. Will computers crash genomics [published online ahead of print February 12, 2011]?","volume":"331","author":"Pennisi","year":"2011","journal-title":"Sci."},{"issue":"4","key":"2020110613035567000_ocv090-B2","doi-asserted-by":"crossref","first-page":"e1002487","DOI":"10.1371\/journal.pcbi.1002487","article-title":"Rise and demise of bioinformatics? Promise and progress","volume":"8","author":"Ouzounis","year":"2012","journal-title":"PLoS Computational Biol."},{"issue":"5","key":"2020110613035567000_ocv090-B3","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1016\/j.jbi.2008.01.008","article-title":"State of the nation in data integration for bioinformatics","volume":"41","author":"Goble","year":"2008","journal-title":"J Biomed Inform."},{"issue":"3","key":"2020110613035567000_ocv090-B4","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol."},{"issue":"43","key":"2020110613035567000_ocv090-B5","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc Natl Acad Sci USA."},{"issue":"5","key":"2020110613035567000_ocv090-B6","doi-asserted-by":"crossref","first-page":"P3","DOI":"10.1186\/gb-2003-4-5-p3","article-title":"DAVID: Database for Annotation, Visualization, and Integrated Discovery [published online ahead of print May 8, 2003]","volume":"4","author":"Dennis","year":"2003","journal-title":"Genome Biol."},{"issue":"1","key":"2020110613035567000_ocv090-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nar\/gkn923","article-title":"Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists","volume":"37","author":"Huang da","year":"2009","journal-title":"Nucleic Acids Res."},{"issue":"1","key":"2020110613035567000_ocv090-B8","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium [published online ahead of print May 10, 2000]","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat Genetics."},{"issue":"Database issue","key":"2020110613035567000_ocv090-B9","doi-asserted-by":"crossref","first-page":"D504","DOI":"10.1093\/nar\/gkj126","article-title":"Pathguide: a pathway resource list [published online ahead of print December 31, 2005]","volume":"34","author":"Bader","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2020110613035567000_ocv090-B10","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/gb-2008-9-s1-s4","article-title":"GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function","volume":"9","author":"Mostafavi","year":"2008","journal-title":"Genome Biol."},{"key":"2020110613035567000_ocv090-B11","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-01902-9","volume-title":"Mining Heterogeneous Information Networks: Principles and Methodologies","author":"Sun","year":"2012"},{"key":"2020110613035567000_ocv090-B12","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2011"},{"key":"2020110613035567000_ocv090-B13","volume-title":"Hadoop: The Definitive Guide","author":"White","year":"2012"},{"issue":"Database issue","key":"2020110613035567000_ocv090-B14","doi-asserted-by":"crossref","first-page":"D561","DOI":"10.1093\/nar\/gkq973","article-title":"The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored","volume":"39","author":"Szklarczyk","year":"2011","journal-title":"Nucleic Acids Res."},{"issue":"Web Server issue","key":"2020110613035567000_ocv090-B15","doi-asserted-by":"crossref","first-page":"W115","DOI":"10.1093\/nar\/gkp406","article-title":"VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology [published online ahead of print May 26, 2009]","volume":"37","author":"Hu","year":"2009","journal-title":"Nucleic Acids Res."},{"issue":"10","key":"2020110613035567000_ocv090-B16","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1101\/gr.4086505","article-title":"Galaxy: a platform for interactive large-scale genome analysis","volume":"15","author":"Giardine","year":"2005","journal-title":"Genome Res."},{"issue":"8","key":"2020110613035567000_ocv090-B17","doi-asserted-by":"crossref","first-page":"R86","DOI":"10.1186\/gb-2010-11-8-r86","article-title":"Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences [published online ahead of print August 27, 2010]","volume":"11","author":"Goecks","year":"2010","journal-title":"Genome Biol."},{"key":"2020110613035567000_ocv090-B18","doi-asserted-by":"crossref","DOI":"10.1002\/0471142727.mb1910s89","article-title":"Galaxy: a web-based genome analysis tool for experimentalists [published online ahead of print January 14, 2010]","author":"Blankenberg","year":"2010","journal-title":"Curr Protocol Mol Biol."},{"key":"2020110613035567000_ocv090-B19","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1007\/978-3-319-16706-0_9","article-title":"Diffusion component analysis: unraveling functional topology in biological networks","volume-title":"Research in Computational Molecular Biology","author":"Cho","year":"2015"},{"issue":"1","key":"2020110613035567000_ocv090-B20","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1093\/bioinformatics\/btn577","article-title":"A novel signaling pathway impact analysis","volume":"25","author":"Tarca","year":"2009","journal-title":"Bioinform."},{"issue":"12","key":"2020110613035567000_ocv090-B21","doi-asserted-by":"crossref","first-page":"i237","DOI":"10.1093\/bioinformatics\/btq182","article-title":"Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM","volume":"26","author":"Vaske","year":"2010","journal-title":"Bioinform."},{"issue":"4","key":"2020110613035567000_ocv090-B22","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1089\/omi.2011.0018","article-title":"Managing data within the HUBzero platform","volume":"15","author":"McLennan","year":"2011","journal-title":"OMICS."},{"issue":"12","key":"2020110613035567000_ocv090-B23","doi-asserted-by":"crossref","first-page":"1739","DOI":"10.1093\/bioinformatics\/btr260","article-title":"Molecular signatures database (MSigDB) 3.0 [published online ahead of print May 7, 2011]","volume":"27","author":"Liberzon","year":"2011","journal-title":"Bioinform."},{"issue":"Database issue","key":"2020110613035567000_ocv090-B24","doi-asserted-by":"crossref","first-page":"D472","DOI":"10.1093\/nar\/gkt1102","article-title":"The Reactome pathway knowledgebase","volume":"42","author":"Croft","year":"2014","journal-title":"Nucleic Acids Res."},{"issue":"Database issue","key":"2020110613035567000_ocv090-B25","doi-asserted-by":"crossref","first-page":"D452","DOI":"10.1093\/nar\/gkh052","article-title":"IntAct: an open source molecular interaction database","volume":"32","author":"Hermjakob","year":"2004","journal-title":"Nucleic Acids Res."},{"issue":"1","key":"2020110613035567000_ocv090-B26","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1093\/nar\/29.1.37","article-title":"The InterPro database, an integrated documentation resource for protein families, domains and functional sites","volume":"29","author":"Apweiler","year":"2001","journal-title":"Nucleic Acids Res."},{"issue":"10","key":"2020110613035567000_ocv090-B27","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1038\/ng.2764","article-title":"The Cancer Genome Atlas Pan-Cancer analysis project","volume":"45","author":"Cancer Genome Atlas Research N","year":"2013","journal-title":"Nat Genetics."},{"issue":"Database issue","key":"2020110613035567000_ocv090-B28","first-page":"D955","article-title":"Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells","volume":"41","author":"Yang","year":"2013","journal-title":"Nucleic Acids Res."},{"issue":"6","key":"2020110613035567000_ocv090-B29","doi-asserted-by":"crossref","first-page":"796","DOI":"10.1109\/TKDE.2007.190745","article-title":"Truth discovery with multiple conflicting information providers on the web","volume":"20","author":"Yin","year":"2008","journal-title":"IEEE Trans Knowledge Data Eng."},{"issue":"1","key":"2020110613035567000_ocv090-B30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1456650.1456651","article-title":"Data fusion","volume":"41","author":"Bleiholder","year":"2009","journal-title":"ACM Computing Surveys."},{"issue":"6","key":"2020110613035567000_ocv090-B31","doi-asserted-by":"crossref","first-page":"550","DOI":"10.14778\/2168651.2168656","article-title":"A Bayesian approach to discovering truth from conflicting sources for data integration","volume":"5","author":"Zhao","year":"2012","journal-title":"Proc VLDB Endowment."}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/6\/1115\/34146474\/ocv090.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/22\/6\/1115\/34146474\/ocv090.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,12]],"date-time":"2023-08-12T13:23:35Z","timestamp":1691846615000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/22\/6\/1115\/2357880"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,7,22]]},"references-count":31,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2015,7,22]]},"published-print":{"date-parts":[[2015,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocv090","relation":{},"ISSN":["1527-974X","1067-5027"],"issn-type":[{"value":"1527-974X","type":"electronic"},{"value":"1067-5027","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,11]]},"published":{"date-parts":[[2015,7,22]]}}}