{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:49:38Z","timestamp":1753876178804,"version":"3.41.2"},"reference-count":42,"publisher":"Oxford University Press (OUP)","license":[{"start":{"date-parts":[[2020,4,15]],"date-time":"2020-04-15T00:00:00Z","timestamp":1586908800000},"content-version":"vor","delay-in-days":105,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["U01-CA198941"],"award-info":[{"award-number":["U01-CA198941"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["R01-LM012980"],"award-info":[{"award-number":["R01-LM012980"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000092","name":"National Library of Medicine","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000092","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Biomolecular data stored in public databases is increasingly specialized to organisms, context\/pathology and tissue type, potentially resulting in significant overhead for analyses. These networks are often specializations of generic interaction sets, presenting opportunities for reducing storage and computational cost. Therefore, it is desirable to develop effective compression and storage techniques, along with efficient algorithms and a flexible query interface capable of operating on compressed data structures. Current graph databases offer varying levels of support for network integration. However, these solutions do not provide efficient methods for the storage and querying of versioned networks.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present VerTIoN, a framework consisting of novel data structures and associated query mechanisms for integrated querying of versioned context-specific biological networks. As a use case for our framework, we study network proximity queries in which the user can select and compose a combination of tissue-specific and generic networks. Using our compressed version tree data structure, in conjunction with state-of-the-art numerical techniques, we demonstrate real-time querying of large network databases.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Our results show that it is possible to support flexible queries defined on heterogeneous networks composed at query time while drastically reducing response time for multiple simultaneous queries. The flexibility offered by VerTIoN in composing integrated network versions opens significant new avenues for the utilization of ever increasing volume of context-specific network data in a broad range of biomedical applications.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and Implementation<\/jats:title><jats:p>VerTIoN is implemented as a C++ library and is available at http:\/\/compbio.case.edu\/omics\/software\/vertion and https:\/\/github.com\/tjcowman\/vertion<\/jats:p><\/jats:sec><jats:sec><jats:title>Contact<\/jats:title><jats:p>tyler.cowman@case.edu<\/jats:p><\/jats:sec>","DOI":"10.1093\/database\/baaa018","type":"journal-article","created":{"date-parts":[[2020,3,2]],"date-time":"2020-03-02T20:16:55Z","timestamp":1583180215000},"source":"Crossref","is-referenced-by-count":4,"title":["Integrated querying and version control of context-specific biological networks"],"prefix":"10.1093","volume":"2020","author":[{"given":"Tyler","family":"Cowman","sequence":"first","affiliation":[{"name":"Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mustafa","family":"Co\u015fkun","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Abdullah G\u00fcl University, Kayseri 38080, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ananth","family":"Grama","sequence":"first","affiliation":[{"name":"Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mehmet","family":"Koyut\u00fcrk","sequence":"first","affiliation":[{"name":"Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, USA"},{"name":"Center for Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, OH 44106, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,4,15]]},"reference":[{"key":"2020041511154908200_ref1","doi-asserted-by":"crossref","first-page":"3346","DOI":"10.1093\/bioinformatics\/bth402","article-title":"Conserved network motifs allow protein\u2013protein interaction prediction","volume":"20","author":"Albert","year":"2004","journal-title":"Bioinformatics"},{"key":"2020041511154908200_ref2","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1006678","article-title":"Cophosk: a method for comprehensive kinase substrate annotation using co-phosphorylation analysis","volume":"15","author":"Ayati","year":"2019","journal-title":"PLoS Comput. Biol."},{"key":"2020041511154908200_ref3","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nrg2918","article-title":"Network medicine: a network-based approach to human disease","volume":"12","author":"Barab\u00e1si","year":"2011","journal-title":"Nat. Rev. Genet."},{"key":"2020041511154908200_ref4","doi-asserted-by":"crossref","first-page":"D267","DOI":"10.1093\/nar\/gkh061","article-title":"The unified medical language system (umls): integrating biomedical terminology","volume":"32","author":"Bodenreider","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2020041511154908200_ref5","first-page":"D204","article-title":"Uniprot: a hub for protein information","volume-title":"Nucleic Acids Res","author":"Consortium","year":"2015"},{"key":"2020041511154908200_ref6","doi-asserted-by":"crossref","first-page":"1515","DOI":"10.1145\/2939672.2939828","article-title":"Efficient processing of network proximity queries via chebyshev acceleration","volume-title":"Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","author":"Coskun","year":"2016"},{"key":"2020041511154908200_ref7","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1038\/nrg.2017.38","article-title":"Network propagation: a universal amplifier of genetic associations","volume":"18","author":"Cowen","year":"2017","journal-title":"Nat. Rev. Genet."},{"key":"2020041511154908200_ref8","first-page":"1","article-title":"The igraph software package for complex network research","volume":"1695","author":"Csardi","year":"2006","journal-title":"InterJournal Complex Systems"},{"key":"2020041511154908200_ref9","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1002\/nme.1620180804","article-title":"Yale sparse matrix package i: the symmetric codes","volume":"18","author":"Eisenstat","year":"1982","journal-title":"Internat. J. Numer. Methods Engrg."},{"key":"2020041511154908200_ref10","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1186\/1756-0381-4-19","article-title":"Dada: degree-aware algorithms for network-based disease gene prioritization","volume":"4","author":"Erten","year":"2011","journal-title":"BioData Min."},{"key":"2020041511154908200_ref11","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1089\/cmb.2011.0154","article-title":"Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks","volume":"18","author":"Erten","year":"2011","journal-title":"J. Comput. Biol."},{"key":"2020041511154908200_ref12","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1004791","article-title":"Context specific and differential gene co-expression networks via bayesian biclustering","volume":"12","author":"Gao","year":"2016","journal-title":"PLoS Comput. Biol."},{"key":"2020041511154908200_ref13","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1038\/ng.3259","article-title":"Understanding multicellular function and disease with human tissue-specific networks","volume":"47","author":"Greene","year":"2015","journal-title":"Nat. Genet."},{"key":"2020041511154908200_ref14","first-page":"895","article-title":"Functional cartography of complex metabolic networks. Nature","author":"Guimera","year":"2005"},{"key":"2020041511154908200_ref15","doi-asserted-by":"crossref","first-page":"D514","DOI":"10.1093\/nar\/gki033","article-title":"Online Mendelian inheritance in man (omim), a knowledgebase of human genes and genetic disorders","volume":"33","author":"Hamosh","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2020041511154908200_ref16","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1038\/nmeth.2651","article-title":"Network-based stratification of tumor mutations","volume":"10","author":"Hofree","year":"2013","journal-title":"Nat. Methods"},{"key":"2020041511154908200_ref17","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1145\/2457317.2457351","article-title":"Performance of graph query languages: comparison of cypher, gremlin and native access in neo4j","volume-title":"Proceedings of the Joint EDBT\/ICDT 2013 Workshops","author":"Holzschuher","year":"2013"},{"key":"2020041511154908200_ref18","first-page":"S2","article-title":"Constructing a gene semantic similarity network for the inference of disease genes","volume-title":"BMC Syst. Biol.","author":"Jiang","year":"2011"},{"key":"2020041511154908200_ref19","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1038\/nature09208","article-title":"Diverse somatic mutation patterns and pathway alterations in human cancers","volume":"466","author":"Kan","year":"2010","journal-title":"Nature"},{"key":"2020041511154908200_ref20","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1016\/j.ajhg.2008.02.013","article-title":"Walking the interactome for prioritization of candidate disease genes","volume":"82","author":"K\u00f6hler","year":"2008","journal-title":"Am. J. Hum. Genet."},{"key":"2020041511154908200_ref21","doi-asserted-by":"crossref","first-page":"D966","DOI":"10.1093\/nar\/gkt1026","article-title":"The human phenotype ontology project: linking molecular biology and disease through phenotype data","volume":"42","author":"K\u00f6hler","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2020041511154908200_ref22","doi-asserted-by":"crossref","first-page":"D536","DOI":"10.1093\/nar\/gkv1115","article-title":"Integrated interactions database: tissue-specific view of the human and model organism interactomes","volume":"44","author":"Kotlyar","year":"2015","journal-title":"Nucleic Acids Res."},{"key":"2020041511154908200_ref23","doi-asserted-by":"crossref","first-page":"i200","DOI":"10.1093\/bioinformatics\/bth919","article-title":"An efficient algorithm for detecting frequent subgraphs in biological networks","volume":"20","author":"Koyut\u00fcrk","year":"2004","journal-title":"Bioinformatics"},{"key":"2020041511154908200_ref24","doi-asserted-by":"crossref","first-page":"pii: 1","DOI":"10.1145\/2898361","article-title":"Snap: a general-purpose network analysis and graph-mining library","volume":"8","author":"Leskovec","year":"2016","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"2020041511154908200_ref25","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005502","article-title":"Co-occurring protein phosphorylation are functionally associated","volume":"13","author":"Li","year":"2017","journal-title":"PLoS Comput. Biol."},{"key":"2020041511154908200_ref26","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1038\/ng.2653","article-title":"The genotype-tissue expression (GTEx) project","volume":"45","author":"Lonsdale","year":"2013","journal-title":"Nat. Genet."},{"key":"2020041511154908200_ref27","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1002690","article-title":"Enhancing the prioritization of disease-causing genes through tissue specific protein interaction networks","volume":"8","author":"Magger","year":"2012","journal-title":"PLoS Comput. Biol."},{"key":"2020041511154908200_ref28","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1038\/nmeth.3799","article-title":"Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases","volume":"13","author":"Marbach","year":"2016","journal-title":"Nat. Methods"},{"key":"2020041511154908200_ref29","doi-asserted-by":"crossref","first-page":"1354","DOI":"10.1093\/bioinformatics\/btw733","article-title":"Linearity of network proximity measures: implications for set-based queries and significance testing","volume":"33","author":"Maxwell","year":"2017","journal-title":"Bioinformatics"},{"key":"2020041511154908200_ref30","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1186\/s12918-015-0253-0","article-title":"Scope and limitations of yeast as a model organism for studying human tissue-specific pathways","volume":"9","author":"Mohammadi","year":"2015","journal-title":"BMC Syst. Biol."},{"key":"2020041511154908200_ref31","doi-asserted-by":"crossref","first-page":"929","DOI":"10.1016\/j.sbi.2013.07.005","article-title":"Towards a detailed atlas of protein\u2013protein interactions","volume":"23","author":"Mosca","year":"2013","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2020041511154908200_ref32","doi-asserted-by":"crossref","DOI":"10.1093\/database\/bav028","article-title":"Disgenet: a discovery platform for the dynamical exploration of human diseases and their genes","volume":"2015","author":"Pi\u00f1ero","year":"2015","journal-title":"Database"},{"key":"2020041511154908200_ref33","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1016\/j.cels.2015.10.001","article-title":"Ndex, the network data exchange","volume":"1","author":"Pratt","year":"2015","journal-title":"Cell Syst."},{"key":"2020041511154908200_ref34","doi-asserted-by":"crossref","first-page":"340","DOI":"10.1093\/bioinformatics\/btg415","article-title":"Functional topology in a network of protein interactions","volume":"20","author":"Pr\u017eulj","year":"2004","journal-title":"Bioinformatics"},{"key":"2020041511154908200_ref35","doi-asserted-by":"crossref","first-page":"356","DOI":"10.1016\/j.tibtech.2014.04.007","article-title":"Signaling hypergraphs","volume":"32","author":"Ritz","year":"2014","journal-title":"Trends Biotechnol."},{"key":"2020041511154908200_ref36","doi-asserted-by":"crossref","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: a software environment for integrated models of biomolecular interaction networks","volume":"13","author":"Shannon","year":"2003","journal-title":"Genome Res."},{"key":"2020041511154908200_ref37","doi-asserted-by":"crossref","first-page":"40321","DOI":"10.1038\/srep40321","article-title":"Drug response prediction as a link prediction problem","volume":"7","author":"Stanfield","year":"2017","journal-title":"Sci. Rep."},{"key":"2020041511154908200_ref38","doi-asserted-by":"crossref","first-page":"e1000641","DOI":"10.1371\/journal.pcbi.1000641","article-title":"Associating genes and protein complexes with disease via network propagation","volume":"6","author":"Vanunu","year":"2010","journal-title":"PLoS Comput. Biol."},{"key":"2020041511154908200_ref39","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1093\/bfgp\/elr024","article-title":"Network-based methods for human disease gene prediction","volume":"10","author":"Wang","year":"2011","journal-title":"Brief. Funct. Genom."},{"key":"2020041511154908200_ref40","doi-asserted-by":"crossref","first-page":"257","DOI":"10.3389\/fgene.2015.00257","article-title":"Human protein interaction networks across tissues and diseases","volume":"6","author":"Yeger-Lotem","year":"2015","journal-title":"Front. Genet."},{"key":"2020041511154908200_ref41","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1007\/978-3-319-23525-7_37","article-title":"Fast inbound top-k query for random walk with restart","volume-title":"Joint European Conference on Machine Learning and Knowledge Discovery in Databases","author":"Zhang","year":"2015"},{"key":"2020041511154908200_ref42","doi-asserted-by":"crossref","first-page":"i484","DOI":"10.1093\/bioinformatics\/bty247","article-title":"Classifying tumors by supervised network propagation","volume":"34","author":"Zhang","year":"2018","journal-title":"Bioinformatics"}],"container-title":["Database"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baaa018\/33047390\/baaa018.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baaa018\/33047390\/baaa018.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,28]],"date-time":"2023-09-28T00:08:25Z","timestamp":1695859705000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baaa018\/5819652"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,1]]},"references-count":42,"URL":"https:\/\/doi.org\/10.1093\/database\/baaa018","relation":{},"ISSN":["1758-0463"],"issn-type":[{"type":"electronic","value":"1758-0463"}],"subject":[],"published-other":{"date-parts":[[2020]]},"published":{"date-parts":[[2020,1,1]]},"article-number":"baaa018"}}