{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T22:59:34Z","timestamp":1772319574207,"version":"3.50.1"},"reference-count":63,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,1,3]],"date-time":"2020-01-03T00:00:00Z","timestamp":1578009600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,1,3]],"date-time":"2020-01-03T00:00:00Z","timestamp":1578009600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"LASIGE Strategic Project","award":["UID\/CEC\/00408\/2019"],"award-info":[{"award-number":["UID\/CEC\/00408\/2019"]}]},{"name":"LASIGE Strategic Project","award":["UID\/CEC\/00408\/2019"],"award-info":[{"award-number":["UID\/CEC\/00408\/2019"]}]},{"name":"LASIGE Strategic Project","award":["UID\/CEC\/00408\/2019"],"award-info":[{"award-number":["UID\/CEC\/00408\/2019"]}]},{"name":"SMILAX","award":["PTDC\/EEI-ESS\/4633\/2014"],"award-info":[{"award-number":["PTDC\/EEI-ESS\/4633\/2014"]}]},{"name":"SMILAX","award":["PTDC\/EEI-ESS\/4633\/2014"],"award-info":[{"award-number":["PTDC\/EEI-ESS\/4633\/2014"]}]},{"name":"PERSEIDS","award":["PTDC\/EMS-SIS\/0642\/2014"],"award-info":[{"award-number":["PTDC\/EMS-SIS\/0642\/2014"]}]},{"name":"BINDER","award":["PTDC\/CCI-INF\/29168\/2017"],"award-info":[{"award-number":["PTDC\/CCI-INF\/29168\/2017"]}]},{"name":"PREDICT","award":["PTDC\/CCI-CIF\/29877\/2017"],"award-info":[{"award-number":["PTDC\/CCI-CIF\/29877\/2017"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>In recent years, biomedical ontologies have become important for describing existing biological knowledge in the form of knowledge graphs. Data mining approaches that work with knowledge graphs have been proposed, but they are based on vector representations that do not capture the full underlying semantics. An alternative is to use machine learning approaches that explore semantic similarity. However, since ontologies can model multiple perspectives, semantic similarity computations for a given learning task need to be fine-tuned to account for this. Obtaining the best combination of semantic similarity aspects for each learning task is not trivial and typically depends on expert knowledge.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We have developed a novel approach, evoKGsim, that applies Genetic Programming over a set of semantic similarity features, each based on a semantic aspect of the data, to obtain the best combination for a given supervised learning task. The approach was evaluated on several benchmark datasets for protein-protein interaction prediction using the Gene Ontology as the knowledge graph to support semantic similarity, and it outperformed competing strategies, including manually selected combinations of semantic aspects emulating expert knowledge. evoKGsim was also able to learn species-agnostic models with different combinations of species for training and testing, effectively addressing the limitations of predicting protein-protein interactions for species with fewer known interactions.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>evoKGsim can overcome one of the limitations in knowledge graph-based semantic similarity applications: the need to expertly select which aspects should be taken into account for a given application. Applying this methodology to protein-protein interaction prediction proved successful, paving the way to broader applications.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-019-3296-1","type":"journal-article","created":{"date-parts":[[2020,1,3]],"date-time":"2020-01-03T15:03:16Z","timestamp":1578063796000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":36,"title":["Evolving knowledge graph similarity for supervised learning in complex biomedical domains"],"prefix":"10.1186","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7241-8970","authenticated-orcid":false,"given":"Rita T.","family":"Sousa","sequence":"first","affiliation":[]},{"given":"Sara","family":"Silva","sequence":"additional","affiliation":[]},{"given":"Catia","family":"Pesquita","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,1,3]]},"reference":[{"key":"3296_CR1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-68856-3","volume-title":"Logical and Relational Learning","author":"L De Raedt","year":"2008","unstructured":"De Raedt L. Logical and Relational Learning. Berlin Heidelberg: Springer; 2008."},{"key":"3296_CR2","volume-title":"The Semantic Web \u2013 ISWC 2014","author":"M Schmachtenberg","year":"2014","unstructured":"Schmachtenberg M, Bizer C, Paulheim H. Adoption of the linked data best practices in different topical domains In: Mika P, Tudorache T, Bernstein A, Welty C, Knoblock C, Vrande\u010di\u0107 D, Groth P, Noy N, Janowicz K, Goble C, editors. The Semantic Web \u2013 ISWC 2014. Cham: Springer: 2014. p. 245\u201360."},{"issue":"5-6","key":"3296_CR3","doi-asserted-by":"publisher","first-page":"907","DOI":"10.1006\/ijhc.1995.1081","volume":"43","author":"TR Gruber","year":"1995","unstructured":"Gruber TR. Toward principles for the design of ontologies used for knowledge sharing?. Int J Hum-Comput Stud. 1995; 43(5-6):907\u201328.","journal-title":"Int J Hum-Comput Stud"},{"key":"3296_CR4","volume-title":"Joint Proceedings of the Posters and Demos Track of the 12th International Conference on Semantic Systems - SEMANTiCS2016, Leipzig, Germany, September 12-15, CEUR Workshop Proceedings, vol. 1695","author":"L Ehrlinger","year":"2016","unstructured":"Ehrlinger L, W\u00f6\u00df W. Towards a definition of knowledge graphs In: Martin M, Cuquet M, Folmer E, editors. Joint Proceedings of the Posters and Demos Track of the 12th International Conference on Semantic Systems - SEMANTiCS2016, Leipzig, Germany, September 12-15, CEUR Workshop Proceedings, vol. 1695. Leipzig: CEUR-WS.org: 2016. http:\/\/nbn-resolving.de\/urn:nbn:de:0074-1695-3."},{"issue":"1","key":"3296_CR5","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al.Gene ontology: tool for the unification of biology. Nat Genet. 2000; 25(1):25\u201329.","journal-title":"Nat Genet"},{"key":"3296_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.websem.2016.01.001","volume":"36","author":"P Ristoski","year":"2016","unstructured":"Ristoski P, Paulheim H. Semantic Web in data mining and knowledge discovery: A comprehensive survey. J Web Semant. 2016; 36:1\u201322.","journal-title":"J Web Semant"},{"key":"3296_CR7","doi-asserted-by":"publisher","DOI":"10.1145\/2254129.2254168","volume-title":"Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics, WIMS \u201912","author":"H Paulheim","year":"2012","unstructured":"Paulheim H, F\u00fcmkranz J. Unsupervised generation of data mining features from linked open data. In: Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics, WIMS \u201912. New York: ACM: 2012. p. 31\u201313112. https:\/\/doi.org\/10.1145\/2254129.2254168. http:\/\/doi.acm.org\/10.1145\/2254129.2254168."},{"key":"3296_CR8","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1016\/j.websem.2015.06.004","volume":"35","author":"P Ristoski","year":"2015","unstructured":"Ristoski P, Bizer C, Paulheim H. Mining the web of linked data with rapidminer. J Web Semant. 2015; 35:142\u201351.","journal-title":"J Web Semant"},{"key":"3296_CR9","volume-title":"Proceedings of the 2013 International Conference on Data Mining on Linked Data - Volume 1082, DMoLD\u201913","author":"GKD De Vries","year":"2013","unstructured":"De Vries GKD, De Rooij S. A fast and simple graph kernel for RDF. In: Proceedings of the 2013 International Conference on Data Mining on Linked Data - Volume 1082, DMoLD\u201913. Aachen: CEUR-WS.org: 2013. p. 23\u201334. http:\/\/dl.acm.org\/citation.cfm?id=3053776.3053781."},{"key":"3296_CR10","volume-title":"The Semantic Web \u2013 ISWC 2016","author":"P Ristoski","year":"2016","unstructured":"Ristoski P, Paulheim H. Rdf2Vec: RDF graph embeddings for data mining In: Groth P, Simperl E, Gray A, Sabou M, Kr\u00f6tzsch M, Lecue F, Fl\u00f6ck F, Gil Y, editors. The Semantic Web \u2013 ISWC 2016. Cham: Springer: 2016. p. 498\u2013514."},{"issue":"4","key":"3296_CR11","doi-asserted-by":"publisher","first-page":"762","DOI":"10.1109\/TCBB.2016.2555304","volume":"14","author":"S Bandyopadhyay","year":"2017","unstructured":"Bandyopadhyay S, Mallick K. A new feature vector based on gene ontology terms for protein-protein interaction prediction. IEEE\/ACM Trans Comput Biol Bioinformatics. 2017; 14(4):762\u201370.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinformatics"},{"issue":"13","key":"3296_CR12","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1093\/bioinformatics\/bty259","volume":"34","author":"FZ Smaili","year":"2018","unstructured":"Smaili FZ, Gao X, Hoehndorf R. Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations. Bioinformatics. 2018; 34(13):52\u201360.","journal-title":"Bioinformatics"},{"issue":"1","key":"3296_CR13","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1093\/bioinformatics\/btr610","volume":"28","author":"SR Maetschke","year":"2011","unstructured":"Maetschke SR, Simonsen M, Davis MJ, Ragan MA. Gene ontology-driven inference of protein\u2013protein interactions using inducers. Bioinformatics. 2011; 28(1):69\u201375.","journal-title":"Bioinformatics"},{"issue":"7","key":"3296_CR14","doi-asserted-by":"publisher","first-page":"1000443","DOI":"10.1371\/journal.pcbi.1000443","volume":"5","author":"C Pesquita","year":"2009","unstructured":"Pesquita C, Faria D, Falcao AO, Lord P, Couto FM. Semantic similarity in biomedical ontologies. PLOS Comput Biol. 2009; 5(7):1000443.","journal-title":"PLOS Comput Biol"},{"issue":"1","key":"3296_CR15","doi-asserted-by":"publisher","first-page":"12100","DOI":"10.1038\/s41598-018-30455-0","volume":"8","author":"W Liu","year":"2018","unstructured":"Liu W, Liu J, Rajapakse JC. Gene ontology enrichment improves performances of functional similarity of genes. Sci Rep. 2018; 8(1):12100.","journal-title":"Sci Rep"},{"key":"3296_CR16","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1016\/j.jtbi.2016.04.020","volume":"401","author":"S-B Zhang","year":"2016","unstructured":"Zhang S-B, Tang Q-R. Protein\u2013protein interaction inference based on semantic similarity of gene ontology terms. J Theor Biol. 2016; 401:30\u20137.","journal-title":"J Theor Biol"},{"issue":"1","key":"3296_CR17","doi-asserted-by":"publisher","first-page":"562","DOI":"10.1186\/1471-2105-11-562","volume":"11","author":"S Jain","year":"2010","unstructured":"Jain S, Bader GD. An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology. BMC Bioinformatics. 2010; 11(1):562. https:\/\/doi.org\/10.1186\/1471-2105-11-562.","journal-title":"BMC Bioinformatics"},{"issue":"7","key":"3296_CR18","doi-asserted-by":"publisher","first-page":"2137","DOI":"10.1093\/nar\/gkl219","volume":"34","author":"X Wu","year":"2006","unstructured":"Wu X, Zhu L, Guo J, Zhang D-Y, Lin K. Prediction of yeast protein\u2013protein interaction network: insights from the gene ontology and annotations. Nucleic Acids Res. 2006; 34(7):2137\u201350.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"3296_CR19","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1186\/1471-2105-6-100","volume":"6","author":"A Patil","year":"2005","unstructured":"Patil A, Nakamura H. Filtering high-throughput protein-protein interaction data using a combination of genomic features. BMC Bioinformatics. 2005; 6(1):100.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"3296_CR20","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1186\/1471-2105-5-154","volume":"5","author":"N Lin","year":"2004","unstructured":"Lin N, Wu B, Jansen R, Gerstein M, Zhao H. Information assessment on predicting protein-protein interactions. BMC Bioinformatics. 2004; 5(1):154.","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"3296_CR21","doi-asserted-by":"publisher","first-page":"1064","DOI":"10.1007\/s11427-014-4747-6","volume":"57","author":"M Li","year":"2014","unstructured":"Li M, Li Q, Ganegoda GU, Wang J, Wu F, Pan Y. Prioritization of orphan disease-causing genes using topological feature and GO similarity between proteins in interaction networks. Sci China Life Sci. 2014; 57(11):1064\u201371.","journal-title":"Sci China Life Sci"},{"issue":"1","key":"3296_CR22","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1186\/1471-2105-7-135","volume":"7","author":"P Zhang","year":"2006","unstructured":"Zhang P, Zhang J, Sheng H, Russo JJ, Osborne B, Buetow K. Gene functional similarity search tool (GFSST). BMC Bioinformatics. 2006; 7(1):135.","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"3296_CR23","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1186\/gb-2003-4-11-r75","volume":"4","author":"FS Turner","year":"2003","unstructured":"Turner FS, Clutterbuck DR, Semple CA. POCUS: mining genomic sequence annotation to predict disease genes. Genome Biol. 2003; 4(11):75.","journal-title":"Genome Biol"},{"issue":"3","key":"3296_CR24","doi-asserted-by":"publisher","first-page":"316","DOI":"10.1038\/ng895","volume":"31","author":"C Perez-Iratxeta","year":"2002","unstructured":"Perez-Iratxeta C, Bork P, Andrade MA. Association of genes to genetically inherited diseases using data mining. Nat Genet. 2002; 31(3):316.","journal-title":"Nat Genet"},{"issue":"suppl_2","key":"3296_CR25","doi-asserted-by":"publisher","first-page":"110","DOI":"10.1093\/bioinformatics\/18.suppl_2.S110","volume":"18","author":"J Freudenberg","year":"2002","unstructured":"Freudenberg J, Propping P. A similarity-based method for genome-wide prediction of disease-relevant human genes. Bioinformatics. 2002; 18(suppl_2):110\u201315.","journal-title":"Bioinformatics"},{"issue":"4","key":"3296_CR26","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1186\/1471-2105-7-S4-S11","volume":"7","author":"Z-H Duan","year":"2006","unstructured":"Duan Z-H, Hughes B, Reichel L, Perez DM, Shi T. The relationship between protein sequences and their gene ontology functions. BMC Bioinformatics. 2006; 7(4):11.","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"3296_CR27","doi-asserted-by":"publisher","first-page":"2739","DOI":"10.1093\/bioinformatics\/bti406","volume":"21","author":"PH Lee","year":"2005","unstructured":"Lee PH, Lee D. Modularized learning of genetic interaction networks from biological annotations and mRNA expression data. Bioinformatics. 2005; 21(11):2739\u201347.","journal-title":"Bioinformatics"},{"issue":"1","key":"3296_CR28","doi-asserted-by":"publisher","first-page":"491","DOI":"10.1186\/1471-2105-7-491","volume":"7","author":"Z Lei","year":"2006","unstructured":"Lei Z, Dai Y. Assessing protein similarity with gene ontology and its use in subnuclear localization prediction. BMC Bioinformatics. 2006; 7(1):491.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"3296_CR29","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1186\/1747-5333-1-19","volume":"1","author":"FM Couto","year":"2006","unstructured":"Couto FM, Silva MJ, Lee V, Dimmer E, Camon E, Apweiler R, Kirsch H, Rebholz-Schuhmann D. GOAnnotator: linking protein GO annotations to evidence text. J Biomed Discov Collab. 2006; 1(1):19.","journal-title":"J Biomed Discov Collab"},{"issue":"5","key":"3296_CR30","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1016\/j.ajhg.2008.09.017","volume":"83","author":"PN Robinson","year":"2008","unstructured":"Robinson PN, K\u00f6hler S, Bauer S, Seelow D, Horn D, Mundlos S. The human phenotype ontology: a tool for annotating and analyzing human hereditary disease. Am J Hum Genet. 2008; 83(5):610\u20135.","journal-title":"Am J Hum Genet"},{"issue":"4","key":"3296_CR31","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1016\/j.ajhg.2009.09.003","volume":"85","author":"S K\u00f6hler","year":"2009","unstructured":"K\u00f6hler S, Schulz MH, Krawitz P, Bauer S, D\u00f6lken S, Ott CE, Mundlos C, Horn D, Mundlos S, Robinson PN. Clinical diagnostics in human genetics with semantic similarity searches in ontologies. Am J Hum Genet. 2009; 85(4):457\u201364.","journal-title":"Am J Hum Genet"},{"issue":"18","key":"3296_CR32","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1093\/nar\/gkr538","volume":"39","author":"R Hoehndorf","year":"2011","unstructured":"Hoehndorf R, Schofield PN, Gkoutos GV. PhenomeNET: a whole-phenome approach to disease gene discovery. Nucleic Acids Res. 2011; 39(18):119.","journal-title":"Nucleic Acids Res"},{"key":"3296_CR33","unstructured":"Poli R, Langdon WB, McPhee NF, Koza JR. A Field Guide to Genetic Programming. Freely available at http:\/\/www.gp-field-guide.org.uk: Published via http:\/\/lulu.com; 2008."},{"key":"3296_CR34","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2011; 12:2825\u201330.","journal-title":"J Mach Learn Res"},{"issue":"3","key":"3296_CR35","doi-asserted-by":"publisher","first-page":"579","DOI":"10.1093\/biomet\/57.3.579","volume":"57","author":"N Breslow","year":"1970","unstructured":"Breslow N. A generalized kruskal-wallis test for comparing k samples subject to unequal patterns of censorship. Biometrika. 1970; 57(3):579\u201394.","journal-title":"Biometrika"},{"key":"3296_CR36","unstructured":"Jones E, Oliphant T, Peterson P, et al.Scipy: Open source scientific tools for python. 2001."},{"issue":"suppl_1","key":"3296_CR37","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1093\/bioinformatics\/bti1016","volume":"21","author":"A Ben-Hur","year":"2005","unstructured":"Ben-Hur A, Noble WS. Kernel methods for predicting protein\u2013protein interactions. Bioinformatics. 2005; 21(suppl_1):38\u201346.","journal-title":"Bioinformatics"},{"issue":"20","key":"3296_CR38","doi-asserted-by":"publisher","first-page":"2610","DOI":"10.1093\/bioinformatics\/btq483","volume":"26","author":"J Yu","year":"2010","unstructured":"Yu J, Guo M, Needham CJ, Huang Y, Cai L, Westhead DR. Simple sequence-based kernels do not predict protein\u2013protein interactions. Bioinformatics. 2010; 26(20):2610\u20134.","journal-title":"Bioinformatics"},{"key":"3296_CR39","doi-asserted-by":"publisher","first-page":"103","DOI":"10.7717\/peerj-cs.103","volume":"3","author":"A Meurer","year":"2017","unstructured":"Meurer A, Smith CP, Paprocki M, \u010cert\u00edk O, Kirpichev SB, Rocklin M, Kumar A, Ivanov S, Moore JK, Singh S, Rathnayake T, Vig S, Granger BE, Muller RP, Bonazzi F, Gupta H, Vats S, Johansson F, Pedregosa F, Curry MJ, Terrel AR, Rou\u010dka v, Saboo A, Fernando I, Kulal S, Cimrman R, Scopatz A. SymPy: symbolic computing in Python. PeerJ Comput Sci. 2017; 3:103. https:\/\/doi.org\/10.7717\/peerj-cs.103.","journal-title":"PeerJ Comput Sci"},{"key":"3296_CR40","volume-title":"Graph Drawing","author":"J Ellson","year":"2002","unstructured":"Ellson J, Gansner E, Koutsofios L, North SC, Woodhull G. Graphviz \u2013 open source graph drawing tools In: Mutzel P, J\u00fcnger M, Leipert S, editors. Graph Drawing. Berlin, Heidelberg: Springer: 2002. p. 483\u2013484."},{"issue":"2","key":"3296_CR41","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1007\/s10710-011-9150-5","volume":"13","author":"S Silva","year":"2012","unstructured":"Silva S, Dignum S, Vanneschi L. Operator equalisation for bloat free genetic programming and a survey of bloat control methods. Genet Program Evolvable Mach. 2012; 13(2):197\u2013238. https:\/\/doi.org\/10.1007\/s10710-011-9150-5.","journal-title":"Genet Program Evolvable Mach"},{"issue":"1","key":"3296_CR42","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/1471-2148-3-21","volume":"3","author":"JD Bloom","year":"2003","unstructured":"Bloom JD, Adami C. Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein\u2013protein interactions data sets. BMC Evol Biol. 2003; 3(1):21.","journal-title":"BMC Evol Biol"},{"issue":"1","key":"3296_CR43","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1186\/1471-2105-10-419","volume":"10","author":"Y Park","year":"2009","unstructured":"Park Y. Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences. BMC Bioinformatics. 2009; 10(1):419.","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"3296_CR44","doi-asserted-by":"publisher","first-page":"740","DOI":"10.1093\/bioinformatics\/btt581","volume":"30","author":"S Harispe","year":"2013","unstructured":"Harispe S, Ranwez S, Janaqi S, Montmain J. The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies. Bioinformatics. 2013; 30(5):740\u20132.","journal-title":"Bioinformatics"},{"issue":"D1","key":"3296_CR45","doi-asserted-by":"publisher","first-page":"1057","DOI":"10.1093\/nar\/gku1113","volume":"43","author":"RP Huntley","year":"2014","unstructured":"Huntley RP, Sawford T, Mutowo-Meullenet P, Shypitsyna A, Bonilla C, Martin MJ, O\u2019donovan C. The GOA database: gene ontology annotation updates for 2015. Nucleic Acids Res. 2014; 43(D1):1057\u201363.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"3296_CR46","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1186\/1471-2105-8-401","volume":"8","author":"RG C\u00f4t\u00e9","year":"2007","unstructured":"C\u00f4t\u00e9 RG, Jones P, Martens L, Kerrien S, Reisinger F, Lin Q, Leinonen R, Apweiler R, Hermjakob H. The protein identifier cross-referencing (PICR) service: reconciling protein identifiers across multiple source databases. BMC Bioinformatics. 2007; 8(1):401.","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"3296_CR47","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1093\/bib\/bbr066","volume":"13","author":"PH Guzzi","year":"2011","unstructured":"Guzzi PH, Mina M, Guerra C, Cannataro M. Semantic similarity analysis of protein data: assessment with biological features and issues. Brief Bioinformatics. 2011; 13(5):569\u201385.","journal-title":"Brief Bioinformatics"},{"key":"3296_CR48","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1016\/j.jbi.2013.11.006","volume":"48","author":"S Harispe","year":"2014","unstructured":"Harispe S, S\u00e1nchez D, Ranwez S, Janaqi S, Montmain J. A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain. J Biomed Inform. 2014; 48:38\u201353.","journal-title":"J Biomed Inform"},{"key":"3296_CR49","unstructured":"Pesquita C, Faria D, Bastos H, Falcao A, Couto F. Evaluating GO-based semantic similarity measures. In: Proceedings of the 10th Annual Bio-Ontologies Meeting. Vienna: 2007. p. 37\u201340."},{"key":"3296_CR50","volume-title":"Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 1, IJCAI\u201995","author":"P Resnik","year":"1995","unstructured":"Resnik P. Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 1, IJCAI\u201995. San Francisco: Morgan Kaufmann Publishers Inc.: 1995. p. 448\u2013453. http:\/\/dl.acm.org\/citation.cfm?id=1625855.1625914."},{"key":"3296_CR51","volume-title":"Proceedings of the 16th European Conference on Artificial Intelligence, ECAI\u201904","author":"N Seco","year":"2004","unstructured":"Seco N, Veale T, Hayes J. An intrinsic information content metric for semantic similarity in WordNet. In: Proceedings of the 16th European Conference on Artificial Intelligence, ECAI\u201904. Amsterdam: IOS Press: 2004. p. 1089\u20131090. http:\/\/dl.acm.org\/citation.cfm?id=3000001.3000272."},{"key":"3296_CR52","volume-title":"The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World","author":"P Domingos","year":"2015","unstructured":"Domingos P. The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World. New York: Basic Books, Inc.; 2015."},{"key":"3296_CR53","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-05094-1","volume-title":"Introduction to Evolutionary Computing, 53","author":"AE Eiben","year":"2003","unstructured":"Eiben AE, Smith JE, et al.Introduction to Evolutionary Computing, 53. Berlin Heidelberg: Springer; 2003."},{"key":"3296_CR54","volume-title":"Foundations of Genetic Programming","author":"WB Langdon","year":"2013","unstructured":"Langdon WB, Poli R. Foundations of Genetic Programming. Berlin Heidelberg: Springer; 2013."},{"key":"3296_CR55","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-20883-1","volume-title":"Handbook of Genetic Programming Applications","author":"AH Gandomi","year":"2015","unstructured":"Gandomi AH, Alavi AH, Ryan C. Handbook of Genetic Programming Applications. Cham: Springer; 2015."},{"issue":"9","key":"3296_CR56","doi-asserted-by":"publisher","first-page":"1159","DOI":"10.1093\/bioinformatics\/btm066","volume":"23","author":"M Brameier","year":"2007","unstructured":"Brameier M, Krings A, MacCallum RM. Nucpred\u2014predicting nuclear localization of proteins. Bioinformatics. 2007; 23(9):1159\u201360.","journal-title":"Bioinformatics"},{"issue":"10","key":"3296_CR57","doi-asserted-by":"publisher","first-page":"3263","DOI":"10.1093\/nar\/gki644","volume":"33","author":"P S\u00e6trom","year":"2005","unstructured":"S\u00e6trom P, Sneve R, Kristiansen KI, Sn\u00f8ve O, Gr\u00fcnfeld T, Rognes T, Seeberg E. Predicting non-coding rna genes in escherichia coli with boosted genetic programming. Nucleic Acids Res. 2005; 33(10):3263\u201370.","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"3296_CR58","doi-asserted-by":"publisher","first-page":"0202685","DOI":"10.1371\/journal.pone.0202685","volume":"13","author":"CA Bannister","year":"2018","unstructured":"Bannister CA, Halcox JP, Currie CJ, Preece A, Spasi\u0107 I. A genetic programming approach to development of clinical prediction models: A case study in symptomatic cardiovascular disease. PloS One. 2018; 13(9):0202685.","journal-title":"PloS One"},{"issue":"3","key":"3296_CR59","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1007\/s10710-010-9112-3","volume":"11","author":"J. R. Koza","year":"2010","unstructured":"Koza J. R.Human-competitive results produced by genetic programming. Genet Program Evolvable Mach. 2010; 11(3):251\u201384. https:\/\/doi.org\/10.1007\/s10710-010-9112-3.","journal-title":"Genet Program Evolvable Mach"},{"key":"3296_CR60","volume-title":"Genetic Programming: On the Programming of Computers by Means of Natural Selection","author":"JR Koza","year":"1992","unstructured":"Koza JR. Genetic Programming: On the Programming of Computers by Means of Natural Selection. Cambridge, USA: MIT Press; 1992."},{"key":"3296_CR61","doi-asserted-by":"publisher","unstructured":"Sipper M, Fu W, Ahuja K, Moore JH. Investigating the parameter space of evolutionary algorithms. BioData Min. 2018; 11(1). https:\/\/doi.org\/10.1186\/s13040-018-0164-x.","DOI":"10.1186\/s13040-018-0164-x"},{"key":"3296_CR62","doi-asserted-by":"crossref","unstructured":"Espejo PG, Ventura S, Herrera F. Applications and Reviews IEEE Trans Syst Man Cybern Part C Appl Rev. 2009; 40(2):121\u201344.","DOI":"10.1109\/TSMCC.2009.2033566"},{"key":"3296_CR63","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1016\/j.swevo.2017.11.003","volume":"39","author":"S Silva","year":"2018","unstructured":"Silva S, Vanneschi L, Cabral AIR, Vasconcelos MJ. A semi-supervised genetic programming method for dealing with noisy labels and hidden overfitting. Swarm Evol Comput. 2018; 39:323\u201338. https:\/\/doi.org\/10.1016\/j.swevo.2017.11.003.","journal-title":"Swarm Evol Comput"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3296-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-019-3296-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3296-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,1,2]],"date-time":"2021-01-02T00:10:08Z","timestamp":1609546208000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-019-3296-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,3]]},"references-count":63,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3296"],"URL":"https:\/\/doi.org\/10.1186\/s12859-019-3296-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,1,3]]},"assertion":[{"value":"6 May 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 November 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 January 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"6"}}