{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,28]],"date-time":"2025-06-28T23:21:01Z","timestamp":1751152861580,"version":"3.37.3"},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,8,30]],"date-time":"2016-08-30T00:00:00Z","timestamp":1472515200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,8,30]],"date-time":"2016-08-30T00:00:00Z","timestamp":1472515200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["GM072014","GM081680"],"award-info":[{"award-number":["GM072014","GM081680"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Sequence matching is extremely important for applications throughout biology, particularly for discovering information such as functional and evolutionary relationships, and also for discriminating between unimportant and disease mutants. At present the functions of a large fraction of genes are unknown; improvements in sequence matching will improve gene annotations. Universal amino acid substitution matrices such as Blosum62 are used to measure sequence similarities and to identify distant homologues, regardless of the structure class. However, such single matrices do not take into account important structural information evident within the different topologies of proteins and treats substitutions within all protein folds identically. Others have suggested that the use of structural information can lead to significant improvements in sequence matching but this has not yet been very effective. Here we develop novel substitution matrices that include not only general sequence information but also have a topology specific component that is unique for each CATH topology. This novel feature of using a combination of sequence and structure information for each protein topology significantly improves the sequence matching scores for the sequence pairs tested. We have used a novel multi-structure alignment method for each homology level of CATH in order to extract topological information.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We obtain statistically significant improved sequence matching scores for 73\u00a0% of the alpha helical test cases. On average, 61\u00a0% of the test cases showed improvements in homology detection when structure information was incorporated into the substitution matrices. On average z-scores for homology detection are improved by more than 54\u00a0% for all cases, and some individual cases have z-scores more than twice those obtained using generic matrices. Our topology specific similarity matrices also outperform other traditional similarity matrices and single matrix based structure methods. When default amino acid substitution matrix in the Psi-blast algorithm is replaced by our structure-based matrices, the structure matching is significantly improved over conventional Psi-blast. It also outperforms results obtained for the corresponding HMM profiles generated for each topology.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>We show that by incorporating topology-specific structure information in addition to sequence information into specific amino acid substitution matrices, the sequence matching scores and homology detection are significantly improved. Our topology specific similarity matrices outperform other traditional similarity matrices, single matrix based structure methods, also show improvement over conventional Psi-blast and HMM profile based methods in sequence matching. The results support the discriminatory ability of the new amino acid similarity matrices to distinguish between distant homologs and structurally dissimilar pairs.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-016-1198-z","type":"journal-article","created":{"date-parts":[[2016,8,30]],"date-time":"2016-08-30T13:20:23Z","timestamp":1472563223000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Fold-specific sequence scoring improves protein sequence matching"],"prefix":"10.1186","volume":"17","author":[{"given":"Sumudu P.","family":"Leelananda","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrzej","family":"Kloczkowski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robert L.","family":"Jernigan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2016,8,30]]},"reference":[{"key":"1198_CR1","doi-asserted-by":"publisher","first-page":"1777","DOI":"10.1101\/gr.3866105","volume":"15","author":"MR Brent","year":"2005","unstructured":"Brent MR. Genome annotation past, present, and future: How to define an ORF at each locus. Genome Res. 2005;15:1777\u201386.","journal-title":"Genome Res"},{"key":"1198_CR2","doi-asserted-by":"crossref","unstructured":"Reed J, Famili I, Thiele I, Palsson B. Towards multidimensional genome annotation. Nat Rev Genet.\u00a02006;7:130\u201341.","DOI":"10.1038\/nrg1769"},{"key":"1198_CR3","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1146\/annurev.genom.4.070802.110300","volume":"4","author":"JL Ashurst","year":"2003","unstructured":"Ashurst JL, Collins JE. Gene annotation: Prediction and testing. Annu Rev Genom Human Genet. 2003;4:69\u201388.","journal-title":"Annu Rev Genom Human Genet"},{"key":"1198_CR4","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1038\/nrg3174","volume":"13","author":"M Yandell","year":"2012","unstructured":"Yandell M, Ence D. A beginner\u2019s guide to eukaryotic genome annotation. Nat Rev Genet. 2012;13:329\u201342.","journal-title":"Nat Rev Genet"},{"key":"1198_CR5","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/0079-6107(89)90011-4","volume":"54","author":"WR Taylor","year":"1989","unstructured":"Taylor WR. A template based method of pattern matching in protein sequences. Prog Biophys Mol Biol. 1989;54:159\u2013252.","journal-title":"Prog Biophys Mol Biol"},{"key":"1198_CR6","doi-asserted-by":"crossref","unstructured":"Barton GJ Protein multiple sequence alignment and flexible pattern matching. In Methods in Enzymology. Volume 183 edition: Academic Press, San Diego CA; 1990:403\u2013428.","DOI":"10.1016\/0076-6879(90)83027-7"},{"key":"1198_CR7","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1038\/35080529","volume":"2","author":"L Stein","year":"2001","unstructured":"Stein L. Genome annotation: From sequence to biology. Nat Rev Genet. 2001;2:493\u2013503.","journal-title":"Nat Rev Genet"},{"key":"1198_CR8","doi-asserted-by":"crossref","unstructured":"Lambert C, Campenhout JV, DeBolle X, Depiereux E. Review of common sequence alignment methods: clues to enhance reliability. Curr Genomics. 2003;4:131\u201346.","DOI":"10.2174\/1389202033350038"},{"key":"1198_CR9","doi-asserted-by":"publisher","first-page":"891","DOI":"10.1002\/prot.21770","volume":"71","author":"M Kosloff","year":"2008","unstructured":"Kosloff M, Kolodny R. Sequence-similar, structure-dissimilar protein pairs in the PDB. Proteins. 2008;71:891\u2013902.","journal-title":"Proteins"},{"key":"1198_CR10","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1093\/protein\/12.2.85","volume":"12","author":"B Rost","year":"1999","unstructured":"Rost B. Twilight zone of protein sequence alignments. Protein Eng. 1999;12:85\u201394.","journal-title":"Protein Eng"},{"key":"1198_CR11","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1002\/prot.22458","volume":"77","author":"K Illergard","year":"2009","unstructured":"Illergard K, Ardell D, Elofison A. Structure is three to ten times more conserved than sequence\u2212A study of structural response in protein cores. Proteins. 2009;77:499\u2013508.","journal-title":"Proteins"},{"key":"1198_CR12","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1002\/prot.21434","volume":"67","author":"AD Solis","year":"2007","unstructured":"Solis AD, Rackovsky S. Property-based sequence representations do not adequately encode local protein folding information. Proteins. 2007;67:785\u20138.","journal-title":"Proteins"},{"key":"1198_CR13","doi-asserted-by":"publisher","first-page":"14345","DOI":"10.1073\/pnas.0903433106","volume":"106","author":"S Rackovsky","year":"2009","unstructured":"Rackovsky S. Sequence physical properties encode the global organization of protein structure space. Proc Natl Acad Sci. 2009;106:14345\u20138.","journal-title":"Proc Natl Acad Sci"},{"key":"1198_CR14","doi-asserted-by":"publisher","first-page":"1681","DOI":"10.1002\/prot.24328","volume":"81","author":"S Rackovsky","year":"2013","unstructured":"Rackovsky S. Sequence determinants of protein architecture. Proteins. 2013;81:1681\u20135.","journal-title":"Proteins"},{"key":"1198_CR15","doi-asserted-by":"crossref","unstructured":"Schwartz RM, Dayhoff MO. Origins of prokaryotes, eukaryotes, mitochondria, and chloroplasts. Science. 1978;199:395-403.","DOI":"10.1126\/science.202030"},{"key":"1198_CR16","doi-asserted-by":"publisher","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","volume":"89","author":"S Henikoff","year":"1992","unstructured":"Henikoff S, Henikoff J. Amino-acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992;89:10915\u20139.","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1198_CR17","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1093\/protein\/6.3.267","volume":"6","author":"S Miyazawa","year":"1993","unstructured":"Miyazawa S, Jernigan RL. A new substitution matrix for protein sequence searches based on contact frequencies in protein structures. Protein Eng. 1993;6:267\u201378.","journal-title":"Protein Eng"},{"key":"1198_CR18","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1002\/prot.21020","volume":"64","author":"Y Tan","year":"2006","unstructured":"Tan Y, Huang H, Kihara D. Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences. Proteins. 2006;64:587\u2013600.","journal-title":"Proteins"},{"key":"1198_CR19","doi-asserted-by":"publisher","first-page":"847","DOI":"10.1093\/bioinformatics\/btg492","volume":"20","author":"RB Vilim","year":"2004","unstructured":"Vilim RB, Cunningham RM, Lu B, Kheradpour P, Stevens FJ. Fold-specific substitution matrices for protein classification. Bioinformatics. 2004;20:847\u201353.","journal-title":"Bioinformatics"},{"key":"1198_CR20","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1002\/(SICI)1097-0134(1997)1+<134::AID-PROT18>3.0.CO;2-P","volume":"29","author":"K Karplus","year":"1998","unstructured":"Karplus K, Sjolander K, Barrett C, Cline M, Haussler D, Hughey R, Holm L, Sander C. Predicting protein structure using hidden Markov models. Proteins. 1998;29:134\u20139.","journal-title":"Proteins"},{"key":"1198_CR21","doi-asserted-by":"crossref","unstructured":"Di Francesco V, Geetha V, Garnier J, Munson PJ. Fold recognition using predicted secondary structure sequences and hidden Markov models of protein folds. Proteins. 1997;1:123-31.","DOI":"10.1002\/(SICI)1097-0134(1997)1+<123::AID-PROT16>3.0.CO;2-Q"},{"key":"1198_CR22","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1093\/oxfordjournals.molbev.a003985","volume":"19","author":"T Muller","year":"2002","unstructured":"Muller T, Spang R, Vingron M. Estimating amino acid substitution models: A comparison of Dayhoff\u2019s estimator, the resolvent approach and a maximum likelihood method. Mol Biol Evol. 2002;19:8\u201313.","journal-title":"Mol Biol Evol"},{"key":"1198_CR23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1756-0500-4-296","volume":"4","author":"IB Kuznetsov","year":"2011","unstructured":"Kuznetsov IB. Protein sequence alignment with family-specific amino acid similarity matrices. BMC Res Notes. 2011;4:1\u201310.","journal-title":"BMC Res Notes"},{"key":"1198_CR24","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1002\/prot.340100307","volume":"10","author":"R Luthy","year":"1991","unstructured":"Luthy R, McLachlan AD, Eisenberg D. Secondary structure-based profiles: Use of structure-conserving scoring tables in searching protein sequence databases for structural similarities. Proteins. 1991;10:229\u201339.","journal-title":"Proteins"},{"key":"1198_CR25","doi-asserted-by":"publisher","first-page":"481","DOI":"10.1016\/0022-2836(91)90188-C","volume":"219","author":"K Niefind","year":"1991","unstructured":"Niefind K, Schomburg D. Amino acid similarity coefficients for protein modeling and sequence alignment derived from main-chain folding angles. J Mol Biol. 1991;219:481\u201397.","journal-title":"J Mol Biol"},{"key":"1198_CR26","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1002\/pro.5560010203","volume":"1","author":"J Overington","year":"1992","unstructured":"Overington J, Donnelly D, Johnson MS, Sali A, Blundell TL. Environment-specific amino acid substitution tables: Tertiary templates and prediction of protein folds. Protein Sci. 1992;1:216\u201326.","journal-title":"Protein Sci"},{"key":"1198_CR27","doi-asserted-by":"publisher","first-page":"641","DOI":"10.1093\/protein\/8.7.641","volume":"8","author":"JM Koshi","year":"1995","unstructured":"Koshi JM, Goldstein RA. Context-dependent optimal substitution matrices. Protein Eng. 1995;8:641\u20135.","journal-title":"Protein Eng"},{"key":"1198_CR28","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1006\/jmbi.1997.1019","volume":"269","author":"RB Russell","year":"1997","unstructured":"Russell RB, Saqi MAS, Sayle RA, Bates PA, Sternberg MJE. Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation. J Mol Biol. 1997;269:423\u201339.","journal-title":"J Mol Biol"},{"key":"1198_CR29","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1093\/protein\/9.1.27","volume":"9","author":"K Tomii","year":"1996","unstructured":"Tomii K, Kanehisa M. Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng. 1996;9:27\u201336.","journal-title":"Protein Eng"},{"key":"1198_CR30","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1093\/bioinformatics\/btt694","volume":"30","author":"K Yamada","year":"2014","unstructured":"Yamada K, Tomii K. Revisiting amino acid substitution matrices for identifying distantly related proteins. Bioinformatics. 2014;30:317\u201325.","journal-title":"Bioinformatics"},{"key":"1198_CR31","doi-asserted-by":"publisher","first-page":"1323","DOI":"10.1093\/protein\/7.11.1323","volume":"7","author":"SA Bennet","year":"1994","unstructured":"Bennet SA, Cohen MA, Gonnet GH. Amino acid substitution during functionally constrained divergent evolution of protein sequences. Protein Eng. 1994;7:1323\u201332.","journal-title":"Protein Eng"},{"key":"1198_CR32","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1093\/protein\/13.8.545","volume":"13","author":"A Prlic","year":"2000","unstructured":"Prlic A, Domingues F, Sippl M. Structure-derived substitution matrices for alignment of distantly related sequences. Protein Eng Des Sel. 2000;13:545\u201350.","journal-title":"Protein Eng Des Sel"},{"key":"1198_CR33","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1002\/prot.10474","volume":"54","author":"O Teodorescu","year":"2004","unstructured":"Teodorescu O, Galor T, Pillardy J, Elber R. Enriching the sequence substitution matrix by structural information. Proteins. 2004;54:41\u20138.","journal-title":"Proteins"},{"key":"1198_CR34","doi-asserted-by":"publisher","first-page":"716","DOI":"10.1006\/jmbi.1993.1548","volume":"233","author":"MS Johnson","year":"1993","unstructured":"Johnson MS, Overington JP. A Structural Basis for Sequence Comparisons: An Evaluation of Scoring Methodologies. J Mol Biol. 1993;233:716\u201338.","journal-title":"J Mol Biol"},{"key":"1198_CR35","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1006\/jmbi.2001.4495","volume":"307","author":"JD Blake","year":"2001","unstructured":"Blake JD, Cohen FE. Pairwise sequence alignment below the twilight zone. J Mol Biol. 2001;307:721\u201335.","journal-title":"J Mol Biol"},{"key":"1198_CR36","doi-asserted-by":"publisher","first-page":"S19","DOI":"10.1186\/1471-2164-13-S6-S19","volume":"13","author":"J Ali","year":"2012","unstructured":"Ali J, Thummala S, Ranjan A. The parasite specific substitution matrices improve the annotation of apicomplexan proteins. BMC Genomics. 2012;13:S19.","journal-title":"BMC Genomics"},{"key":"1198_CR37","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","volume":"5","author":"CA Orengo","year":"1997","unstructured":"Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM. CATH-a hierarchic classification of protein domain structures. Structure. 1997;5:1093\u2013109.","journal-title":"Structure"},{"key":"1198_CR38","doi-asserted-by":"publisher","first-page":"172","DOI":"10.1002\/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.0.CO;2-F","volume":"29","author":"I Bahar","year":"1997","unstructured":"Bahar I, Atilgan A, Jernigan R, Erman B. Understanding the recognition of protein structural classes by amino acid composition. Proteins. 1997;29:172\u201385.","journal-title":"Proteins"},{"key":"1198_CR39","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1016\/0014-5793(95)00245-5","volume":"363","author":"KC Chou","year":"1995","unstructured":"Chou KC. Does the folding type of a protein depend on its amino acid composition? FEBS Lett. 1995;363:127\u201331.","journal-title":"FEBS Lett"},{"key":"1198_CR40","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1002\/prot.20921","volume":"64","author":"A Konagurthu","year":"2006","unstructured":"Konagurthu A, Whisstock J, Stuckey P, Lesk A. MUSTANG: A multiple structural alignment algorithm. Proteins. 2006;64:559\u201374.","journal-title":"Proteins"},{"key":"1198_CR41","doi-asserted-by":"crossref","unstructured":"Muller T, Vingron M. Modeling Amino Acid Replacement. J Comput Biol. 2000;7:761-76.","DOI":"10.1089\/10665270050514918"},{"key":"1198_CR42","doi-asserted-by":"publisher","first-page":"350","DOI":"10.1110\/ps.18602","volume":"11","author":"I Friedberg","year":"2002","unstructured":"Friedberg I, Margalit H. Persistently conserved positions in structurally similar, sequence dissimilar proteins: Roles in preserving protein fold and function. Protein Sci. 2002;11:350\u201360.","journal-title":"Protein Sci"},{"key":"1198_CR43","doi-asserted-by":"crossref","unstructured":"Gniewek P, Kolinski A, Gront D. Optimization of Profile-to-Profile Alignment Parameters for One-Dimensional Threading. J Comput Biol. 2012;19:879-86.","DOI":"10.1089\/cmb.2011.0307"},{"key":"1198_CR44","doi-asserted-by":"publisher","first-page":"621","DOI":"10.1093\/bioinformatics\/btk037","volume":"22","author":"D Gront","year":"2006","unstructured":"Gront D, Kolinski A. BioShell\u2212a package of tools for structural biology computations. Bioinformatics. 2006;22:621\u20132.","journal-title":"Bioinformatics"},{"key":"1198_CR45","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389\u2013402.","journal-title":"Nucleic Acids Res"},{"key":"1198_CR46","doi-asserted-by":"publisher","first-page":"951","DOI":"10.1093\/bioinformatics\/bti125","volume":"21","author":"J Soding","year":"2005","unstructured":"Soding J. Protein homology detection by HMM\u0393\u00c7\u00f4HMM comparison. Bioinformatics. 2005;21:951\u201360.","journal-title":"Bioinformatics"},{"key":"1198_CR47","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1186\/1471-2105-8-435","volume":"8","author":"J Bernardes","year":"2007","unstructured":"Bernardes J, Davila A, Costa V, Zaverucha G. Improving model construction of profile HMMs for remote homology detection through structural alignment. BMC Bioinformatics. 2007;8:435.","journal-title":"BMC Bioinformatics"},{"key":"1198_CR48","doi-asserted-by":"publisher","first-page":"3541","DOI":"10.1016\/j.proeng.2012.06.408","volume":"38","author":"A Pal","year":"2012","unstructured":"Pal A, Mishra D, Mishra S, Satapathy SK, Das K. A Study on Protein (P-glycoprotein) Homology Detection using Hidden Markov Model. Procedia Eng. 2012;38:3541\u20136.","journal-title":"Procedia Eng"},{"key":"1198_CR49","doi-asserted-by":"publisher","first-page":"755","DOI":"10.1093\/bioinformatics\/14.9.755","volume":"14","author":"SR Eddy","year":"1998","unstructured":"Eddy SR. Profile hidden Markov models. Bioinformatics. 1998;14:755\u201363.","journal-title":"Bioinformatics"},{"key":"1198_CR50","doi-asserted-by":"publisher","first-page":"e1002195","DOI":"10.1371\/journal.pcbi.1002195","volume":"7","author":"SR Eddy","year":"2011","unstructured":"Eddy SR. Accelerated Profile HMM Searches. PLoS Comput Biol. 2011;7:e1002195.","journal-title":"PLoS Comput Biol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1198-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1198-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1198-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1198-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T18:09:07Z","timestamp":1706810947000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1198-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,30]]},"references-count":50,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1198"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1198-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2016,8,30]]},"assertion":[{"value":"21 March 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 August 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 August 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"328"}}