{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T16:15:58Z","timestamp":1772727358932,"version":"3.50.1"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Elucidating protein-protein interactions (PPIs) is essential to constructing protein interaction networks and facilitating our understanding of the general principles of biological systems. Previous studies have revealed that interacting protein pairs can be predicted by their primary structure. Most of these approaches have achieved satisfactory performance on datasets comprising equal number of interacting and non-interacting protein pairs. However, this ratio is highly unbalanced in nature, and these techniques have not been comprehensively evaluated with respect to the effect of the large number of non-interacting pairs in realistic datasets. Moreover, since highly unbalanced distributions usually lead to large datasets, more efficient predictors are desired when handling such challenging tasks.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>This study presents a method for PPI prediction based only on sequence information, which contributes in three aspects. First, we propose a probability-based mechanism for transforming protein sequences into feature vectors. Second, the proposed predictor is designed with an efficient classification algorithm, where the efficiency is essential for handling highly unbalanced datasets. Third, the proposed PPI predictor is assessed with several unbalanced datasets with different positive-to-negative ratios (from 1:1 to 1:15). This analysis provides solid evidence that the degree of dataset imbalance is important to PPI predictors.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Dealing with data imbalance is a key issue in PPI prediction since there are far fewer interacting protein pairs than non-interacting ones. This article provides a comprehensive study on this issue and develops a practical tool that achieves both good prediction performance and efficiency using only protein sequence information.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-11-167","type":"journal-article","created":{"date-parts":[[2010,4,2]],"date-time":"2010-04-02T18:13:50Z","timestamp":1270232030000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":65,"title":["Predicting protein-protein interactions in unbalanced data using the primary structure of proteins"],"prefix":"10.1186","volume":"11","author":[{"given":"Chi-Yuan","family":"Yu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lih-Ching","family":"Chou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Darby Tien-Hao","family":"Chang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2010,4,2]]},"reference":[{"issue":"10","key":"3624_CR1","doi-asserted-by":"publisher","first-page":"551","DOI":"10.1016\/j.tig.2003.08.009","volume":"19","author":"H Ge","year":"2003","unstructured":"Ge H, Walhout AJM, Vidal M: Integrating 'omic' information: a bridge between genomics and systems biology. Trends Genet 2003, 19(10):551\u2013560. 10.1016\/j.tig.2003.08.009","journal-title":"Trends Genet"},{"issue":"1","key":"3624_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.physa.2004.12.030","volume":"352","author":"V Colizza","year":"2005","unstructured":"Colizza V, Flammini A, Maritan A, Vespignani A: Characterization and modeling of protein-protein interaction networks. Physica a-Statistical Mechanics and Its Applications 2005, 352(1):1\u201327. 10.1016\/j.physa.2004.12.030","journal-title":"Physica a-Statistical Mechanics and Its Applications"},{"issue":"6230","key":"3624_CR3","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1038\/340245a0","volume":"340","author":"S Fields","year":"1989","unstructured":"Fields S, Song OK: A Novel Genetic System to Detect Protein Protein Interactions. Nature 1989, 340(6230):245\u2013246. 10.1038\/340245a0","journal-title":"Nature"},{"issue":"8","key":"3624_CR4","doi-asserted-by":"publisher","first-page":"4569","DOI":"10.1073\/pnas.061034498","volume":"98","author":"T Ito","year":"2001","unstructured":"Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci USA 2001, 98(8):4569\u20134574. 10.1073\/pnas.061034498","journal-title":"Proc Natl Acad Sci USA"},{"issue":"6868","key":"3624_CR5","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1038\/415141a","volume":"415","author":"AC Gavin","year":"2002","unstructured":"Gavin AC, Bosche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon AM, Cruciat CM, et al.: Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 2002, 415(6868):141\u2013147. 10.1038\/415141a","journal-title":"Nature"},{"issue":"6868","key":"3624_CR6","doi-asserted-by":"publisher","first-page":"180","DOI":"10.1038\/415180a","volume":"415","author":"Y Ho","year":"2002","unstructured":"Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams SL, Millar A, Taylor P, Bennett K, Boutilier K, et al.: Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature 2002, 415(6868):180\u2013183. 10.1038\/415180a","journal-title":"Nature"},{"issue":"7084","key":"3624_CR7","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1038\/nature04532","volume":"440","author":"AC Gavin","year":"2006","unstructured":"Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M, Rau C, Jensen LJ, Bastuck S, Dumpelfeld B, et al.: Proteome survey reveals modularity of the yeast cell machinery. Nature 2006, 440(7084):631\u2013636. 10.1038\/nature04532","journal-title":"Nature"},{"issue":"5537","key":"3624_CR8","doi-asserted-by":"publisher","first-page":"2101","DOI":"10.1126\/science.1062191","volume":"293","author":"H Zhu","year":"2001","unstructured":"Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, Lan N, Jansen R, Bidlingmaier S, Houfek T, et al.: Global analysis of protein activities using proteome chips. Science 2001, 293(5537):2101\u20132105. 10.1126\/science.1062191","journal-title":"Science"},{"issue":"5553","key":"3624_CR9","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1126\/science.1064987","volume":"295","author":"AHY Tong","year":"2002","unstructured":"Tong AHY, Drees B, Nardelli G, Bader GD, Brannetti B, Castagnoli L, Evangelista M, Ferracuti S, Nelson B, Paoluzi S, et al.: A combined experimental and computational strategy to define protein interaction networks for peptide recognition modules. Science 2002, 295(5553):321\u2013324. 10.1126\/science.1064987","journal-title":"Science"},{"issue":"1","key":"3624_CR10","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1093\/nar\/gkg056","volume":"31","author":"GD Bader","year":"2003","unstructured":"Bader GD, Betel D, Hogue CWV: BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res 2003, 31(1):248\u2013250. 10.1093\/nar\/gkg056","journal-title":"Nucleic Acids Res"},{"key":"3624_CR11","doi-asserted-by":"publisher","first-page":"D449","DOI":"10.1093\/nar\/gkh086","volume":"32","author":"L Salwinski","year":"2004","unstructured":"Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res 2004, 32: D449-D451. 10.1093\/nar\/gkh086","journal-title":"Nucleic Acids Res"},{"key":"3624_CR12","doi-asserted-by":"publisher","first-page":"D436","DOI":"10.1093\/nar\/gkj003","volume":"34","author":"U Guldener","year":"2006","unstructured":"Guldener U, Munsterkotter M, Oesterheld M, Pagel P, Ruepp A, Mewes HW, Stumpflen V: MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res 2006, 34: D436-D441. 10.1093\/nar\/gkj003","journal-title":"Nucleic Acids Res"},{"key":"3624_CR13","doi-asserted-by":"publisher","first-page":"D561","DOI":"10.1093\/nar\/gkl958","volume":"35","author":"S Kerrien","year":"2007","unstructured":"Kerrien S, Alam-Faruque Y, Aranda B, Bancarz I, Bridge A, Derow C, Dimmer E, Feuermann M, Friedrichsen A, Huntley R, et al.: IntAct - open source resource for molecular interaction data. Nucleic Acids Res 2007, 35: D561-D565. 10.1093\/nar\/gkl958","journal-title":"Nucleic Acids Res"},{"issue":"7","key":"3624_CR14","doi-asserted-by":"publisher","first-page":"839","DOI":"10.1038\/nbt1116","volume":"23","author":"JDJ Han","year":"2005","unstructured":"Han JDJ, Dupuy D, Bertin N, Cusick ME, Vidal M: Effect of sampling on topology predictions of protein-protein interaction networks. Nat Biotechnol 2005, 23(7):839\u2013844. 10.1038\/nbt1116","journal-title":"Nat Biotechnol"},{"key":"3624_CR15","doi-asserted-by":"crossref","unstructured":"Hart GT, Ramani AK, Marcotte EM: How complete are current yeast and human protein-interaction networks? Genome Biol 2006., 7(11): 10.1186\/gb-2006-7-11-120","DOI":"10.1186\/gb-2006-7-11-120"},{"issue":"8","key":"3624_CR16","doi-asserted-by":"publisher","first-page":"4285","DOI":"10.1073\/pnas.96.8.4285","volume":"96","author":"M Pellegrini","year":"1999","unstructured":"Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO: Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles. Proc Natl Acad Sci USA 1999, 96(8):4285\u20134288. 10.1073\/pnas.96.8.4285","journal-title":"Proc Natl Acad Sci USA"},{"issue":"9","key":"3624_CR17","doi-asserted-by":"publisher","first-page":"5896","DOI":"10.1073\/pnas.092147999","volume":"99","author":"P Aloy","year":"2002","unstructured":"Aloy P, Russell RB: Interrogating protein interaction networks through structural biology. Proc Natl Acad Sci USA 2002, 99(9):5896\u20135901. 10.1073\/pnas.092147999","journal-title":"Proc Natl Acad Sci USA"},{"issue":"1","key":"3624_CR18","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1093\/bioinformatics\/19.1.161","volume":"19","author":"P Aloy","year":"2003","unstructured":"Aloy P, Russell RB: InterPreTS: protein Interaction Prediction through Tertiary Structure. Bioinformatics 2003, 19(1):161\u2013162. 10.1093\/bioinformatics\/19.1.161","journal-title":"Bioinformatics"},{"key":"3624_CR19","doi-asserted-by":"publisher","first-page":"W331","DOI":"10.1093\/nar\/gki585","volume":"33","author":"U Ogmen","year":"2005","unstructured":"Ogmen U, Keskin O, Aytuna AS, Nussinov R, Gursoy A: PRISM: protein interactions by structural matching. Nucleic Acids Res 2005, 33: W331-W336. 10.1093\/nar\/gki585","journal-title":"Nucleic Acids Res"},{"issue":"5428","key":"3624_CR20","doi-asserted-by":"publisher","first-page":"751","DOI":"10.1126\/science.285.5428.751","volume":"285","author":"EM Marcotte","year":"1999","unstructured":"Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D: Detecting protein function and protein-protein interactions from genome sequences. Science 1999, 285(5428):751\u2013753. 10.1126\/science.285.5428.751","journal-title":"Science"},{"issue":"22","key":"3624_CR21","doi-asserted-by":"publisher","first-page":"2608","DOI":"10.1093\/bioinformatics\/btn498","volume":"24","author":"TT Soong","year":"2008","unstructured":"Soong TT, Wrzeszczynski KO, Rost B: Physical protein-protein interactions predicted from microarrays. Bioinformatics 2008, 24(22):2608\u20132614. 10.1093\/bioinformatics\/btn498","journal-title":"Bioinformatics"},{"issue":"17","key":"3624_CR22","doi-asserted-by":"publisher","first-page":"3273","DOI":"10.1093\/bioinformatics\/bth366","volume":"20","author":"TW Huang","year":"2004","unstructured":"Huang TW, Tien AC, Lee YCG, Huang WS, Lee YCG, Peng CL, Tseng HH, Kao CY, Huang CYF: POINT: a database for the prediction of protein-protein interactions based on the orthologous interactome. Bioinformatics 2004, 20(17):3273\u20133276. 10.1093\/bioinformatics\/bth366","journal-title":"Bioinformatics"},{"issue":"16","key":"3624_CR23","doi-asserted-by":"publisher","first-page":"3360","DOI":"10.1093\/bioinformatics\/bti522","volume":"21","author":"J Espadaler","year":"2005","unstructured":"Espadaler J, Romero-Isart O, Jackson RM, Oliva B: Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships. Bioinformatics 2005, 21(16):3360\u20133368. 10.1093\/bioinformatics\/bti522","journal-title":"Bioinformatics"},{"issue":"4","key":"3624_CR24","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1371\/journal.pcbi.0030043","volume":"3","author":"BA Shoemaker","year":"2007","unstructured":"Shoemaker BA, Panchenko AR: Deciphering protein-protein interactions. Part II. Computational methods to predict protein and domain interaction partners. PLoS Comput Biol 2007, 3(4):595\u2013601. 10.1371\/journal.pcbi.0030043","journal-title":"PLoS Comput Biol"},{"issue":"5","key":"3624_CR25","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1093\/bioinformatics\/17.5.455","volume":"17","author":"JR Bock","year":"2001","unstructured":"Bock JR, Gough DA: Predicting protein-protein interactions from primary structure. Bioinformatics 2001, 17(5):455\u2013460. 10.1093\/bioinformatics\/17.5.455","journal-title":"Bioinformatics"},{"issue":"4","key":"3624_CR26","doi-asserted-by":"publisher","first-page":"876","DOI":"10.1002\/pmic.200401118","volume":"5","author":"SL Lo","year":"2005","unstructured":"Lo SL, Cai CZ, Chen YZ, Chung MCM: Effect of training datasets on support vector machine prediction of protein-protein interactions. Proteomics 2005, 5(4):876\u2013884. 10.1002\/pmic.200401118","journal-title":"Proteomics"},{"key":"3624_CR27","doi-asserted-by":"publisher","first-page":"I38","DOI":"10.1093\/bioinformatics\/bti1016","volume":"21","author":"A Ben-Hur","year":"2005","unstructured":"Ben-Hur A, Noble WS: Kernel methods for predicting protein-protein interactions. Bioinformatics 2005, 21: I38-I46. 10.1093\/bioinformatics\/bti1016","journal-title":"Bioinformatics"},{"issue":"24","key":"3624_CR28","doi-asserted-by":"publisher","first-page":"4394","DOI":"10.1093\/bioinformatics\/bti721","volume":"21","author":"XW Chen","year":"2005","unstructured":"Chen XW, Liu M: Prediction of protein-protein interactions using random decision forest framework. Bioinformatics 2005, 21(24):4394\u20134400. 10.1093\/bioinformatics\/bti721","journal-title":"Bioinformatics"},{"issue":"2","key":"3624_CR29","doi-asserted-by":"publisher","first-page":"218","DOI":"10.1093\/bioinformatics\/bth483","volume":"21","author":"S Martin","year":"2005","unstructured":"Martin S, Roe D, Faulon JL: Predicting protein-protein interactions using signature products. Bioinformatics 2005, 21(2):218\u2013226. 10.1093\/bioinformatics\/bth483","journal-title":"Bioinformatics"},{"issue":"2","key":"3624_CR30","doi-asserted-by":"publisher","first-page":"316","DOI":"10.1021\/pr050331g","volume":"5","author":"KC Chou","year":"2006","unstructured":"Chou KC, Cai YD: Predicting protein-protein interactions from sequences in a hybridization space. J Proteome Res 2006, 5(2):316\u2013322. 10.1021\/pr050331g","journal-title":"J Proteome Res"},{"issue":"10","key":"3624_CR31","doi-asserted-by":"publisher","first-page":"1207","DOI":"10.1093\/bioinformatics\/btl055","volume":"22","author":"L Nanni","year":"2006","unstructured":"Nanni L, Lumini A: An ensemble of K-local hyperplanes for predicting protein-protein interactions. Bioinformatics 2006, 22(10):1207\u20131210. 10.1093\/bioinformatics\/btl055","journal-title":"Bioinformatics"},{"key":"3624_CR32","volume-title":"BMC Bioinformatics","author":"S Pitre","year":"2006","unstructured":"Pitre S, Dehne F, Chan A, Cheetham J, Duong A, Emili A, Gebbia M, Greenblatt J, Jessulat M, Krogan N, et al.: PIPE: a protein-protein interaction prediction engine based on the re-occurring short polypeptide sequences between known interacting protein pairs. BMC Bioinformatics 2006., 7: 10.1186\/1471-2105-7-365"},{"issue":"11","key":"3624_CR33","doi-asserted-by":"publisher","first-page":"4337","DOI":"10.1073\/pnas.0607879104","volume":"104","author":"JW Shen","year":"2007","unstructured":"Shen JW, Zhang J, Luo XM, Zhu WL, Yu KQ, Chen KX, Li YX, Jiang HL: Predicting protein-protein interactions based only on sequences information. Proc Natl Acad Sci USA 2007, 104(11):4337\u20134341. 10.1073\/pnas.0607879104","journal-title":"Proc Natl Acad Sci USA"},{"issue":"9","key":"3624_CR34","doi-asserted-by":"publisher","first-page":"3025","DOI":"10.1093\/nar\/gkn159","volume":"36","author":"YZ Guo","year":"2008","unstructured":"Guo YZ, Yu LZ, Wen ZN, Li ML: Using support vector machine combined with auto covariance to predict proteinprotein interactions from protein sequences. Nucleic Acids Res 2008, 36(9):3025\u20133030. 10.1093\/nar\/gkn159","journal-title":"Nucleic Acids Res"},{"key":"3624_CR35","doi-asserted-by":"crossref","unstructured":"Najafabadi HS, Salavati R: Sequence-based prediction of protein-protein interactions by means of codon usage. Genome Biol 2008., 9(5): 10.1186\/gb-2008-9-5-r87","DOI":"10.1186\/gb-2008-9-5-r87"},{"issue":"10","key":"3624_CR36","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1038\/nbt1002-991","volume":"20","author":"GD Bader","year":"2002","unstructured":"Bader GD, Hogue CWV: Analyzing yeast protein-protein interaction data obtained from different sources. Nat Biotechnol 2002, 20(10):991\u2013997. 10.1038\/nbt1002-991","journal-title":"Nat Biotechnol"},{"issue":"6868","key":"3624_CR37","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1038\/415123a","volume":"415","author":"A Kumar","year":"2002","unstructured":"Kumar A, Snyder M: Protein complexes take the bait. Nature 2002, 415(6868):123\u2013124. 10.1038\/415123a","journal-title":"Nature"},{"issue":"1","key":"3624_CR38","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1109\/TNN.2004.836229","volume":"16","author":"YJ Oyang","year":"2005","unstructured":"Oyang YJ, Hwang SC, Ou YY, Chen CY, Chen ZW: Data classification with radial basis function networks based on a novel kernel density estimation algorithm. IEEE Transactions on Neural Networks 2005, 16(1):225\u2013236. 10.1109\/TNN.2004.836229","journal-title":"IEEE Transactions on Neural Networks"},{"key":"3624_CR39","doi-asserted-by":"crossref","first-page":"515","DOI":"10.3233\/ISB-00263","volume":"6","author":"S Dohkan","year":"2006","unstructured":"Dohkan S, Koike A, Takagi T: Improving the Performance of an SVM-Based Method for Predicting Protein-Protein Interactions. In Silico Biol 2006, 6: 515\u2013529.","journal-title":"In Silico Biol"},{"key":"3624_CR40","volume-title":"LIBSVM: a library for support vector machines","author":"CC Chang","year":"2001","unstructured":"Chang CC, Lin CJ: LIBSVM: a library for support vector machines.2001. [http:\/\/www.csie.ntu.edu.tw\/~cjlin\/libsvm]"},{"key":"3624_CR41","volume-title":"BMC Research Notes","author":"DT-H Chang","year":"2008","unstructured":"Chang DT-H, Ou Y-Y, Hung H-G, Yang M-H, Chen C-Y, Oyang Y-J: Prediction of protein secondary structures with a novel kernel density estimation based classifier. BMC Research Notes 2008., 1: 10.1186\/1756-0500-1-51"},{"issue":"10","key":"3624_CR42","doi-asserted-by":"publisher","first-page":"2363","DOI":"10.1101\/gr.1680803","volume":"13","author":"S Peri","year":"2003","unstructured":"Peri S, Navarro JD, Amanchy R, Kristiansen TZ, Jonnalagadda CK, Surendranath V, Niranjan V, Muthusamy B, Gandhi TKB, Gronborg M, et al.: Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Res 2003, 13(10):2363\u20132371. 10.1101\/gr.1680803","journal-title":"Genome Res"},{"key":"3624_CR43","doi-asserted-by":"publisher","first-page":"D411","DOI":"10.1093\/nar\/gkj141","volume":"34","author":"GR Mishra","year":"2006","unstructured":"Mishra GR, Suresh M, Kumaran K, Kannabiran N, Suresh S, Bala P, Shivakumar K, Anuradha N, Reddy R, Raghavan TM, et al.: Human protein reference database - 2006 update. Nucleic Acids Res 2006, 34: D411-D414. 10.1093\/nar\/gkj141","journal-title":"Nucleic Acids Res"},{"issue":"19","key":"3624_CR44","doi-asserted-by":"publisher","first-page":"6959","DOI":"10.1073\/pnas.0708078105","volume":"105","author":"MPH Stumpf","year":"2008","unstructured":"Stumpf MPH, Thorne T, de Silva E, Stewart R, An HJ, Lappe M, Wiuf C: Estimating the size of the human interactome. Proceedings of the National Academy of Sciences of the United States of America 2008, 105(19):6959\u20136964. 10.1073\/pnas.0708078105","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"3624_CR45","doi-asserted-by":"crossref","unstructured":"Kato M, Hata N, Banerjee N, Futcher B, Zhang MQ: Identifying combinatorial regulation of transcription factors and binding motifs. Genome Biology 2004., 5(8): 10.1186\/gb-2004-5-8-r56","DOI":"10.1186\/gb-2004-5-8-r56"},{"issue":"3","key":"3624_CR46","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1038\/nbt1068","volume":"23","author":"MA Fabian","year":"2005","unstructured":"Fabian MA, Biggs WH, Treiber DK, Atteridge CE, Azimioara MD, Benedetti MG, Carter TA, Ciceri P, Edeen PT, Floyd M, et al.: A small molecule-kinase interaction map for clinical kinase inhibitors. Nature Biotechnology 2005, 23(3):329\u2013336. 10.1038\/nbt1068","journal-title":"Nature Biotechnology"},{"issue":"3","key":"3624_CR47","doi-asserted-by":"publisher","first-page":"203","DOI":"10.2307\/2684655","volume":"50","author":"MJ Schervish","year":"1996","unstructured":"Schervish MJ: P Values: What They Are and What They Are Not. The American Statistician 1996, 50(3):203\u2013206. 10.2307\/2684655","journal-title":"The American Statistician"},{"key":"3624_CR48","volume-title":"The Gamma Function","author":"E Artin","year":"1964","unstructured":"Artin E: The Gamma Function. New York: Holt, Rinehart and Winston; 1964."},{"key":"3624_CR49","doi-asserted-by":"publisher","first-page":"D190","DOI":"10.1093\/nar\/gkn141","volume":"36","author":"A Bairoch","year":"2008","unstructured":"Bairoch A, Bougueleret L, Altairac S, Amendolia V, Auchincloss A, Puy GA, Axelsen K, Baratin D, Blatter MC, Boeckmann B, et al.: The Universal Protein Resource (UniProt). Nucleic Acids Res 2008, 36: D190-D195. 10.1093\/nar\/gkn141","journal-title":"Nucleic Acids Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-167.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,19]],"date-time":"2025-02-19T20:51:28Z","timestamp":1739998288000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-167"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,2]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3624"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-167","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,4,2]]},"assertion":[{"value":"9 October 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 April 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 April 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"167"}}