{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,31]],"date-time":"2025-12-31T08:58:12Z","timestamp":1767171492034,"version":"build-2238731810"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,5,24]],"date-time":"2021-05-24T00:00:00Z","timestamp":1621814400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,24]],"date-time":"2021-05-24T00:00:00Z","timestamp":1621814400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>Moonlighting proteins (MPs) are a subclass of multifunctional proteins in which more than one independent or usually distinct function occurs in a single polypeptide chain. Identification of unknown cellular processes, understanding novel protein mechanisms, improving the prediction of protein functions, and gaining information about protein evolution are the main reasons to study MPs. They also play an important role in disease pathways and drug-target discovery. Since detecting MPs experimentally is quite a challenge, most of them are detected randomly. Therefore, introducing an appropriate computational approach to predict MPs seems reasonable.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In this study, we introduced a competent model for detecting moonlighting and non-MPs through extracted features from protein sequences. We attempted to set up a well-judged scheme for detecting outlier proteins. Consequently, 37 distinct feature vectors were utilized to study each protein\u2019s impact on detecting MPs. Furthermore, 8 different classification methods were assessed to find the best performance. To detect outliers, each one of the classifications was executed 100 times by tenfold cross-validation on feature vectors; proteins which misclassified 90 times or more were grouped. This process was applied to every single feature vector and eventually the intersection of these groups was determined as the outlier proteins. The results of tenfold cross-validation on a dataset of 351 samples (containing 215 moonlighting and 136 non-moonlighting proteins) reveal that the SVM method on all feature vectors has the highest performance among all methods in this study and other available methods. Besides, the study of outliers showed that 57 of 351 proteins in the dataset could be an appropriate candidate for the outlier. Among the outlier proteins, there were non-MPs (such as P69797) that have been misclassified in 8 different classification methods with 16 different feature vectors. Because these proteins have been obtained by computational methods, the results of this study could reduce the likelihood of hypothesizing whether these proteins are non-moonlighting at all.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>MPs are difficult to be identified through experimentation. Using distinct feature vectors, our method enabled identification of novel moonlighting proteins. The study also pinpointed that a number of non-MPs are likely to be moonlighting.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-021-04194-5","type":"journal-article","created":{"date-parts":[[2021,5,24]],"date-time":"2021-05-24T01:02:39Z","timestamp":1621818159000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["Moonlighting protein prediction using physico-chemical and evolutional properties via machine learning methods"],"prefix":"10.1186","volume":"22","author":[{"given":"Farshid","family":"Shirafkan","sequence":"first","affiliation":[]},{"given":"Sajjad","family":"Gharaghani","sequence":"additional","affiliation":[]},{"given":"Karim","family":"Rahimian","sequence":"additional","affiliation":[]},{"given":"Reza Hasan","family":"Sajedi","sequence":"additional","affiliation":[]},{"given":"Javad","family":"Zahiri","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,5,24]]},"reference":[{"key":"4194_CR1","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1002\/pmic.201800083","volume":"18","author":"A Jain","year":"2018","unstructured":"Jain A, Gali H, Kihara D. Identification of moonlighting proteins in genomes using text mining techniques. Proteomics. 2018;18:21\u20132.","journal-title":"Proteomics"},{"key":"4194_CR2","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1016\/j.bbamcr.2010.01.022","volume":"1803","author":"DHEW Huberts","year":"2010","unstructured":"Huberts DHEW, van der Klei IJ. Moonlighting proteins: an intriguing mode of multitasking. Biochim Biophys Acta. 2010;1803:520\u20135.","journal-title":"Biochim Biophys Acta"},{"key":"4194_CR3","doi-asserted-by":"crossref","unstructured":"Jeffery CJ. Intracellular\/surface moonlighting proteins. Biochem Mol Biol. 2018.","DOI":"10.1096\/fasebj.31.1_supplement.926.10"},{"key":"4194_CR4","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1016\/S0968-0004(98)01335-8","volume":"24","author":"CJ Jeffery","year":"1999","unstructured":"Jeffery CJ. Moonlighting proteins. Trends Biochem Sci. 1999;24:8\u201311.","journal-title":"Trends Biochem Sci"},{"issue":"2","key":"4194_CR5","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1016\/0092-8674(89)90956-2","volume":"57","author":"J Piatigorsky","year":"1989","unstructured":"Piatigorsky J, Wistow GJ. Enzyme\/crystallins: gene sharing as an evolutionary strategy. Cell. 1989;57(2):197\u20139.","journal-title":"Cell"},{"issue":"1","key":"4194_CR6","doi-asserted-by":"publisher","first-page":"D640","DOI":"10.1093\/nar\/gkx1043","volume":"46","author":"C Chen","year":"2018","unstructured":"Chen C, et al. Moonprot 2.0: an expansion and update of the moonlighting proteins database. Nucl Acids Res. 2018;46(1):D640\u20134.","journal-title":"Nucl Acids Res"},{"issue":"D1","key":"4194_CR7","doi-asserted-by":"publisher","first-page":"D645","DOI":"10.1093\/nar\/gkx1066","volume":"46","author":"L Franco-Serrano","year":"2018","unstructured":"Franco-Serrano L, et al. Multitaskprotdb-ii: an update of a database of multitasking\/moonlighting proteins. Nucl Acids Res. 2018;46(D1):D645\u20138.","journal-title":"Nucl Acids Res"},{"issue":"1","key":"4194_CR8","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1093\/nar\/gky1039","volume":"47","author":"DM Ribeiro","year":"2019","unstructured":"Ribeiro DM, et al. Moondb 2.0: an updated database of extreme multifunctional and moonlighting proteins. Nucl Acids Res. 2019;47(1):398\u2013402.","journal-title":"Nucl Acids Res"},{"key":"4194_CR9","volume-title":"Moonlighting proteins: novel virulence factors in bacterial infections, p 53","author":"S Das","year":"2017","unstructured":"Das S, Khan I, Kihara D, Orengo C. Exploring structure function relationships in moonlighting proteins. In: Henderson B, editor. Moonlighting proteins: novel virulence factors in bacterial infections, p 53. London: Wiley; 2017."},{"issue":"1738","key":"4194_CR10","doi-asserted-by":"publisher","first-page":"20160523","DOI":"10.1098\/rstb.2016.0523","volume":"373","author":"CJ Jeffery","year":"2017","unstructured":"Jeffery CJ. Protein moonlighting: What is it, and why is it important? Philos Trans R Soc B Biol Sci. 2017;373(1738):20160523.","journal-title":"Philos Trans R Soc B Biol Sci"},{"key":"4194_CR11","doi-asserted-by":"publisher","first-page":"444","DOI":"10.1007\/s10930-018-9790-x","volume":"37","author":"L Franco-Serrano","year":"2018","unstructured":"Franco-Serrano L, et al. Multifunctional proteins: involvement in human diseases and targets of current drugs. Protein J. 2018;37:444\u201353.","journal-title":"Protein J"},{"issue":"20","key":"4194_CR12","doi-asserted-by":"publisher","first-page":"13442","DOI":"10.1016\/S0021-9258(18)98859-9","volume":"266","author":"H Watanabe","year":"1991","unstructured":"Watanabe H, et al. Purification of human tumor cell autocrine motility factor and molecular cloning of its receptor. J Biol Chem. 1991;266(20):13442\u20138.","journal-title":"J Biol Chem"},{"key":"4194_CR13","doi-asserted-by":"crossref","unstructured":"C. J. Jeffery. Why study moonlighting proteins? Front Genet. 2015; 6(JUN).","DOI":"10.3389\/fgene.2015.00211"},{"issue":"8","key":"4194_CR14","doi-asserted-by":"publisher","first-page":"2379","DOI":"10.1039\/c1mb05180f","volume":"7","author":"A Gomez","year":"2011","unstructured":"Gomez A, et al. Do protein\u2013protein interaction databases identify moonlighting proteins? Mol Biosyst. 2011;7(8):2379\u201382.","journal-title":"Mol Biosyst"},{"key":"4194_CR15","doi-asserted-by":"crossref","unstructured":"Hernandez S et al. Bioinformatics and moonlighting proteins. Front Bioeng Biotechnol. 2015.","DOI":"10.3389\/fbioe.2015.00090"},{"key":"4194_CR16","doi-asserted-by":"crossref","unstructured":"Chapple C, Robisson B, Spinelli L et al. Extreme multifunctional proteins identified from a human protein interaction network. Nat Com 2015; 7412.","DOI":"10.1038\/ncomms8412"},{"key":"4194_CR17","doi-asserted-by":"crossref","unstructured":"Khan et al. Genome-scale identification and characterization of moonlighting proteins. Biol Direct 2014; 30.","DOI":"10.1186\/s13062-014-0030-9"},{"key":"4194_CR18","doi-asserted-by":"crossref","unstructured":"Khan IK, Kihara D. Genome-scale prediction of moonlighting proteins using diverse protein association information. Bioinformatics, March 2016.","DOI":"10.1093\/bioinformatics\/btw166"},{"key":"4194_CR19","unstructured":"Amerifar S, Zahiri J. ftrcool: Feature extraction from biological sequences, 2020."},{"issue":"4","key":"4194_CR20","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1016\/j.ygeno.2013.05.006","volume":"102","author":"J Zahiri","year":"2013","unstructured":"Zahiri J, et al. Ppievo: Protein\u2013protein interaction prediction from pssm based evolutionary information. Genomics. 2013;102(4):237\u201342.","journal-title":"Genomics"},{"issue":"3","key":"4194_CR21","first-page":"273","volume":"20","author":"C Cortes","year":"1995","unstructured":"Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273\u201397.","journal-title":"Mach Learn"},{"issue":"1","key":"4194_CR22","first-page":"21","volume":"13","author":"TM Cover","year":"1967","unstructured":"Cover TM, Hart PE. Nearest neighbor pattern classification. IEEE Trans. 1967;13(1):21\u20137.","journal-title":"IEEE Trans"},{"key":"4194_CR23","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1007\/978-3-642-14571-1_10","volume":"6160","author":"MR Guarracino","year":"2010","unstructured":"Guarracino MR, Nebbia A. Predicting protein\u2013protein interactions with k-nearest neighbors classification algorithm. Lect Notes Comput Sci. 2010;6160:139\u201350.","journal-title":"Lect Notes Comput Sci"},{"issue":"10","key":"4194_CR24","doi-asserted-by":"publisher","first-page":"1773","DOI":"10.1007\/s00726-017-2474-6","volume":"49","author":"SS Hu","year":"2017","unstructured":"Hu SS, Chen P, Wang B, Li J. Protein binding hot spots prediction from sequence only by a new ensemble learning method. Amino Acids. 2017;49(10):1773\u201385.","journal-title":"Amino Acids"},{"key":"4194_CR25","first-page":"2008","volume":"788\u2013798","author":"H Geng","year":"2015","unstructured":"Geng H, Lu T, Lin X, Yan F. Prediction of protein\u2013protein interaction sites based on Naive Bayes classifier. Biochem. 2015;788\u2013798:2008.","journal-title":"Biochem"},{"key":"4194_CR26","doi-asserted-by":"publisher","first-page":"788","DOI":"10.1007\/978-3-540-89197-0_73","volume":"5351","author":"CD Nguyen","year":"2008","unstructured":"Nguyen CD, Gardiner KJ, Nguyen D, Cios KJ. Prediction of protein functions from protein interaction networks: a Naive Bayes approach. Lect Notes Comput Sci. 2008;5351:788\u201398.","journal-title":"Lect Notes Comput Sci"},{"issue":"1","key":"4194_CR27","first-page":"81","volume":"1","author":"JR Quinlan","year":"1986","unstructured":"Quinlan JR. Induction of decision trees. Springer. 1986;1(1):81\u2013106.","journal-title":"Springer"},{"key":"4194_CR28","doi-asserted-by":"publisher","first-page":"22108","DOI":"10.1109\/ACCESS.2018.2807811","volume":"6","author":"A Sikandar","year":"2018","unstructured":"Sikandar A, et al. Decision tree based approaches for detecting protein complex in protein protein interaction network (ppi) via link and sequence analysis. IEEE Access. 2018;6:22108\u201320.","journal-title":"IEEE Access"},{"key":"4194_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.jtbi.2016.04.025","volume":"402","author":"M Akbaripour-Elahabad","year":"2016","unstructured":"Akbaripour-Elahabad M, et al. rpicool: A tool for in silico rna-protein interaction detection using random forest. Theor Biol. 2016;402:1\u20138.","journal-title":"Theor Biol"},{"issue":"9","key":"4194_CR30","doi-asserted-by":"publisher","first-page":"1423","DOI":"10.1109\/5.784219","volume":"87","author":"X Yao","year":"1999","unstructured":"Yao X. Evolving artificial neural networks. Proc IEEE. 1999;87(9):1423\u201347.","journal-title":"Proc IEEE"},{"key":"4194_CR31","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1016\/j.csbj.2019.12.005","volume":"18","author":"X Yang","year":"2020","unstructured":"Yang X, Yang S, Li Q, Zhang Z. Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method. Comput Struct Biotechnol J. 2020;18:153\u201361.","journal-title":"Comput Struct Biotechnol J"},{"key":"4194_CR32","doi-asserted-by":"crossref","unstructured":"Ni Q et al. Using logistic regression method to predict protein function from protein\u2013protein interaction data. IEEE, June 2009.","DOI":"10.1109\/ICBBE.2009.5163737"},{"key":"4194_CR33","first-page":"175","volume-title":"Measuring the performance of a classifier","author":"M Bramer","year":"2013","unstructured":"Bramer M. Measuring the performance of a classifier. London: Springer; 2013. p. 175\u201387."},{"key":"4194_CR34","first-page":"12","volume":"97","author":"K Malik","year":"2014","unstructured":"Malik K, Kalra HS, Gursharanjeet K. Comparative analysis of outlier detection techniques. Int J Comput Appl. 2014;97:12\u201321.","journal-title":"Int J Comput Appl"},{"key":"4194_CR35","doi-asserted-by":"publisher","first-page":"18408","DOI":"10.1074\/jbc.M111.309500","volume":"287","author":"MA Scranton","year":"2012","unstructured":"Scranton MA, Yee A, Park SY. Plant leucine aminopeptidases moonlight as molecular chaperones to alleviate stress-induced damage. J Biol Chem. 2012;287:18408\u201317.","journal-title":"J Biol Chem"}],"updated-by":[{"DOI":"10.1186\/s12859-021-04257-7","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2021,7,9]],"date-time":"2021-07-09T00:00:00Z","timestamp":1625788800000}}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-04194-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-021-04194-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-04194-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,31]],"date-time":"2024-08-31T07:40:04Z","timestamp":1725090004000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-021-04194-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,24]]},"references-count":35,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["4194"],"URL":"https:\/\/doi.org\/10.1186\/s12859-021-04194-5","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,24]]},"assertion":[{"value":"11 December 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 May 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 May 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 July 2021","order":4,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Correction","order":5,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"A Correction to this paper has been published:","order":6,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"https:\/\/doi.org\/10.1186\/s12859-021-04257-7","URL":"https:\/\/doi.org\/10.1186\/s12859-021-04257-7","order":7,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"261"}}