{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T01:05:38Z","timestamp":1775178338922,"version":"3.50.1"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2019,6,20]],"date-time":"2019-06-20T00:00:00Z","timestamp":1560988800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"PRIN 2017","award":["2017483NH8"],"award-info":[{"award-number":["2017483NH8"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>The correct localization of proteins in cell compartments is a key issue for their function. Particularly, mitochondrial proteins are physiologically active in different compartments and their aberrant localization contributes to the pathogenesis of human mitochondrial pathologies. Many computational methods exist to assign protein sequences to subcellular compartments such as nucleus, cytoplasm and organelles. However, a substantial lack of experimental evidence in public sequence databases hampered so far a finer grain discrimination, including also intra-organelle compartments.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We describe DeepMito, a novel method for predicting protein sub-mitochondrial cellular localization. Taking advantage of powerful deep-learning approaches, such as convolutional neural networks, our method is able to achieve very high prediction performances when discriminating among four different mitochondrial compartments (matrix, outer, inner and intermembrane regions). The method is trained and tested in cross-validation on a newly generated, high-quality dataset comprising 424 mitochondrial proteins with experimental evidence for sub-organelle localizations. We benchmark DeepMito towards the only one recent approach developed for the same task. Results indicate that DeepMito performances are superior. Finally, genomic-scale prediction on a highly-curated dataset of human mitochondrial proteins further confirms the effectiveness of our approach and suggests that DeepMito is a good candidate for genome-scale annotation of mitochondrial protein subcellular localization.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The DeepMito web server as well as all datasets used in this study are available at http:\/\/busca.biocomp.unibo.it\/deepmito. A standalone version of DeepMito is available on DockerHub at https:\/\/hub.docker.com\/r\/bolognabiocomp\/deepmito. DeepMito source code is available on GitHub at https:\/\/github.com\/BolognaBiocomp\/deepmito<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz512","type":"journal-article","created":{"date-parts":[[2019,6,17]],"date-time":"2019-06-17T19:11:28Z","timestamp":1560798688000},"page":"56-64","source":"Crossref","is-referenced-by-count":100,"title":["DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7359-0633","authenticated-orcid":false,"given":"Castrense","family":"Savojardo","sequence":"first","affiliation":[{"name":"Biocomputing Group, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna , Bologna, Italy"}]},{"given":"Niccol\u00f2","family":"Bruciaferri","sequence":"additional","affiliation":[{"name":"Biocomputing Group, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna , Bologna, Italy"}]},{"given":"Giacomo","family":"Tartari","sequence":"additional","affiliation":[{"name":"Biocomputing Group, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna , Bologna, Italy"},{"name":"Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), Italian National Research Council (CNR) , Bari, Italy"}]},{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[{"name":"Biocomputing Group, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna , Bologna, Italy"}]},{"given":"Rita","family":"Casadio","sequence":"additional","affiliation":[{"name":"Biocomputing Group, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna , Bologna, Italy"},{"name":"Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), Italian National Research Council (CNR) , Bari, Italy"}]}],"member":"286","published-online":{"date-parts":[[2019,6,20]]},"reference":[{"key":"2023013109510102500_btz512-B2","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","article-title":"Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023013109510102500_btz512-B3","doi-asserted-by":"crossref","first-page":"3387","DOI":"10.1093\/bioinformatics\/btx431","article-title":"DeepLoc: prediction of protein subcellular localization using deep learning","volume":"33","author":"Almagro Armenteros","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B4","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023013109510102500_btz512-B5","doi-asserted-by":"crossref","first-page":"878.","DOI":"10.15252\/msb.20156651","article-title":"Deep learning for computational biology","volume":"12","author":"Angermueller","year":"2016","journal-title":"Mol. Syst. Biol"},{"key":"2023013109510102500_btz512-B6","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1093\/bioinformatics\/16.5.412","article-title":"Assessing the accuracy of prediction algorithms for classification: an overview","volume":"16","author":"Baldi","year":"2000","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B7","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1093\/bioinformatics\/18.2.298","article-title":"Extensive feature detection of N-terminal protein sorting signals","volume":"18","author":"Bannai","year":"2002","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B9","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1186\/1471-2105-7-518","article-title":"Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence","volume":"7","author":"Du","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023013109510102500_btz512-B10","doi-asserted-by":"crossref","first-page":"263829.","DOI":"10.1155\/2013\/263829","article-title":"SubMito-PSPCP: predicting protein submitochondrial locations by hybridizing positional specific physicochemical properties with pseudoamino acid compositions","volume":"2013","author":"Du","year":"2013","journal-title":"Biomed. Res. Int"},{"key":"2023013109510102500_btz512-B11","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1016\/j.bbamcr.2012.05.028","article-title":"Mitochondrial protein import: common principles and physiological networks","volume":"1833","author":"Dudek","year":"2013","journal-title":"Biochim. Biophys. Acta"},{"key":"2023013109510102500_btz512-B12","doi-asserted-by":"crossref","first-page":"953","DOI":"10.1038\/nprot.2007.131","article-title":"Locating proteins in the cell using TargetP, SignalP and related tools","volume":"2","author":"Emanuelsson","year":"2007","journal-title":"Nat. Protoc"},{"key":"2023013109510102500_btz512-B13","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1007\/s00726-011-1143-4","article-title":"Predicting protein submitochondria locations by combining different descriptors into the general form of Chou's pseudo amino acid composition","volume":"43","author":"Fan","year":"2012","journal-title":"Amino Acids"},{"key":"2023013109510102500_btz512-B14","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1074\/mcp.M114.043083","article-title":"MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites","volume":"14","author":"Fukasawa","year":"2015","journal-title":"Mol. Cell Proteomics"},{"key":"2023013109510102500_btz512-B15","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/BF01025492","article-title":"Statistical analysis of the physical properties of the 20 naturally occurring amino acids","volume":"4","author":"Kidera","year":"1985","journal-title":"J. Prot. Chem"},{"key":"2023013109510102500_btz512-B16","doi-asserted-by":"crossref","first-page":"e98345.","DOI":"10.1371\/journal.pone.0098345","article-title":"Protein sub-nuclear localization prediction using SVM and PFAM domain information","volume":"9","author":"Kumar","year":"2014","journal-title":"PLoS One"},{"key":"2023013109510102500_btz512-B17","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.mito.2017.10.004","article-title":"Proteome-wide prediction and annotation of mitochondrial and sub-mitochondrial proteins by incorporating domain information","volume":"42","author":"Kumar","year":"2018","journal-title":"Mitochondrion"},{"key":"2023013109510102500_btz512-B18","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"2023013109510102500_btz512-B19","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B20","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1007\/s10441-013-9181-9","article-title":"Using over-represented tetrapeptides to predict protein submitochondria locations","volume":"61","author":"Lin","year":"2013","journal-title":"Acta Biotheor"},{"key":"2023013109510102500_btz512-B21","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1007\/978-1-4939-2257-4_27","article-title":"Computer-based prediction of mitochondria-targeting peptides","volume":"1264","author":"Martelli","year":"2015","journal-title":"Methods Mol. Biol"},{"key":"2023013109510102500_btz512-B22","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1016\/j.jtbi.2011.10.015","article-title":"Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization","volume":"293","author":"Mei","year":"2012","journal-title":"J. Theor. Biol"},{"key":"2023013109510102500_btz512-B23","doi-asserted-by":"crossref","first-page":"653","DOI":"10.1007\/s00726-007-0018-1","article-title":"Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization","volume":"34","author":"Nanni","year":"2008","journal-title":"Amino Acids"},{"key":"2023013109510102500_btz512-B24","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/S1672-0229(06)60016-8","article-title":"PredSL: a tool for the N-terminal sequence-based prediction of protein subcellular localization","volume":"4","author":"Petsalaki","year":"2006","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2023013109510102500_btz512-B25","doi-asserted-by":"crossref","first-page":"e408","DOI":"10.1093\/bioinformatics\/btl222","article-title":"BaCelLo: a balanced subcellular localization predictor","volume":"22","author":"Pierleoni","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B26","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1007\/s00441-016-2492-9","article-title":"The versatility of the mitochondrial presequence processing machinery: cleavage, quality control and turnover","volume":"367","author":"Poveda-Huertes","year":"2017","journal-title":"Cell Tissue Res"},{"key":"2023013109510102500_btz512-B27","doi-asserted-by":"crossref","first-page":"2973","DOI":"10.1093\/bioinformatics\/btu411","article-title":"TPpred2: improving the prediction of mitochondrial targeting peptide cleavage sites by exploiting sequence motifs","volume":"30","author":"Savojardo","year":"2014","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B28","doi-asserted-by":"crossref","first-page":"3269","DOI":"10.1093\/bioinformatics\/btv367","article-title":"TPpred3 detects and discriminates mitochondrial and chloroplastic targeting peptides in eukaryotic proteins","volume":"31","author":"Savojardo","year":"2015","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B29","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1093\/bioinformatics\/btw656","article-title":"SChloro: directing Viridiplantae proteins to six chloroplastic sub-compartments","volume":"33","author":"Savojardo","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B30","doi-asserted-by":"crossref","first-page":"1690","DOI":"10.1093\/bioinformatics\/btx818","article-title":"DeepSig: deep learning improves signal peptide detection in proteins","volume":"34","author":"Savojardo","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B31","doi-asserted-by":"crossref","first-page":"424","DOI":"10.1016\/j.bbamcr.2011.01.011","article-title":"Identify submitochondria and subchloroplast locations with pseudo amino acid composition: approach from the strategy of discrete wavelet transform feature extraction","volume":"1813","author":"Shi","year":"2011","journal-title":"Biochim. Biophys. Acta"},{"key":"2023013109510102500_btz512-B32","doi-asserted-by":"crossref","first-page":"1581","DOI":"10.1002\/pmic.200300776","article-title":"Predotar: a tool for rapidly screening proteomes for N-terminal targeting sequences","volume":"4","author":"Small","year":"2004","journal-title":"Proteomics"},{"key":"2023013109510102500_btz512-B33","doi-asserted-by":"crossref","first-page":"eaal3321.","DOI":"10.1126\/science.aal3321","article-title":"A subcellular map of the human proteome","volume":"356","author":"Thul","year":"2017","journal-title":"Science"},{"key":"2023013109510102500_btz512-B34","doi-asserted-by":"crossref","first-page":"2639","DOI":"10.1093\/bioinformatics\/btv212","article-title":"MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou\u2019s pseudo amino acid composition and a novel multi-label classifier","volume":"31","author":"Wang","year":"2015","journal-title":"Bioinformatics"},{"key":"2023013109510102500_btz512-B35","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1016\/j.jtbi.2009.03.028","article-title":"Using the augmented Chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach","volume":"259","author":"Zeng","year":"2009","journal-title":"J. Theor. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz512\/28928422\/btz512.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/1\/56\/48981747\/bioinformatics_36_1_56.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/1\/56\/48981747\/bioinformatics_36_1_56.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,17]],"date-time":"2023-09-17T14:24:16Z","timestamp":1694960656000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/1\/56\/5520945"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,6,20]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,1,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz512","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,1,1]]},"published":{"date-parts":[[2019,6,20]]}}}