{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T10:14:36Z","timestamp":1776248076624,"version":"3.50.1"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T00:00:00Z","timestamp":1659571200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T00:00:00Z","timestamp":1659571200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Essential Proteins are demonstrated to exert vital functions on cellular processes and are indispensable for the survival and reproduction of the organism. Traditional centrality methods perform poorly on complex protein\u2013protein interaction (PPI) networks. Machine learning approaches based on high-throughput data lack the exploitation of the temporal and spatial dimensions of biological information.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We put forward a deep learning framework to predict essential proteins by integrating features obtained from the PPI network, subcellular localization, and gene expression profiles. In our model, the node2vec method is applied to learn continuous feature representations for proteins in the PPI network, which capture the diversity of connectivity patterns in the network. The concept of depthwise separable convolution is employed on gene expression profiles to extract properties and observe the trends of gene expression over time under different experimental conditions. Subcellular localization information is mapped into a long one-dimensional vector to capture its characteristics. Additionally, we use a sampling method to mitigate the impact of imbalanced learning when training the model. With experiments carried out on the data of Saccharomyces cerevisiae, results show that our model outperforms traditional centrality methods and machine learning methods. Likewise, the comparative experiments have manifested that our process of various biological information is preferable.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Our proposed deep learning framework effectively identifies essential proteins by integrating multiple biological data, proving a broader selection of subcellular localization information significantly improves the results of prediction and depthwise separable convolution implemented on gene expression profiles enhances the performance.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-04868-8","type":"journal-article","created":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T11:06:13Z","timestamp":1659611173000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":27,"title":["A deep learning framework for identifying essential proteins based on multiple biological information"],"prefix":"10.1186","volume":"23","author":[{"given":"Yi","family":"Yue","sequence":"first","affiliation":[]},{"given":"Chen","family":"Ye","sequence":"additional","affiliation":[]},{"given":"Pei-Yun","family":"Peng","sequence":"additional","affiliation":[]},{"given":"Hui-Xin","family":"Zhai","sequence":"additional","affiliation":[]},{"given":"Iftikhar","family":"Ahmad","sequence":"additional","affiliation":[]},{"given":"Chuan","family":"Xia","sequence":"additional","affiliation":[]},{"given":"Yun-Zhi","family":"Wu","sequence":"additional","affiliation":[]},{"given":"You-Hua","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,8,4]]},"reference":[{"key":"4868_CR1","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1038\/35075138","volume":"411","author":"H Jeong","year":"2001","unstructured":"Jeong H, Mason SP, Barab\u00e1si AL, Oltvai ZN. Lethality and centrality in protein networks. Nature. 2001;411:41\u20132.","journal-title":"Nature"},{"key":"4868_CR2","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1186\/1471-2164-7-165","volume":"7","author":"SJ Furney","year":"2006","unstructured":"Furney SJ, Alb\u00e0 MM, L\u00f3pez-Bigas N. Differences in the evolutionary history of disease genes affected by dominant or recessive mutations. BMC Genomics. 2006;7:165.","journal-title":"BMC Genomics"},{"key":"4868_CR3","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1186\/1471-2180-5-8","volume":"5","author":"SA Becker","year":"2005","unstructured":"Becker SA, Palsson B\u00d8. Genome-scale reconstruction of the metabolic network in Staphylococcus aureus N315: an initial draft to the two-dimensional annotation. BMC Microbiol. 2005;5:8.","journal-title":"BMC Microbiol"},{"key":"4868_CR4","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1038\/nature00935","volume":"418","author":"G Giaever","year":"2002","unstructured":"Giaever G, Chu AM, Ni L, Connelly C, Riles L, V\u00e9ronneau S, et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature. 2002;418:387\u201391.","journal-title":"Nature"},{"key":"4868_CR5","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1111\/j.1440-1711.2005.01332.x","volume":"83","author":"LM Cullen","year":"2005","unstructured":"Cullen LM, Arndt GM. Genome-wide screening for gene function using RNAi in mammalian cells. Immunol Cell Biol. 2005;83:217\u201323.","journal-title":"Immunol Cell Biol"},{"key":"4868_CR6","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1046\/j.1365-2958.2003.03697.x","volume":"50","author":"T Roemer","year":"2003","unstructured":"Roemer T, Jiang B, Davison J, Ketela T, Veillette K, Breton A, et al. Large-scale essential gene identification in Candida albicans and applications to antifungal drug discovery. Mol Microbiol. 2003;50:167\u201381.","journal-title":"Mol Microbiol"},{"key":"4868_CR7","doi-asserted-by":"publisher","first-page":"4569","DOI":"10.1073\/pnas.061034498","volume":"98","author":"I Takashi","year":"2001","unstructured":"Takashi I, Tomoko C, Ritsuko O, Mikio Y, Masahira H, Yoshiyuki S. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci. 2001;98:4569\u201374.","journal-title":"Proc Natl Acad Sci"},{"key":"4868_CR8","doi-asserted-by":"publisher","first-page":"218","DOI":"10.1006\/meth.2001.1183","volume":"24","author":"O Puig","year":"2001","unstructured":"Puig O, Caspary F, Rigaut G, Rutz B, Bouveret E, Bragado-Nilsson E, et al. The tandem affinity purification (TAP) method: a general procedure of protein complex purification. Methods. 2001;24:218\u201329.","journal-title":"Methods"},{"key":"4868_CR9","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1038\/nbt0198-40","volume":"16","author":"G Ramsay","year":"1998","unstructured":"Ramsay G. DNA chips: state-of-the art. Nat Biotechnol. 1998;16:40\u20134.","journal-title":"Nat Biotechnol"},{"key":"4868_CR10","doi-asserted-by":"publisher","first-page":"566","DOI":"10.1093\/bib\/bbz017","volume":"21","author":"X Li","year":"2020","unstructured":"Li X, Li W, Zeng M, Zheng R, Li M. Network-based methods for predicting essential genes or proteins: a survey. Brief Bioinform. 2020;21:566\u201383.","journal-title":"Brief Bioinform"},{"key":"4868_CR11","doi-asserted-by":"publisher","first-page":"440","DOI":"10.1038\/30918","volume":"393","author":"DJ Watts","year":"1998","unstructured":"Watts DJ, Strogatz SH. Collective dynamics of \u2018small-world\u2019 networks. Nature. 1998;393:440\u20132.","journal-title":"Nature"},{"key":"4868_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1752-0509-6-S2-S1","volume":"6","author":"M Li","year":"2012","unstructured":"Li M, Zhang H, Wang JX, Pan Y. A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data. BMC Syst Biol. 2012;6:1\u20139.","journal-title":"BMC Syst Biol"},{"key":"4868_CR13","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1109\/TCBB.2013.2295318","volume":"11","author":"X Tang","year":"2014","unstructured":"Tang X, Wang J, Zhong J, Pan Y. Predicting essential proteins basedon weighted degree centrality. IEEE\/ACM Trans Comput Biol Bioinforma. 2014;11:407\u201318.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"4868_CR14","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1186\/s12859-021-04175-8","volume":"22","author":"J Zhong","year":"2021","unstructured":"Zhong J, Tang C, Peng W, Xie M, Sun Y, Tang Q, et al. A novel essential protein identification method based on PPI networks and gene expression data. BMC Bioinform. 2021;22:248.","journal-title":"BMC Bioinform"},{"key":"4868_CR15","doi-asserted-by":"publisher","first-page":"430","DOI":"10.1186\/s12859-021-04300-7","volume":"22","author":"S Li","year":"2021","unstructured":"Li S, Zhang Z, Li X, Tan Y, Wang L, Chen Z. An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information. BMC Bioinform. 2021;22:430.","journal-title":"BMC Bioinform"},{"key":"4868_CR16","doi-asserted-by":"publisher","first-page":"645932","DOI":"10.3389\/fgene.2021.645932","volume":"12","author":"Z Meng","year":"2021","unstructured":"Meng Z, Kuang L, Chen Z, Zhang Z, Tan Y, Li X, et al. Method for essential protein prediction based on a novel weighted protein-domain interaction network. Front Genet. 2021;12:645932.","journal-title":"Front Genet"},{"key":"4868_CR17","doi-asserted-by":"publisher","first-page":"491","DOI":"10.1109\/TST.2015.7297748","volume":"20","author":"J Zhong","year":"2015","unstructured":"Zhong J, Wang J, Peng W, Zhang Z, Li M. A feature selection method for prediction essential protein. Tsinghua Sci Technol. 2015;20:491\u20139.","journal-title":"Tsinghua Sci Technol"},{"key":"4868_CR18","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1109\/TNB.2018.2842219","volume":"17","author":"J Zhong","year":"2018","unstructured":"Zhong J, Sun Y, Peng W, Xie M, Yang J, Tang X. XGBFEMF: an XGBoost-based framework for essential protein prediction. IEEE Trans Nanobiosci. 2018;17:243\u201350.","journal-title":"IEEE Trans Nanobiosci"},{"key":"4868_CR19","doi-asserted-by":"crossref","unstructured":"Grover A, Leskovec J. Node2vec: scalable feature learning for networks. Proc ACM SIGKDD Int Conf Knowl Discov Data Min. 2016;13\u201317-Augu:855\u201364.","DOI":"10.1145\/2939672.2939754"},{"issue":"Suppl 16","key":"4868_CR20","first-page":"1","volume":"20","author":"M Zeng","year":"2019","unstructured":"Zeng M, Li M, Wu FX, Li Y, Pan Y. DeepEP: a deep learning framework for identifying essential proteins. BMC Bioinform. 2019;20(Suppl 16):1\u201310.","journal-title":"BMC Bioinform"},{"key":"4868_CR21","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1109\/TCBB.2020.2983958","volume":"18","author":"M Zeng","year":"2021","unstructured":"Zeng M, Li M, Fei Z, Wu F-X, Li Y, Pan Y, et al. A deep learning framework for identifying essential proteins by integrating multiple types of biological information. IEEE\/ACM Trans Comput Biol Bioinform. 2021;18:296\u2013305.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"4868_CR22","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1008229","volume":"16","author":"X Zhang","year":"2020","unstructured":"Zhang X, Xiao W, Xiao W. DeepHE: accurately predicting human essential genes based on deep learning. PLoS Comput Biol. 2020;16: e1008229.","journal-title":"PLoS Comput Biol"},{"key":"4868_CR23","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0074939","volume":"8","author":"W Sha","year":"2013","unstructured":"Sha W, Martins AM, Laubenbacher R, Mendes P, Shulaev V. The genome-wide early temporal response of saccharomyces cerevisiae to oxidative stress induced by cumene hydroperoxide. PLoS ONE. 2013;8: e74939.","journal-title":"PLoS ONE"},{"key":"4868_CR24","first-page":"1","volume":"21","author":"CJ Kowalski","year":"1972","unstructured":"Kowalski CJ. On the effects of non-normality on the distribution of the sample product-moment correlation coefficient. J R Stat Soc Ser C Appl Stat. 1972;21:1\u201312.","journal-title":"J R Stat Soc Ser C Appl Stat"},{"key":"4868_CR25","unstructured":"SIfre L, Mallat S. Rigid-Motion Scattering for Texture Classification. 2014."},{"key":"4868_CR26","unstructured":"Ioffe S, Learning CSBT-P of the 32nd IC on M. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. 37:448\u201356."},{"key":"4868_CR27","doi-asserted-by":"publisher","first-page":"657","DOI":"10.1016\/j.neunet.2021.06.030","volume":"143","author":"Y Guo","year":"2021","unstructured":"Guo Y, Chen Y, Tan M, Jia K, Chen J, Wang J. Content-aware convolutional neural networks. Neural Netw. 2021;143:657\u201368.","journal-title":"Neural Netw"},{"key":"4868_CR28","doi-asserted-by":"publisher","first-page":"bau012","DOI":"10.1093\/database\/bau012","volume":"2014","author":"JX Binder","year":"2014","unstructured":"Binder JX, Pletscher-Frankild S, Tsafou K, Stolte C, O\u2019Donoghue SI, Schneider R, et al. COMPARTMENTS: unification and visualization of protein subcellular localization evidence. Database. 2014;2014:bau012.","journal-title":"Database"},{"key":"4868_CR29","doi-asserted-by":"publisher","first-page":"535","DOI":"10.1093\/nar\/gkj109","volume":"34","author":"C Stark","year":"2006","unstructured":"Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34:535\u20139.","journal-title":"Nucleic Acids Res"},{"key":"4868_CR30","doi-asserted-by":"publisher","first-page":"574","DOI":"10.1093\/nar\/gkt1131","volume":"42","author":"H Luo","year":"2014","unstructured":"Luo H, Lin Y, Gao F, Zhang C-T, Zhang R. DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements. Nucleic Acids Res. 2014;42:574\u201380.","journal-title":"Nucleic Acids Res"},{"key":"4868_CR31","doi-asserted-by":"publisher","first-page":"D998","DOI":"10.1093\/nar\/gkaa884","volume":"49","author":"S Gurumayum","year":"2021","unstructured":"Gurumayum S, Jiang P, Hao X, Campos TL, Young ND, Korhonen PK, et al. OGEE v3: online GEne Essentiality database with increased coverage of organisms and human cell lines. Nucleic Acids Res. 2021;49:D998-1003.","journal-title":"Nucleic Acids Res"},{"key":"4868_CR32","unstructured":"Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, et al. TensorFlow: large-scale machine learning on heterogeneous distributed systems. 2016."},{"key":"4868_CR33","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735\u201380.","journal-title":"Neural Comput"},{"key":"4868_CR34","doi-asserted-by":"crossref","unstructured":"Cho K, Van Merri\u00ebnboer B, Bahdanau D, Bengio Y. On the properties of neural machine translation: encoder\u2013decoder approaches. https:\/\/arxiv.org\/abs\/14091259.2014.","DOI":"10.3115\/v1\/W14-4012"},{"key":"4868_CR35","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1155\/JBB.2005.96","volume":"2005","author":"MP Joy","year":"2005","unstructured":"Joy MP, Brock A, Ingber DE, Huang S. High-betweenness proteins in the yeast protein interaction network. J Biomed Biotechnol. 2005;2005:96\u2013103.","journal-title":"J Biomed Biotechnol"},{"key":"4868_CR36","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1016\/S0022-5193(03)00071-7","volume":"223","author":"S Wuchty","year":"2003","unstructured":"Wuchty S, Stadler PF. Centers of complex networks. J Theor Biol. 2003;223:45\u201353.","journal-title":"J Theor Biol"},{"key":"4868_CR37","doi-asserted-by":"publisher","first-page":"1170","DOI":"10.1086\/228631","volume":"92","author":"P Bonacich","year":"1987","unstructured":"Bonacich P. Power and centrality: a family of measures. Am J Sociol. 1987;92:1170\u201382.","journal-title":"Am J Sociol"},{"key":"4868_CR38","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1103\/PhysRevE.71.056103","volume":"71","author":"E Estrada","year":"2005","unstructured":"Estrada E, Rodr\u00edguez-Vel\u00e1zquez JA. Subgraph centrality in complex networks. Phys Rev E Stat Nonlinear Soft Matter Phys. 2005;71:1\u20139.","journal-title":"Phys Rev E Stat Nonlinear Soft Matter Phys"},{"key":"4868_CR39","first-page":"87","volume":"6674","author":"H Wang","year":"2011","unstructured":"Wang H, Li M, Wang J, Pan Y. A new method for identifying essential proteins based on edge clustering coefficient. Lect Not Comput Sci Include Subser Lect Notes Artif Intell Lect Not Bioinform. 2011;6674:87\u201398.","journal-title":"Lect Not Comput Sci Include Subser Lect Notes Artif Intell Lect Not Bioinform."},{"issue":"suppl_2","key":"4868_CR40","doi-asserted-by":"publisher","first-page":"W438","DOI":"10.1093\/nar\/gkn257","volume":"36","author":"C-Y Lin","year":"2008","unstructured":"Lin C-Y, Chin C-H, Wu H-H, Chen S-H, Ho C-W, Ko M-T. Hubba: hub objects analyzer\u2014a framework of interactome hubs identification for network biology. Nucleic Acids Res. 2008;36(suppl_2):W438\u201343.","journal-title":"Nucleic Acids Res"},{"key":"4868_CR41","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1016\/j.compbiolchem.2011.04.002","volume":"35","author":"M Li","year":"2011","unstructured":"Li M, Wang J, Chen X, Wang H, Pan Y. A local average connectivity-based method for identifying essential proteins from the network level. Comput Biol Chem. 2011;35:143\u201350.","journal-title":"Comput Biol Chem"},{"key":"4868_CR42","doi-asserted-by":"publisher","first-page":"1170","DOI":"10.1109\/TCBB.2015.2509989","volume":"13","author":"Y Qi","year":"2016","unstructured":"Qi Y, Luo J. Prediction of essential proteins based on local interaction density. IEEE\/ACM Trans Comput Biol Bioinforma. 2016;13:1170\u201382.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinforma"},{"key":"4868_CR43","doi-asserted-by":"publisher","first-page":"5686","DOI":"10.1158\/1078-0432.CCR-13-0405","volume":"19","author":"X Yin","year":"2013","unstructured":"Yin X, Luistro L, Zhong H, Smith M, Nevins T, Schostack K, et al. RG7212 anti-TWEAK mAb inhibits tumor growth through inhibition of tumor cell proliferation and survival signaling and by enhancing the host antitumor immune response. Clin Cancer Res Off J Am Assoc Cancer Res. 2013;19:5686\u201398.","journal-title":"Clin Cancer Res Off J Am Assoc Cancer Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04868-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04868-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04868-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T11:08:29Z","timestamp":1659611309000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04868-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,4]]},"references-count":43,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4868"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04868-8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,4]]},"assertion":[{"value":"7 June 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 July 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 August 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no conflict of interest.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"318"}}