{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T07:11:10Z","timestamp":1773645070437,"version":"3.50.1"},"reference-count":37,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T00:00:00Z","timestamp":1773619200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>\n                    Identification of essential proteins is fundamental for understanding cellular processes and disease mechanisms. However, many existing computational methods do not adequately model dynamic expression activity and often underutilize global network context, which limits prediction accuracy. To address these issues, we propose a Correlation-guided Subgraph Graph Neural Network (CSGNN) for essential protein identification by integrating correlation-guided graph construction with attention-based representation learning. First, we derive an activity-aware expression matrix from periodic gene expression patterns, and we construct a weighted protein network by computing Pearson correlation coefficients between gene pairs. This correlation-guided network further defines first-order and second-order neighborhoods, which provide multi-scale subgraph contexts for each protein. Next, we employ a two-layer attention-based graph convolution to learn node embeddings by aggregating information within these correlation-defined neighborhoods. Finally, we form an interaction-aware node representation by integrating each protein embedding with its neighborhood context, and we use a lightweight multilayer perceptron to output an essentiality probability for each protein. Proteins are then ranked by the predicted scores to identify essential candidates. Experiments on yeast and\n                    <jats:italic>E. coli<\/jats:italic>\n                    datasets demonstrate that CSGNN consistently outperforms traditional baselines, indicating improved accuracy and robustness for essential protein identification.\n                  <\/jats:p>","DOI":"10.3389\/fbinf.2026.1731178","type":"journal-article","created":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T06:39:36Z","timestamp":1773643176000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["A CSGNN model-based method for essential protein identification"],"prefix":"10.3389","volume":"6","author":[{"given":"Zixuan","family":"Li","sequence":"first","affiliation":[{"name":"School of Informatics, Hunan University of Chinese Medicine","place":["Changsha, Hunan, China"]}]},{"given":"Zhiguo","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Informatics, Hunan University of Chinese Medicine","place":["Changsha, Hunan, China"]}]},{"given":"Peng","family":"Li","sequence":"additional","affiliation":[{"name":"School of Informatics, Hunan University of Chinese Medicine","place":["Changsha, Hunan, China"]}]}],"member":"1965","published-online":{"date-parts":[[2026,3,16]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.ymeth.2024.04.004","article-title":"DEEP-EP: identification of epigenetic protein by ensemble residual convolutional neural network for drug discovery","volume":"226","author":"Ali","year":"2024","journal-title":"Methods"},{"key":"B2","doi-asserted-by":"publisher","first-page":"899","DOI":"10.1056\/NEJMoa2404143","article-title":"Plozasiran, an RNA interference agent targeting APOC3, for mixed hyperlipidemia","volume":"391","author":"Ballantyne","year":"2024","journal-title":"N. Engl. J. Med."},{"key":"B3","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1016\/j.tig.2020.09.018","article-title":"Gene regulation and cellular metabolism: an essential partnership","volume":"37","author":"Carthew","year":"2021","journal-title":"Trends Genet."},{"key":"B4","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1007\/978-1-4939-3578-9_5","volume-title":"The gene expression omnibus database statistical genomics: Methods and protocols","author":"Clough","year":"2016"},{"key":"B5","doi-asserted-by":"publisher","first-page":"1936","DOI":"10.1038\/s41467-021-21953-3","article-title":"The SARS-CoV-2 nucleocapsid protein is dynamic, disordered, and phase separates with RNA","volume":"12","author":"Cubuk","year":"2021","journal-title":"Nat. Communications"},{"key":"B6","doi-asserted-by":"publisher","first-page":"428","DOI":"10.1016\/j.tifs.2021.12.020","article-title":"Food proteins from animals and plants: differences in the nutritional and functional properties","volume":"119","author":"Day","year":"2022","journal-title":"Trends Food Sci. & Technol."},{"key":"B8","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1016\/j.joca.2021.02.008","article-title":"Circadian time series proteomics reveals daily dynamics in cartilage physiology","volume":"29","author":"Dudek","year":"2021","journal-title":"Osteoarthr. Cartilage"},{"key":"B9","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1093\/nar\/30.1.69","article-title":"Saccharomyces genome database (SGD) provides secondary gene annotation using the gene ontology (GO)","volume":"30","author":"Dwight","year":"2002","journal-title":"Nucleic Acids Research"},{"key":"B10","doi-asserted-by":"publisher","first-page":"420","DOI":"10.1111\/tpj.14534","article-title":"Single organelle function and organization as estimated from arabidopsis mitochondrial proteomics","volume":"101","author":"Fuchs","year":"2020","journal-title":"Plant J."},{"key":"B11","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1534\/genetics.114.161620","article-title":"The yeast deletion collection: a decade of functional genomics","volume":"197","author":"Giaever","year":"2014","journal-title":"Genetics"},{"key":"B12","doi-asserted-by":"publisher","first-page":"1711","DOI":"10.1038\/s44319-024-00111-y","article-title":"The TAM, a translocation and assembly module for protein assembly and potential conduit for phospholipid transfer","volume":"25","author":"Goh","year":"2024","journal-title":"EMBO Reports"},{"key":"B13","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1146\/annurev-ecolsys-012220-120819","article-title":"The structure of ecological networks across levels of organization","volume":"51","author":"Guimaraes","year":"2020","journal-title":"Annu. Rev. Ecol. Evol. Syst."},{"key":"B14","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1093\/molbev\/msi072","article-title":"Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks","volume":"22","author":"Hahn","year":"2005","journal-title":"Mol. Biology Evolution"},{"key":"B15","doi-asserted-by":"publisher","first-page":"2122","DOI":"10.1109\/TCBB.2022.3233473","article-title":"Identification of essential protein using chemical reaction optimization and machine learning technique","volume":"20","author":"Inzamam-Ul-Hossain","year":"2023","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinforma."},{"key":"B16","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1073\/pnas.93.1.13","article-title":"Principles of protein-protein interactions","volume":"93","author":"Jones","year":"1996","journal-title":"Proc. Natl. Acad. Sci."},{"key":"B17","doi-asserted-by":"publisher","first-page":"1386","DOI":"10.1109\/TCBB.2017.2665482","article-title":"Construction of refined protein interaction network for predicting essential proteins","volume":"16","author":"Li","year":"2017","journal-title":"IEEE\/ACM Transactions Computational Biology Bioinformatics"},{"key":"B18","doi-asserted-by":"publisher","first-page":"115686","DOI":"10.1016\/j.lwt.2023.115686","article-title":"Characterization of thyme essential oil composite film based on soy protein isolate and its application in the preservation of cherry tomatoes","volume":"191","author":"Liu","year":"2024","journal-title":"Lwt"},{"key":"B19","doi-asserted-by":"publisher","first-page":"130","DOI":"10.1016\/j.tcb.2020.11.008","article-title":"The ways of actin: why tunneling nanotubes are unique cell protrusions","volume":"31","author":"Ljubojevic","year":"2021","journal-title":"Trends Cell Biol."},{"key":"B20","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1038\/s41586-020-2134-y","article-title":"A conserved dendritic-cell regulatory program limits antitumour immunity","volume":"580","author":"Maier","year":"2020","journal-title":"Nature"},{"key":"B21","doi-asserted-by":"publisher","first-page":"546","DOI":"10.1038\/s42255-023-00783-1","article-title":"Multifaceted mitochondria: moving mitochondrial science beyond function and dysfunction","volume":"5","author":"Monzel","year":"2023","journal-title":"Nat. Metabolism"},{"key":"B22","doi-asserted-by":"publisher","first-page":"255","DOI":"10.1042\/EBC20200108","article-title":"Uncovering protein function: from classification to complexes","volume":"66","author":"Morris","year":"2022","journal-title":"Essays Biochem."},{"key":"B23","doi-asserted-by":"publisher","first-page":"1076","DOI":"10.1016\/j.cell.2024.01.042","article-title":"Past, present, and future of CRISPR genome editing technologies","volume":"187","author":"Pacesa","year":"2024","journal-title":"Cell"},{"key":"B24","doi-asserted-by":"publisher","first-page":"832","DOI":"10.1093\/bioinformatics\/bti115","article-title":"The MIPS Mammalian protein\u2013protein interaction database","volume":"21","author":"Pagel","year":"2005","journal-title":"Bioinformatics"},{"key":"B25","doi-asserted-by":"publisher","first-page":"6452","DOI":"10.1074\/jbc.R116.771915","article-title":"Essential protein s and pathways that regulate lifespan","volume":"292","author":"Pan","year":"2017","journal-title":"J. Biol. Chem."},{"key":"B7","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1016\/j.eswa.2017.12.020","article-title":"The use of machine learning algorithms in recommender systems: a systematic review","volume":"97","author":"Portugal","year":"2018","journal-title":"Expert Systems with Applications"},{"key":"B26","doi-asserted-by":"publisher","first-page":"e17010","DOI":"10.7717\/peerj.17010","article-title":"Epi-sf: essential protein identification in protein interaction networks using sequence features","volume":"12","author":"Saha","year":"2024","journal-title":"PeerJ"},{"key":"B27","doi-asserted-by":"publisher","first-page":"347","DOI":"10.1007\/s00726-007-0555-7","article-title":"SVCT1 and SVCT2: essential protein s for vitamin C uptake","volume":"34","author":"Savini","year":"2008","journal-title":"Amino Acids"},{"key":"B28","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1038\/s41592-024-02174-0","article-title":"CombFold: predicting structures of large protein assemblies using a combinatorial assembly algorithm and AlphaFold2","volume":"21","author":"Shor","year":"2024","journal-title":"Nat. Methods"},{"key":"B29","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1109\/TCBB.2013.2295318","article-title":"Predicting essential proteins based on weighted degree centrality","volume":"11","author":"Tang","year":"2013","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinforma."},{"key":"B30","doi-asserted-by":"publisher","first-page":"115949","DOI":"10.1016\/j.ab.2025.115949","article-title":"HCNS: a deep learning model for identifying essential proteins based on hypergraph convolution and sequence features","volume":"707","author":"Tian","year":"2025","journal-title":"Anal. Biochem."},{"key":"B31","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1002\/med.2610120403","article-title":"Protein targets for structure-based drug design","volume":"12","author":"Walkinshaw","year":"1992","journal-title":"Med. Research Reviews"},{"key":"B32","doi-asserted-by":"publisher","first-page":"a041472","DOI":"10.1101\/cshperspect.a041472","article-title":"Protein design using structure-prediction networks: alphafold and RoseTTAFold as protein structure foundation models","volume":"16","author":"Wang","year":"2024","journal-title":"Cold Spring Harb. Perspect. Biol."},{"key":"B33","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1093\/nar\/28.1.289","article-title":"DIP: the database of interacting proteins","volume":"28","author":"Xenarios","year":"2000","journal-title":"Nucleic Acids Research"},{"key":"B34","doi-asserted-by":"publisher","first-page":"058902","DOI":"10.1088\/1674-1056\/acb9f9","article-title":"AG-GATCN: a novel method for predicting essential proteins","volume":"32","author":"Yang","year":"2023","journal-title":"Chin. Phys. B"},{"key":"B35","first-page":"98","article-title":"Improving human essential protein prediction using only protein sequences via ensemble learning","author":"Zeng","year":"2021"},{"key":"B36","doi-asserted-by":"publisher","first-page":"D455","DOI":"10.1093\/nar\/gkn858","article-title":"DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes","volume":"37","author":"Zhang","year":"2009","journal-title":"Nucleic Acids Research"},{"key":"B37","doi-asserted-by":"publisher","first-page":"933","DOI":"10.1186\/s12870-025-06878-z","article-title":"MultiRepPI: a cross-modal feature fusion-based multiple characterization framework for plant peptide-protein interaction prediction","volume":"25","author":"Zhiguo","year":"2025","journal-title":"BMC Plant Biol."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2026.1731178\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T06:39:36Z","timestamp":1773643176000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2026.1731178\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,16]]},"references-count":37,"alternative-id":["10.3389\/fbinf.2026.1731178"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2026.1731178","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,16]]},"article-number":"1731178"}}