{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T17:41:47Z","timestamp":1775324507061,"version":"3.50.1"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T00:00:00Z","timestamp":1740009600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62302075"],"award-info":[{"award-number":["62302075"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>S-sulfhydration, a crucial post-translational protein modification, is pivotal in cellular recognition, signaling processes, and the development and progression of cardiovascular and neurological disorders, so identifying S-sulfhydration sites is crucial for studies in cell biology. Deep learning shows high efficiency and accuracy in identifying protein sites compared to traditional methods that often lack sensitivity and specificity in accurately locating nonsulfhydration sites. Therefore, we employ deep learning methods to tackle the challenge of pinpointing S-sulfhydration sites.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this work, we introduce a deep learning approach called Sul-BertGRU, designed specifically for predicting S-sulfhydration sites in proteins, which integrates multi-directional gated recurrent unit (GRU) and BERT. First, Sul-BertGRU proposes an information entropy-enhanced BERT (IE-BERT) to preprocess protein sequences and extract initial features. Subsequently, confidence learning is employed to eliminate potential S-sulfhydration samples from the nonsulfhydration samples and select reliable negative samples. Then, considering the directional nature of the modification process, protein sequences are categorized into left, right, and full sequences centered on cysteines. We build a multi-directional GRU to enhance the extraction of directional sequence features and model the details of the enzymatic reaction involved in S-sulfhydration. Ultimately, we apply a parallel multi-head self-attention mechanism alongside a convolutional neural network to deeply analyze sequence features that might be missed at a local level. Sul-BertGRU achieves sensitivity, specificity, precision, accuracy, Matthews correlation coefficient, and area under the curve scores of 85.82%, 68.24%, 74.80%, 77.44%, 55.13%, and 77.03%, respectively. Sul-BertGRU demonstrates exceptional performance and proves to be a reliable method for predicting protein S-sulfhydration sites.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code and data are available at https:\/\/github.com\/Severus0902\/Sul-BertGRU\/.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf078","type":"journal-article","created":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T18:07:41Z","timestamp":1740074861000},"source":"Crossref","is-referenced-by-count":2,"title":["Sul-BertGRU: an ensemble deep learning method integrating information entropy-enhanced BERT and directional multi-GRU for S-sulfhydration sites prediction"],"prefix":"10.1093","volume":"41","author":[{"given":"Xirun","family":"Wei","sequence":"first","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4809-5104","authenticated-orcid":false,"given":"Qiao","family":"Ning","sequence":"additional","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]},{"name":"School of Artificial Intelligence and Computer Science, Jiangnan University , Wuxi 214122,","place":["P.R. China"]},{"name":"Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University , Changchun 130012,","place":["P.R. China"]},{"name":"Neutech Group Limited , Dalian 116023,","place":["P.R. China"]}]},{"given":"Kuiyang","family":"Che","sequence":"additional","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]}]},{"given":"Zhaowei","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]}]},{"given":"Hui","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]}]},{"given":"Shikai","family":"Guo","sequence":"additional","affiliation":[{"name":"Department of Information Science and Technology, Dalian Maritime University , Dalian 116026,","place":["P.R. China"]},{"name":"The Dalian Key Laboratory of Artificial Intelligence , Dalian 116026,","place":["P.R. China"]}]}],"member":"286","published-online":{"date-parts":[[2025,2,20]]},"reference":[{"key":"2025031417311535100_btaf078-B1","doi-asserted-by":"crossref","first-page":"e25369","DOI":"10.1016\/j.heliyon.2024.e25369","article-title":"A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction","volume":"10","author":"Abbasi","year":"2024","journal-title":"Heliyon"},{"key":"2025031417311535100_btaf078-B2","doi-asserted-by":"crossref","first-page":"125375","DOI":"10.1016\/j.energy.2022.125375","article-title":"Soc estimation for lithium-ion battery using the LSTM-RNN with extended input and constrained output","volume":"262","author":"Chen","year":"2023","journal-title":"Energy"},{"key":"2025031417311535100_btaf078-B3","author":"Chung","year":"2014"},{"key":"2025031417311535100_btaf078-B4","first-page":"4171","author":"Devlin","year":"2019"},{"key":"2025031417311535100_btaf078-B5","first-page":"278","author":"Ding","year":"2018"},{"key":"2025031417311535100_btaf078-B6","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1023\/B:NERE.0000014832.78725.1a","article-title":"Redox control of signal transduction, gene expression and cellular senescence","volume":"29","author":"Esposito","year":"2004","journal-title":"Neurochem Res"},{"key":"2025031417311535100_btaf078-B7","doi-asserted-by":"crossref","first-page":"101579","DOI":"10.1016\/j.arr.2022.101579","article-title":"Protein s-sulfhydration: unraveling the prospective of hydrogen sulfide in the brain, vasculature and neurological manifestations","volume":"76","author":"Gupta","year":"2022","journal-title":"Ageing Res Rev"},{"key":"2025031417311535100_btaf078-B8","doi-asserted-by":"crossref","first-page":"2545","DOI":"10.1039\/C7MB00491E","article-title":"Computational identification of protein s-sulfenylation sites by incorporating the multiple sequence features information","volume":"13","author":"Hasan","year":"2017","journal-title":"Mol Biosyst"},{"key":"2025031417311535100_btaf078-B9","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1089\/cmb.2017.0069","article-title":"Prediction of rare palmitoylation events in proteins","volume":"25","author":"Kumari","year":"2018","journal-title":"J Comput Biol"},{"key":"2025031417311535100_btaf078-B10","doi-asserted-by":"crossref","first-page":"bbab005","DOI":"10.1093\/bib\/bbab005","article-title":"A transformer architecture based on BERT and 2D convolutional neural network to identify dna enhancers from sequence information","volume":"22","author":"Le","year":"2021","journal-title":"Brief Bioinform"},{"key":"2025031417311535100_btaf078-B11","doi-asserted-by":"crossref","first-page":"12581","DOI":"10.1109\/TPAMI.2023.3282631","article-title":"Uniformer: unifying convolution and self-attention for visual recognition","volume":"45","author":"Li","year":"2023","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025031417311535100_btaf078-B12","doi-asserted-by":"crossref","first-page":"617366","DOI":"10.3389\/fcell.2021.617366","article-title":"pCysMod: prediction of multiple cysteine modifications based on deep learning framework","volume":"9","author":"Li","year":"2021","journal-title":"Front Cell Dev Biol"},{"key":"2025031417311535100_btaf078-B13","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1016\/j.molcel.2014.10.019","article-title":"Cytokine-induced GAPDH sulfhydration affects PSD95 degradation and memory","volume":"56","author":"Mir","year":"2014","journal-title":"Mol Cell"},{"key":"2025031417311535100_btaf078-B14","doi-asserted-by":"crossref","first-page":"ra72","DOI":"10.1126\/scisignal.2000464","article-title":"H2s signals through protein s-sulfhydration","volume":"2","author":"Mustafa","year":"2009","journal-title":"Sci Signal"},{"key":"2025031417311535100_btaf078-B15","doi-asserted-by":"crossref","first-page":"bbac323","DOI":"10.1093\/bib\/bbac323","article-title":"DLF-Sul: a multi-module deep learning framework for prediction of s-sulfinylation sites in proteins","volume":"23","author":"Ning","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025031417311535100_btaf078-B16","doi-asserted-by":"crossref","first-page":"104398","DOI":"10.1016\/j.chemolab.2021.104398","article-title":"m7G-DLSTM: intergrating directional double-lstm and fully connected network for RNA N7-methlguanosine sites prediction in human","volume":"217","author":"Ning","year":"2021","journal-title":"Chemometr Intell Lab Syst"},{"key":"2025031417311535100_btaf078-B17","doi-asserted-by":"crossref","first-page":"1836","DOI":"10.1093\/bib\/bbaa038","article-title":"GPS-Palm: a deep learning-based graphic presentation system for the prediction of s-palmitoylation sites in proteins","volume":"22","author":"Ning","year":"2021","journal-title":"Brief Bioinform"},{"key":"2025031417311535100_btaf078-B18","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1613\/jair.1.12125","article-title":"Confident learning: estimating uncertainty in dataset labels","volume":"70","author":"Northcutt","year":"2021","journal-title":"Jair"},{"key":"2025031417311535100_btaf078-B19","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.imavis.2018.04.004","article-title":"Beyond one-hot encoding: lower dimensional target embedding","volume":"75","author":"Rodr\u00edguez","year":"2018","journal-title":"Image Vision Comput"},{"key":"2025031417311535100_btaf078-B20","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.molcel.2011.10.021","article-title":"Hydrogen sulfide-linked sulfhydration of Nf-\u03bab mediates its antiapoptotic actions","volume":"45","author":"Sen","year":"2012","journal-title":"Mol Cell"},{"key":"2025031417311535100_btaf078-B21","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1021\/acs.accounts.9b00562","article-title":"Activity-based sensing for site-specific proteomic analysis of cysteine oxidation","volume":"53","author":"Shi","year":"2020","journal-title":"Acc Chem Res"},{"key":"2025031417311535100_btaf078-B22","article-title":"Attention is all you need","author":"Vaswani"},{"key":"2025031417311535100_btaf078-B23","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1002\/anie.201305876","article-title":"Detection of protein s-sulfhydration by a tag-switch technique","volume":"53","author":"Zhang","year":"2014","journal-title":"Angew Chem Int Ed Engl"},{"key":"2025031417311535100_btaf078-B24","doi-asserted-by":"crossref","first-page":"108032","DOI":"10.1016\/j.cmpb.2024.108032","article-title":"Label correlation guided discriminative label feature learning for multi-label chest image classification","volume":"245","author":"Zhang","year":"2024","journal-title":"Comput Methods Programs Biomed"},{"key":"2025031417311535100_btaf078-B25","doi-asserted-by":"crossref","first-page":"1394","DOI":"10.1021\/acs.jcim.3c02025","article-title":"FCMSTrans: accurate prediction of disease-associated nsSNPs by utilizing multiscale convolution and deep feature combination within a transformer framework","volume":"64","author":"Zhang","year":"2024","journal-title":"J Chem Inf Model"},{"key":"2025031417311535100_btaf078-B26","doi-asserted-by":"crossref","first-page":"107145","DOI":"10.1016\/j.compbiomed.2023.107145","article-title":"BIGRUD-SA: protein S-sulfenylation sites prediction based on BiGRU and self-attention","volume":"163","author":"Zhang","year":"2023","journal-title":"Comput Biol Med"},{"key":"2025031417311535100_btaf078-B27","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1016\/j.intimp.2019.05.052","article-title":"Hydrogen sulfide donor NaHs alters antibody structure and function via sulfhydration","volume":"73","author":"Zhang","year":"2019","journal-title":"Int Immunopharmacol"},{"key":"2025031417311535100_btaf078-B28","doi-asserted-by":"crossref","first-page":"2379","DOI":"10.1109\/JBHI.2021.3123503","article-title":"Mul-SNO: a novel prediction tool for s-nitrosylation sites based on deep learning methods","volume":"26","author":"Zhao","year":"2022","journal-title":"IEEE J Biomed Health Inform"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf078\/62014674\/btaf078.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/3\/btaf078\/62014674\/btaf078.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/3\/btaf078\/62014674\/btaf078.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,14]],"date-time":"2025-03-14T17:31:34Z","timestamp":1741973494000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf078\/8026686"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,2,20]]},"references-count":28,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf078","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,3]]},"published":{"date-parts":[[2025,2,20]]},"article-number":"btaf078"}}