{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T09:51:51Z","timestamp":1771235511246,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T00:00:00Z","timestamp":1756252800000},"content-version":"vor","delay-in-days":57,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R35-GM126985"],"award-info":[{"award-number":["R35-GM126985"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"US National Science Foundation","doi-asserted-by":"crossref","award":["DBI-2145226"],"award-info":[{"award-number":["DBI-2145226"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,7,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Protein targeting, often guided by targeting peptides, is a critical biological process that directs proteins to their specific cellular destinations, ensuring proper cellular functionality and organization. Accurate classification and detection of targeting peptides are fundamental to understanding protein sorting mechanisms. This study introduces MULoc-Target, a novel deep-learning method designed to detect and classify targeting peptides in eukaryotic proteins. To support its development and evaluation, we curated a benchmark dataset comprising eight types of eukaryotic targeting peptides with manually curated annotations. Comprehensive evaluations on this dataset and external datasets from the literature demonstrate that MULoc-Target achieves state-of-the-art or competitive performance in detecting and classifying targeting peptides. Additionally, it enables the extraction of enriched motif patterns, offering valuable insights into their properties and the underlying targeting mechanisms. The identified motifs align closely with established biological features, further validating MULoc-Target's capabilities. A web server for MULoc-Target is integrated into our MULocDeep localization suite as a new toolkit, publicly accessible at https:\/\/mu-loc.org\/MULoc-Target, and the inference code is available at https:\/\/github.com\/yuexujiang\/MULoc-Target.<\/jats:p>","DOI":"10.1093\/bib\/bbaf436","type":"journal-article","created":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T11:12:32Z","timestamp":1756293152000},"source":"Crossref","is-referenced-by-count":1,"title":["MULoc-target: Targeting peptide classification and detection using a protein language model"],"prefix":"10.1093","volume":"26","author":[{"given":"Yuexu","family":"Jiang","sequence":"first","affiliation":[{"name":"Department of Chemical and Materials Engineering, University of Kentucky , 177, 512 Administration Dr F. Paul Anderson Tower, Lexington, KY 40506 ,","place":["United States"]},{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Duolin","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Shuai","family":"Zeng","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Yichuan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Lei","family":"Jiang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Mahdi","family":"Pourmirzaei","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Negin","family":"Manshour","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Farzaneh","family":"Esmaili","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Weinan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]},{"given":"Ian M","family":"M\u00f8ller","sequence":"additional","affiliation":[{"name":"Department of Molecular Biology and Genetics, Aarhus University, Fors\u00f8gsvej 1 , Slagelse DK-4200 ,","place":["Denmark"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4809-0514","authenticated-orcid":false,"given":"Dong","family":"Xu","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri , 1201 Rollins St, Columbia, MO 65211 ,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,8,27]]},"reference":[{"key":"2025092303113160200_ref1","author":"Lodish"},{"key":"2025092303113160200_ref2","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1016\/j.atherosclerosis.2015.11.027","article-title":"Protein sorting gone wrong \u2013 VPS10P domain receptors in cardiovascular and metabolic diseases","volume":"245","author":"Schmidt","year":"2016","journal-title":"Atherosclerosis"},{"key":"2025092303113160200_ref3","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1146\/annurev-cellbio-100913-013012","article-title":"Protein sorting at the trans -Golgi network","volume":"30","author":"Guo","year":"2014","journal-title":"Annu Rev Cell Dev Biol"},{"key":"2025092303113160200_ref4","doi-asserted-by":"publisher","first-page":"835","DOI":"10.1083\/jcb.67.3.835","article-title":"Transfer of proteins across membranes. I. Presence of proteolytically processed and unprocessed nascent immunoglobulin light chains on membrane-bound ribosomes of murine myeloma","volume":"67","author":"Blobel","year":"1975","journal-title":"J Cell Biol"},{"key":"2025092303113160200_ref5","doi-asserted-by":"publisher","first-page":"3381","DOI":"10.1242\/jcs.089110","article-title":"Protein localization in disease and therapy","volume":"124","author":"Hung","year":"2011","journal-title":"J Cell Sci"},{"key":"2025092303113160200_ref6","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1111\/tra.12310","article-title":"Mechanisms regulating protein localization: Mechanisms regulating protein localization","volume":"16","author":"Bauer","year":"2015","journal-title":"Traffic"},{"key":"2025092303113160200_ref7","doi-asserted-by":"publisher","first-page":"338","DOI":"10.1083\/jcb1703fta1","article-title":"Lost in translation","volume":"170","author":"Leslie","year":"2005","journal-title":"J Cell Biol"},{"key":"2025092303113160200_ref8","doi-asserted-by":"publisher","first-page":"889","DOI":"10.1091\/mbc.e02-08-0468","article-title":"KDEL and KKXX retrieval signals appended to the same reporter protein determine different trafficking between endoplasmic reticulum, intermediate compartment, and Golgi complex","volume":"14","author":"Stornaiuolo","year":"2003","journal-title":"MBoC"},{"key":"2025092303113160200_ref9","doi-asserted-by":"publisher","first-page":"e201900429","DOI":"10.26508\/lsa.201900429","article-title":"Detecting sequence signals in targeting peptides using deep learning","volume":"2","author":"Almagro Armenteros","year":"2019","journal-title":"Life Sci Alliance"},{"key":"2025092303113160200_ref10","doi-asserted-by":"publisher","DOI":"10.1038\/s41587-021-01156-3","article-title":"SignalP 6.0 predicts all five types of signal peptides using protein language models","volume":"40","author":"Teufel","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025092303113160200_ref11","doi-asserted-by":"publisher","first-page":"W228","DOI":"10.1093\/nar\/gkac278","article-title":"DeepLoc 2.0: Multi-label subcellular localization prediction using protein language models","volume":"50","author":"Thumuluri","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025092303113160200_ref12","doi-asserted-by":"publisher","first-page":"4825","DOI":"10.1016\/j.csbj.2021.08.027","article-title":"MULocDeep: A deep-learning framework for protein subcellular and suborganellar localization prediction with residue-level interpretation","volume":"19","author":"Jiang","year":"2021","journal-title":"Comput Struct Biotechnol J"},{"key":"2025092303113160200_ref13","doi-asserted-by":"publisher","first-page":"W343","DOI":"10.1093\/nar\/gkad374","article-title":"MULocDeep web service for protein localization prediction and visualization at subcellular and suborganellar levels","volume":"51","author":"Jiang","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025092303113160200_ref14","doi-asserted-by":"publisher","first-page":"1123","DOI":"10.1126\/science.ade2574","article-title":"Evolutionary-scale prediction of atomic-level protein structure with a language model","volume":"379","author":"Lin","year":"2023","journal-title":"Science"},{"key":"2025092303113160200_ref15","doi-asserted-by":"publisher","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: A worldwide hub of protein knowledge","volume":"47","author":"The UniProt Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2025092303113160200_ref16","doi-asserted-by":"publisher","first-page":"2444","DOI":"10.1073\/pnas.85.8.2444","article-title":"Improved tools for biological sequence comparison","volume":"85","author":"Pearson","year":"1988","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025092303113160200_ref17","author":"Edward"},{"key":"2025092303113160200_ref18","author":"Houlsby"},{"key":"2025092303113160200_ref19","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1093\/nar\/gkg571","article-title":"LGA: A method for finding 3D similarities in protein structures","volume":"31","author":"Zemla","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2025092303113160200_ref20","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1002\/(SICI)1097-0134(19990201)34:2&lt;220::AID-PROT7&gt;3.0.CO;2-K","article-title":"A modified definition of sov, a segment-based measure for protein secondary structure prediction assessment","volume":"34","author":"Zemla","year":"1999","journal-title":"Proteins"},{"key":"2025092303113160200_ref21","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1111\/j.1469-8137.1912.tb05611.x","article-title":"The distribution of the flora in the alpine zone","volume":"11","author":"Jaccard","year":"1912","journal-title":"New Phytol"},{"key":"2025092303113160200_ref22","doi-asserted-by":"publisher","first-page":"113565","DOI":"10.1016\/j.ab.2019.113565","article-title":"Discovering nuclear targeting signal sequence through protein language learning and multivariate analysis","volume":"591","author":"Guo","year":"2020","journal-title":"Anal Biochem"},{"key":"2025092303113160200_ref23","doi-asserted-by":"publisher","first-page":"e1003841","DOI":"10.1371\/journal.pcbi.1003841","article-title":"NESmapper: Accurate prediction of leucine-rich nuclear export signals using activity-based profiles","volume":"10","author":"Kosugi","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2025092303113160200_ref24","doi-asserted-by":"publisher","first-page":"e1000071","DOI":"10.1371\/journal.pcbi.1000071","article-title":"Discovering sequence motifs with arbitrary insertions and deletions","volume":"4","author":"Frith","year":"2008","journal-title":"PLoS Comput Biol"},{"key":"2025092303113160200_ref25","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1093\/protein\/4.1.33","article-title":"Cleavage-site motifs in mitochondrial targeting peptides","volume":"4","author":"Gavel","year":"1990","journal-title":"Protein Eng Des Sel"},{"key":"2025092303113160200_ref26","doi-asserted-by":"publisher","first-page":"856","DOI":"10.1111\/j.1432-1033.1996.00856.x","article-title":"Determinants in the Presequence of cytochrome b2 for import into mitochondria and for proteolytic processing","volume":"236","author":"Klaus","year":"1996","journal-title":"Eur J Biochem"},{"key":"2025092303113160200_ref27","doi-asserted-by":"publisher","first-page":"1938","DOI":"10.1021\/acschembio.3c00068","article-title":"An inherent difference between serine and threonine phosphorylation: Phosphothreonine strongly prefers a highly ordered, compact","volume":"18","author":"Pandey","year":"2023","journal-title":"Cyclic Conformation ACS Chem Biol"},{"key":"2025092303113160200_ref28","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1083\/jcb.147.1.33","article-title":"Stromal processing peptidase binds transit peptides and initiates their Atp-dependent turnover in chloroplasts","volume":"147","author":"Richter","year":"1999","journal-title":"J Cell Biol"},{"key":"2025092303113160200_ref29","doi-asserted-by":"publisher","first-page":"2715","DOI":"10.1002\/j.1460-2075.1995.tb07272.x","article-title":"A new type of signal peptide: Central role of a twin-arginine motif in transfer signals for the delta pH-dependent thylakoidal protein translocase","volume":"14","author":"Chaddock","year":"1995","journal-title":"EMBO J"},{"key":"2025092303113160200_ref30","doi-asserted-by":"publisher","first-page":"17202","DOI":"10.1016\/S0021-9258(17)44889-7","article-title":"Structural features in the NH2-terminal region of a model eukaryotic signal peptide influence the site of its cleavage by signal peptidase","volume":"265","author":"Nothwehr","year":"1990","journal-title":"J Biol Chem"},{"key":"2025092303113160200_ref31","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1016\/S0022-2836(83)80341-6","article-title":"A putative signal peptidase recognition site and sequence in eukaryotic and prokaryotic signal peptides","volume":"167","author":"Perlman","year":"1983","journal-title":"J Mol Biol"},{"key":"2025092303113160200_ref32","doi-asserted-by":"publisher","first-page":"1317","DOI":"10.1074\/jbc.M008522200","article-title":"Dissection of a nuclear localization signal","volume":"276","author":"Hodel","year":"2001","journal-title":"J Biol Chem"},{"key":"2025092303113160200_ref33","doi-asserted-by":"publisher","first-page":"1136","DOI":"10.1038\/nature07975","article-title":"Structural basis for leucine-rich nuclear export signal recognition by CRM1","volume":"458","author":"Dong","year":"2009","journal-title":"Nature"},{"key":"2025092303113160200_ref34","doi-asserted-by":"publisher","first-page":"563","DOI":"10.1038\/225563a0","article-title":"Natural selection and the concept of a protein space","volume":"225","author":"Maynard","year":"1970","journal-title":"Nature"},{"key":"2025092303113160200_ref35","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1038\/nature23320","article-title":"Proteins evolve on the edge of supramolecular self-assembly","volume":"548","author":"Garcia-Seisdedos","year":"2017","journal-title":"Nature"},{"key":"2025092303113160200_ref36","doi-asserted-by":"publisher","volume":"387","author":"Hayes","DOI":"10.1126\/science.ads0018"},{"key":"2025092303113160200_ref37","doi-asserted-by":"publisher","first-page":"1091","DOI":"10.1038\/81930","article-title":"Peroxisomal targeting signal-1 recognition by the TPR domains of human PEX5","volume":"7","author":"Berg","year":"2000","journal-title":"Nat Struct Biol"},{"key":"2025092303113160200_ref38","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1038\/s41586-024-07487-w","article-title":"Accurate structure prediction of biomolecular interactions with AlphaFold 3","volume":"630","author":"Abramson","year":"2024","journal-title":"Nature"},{"key":"2025092303113160200_ref39","doi-asserted-by":"publisher","DOI":"10.1126\/science.adq2634","volume":"387","author":"","journal-title":"Science"},{"key":"2025092303113160200_ref40","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2021.3095381","volume":"44","author":"Elnaggar","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025092303113160200_ref41","doi-asserted-by":"publisher","DOI":"10.1101\/2023.08.06.552203","volume":"13","author":"Wang","journal-title":"bioRxiv [Preprint]"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf436\/64143059\/bbaf436.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/4\/bbaf436\/64143059\/bbaf436.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T07:11:39Z","timestamp":1758611499000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf436\/8242128"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7]]},"references-count":41,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,7,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf436","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,7]]},"published":{"date-parts":[[2025,7]]},"article-number":"bbaf436"}}