{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T04:25:36Z","timestamp":1770956736413,"version":"3.50.1"},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"W1","license":[{"start":{"date-parts":[[2021,5,17]],"date-time":"2021-05-17T00:00:00Z","timestamp":1621209600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004681","name":"HEC","doi-asserted-by":"publisher","award":["21-320SRGP\/R&D\/HEC\/2014"],"award-info":[{"award-number":["21-320SRGP\/R&D\/HEC\/2014"]}],"id":[{"id":"10.13039\/501100004681","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004681","name":"HEC","doi-asserted-by":"publisher","award":["20-2269\/NRPU\/R&D\/HEC\/12\/4792"],"award-info":[{"award-number":["20-2269\/NRPU\/R&D\/HEC\/12\/4792"]}],"id":[{"id":"10.13039\/501100004681","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004681","name":"HEC","doi-asserted-by":"publisher","award":["20-3629\/NRPU\/R&D\/HEC\/14\/585"],"award-info":[{"award-number":["20-3629\/NRPU\/R&D\/HEC\/14\/585"]}],"id":[{"id":"10.13039\/501100004681","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ignite","award":["SRG-209"],"award-info":[{"award-number":["SRG-209"]}]},{"DOI":"10.13039\/100004457","name":"TWAS","doi-asserted-by":"publisher","award":["RG 14-319 RG\/ITC\/AS_C"],"award-info":[{"award-number":["RG 14-319 RG\/ITC\/AS_C"]}],"id":[{"id":"10.13039\/100004457","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004457","name":"TWAS","doi-asserted-by":"publisher","award":["LUMS"],"award-info":[{"award-number":["LUMS"]}],"id":[{"id":"10.13039\/100004457","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004457","name":"TWAS","doi-asserted-by":"publisher","award":["STG-BIO-1008"],"award-info":[{"award-number":["STG-BIO-1008"]}],"id":[{"id":"10.13039\/100004457","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004457","name":"TWAS","doi-asserted-by":"publisher","award":["FIF-BIO-2052"],"award-info":[{"award-number":["FIF-BIO-2052"]}],"id":[{"id":"10.13039\/100004457","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004457","name":"TWAS","doi-asserted-by":"publisher","award":["FIF-BIO-0255"],"award-info":[{"award-number":["FIF-BIO-0255"]}],"id":[{"id":"10.13039\/100004457","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,7,2]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>PERCEPTRON is a next-generation freely available web-based proteoform identification and characterization platform for top-down proteomics (TDP). PERCEPTRON search pipeline brings together algorithms for (i) intact protein mass tuning, (ii) de novo sequence tags-based filtering, (iii) characterization of terminal as well as post-translational modifications, (iv) identification of truncated proteoforms, (v) in silico spectral comparison, and (vi) weight-based candidate protein scoring. High-throughput performance is achieved through the execution of optimized code via multiple threads in parallel, on graphics processing units (GPUs) using NVidia Compute Unified Device Architecture (CUDA) framework. An intuitive graphical web interface allows for setting up of search parameters as well as for visualization of results. The accuracy and performance of the tool have been validated on several TDP datasets and against available TDP software. Specifically, results obtained from searching two published TDP datasets demonstrate that PERCEPTRON outperforms all other tools by up to 135% in terms of reported proteins and 10-fold in terms of runtime. In conclusion, the proposed tool significantly enhances the state-of-the-art in TDP search software and is publicly available at https:\/\/perceptron.lums.edu.pk. Users can also create in-house deployments of the tool by building code available on the GitHub repository (http:\/\/github.com\/BIRL\/Perceptron).<\/jats:p>","DOI":"10.1093\/nar\/gkab368","type":"journal-article","created":{"date-parts":[[2021,4,25]],"date-time":"2021-04-25T11:09:02Z","timestamp":1619348942000},"page":"W510-W515","source":"Crossref","is-referenced-by-count":3,"title":["PERCEPTRON: an open-source GPU-accelerated proteoform identification pipeline for top-down proteomics"],"prefix":"10.1093","volume":"49","author":[{"given":"Muhammad Farhan","family":"Khalid","sequence":"first","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kanzal","family":"Iman","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Amna","family":"Ghafoor","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mujtaba","family":"Saboor","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ahsan","family":"Ali","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Urwa","family":"Muaz","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4675-5375","authenticated-orcid":false,"given":"Abdul Rehman","family":"Basharat","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Taha","family":"Tahir","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Muhammad","family":"Abubakar","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Momina Amer","family":"Akhter","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Waqar","family":"Nabi","sequence":"additional","affiliation":[{"name":"School of Computing Science, University of Glasgow, Glasgow, G12 8QQ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wim","family":"Vanderbauwhede","sequence":"additional","affiliation":[{"name":"School of Computing Science, University of Glasgow, Glasgow, G12 8QQ, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fayyaz","family":"Ahmad","sequence":"additional","affiliation":[{"name":"Department of Statistics, University of Gujrat, Gujrat, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bilal","family":"Wajid","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, University of Engineering and Technology, Lahore,\u00a0Pakistan"},{"name":"Department of Computer Science, University of Management and Technology, Lahore, Pakistan"},{"name":"Division of Research and Development, Sabz-Qalam, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3758-6581","authenticated-orcid":false,"given":"Safee Ullah","family":"Chaudhary","sequence":"additional","affiliation":[{"name":"Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,5,17]]},"reference":[{"key":"2021070812074984200_B1","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1038\/nmeth.2369","article-title":"Proteoform: a single term describing protein complexity","volume":"10","author":"Smith","year":"2013","journal-title":"Nat. Methods"},{"key":"2021070812074984200_B2","doi-asserted-by":"crossref","first-page":"3465","DOI":"10.1074\/mcp.M113.030114","article-title":"Large-scale top-down proteomics of the human proteome: membrane proteins, mitochondria, and senescence","volume":"12","author":"Catherman","year":"2013","journal-title":"Mol. Cell. Proteomics"},{"key":"2021070812074984200_B3","doi-asserted-by":"crossref","first-page":"1880","DOI":"10.1021\/ac3031527","article-title":"Top down proteomics of human membrane proteins from enriched mitochondrial fractions","volume":"85","author":"Catherman","year":"2013","journal-title":"Anal. Chem."},{"key":"2021070812074984200_B4","doi-asserted-by":"crossref","first-page":"10153","DOI":"10.1073\/pnas.1221210110","article-title":"Top-down proteomics reveals a unique protein S-thiolation switch in Salmonella Typhimurium in response to infection-like conditions","volume":"110","author":"Ansong","year":"2013","journal-title":"Proc. Natl. Acad. Sci. U.S.A."},{"key":"2021070812074984200_B5","doi-asserted-by":"crossref","first-page":"4054","DOI":"10.1021\/pr200258m","article-title":"Top-down quantitative proteomics identified phosphorylation of cardiac troponin I as a candidate biomarker for chronic heart failure","volume":"10","author":"Zhang","year":"2011","journal-title":"J. Proteome Res."},{"key":"2021070812074984200_B6","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1016\/j.yjmcc.2015.08.007","article-title":"Comprehensive assessment of chamber-specific and transmural heterogeneity in myofilament protein phosphorylation by top-down mass spectrometry","volume":"87","author":"Gregorich","year":"2015","journal-title":"J. Mol. Cell. Cardiol."},{"key":"2021070812074984200_B7","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1038\/nbt.2839","article-title":"ProteomeXchange provides globally coordinated proteomics data submission and dissemination","volume":"32","author":"Vizca\u00edno","year":"2014","journal-title":"Nat. Biotechnol."},{"key":"2021070812074984200_B8","doi-asserted-by":"crossref","DOI":"10.1074\/mcp.M111.008524","article-title":"Protein identification using top-down spectra","volume":"11","author":"Liu","year":"2012","journal-title":"Mol. Cell. Proteomics"},{"key":"2021070812074984200_B9","doi-asserted-by":"crossref","first-page":"3082","DOI":"10.1021\/acs.analchem.5b03963","article-title":"pTop 1.0: a high-accuracy and high-efficiency search engine for intact protein identification","volume":"88","author":"Sun","year":"2016","journal-title":"Anal. Chem."},{"key":"2021070812074984200_B10","doi-asserted-by":"crossref","first-page":"3495","DOI":"10.1093\/bioinformatics\/btw398","article-title":"TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization","volume":"32","author":"Kou","year":"2016","journal-title":"Bioinformatics"},{"key":"2021070812074984200_B11","doi-asserted-by":"crossref","first-page":"909","DOI":"10.1038\/nmeth.4388","article-title":"Informed-Proteomics: open-source software package for top-down proteomics","volume":"14","author":"Park","year":"2017","journal-title":"Nat. Methods"},{"key":"2021070812074984200_B12","doi-asserted-by":"crossref","first-page":"W340","DOI":"10.1093\/nar\/gkh447","article-title":"ProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry","volume":"32","author":"LeDuc","year":"2004","journal-title":"Nucleic. Acids. Res."},{"key":"2021070812074984200_B13","doi-asserted-by":"crossref","first-page":"W701","DOI":"10.1093\/nar\/gkm371","article-title":"ProSight PTM 2.0: improved protein identification and characterization for top down mass spectrometry","volume":"35","author":"Zamdborg","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2021070812074984200_B14","doi-asserted-by":"crossref","first-page":"2118","DOI":"10.1002\/pro.3247","article-title":"Crowd sourcing difficult problems in protein science","volume":"26","author":"Alexander","year":"2017","journal-title":"Protein Sci."},{"key":"2021070812074984200_B15","doi-asserted-by":"crossref","first-page":"11267","DOI":"10.1038\/s41598-019-47724-1","article-title":"SPECTRUM\u2013A MATLAB toolbox for proteoform identification from top-down proteomics data","volume":"9","author":"Basharat","year":"2019","journal-title":"Sci. Rep."},{"key":"2021070812074984200_B16","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.1038\/nbt1031","article-title":"A common open representation of mass spectrometry data and its application to proteomics research","volume":"22","author":"Pedrioli","year":"2004","journal-title":"Nat. Biotechnol."},{"key":"2021070812074984200_B17","doi-asserted-by":"crossref","first-page":"3551","DOI":"10.1002\/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2","article-title":"Probability-based protein identification by searching sequence databases using mass spectrometry data","volume":"20","author":"Perkins","year":"1999","journal-title":"Electrophoresis"},{"key":"2021070812074984200_B18","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1007\/978-1-60761-987-1_11","article-title":"Spectra, chromatograms, metadata: mzML-the standard data format for mass spectrometer output","volume-title":"Data Mining in Proteomics","author":"Turewicz","year":"2011"},{"key":"2021070812074984200_B19","doi-asserted-by":"crossref","first-page":"R110. 000133","DOI":"10.1074\/mcp.R110.000133","article-title":"mzML\u2014a community standard for mass spectrometry data","volume":"10","author":"Martens","year":"2011","journal-title":"Mol. Cell. Proteomics"},{"key":"2021070812074984200_B20","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1021\/ac702324u","article-title":"Interpreting top-down mass spectra using spectral alignment","volume":"80","author":"Frank","year":"2008","journal-title":"Anal. Chem."}],"container-title":["Nucleic Acids Research"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/nar\/article-pdf\/49\/W1\/W510\/38841731\/gkab368.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/nar\/article-pdf\/49\/W1\/W510\/38841731\/gkab368.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,2]],"date-time":"2023-11-02T18:18:13Z","timestamp":1698949093000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/nar\/article\/49\/W1\/W510\/6276909"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,17]]},"references-count":20,"journal-issue":{"issue":"W1","published-online":{"date-parts":[[2021,5,17]]},"published-print":{"date-parts":[[2021,7,2]]}},"URL":"https:\/\/doi.org\/10.1093\/nar\/gkab368","relation":{},"ISSN":["0305-1048","1362-4962"],"issn-type":[{"value":"0305-1048","type":"print"},{"value":"1362-4962","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,7,2]]},"published":{"date-parts":[[2021,5,17]]}}}