{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,5]],"date-time":"2026-04-05T09:52:26Z","timestamp":1775382746050,"version":"3.50.1"},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2023,8,1]],"date-time":"2023-08-01T00:00:00Z","timestamp":1690848000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100009708","name":"Novo Nordisk Foundation","doi-asserted-by":"publisher","award":["NNF14CC0001"],"award-info":[{"award-number":["NNF14CC0001"]}],"id":[{"id":"10.13039\/501100009708","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Summary<\/jats:title>\n                  <jats:p>The widespread application of mass spectrometry (MS)-based proteomics in biomedical research increasingly requires robust, transparent, and streamlined solutions to extract statistically reliable insights. We have designed and implemented AlphaPeptStats, an inclusive Python package with currently with broad functionalities for normalization, imputation, visualization, and statistical analysis of label-free proteomics data. It modularly builds on the established stack of Python scientific libraries and is accompanied by a rigorous testing framework with 98% test coverage. It imports the output of a range of popular search engines. Data can be filtered and normalized according to user specifications. At its heart, AlphaPeptStats provides a wide range of robust statistical algorithms such as t-tests, analysis of variance, principal component analysis, hierarchical clustering, and multiple covariate analysis\u2014all in an automatable manner. Data visualization capabilities include heat maps, volcano plots, and scatter plots in publication-ready format. AlphaPeptStats advances proteomic research through its robust tools that enable researchers to manually or automatically explore complex datasets to identify interesting patterns and outliers.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>AlphaPeptStats is implemented in Python and part of the AlphaPept framework. It is released under a permissive Apache license. The source code and one-click installers are freely available and on GitHub at\u00a0https:\/\/github.com\/MannLabs\/alphapeptstats.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad461","type":"journal-article","created":{"date-parts":[[2023,8,1]],"date-time":"2023-08-01T16:24:11Z","timestamp":1690907051000},"source":"Crossref","is-referenced-by-count":22,"title":["AlphaPeptStats: an open-source Python package for automated and scalable statistical analysis of mass spectrometry-based proteomics"],"prefix":"10.1093","volume":"39","author":[{"given":"Elena","family":"Krismer","sequence":"first","affiliation":[{"name":"Department of Clinical Proteomics, Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen , 2200 Copenhagen, Denmark"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2601-238X","authenticated-orcid":false,"given":"Isabell","family":"Bludau","sequence":"additional","affiliation":[{"name":"Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry , 82152 Martinsried, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3320-6833","authenticated-orcid":false,"given":"Maximilian T","family":"Strauss","sequence":"additional","affiliation":[{"name":"Department of Clinical Proteomics, Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen , 2200 Copenhagen, Denmark"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1292-4799","authenticated-orcid":false,"given":"Matthias","family":"Mann","sequence":"additional","affiliation":[{"name":"Department of Clinical Proteomics, Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen , 2200 Copenhagen, Denmark"},{"name":"Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry , 82152 Martinsried, Germany"}]}],"member":"286","published-online":{"date-parts":[[2023,8,1]]},"reference":[{"key":"2023081102032995200_btad461-B1","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1038\/nature19949","article-title":"Mass-spectrometric exploration of proteome structure and function","volume":"537","author":"Aebersold","year":"2016","journal-title":"Nature"},{"key":"2023081102032995200_btad461-B2","doi-asserted-by":"crossref","first-page":"3148","DOI":"10.1093\/bioinformatics\/btaa118","article-title":"Proline: an efficient and user-friendly software suite for large-scale proteomics","volume":"36","author":"Bouyssi\u00e9","year":"2020","journal-title":"Bioinformatics"},{"key":"2023081102032995200_btad461-B3","doi-asserted-by":"crossref","first-page":"1400","DOI":"10.1074\/mcp.M114.044305","article-title":"Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues","volume":"14","author":"Bruderer","year":"2015","journal-title":"Mol Cell Proteomics"},{"key":"2023081102032995200_btad461-B4","doi-asserted-by":"crossref","first-page":"2524","DOI":"10.1093\/bioinformatics\/btu305","article-title":"MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments","volume":"30","author":"Choi","year":"2014","journal-title":"Bioinformatics"},{"key":"2023081102032995200_btad461-B5","doi-asserted-by":"crossref","first-page":"1367","DOI":"10.1038\/nbt.1511","article-title":"MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification","volume":"26","author":"Cox","year":"2008","journal-title":"Nat Biotechnol"},{"key":"2023081102032995200_btad461-B6","doi-asserted-by":"crossref","first-page":"869","DOI":"10.1038\/s41592-020-0912-y","article-title":"Philosopher: a versatile toolkit for shotgun proteomics data analysis","volume":"17","author":"da Veiga Leprevost","year":"2020","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B7","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/s41592-019-0638-x","article-title":"DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput","volume":"17","author":"Demichev","year":"2020","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B8","doi-asserted-by":"crossref","first-page":"1150","DOI":"10.1002\/pmic.200900375","article-title":"A guided tour of the trans-proteomic pipeline","volume":"10","author":"Deutsch","year":"2010","journal-title":"Proteomics"},{"key":"2023081102032995200_btad461-B9","doi-asserted-by":"crossref","first-page":"2104","DOI":"10.1021\/acs.jproteome.2c00145","article-title":"Protein contaminants matter: building universal protein contaminant libraries for DDA and DIA proteomics","volume":"21","author":"Frankenfield","year":"2022","journal-title":"J Proteome Res"},{"key":"2023081102032995200_btad461-B10","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1007\/s13361-012-0516-6","article-title":"Pyteomics\u2014a Python framework for exploratory data analysis and rapid software prototyping in proteomics","volume":"24","author":"Goloborodko","year":"2013","journal-title":"J Am Soc Mass Spectrom"},{"key":"2023081102032995200_btad461-B11","doi-asserted-by":"crossref","first-page":"2765","DOI":"10.1074\/mcp.O113.036681","article-title":"The mzTab data exchange format: communicating mass-spectromtry-based proteomics and metabolomics experimental results to a wider audience","volume":"13","author":"Griss","year":"2014","journal-title":"Mol Cell Proteomics"},{"key":"2023081102032995200_btad461-B12","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","article-title":"Array programming with NumPy","volume":"585","author":"Harris","year":"2020","journal-title":"Nature"},{"key":"2023081102032995200_btad461-B13","doi-asserted-by":"crossref","first-page":"vbac004","DOI":"10.1093\/bioadv\/vbac004","article-title":"MSPypeline: a python package for streamlined data analysis of mass spectrometry-based proteomics","volume":"2","author":"Heming","year":"2022","journal-title":"Bioinform Adv"},{"key":"2023081102032995200_btad461-B14","doi-asserted-by":"crossref","first-page":"1760","DOI":"10.1038\/s41598-021-81279-4","article-title":"A comparative study of evaluating missing value imputation methods in label-free proteomics","volume":"11","author":"Jin","year":"2021","journal-title":"Sci Rep"},{"key":"2023081102032995200_btad461-B15","doi-asserted-by":"crossref","first-page":"492","DOI":"10.1186\/s12859-019-3110-0","article-title":"Random forest-based imputation outperforms other methods for imputing LC-MS metabolomics data: a comparative study","volume":"20","author":"Kokla","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2023081102032995200_btad461-B16","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1038\/nmeth.4256","article-title":"MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry\u2013based proteomics","volume":"14","author":"Kong","year":"2017","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B17","first-page":"709","article-title":"Pyteomics 4.0: five years of development of a Python proteomics framework","volume":"2","author":"Levitsky","year":"2018","journal-title":"J Proteome Res"},{"key":"2023081102032995200_btad461-B18","doi-asserted-by":"crossref","first-page":"966","DOI":"10.1093\/bioinformatics\/btq054","article-title":"Skyline: an open source document editor for creating and analyzing targeted proteomics experiments","volume":"26","author":"MacLean","year":"2010","journal-title":"Bioinformatics"},{"key":"2023081102032995200_btad461-B19","first-page":"56","author":"McKinney","year":"2010"},{"key":"2023081102032995200_btad461-B20","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1021\/acs.jproteome.7b00608","article-title":"Ultrafast peptide label-free quantification with FlashLFQ","volume":"17","author":"Millikin","year":"2018","journal-title":"J Proteome Res"},{"key":"2023081102032995200_btad461-B21","doi-asserted-by":"crossref","first-page":"e8793","DOI":"10.15252\/msb.20188793","article-title":"Plasma proteome profiling discovers novel proteins associated with non-alcoholic fatty liver disease","volume":"15","author":"Niu","year":"2019","journal-title":"Mol Syst Biol"},{"key":"2023081102032995200_btad461-B22","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1016\/j.dib.2015.11.063","article-title":"Spiked proteomic standard dataset for testing label-free quantitative software and statistical methods","volume":"6","author":"Ramus","year":"2016","journal-title":"Data Brief"},{"key":"2023081102032995200_btad461-B23","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1038\/nmeth.3959","article-title":"OpenMS: a flexible open-source software platform for mass spectrometry data analysis","volume":"13","author":"R\u00f6st","year":"2016","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B24","doi-asserted-by":"crossref","first-page":"1003","DOI":"10.1038\/nmeth.3621","article-title":"Avoiding abundance bias in the functional annotation of post-translationally modified proteins","volume":"12","author":"Sch\u00f6lz","year":"2015","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B25","author":"Strauss","year":"2021"},{"key":"2023081102032995200_btad461-B26","doi-asserted-by":"crossref","first-page":"498","DOI":"10.1021\/acs.jproteome.0c00544","article-title":"Fast deisotoping algorithm and its implementation in the MSFragger search engine","volume":"20","author":"Teo","year":"2021","journal-title":"J Proteome Res"},{"key":"2023081102032995200_btad461-B27","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023081102032995200_btad461-B28","doi-asserted-by":"crossref","first-page":"731","DOI":"10.1038\/nmeth.3901","article-title":"The Perseus computational platform for comprehensive analysis of (prote)omics data","volume":"13","author":"Tyanova","year":"2016","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B29","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1038\/s41592-019-0686-2","article-title":"SciPy 1.0: fundamental algorithms for scientific computing in Python","volume":"17","author":"Virtanen","year":"2020","journal-title":"Nat Methods"},{"key":"2023081102032995200_btad461-B30","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1093\/bioinformatics\/btab674","article-title":"AlphaMap: an open-source Python package for the visual annotation of proteomics data with sequence-specific knowledge","volume":"38","author":"Voytik","year":"2022","journal-title":"Bioinformatics"},{"key":"2023081102032995200_btad461-B31","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2023081102032995200_btad461-B32","doi-asserted-by":"crossref","first-page":"100077","DOI":"10.1016\/j.mcpro.2021.100077","article-title":"IonQuant enables accurate and sensitive Label-Free quantification with FDR-controlled match-between-runs","volume":"20","author":"Yu","year":"2021","journal-title":"Mol Cell Proteomics"},{"key":"2023081102032995200_btad461-B33","doi-asserted-by":"crossref","first-page":"1575","DOI":"10.1074\/mcp.TIR120.002048","article-title":"Fast quantitative analysis of timsTOF PASEF data with MSFragger and IonQuant","volume":"19","author":"Yu","year":"2020","journal-title":"Mol Cell Proteomics"},{"key":"2023081102032995200_btad461-B34","doi-asserted-by":"crossref","first-page":"7238","DOI":"10.1038\/s41467-022-34904-3","article-title":"AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics","volume":"13","author":"Zeng","year":"2022","journal-title":"Nat Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad461\/51016571\/btad461.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad461\/51092061\/btad461.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad461\/51092061\/btad461.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T02:03:53Z","timestamp":1691719433000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad461\/7234607"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,8,1]]},"references-count":34,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2023,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad461","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,8,1]]},"published":{"date-parts":[[2023,8,1]]},"article-number":"btad461"}}