{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T01:41:28Z","timestamp":1769737288736,"version":"3.49.0"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2024,4,18]],"date-time":"2024-04-18T00:00:00Z","timestamp":1713398400000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,5,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The clinical translation of mass spectrometry-based proteomics has been challenging due to limited statistical power caused by large technical variability and inter-patient heterogeneity. Bottom-up proteomics provides an indirect measurement of proteins through digested peptides. This raises the question whether peptide measurements can be used directly to better distinguish differentially expressed proteins.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present a novel method called the peptide set test, which detects coordinated changes in the expression of peptides originating from the same protein and compares them to the rest of the peptidome. Applying our method to data from a published spike-in experiment and simulations demonstrates improved sensitivity without compromising precision, compared to aggregation-based approaches. Additionally, applying the peptide set test to compare the tumor proteomes of tamoxifen-sensitive and tamoxifen-resistant breast cancer patients reveals significant alterations in peptide levels of collagen XII, suggesting an association between collagen XII-mediated matrix reassembly and tamoxifen resistance. Our study establishes the peptide set test as a powerful peptide-centric strategy to infer differential expression in proteomics studies.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Peptide set test (PepSetTest) is publicly available at https:\/\/github.com\/JmWangBio\/PepSetTest.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae270","type":"journal-article","created":{"date-parts":[[2024,4,18]],"date-time":"2024-04-18T02:13:30Z","timestamp":1713406410000},"source":"Crossref","is-referenced-by-count":3,"title":["Peptide set test: a peptide-centric strategy to infer differentially expressed proteins"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-3728-9538","authenticated-orcid":false,"given":"Junmin","family":"Wang","sequence":"first","affiliation":[{"name":"Data Sciences and Quantitative Biology, Discovery Sciences, Biopharmaceuticals R&D, AstraZeneca , Gaithersburg, MD 20878, United States"}]},{"given":"Steven","family":"Novick","sequence":"additional","affiliation":[{"name":"Global Statistical Sciences , Eli Lilly, Indianapolis, IN 46285, United States"}]}],"member":"286","published-online":{"date-parts":[[2024,4,17]]},"reference":[{"key":"2024050702562460300_btae270-B1","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1214\/07-AOAS146","article-title":"A statistical framework for testing functional categories in microarray data","volume":"2","author":"Barry","year":"2008","journal-title":"Ann Appl Stat"},{"key":"2024050702562460300_btae270-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v067.i01","article-title":"Fitting linear Mixed-Effects models using lme4","volume":"67","author":"Bates","year":"2015","journal-title":"J Stat Soft"},{"key":"2024050702562460300_btae270-B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J R Statist Soc"},{"key":"2024050702562460300_btae270-B4","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Ann Statist"},{"key":"2024050702562460300_btae270-B5","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1038\/nrm3904","article-title":"Remodelling the extracellular matrix in development and disease","volume":"15","author":"Bonnans","year":"2014","journal-title":"Nat Rev Mol Cell Biol"},{"key":"2024050702562460300_btae270-B6","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1021\/acs.jproteome.1c00028","article-title":"Characterization of citrullination sites in neutrophils and mast cells activated by ionomycin via integration of mass spectrometry and machine learning","volume":"20","author":"Chaerkady","year":"2021","journal-title":"J Proteome Res"},{"key":"2024050702562460300_btae270-B7","doi-asserted-by":"crossref","first-page":"2020081","DOI":"10.1080\/19420862.2021.2020081","article-title":"Mitochondrial membrane potential-enriched CHO host: a novel and powerful tool for improving biomanufacturing capability","volume":"14","author":"Chakrabarti","year":"2022","journal-title":"MAbs"},{"key":"2024050702562460300_btae270-B8","first-page":"239204","article-title":"Proteomics: challenges, techniques and possibilities to overcome biological sample complexity","volume":"2009","author":"Chandramouli","year":"2009","journal-title":"Hum Genomics Proteomics"},{"key":"2024050702562460300_btae270-B9","doi-asserted-by":"crossref","first-page":"2524","DOI":"10.1093\/bioinformatics\/btu305","article-title":"MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments","volume":"30","author":"Choi","year":"2014","journal-title":"Bioinformatics"},{"key":"2024050702562460300_btae270-B10","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1186\/s10194-015-0528-2","article-title":"Specific and number of comorbidities are associated with increased levels of temporomandibular pain intensity and duration","volume":"16","author":"Dahan","year":"2015","journal-title":"J Headache Pain"},{"key":"2024050702562460300_btae270-B11","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.molonc.2015.07.004","article-title":"4-protein signature predicting tamoxifen treatment outcome in recurrent breast cancer","volume":"10","author":"De Marchi","year":"2016","journal-title":"Mol Oncol"},{"key":"2024050702562460300_btae270-B12","doi-asserted-by":"crossref","DOI":"10.1007\/b97673","volume-title":"Design and Analysis of Experiments","author":"Dean","year":"1999"},{"key":"2024050702562460300_btae270-B13","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/s12943-022-01526-8","article-title":"Proteomics technologies for cancer liquid biopsies","volume":"21","author":"Ding","year":"2022","journal-title":"Mol Cancer"},{"key":"2024050702562460300_btae270-B14","author":"Gatto","year":"2021"},{"key":"2024050702562460300_btae270-B15","doi-asserted-by":"crossref","first-page":"00378","DOI":"10.1183\/23120541.00378-2022","article-title":"Multiomics links global surfactant dysregulation with airflow obstruction and emphysema in COPD","volume":"9","author":"Hristova","year":"2023","journal-title":"ERJ Open Res"},{"key":"2024050702562460300_btae270-B16","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1186\/s40635-023-00528-0","article-title":"Serum proteome profiles in patients treated with targeted temperature management after out-of-hospital cardiac arrest","volume":"11","author":"Lileikyte","year":"2023","journal-title":"Intensive Care Med Exp"},{"key":"2024050702562460300_btae270-B17","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.cels.2020.05.012","article-title":"Ultra-high-throughput clinical proteomics reveals classifiers of COVID-19 infection","volume":"11","author":"Messner","year":"2020","journal-title":"Cell Syst"},{"key":"2024050702562460300_btae270-B18","doi-asserted-by":"crossref","first-page":"e0000090","DOI":"10.1371\/journal.pgph.0000090","article-title":"Assessment of independent comorbidities and comorbidity measures in predicting healthcare facility-onset Clostridioides difficile infection in Kenya","volume":"2","author":"Mutai","year":"2022","journal-title":"PLOS Glob Public Health"},{"key":"2024050702562460300_btae270-B19","doi-asserted-by":"crossref","first-page":"4646","DOI":"10.1021\/ac0341261","article-title":"A statistical model for identifying proteins by tandem mass spectrometry","volume":"75","author":"Nesvizhskii","year":"2003","journal-title":"Anal Chem"},{"key":"2024050702562460300_btae270-B20","author":"Neumann"},{"key":"2024050702562460300_btae270-B21","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1002\/ana.25920","article-title":"A stearoyl-coenzyme a desaturase inhibitor prevents multiple Parkinson disease phenotypes in alpha-synuclein mice","volume":"89","author":"Nuber","year":"2021","journal-title":"Ann Neurol"},{"key":"2024050702562460300_btae270-B22","doi-asserted-by":"crossref","first-page":"4587","DOI":"10.1038\/s41467-022-32255-7","article-title":"Temporal profiling of the breast tumour microenvironment reveals collagen XII as a driver of metastasis","volume":"13","author":"Papanicolaou","year":"2022","journal-title":"Nat Commun"},{"key":"2024050702562460300_btae270-B23","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1074\/mcp.M900222-MCP200","article-title":"Interlaboratory study characterizing a yeast performance standard for benchmarking LC-MS platform performance","volume":"9","author":"Paulovich","year":"2010","journal-title":"Mol Cell Proteomics"},{"key":"2024050702562460300_btae270-B24","doi-asserted-by":"crossref","first-page":"2128","DOI":"10.1021\/pr301146m","article-title":"Sources of technical variability in quantitative LC-MS proteomics: human brain tissue sample analysis","volume":"12","author":"Piehowski","year":"2013","journal-title":"J Proteome Res"},{"key":"2024050702562460300_btae270-B25","doi-asserted-by":"crossref","first-page":"2430","DOI":"10.1021\/acsbiomaterials.0c01803","article-title":"Collagen I fibrous substrates modulate the proliferation and secretome of estrogen receptor-positive breast tumor cells in a hormone-restricted microenvironment","volume":"7","author":"Reyes-Ramos","year":"2021","journal-title":"ACS Biomater Sci Eng"},{"key":"2024050702562460300_btae270-B26","doi-asserted-by":"crossref","first-page":"e47","DOI":"10.1093\/nar\/gkv007","article-title":"Limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2024050702562460300_btae270-B27","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1016\/j.immuni.2020.10.008","article-title":"Plasma proteomics identify biomarkers and pathogenesis of COVID-19","volume":"53","author":"Shu","year":"2020","journal-title":"Immunity"},{"key":"2024050702562460300_btae270-B28","doi-asserted-by":"crossref","first-page":"2398","DOI":"10.3390\/cells10092398","article-title":"Inhibition of metabolism as a therapeutic option for Tamoxifen-Resistant breast cancer cells","volume":"10","author":"Steifensand","year":"2021","journal-title":"Cells"},{"key":"2024050702562460300_btae270-B29","doi-asserted-by":"crossref","first-page":"1209","DOI":"10.1074\/mcp.RA119.001624","article-title":"Robust summarization and inference in proteome-wide label-free quantification","volume":"19","author":"Sticker","year":"2020","journal-title":"Mol Cell Proteomics"},{"key":"2024050702562460300_btae270-B30","doi-asserted-by":"crossref","first-page":"3123","DOI":"10.1021\/acs.jproteome.5b00052","article-title":"Quantitative differences in the urinary proteome of siblings discordant for type 1 diabetes include lysosomal enzymes","volume":"14","author":"Suh","year":"2015","journal-title":"J Proteome Res"},{"key":"2024050702562460300_btae270-B31","doi-asserted-by":"crossref","first-page":"2301","DOI":"10.1038\/nprot.2016.136","article-title":"The MaxQuant computational platform for mass spectrometry-based shotgun proteomics","volume":"11","author":"Tyanova","year":"2016","journal-title":"Nat Protoc"},{"key":"2024050702562460300_btae270-B32","doi-asserted-by":"crossref","first-page":"D1063","DOI":"10.1093\/nar\/gks1262","article-title":"The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013","volume":"41","author":"Vizcaino","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2024050702562460300_btae270-B33","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1016\/j.ymben.2021.11.006","article-title":"Elucidating tumor-stromal metabolic crosstalk in colorectal cancer through integration of constraint-based models and LC-MS metabolomics","volume":"69","author":"Wang","year":"2022","journal-title":"Metab Eng"},{"key":"2024050702562460300_btae270-B34","doi-asserted-by":"crossref","first-page":"e0132285","DOI":"10.1371\/journal.pone.0132285","article-title":"Inhibition of aerobic glycolysis represses akt\/mTOR\/HIF-1alpha axis and restores tamoxifen sensitivity in Antiestrogen-Resistant breast cancer cells","volume":"10","author":"Woo","year":"2015","journal-title":"PLoS One"},{"key":"2024050702562460300_btae270-B35","doi-asserted-by":"crossref","first-page":"2176","DOI":"10.1093\/bioinformatics\/btq401","article-title":"ROAST: rotation gene set tests for complex microarray experiments","volume":"26","author":"Wu","year":"2010","journal-title":"Bioinformatics"},{"key":"2024050702562460300_btae270-B36","doi-asserted-by":"crossref","first-page":"e133","DOI":"10.1093\/nar\/gks461","article-title":"Camera: a competitive gene set test accounting for inter-gene correlation","volume":"40","author":"Wu","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2024050702562460300_btae270-B37","first-page":"100141","article-title":"clusterProfiler 4.0: a universal enrichment tool for interpreting omics data","volume":"2","author":"Wu","year":"2021","journal-title":"Innovation (Camb)"},{"key":"2024050702562460300_btae270-B38","doi-asserted-by":"crossref","first-page":"4","DOI":"10.3389\/fmolb.2015.00004","article-title":"A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis","volume":"2","author":"Yang","year":"2015","journal-title":"Front Mol Biosci"},{"key":"2024050702562460300_btae270-B39","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1089\/omi.2011.0118","article-title":"clusterProfiler: an R package for comparing biological themes among gene clusters","volume":"16","author":"Yu","year":"2012","journal-title":"OMICS"},{"key":"2024050702562460300_btae270-B40","doi-asserted-by":"crossref","first-page":"2103","DOI":"10.1021\/pr200636x","article-title":"Statistical considerations of optimal study design for human plasma proteomics and biomarker discovery","volume":"11","author":"Zhou","year":"2012","journal-title":"J Proteome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae270\/57262450\/btae270.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/5\/btae270\/57421184\/btae270.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/5\/btae270\/57421184\/btae270.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,7]],"date-time":"2024-05-07T02:57:03Z","timestamp":1715050623000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae270\/7649320"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,4,17]]},"references-count":40,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,5,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae270","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,5,1]]},"published":{"date-parts":[[2024,4,17]]},"article-number":"btae270"}}