{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T07:22:57Z","timestamp":1775892177084,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010603","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,11,2]],"date-time":"2022-11-02T00:00:00Z","timestamp":1667347200000}}],"reference-count":56,"publisher":"Public Library of Science (PLoS)","issue":"10","license":[{"start":{"date-parts":[[2022,10,21]],"date-time":"2022-10-21T00:00:00Z","timestamp":1666310400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000092","name":"U.S. National Library of Medicine","doi-asserted-by":"publisher","award":["R15LM013460"],"award-info":[{"award-number":["R15LM013460"]}],"id":[{"id":"10.13039\/100000092","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100008460","name":"National Center for Complementary and Integrative Health","doi-asserted-by":"publisher","award":["R01AT011618"],"award-info":[{"award-number":["R01AT011618"]}],"id":[{"id":"10.13039\/100008460","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000050","name":"National Heart, Lung, and Blood Institute","doi-asserted-by":"publisher","award":["R01HL134828"],"award-info":[{"award-number":["R01HL134828"]}],"id":[{"id":"10.13039\/100000050","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Metaproteomics based on high-throughput tandem mass spectrometry (MS\/MS) plays a crucial role in characterizing microbiome functions. The acquired MS\/MS data is searched against a protein sequence database to identify peptides, which are then used to infer a list of proteins present in a metaproteome sample. While the problem of protein inference has been well-studied for proteomics of single organisms, it remains a major challenge for metaproteomics of complex microbial communities because of the large number of degenerate peptides shared among homologous proteins in different organisms. This challenge calls for improved discrimination of true protein identifications from false protein identifications given a set of unique and degenerate peptides identified in metaproteomics. MetaLP was developed here for protein inference in metaproteomics using an integrative linear programming method. Taxonomic abundance information extracted from metagenomics shotgun sequencing or 16s rRNA gene amplicon sequencing, was incorporated as prior information in MetaLP. Benchmarking with mock, human gut, soil, and marine microbial communities demonstrated significantly higher numbers of protein identifications by MetaLP than ProteinLP, PeptideProphet, DeepPep, PIPQ, and Sipros Ensemble. In conclusion, MetaLP could substantially improve protein inference for complex metaproteomes by incorporating taxonomic abundance information in a linear programming model.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010603","type":"journal-article","created":{"date-parts":[[2022,10,21]],"date-time":"2022-10-21T17:40:37Z","timestamp":1666374037000},"page":"e1010603","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":7,"title":["MetaLP: An integrative linear programming method for protein inference in metaproteomics"],"prefix":"10.1371","volume":"18","author":[{"given":"Shichao","family":"Feng","sequence":"first","affiliation":[]},{"given":"Hong-Long","family":"Ji","sequence":"additional","affiliation":[]},{"given":"Huan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Bailu","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Ryan","family":"Sterzenbach","sequence":"additional","affiliation":[]},{"given":"Chongle","family":"Pan","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2777-4482","authenticated-orcid":true,"given":"Xuan","family":"Guo","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,10,21]]},"reference":[{"issue":"3","key":"pcbi.1010603.ref001","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1128\/MMBR.00014-10","article-title":"From structure to function: the ecology of host-associated microbial communities","volume":"74","author":"CJ Robinson","year":"2010","journal-title":"Microbiology and Molecular Biology Reviews"},{"issue":"5","key":"pcbi.1010603.ref002","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1111\/1574-6976.12022","article-title":"Quantifying the metabolic activities of human-associated microbial communities across multiple ecological scales","volume":"37","author":"CF Maurice","year":"2013","journal-title":"FEMS microbiology reviews"},{"issue":"6","key":"pcbi.1010603.ref003","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1038\/s41579-018-0004-5","article-title":"The social network of microorganisms\u2014how auxotrophies shape complex communities","volume":"16","author":"K Zengler","year":"2018","journal-title":"Nature Reviews Microbiology"},{"key":"pcbi.1010603.ref004","doi-asserted-by":"crossref","first-page":"2706","DOI":"10.3389\/fmicb.2019.02706","article-title":"Genome-resolved proteomic stable isotope probing of soil microbial communities using 13CO2 and 13C-methanol","volume":"10","author":"Z Li","year":"2019","journal-title":"Frontiers in microbiology"},{"issue":"1","key":"pcbi.1010603.ref005","first-page":"1","article-title":"Islet autoantibody seroconversion in type-1 diabetes is associated with metagenome-assembled genomes in infant gut microbiomes","volume":"13","author":"L Zhang","year":"2022","journal-title":"Nature communications"},{"issue":"1","key":"pcbi.1010603.ref006","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40168-016-0176-z","article-title":"MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota","volume":"4","author":"X Zhang","year":"2016","journal-title":"Microbiome"},{"issue":"3","key":"pcbi.1010603.ref007","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1038\/s41559-017-0463-5","article-title":"Community proteogenomics reveals the systemic impact of phosphorus availability on microbial functions in tropical soil","volume":"2","author":"Q Yao","year":"2018","journal-title":"Nature ecology & evolution"},{"issue":"20","key":"pcbi.1010603.ref008","doi-asserted-by":"crossref","first-page":"3424","DOI":"10.1002\/pmic.201400571","article-title":"Microbial metaproteomics for characterizing the range of metabolic functions and activities of human gut microbiota","volume":"15","author":"W Xiong","year":"2015","journal-title":"Proteomics"},{"issue":"17","key":"pcbi.1010603.ref009","doi-asserted-by":"crossref","first-page":"4646","DOI":"10.1021\/ac0341261","article-title":"A statistical model for identifying proteins by tandem mass spectrometry","volume":"75","author":"AI Nesvizhskii","year":"2003","journal-title":"Analytical chemistry"},{"issue":"22","key":"pcbi.1010603.ref010","doi-asserted-by":"crossref","first-page":"2956","DOI":"10.1093\/bioinformatics\/bts540","article-title":"A linear programming model for protein inference problem in shotgun proteomics","volume":"28","author":"T Huang","year":"2012","journal-title":"Bioinformatics"},{"issue":"8","key":"pcbi.1010603.ref011","doi-asserted-by":"crossref","first-page":"1183","DOI":"10.1089\/cmb.2009.0018","article-title":"A Bayesian approach to protein inference problem in shotgun proteomics","volume":"16","author":"YF Li","year":"2009","journal-title":"Journal of Computational Biology"},{"issue":"10","key":"pcbi.1010603.ref012","doi-asserted-by":"crossref","first-page":"5346","DOI":"10.1021\/pr100594k","article-title":"Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data","volume":"9","author":"O Serang","year":"2010","journal-title":"Journal of proteome research"},{"issue":"3","key":"pcbi.1010603.ref013","doi-asserted-by":"crossref","first-page":"e91507","DOI":"10.1371\/journal.pone.0091507","article-title":"The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS\/MS protein inference","volume":"9","author":"O Serang","year":"2014","journal-title":"PloS one"},{"issue":"3","key":"pcbi.1010603.ref014","doi-asserted-by":"crossref","first-page":"1060","DOI":"10.1021\/acs.jproteome.9b00566","article-title":"EPIFANY: A Method for Efficient High-Confidence Protein Inference","volume":"19","author":"J Pfeuffer","year":"2020","journal-title":"Journal of proteome research"},{"key":"pcbi.1010603.ref015","doi-asserted-by":"crossref","first-page":"36166","DOI":"10.1109\/ACCESS.2022.3163257","article-title":"LINA: A linearizing neural network architecture for accurate first-order and second-order interpretations","volume":"10","author":"A Badr\u00e9","year":"2022","journal-title":"IEEE Access"},{"key":"pcbi.1010603.ref016","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.compbiolchem.2015.02.009","article-title":"BagReg: Protein inference through machine learning","volume":"57","author":"C Zhao","year":"2015","journal-title":"Computational biology and chemistry"},{"issue":"9","key":"pcbi.1010603.ref017","doi-asserted-by":"crossref","first-page":"e1005661","DOI":"10.1371\/journal.pcbi.1005661","article-title":"DeepPep: Deep proteome inference from peptide profiles","volume":"13","author":"M Kim","year":"2017","journal-title":"PLoS computational biology"},{"issue":"11","key":"pcbi.1010603.ref018","doi-asserted-by":"crossref","first-page":"1397","DOI":"10.1093\/bioinformatics\/btp168","article-title":"Integrating shotgun proteomics and mRNA expression data to improve protein identification","volume":"25","author":"SR Ramakrishnan","year":"2009","journal-title":"Bioinformatics"},{"issue":"22","key":"pcbi.1010603.ref019","doi-asserted-by":"crossref","first-page":"2955","DOI":"10.1093\/bioinformatics\/btp461","article-title":"Mining gene functional networks to improve mass-spectrometry-based protein identification","volume":"25","author":"SR Ramakrishnan","year":"2009","journal-title":"Bioinformatics"},{"issue":"1","key":"pcbi.1010603.ref020","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1038\/msb.2009.54","article-title":"Network-assisted protein identification and data interpretation in shotgun proteomics","volume":"5","author":"J Li","year":"2009","journal-title":"Molecular systems biology"},{"issue":"6","key":"pcbi.1010603.ref021","doi-asserted-by":"crossref","first-page":"1399","DOI":"10.1109\/TCBB.2016.2601618","article-title":"Protein inference from the integration of tandem ms data and interactome networks","volume":"14","author":"J Zhong","year":"2016","journal-title":"IEEE\/ACM transactions on computational biology and bioinformatics"},{"key":"pcbi.1010603.ref022","unstructured":"Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual; 2021. Available from: https:\/\/www.gurobi.com."},{"key":"pcbi.1010603.ref023","unstructured":"Achterberg T. What\u2019s new in Gurobi 9.0. Webinar Talk url: https:\/\/www.gurobi.com\/wp-content\/uploads\/2019\/12\/Gurobi-90-Overview-Webinar-Slides-1.pdf. 2019."},{"key":"pcbi.1010603.ref024","unstructured":"Bushnell B. BBMap: a fast, accurate, splice-aware aligner. Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States); 2014."},{"issue":"5","key":"pcbi.1010603.ref025","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1101\/gr.213959.116","article-title":"metaSPAdes: a new versatile metagenomic assembler","volume":"27","author":"S Nurk","year":"2017","journal-title":"Genome research"},{"key":"pcbi.1010603.ref026","doi-asserted-by":"crossref","first-page":"e7359","DOI":"10.7717\/peerj.7359","article-title":"MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies","volume":"7","author":"DD Kang","year":"2019","journal-title":"PeerJ"},{"issue":"1","key":"pcbi.1010603.ref027","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40168-018-0541-1","article-title":"MetaWRAP\u2014a flexible pipeline for genome-resolved metagenomic data analysis","volume":"6","author":"GV Uritskiy","year":"2018","journal-title":"Microbiome"},{"issue":"4","key":"pcbi.1010603.ref028","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with Bowtie 2","volume":"9","author":"B Langmead","year":"2012","journal-title":"Nature methods"},{"key":"pcbi.1010603.ref029","doi-asserted-by":"crossref","first-page":"e2584","DOI":"10.7717\/peerj.2584","article-title":"VSEARCH: a versatile open source tool for metagenomics","volume":"4","author":"T Rognes","year":"2016","journal-title":"PeerJ"},{"issue":"D1","key":"pcbi.1010603.ref030","doi-asserted-by":"crossref","first-page":"D633","DOI":"10.1093\/nar\/gkt1244","article-title":"Ribosomal Database Project: data and tools for high throughput rRNA analysis","volume":"42","author":"JR Cole","year":"2014","journal-title":"Nucleic acids research"},{"issue":"10","key":"pcbi.1010603.ref031","doi-asserted-by":"crossref","first-page":"918","DOI":"10.1038\/nbt.2377","article-title":"A cross-platform toolkit for mass spectrometry and proteomics","volume":"30","author":"MC Chambers","year":"2012","journal-title":"Nature biotechnology"},{"issue":"1","key":"pcbi.1010603.ref032","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1002\/pmic.201200439","article-title":"Comet: an open-source MS\/MS sequence database search tool","volume":"13","author":"JK Eng","year":"2013","journal-title":"Proteomics"},{"issue":"5","key":"pcbi.1010603.ref033","doi-asserted-by":"crossref","first-page":"795","DOI":"10.1093\/bioinformatics\/btx601","article-title":"Sipros ensemble improves database searching and filtering for complex metaproteomics","volume":"34","author":"X Guo","year":"2018","journal-title":"Bioinformatics"},{"issue":"16","key":"pcbi.1010603.ref034","doi-asserted-by":"crossref","first-page":"2064","DOI":"10.1093\/bioinformatics\/btt329","article-title":"Sipros\/ProRata: a versatile informatics system for quantitative community proteomics","volume":"29","author":"Y Wang","year":"2013","journal-title":"Bioinformatics"},{"issue":"20","key":"pcbi.1010603.ref035","doi-asserted-by":"crossref","first-page":"5383","DOI":"10.1021\/ac025747h","article-title":"Empirical statistical model to estimate the accuracy of peptide identifications made by MS\/MS and database search","volume":"74","author":"A Keller","year":"2002","journal-title":"Analytical chemistry"},{"key":"pcbi.1010603.ref036","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.compbiolchem.2016.02.006","article-title":"Protein inference: A protein quantification perspective","volume":"63","author":"Z He","year":"2016","journal-title":"Computational biology and chemistry"},{"issue":"1","key":"pcbi.1010603.ref037","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-017-01544-x","article-title":"Assessing species biomass contributions in microbial communities via metaproteomics","volume":"8","author":"M Kleiner","year":"2017","journal-title":"Nature communications"},{"issue":"2","key":"pcbi.1010603.ref038","doi-asserted-by":"crossref","first-page":"e00027","DOI":"10.1128\/mSystems.00027-15","article-title":"Proteomic stable isotope probing reveals taxonomically distinct patterns in amino acid assimilation by coastal marine bacterioplankton","volume":"1","author":"S Bryson","year":"2016","journal-title":"Msystems"},{"key":"pcbi.1010603.ref039","doi-asserted-by":"crossref","first-page":"e2687","DOI":"10.7717\/peerj.2687","article-title":"Proteogenomic analyses indicate bacterial methylotrophy and archaeal heterotrophy are prevalent below the grass root zone","volume":"4","author":"CN Butterfield","year":"2016","journal-title":"PeerJ"},{"issue":"1","key":"pcbi.1010603.ref040","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.cell.2019.08.011","article-title":"Interspecies competition impacts targeted manipulation of human gut bacteria by fiber-derived glycans","volume":"179","author":"ML Patnode","year":"2019","journal-title":"Cell"},{"issue":"3","key":"pcbi.1010603.ref041","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1038\/85686","article-title":"Large-scale analysis of the yeast proteome by multidimensional protein identification technology","volume":"19","author":"MP Washburn","year":"2001","journal-title":"Nature biotechnology"},{"issue":"3","key":"pcbi.1010603.ref042","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/nmeth1019","article-title":"Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry","volume":"4","author":"JE Elias","year":"2007","journal-title":"Nature methods"},{"issue":"9","key":"pcbi.1010603.ref043","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1074\/mcp.M114.046995","article-title":"A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets [S]","volume":"14","author":"MM Savitski","year":"2015","journal-title":"Molecular & Cellular Proteomics"},{"issue":"suppl_1","key":"pcbi.1010603.ref044","doi-asserted-by":"crossref","first-page":"D13","DOI":"10.1093\/nar\/gkm1000","article-title":"Database resources of the national center for biotechnology information","volume":"36","author":"DL Wheeler","year":"2007","journal-title":"Nucleic acids research"},{"issue":"1","key":"pcbi.1010603.ref045","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"C Camacho","year":"2009","journal-title":"BMC bioinformatics"},{"issue":"D1","key":"pcbi.1010603.ref046","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","author":"U Consortium","year":"2019","journal-title":"Nucleic acids research"},{"issue":"1","key":"pcbi.1010603.ref047","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"M Kanehisa","year":"2000","journal-title":"Nucleic acids research"},{"key":"pcbi.1010603.ref048","doi-asserted-by":"crossref","first-page":"117851","DOI":"10.1016\/j.envpol.2021.117851","article-title":"Ecological network analysis reveals distinctive microbial modules associated with heavy metal contamination of abandoned mine soils in Korea","volume":"289","author":"SJ Chun","year":"2021","journal-title":"Environmental Pollution"},{"key":"pcbi.1010603.ref049","doi-asserted-by":"crossref","unstructured":"Saranraj P, Sivasakthivelan P, Al-Tawaha A, Sudha A, Al-Tawaha A, Sirajuddin S, et al. Diversity and evolution of Bradyrhizobium communities relating to Soybean cultivation: A review. In: IOP Conference Series: Earth and Environmental Science. vol. 788. IOP Publishing; 2021. p. 012208.","DOI":"10.1088\/1755-1315\/788\/1\/012208"},{"key":"pcbi.1010603.ref050","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1016\/j.scitotenv.2019.01.052","article-title":"The complex interactions between novel DEHP-metabolising bacteria and the microbes in agricultural soils","volume":"660","author":"M Song","year":"2019","journal-title":"Science of the Total Environment"},{"key":"pcbi.1010603.ref051","doi-asserted-by":"crossref","first-page":"11","DOI":"10.3389\/fcimb.2017.00011","article-title":"Oral multiple sclerosis drugs inhibit the in vitro growth of epsilon toxin producing gut bacterium, Clostridium perfringens","volume":"7","author":"KR Rumah","year":"2017","journal-title":"Frontiers in cellular and infection microbiology"},{"key":"pcbi.1010603.ref052","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1016\/j.jprot.2017.05.015","article-title":"Mucin-and carbohydrate-stimulated adhesion and subproteome changes of the probiotic bacterium Lactobacillus acidophilus NCFM","volume":"163","author":"HU Celebioglu","year":"2017","journal-title":"Journal of proteomics"},{"issue":"4","key":"pcbi.1010603.ref053","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1111\/mmi.14445","article-title":"Fatty acid activation and utilization by Alistipes finegoldii, a representative Bacteroidetes resident of the human gut microbiome","volume":"113","author":"CD Radka","year":"2020","journal-title":"Molecular microbiology"},{"key":"pcbi.1010603.ref054","doi-asserted-by":"crossref","first-page":"117874","DOI":"10.1016\/j.carbpol.2021.117874","article-title":"Extraction, characterization of aloe polysaccharides and the in-depth analysis of its prebiotic effects on mice gut microbiota","volume":"261","author":"C Liu","year":"2021","journal-title":"Carbohydrate Polymers"},{"key":"pcbi.1010603.ref055","doi-asserted-by":"crossref","first-page":"76","DOI":"10.3389\/fvets.2020.00076","article-title":"A novel thioredoxin-dependent peroxiredoxin (TPx-Q) plays an important role in defense against oxidative stress and is a possible drug target in Babesia microti","volume":"7","author":"H Zhang","year":"2020","journal-title":"Frontiers in Veterinary Science"},{"issue":"27","key":"pcbi.1010603.ref056","doi-asserted-by":"crossref","first-page":"12101","DOI":"10.1073\/pnas.0907654107","article-title":"Protein and gene model inference based on statistical modeling in k-partite graphs","volume":"107","author":"S Gerster","year":"2010","journal-title":"Proceedings of the national academy of sciences"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010603","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,11,2]],"date-time":"2022-11-02T00:00:00Z","timestamp":1667347200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010603","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,2]],"date-time":"2022-11-02T18:05:52Z","timestamp":1667412352000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010603"}},"subtitle":[],"editor":[{"given":"Jacquelyn S.","family":"Fetrow","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,10,21]]},"references-count":56,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2022,10,21]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010603","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1010603","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,21]]}}}