{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:47:14Z","timestamp":1760147234337,"version":"build-2065373602"},"reference-count":33,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2023,1,16]],"date-time":"2023-01-16T00:00:00Z","timestamp":1673827200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012190","name":"Ministry of Science and Higher Education of the Russian Federation","doi-asserted-by":"publisher","award":["0714-2020-0006","075-00337-20-02"],"award-info":[{"award-number":["0714-2020-0006","075-00337-20-02"]}],"id":[{"id":"10.13039\/501100012190","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>The automatic processing of high-dimensional mass spectrometry data is required for the clinical implementation of ambient ionization molecular profiling methods. However, complex algorithms required for the analysis of peak-rich spectra are sensitive to the quality of the input data. Therefore, an objective and quantitative indicator, insensitive to the conditions of the experiment, is currently in high demand for the automated treatment of mass spectrometric data. In this work, we demonstrate the utility of the Shapley value as an indicator of the quality of the individual mass spectrum in the classification task for human brain tumor tissue discrimination. The Shapley values are calculated on the training set of glioblastoma and nontumor pathological tissues spectra and used as feedback to create a random forest regression model to estimate the contributions for all spectra of each specimen. As a result, it is shown that the implementation of Shapley values significantly accelerates the data analysis of negative mode mass spectrometry data alongside simultaneous improving the regression models\u2019 accuracy.<\/jats:p>","DOI":"10.3390\/data8010021","type":"journal-article","created":{"date-parts":[[2023,1,16]],"date-time":"2023-01-16T04:31:32Z","timestamp":1673843492000},"page":"21","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Shapley Value as a Quality Control for Mass Spectra of Human Glioblastoma Tissues"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5469-216X","authenticated-orcid":false,"given":"Denis S.","family":"Zavorotnyuk","sequence":"first","affiliation":[{"name":"The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia"}]},{"given":"Anatoly A.","family":"Sorokin","sequence":"additional","affiliation":[{"name":"The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9622-3457","authenticated-orcid":false,"given":"Stanislav I.","family":"Pekov","sequence":"additional","affiliation":[{"name":"Skolkovo Institute of Science and Technology, 121205 Moscow, Russia"},{"name":"Siberian State Medical University, 634050 Tomsk, Russia"}]},{"given":"Denis S.","family":"Bormotov","sequence":"additional","affiliation":[{"name":"The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia"}]},{"given":"Vasiliy A.","family":"Eliferov","sequence":"additional","affiliation":[{"name":"The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia"}]},{"given":"Konstantin V.","family":"Bocharov","sequence":"additional","affiliation":[{"name":"V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Science, 119334 Moscow, Russia"}]},{"given":"Eugene N.","family":"Nikolaev","sequence":"additional","affiliation":[{"name":"Skolkovo Institute of Science and Technology, 121205 Moscow, Russia"}]},{"given":"Igor A.","family":"Popov","sequence":"additional","affiliation":[{"name":"The Moscow Institute of Physics and Technology, National Research University, 141701 Dolgoprudny, Russia"}]}],"member":"1968","published-online":{"date-parts":[[2023,1,16]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"S0060","DOI":"10.5702\/massspectrometry.S0060","article-title":"Clinical Application of Ambient Ionization Mass Spectrometry","volume":"6","author":"Li","year":"2017","journal-title":"Mass Spectrom."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1002\/ansa.202100067","article-title":"Applications of Ambient Ionization Mass Spectrometry in 2021: An Annual Review","volume":"3","author":"Reynolds","year":"2022","journal-title":"Anal. Sci. Adv."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Pekov, S.I., Zhvansky, E.S., Eliferov, V.A., Sorokin, A.A., Ivanov, D.G., Nikolaev, E.N., and Popov, I.A. (2022). Determination of Brain Tissue Samples Storage Conditions for Reproducible Intraoperative Lipid Profiling. Molecules, 27.","DOI":"10.3390\/molecules27082587"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2913","DOI":"10.1007\/s00216-021-03220-y","article-title":"Rapid Estimation of Tumor Cell Percentage in Brain Tissue Biopsy Samples Using Inline Cartridge Extraction Mass Spectrometry","volume":"413","author":"Pekov","year":"2021","journal-title":"Anal. Bioanal. Chem."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"632","DOI":"10.1002\/bjs.11613","article-title":"Breast Cancer Diagnosis Based on Lipid Profiling by Probe Electrospray Ionization Mass Spectrometry","volume":"107","author":"Iwano","year":"2020","journal-title":"Br. J. Surg."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Giordano, S., Siciliano, A.M., Donadon, M., Soldani, C., Franceschini, B., Lleo, A., di Tommaso, L., Cimino, M., Torzilli, G., and Saiki, H. (2022). Versatile Mass Spectrometry-Based Intraoperative Diagnosis of Liver Tumor in a Multiethnic Cohort. Appl. Sci., 12.","DOI":"10.3390\/app12094244"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"4058","DOI":"10.1039\/C7AN01334E","article-title":"Analysis of Human Gliomas by Swab Touch Spray-Mass Spectrometry: Applications to Intraoperative Assessment of Surgical Margins and Presence of Oncometabolites","volume":"142","author":"Pirro","year":"2017","journal-title":"Analyst"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Shamraeva, M.A., Bormotov, D.S., Shamarina, E.V., Bocharov, K.V., Peregudova, O.V., Pekov, S.I., Nikolaev, E.N., and Popov, I.A. (2022). Spherical Sampler Probes Enhance the Robustness of Ambient Ionization Mass Spectrometry for Rapid Drugs Screening. Molecules, 27.","DOI":"10.3390\/molecules27030945"},{"key":"ref_9","first-page":"3","article-title":"Ambient Ionization Mass Spectrometry Applied to New Psychoactive Substance Analysis","volume":"42","author":"Cowan","year":"2021","journal-title":"Mass. Spectrom. Rev."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"827360","DOI":"10.3389\/froh.2022.827360","article-title":"Mass Spectrometry-Based Differentiation of Oral Tongue Squamous Cell Carcinoma and Nontumor Regions With the SpiderMass Technology","volume":"3","author":"Ogrinc","year":"2022","journal-title":"Front. Oral Health"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"e2104411118","DOI":"10.1073\/pnas.2104411118","article-title":"Rapid Diagnosis and Tumor Margin Assessment during Pancreatic Cancer Surgery with the MasSpec Pen Technology","volume":"118","author":"King","year":"2021","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"9338","DOI":"10.1021\/acs.analchem.0c01660","article-title":"Single-Cell Classification Using Mass Spectrometry through Interpretable Machine Learning","volume":"92","author":"Xie","year":"2020","journal-title":"Anal. Chem."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"14590","DOI":"10.1021\/jacs.2c03631","article-title":"Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning","volume":"144","author":"Boiko","year":"2022","journal-title":"J. Am. Chem. Soc."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1746","DOI":"10.1039\/D1SC05171G","article-title":"LAP-MALDI MS Coupled with Machine Learning: An Ambient Mass Spectrometry Approach for High-Throughput Diagnostics","volume":"13","author":"Piras","year":"2022","journal-title":"Chem. Sci."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liebal, U.W., Phan, A.N.T., Sudhakar, M., Raman, K., and Blank, L.M. (2020). Machine Learning Applications for Mass Spectrometry-Based Metabolomics. Metabolites, 10.","DOI":"10.3390\/metabo10060243"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zavorotnyuk, D.S., Pekov, S.I., Sorokin, A.A., Bormotov, D.S., Levin, N., Zhvansky, E., Semenov, S., Strelnikova, P., Bocharov, K.V., and Vorobiev, A. (2021). Lipid Profiles of Human Brain Tumors Obtained by High-Resolution Negative Mode Ambient Mass Spectrometry. Data, 6.","DOI":"10.3390\/data6120132"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"18960","DOI":"10.1038\/s41598-019-55597-7","article-title":"Inline Cartridge Extraction for Rapid Brain Tumor Tissue Identification by Molecular Profiling","volume":"9","author":"Pekov","year":"2019","journal-title":"Sci. Rep."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Thomas, S.A., Race, A.M., Steven, R.T., Gilmore, I.S., and Bunch, J. (2016). Dimensionality Reduction of Mass Spectrometry Imaging Data Using Autoencoders. 2016 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE.","DOI":"10.1109\/SSCI.2016.7849863"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"A0094","DOI":"10.5702\/massspectrometry.A0094","article-title":"Comparison of Dimensionality Reduction Methods in Mass Spectra of Astrocytoma and Glioblastoma Tissues","volume":"10","author":"Zhvansky","year":"2021","journal-title":"Mass Spectrom."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"e4640","DOI":"10.1002\/jms.4640","article-title":"Assessment of Variation of Inline Cartridge Extraction Mass Spectra","volume":"56","author":"Zhvansky","year":"2021","journal-title":"J. Mass Spectrom."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"914","DOI":"10.1038\/s41598-018-37560-0","article-title":"Metrics for Evaluating the Stability and Reproducibility of Mass Spectra","volume":"9","author":"Zhvansky","year":"2019","journal-title":"Sci. Rep."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2270","DOI":"10.1093\/bioinformatics\/bts447","article-title":"MALDIquant: A Versatile R Package for the Analysis of Mass Spectrometry Data","volume":"28","author":"Gibb","year":"2012","journal-title":"Bioinformatics"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Pluskal, T., Castillo, S., Villar-Briones, A., and Ore\u0161i\u010d, M. (2010). MZmine 2: Modular Framework for Processing, Visualizing, and Analyzing Mass Spectrometry-Based Molecular Profile Data. BMC Bioinform., 11.","DOI":"10.1186\/1471-2105-11-395"},{"key":"ref_24","unstructured":"Koh, P.W., and Liang, P. (2017, January 6\u201311). Understanding Black-Box Predictions via Influence Functions. Proceedings of the 34th International Conference on Machine Learning, Sydney, NSW, Australia."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","article-title":"Prediction Error Estimation: A Comparison of Resampling Methods","volume":"21","author":"Molinaro","year":"2005","journal-title":"Bioinformatics"},{"key":"ref_26","unstructured":"Ghorbani, A., and Zou, J. (2019, January 9\u201315). Data Shapley: Equitable Valuation of Data for Machine Learning. Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA."},{"key":"ref_27","first-page":"307","article-title":"A value for n-person games","volume":"2","author":"Shapley","year":"1953","journal-title":"Contrib. Theory Games"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1521","DOI":"10.2174\/1568026619666190729154543","article-title":"Untangling the Metabolic Reprogramming in Brain Cancer: Discovering Key Molecular Players Using Mass Spectrometry","volume":"19","author":"Sorokin","year":"2019","journal-title":"Curr. Top. Med. Chem."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1134\/S1990750821030070","article-title":"Analysis of Phosphatidylcholines Alterations in Human Glioblastomas Ex Vivo","volume":"15","author":"Pekov","year":"2021","journal-title":"Biochem. Moscow Suppl. Ser. B Biomed. Chem."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v033.i01","article-title":"Regularization Paths for Generalized Linear Models via Coordinate Descent","volume":"33","author":"Friedman","year":"2010","journal-title":"J. Stat. Softw."},{"key":"ref_31","unstructured":"Microsoft Corporation and Steve Weston (2022, July 10). doParallel: Foreach Parallel Adaptor for the \u2018parallel\u2019 Package. R package version 1.0.17. Available online: https:\/\/CRAN.R-project.org\/package=doParallel."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v028.i05","article-title":"Building Predictive Models in R Using the Caret Package","volume":"28","author":"Kuhn","year":"2008","journal-title":"J. Stat. Softw."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Wickham, H. (2016). ggplot2: Elegant Graphics for Data Analysis, Springer.","DOI":"10.1007\/978-3-319-24277-4_9"}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/1\/21\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:07:15Z","timestamp":1760119635000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/1\/21"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,16]]},"references-count":33,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["data8010021"],"URL":"https:\/\/doi.org\/10.3390\/data8010021","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2023,1,16]]}}}