{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T16:22:24Z","timestamp":1770740544222,"version":"3.49.0"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,7,10]],"date-time":"2022-07-10T00:00:00Z","timestamp":1657411200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,10]],"date-time":"2022-07-10T00:00:00Z","timestamp":1657411200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Metabolomics is a primary omics topic, which occupies an important position in both clinical applications and basic researches for metabolic signatures and biomarkers. Unfortunately, the relevant studies are challenged by the batch effect caused by many external factors. In last decade, the technique of deep learning has become a dominant tool in data science, such that one may train a diagnosis network from a known batch and then generalize it to a new batch. However, the batch effect inevitably hinders such efforts, as the two batches under consideration can be highly mismatched.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We propose an end-to-end deep learning framework, for joint batch effect removal and then classification upon metabolomics data. We firstly validate the proposed deep learning framework on a public CyTOF dataset as a simulated experiment. We also visually compare the t-SNE distribution and demonstrate that our method effectively removes the batch effects in latent space. Then, for a private MALDI MS dataset, we have achieved the highest diagnostic accuracy, with about 5.1\u2009~\u20097.9% increase on average over state-of-the-art methods.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Both experiments conclude that our method performs significantly better in classification than conventional methods benefitting from the effective removal of batch effect.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-04758-z","type":"journal-article","created":{"date-parts":[[2022,7,11]],"date-time":"2022-07-11T14:02:44Z","timestamp":1657548164000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Joint deep learning for batch effect removal and classification toward MALDI MS based metabolomics"],"prefix":"10.1186","volume":"23","author":[{"given":"Jingyang","family":"Niu","sequence":"first","affiliation":[]},{"given":"Jing","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Yuyu","family":"Guo","sequence":"additional","affiliation":[]},{"given":"Kun","family":"Qian","sequence":"additional","affiliation":[]},{"given":"Qian","family":"Wang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,7,10]]},"reference":[{"issue":"7216","key":"4758_CR1","doi-asserted-by":"publisher","first-page":"1054","DOI":"10.1038\/4551054a","volume":"455","author":"JK Nicholson","year":"2008","unstructured":"Nicholson JK, Lindon JC. Metabonomics. Nature. 2008;455(7216):1054\u20136.","journal-title":"Nature"},{"issue":"6163","key":"4758_CR2","doi-asserted-by":"publisher","first-page":"1243259","DOI":"10.1126\/science.1243259","volume":"342","author":"R Zenobi","year":"2013","unstructured":"Zenobi R. Single-cell metabolomics: analytical and biological perspectives. Science. 2013;342(6163):1243259.","journal-title":"Science"},{"issue":"1","key":"4758_CR3","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ac9507956","volume":"68","author":"SL Cohen","year":"1996","unstructured":"Cohen SL, Chait BT. Influence of matrix solution conditions on the MALDI-MS analysis of peptides and proteins. Anal Chem. 1996;68(1):31\u20137.","journal-title":"Anal Chem"},{"key":"4758_CR4","doi-asserted-by":"publisher","first-page":"2367","DOI":"10.3389\/fimmu.2019.02367","volume":"10","author":"RP Schuyler","year":"2019","unstructured":"Schuyler RP, Jackson C, Garcia-Perez JE, Baxter RM, Ogolla S, Rochford R, Ghosh D, Rudra P, Hsieh EWY. Minimizing batch effects in mass cytometry data. Front Immunol. 2019;10:2367.","journal-title":"Front Immunol"},{"key":"4758_CR5","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1007\/978-1-4939-3684-7_2","volume-title":"Natural killer cells: methods and protocols","author":"AW Kay","year":"2016","unstructured":"Kay AW, Strauss-Albee DM, Blish CA. Application of mass cytometry (CyTOF) for functional and phenotypic analysis of natural killer cells. In: Somanchi SS, editor. Natural killer cells: methods and protocols. Springer: New York; 2016. p. 13\u201326."},{"key":"4758_CR6","doi-asserted-by":"crossref","unstructured":"Yamada R, Okada D, Wang J, Basak T, Koyama S: Interpretation of omics data analyses. J Hum Genet 2020.","DOI":"10.1038\/s10038-020-0763-5"},{"key":"4758_CR7","unstructured":"Md. Mohaiminul Islam YWaPH: Deep learning models for predicting phenotypic traits and diseases from omics data. Artificial Intelligence - Emerging Trends and Applications 2018."},{"issue":"1","key":"4758_CR8","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1093\/bfgp\/ely030","volume":"18","author":"Z Zhang","year":"2019","unstructured":"Zhang Z, Zhao Y, Liao X, Shi W, Li K, Zou Q, Peng S. Deep learning in omics: a survey and guideline. Brief Funct Genomics. 2019;18(1):41\u201357.","journal-title":"Brief Funct Genomics"},{"key":"4758_CR9","doi-asserted-by":"crossref","unstructured":"Martorell-Marugan J, Tabik S, Benhammou Y, del Val C, Zwir I, Herrera F, Carmona-Saez P: Deep learning in omics data analysis and precision medicine. In: Husi H (ed) Computational Biology, Brisbane (AU); 2019.","DOI":"10.15586\/computationalbiology.2019.ch3"},{"issue":"10","key":"4758_CR10","doi-asserted-by":"publisher","first-page":"733","DOI":"10.1038\/nrg2825","volume":"11","author":"JT Leek","year":"2010","unstructured":"Leek JT, Scharpf RB, Bravo HC, Simcha D, Langmead B, Johnson WE, Geman D, Baggerly K, Irizarry RA. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet. 2010;11(10):733\u20139.","journal-title":"Nat Rev Genet"},{"issue":"4","key":"4758_CR11","doi-asserted-by":"publisher","first-page":"469","DOI":"10.1093\/bib\/bbs037","volume":"14","author":"C Lazar","year":"2013","unstructured":"Lazar C, Meganck S, Taminau J, Steenhoff D, Coletta A, Molter C, Weiss-Solis DY, Duque R, Bersini H, Nowe A. Batch effect removal methods for microarray gene expression data integration: a survey. Brief Bioinform. 2013;14(4):469\u201390.","journal-title":"Brief Bioinform"},{"issue":"1","key":"4758_CR12","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1093\/biostatistics\/kxj037","volume":"8","author":"WE Johnson","year":"2007","unstructured":"Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8(1):118\u201327.","journal-title":"Biostatistics"},{"key":"4758_CR13","doi-asserted-by":"publisher","DOI":"10.7717\/peerj.561","volume":"2","author":"HS Parker","year":"2014","unstructured":"Parker HS, Corrada Bravo H, Leek JT. Removing batch effects for prediction problems with frozen surrogate variable analysis. PeerJ. 2014;2: e561.","journal-title":"PeerJ"},{"issue":"1","key":"4758_CR14","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1093\/bioinformatics\/btg385","volume":"20","author":"M Benito","year":"2004","unstructured":"Benito M, Parker J, Du Q, Wu J, Xiang D, Perou CM, Marron JS. Adjustment of systematic microarray data biases. Bioinformatics. 2004;20(1):105\u201314.","journal-title":"Bioinformatics"},{"issue":"1","key":"4758_CR15","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1186\/1755-8794-1-42","volume":"1","author":"AH Sims","year":"2008","unstructured":"Sims AH, Smethurst GJ, Hey Y, Okoniewski MJ, Pepper SD, Howell A, Miller CJ, Clarke RB. The removal of multiplicative, systematic bias allows integration of breast cancer gene expression datasets \u2013 improving meta-analysis and prediction of prognosis. BMC Med Genomics. 2008;1(1):42.","journal-title":"BMC Med Genomics"},{"issue":"4","key":"4758_CR16","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1038\/tpj.2010.57","volume":"10","author":"J Luo","year":"2010","unstructured":"Luo J, Schumacher M, Scherer A, Sanoudou D, Megherbi D, Davison T, Shi T, Tong W, Shi L, Hong H, et al. A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data. Pharmacogenomics J. 2010;10(4):278\u201391.","journal-title":"Pharmacogenomics J"},{"issue":"9","key":"4758_CR17","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pgen.0030161","volume":"3","author":"JT Leek","year":"2007","unstructured":"Leek JT, Storey JD. Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 2007;3(9): e161.","journal-title":"PLoS Genet"},{"key":"4758_CR18","unstructured":"O'Shea K, Nash R: An introduction to convolutional neural networks. ArXiv e-prints 2015."},{"key":"4758_CR19","unstructured":"Zaremba W, Sutskever I, Vinyals O: Recurrent neural network regularization. 2014."},{"key":"4758_CR20","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735\u201380.","journal-title":"Neural Comput"},{"key":"4758_CR21","unstructured":"Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y: Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2; Montreal, Canada. MIT Press 2014: 2672\u20132680."},{"issue":"3","key":"4758_CR22","doi-asserted-by":"publisher","first-page":"807","DOI":"10.3390\/s22030807","volume":"22","author":"K Jabeen","year":"2022","unstructured":"Jabeen K, Khan MA, Alhaisoni M, Tariq U, Zhang Y-D, Hamza A, Mickus A, Dama\u0161evi\u010dius R. Breast cancer classification from ultrasound images using probability-based optimal deep learning feature fusion. Sensors. 2022;22(3):807.","journal-title":"Sensors"},{"issue":"2","key":"4758_CR23","doi-asserted-by":"publisher","first-page":"434","DOI":"10.3390\/s22020434","volume":"22","author":"M Nawaz","year":"2022","unstructured":"Nawaz M, Nazir T, Javed A, Tariq U, Yong H-S, Khan MA, Cha J. An efficient deep learning approach to automatic glaucoma detection using optic disc and optic cup localization. Sensors. 2022;22(2):434.","journal-title":"Sensors"},{"issue":"3","key":"4758_CR24","doi-asserted-by":"publisher","first-page":"334","DOI":"10.3390\/electronics11030334","volume":"11","author":"MI Sharif","year":"2022","unstructured":"Sharif MI, Khan MA, Alqahtani A, Nazir M, Alsubai S, Binbusayyis A, Dama\u0161evi\u010dius R. Deep learning and kurtosis-controlled, entropy-based framework for human gait recognition using video sequences. Electronics. 2022;11(3):334.","journal-title":"Electronics"},{"key":"4758_CR25","doi-asserted-by":"crossref","unstructured":"Khan MA, Muhammad K, Sharif M, Akram T, Kadry S: Intelligent fusion-assisted skin lesion localization and classification for smart healthcare. Neural Computing and Applications 2021.","DOI":"10.1007\/s00521-021-06490-w"},{"issue":"16","key":"4758_CR26","doi-asserted-by":"publisher","first-page":"2539","DOI":"10.1093\/bioinformatics\/btx196","volume":"33","author":"U Shaham","year":"2017","unstructured":"Shaham U, Stanton KP, Zhao J, Li H, Raddassi K, Montgomery R, Kluger Y. Removal of batch effects using distribution-matching residual networks. Bioinformatics. 2017;33(16):2539\u201346.","journal-title":"Bioinformatics"},{"issue":"7","key":"4758_CR27","doi-asserted-by":"publisher","first-page":"5082","DOI":"10.1021\/acs.analchem.9b05460","volume":"92","author":"Z Rong","year":"2020","unstructured":"Rong Z, Tan Q, Cao L, Zhang L, Deng K, Huang Y, Zhu ZJ, Li Z, Li K. NormAE: deep adversarial learning model to remove batch effects in liquid chromatography mass spectrometry-based metabolomics data. Anal Chem. 2020;92(7):5082\u201390.","journal-title":"Anal Chem"},{"issue":"8","key":"4758_CR28","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1038\/nbt.1665","volume":"28","author":"L Shi","year":"2010","unstructured":"Shi L, Campbell G, Jones WD, Campagne F, Wen Z, Walker SJ, Su Z, Chu T-M, Goodsaid FM, Pusztai L, et al. The microarray quality control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. Nat Biotechnol. 2010;28(8):827\u201338.","journal-title":"Nat Biotechnol"},{"issue":"4","key":"4758_CR29","doi-asserted-by":"publisher","first-page":"780","DOI":"10.1016\/j.cell.2016.04.019","volume":"165","author":"MH Spitzer","year":"2016","unstructured":"Spitzer MH, Nolan GP. Mass cytometry: single cells, many features. Cell. 2016;165(4):780\u201391.","journal-title":"Cell"},{"issue":"5","key":"4758_CR30","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1002\/cyto.a.22271","volume":"83","author":"R Finck","year":"2013","unstructured":"Finck R, Simonds EF, Jager A, Krishnaswamy S, Sachs K, Fantl W, Pe\u2019er D, Nolan GP, Bendall SC. Normalization of mass cytometry data with bead standards. Cytom Part A J Int Soc Anal Cytol. 2013;83(5):483\u201394.","journal-title":"Cytom Part A J Int Soc Anal Cytol"},{"issue":"8","key":"4758_CR31","doi-asserted-by":"publisher","first-page":"2677","DOI":"10.1002\/art.34473","volume":"64","author":"M Petri","year":"2012","unstructured":"Petri M, Orbai AM, Alarc\u00f3n GS, Gordon C, Merrill JT, Fortin PR, Bruce IN, Isenberg D, Wallace DJ, Nived O, et al. Derivation and validation of the systemic lupus international collaborating clinics classification criteria for systemic lupus erythematosus. Arthritis Rheum. 2012;64(8):2677\u201386.","journal-title":"Arthritis Rheum"},{"issue":"1","key":"4758_CR32","doi-asserted-by":"publisher","first-page":"3556","DOI":"10.1038\/s41467-020-17347-6","volume":"11","author":"L Huang","year":"2020","unstructured":"Huang L, Wang L, Hu X, Chen S, Tao Y, Su H, Yang J, Xu W, Vedarethinam V, Wu S, et al. Machine learning of serum metabolic patterns encodes early-stage lung adenocarcinoma. Nat Commun. 2020;11(1):3556.","journal-title":"Nat Commun"},{"key":"4758_CR33","unstructured":"Kingma D, Ba J: Adam: a method for stochastic optimization. In: International Conference on Learning Representations 2014."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04758-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04758-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04758-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,11]],"date-time":"2022-07-11T14:02:50Z","timestamp":1657548170000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04758-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,10]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4758"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04758-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,10]]},"assertion":[{"value":"15 July 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 May 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 July 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"Not applicable.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"270"}}