{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,14]],"date-time":"2026-05-14T18:21:22Z","timestamp":1778782882793,"version":"3.51.4"},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2023,5,30]],"date-time":"2023-05-30T00:00:00Z","timestamp":1685404800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Center for Bioanalytical Metrology","award":["NSF IIP-1916645"],"award-info":[{"award-number":["NSF IIP-1916645"]}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DBI-2011271"],"award-info":[{"award-number":["DBI-2011271"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Tandem mass spectrometry is an essential technology for characterizing chemical compounds at high sensitivity and throughput, and is commonly adopted in many fields. However, computational methods for automated compound identification from their MS\/MS spectra are still limited, especially for novel compounds that have not been previously characterized. In recent years, in silico methods were proposed to predict the MS\/MS spectra of compounds, which can then be used to expand the reference spectral libraries for compound identification. However, these methods did not consider the compounds\u2019 3D conformations, and thus neglected critical structural information.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present the 3D Molecular Network for Mass Spectra Prediction (3DMolMS), a deep neural network model to predict the MS\/MS spectra of compounds from their 3D conformations. We evaluated the model on the experimental spectra collected in several spectral libraries. The results showed that 3DMolMS predicted the spectra with the average cosine similarity of 0.691 and 0.478 with the experimental MS\/MS spectra acquired in positive and negative ion modes, respectively. Furthermore, 3DMolMS model can be generalized to the prediction of MS\/MS spectra acquired by different labs on different instruments through minor fine-tuning on a small set of spectra. Finally, we demonstrate that the molecular representation learned by 3DMolMS from MS\/MS spectra prediction can be adapted to enhance the prediction of chemical properties such as the elution time in the liquid chromatography and the collisional cross section measured by ion mobility spectrometry, both of which are often used to improve compound identification.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The codes of 3DMolMS are available at https:\/\/github.com\/JosieHong\/3DMolMS and the web service is at https:\/\/spectrumprediction.gnps2.org.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad354","type":"journal-article","created":{"date-parts":[[2023,5,30]],"date-time":"2023-05-30T12:24:41Z","timestamp":1685449481000},"source":"Crossref","is-referenced-by-count":32,"title":["3DMolMS: prediction of tandem mass spectra from 3D molecular conformations"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5647-9714","authenticated-orcid":false,"given":"Yuhui","family":"Hong","sequence":"first","affiliation":[{"name":"Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington , Bloomington, IN 47408, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sujun","family":"Li","sequence":"additional","affiliation":[{"name":"Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington , Bloomington, IN 47408, United States"},{"name":"Shanghai Dengding BioAI Co., Ltd, Building 2, Lane 500, Furonghua Road, Pudong New District , Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christopher J","family":"Welch","sequence":"additional","affiliation":[{"name":"Indiana Consortium for Analytical Science & Engineering (ICASE) , Indianapolis, IN 46202, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shane","family":"Tichy","sequence":"additional","affiliation":[{"name":"Agilent Technologies , Santa Clara, CA 95051, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3707-3185","authenticated-orcid":false,"given":"Yuzhen","family":"Ye","sequence":"additional","affiliation":[{"name":"Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington , Bloomington, IN 47408, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haixu","family":"Tang","sequence":"additional","affiliation":[{"name":"Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington , Bloomington, IN 47408, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,5,30]]},"reference":[{"key":"2023062101001395500_btad354-B1","author":"Adams","year":"2021"},{"key":"2023062101001395500_btad354-B2","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/s41592-021-01197-1","article-title":"Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices","volume":"18","author":"Alseekh","year":"2021","journal-title":"Nat Methods"},{"key":"2023062101001395500_btad354-B3","doi-asserted-by":"crossref","first-page":"31","DOI":"10.3390\/metabo8020031","article-title":"Software tools and approaches for compound identification of LC-MS\/MS data in metabolomics","volume":"8","author":"Bla\u017eenovi\u0107","year":"2018","journal-title":"Metabolites"},{"key":"2023062101001395500_btad354-B4","doi-asserted-by":"crossref","first-page":"9557","DOI":"10.1021\/ac1022953","article-title":"Collision cross sections of proteins and their complexes: a calibration framework and database for gas-phase structural biology","volume":"82","author":"Bush","year":"2010","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-13680-7","article-title":"The metlin small molecule dataset for machine learning-based retention time prediction","volume":"10","author":"Domingo-Almenara","year":"2019","journal-title":"Nat Commun"},{"key":"2023062101001395500_btad354-B6","author":"Gasteiger","year":"2020"},{"key":"2023062101001395500_btad354-B7","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1038\/s41592-019-0426-7","article-title":"Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning","volume":"16","author":"Gessulat","year":"2019","journal-title":"Nat Methods"},{"key":"2023062101001395500_btad354-B8","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1038\/nmeth.3902","article-title":"Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets","volume":"13","author":"Griss","year":"2016","journal-title":"Nat Methods"},{"key":"2023062101001395500_btad354-B9","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1186\/s12859-018-2523-5","article-title":"Convolutional neural network based on smiles representation of compounds for detecting chemical motif","volume":"19","author":"Hirohara","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023062101001395500_btad354-B10","doi-asserted-by":"crossref","first-page":"10780","DOI":"10.1021\/ac502805w","article-title":"Improving natural products identification through targeted LC-MS\/MS in an untargeted secondary metabolomics workflow","volume":"86","author":"Hoffmann","year":"2014","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B11","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1002\/jms.1777","article-title":"Massbank: a public repository for sharing mass spectral data for life sciences","volume":"45","author":"Horai","year":"2010","journal-title":"J Mass Spectrom"},{"key":"2023062101001395500_btad354-B12","first-page":"1","article-title":"A merged molecular representation learning for molecular properties prediction with a web-based service","volume":"11","author":"Kim","year":"2021","journal-title":"Sci Rep"},{"key":"2023062101001395500_btad354-B13","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1038\/nmeth.2551","article-title":"Lipidblast in silico tandem mass spectrometry database for lipid identification","volume":"10","author":"Kind","year":"2013","journal-title":"Nat Methods"},{"key":"2023062101001395500_btad354-B14","author":"Klicpera","year":"2020"},{"key":"2023062101001395500_btad354-B15","first-page":"655","author":"Lam","year":"2006"},{"key":"2023062101001395500_btad354-B16","doi-asserted-by":"crossref","first-page":"4275","DOI":"10.1021\/acs.analchem.9b04867","article-title":"Full-spectrum prediction of peptides tandem mass spectra using deep neural network","volume":"92","author":"Liu","year":"2020","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B17","doi-asserted-by":"crossref","first-page":"107000","DOI":"10.1016\/j.patcog.2019.107000","article-title":"Dynamic graph convolutional networks","volume":"97","author":"Manessi","year":"2020","journal-title":"Pattern Recogn"},{"key":"2023062101001395500_btad354-B18","doi-asserted-by":"crossref","first-page":"227","DOI":"10.2174\/2213235X113019990005","article-title":"Biomarker discovery and translation in metabolomics","volume":"1","author":"Nagana Gowda","year":"2013","journal-title":"Curr Metab"},{"key":"2023062101001395500_btad354-B19","first-page":"8024","volume-title":"Advances in Neural Information Processing Systems","author":"Paszke","year":"2019"},{"key":"2023062101001395500_btad354-B20","doi-asserted-by":"crossref","first-page":"5191","DOI":"10.1021\/acs.analchem.8b05821","article-title":"Predicting ion mobility collision cross-sections using a deep neural network: DeepCCS","volume":"91","author":"Plante","year":"2019","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B21","first-page":"652","author":"Qi","year":"2017"},{"key":"2023062101001395500_btad354-B22","doi-asserted-by":"crossref","first-page":"4373","DOI":"10.1021\/ac800660d","article-title":"Environmental mass spectrometry: emerging contaminants and current issues","volume":"80","author":"Richardson","year":"2008","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B23","doi-asserted-by":"crossref","first-page":"2562","DOI":"10.1021\/acs.jcim.5b00654","article-title":"Better informed distance geometry: using what we know to improve conformation generation","volume":"55","author":"Riniker","year":"2015","journal-title":"J Chem Inf Model"},{"key":"2023062101001395500_btad354-B24","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1021\/ci100050t","article-title":"Extended-connectivity fingerprints","volume":"50","author":"Rogers","year":"2010","journal-title":"J Chem Inf Model"},{"key":"2023062101001395500_btad354-B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-016-0115-9","article-title":"Metfrag relaunched: incorporating strategies beyond in silico fragmentation","volume":"8","author":"Ruttkies","year":"2016","journal-title":"J Cheminform"},{"key":"2023062101001395500_btad354-B26","doi-asserted-by":"crossref","first-page":"241722","DOI":"10.1063\/1.5019779","article-title":"Schnet\u2014a deep learning architecture for molecules and materials","volume":"148","author":"Sch\u00fctt","year":"2018","journal-title":"J Chem Phys"},{"key":"2023062101001395500_btad354-B27","first-page":"7274","author":"Stein","year":"2012"},{"key":"2023062101001395500_btad354-B28","first-page":"1","author":"Tanimoto","year":"1958"},{"key":"2023062101001395500_btad354-B29","doi-asserted-by":"crossref","first-page":"5815","DOI":"10.1021\/acs.analchem.0c05427","article-title":"pDeep3: toward more accurate spectrum prediction with fast few-shot learning","volume":"93","author":"Tarn","year":"2021","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B30","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1038\/s41592-019-0427-6","article-title":"High-quality MS\/MS spectrum prediction for data-dependent and data-independent acquisition data analysis","volume":"16","author":"Tiwary","year":"2019","journal-title":"Nat Methods"},{"key":"2023062101001395500_btad354-B31","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.trac.2015.09.005","article-title":"Mass spectral databases for LC\/MS- and GC\/MS-based metabolomics: state of the field and future prospects","volume":"78","author":"Vinaixa","year":"2016","journal-title":"TrAC Trends Anal Chem"},{"key":"2023062101001395500_btad354-B32","doi-asserted-by":"crossref","first-page":"11692","DOI":"10.1021\/acs.analchem.1c01465","article-title":"CFM-ID 4.0: more accurate ESI-MS\/MS spectral prediction and compound identification","volume":"93","author":"Wang","year":"2021","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B33","doi-asserted-by":"crossref","first-page":"828","DOI":"10.1038\/nbt.3597","article-title":"Sharing and community curation of mass spectrometry data with global natural products social molecular networking","volume":"34","author":"Wang","year":"2016","journal-title":"Nat Biotechnol"},{"key":"2023062101001395500_btad354-B34","doi-asserted-by":"crossref","first-page":"9496","DOI":"10.1021\/ac5014783","article-title":"MIDAS: a database-searching algorithm for metabolite identification in metabolomics","volume":"86","author":"Wang","year":"2014","journal-title":"Anal Chem"},{"key":"2023062101001395500_btad354-B35","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1038\/s42256-022-00447-x","article-title":"Molecular contrastive learning of representations via graph neural networks","volume":"4","author":"Wang","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2023062101001395500_btad354-B36","doi-asserted-by":"crossref","first-page":"700","DOI":"10.1021\/acscentsci.9b00085","article-title":"Rapid prediction of electron\u2013ionization mass spectrometry using neural networks","volume":"5","author":"Wei","year":"2019","journal-title":"ACS Cent Sci"},{"key":"2023062101001395500_btad354-B37","doi-asserted-by":"crossref","first-page":"D622","DOI":"10.1093\/nar\/gkab1062","article-title":"HMDB 5.0: the human metabolome database for 2022","volume":"50","author":"Wishart","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023062101001395500_btad354-B38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.trac.2011.08.009","article-title":"Metabolite identification and quantitation in LC-MS\/MS-based metabolomics","volume":"32","author":"Xiao","year":"2012","journal-title":"Trends Analyt Chem"},{"key":"2023062101001395500_btad354-B39","doi-asserted-by":"crossref","first-page":"2280","DOI":"10.1007\/s13361-017-1748-2","article-title":"Extending a tandem mass spectral library to include MS2 spectra of fragment ions produced in-source and msn spectra","volume":"28","author":"Yang","year":"2017","journal-title":"J Am Soc Mass Spectrom"},{"key":"2023062101001395500_btad354-B40","first-page":"1","article-title":"In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics","volume":"11","author":"Yang","year":"2020","journal-title":"Nat Commun"},{"key":"2023062101001395500_btad354-B41","author":"Young","year":"2021"},{"key":"2023062101001395500_btad354-B42","article-title":"Graph transformer networks","volume":"32","author":"Yun","year":"2019","journal-title":"Adv Neural Inf Process Syst"},{"key":"2023062101001395500_btad354-B43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-18171-8","article-title":"Ion mobility collision cross-section atlas for known and unknown metabolite annotation in untargeted metabolomics","volume":"11","author":"Zhou","year":"2020","journal-title":"Nat Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad354\/50489048\/btad354.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/6\/btad354\/50661428\/btad354.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/6\/btad354\/50661428\/btad354.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,21]],"date-time":"2023-06-21T01:39:50Z","timestamp":1687311590000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad354\/7186501"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2023,5,30]]},"references-count":43,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad354","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.03.15.532823","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,6,1]]},"published":{"date-parts":[[2023,5,30]]},"article-number":"btad354"}}