{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T19:41:15Z","timestamp":1776886875158,"version":"3.51.2"},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,9,17]],"date-time":"2012-09-17T00:00:00Z","timestamp":1347840000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Computer Assisted Structure Elucidation has been used for decades to discover the chemical structure of unknown compounds. In this work we introduce the first open source structure generator, Open Molecule Generator (OMG), which for a given elemental composition produces all non-isomorphic chemical structures that match that elemental composition. Furthermore, this structure generator can accept as additional input one or multiple non-overlapping prescribed substructures to drastically reduce the number of possible chemical structures. Being open source allows for customization and future extension of its functionality. OMG relies on a modified version of the Canonical Augmentation Path, which grows intermediate chemical structures by adding bonds and checks that at each step only unique molecules are produced. In order to benchmark the tool, we generated chemical structures for the elemental formulas and substructures of different metabolites and compared the results with a commercially available structure generator. The results obtained, i.e. the number of molecules generated, were identical for elemental compositions having only C, O and H. For elemental compositions containing C, O, H, N, P and S, OMG produces all the chemically valid molecules while the other generator produces more, yet chemically impossible, molecules. The chemical completeness of the OMG results comes at the expense of being slower than the commercial generator. In addition to being open source, OMG clearly showed the added value of constraining the solution space by using multiple prescribed substructures as input. We expect this structure generator to be useful in many fields, but to be especially of great importance for metabolomics, where identifying unknown metabolites is still a major bottleneck.<\/jats:p>","DOI":"10.1186\/1758-2946-4-21","type":"journal-article","created":{"date-parts":[[2012,9,17]],"date-time":"2012-09-17T10:14:18Z","timestamp":1347876858000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":75,"title":["OMG: Open Molecule Generator"],"prefix":"10.1186","volume":"4","author":[{"given":"Julio E","family":"Peironcely","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Miguel","family":"Rojas-Chert\u00f3","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Davide","family":"Fichera","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Theo","family":"Reijmers","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leon","family":"Coulier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jean-Loup","family":"Faulon","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Hankemeier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2012,9,17]]},"reference":[{"key":"365_CR1","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1007\/s12566-010-0015-9","volume":"2","author":"T Kind","year":"2010","unstructured":"Kind T, Fiehn O: Advances in structure elucidation of small molecules using mass spectrometry. Bioanal Rev. 2010, 2: 23-60. 10.1007\/s12566-010-0015-9.","journal-title":"Bioanal Rev"},{"key":"365_CR2","volume-title":"Applications of Artificial Intelligence for Organic Chemistry: The DENDRAL Project","author":"RK Lindsay","year":"1980","unstructured":"Lindsay RK, Buchanan BG, Feigenbaum EA, Lederberg J: Applications of Artificial Intelligence for Organic Chemistry: The DENDRAL Project. 1980, New York: McGraw-Hill Book"},{"key":"365_CR3","doi-asserted-by":"publisher","first-page":"1708","DOI":"10.1021\/jo00321a037","volume":"46","author":"RE Carhart","year":"1981","unstructured":"Carhart RE, Smith DH, Gray NAB, Nourse JG, Djerassi C: GENOA: A computer program for structure elucidation utilizing overlapping and alternative substructures. J Org Chem. 1981, 46: 1708-1718. 10.1021\/jo00321a037.","journal-title":"J Org Chem"},{"key":"365_CR4","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1021\/ci00057a003","volume":"28","author":"K Funatsu","year":"1988","unstructured":"Funatsu K, Miyabayaski N, Sasaki S: Further development of structure generation in the automated structure elucidation system CHEMICS. J Chem Inf Model. 1988, 28: 18-28. 10.1021\/ci00057a003.","journal-title":"J Chem Inf Model"},{"key":"365_CR5","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1016\/S0169-7439(00)00056-3","volume":"51","author":"M Badertscher","year":"2000","unstructured":"Badertscher M, Korytko A, Schulz K-P, Madison M, Munk ME, Portmann P, Junghans M, Fontana P, Pretsch E: Assemble 2.0: a structure generator. Chemom Intell Lab Syst. 2000, 51: 73-79. 10.1016\/S0169-7439(00)00056-3.","journal-title":"Chemom Intell Lab Syst"},{"key":"365_CR6","doi-asserted-by":"publisher","first-page":"888","DOI":"10.1021\/ci950393z","volume":"36","author":"MS Molchanova","year":"1996","unstructured":"Molchanova MS, Shcherbukhin VV, Zefirov NS: Computer Generation of Molecular Structures by the SMOG Program. J Chem Inf Model. 1996, 36: 888-899. 10.1021\/ci950393z.","journal-title":"J Chem Inf Model"},{"key":"365_CR7","first-page":"205","volume":"37","author":"A Kerber","year":"1998","unstructured":"Kerber A, Laue R, Gr\u00fcner T, Meringer M: MOLGEN 4.0. Match Communications. Math Comput Chem. 1998, 37: 205-208.","journal-title":"Math Comput Chem"},{"key":"365_CR8","doi-asserted-by":"publisher","first-page":"12267","DOI":"10.1016\/S0040-4020(01)89576-4","volume":"50","author":"SV Ley","year":"1994","unstructured":"Ley SV, Doherty K, Massiot G, Nuzillard JM: Connectivist approach to organic structure determination. LSD-program assisted NMR analysis of the insect antifeedant azadirachtin. Tetrahedron. 1994, 50: 12267-12280. 10.1016\/S0040-4020(01)89576-4.","journal-title":"Tetrahedron"},{"key":"365_CR9","doi-asserted-by":"publisher","first-page":"1500","DOI":"10.1021\/ci000407n","volume":"41","author":"C Steinbeck","year":"2001","unstructured":"Steinbeck C: SENECA: A platform-independent, distributed, and parallel system for computer-assisted structure elucidation in organic chemistry. J Chem Inf Comput Sci. 2001, 41: 1500-7. 10.1021\/ci000407n.","journal-title":"J Chem Inf Comput Sci"},{"key":"365_CR10","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1021\/ci025584y","volume":"43","author":"C Steinbeck","year":"2003","unstructured":"Steinbeck C, Han Y, Kuhn S, Horlacher O, Luttmann E, Willighagen E: The Chemistry Development Kit (CDK): an open-source Java library for Chemo- and Bioinformatics. J Chem Inform Comput Sci. 2003, 43: 493-500.","journal-title":"J Chem Inform Comput Sci"},{"key":"365_CR11","doi-asserted-by":"publisher","first-page":"2111","DOI":"10.2174\/138161206777585274","volume":"12","author":"C Steinbeck","year":"2006","unstructured":"Steinbeck C, Hoppe C, Kuhn S, Floris M, Guha R, Willighagen EL: Recent Developments of the Chemistry Development Kit (CDK) - An Open-Source Java Library for Chemo- and Bioinformatics. Curr Pharm Des. 2006, 12: 2111-2120. 10.2174\/138161206777585274.","journal-title":"Curr Pharm Des"},{"key":"365_CR12","doi-asserted-by":"publisher","first-page":"544","DOI":"10.1016\/j.tibtech.2005.08.005","volume":"23","author":"J Nielsen","year":"2005","unstructured":"Nielsen J, Oliver S: The next wave in metabolome analysis. Trends Biotechnol. 2005, 23: 544-6. 10.1016\/j.tibtech.2005.08.005.","journal-title":"Trends Biotechnol"},{"key":"365_CR13","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1007\/s11306-009-0168-0","volume":"5","author":"A Scalbert","year":"2009","unstructured":"Scalbert A, Brennan L, Fiehn O, Hankemeier T, Kristal BS, van Ommen B, Pujos-Guillot E, Verheij E, Wishart D, Wopereis S: Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition research. Metabolomics. 2009, 5: 435-458. 10.1007\/s11306-009-0168-0.","journal-title":"Metabolomics"},{"key":"365_CR14","doi-asserted-by":"publisher","first-page":"1060","DOI":"10.1038\/nprot.2011.335","volume":"6","author":"WB Dunn","year":"2011","unstructured":"Dunn WB, Broadhurst D, Begley P, Zelena E, Francis-McIntyre S, Anderson N, Brown M, Knowles JD, Halsall A, Haselden JN, Nicholls AW, Wilson ID, Kell DB, Goodacre R: Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry. Nat Protoc. 2011, 6: 1060-1083. 10.1038\/nprot.2011.335.","journal-title":"Nat Protoc"},{"key":"365_CR15","doi-asserted-by":"publisher","first-page":"7677","DOI":"10.1021\/ac901087t","volume":"81","author":"R Mohamed","year":"2009","unstructured":"Mohamed R, Varesio E, Ivosev G, Burton L, Bonner R, Hopfgartner G: Comprehensive analytical strategy for biomarker identification based on liquid chromatography coupled to mass spectrometry and new candidate confirmation tools. Anal Chem. 2009, 81: 7677-94. 10.1021\/ac901087t.","journal-title":"Anal Chem"},{"key":"365_CR16","doi-asserted-by":"publisher","first-page":"1994","DOI":"10.1021\/ac2030738","volume":"84","author":"T Zhang","year":"2012","unstructured":"Zhang T, Creek DJ, Barrett MP, Blackburn G, Watson DG: Evaluation of Coupling Reversed Phase, Aqueous Normal Phase, and Hydrophilic Interaction Liquid Chromatography with Orbitrap Mass Spectrometry for Metabolomic Studies of Human Urine. Analytical Chemistry. 2012, 84: 1994-2001. 10.1021\/ac2030738.","journal-title":"Analytical Chemistry"},{"key":"365_CR17","doi-asserted-by":"publisher","first-page":"136","DOI":"10.1016\/j.aca.2008.03.060","volume":"615","author":"EL Schymanski","year":"2008","unstructured":"Schymanski EL, Meinert C, Meringer M, Brack W: The use of MS classifiers and structure generation to assist in the identification of unknowns in effect-directed analysis. Anal Chim Acta. 2008, 615: 136-147. 10.1016\/j.aca.2008.03.060.","journal-title":"Anal Chim Acta"},{"key":"365_CR18","doi-asserted-by":"publisher","first-page":"903","DOI":"10.1021\/ac102574h","volume":"83","author":"EL Schymanski","year":"2011","unstructured":"Schymanski EL, Meringer M, Brack W: Automated Strategies To Identify Compounds on the Basis of GC\/EI-MS and Calculated Properties. Anal Chem. 2011, 83: 903-912. 10.1021\/ac102574h.","journal-title":"Anal Chem"},{"key":"365_CR19","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1080\/00207167908803169","volume":"7","author":"C Colbourn","year":"1979","unstructured":"Colbourn C, Read R: Orderly algorithms for graph generation. Int J Comput Math. 1979, 7: 167-172. 10.1080\/00207167908803169.","journal-title":"Int J Comput Math"},{"key":"365_CR20","first-page":"131","volume-title":"Probl\u00e8mes combinatoires et th\u00e9orie des graphes","author":"IA Faradzev","year":"1978","unstructured":"Faradzev IA: Constructive Enumeration of Combinatorial Objects. Probl\u00e8mes combinatoires et th\u00e9orie des graphes. 1978, Orsay: University of Paris, 131-135."},{"key":"365_CR21","doi-asserted-by":"publisher","first-page":"1204","DOI":"10.1021\/ci00021a031","volume":"34","author":"J-L Faulon","year":"1994","unstructured":"Faulon J-L: Stochastic Generator of Chemical Structure. 1. Application to the Structure Elucidation of Large Molecules. J Chem Inf Model. 1994, 34: 1204-1218. 10.1021\/ci00021a031.","journal-title":"J Chem Inf Model"},{"key":"365_CR22","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1023\/A:1005998722658","volume":"52","author":"A Kerber","year":"1998","unstructured":"Kerber A, Laue R: Group Actions, Double Cosets, and Homomorphisms: Unifying Concepts for the Constructive Theory of Discrete Structures. Acta Applicandae Mathematicae. 1998, 52: 63-90. 10.1023\/A:1005998722658.","journal-title":"Acta Applicandae Mathematicae"},{"key":"365_CR23","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1006\/jagm.1997.0898","volume":"26","author":"B McKay","year":"1998","unstructured":"McKay B: Isomorph-Free Exhaustive Generation. J Algorithm. 1998, 26: 306-324. 10.1006\/jagm.1997.0898.","journal-title":"J Algorithm"},{"key":"365_CR24","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1021\/ci600423u","volume":"47","author":"T Fink","year":"2007","unstructured":"Fink T, Reymond J-L: Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discove. J Chem Inf Model. 2007, 47: 342-53. 10.1021\/ci600423u.","journal-title":"J Chem Inf Model"},{"key":"365_CR25","doi-asserted-by":"publisher","first-page":"8732","DOI":"10.1021\/ja902302h","volume":"131","author":"LC Blum","year":"2009","unstructured":"Blum LC, Reymond J-L: 970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13. J Am Chem Soc. 2009, 131: 8732-8733. 10.1021\/ja902302h.","journal-title":"J Am Chem Soc"},{"key":"365_CR26","doi-asserted-by":"publisher","first-page":"2230","DOI":"10.1021\/ci060016u","volume":"46","author":"WL Chen","year":"2006","unstructured":"Chen WL: Chemoinformatics: past, present, and future. J Chem Inf Model. 2006, 46: 2230-55. 10.1021\/ci060016u.","journal-title":"J Chem Inf Model"},{"key":"365_CR27","unstructured":"Rojas-Chert\u00f3 M, Peironcely JE, Kasper PT, van der Hooft JJJ, de Vos RCH, Vreeken R, Hankemeier T, Reijmers T: Metabolite identification using automated comparison of high resolution MSn spectral trees. submitted"},{"key":"365_CR28","doi-asserted-by":"publisher","first-page":"e28966","DOI":"10.1371\/journal.pone.0028966","volume":"6","author":"JE Peironcely","year":"2011","unstructured":"Peironcely JE, Reijmers T, Coulier L, Bender A, Hankemeier T: Understanding and Classifying Metabolite Space and Metabolite-Likeness. PLoS One. 2011, 6: e28966-10.1371\/journal.pone.0028966.","journal-title":"PLoS One"},{"key":"365_CR29","volume-title":"Nauty User\u2019s Guide ( Version 2.4 )","author":"BD Mckay","year":"2009","unstructured":"Mckay BD: Nauty User\u2019s Guide ( Version 2.4 ). 2009"},{"key":"365_CR30","doi-asserted-by":"publisher","first-page":"542","DOI":"10.1021\/ci030404l","volume":"44","author":"J Braun","year":"2004","unstructured":"Braun J, Gugisch R, Kerber A, Laue R, Meringer M, R\u00fccker C: MOLGEN-CID\u2013A canonizer for molecules and graphs accessible through the Internet. J Chem Inform Comput Sci. 2004, 44: 542-8.","journal-title":"J Chem Inform Comput Sci"},{"key":"365_CR31","doi-asserted-by":"publisher","first-page":"427","DOI":"10.1021\/ci0341823","volume":"44","author":"J-L Faulon","year":"2004","unstructured":"Faulon J-L, Collins MJ, Carr RD: The signature molecular descriptor. 4. Canonizing molecules using extended valence sequences. J Chem Inf Comput Sci. 2004, 44: 427-36. 10.1021\/ci0341823.","journal-title":"J Chem Inf Comput Sci"},{"key":"365_CR32","unstructured":"IUPAC International Chemical Identifier (InChI): Technical Manual. http:\/\/www.inchi-trust.org\/sites\/default\/files\/inchi-1.04\/InChI_TechMan.pdf,"},{"key":"365_CR33","first-page":"188","volume-title":"3rd IAPR TC-15 Workshop on Graph-based Representations in Pattern Recognition","author":"P Foggia","year":"2001","unstructured":"Foggia P, Sansone C, Vento M: A Performance Comparison of Five Algorithms for Graph Isomorphism. 3rd IAPR TC-15 Workshop on Graph-based Representations in Pattern Recognition. 2001, 188-199."},{"key":"365_CR34","doi-asserted-by":"publisher","first-page":"D603","DOI":"10.1093\/nar\/gkn810","volume":"37","author":"DS Wishart","year":"2009","unstructured":"Wishart DS, Knox C, Guo AC, Eisner R, Young N, Gautam B, Hau DD, Psychogios N, Dong E, Bouatra S, Mandal R, Sinelnikov I, Xia J, Jia L, Cruz JA, Lim E, Sobsey CA, Shrivastava S, Huang P, Liu P, Fang L, Peng J, Fradette R, Cheng D, Tzur D, Clements M, Lewis A, Souza AD, Zuniga A, Dawe M, Xiong Y, Clive D, Greiner R, Nazyrova A, Shaykhutdinov R, Li L, Vogel HJ, Forsythe I: HMDB: a knowledgebase for the human metabolome. Nucleic Acids Res. 2009, 37: D603-610. 10.1093\/nar\/gkn810.","journal-title":"Nucleic Acids Res"},{"key":"365_CR35","doi-asserted-by":"publisher","first-page":"1345","DOI":"10.1021\/ci700385a","volume":"48","author":"H Fujiwara","year":"2008","unstructured":"Fujiwara H, Wang J, Zhao L, Nagamochi H, Akutsu T: Enumerating treelike chemical graphs with given path frequency. J Chem Inf Model. 2008, 48: 1345-57. 10.1021\/ci700385a.","journal-title":"J Chem Inf Model"},{"key":"365_CR36","doi-asserted-by":"publisher","first-page":"910","DOI":"10.1007\/s10910-010-9789-9","volume":"49","author":"T Imada","year":"2011","unstructured":"Imada T, Ota S, Nagamochi H, Akutsu T: Efficient enumeration of stereoisomers of tree structured molecules using dynamic programming. J Math Chem. 2011, 49: 910-970. 10.1007\/s10910-010-9789-9.","journal-title":"J Math Chem"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-4-21.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1758-2946-4-21\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-4-21.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T21:37:51Z","timestamp":1630532271000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/1758-2946-4-21"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,17]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["365"],"URL":"https:\/\/doi.org\/10.1186\/1758-2946-4-21","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,9,17]]},"assertion":[{"value":"18 May 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 September 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"21"}}