{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,11]],"date-time":"2025-04-11T04:11:13Z","timestamp":1744344673373,"version":"3.40.4"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T00:00:00Z","timestamp":1744243200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T00:00:00Z","timestamp":1744243200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100014553","name":"Samsung Advanced Institute of Technology","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100014553","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["RS-2023-00207903"],"award-info":[{"award-number":["RS-2023-00207903"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>In synthesis planning, identifying and optimizing chemical reactions are important for the successful design of synthetic pathways to target substances. Chemical reaction databases assist chemists in gaining insights into this process. Traditionally, searching for relevant records from a reaction database has relied on the manual formulation of queries by chemists based on their search purposes, which is challenging without explicit knowledge of what they are searching for. In this study, we propose an intelligent chemical reaction search system that simplifies the process of enhancing the search results. When a user submits a query, a list of relevant records is retrieved from the reaction database. Users can express their preferences and requirements by providing binary ratings for the individual retrieved records. The search results are refined based on the user feedback. To implement this system effectively, we incorporate and adapt contrastive representation learning, dimensionality reduction, and human-in-the-loop techniques. Contrastive learning is used to train a representation model that embeds records in the reaction database as numerical vectors suitable for chemical reaction searches. Dimensionality reduction is applied to compress these vectors, thereby enhancing the search efficiency. Human-in-the-loop is integrated to iteratively update the representation model by reflecting user feedback. Through experimental investigations, we demonstrate that the proposed method effectively improves the chemical reaction search towards better alignment with user preferences and requirements. <\/jats:p>\n          <jats:p>\n            <jats:bold>Scientific contribution<\/jats:bold> This study seeks to enhance the search functionality of chemical reaction databases by drawing inspiration from recommender systems. The proposed method simplifies the search process, offering an alternative to the complexity of formulating explicit query rules. We believe that the proposed method can assist users in efficiently discovering records relevant to target reactions, especially when they encounter difficulties in crafting detailed queries due to limited knowledge.<\/jats:p>","DOI":"10.1186\/s13321-025-00987-5","type":"journal-article","created":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T12:57:41Z","timestamp":1744289861000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop"],"prefix":"10.1186","volume":"17","author":[{"given":"Youngchun","family":"Kwon","sequence":"first","affiliation":[]},{"given":"Hyunjeong","family":"Jeon","sequence":"additional","affiliation":[]},{"given":"Joonhyuk","family":"Choi","sequence":"additional","affiliation":[]},{"given":"Youn-Suk","family":"Choi","sequence":"additional","affiliation":[]},{"given":"Seokho","family":"Kang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,4,10]]},"reference":[{"issue":"6","key":"987_CR1","doi-asserted-by":"publisher","first-page":"3089","DOI":"10.1021\/acs.chemrev.2c00798","volume":"123","author":"CJ Taylor","year":"2023","unstructured":"Taylor CJ, Pomberger A, Felton KC, Grainger R, Barecka M, Chamberlain TW et al (2023) A brief introduction to chemical reaction optimization. Chem Rev 123(6):3089\u20133126","journal-title":"Chem Rev"},{"issue":"6453","key":"987_CR2","doi-asserted-by":"publisher","first-page":"eaax1566","DOI":"10.1126\/science.aax1566","volume":"365","author":"CW Coley","year":"2019","unstructured":"Coley CW, Thomas DA, Lummiss J, Jaworski JN, Breen CP, Schultz V et al (2019) A robotic platform for flow synthesis of organic compounds informed by AI planning. Science 365(6453):eaax1566","journal-title":"Science"},{"issue":"44","key":"987_CR3","doi-asserted-by":"publisher","first-page":"eadj0461","DOI":"10.1126\/sciadv.adj0461","volume":"9","author":"T Ha","year":"2023","unstructured":"Ha T, Lee D, Kwon Y, Park MS, Lee S, Jang J et al (2023) AI-driven robotic chemist for autonomous synthesis of organic molecules. Sci Adv 9(44):eadj0461","journal-title":"Sci Adv"},{"issue":"23","key":"987_CR4","doi-asserted-by":"publisher","first-page":"5952","DOI":"10.1021\/acs.jcim.2c01085","volume":"62","author":"Y Kwon","year":"2022","unstructured":"Kwon Y, Kim S, Choi YS, Kang S (2022) Generative modeling to predict multiple suitable conditions for chemical reactions. J Chem Inf Model 62(23):5952\u20135960","journal-title":"J Chem Inf Model"},{"key":"987_CR5","doi-asserted-by":"crossref","unstructured":"Kwon Y, Lee D, Choi YS, Kang S (2022) uncertainty-aware prediction of chemical reaction yields with graph neural networks. J Cheminformatics 14(2)","DOI":"10.1186\/s13321-021-00579-z"},{"issue":"12","key":"987_CR6","doi-asserted-by":"publisher","first-page":"2897","DOI":"10.1021\/ci900437n","volume":"49","author":"J Goodman","year":"2009","unstructured":"Goodman J (2009) Computer software review: Reaxys. J Chem Inf Model 49(12):2897\u20132898","journal-title":"J Chem Inf Model"},{"issue":"4","key":"987_CR7","doi-asserted-by":"publisher","first-page":"588","DOI":"10.5195\/jmla.2018.515","volume":"106","author":"SW Gabrielson","year":"2018","unstructured":"Gabrielson SW (2018) SciFinder. J Med Libr Assoc 106(4):588\u2013590","journal-title":"J Med Libr Assoc"},{"issue":"45","key":"987_CR8","doi-asserted-by":"publisher","first-page":"18820","DOI":"10.1021\/jacs.1c09820","volume":"143","author":"SM Kearnes","year":"2021","unstructured":"Kearnes SM, Maser MR, Wleklinski M, Kast A, Doyle AG, Dreher SD et al (2021) The open reaction database. J Am Chem Soc 143(45):18820\u201318826","journal-title":"J Am Chem Soc"},{"key":"987_CR9","unstructured":"Lowe D (2017) Chemical Reactions From US Patents (1976-Sep2016). Available from: https:\/\/figshare.com\/articles\/dataset\/Chemical_reactions_from_US_patents_1976-Sep2016_\/5104873"},{"issue":"17","key":"987_CR10","doi-asserted-by":"publisher","first-page":"2465","DOI":"10.1093\/bioinformatics\/btr413","volume":"27","author":"QN Hu","year":"2011","unstructured":"Hu QN, Deng Z, Hu H, Cao DS, Liang YZ (2011) RxnFinder: biochemical reaction search engines using molecular structures, molecular fragments and reaction similarity. Bioinformatics 27(17):2465\u20132467","journal-title":"Bioinformatics"},{"issue":"8","key":"987_CR11","doi-asserted-by":"publisher","first-page":"5079","DOI":"10.3390\/molecules15085079","volume":"15","author":"N Haider","year":"2010","unstructured":"Haider N (2010) Functionality pattern matching as an efficient complementary structure\/reaction search tool: an open-source approach. Molecules 15(8):5079\u20135092","journal-title":"Molecules"},{"issue":"1","key":"987_CR12","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1186\/s13321-018-0295-6","volume":"10","author":"N Kochev","year":"2018","unstructured":"Kochev N, Avramova S, Jeliazkova N (2018) Ambit-SMIRKS: a software module for reaction representation, reaction search and structure transformation. J Cheminformatics 10(1):42","journal-title":"J Cheminformatics"},{"issue":"5","key":"987_CR13","doi-asserted-by":"publisher","DOI":"10.1002\/wcms.1604","volume":"12","author":"P Schwaller","year":"2022","unstructured":"Schwaller P, Vaucher AC, Laplaza R, Bunne C, Krause A, Corminboeuf C et al (2022) Machine intelligence for chemical reaction space. Wiley Interdiscip Rev Comput Mol Sci 12(5):e1604","journal-title":"Wiley Interdiscip Rev Comput Mol Sci"},{"issue":"1","key":"987_CR14","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1007\/s40747-020-00212-w","volume":"7","author":"Q Zhang","year":"2021","unstructured":"Zhang Q, Lu J, Jin Y (2021) Artificial intelligence in recommender systems. Complex Intell Syst 7(1):439\u2013457","journal-title":"Complex Intell Syst"},{"key":"987_CR15","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1186\/s13321-020-00460-5","volume":"12","author":"L David","year":"2020","unstructured":"David L, Thakkar A, Mercado R, Engkvist O (2020) Molecular representations in AI-driven drug discovery: a review and practical guide. J Cheminformatics 12:56","journal-title":"J Cheminformatics"},{"key":"987_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.compchemeng.2023.108202","volume":"172","author":"AM Schweidtmann","year":"2023","unstructured":"Schweidtmann AM, Rittig JG, Weber JM, Grohe M, Dahmen M, Leonhard K et al (2023) Physical pooling functions in graph neural networks for molecular property prediction. Comput Chem Eng 172:108202","journal-title":"Comput Chem Eng"},{"issue":"11","key":"987_CR17","doi-asserted-by":"publisher","first-page":"1465","DOI":"10.1021\/acscentsci.8b00357","volume":"4","author":"H Gao","year":"2018","unstructured":"Gao H, Struble TJ, Coley CW, Wang Y, Green WH, Jensen KF (2018) Using machine learning to predict suitable conditions for organic reactions. ACS Cent Sci 4(11):1465\u20131476","journal-title":"ACS Cent Sci"},{"issue":"1","key":"987_CR18","doi-asserted-by":"publisher","first-page":"156","DOI":"10.1021\/acs.jcim.0c01234","volume":"61","author":"MR Maser","year":"2021","unstructured":"Maser MR, Cui AY, Ryou S, DeLano TJ, Yue Y, Reisman SE (2021) Multilabel classification models for the prediction of cross-coupling reaction conditions. J Chem Inf Model 61(1):156\u2013166","journal-title":"J Chem Inf Model"},{"key":"987_CR19","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1186\/s13321-024-00805-4","volume":"16","author":"LY Chen","year":"2024","unstructured":"Chen LY, Li YP (2024) Enhancing chemical synthesis: a two-stage deep neural network for predicting feasible reaction conditions. J Cheminformatics 16:11","journal-title":"J Cheminformatics"},{"issue":"1","key":"987_CR20","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1021\/ci5006614","volume":"55","author":"N Schneider","year":"2015","unstructured":"Schneider N, Lowe DM, Sayle RA, Landrum GA (2015) Development of a novel fingerprint for chemical reactions and its application to large-scale reaction classification and similarity. J Chem Inf Model 55(1):39\u201353","journal-title":"J Chem Inf Model"},{"issue":"6","key":"987_CR21","doi-asserted-by":"publisher","first-page":"1379","DOI":"10.1016\/j.chempr.2020.02.017","volume":"6","author":"F Sandfort","year":"2020","unstructured":"Sandfort F, Strieth-Kalthoff F, K\u00fchnemund M, Beecks C, Glorius F (2020) A structure-based platform for predicting chemical reactivity. Chem 6(6):1379\u20131390","journal-title":"Chem"},{"issue":"2","key":"987_CR22","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1039\/D1DD00006C","volume":"1","author":"D Probst","year":"2022","unstructured":"Probst D, Schwaller P, Reymond JL (2022) Reaction classification and yield prediction using the differential reaction fingerprint DRFP. Digital Discov 1(2):91\u201397","journal-title":"Digital Discov"},{"issue":"2","key":"987_CR23","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1038\/s42256-020-00284-w","volume":"3","author":"P Schwaller","year":"2021","unstructured":"Schwaller P, Probst D, Vaucher AC, Nair VH, Kreutter D, Laino T et al (2021) Mapping the space of chemical reactions using attention-based neural networks. Nat Mach Intell 3(2):144\u2013152","journal-title":"Nat Mach Intell"},{"key":"987_CR24","unstructured":"Wang H, Li W, Jin X, Cho K, Ji H, Han J, et al (2022) Chemical-reaction-aware molecule representation learning. In: Proceedings of International Conference on Learning Representations"},{"issue":"5","key":"987_CR25","doi-asserted-by":"publisher","first-page":"1446","DOI":"10.1039\/D1SC06515G","volume":"13","author":"M Wen","year":"2022","unstructured":"Wen M, Blau SM, Xie X, Dwaraknath S, Persson KA (2022) Improving machine learning performance on small chemical reaction data with unsupervised contrastive pretraining. Chem Sci 13(5):1446\u20131458","journal-title":"Chem Sci"},{"issue":"6","key":"987_CR26","doi-asserted-by":"publisher","first-page":"1945","DOI":"10.1021\/acs.jcim.4c00157","volume":"64","author":"J Xie","year":"2024","unstructured":"Xie J, Wang Y, Rao J, Zheng S, Yang Y (2024) Self-supervised contrastive molecular representation learning with a chemical synthesis knowledge graph. J Chem Inf Model 64(6):1945\u20131954","journal-title":"J Chem Inf Model"},{"key":"987_CR27","unstructured":"Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: Proceedings of International Conference on Machine Learning; pp 1597\u20131607"},{"key":"987_CR28","doi-asserted-by":"crossref","unstructured":"Hu H, Wang X, Zhang Y, Chen Q, Guan Q (2024) A comprehensive survey on contrastive learning. Neurocomputing 128645","DOI":"10.1016\/j.neucom.2024.128645"},{"issue":"14","key":"987_CR29","doi-asserted-by":"publisher","first-page":"5143","DOI":"10.1039\/D3SC04928K","volume":"15","author":"E King-Smith","year":"2024","unstructured":"King-Smith E (2024) Transfer learning for a foundational chemistry model. Chem Sci 15(14):5143\u20135151","journal-title":"Chem Sci"},{"key":"987_CR30","doi-asserted-by":"crossref","unstructured":"Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition; pp 815\u2013823","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"987_CR31","doi-asserted-by":"crossref","unstructured":"Trivedi P, Lubana ES, Yan Y, Yang Y, Koutra D (2022) Augmentations in graph contrastive learning: current methodological flaws & towards better practices. In: Proceedings of the ACM Web Conference; pp 1538\u20131549","DOI":"10.1145\/3485447.3512200"},{"key":"987_CR32","first-page":"5812","volume":"33","author":"Y You","year":"2020","unstructured":"You Y, Chen T, Sui Y, Chen T, Wang Z, Shen Y (2020) Graph contrastive learning with augmentations. Adv Neural Inf Process Syst 33:5812\u20135823","journal-title":"Adv Neural Inf Process Syst"},{"key":"987_CR33","first-page":"7372","volume":"36","author":"N Lee","year":"2022","unstructured":"Lee N, Lee J, Park C (2022) Augmentation-free self-supervised learning on graphs. In: Proceedings of AAAI Conference on Artificial Intelligence; pp 7372\u20137380","journal-title":"Proc AAAI Conf Artif Intell"},{"issue":"4","key":"987_CR34","doi-asserted-by":"publisher","first-page":"3005","DOI":"10.1007\/s10462-022-10246-w","volume":"56","author":"E Mosqueira-Rey","year":"2023","unstructured":"Mosqueira-Rey E, Hern\u00e1ndez-Pereira E, Alonso-R\u00edos D, Bobes-Bascar\u00e1n J, Fern\u00e1ndez-Leal \u00c1 (2023) Human-in-the-loop machine learning: a state of the art. Artif Intell Rev 56(4):3005\u20133054","journal-title":"Artif Intell Rev"},{"issue":"1","key":"987_CR35","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1021\/acs.jcim.9b00949","volume":"60","author":"S Zheng","year":"2019","unstructured":"Zheng S, Rao J, Zhang Z, Xu J, Yang Y (2019) Predicting retrosynthetic reactions using self-corrected transformer neural networks. J Chem Inf Model 60(1):47\u201355","journal-title":"J Chem Inf Model"},{"key":"987_CR36","unstructured":"Xu K, Hu W, Leskovec J, Jegelka S (2019) How powerful are graph neural networks? In: Proceedings of International Conference on Learning Representations"},{"key":"987_CR37","unstructured":"Hu W, Liu B, Gomes J, Zitnik M, Liang P, Pande V, et\u00a0al (2020) Strategies for pre-training graph neural networks. In: Proceedings of International Conference on Learning Representations"},{"issue":"1","key":"987_CR38","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1021\/acs.jcim.7b00616","volume":"58","author":"S Jaeger","year":"2018","unstructured":"Jaeger S, Fulle S, Turk S (2018) Mol2vec: unsupervised machine learning approach with chemical intuition. J Chem Inf Model 58(1):27\u201335","journal-title":"J Chem Inf Model"},{"key":"987_CR39","unstructured":"Fabian B, Edlich T, Gaspar H, Segler M, Meyers J, Fiscato M, et\u00a0al (2020) Molecular representation learning with language models and domain-relevant auxiliary tasks. In: Proceedings of NeurIPS Workshop on Machine Learning for Molecules"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-00987-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-025-00987-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-025-00987-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T12:57:54Z","timestamp":1744289874000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-025-00987-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,10]]},"references-count":39,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["987"],"URL":"https:\/\/doi.org\/10.1186\/s13321-025-00987-5","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,10]]},"assertion":[{"value":"21 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 March 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 April 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"51"}}