{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T16:54:32Z","timestamp":1781715272037,"version":"3.54.5"},"reference-count":125,"publisher":"IOP Publishing","issue":"3","license":[{"start":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T00:00:00Z","timestamp":1759104000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T00:00:00Z","timestamp":1759104000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/iopscience.iop.org\/info\/page\/text-and-data-mining"}],"funder":[{"DOI":"10.13039\/100007767","name":"Material Measurement Laboratory","doi-asserted-by":"crossref","award":["70NANB24H049 \/ MML24-1001"],"award-info":[{"award-number":["70NANB24H049 \/ MML24-1001"]}],"id":[{"id":"10.13039\/100007767","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000105","name":"Office of Advanced Cyberinfrastructure","doi-asserted-by":"crossref","award":["#2209892"],"award-info":[{"award-number":["#2209892"]}],"id":[{"id":"10.13039\/100000105","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["iopscience.iop.org"],"crossmark-restriction":false},"short-container-title":["Mach. Learn.: Sci. Technol."],"published-print":{"date-parts":[[2025,9,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Large language models (LLMs) are reshaping many aspects of materials science and chemistry research, enabling advances in molecular property prediction, materials design, scientific automation, knowledge extraction, and more. Recent developments demonstrate that the latest class of models are able to integrate structured and unstructured data, assist in hypothesis generation, and streamline research workflows. To explore the frontier of LLM capabilities across the research lifecycle, we review applications of LLMs through 32 total projects developed during the second annual LLM hackathon for applications in materials science and chemistry, a global hybrid event. These projects spanned seven key research areas: (1) molecular and material property prediction, (2) molecular and material design, (3) automation and novel interfaces, (4) scientific communication and education, (5) research data management and automation, (6) hypothesis generation and evaluation, and (7) knowledge extraction and reasoning from the scientific literature. Collectively, these applications illustrate how LLMs serve as versatile predictive models, platforms for rapid prototyping of domain-specific tools, and much more. In particular, improvements in both open source and proprietary LLM performance through the addition of reasoning, additional training data, and new techniques have expanded effectiveness, particularly in low-data environments and interdisciplinary research. As LLMs continue to improve, their integration into scientific workflows presents both new opportunities and new challenges, requiring ongoing exploration, continued refinement, and further research to address reliability, interpretability, and reproducibility.<\/jats:p>","DOI":"10.1088\/2632-2153\/ae011a","type":"journal-article","created":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T22:53:49Z","timestamp":1756508029000},"page":"030701","update-policy":"https:\/\/doi.org\/10.1088\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["32 examples of LLM applications in materials science and chemistry: towards automation, assistants, agents, and accelerated scientific discovery"],"prefix":"10.1088","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-1720-4368","authenticated-orcid":true,"given":"Yoel","family":"Zimmermann","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6475-8505","authenticated-orcid":false,"given":"Adib","family":"Bazgir","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-8377-7049","authenticated-orcid":false,"given":"Alexander","family":"Al-Feghali","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5696-9193","authenticated-orcid":false,"given":"Mehrad","family":"Ansari","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7523-152X","authenticated-orcid":false,"given":"Joshua","family":"Bocarsly","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2551-1563","authenticated-orcid":false,"given":"L Catherine","family":"Brinson","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4017-7084","authenticated-orcid":false,"given":"Yuan","family":"Chiang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5761-0198","authenticated-orcid":false,"given":"Defne","family":"Circi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0637-7856","authenticated-orcid":false,"given":"Min-Hsueh","family":"Chiu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7647-1816","authenticated-orcid":true,"given":"Nathan","family":"Daelman","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1182-9098","authenticated-orcid":true,"given":"Matthew L","family":"Evans","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Abhijeet S","family":"Gangan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8907-0336","authenticated-orcid":false,"given":"Janine","family":"George","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6016-3122","authenticated-orcid":true,"given":"Hassan","family":"Harb","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-2476-8043","authenticated-orcid":false,"given":"Ghazal","family":"Khalighinejad","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-2131-9700","authenticated-orcid":true,"given":"Sartaaj","family":"Takrim Khan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4850-776X","authenticated-orcid":true,"given":"Sascha","family":"Klawohn","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0665-1839","authenticated-orcid":true,"given":"Magdalena","family":"Lederbauer","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8879-5431","authenticated-orcid":false,"given":"Soroush","family":"Mahjoubi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0903-0073","authenticated-orcid":false,"given":"Bernadette","family":"Mohr","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0357-5729","authenticated-orcid":true,"given":"Seyed","family":"Mohamad Moosavi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6071-6786","authenticated-orcid":true,"given":"Aakash","family":"Naik","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0281-3860","authenticated-orcid":true,"given":"Aleyna","family":"Beste Ozhan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8906-8447","authenticated-orcid":true,"given":"Dieter","family":"Plessers","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0243-9124","authenticated-orcid":true,"given":"Aritra","family":"Roy","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Fabian","family":"Sch\u00f6ppach","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3046-6576","authenticated-orcid":false,"given":"Philippe","family":"Schwaller","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-3786-0773","authenticated-orcid":true,"given":"Carla","family":"Terboven","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2967-1182","authenticated-orcid":true,"given":"Katharina","family":"Ueltzen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2874-8267","authenticated-orcid":true,"given":"Yue","family":"Wu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8433-8599","authenticated-orcid":false,"given":"Shang","family":"Zhu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9948-7119","authenticated-orcid":false,"given":"Jan","family":"Janssen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Calvin","family":"Li","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2129-5269","authenticated-orcid":true,"given":"Ian","family":"Foster","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5326-4902","authenticated-orcid":true,"given":"Ben","family":"Blaiszik","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"266","published-online":{"date-parts":[[2025,9,29]]},"reference":[{"key":"mlstae011abib1","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1038\/s42256-023-00788-1","type":"journal-article","article-title":"Leveraging large language models for predictive chemistry","volume":"6","author":"Jablonka","year":"2024","journal-title":"Nat. Mach. Intell."},{"key":"mlstae011abib2","doi-asserted-by":"publisher","first-page":"7086","DOI":"10.1021\/acs.jcim.4c01396","type":"journal-article","article-title":"Large language models as molecular design engines","volume":"64","author":"Bhattacharya","year":"2024","journal-title":"J. Chem. Inf. Mod."},{"key":"mlstae011abib3","doi-asserted-by":"publisher","first-page":"6909","DOI":"10.1021\/acs.jpclett.4c01126","type":"journal-article","article-title":"AtomGPT: atomistic generative pretrained transformer for forward and inverse materials design","volume":"15","author":"Choudhary","year":"2024","journal-title":"J. Phys. Chem. Lett."},{"key":"mlstae011abib4","first-page":"pp 556","type":"conference-proceedings","article-title":"Comparative study of large language model architectures on frontier","author":"Yin","year":"2024"},{"key":"mlstae011abib5","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1038\/s42256-024-00832-8","type":"journal-article","article-title":"Augmenting large language models with chemistry tools","volume":"6","author":"Bran","year":"2024","journal-title":"Nat. Mach. Intell."},{"key":"mlstae011abib6","article-title":"A review of large language models and autonomous agents in chemistry","author":"Ramos","year":"2024","type":"preprint"},{"key":"mlstae011abib7","doi-asserted-by":"publisher","first-page":"9633","DOI":"10.1021\/acs.chemrev.4c00055","type":"journal-article","article-title":"Self-driving laboratories for chemistry and materials science","volume":"124","author":"Tom","year":"2024","journal-title":"Chem. Rev."},{"key":"mlstae011abib8","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1021\/acs.jcim.4c01653","type":"journal-article","article-title":"Toward automated simulation research workflow through LLM prompt engineering design","volume":"65","author":"Liu","year":"2024","journal-title":"J. Chem. Inf. Model."},{"key":"mlstae011abib9","article-title":"Advancing the scientific method with large language models: from hypothesis to discovery","author":"Zhang","year":"2025","type":"preprint"},{"key":"mlstae011abib10","article-title":"A survey on hypothesis generation for scientific discovery in the era of large language models","author":"Alkan","year":"2025","type":"preprint"},{"key":"mlstae011abib11","first-page":"pp 117","type":"conference-proceedings","article-title":"Hypothesis generation with large language models","author":"Zhou","year":"2024"},{"key":"mlstae011abib12","article-title":"Training a scientific reasoning model for chemistry","author":"Narayanan","year":"2025","type":"preprint"},{"key":"mlstae011abib13","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1038\/s41524-022-00784-w","type":"journal-article","article-title":"Matscibert: a materials domain language model for text mining and information extraction","volume":"8","author":"Gupta","year":"2022","journal-title":"npj Comput. Mater."},{"key":"mlstae011abib14","article-title":"Alchembert: exploring lightweight language models for materials informatics","author":"Liu","year":"2025","type":"other"},{"key":"mlstae011abib15","article-title":"Llm-prop: predicting physical and electronic properties of crystalline solids from their text descriptions","author":"Rubungo","year":"2023","type":"preprint"},{"key":"mlstae011abib16","doi-asserted-by":"publisher","first-page":"0","DOI":"10.1002\/anie.202423950","type":"journal-article","article-title":"Explainable synthesizability prediction of inorganic crystal polymorphs using large language models","volume":"64","author":"Kim","year":"2024","journal-title":"Angew. Chem., Int. Ed."},{"key":"mlstae011abib17","doi-asserted-by":"publisher","first-page":"874","DOI":"10.1557\/mrc.2019.94","type":"journal-article","article-title":"Robocrystallographer: automated crystal structure text descriptions and analysis","volume":"9","author":"Ganose","year":"2019","journal-title":"MRS Commun."},{"key":"mlstae011abib18","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1038\/s41524-020-00406-3","type":"journal-article","article-title":"Benchmarking materials property prediction methods: the matbench test set and automatminer reference algorithm","volume":"6","author":"Dunn","year":"2020","journal-title":"npj Comput. Mater."},{"key":"mlstae011abib19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/sdata.2018.65","type":"journal-article","article-title":"High-throughput density-functional perturbation theory phonons for inorganic materials","volume":"5","author":"Petretto","year":"2018","journal-title":"Sci. Data"},{"key":"mlstae011abib20","article-title":"Matagent: a human-in-the-loop multi-agent llm framework for accelerating the material science discovery cycle","author":"Bazgir","year":"2025","type":"other"},{"key":"mlstae011abib21","article-title":"Regression with large language models for materials and molecular property prediction","author":"Jacobs","year":"2024","type":"other"},{"key":"mlstae011abib22","article-title":"Llm4mat-bench: benchmarking large language models for materials property prediction","author":"Rubungo","year":"2024","type":"preprint"},{"key":"mlstae011abib23","article-title":"Can large language models empower molecular property prediction?","author":"Qian","year":"2023","type":"other"},{"key":"mlstae011abib24","doi-asserted-by":"publisher","first-page":"570","DOI":"10.1038\/s41586-023-06792-0","type":"journal-article","article-title":"Autonomous chemical research with large language models","volume":"624","author":"Boiko","year":"2023","journal-title":"Nature"},{"key":"mlstae011abib25","first-page":"2505","type":"other","article-title":"Robin: a multi-agent system for automating scientific discovery","author":"Ghareeb","year":"2025"},{"key":"mlstae011abib26","article-title":"Language agents achieve superhuman synthesis of scientific knowledge","author":"Skarlinski","year":"2024","type":"preprint"},{"key":"mlstae011abib27","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2024.findings-emnlp.192","type":"other","article-title":"Honeycomb: a flexible llm-based agent system for materials science","author":"Zhang","year":"2024"},{"key":"mlstae011abib28","article-title":"The llama 3 herd of models","author":"Grattafiori","year":"2024","type":"other"},{"key":"mlstae011abib29","article-title":"Phi-3 technical report: a highly capable language model locally on your phone","author":"Abdin","year":"2024","type":"other"},{"key":"mlstae011abib30","article-title":"The claude 3 model family: opus, sonnet, haiku","author":"Anthropic","year":"2024","type":"other"},{"key":"mlstae011abib31","article-title":"Mixtral of experts","author":"Jiang","year":"2024","type":"other"},{"key":"mlstae011abib32","article-title":"Gpt-4-turbo and gpt-4","author":"OpenAI","year":"2023","type":"other"},{"key":"mlstae011abib33","article-title":"Gpt-3.5-turbo","author":"OpenAI","year":"2023","type":"other"},{"key":"mlstae011abib34","article-title":"OpenAI","author":"Achiam","year":"2024","type":"other"},{"key":"mlstae011abib35","first-page":"5","type":"conference-proceedings","article-title":"How to support newcomers in scientific hackathons - an action research study on expert mentoring","volume":"vol 4","author":"Nolte","year":"2020"},{"key":"mlstae011abib36","first-page":"pp 27","type":"book","article-title":"Understanding hackathons for science: collaboration, affordances and outcomes","author":"Pe-Than","year":"2019"},{"key":"mlstae011abib37","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1186\/s13731-023-00269-0","type":"journal-article","article-title":"Hack your organizational innovation: literature review and integrative model for running hackathons","volume":"12","author":"Heller","year":"2023","journal-title":"J. Innov. Entrepreneurship"},{"key":"mlstae011abib38","doi-asserted-by":"publisher","first-page":"1233","DOI":"10.1039\/D3DD00113J","type":"journal-article","article-title":"14 examples of how llms can transform materials science and chemistry: a reflection on a large language model hackathon","volume":"2","author":"Jablonka","year":"2023","journal-title":"Dig. Discovery"},{"key":"mlstae011abib39","article-title":"Reflections from the 2024 large language model (LLM) hackathon for applications in materials science and chemistry","author":"Zimmermann","year":"2025","type":"other"},{"key":"mlstae011abib40","article-title":"Language models are few-shot learners","author":"Brown","year":"2020","type":"other"},{"key":"mlstae011abib41","article-title":"From words to numbers: Your large language model is secretly a capable regressor when given in-context examples","author":"Vacareanu","year":"2024","type":"conference-proceedings"},{"key":"mlstae011abib42","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1038\/s41597-023-02477-5","type":"journal-article","article-title":"A quantum-chemical bonding database for solid-state materials","volume":"10","author":"Naik","year":"2023","journal-title":"Sci. Data"},{"key":"mlstae011abib43","doi-asserted-by":"publisher","first-page":"6286","DOI":"10.21105\/joss.06286","type":"journal-article","article-title":"Lobsterpy: A package to automatically analyze lobster runs","volume":"9","author":"Naik","year":"2024","journal-title":"J. Open Source Softw."},{"key":"mlstae011abib44","article-title":"A quantum-chemical bonding database for solid-state materials (JSONS: Part 1)","author":"Naik","year":"2023","type":"other"},{"key":"mlstae011abib45","article-title":"The matbench test suite, phonon dataset","author":"Matbench","year":"2024","type":"other"},{"key":"mlstae011abib46","first-page":"1","type":"journal-article","article-title":"Exploring the limits of transfer learning with a unified text-to-text transformer","volume":"21","author":"Raffel","year":"2020","journal-title":"J. Mach. Learn. Res."},{"key":"mlstae011abib47","article-title":"The unsloth package","author":"Han","year":"2024","type":"other"},{"key":"mlstae011abib48","article-title":"Multimodal large language models for inverse molecular design with retrosynthetic planning","author":"Liu","year":"2024","type":"other"},{"key":"mlstae011abib49","article-title":"Llmatdesign: Autonomous materials discovery with large language models","author":"Jia","year":"2024","type":"other"},{"key":"mlstae011abib50","article-title":"Can llms generate diverse molecules? towards alignment with structural diversity","author":"Jang","year":"2025","type":"other"},{"key":"mlstae011abib51","doi-asserted-by":"crossref","DOI":"10.26434\/chemrxiv-2024-z29m3","type":"other","article-title":"Generative design of functional metal complexes utilizing the internal knowledge of large language models","author":"Lu","year":"2024"},{"key":"mlstae011abib52","first-page":"pp 25603","type":"conference-proceedings","article-title":"A sober look at LLMs for material discovery: are they actually good for Bayesian optimization over molecules?","volume":"vol 235","author":"Kristiadi","year":"2024"},{"key":"mlstae011abib53","article-title":"Are llms ready for real-world materials discovery?","author":"Miret","year":"2024","type":"other"},{"key":"mlstae011abib54","doi-asserted-by":"publisher","DOI":"10.1016\/j.rser.2022.112441","type":"journal-article","article-title":"Review on applications of metal\u2013organic frameworks for co2 capture and the performance enhancement mechanisms","volume":"162","author":"Li","year":"2022","journal-title":"Renew. Sustain. Energy Rev."},{"key":"mlstae011abib55","article-title":"ReAct: synergizing reasoning and acting in language models","author":"Yao","year":"2023","type":"conference-proceedings"},{"key":"mlstae011abib56","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","type":"journal-article","article-title":"Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules","volume":"28","author":"Weininger","year":"1988","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"mlstae011abib57","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1021\/ci00062a008","type":"journal-article","article-title":"Smiles. 2. algorithm for generation of unique smiles notation","volume":"29","author":"Weininger","year":"1989","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"mlstae011abib58","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1021\/ci00067a005","type":"journal-article","article-title":"Smiles. 3. depict. graphical depiction of chemical structures","volume":"30","author":"Weininger","year":"1990","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"mlstae011abib59","article-title":"dziner: rational inverse design of materials with ai agents","author":"Ansari","year":"2024","type":"preprint"},{"key":"mlstae011abib60","doi-asserted-by":"publisher","DOI":"10.1002\/adma.201605071","type":"journal-article","article-title":"Semiconductor metal\u2013organic frameworks: future low-bandgap materials","volume":"29","author":"Usman","year":"2017","journal-title":"Adv. Mater."},{"key":"mlstae011abib61","doi-asserted-by":"publisher","first-page":"20610","DOI":"10.1021\/jp405335q","type":"journal-article","article-title":"Band gap modulations in uio metal\u2013organic frameworks","volume":"117","author":"Flage-Larsen","year":"2013","journal-title":"J. Phys. Chem. C"},{"key":"mlstae011abib62","doi-asserted-by":"publisher","first-page":"2532","DOI":"10.1021\/cg500243s","type":"journal-article","article-title":"Band gap engineering of paradigm mof-5","volume":"14","author":"Yang","year":"2014","journal-title":"Cryst. Growth Des."},{"key":"mlstae011abib63","doi-asserted-by":"publisher","first-page":"10283","DOI":"10.1021\/ic100694w","type":"journal-article","article-title":"Theoretical investigations on the chemical bonding, electronic structure and optical properties of the metal- organic framework mof-5","volume":"49","author":"Yang","year":"2010","journal-title":"Inorg. Chem."},{"key":"mlstae011abib64","doi-asserted-by":"publisher","first-page":"1190","DOI":"10.1002\/er.5807","type":"journal-article","article-title":"Recent advancements in mof-based catalysts for applications in electrochemical and photoelectrochemical water splitting: a review","volume":"45","author":"Ali","year":"2021","journal-title":"Int. J. Energy Res."},{"key":"mlstae011abib65","doi-asserted-by":"publisher","first-page":"42845","DOI":"10.1021\/acsami.3c08470","type":"journal-article","article-title":"Tuning electrical and mechanical properties of metal\u2013organic frameworks by metal substitution","volume":"15","author":"Yan","year":"2023","journal-title":"ACS Appl. Mater. Interfaces"},{"key":"mlstae011abib66","doi-asserted-by":"publisher","first-page":"9039","DOI":"10.1021\/ic301189m","type":"journal-article","article-title":"Tunability of band gaps in metal\u2013organic frameworks","volume":"51","author":"Lin","year":"2012","journal-title":"Inorg. Chem."},{"key":"mlstae011abib67","article-title":"New and improved embedding model","author":"Greene","year":"2022","type":"other"},{"key":"mlstae011abib68","doi-asserted-by":"publisher","first-page":"2607","DOI":"10.1039\/D4DD00252K","type":"journal-article","article-title":"Agent-based learning of materials datasets from the scientific literature","volume":"3","author":"Ansari","year":"2024","journal-title":"Digital Discovery"},{"key":"mlstae011abib69","doi-asserted-by":"publisher","first-page":"2958","DOI":"10.1021\/jacs.2c11420","type":"journal-article","article-title":"Moformer: self-supervised transformer model for metal\u2013organic framework property prediction","volume":"145","author":"Cao","year":"2023","journal-title":"J. Am. Chem. Soc."},{"key":"mlstae011abib70","first-page":"pp 12310","type":"conference-proceedings","article-title":"Barlow twins: self-supervised learning via redundancy reduction","author":"Zbontar","year":"2021"},{"key":"mlstae011abib71","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.120.145301","type":"journal-article","article-title":"Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties","volume":"120","author":"Xie","year":"2018","journal-title":"Phys. Rev. Lett."},{"key":"mlstae011abib72","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-020-17755-8","type":"journal-article","article-title":"Understanding the diversity of the metal-organic framework ecosystem","volume":"11","author":"Moosavi","year":"2020","journal-title":"Nat. Commun."},{"key":"mlstae011abib73","doi-asserted-by":"publisher","first-page":"1578","DOI":"10.1016\/j.matt.2021.02.015","type":"journal-article","article-title":"Machine learning the quantum-chemical properties of metal\u2013organic frameworks for accelerated materials discovery","volume":"4","author":"Rosen","year":"2021","journal-title":"Matter"},{"key":"mlstae011abib74","first-page":"4","type":"journal-article","article-title":"Rdkit documentation","volume":"1","author":"Landrum","year":"2013","journal-title":"Release"},{"key":"mlstae011abib75","first-page":"10","type":"other","article-title":"Langchain","author":"Chase","year":"2022"},{"key":"mlstae011abib76","article-title":"Restgpt: connecting large language models with real-world restful apis","author":"Song","year":"2023","type":"other"},{"key":"mlstae011abib77","doi-asserted-by":"publisher","DOI":"10.1016\/j.matt.2024.10.015","type":"journal-article","article-title":"Organa: a robotic assistant for automated chemistry experimentation and characterization","volume":"8","author":"Darvish","year":"2025","journal-title":"Matter"},{"key":"mlstae011abib78","article-title":"Langsim","author":"Project","year":"2024","type":"other"},{"key":"mlstae011abib79","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1038\/s41524-018-0122-7","type":"journal-article","article-title":"A machine learning approach to model solute grain boundary segregation","volume":"4","author":"Huber","year":"2018","journal-title":"npj Comput. Mater."},{"key":"mlstae011abib80","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1016\/j.commatsci.2018.07.043","type":"journal-article","article-title":"pyiron: an integrated development environment for computational materials science","volume":"163","author":"Janssen","year":"2019","journal-title":"Comput. Mater. Sci."},{"key":"mlstae011abib81","article-title":"A foundation model for atomistic materials chemistry","author":"Batatia","year":"2023","type":"preprint"},{"key":"mlstae011abib82","doi-asserted-by":"publisher","DOI":"10.1088\/1361-651X\/ad4d0d","type":"journal-article","article-title":"Roadmap on data-centric materials science","volume":"32","author":"Bauer","year":"2024","journal-title":"Modelling Simul. Mater. Sci. Eng."},{"key":"mlstae011abib83","article-title":"Leveraging large language models and social media for automation in scanning probe microscopy","author":"Diao","year":"2024","type":"preprint"},{"key":"mlstae011abib84","doi-asserted-by":"publisher","first-page":"02LT01","DOI":"10.1088\/2632-2153\/ad52e9","type":"journal-article","article-title":"Synergizing human expertise and ai efficiency with language model for microscopy operation and automated experiment design","volume":"5","author":"Liu","year":"2024","journal-title":"Mach. Learn.: Sci. Technol."},{"key":"mlstae011abib85","doi-asserted-by":"publisher","first-page":"24","DOI":"10.12688\/openreseurope.13015.1","type":"journal-article","article-title":"The abtem code: transmission electron microscopy from first principles","volume":"1","author":"Madsen","year":"2021","journal-title":"Open Res. Eur."},{"key":"mlstae011abib86","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1017\/S143192761900134X","type":"journal-article","article-title":"Nion swift: open source image processing software for instrument control, data acquisition, organization, visualization and analysis using python","volume":"25","author":"Meyer","year":"2019","journal-title":"Microsc. Microanal."},{"key":"mlstae011abib87","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1111\/bjet.13370","type":"journal-article","article-title":"Practical and ethical challenges of large language models in education: a systematic scoping review","volume":"55","author":"Yan","year":"2023","journal-title":"Br. J. Educ. Technol."},{"key":"mlstae011abib88","article-title":"Large language models for education: a survey and outlook","author":"Wang","year":"2024","type":"other"},{"key":"mlstae011abib89","doi-asserted-by":"publisher","DOI":"10.1016\/j.lindif.2023.102274","type":"journal-article","article-title":"Chatgpt for good? on opportunities and challenges of large language models for education","volume":"103","author":"Kasneci","year":"2023","journal-title":"Learn. Ind. Diff."},{"key":"mlstae011abib90","doi-asserted-by":"publisher","first-page":"Y02","DOI":"10.5167\/uzh-237847","type":"journal-article","article-title":"The notorious gpt: science communication in the age of artificial intelligence","volume":"22","author":"Sch\u00e4fer","year":"2023","journal-title":"J. Sci. Commun."},{"key":"mlstae011abib91","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1039\/D3DD00188A","type":"journal-article","article-title":"Mascqa: investigating materials science knowledge of large language models","volume":"3","author":"Zaki","year":"2024","journal-title":"Dig. Discovery"},{"key":"mlstae011abib92","article-title":"Benchmarking large language models for math reasoning tasks","author":"Se\u00dfler","year":"2024","type":"preprint"},{"key":"mlstae011abib93","article-title":"Self-consistency improves chain of thought reasoning in language models","author":"Wang","year":"2022","type":"conference-proceedings"},{"key":"mlstae011abib94","first-page":"9459","type":"conference-proceedings","article-title":"Retrieval-augmented generation for knowledge-intensive nlp tasks","volume":"vol 33","author":"Lewis","year":"2020"},{"key":"mlstae011abib95","article-title":"Foundational large language models for materials research","author":"Mishra","year":"2024","type":"preprint"},{"key":"mlstae011abib96","doi-asserted-by":"publisher","DOI":"10.1088\/2515-7639\/ab13bb","type":"journal-article","article-title":"The nomad laboratory: from data sharing to artificial intelligence","volume":"2","author":"Draxl","year":"2019","journal-title":"J. Phys. Mater."},{"key":"mlstae011abib97","article-title":"datalab","author":"Evans","year":"2024","type":"other"},{"key":"mlstae011abib98","doi-asserted-by":"publisher","first-page":"5388","DOI":"10.21105\/joss.05388","type":"journal-article","article-title":"Nomad: a distributed web-based platform for managing materials science research data","volume":"8","author":"Scheidgen","year":"2023","journal-title":"J. Open Source Softw."},{"key":"mlstae011abib99","article-title":"Retrieval-augmented generation for large language models: a survey","author":"Gao","year":"2023","type":"preprint"},{"key":"mlstae011abib100","article-title":"Llama: open and efficient foundation language models","author":"Touvron","year":"2023","type":"preprint"},{"key":"mlstae011abib101","article-title":"Scientific hypothesis generation by a large language model: laboratory validation in breast cancer treatment","author":"Abdel-Rehim","year":"2024","type":"other"},{"key":"mlstae011abib102","doi-asserted-by":"publisher","first-page":"896","DOI":"10.1057\/s41599-024-03407-5","type":"journal-article","article-title":"Automating psychological hypothesis generation with ai: when large language models meet causal graph","volume":"11","author":"Tong","year":"2024","journal-title":"Human. Soc. Sci. Commun."},{"key":"mlstae011abib103","article-title":"Harnessing the power of adversarial prompting and large language models for robust hypothesis generation in astronomy","author":"Ciuc\u0103","year":"2023","type":"other"},{"key":"mlstae011abib104","article-title":"Proteinhypothesis: a physics-aware chain of multi-agent rag llm for hypothesis generation in protein science","author":"Bazgir","year":"2025","type":"conference-proceedings"},{"key":"mlstae011abib105","article-title":"Beyond designer\u2019s knowledge: generating materials design hypotheses via large language models","author":"Liu","year":"2024","type":"other"},{"key":"mlstae011abib106","doi-asserted-by":"crossref","DOI":"10.26434\/chemrxiv-2024-lf2xx","type":"other","article-title":"Towards ai research agents in the chemical sciences","author":"Shir","year":"2024"},{"key":"mlstae011abib107","article-title":"Agentichypothesis: a survey on hypothesis generation using llm systems","author":"Bazgir","year":"2025","type":"conference-proceedings"},{"key":"mlstae011abib108","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2024.findings-acl.804","type":"other","article-title":"Large language models for automated open-domain scientific hypotheses discovery","author":"Yang","year":"2024"},{"key":"mlstae011abib109","first-page":"pp 11809","type":"book","article-title":"Tree of thoughts: Deliberate problem solving with large language models","volume":"vol 36","author":"Yao","year":"2023"},{"key":"mlstae011abib110","article-title":"Large language models for scientific information extraction: an empirical study for virology","author":"Shamsabadi","year":"2024","type":"other"},{"key":"mlstae011abib111","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1038\/s41467-024-45563-x","type":"journal-article","article-title":"Structured information extraction from scientific text with large language models","volume":"15","author":"Dagdelen","year":"2024","journal-title":"Nat. Commun."},{"key":"mlstae011abib112","doi-asserted-by":"crossref","DOI":"10.1007\/s11704-024-40555-y","type":"other","article-title":"Large language models for generative information extraction: a survey","author":"Xu","year":"2024"},{"key":"mlstae011abib113","first-page":"1","type":"journal-article","article-title":"Generative ai for self-adaptive systems: state of the art and research roadmap","volume":"19","author":"Li","year":"2024","journal-title":"ACM Trans. Auto. Adaptive Syst."},{"key":"mlstae011abib114","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/2024.emnlp-main.880","type":"other","article-title":"Sciagent: tool-augmented language models for scientific reasoning","author":"Ma","year":"2024"},{"key":"mlstae011abib115","article-title":"arxiv api","type":"other"},{"key":"mlstae011abib116","article-title":"Neo4j","year":"2024","type":"other"},{"key":"mlstae011abib117","article-title":"Dspy: Compiling declarative language model calls into self-improving pipelines","author":"Khattab","year":"2023","type":"other"},{"key":"mlstae011abib118","article-title":"Chain-of-thought prompting elicits reasoning in large language models","author":"Wei","year":"2023","type":"other"},{"key":"mlstae011abib119","article-title":"Graph maker","author":"Maker","year":"2024","type":"other"},{"key":"mlstae011abib120","article-title":"Gradio","author":"Gradio","year":"2024","type":"other"},{"key":"mlstae011abib121","article-title":"Isobench: benchmarking multimodal foundation models on isomorphic representations","author":"Fu","year":"2024","type":"other"},{"key":"mlstae011abib122","article-title":"What can large language models do in chemistry? a comprehensive benchmark on eight tasks","author":"Guo","year":"2023","type":"other"},{"key":"mlstae011abib123","article-title":"Chemqa: a multimodal question-and-answering dataset on chemistry reasoning","author":"Zhu","year":"2024","type":"other"},{"key":"mlstae011abib124","article-title":"Gemini: a family of highly capable multimodal models","author":"Team","year":"2024","type":"other"},{"key":"mlstae011abib125","article-title":"The claude 3 model family: opus, sonnet, haiku","type":"other"}],"container-title":["Machine Learning: Science and Technology"],"original-title":[],"link":[{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a","content-type":"text\/html","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"am","intended-application":"similarity-checking"},{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a\/pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,3]],"date-time":"2025-10-03T12:21:58Z","timestamp":1759494118000},"score":1,"resource":{"primary":{"URL":"https:\/\/iopscience.iop.org\/article\/10.1088\/2632-2153\/ae011a"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,29]]},"references-count":125,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2025,9,29]]},"published-print":{"date-parts":[[2025,9,30]]}},"URL":"https:\/\/doi.org\/10.1088\/2632-2153\/ae011a","relation":{},"ISSN":["2632-2153"],"issn-type":[{"value":"2632-2153","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,29]]},"assertion":[{"value":"32 examples of LLM applications in materials science and chemistry: towards automation, assistants, agents, and accelerated scientific discovery","name":"article_title","label":"Article Title"},{"value":"Machine Learning: Science and Technology","name":"journal_title","label":"Journal Title"},{"value":"paper","name":"article_type","label":"Article Type"},{"value":"\u00a9 2025 The Author(s). Published by IOP Publishing Ltd","name":"copyright_information","label":"Copyright Information"},{"value":"2025-05-12","name":"date_received","label":"Date Received","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-08-29","name":"date_accepted","label":"Date Accepted","group":{"name":"publication_dates","label":"Publication dates"}},{"value":"2025-09-29","name":"date_epub","label":"Online publication date","group":{"name":"publication_dates","label":"Publication dates"}}]}}