{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T12:01:39Z","timestamp":1772884899540,"version":"3.50.1"},"reference-count":43,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2025,2,1]],"date-time":"2025-02-01T00:00:00Z","timestamp":1738368000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Agencia Nacional de Investigaci\u00f3n y Desarrollo","award":["Fondef IT20I0127"],"award-info":[{"award-number":["Fondef IT20I0127"]}]},{"name":"Agencia Nacional de Investigaci\u00f3n y Desarrollo","award":["1221420"],"award-info":[{"award-number":["1221420"]}]},{"name":"Agencia Nacional de Investigaci\u00f3n y Desarrollo","award":["ICN17_012"],"award-info":[{"award-number":["ICN17_012"]}]},{"name":"Fondecyt Regular","award":["Fondef IT20I0127"],"award-info":[{"award-number":["Fondef IT20I0127"]}]},{"name":"Fondecyt Regular","award":["1221420"],"award-info":[{"award-number":["1221420"]}]},{"name":"Fondecyt Regular","award":["ICN17_012"],"award-info":[{"award-number":["ICN17_012"]}]},{"name":"Millennium Science Initiative Program","award":["Fondef IT20I0127"],"award-info":[{"award-number":["Fondef IT20I0127"]}]},{"name":"Millennium Science Initiative Program","award":["1221420"],"award-info":[{"award-number":["1221420"]}]},{"name":"Millennium Science Initiative Program","award":["ICN17_012"],"award-info":[{"award-number":["ICN17_012"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Automated analysis of the scientific literature using natural language processing (NLP) can accelerate the identification of potentially unexplored formulations that enable innovations in materials engineering with fewer experimentation and testing cycles. This strategy has been successful for specific classes of inorganic materials, but their general application in broader material domains such as bioplastics remains challenging. To begin addressing this gap, we explore correlations between the ingredients and physicochemical properties of seaweed-based biofilms from a corpus of 2000 article abstracts from the scientific literature since 1958, using a supervised word co-occurrence analysis and an unsupervised approach based on the language model MatBERT without fine-tuning. Using known relations between ingredients and properties for test scenarios, we discuss the potential and limitations of these NLP approaches for identifying novel combinations of polysaccharides, plasticizers, and additives that are related to the functionality of seaweed biofilms. The model demonstrates a valuable predictive ability to identify ingredients associated with increased water vapor permeability, suggesting its potential utility in optimizing formulations for future research. Using the model further revealed alternative combinations that are underrepresented in the literature. This automated method facilitates the mapping of relationships between ingredients and properties, guiding the development of seaweed bioplastic formulations. The unstructured and heterogeneous nature of the literature on bioplastics represents a particular challenge that demands ad hoc fine-tuning strategies for state-of-the-art language models for advancing the field of seaweed bioplastics.<\/jats:p>","DOI":"10.3390\/data10020020","type":"journal-article","created":{"date-parts":[[2025,2,3]],"date-time":"2025-02-03T05:36:32Z","timestamp":1738560992000},"page":"20","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Seaweed-Based Bioplastics: Data Mining Ingredient\u2013Property Relations from the Scientific Literature"],"prefix":"10.3390","volume":"10","author":[{"given":"Fernanda","family":"V\u00e9liz","sequence":"first","affiliation":[{"name":"Department of Physics, Universidad de Santiago de Chile, Av Victor Jara 3493, Santiago 9170124, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6202-1438","authenticated-orcid":false,"given":"Thulasi","family":"Bikku","sequence":"additional","affiliation":[{"name":"Department of Physics, Universidad de Santiago de Chile, Av Victor Jara 3493, Santiago 9170124, Chile"},{"name":"Computer Science and Engineering, Amrita School of Computing Amaravati, Amrita Vishwa Vidyapeetham, Amaravati 522503, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1972-6085","authenticated-orcid":false,"given":"Davor","family":"Ibarra-P\u00e9rez","sequence":"additional","affiliation":[{"name":"Department of Mechanical Engineering, University of Santiago of Chile (USACH), Avenida Libertador Bernardo O\u2019Higgins 3363, Santiago 9170022, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0100-8322","authenticated-orcid":false,"given":"Valentina","family":"Hern\u00e1ndez-Mu\u00f1oz","sequence":"additional","affiliation":[{"name":"Department of Industrial Engineering, University of Santiago of Chile (USACH), Avenida Libertador Bernardo O\u2019Higgins 3363, Santiago 9170022, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8468-5809","authenticated-orcid":false,"given":"Alysia","family":"Garmulewicz","sequence":"additional","affiliation":[{"name":"Department of Management, Faculty of Management and Economics, University of Santiago of Chile (USACH), Avenida Libertador Bernardo O\u2019Higgins 3363, Estaci\u00f3n Central 9170022, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8121-1931","authenticated-orcid":false,"given":"Felipe","family":"Herrera","sequence":"additional","affiliation":[{"name":"Department of Physics, Universidad de Santiago de Chile, Av Victor Jara 3493, Santiago 9170124, Chile"},{"name":"Millennium Institute for Research in Optics, Concepci\u00f3n 4030000, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,2,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"2153","DOI":"10.1098\/rstb.2009.0053","article-title":"Plastics, the environment and human health: Current consensus and future trends","volume":"364","author":"Thompson","year":"2009","journal-title":"Philos. Trans. R. Soc. Lond. B Biol. Sci."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1016\/j.tifs.2008.07.003","article-title":"Biodegradable polymers for food packaging: A review","volume":"19","author":"Siracusa","year":"2008","journal-title":"Trends Food Sci. Technol."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2082","DOI":"10.1021\/cr200162d","article-title":"Plastics Derived from Biological Sources: Present and Future: A Technical and Environmental Review","volume":"112","author":"Chen","year":"2012","journal-title":"Chem. Rev."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Aleksanyan, K.V. (2023). Polysaccharides for Biodegradable Packaging Materials: Past, Present, and Future (Brief Review). Polymers, 15.","DOI":"10.3390\/polym15020451"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1007\/s10068-021-00935-7","article-title":"Improved extraction of carrageenan from red seaweed (Chondracantus canaliculatus) using ultrasound-assisted methods and evaluation of the yield, physicochemical properties and functional groups","volume":"30","year":"2021","journal-title":"Food Sci. Biotechnol."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Lomartire, S., Marques, J.C., and Gon\u00e7alves, A.M.M. (2022). An Overview of the Alternative Use of Seaweeds to Produce Safe and Sustainable Bio-Packaging. Appl. Sci., 12.","DOI":"10.3390\/app12063123"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.enceco.2020.07.003","article-title":"Fabrication of improved cellulose acetate-based biodegradable films for food packaging applications","volume":"2","author":"Rajeswari","year":"2020","journal-title":"Environ. Chem. Ecotoxicol."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Escamilla-Garc\u00eda, M., Calder\u00f3n-Dom\u00ednguez, G., Chanona-P\u00e9rez, J.J., Mendoza-Madrigal, A.G., Di Pierro, P., Garc\u00eda-Almend\u00e1rez, B.E., Amaro-Reyes, A., and Regalado-Gonz\u00e1lez, C. (2017). Physical, Structural, Barrier, and Antifungal Characterization of Chitosan\u2013Zein Edible Films with Added Essential Oils. Int. J. Mol. Sci., 18.","DOI":"10.3390\/ijms18112370"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"5118","DOI":"10.15376\/biores.16.3.5118-5132","article-title":"Reinforcing Effects of Seaweed Nanoparticles in Agar-Based Biopolymer Composite: Physical, Water Vapor Barrier, Mechanical, and Biodegradable Properties","volume":"16","author":"Dungani","year":"2021","journal-title":"BioResources"},{"key":"ref_10","unstructured":"Cameron, J.J., and Leung, C.K. (2024, September 28). Mining Frequent Patterns from Precise and Uncertain Data,\u201d 2011, UNIFACS. Available online: http:\/\/hdl.handle.net\/1993\/32123."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Hern\u00e1ndez, V., Ibarra, D., Triana, J.F., Mart\u00ednez-Soto, B., Fa\u00fandez, M., Vasco, D.A., Gordillo, L., Herrera, F., Garc\u00eda-Herrera, C., and Garmulewicz, A. (2022). Agar Biopolymer Films for Biodegradable Packaging: A Reference Dataset for Exploring the Limits of Mechanical Performance. Materials, 15.","DOI":"10.3390\/ma15113954"},{"key":"ref_12","first-page":"100488","article-title":"The Impact of Domain-Specific Pre-Training on Named Entity Recognition Tasks in Materials Science","volume":"3","author":"Trewartha","year":"2021","journal-title":"SSRN Electron. J."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Liu, R.-L. (2019). Identification of conclusive association entities in biomedical articles. J. Biomed. Semant., 10.","DOI":"10.1186\/s13326-018-0194-9"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Salloum, S.A., Al-Emran, M., Monem, A.A., and Shaalan, K. (2017). Using Text Mining Techniques for Extracting Information from Research Articles. Intelligent Natural Language Processing: Trends and Applications, Springer.","DOI":"10.1007\/978-3-319-67056-0_18"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Neumann, M., King, D., Beltagy, I., and Ammar, W. (2019, January 1). ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing. Proceedings of the 18th BioNLP Workshop and Shared Task, Florence, Italy.","DOI":"10.18653\/v1\/W19-5034"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1038\/s42256-023-00788-1","article-title":"Leveraging large language models for predictive chemistry","volume":"6","author":"Jablonka","year":"2024","journal-title":"Nat. Mach. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1038\/s41524-022-00784-w","article-title":"Mausam MatSciBERT: A materials domain language model for text mining and information extraction","volume":"8","author":"Gupta","year":"2022","journal-title":"npj Comput. Mater."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/s41597-019-0224-1","article-title":"Text-mined dataset of inorganic materials synthesis recipes","volume":"6","author":"Kononova","year":"2019","journal-title":"Sci. Data"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"V\u00e9liz, F.A.V., Bikku, T., Ibarra, D., Hern\u00e1ndez, V., Garmulewicz, A., and Herrera, F. (2024). fherreralab\/-Seaweed-Based-Bioplastics-Data-Mining-Ingredient-Property-Relations-from-the-Scientific-Literature: Supplementary Material, v1.0, Zenodo.","DOI":"10.3390\/data10020020"},{"key":"ref_20","unstructured":"Tunstall, L., von Werra, L., and Wolf, T. (2022). Natural Language Processing with Transformers Building Language Applications with Hugging Face, O\u2019Reilly Media, Inc."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"100488","DOI":"10.1016\/j.patter.2022.100488","article-title":"Quantifying the advantage of domain-specific pre-training on named entity recognition tasks in materials science","volume":"3","author":"Trewartha","year":"2022","journal-title":"Patterns"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"128576","DOI":"10.1016\/j.neucom.2024.128576","article-title":"Mask-guided BERT for Few Shot Text Classification","volume":"610","author":"Liao","year":"2023","journal-title":"Neurocomputing"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"105","DOI":"10.22159\/ijap.2018v10i2.24460","article-title":"Design and evaluation of buccal patches containing combination of hydrochlorothiazide and atenolol","volume":"10","author":"Roda","year":"2018","journal-title":"Int. J. Appl. Pharm."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"309","DOI":"10.22159\/ijap.2018v10i6.28495","article-title":"Fabrication of bioadhesive ocusert with different polymers: Once a day dose","volume":"10","author":"Dawaba","year":"2018","journal-title":"Int. J. Appl. Pharm."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"101578","DOI":"10.1016\/j.jddst.2020.101578","article-title":"Effect of composition on mechanical and physicochemical properties of mucoadhesive buccal films containing buprenorphine hydrochloride: From design of experiments to optimal formulation","volume":"56","author":"Kazemi","year":"2020","journal-title":"J. Drug Deliv. Sci. Technol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1007\/s40005-016-0298-0","article-title":"Preparation and evaluation of oral dissolving film containing local anesthetic agent, lidocaine","volume":"47","author":"Kim","year":"2017","journal-title":"J. Pharm. Investig."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"133342","DOI":"10.1016\/j.foodchem.2022.133342","article-title":"Preparation and application of a colorimetric film based on sodium alginate\/sodium carboxymethyl cellulose incorporated with rose anthocyanins","volume":"393","author":"Yang","year":"2022","journal-title":"Food Chem."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1111\/j.1365-2621.1999.tb09880.x","article-title":"Physical characteristics of a composite film of soy protein isolate and propyleneglycol alginate","volume":"64","author":"Rhim","year":"1999","journal-title":"J. Food Sci."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1016\/j.foodhyd.2018.02.026","article-title":"Antiviral and antioxidant properties of active alginate edible films containing phenolic extracts","volume":"81","author":"Fabra","year":"2018","journal-title":"Food Hydrocoll."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1016\/j.foodres.2012.07.023","article-title":"Fortification of dietary biopolymers-based packaging material with bioactive plant extracts","volume":"49","author":"Wang","year":"2012","journal-title":"Food Res. Int."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"012008","DOI":"10.1088\/1755-1315\/483\/1\/012008","article-title":"The effect of organic powdered cottonii concentration and types of plasticizers on the characteristics of edible film","volume":"483","author":"Fransiska","year":"2020","journal-title":"IOP Conf. Ser. Earth Environ. Sci."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"\u0141opusiewicz, \u0141., Macieja, S., \u015aliwi\u0144ski, M., Bartkowiak, A., Roy, S., and Sobolewski, P. (2022). Alginate Biofunctional Films Modified with Melanin from Watermelon Seeds and Zinc Oxide\/Silver Nanoparticles. Materials, 15.","DOI":"10.3390\/ma15072381"},{"key":"ref_33","first-page":"1091","article-title":"Preparation and properties analysis of edible watermelon rind based film","volume":"37","author":"Wu","year":"2018","journal-title":"J. Food Sci. Biotechnol."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"12832","DOI":"10.1111\/jfs.12832","article-title":"Preparation of a bilayer edible film incorporated with lysozyme and its effect on fish spoilage bacteria","volume":"40","author":"Li","year":"2020","journal-title":"J. Food Saf."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2317","DOI":"10.4315\/0362-028X-68.11.2317","article-title":"Listeria monocytogenes Inhibition by Whey Protein Films and Coatings Incorporating Lysozyme","volume":"68","author":"Min","year":"2005","journal-title":"J. Food Prot."},{"key":"ref_36","first-page":"M560","article-title":"Antimicrobial Activity and Hydrophobicity of Edible Whey Protein Isolate Films Formulated with Nisin and\/or Glucose Oxidase","volume":"78","year":"2013","journal-title":"J. Food Sci."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.lwt.2013.05.012","article-title":"Antimicrobial edible defatted soybean meal-based films incorporating the lactoperoxidase system","volume":"54","author":"Lee","year":"2013","journal-title":"LWT-Food Sci. Technol."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"103036","DOI":"10.1016\/j.algal.2023.103036","article-title":"Life Cycle Assessment of pilot scale production of seaweed-based bioplastic","volume":"71","author":"Ayala","year":"2023","journal-title":"Algal Res."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"El-Sheekh, M.M., Alwaleed, E.A., Ibrahim, A., and Saber, H. (2024). Preparation and characterization of bioplastic film from the green seaweed Halimeda opuntia. Int. J. Biol. Macromol., 259.","DOI":"10.1016\/j.ijbiomac.2024.129307"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1108\/PRT-09-2021-0119","article-title":"Development and characterization of biode-gradable film from marine red seaweed (Kappaphycus alvarezii)","volume":"52","author":"Rajasekar","year":"2023","journal-title":"Pigment. Resin Technol."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"113776","DOI":"10.1016\/j.lwt.2022.113776","article-title":"Biodegradable packaging films with \u03b5-polylysine\/ZIF-L composites","volume":"166","author":"Zhang","year":"2022","journal-title":"LWT"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"16632","DOI":"10.1111\/jfpp.16632","article-title":"Evaluation of physicochemical, mechanical, and antimicrobial properties of gelatin-sodium alginate-yarrow (Achillea millefolium L.) essential oil film","volume":"46","author":"Karami","year":"2022","journal-title":"J. Food Process. Preserv."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"18048","DOI":"10.1021\/jacs.3c05819","article-title":"ChatGPT Chemistry Assistant for Text Mining and the Prediction of MOF Synthesis","volume":"145","author":"Zheng","year":"2023","journal-title":"J. Am. Chem. Soc."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/2\/20\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T16:25:25Z","timestamp":1760027125000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/2\/20"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,1]]},"references-count":43,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2025,2]]}},"alternative-id":["data10020020"],"URL":"https:\/\/doi.org\/10.3390\/data10020020","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,1]]}}}