{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T17:02:14Z","timestamp":1775581334469,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T00:00:00Z","timestamp":1714089600000},"content-version":"vor","delay-in-days":30,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,3,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Despite the widespread use of ionizable lipid nanoparticles (LNPs) in clinical applications for messenger RNA (mRNA) delivery, the mRNA drug delivery system faces an efficient challenge in the screening of LNPs. Traditional screening methods often require a substantial amount of experimental time and incur high research and development costs. To accelerate the early development stage of LNPs, we propose TransLNP, a transformer-based transfection prediction model designed to aid in the selection of LNPs for mRNA drug delivery systems. TransLNP uses two types of molecular information to perceive the relationship between structure and transfection efficiency: coarse-grained atomic sequence information and fine-grained atomic spatial relationship information. Due to the scarcity of existing LNPs experimental data, we find that pretraining the molecular model is crucial for better understanding the task of predicting LNPs properties, which is achieved through reconstructing atomic 3D coordinates and masking atom predictions. In addition, the issue of data imbalance is particularly prominent in the real-world exploration of LNPs. We introduce the BalMol block to solve this problem by smoothing the distribution of labels and molecular features. Our approach outperforms state-of-the-art works in transfection property prediction under both random and scaffold data splitting. Additionally, we establish a relationship between molecular structural similarity and transfection differences, selecting 4267 pairs of molecular transfection cliffs, which are pairs of molecules that exhibit high structural similarity but significant differences in transfection efficiency, thereby revealing the primary source of prediction errors. The code, model and data are made publicly available at https:\/\/github.com\/wklix\/TransLNP.<\/jats:p>","DOI":"10.1093\/bib\/bbae186","type":"journal-article","created":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T21:45:20Z","timestamp":1714167920000},"source":"Crossref","is-referenced-by-count":30,"title":["Data-balanced transformer for accelerated ionizable lipid nanoparticles screening in mRNA delivery"],"prefix":"10.1093","volume":"25","author":[{"given":"Kun","family":"Wu","sequence":"first","affiliation":[{"name":"Shanghai Advanced Research Institute, Chinese Academy of Sciences , Shanghai 201210 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]},{"given":"Xiulong","family":"Yang","sequence":"additional","affiliation":[{"name":"Shanghai Advanced Research Institute, Chinese Academy of Sciences , Shanghai 201210 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]},{"given":"Zixu","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Tsukuba , Tsukuba 3058577 , Japan"}]},{"given":"Na","family":"Li","sequence":"additional","affiliation":[{"name":"National Facility for Protein Science in Shanghai, Zhangjiang Laboratory , Shanghai Advanced Research Institute,"},{"name":"Chinese Academy of Sciences , Shanghai Advanced Research Institute,"}]},{"given":"Jialu","family":"Zhang","sequence":"additional","affiliation":[{"name":"Shanghai Advanced Research Institute, Chinese Academy of Sciences , Shanghai 201210 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]},{"given":"Lizhuang","family":"Liu","sequence":"additional","affiliation":[{"name":"Shanghai Advanced Research Institute, Chinese Academy of Sciences , Shanghai 201210 , China"},{"name":"University of Chinese Academy of Sciences , Beijing 100049 , China"}]}],"member":"286","published-online":{"date-parts":[[2024,4,25]]},"reference":[{"issue":"1","key":"2024042621450213200_ref1","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1038\/s41392-022-01007-w","article-title":"Mrna-based therapeutics: powerful and versatile tools to combat diseases","volume":"7","author":"Qin","year":"2022","journal-title":"Signal Transduct Target Ther"},{"issue":"11","key":"2024042621450213200_ref2","doi-asserted-by":"crossref","first-page":"16982","DOI":"10.1021\/acsnano.1c04996","article-title":"Lipid nanoparticles-from liposomes to mrna vaccine delivery, a landscape of research diversity and advancement","volume":"15","author":"Tenchov","year":"2021","journal-title":"ACS Nano"},{"key":"2024042621450213200_ref3","doi-asserted-by":"crossref","first-page":"3321","DOI":"10.1016\/j.radcr.2021.07.082","article-title":"Acute myocarditis following comirnaty vaccination in a healthy man with previous sars-cov-2 infection","volume":"16","author":"Patrignani","year":"2021","journal-title":"Radiol Case Rep"},{"key":"2024042621450213200_ref4","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1038\/s41562-023-01733-3","article-title":"A systematic review and meta-analysis of the global prevalence and determinants of covid-19 vaccine acceptance and uptake in people living with hiv","volume":"8","author":"Sulaiman","year":"2024","journal-title":"Nat Hum Behav"},{"issue":"12","key":"2024042621450213200_ref5","doi-asserted-by":"crossref","first-page":"2118","DOI":"10.1038\/mt.2014.133","article-title":"A cationic nanoemulsion for the delivery of next-generation rna vaccines","volume":"22","author":"Brito","year":"2014","journal-title":"Mol Ther"},{"issue":"8","key":"2024042621450213200_ref6","doi-asserted-by":"crossref","first-page":"1415","DOI":"10.1016\/j.ymthe.2019.05.012","article-title":"Mrna delivery for therapeutic anti-her2 antibody expression in\u00c2 vivo","volume":"27","author":"Rybakova","year":"2019","journal-title":"Mol Ther"},{"issue":"4","key":"2024042621450213200_ref7","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1016\/j.ymthe.2018.12.013","article-title":"Self-amplifying rna vaccines for venezuelan equine encephalitis virus induce robust protective immunogenicity in mice","volume":"27","author":"Samsa","year":"2019","journal-title":"Mol Ther"},{"key":"2024042621450213200_ref8","first-page":"702","article-title":"Modified mrna-lnp vaccines confer protection against experimental denv-2 infection in mice","volume":"18","author":"Zhang","year":"2020","journal-title":"Mol Ther"},{"issue":"3","key":"2024042621450213200_ref9","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1093\/infdis\/jix592","article-title":"Modified mrna-based vaccines elicit robust immune responses and protect Guinea pigs from ebola virus disease","volume":"217","author":"Meyer","year":"2018","journal-title":"J Infect Dis"},{"key":"2024042621450213200_ref10","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1016\/j.omtn.2020.03.015","article-title":"Treatment of hemophilia a using factor viii messenger rna lipid nanoparticles","volume":"20","author":"Chen","year":"2020","journal-title":"Mol Ther Nucleic Acids"},{"issue":"10","key":"2024042621450213200_ref11","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1038\/gt.2016.46","article-title":"Therapeutic efficacy in a hemophilia b model using a biosynthetic mrna liver depot system","volume":"23","author":"DeRosa","year":"2016","journal-title":"Gene Ther"},{"issue":"1","key":"2024042621450213200_ref12","doi-asserted-by":"crossref","first-page":"7859","DOI":"10.1038\/s41598-018-26298-4","article-title":"Mrna treatment produces sustained expression of enzymatically active human adamts13 in mice","volume":"8","author":"Liu-Chen","year":"2018","journal-title":"Sci Rep"},{"issue":"1","key":"2024042621450213200_ref13","doi-asserted-by":"crossref","first-page":"20019","DOI":"10.1038\/srep20019","article-title":"Intrathecal delivery of frataxin mrna encapsulated in lipid nanoparticles to dorsal root ganglia as a potential therapeutic for friedreich\u2019s ataxia","volume":"6","author":"Nabhan","year":"2016","journal-title":"Sci Rep"},{"key":"2024042621450213200_ref14","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1016\/j.omtn.2018.03.013","article-title":"De novo synthesis of elastin by exogenous delivery of synthetic modified mrna into skin and elastin-deficient cells","volume":"11","author":"Lescan","year":"2018","journal-title":"Mol Ther Nucleic Acids"},{"issue":"2","key":"2024042621450213200_ref15","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1007\/s00221-020-05970-0","article-title":"Lipid nanoparticles-encapsulated brain-derived neurotrophic factor mrna delivered through the round window niche in the cochleae of Guinea pigs","volume":"239","author":"Miwa","year":"2021","journal-title":"Exp Brain Res"},{"issue":"1","key":"2024042621450213200_ref16","doi-asserted-by":"crossref","first-page":"7233","DOI":"10.1038\/s41467-021-27493-0","article-title":"An ionizable lipid toolbox for rna delivery","volume":"12","author":"Han","year":"2021","journal-title":"Nat Commun"},{"issue":"15","key":"2024042621450213200_ref17","doi-asserted-by":"crossref","first-page":"2209624","DOI":"10.1002\/adma.202209624","article-title":"Unsaturated, trialkyl ionizable lipids are versatile lipid-nanoparticle components for therapeutic and vaccine applications","volume":"35","author":"Lam","year":"2023","journal-title":"Adv Mater"},{"issue":"9","key":"2024042621450213200_ref18","doi-asserted-by":"crossref","first-page":"eabf4398","DOI":"10.1126\/sciadv.abf4398","article-title":"Engineered ionizable lipid nanoparticles for targeted delivery of rna therapeutics into different types of cells in the liver","volume":"7","author":"Kim","year":"2021","journal-title":"Sci Adv"},{"issue":"12","key":"2024042621450213200_ref19","doi-asserted-by":"crossref","first-page":"1078","DOI":"10.1038\/s41578-021-00358-0","article-title":"Lipid nanoparticles for mrna delivery","volume":"6","author":"Hou","year":"2021","journal-title":"Nat Rev Mater"},{"issue":"10","key":"2024042621450213200_ref20","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1038\/s41587-019-0247-3","article-title":"Delivery of mrna vaccines with heterocyclic lipids increases anti-tumor efficacy by sting-mediated immune cell activation","volume":"37","author":"Miao","year":"2019","journal-title":"Nat Biotechnol"},{"issue":"10","key":"2024042621450213200_ref21","doi-asserted-by":"crossref","first-page":"1410","DOI":"10.1038\/s41587-023-01679-x","article-title":"Combinatorial design of nanoparticles for pulmonary mrna delivery and genome editing","volume":"41","author":"Li","year":"2023","journal-title":"Nat Biotechnol"},{"key":"2024042621450213200_ref22","doi-asserted-by":"crossref","first-page":"2950","DOI":"10.1016\/j.apsb.2021.11.021","article-title":"Prediction of lipid nanoparticles for mrna vaccines by the machine learning algorithm","volume":"12","author":"Wang","year":"2021","journal-title":"Acta Pharm Sin B"},{"key":"2024042621450213200_ref23","doi-asserted-by":"crossref","article-title":"Agile platform: a deep learning-powered approach to accelerate lnp development for mrna delivery","author":"Xu","DOI":"10.1101\/2023.06.01.543345"},{"issue":"4","key":"2024042621450213200_ref24","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1021\/acscentsci.6b00367","article-title":"Low data drug discovery with one-shot learning","volume":"3","author":"Altae-Tran","year":"2017","journal-title":"ACS Cent Sci"},{"key":"2024042621450213200_ref25","doi-asserted-by":"crossref","first-page":"2559","DOI":"10.1145\/3442381.3450112","article-title":"Few-shot graph learning for molecular property prediction","volume-title":"Proceedings of the Web Conference 2021, WWW'21","author":"Guo","year":"2021"},{"issue":"1","key":"2024042621450213200_ref26","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1021\/acs.jcim.2c00779","article-title":"Few-shot learning for low-data drug discovery","volume":"63","author":"Vella","year":"2023","journal-title":"J Chem Inf Model"},{"key":"2024042621450213200_ref27","first-page":"7774","article-title":"Data augmentation for imbalanced regression","volume-title":"Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, volume 206 of Proceedings of Machine Learning Research","author":"Stocksieker","year":"2023"},{"key":"2024042621450213200_ref28","article-title":"Delving into deep imbalanced regression","volume-title":"International Conference on Machine Learning (ICML)","author":"Yang","year":"2021"},{"key":"2024042621450213200_ref29","first-page":"36","article-title":"SMOGN: a pre-processing approach for imbalanced regression","volume-title":"Proceedings of the First International Workshop on Learning with Imbalanced Domains: Theory and Applications, volume 74 of Proceedings of Machine Learning Research","author":"Branco","year":"2017"},{"key":"2024042621450213200_ref30","first-page":"67","article-title":"Rebagg: Resampled bagging for imbalanced regression","volume-title":"Proceedings of the Second International Workshop on Learning with Imbalanced Domains: Theory and Applications, volume 94 of Proceedings of Machine Learning Research","author":"Branco","year":"2018"},{"issue":"2","key":"2024042621450213200_ref31","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1038\/s42256-021-00438-4","article-title":"Geometry-enhanced molecular representation learning for property prediction","volume":"4","author":"Fang","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2024042621450213200_ref32","article-title":"CheMixNet: mixed DNN architectures for predicting chemical properties using multiple molecular representations","volume-title":"Proceedings of the Thirty-second Conference on Neural Information Processing Systems (NeurIPS)","author":"Paul","year":"2018"},{"issue":"23","key":"2024042621450213200_ref33","doi-asserted-by":"crossref","first-page":"4979","DOI":"10.1093\/bioinformatics\/btz307","article-title":"Fp2vec: a new molecular featurizer for learning molecular properties","volume":"35","author":"Jeon","year":"2019","journal-title":"Bioinformatics"},{"key":"2024042621450213200_ref34","volume-title":"Three-dimensionally embedded graph convolutional network (3dgcn) for molecule interpretation","author":"Cho","year":"2018"},{"issue":"1","key":"2024042621450213200_ref35","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1109\/TNN.2008.2005605","article-title":"The graph neural network model","volume":"20","author":"Scarselli","year":"2009","journal-title":"IEEE Trans Neural Netw"},{"key":"2024042621450213200_ref36","first-page":"6000","article-title":"Attention is all you need","volume-title":"Proceedings of the 31st international conference on neural information processing systems, NIPS\u201917","author":"Vaswani","year":"2017"},{"issue":"9","key":"2024042621450213200_ref37","doi-asserted-by":"crossref","first-page":"1572","DOI":"10.1021\/acscentsci.9b00576","article-title":"Molecular transformer: a model for uncertainty-calibrated chemical reaction prediction","volume":"5","author":"Schwaller","year":"2019","journal-title":"ACS Cent Sci"},{"key":"2024042621450213200_ref38","doi-asserted-by":"crossref","first-page":"09","DOI":"10.1093\/bib\/bbac408","article-title":"FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction","volume":"23","author":"Cai","year":"2022","journal-title":"Brief Bioinform"},{"issue":"5","key":"2024042621450213200_ref39","doi-asserted-by":"crossref","first-page":"bbab051","DOI":"10.1093\/bib\/bbab051","article-title":"NPI-GNN: predicting ncRNA-protein interactions with deep graph neural networks","volume":"22","author":"Shen","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024042621450213200_ref40","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac231","article-title":"Multiphysical graph neural network (MP-GNN) for COVID-19 drug design","volume":"23","author":"Li","year":"2022","journal-title":"Brief Bioinform"},{"issue":"9","key":"2024042621450213200_ref41","doi-asserted-by":"crossref","first-page":"2064","DOI":"10.1021\/acs.jcim.1c00600","article-title":"Molgpt: molecular generation using a transformer-decoder model","volume":"62","author":"Bagal","year":"2022","journal-title":"J Chem Inf Model"},{"key":"2024042621450213200_ref42","article-title":"Transformer-based molecular generative model for antiviral drug design","volume":"64","author":"Mao","year":"2023","journal-title":"J Chem Inf Model"},{"key":"2024042621450213200_ref43","article-title":"3d-transformer: molecular representation with transformer in 3d space","author":"Fang","year":"2021"},{"key":"2024042621450213200_ref44","doi-asserted-by":"crossref","DOI":"10.26434\/chemrxiv-2022-jjm0j-v4","article-title":"Uni-mol: A universal 3d molecular representation learning framework","volume-title":"The Eleventh International Conference on Learning Representations","author":"Zhou","year":"2023"},{"key":"2024042621450213200_ref45","first-page":"2058","article-title":"Return of frustratingly easy domain adaptation","volume-title":"Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI\u201916","author":"Sun","year":"2016"},{"key":"2024042621450213200_ref46","article-title":"Hierarchical grammar-induced geometry for data-efficient molecular property prediction","volume-title":"Proceedings of the 40th International Conference on Machine Learning ICML\u201923","author":"Guo","year":"2023"},{"key":"2024042621450213200_ref47","first-page":"15870","article-title":"Motif-based graph self-supervised learning for molecular property prediction","volume-title":"Advances in Neural Information Processing Systems","year":"2021"},{"issue":"23","key":"2024042621450213200_ref48","doi-asserted-by":"crossref","first-page":"5938","DOI":"10.1021\/acs.jcim.2c01073","article-title":"Exposing the limitations of molecular machine learning with activity cliffs","volume":"62","author":"van Tilborg","year":"2022","journal-title":"J Chem Inf Model"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/3\/bbae186\/57330692\/bbae186.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/3\/bbae186\/57330692\/bbae186.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,26]],"date-time":"2024-04-26T21:45:52Z","timestamp":1714167952000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae186\/7658017"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,27]]},"references-count":48,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,3,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae186","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,5]]},"published":{"date-parts":[[2024,3,27]]},"article-number":"bbae186"}}