{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T01:46:03Z","timestamp":1772847963408,"version":"3.50.1"},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,3,28]],"date-time":"2023-03-28T00:00:00Z","timestamp":1679961600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,28]],"date-time":"2023-03-28T00:00:00Z","timestamp":1679961600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2021YFF1201400"],"award-info":[{"award-number":["2021YFF1201400"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["22173118"],"award-info":[{"award-number":["22173118"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Hunan Provincial Science Fund for Distinguished Young Scholars","award":["2021JJ10068"],"award-info":[{"award-number":["2021JJ10068"]}]},{"name":"the science and technology innovation Program of Hunan Province","award":["2021RC4011"],"award-info":[{"award-number":["2021RC4011"]}]},{"DOI":"10.13039\/501100003051","name":"the New Energy and Industrial Technology Development Organization (NEDO);","doi-asserted-by":"crossref","award":["JP22K12144;JPMJPF2017,"],"award-info":[{"award-number":["JP22K12144;JPMJPF2017,"]}],"id":[{"id":"10.13039\/501100003051","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Drug discovery for a protein target is a laborious and costly process. Deep learning (DL) methods have been applied to drug discovery and successfully generated novel molecular structures, and they can substantially reduce development time and costs. However, most of them rely on prior knowledge, either by drawing on the structure and properties of known molecules to generate similar candidate molecules or extracting information on the binding sites of protein pockets to obtain molecules that can bind to them. In this paper, DeepTarget, an end-to-end DL model, was proposed to generate novel molecules solely relying on the amino acid sequence of the target protein to reduce the heavy reliance on prior knowledge. DeepTarget includes three modules: Amino Acid Sequence Embedding (AASE), Structural Feature Inference (SFI), and Molecule Generation (MG). AASE generates embeddings from the amino acid sequence of the target protein. SFI inferences the potential structural features of the synthesized molecule, and MG seeks to construct the eventual molecule. The validity of the generated molecules was demonstrated by a benchmark platform of molecular generation models. The interaction between the generated molecules and the target proteins was also verified on the basis of two metrics, drug\u2013target affinity and molecular docking. The results of the experiments indicated the efficacy of the model for direct molecule generation solely conditioned on amino acid sequence.<\/jats:p>","DOI":"10.1186\/s13321-023-00702-2","type":"journal-article","created":{"date-parts":[[2023,3,28]],"date-time":"2023-03-28T13:02:56Z","timestamp":1680008576000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":55,"title":["Deep generative model for drug design from protein target sequence"],"prefix":"10.1186","volume":"15","author":[{"given":"Yangyang","family":"Chen","sequence":"first","affiliation":[]},{"given":"Zixu","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Lei","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jianmin","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Pengyong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Dongsheng","family":"Cao","sequence":"additional","affiliation":[]},{"given":"Xiangxiang","family":"Zeng","sequence":"additional","affiliation":[]},{"given":"Xiucai","family":"Ye","sequence":"additional","affiliation":[]},{"given":"Tetsuya","family":"Sakurai","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,28]]},"reference":[{"key":"702_CR1","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1016\/j.jhealeco.2016.01.012","volume":"47","author":"JA DiMasi","year":"2016","unstructured":"DiMasi JA, Grabowski HG, Hansen RW (2016) Innovation in the pharmaceutical industry: new estimates of R&D costs. J Health Econ 47:20\u201333","journal-title":"J Health Econ"},{"issue":"9","key":"702_CR2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patter.2020.100142","volume":"1","author":"W Gao","year":"2020","unstructured":"Gao W et al (2020) Deep learning in protein structural modeling and design. Patterns 1(9):100142","journal-title":"Patterns"},{"key":"702_CR3","first-page":"2314","volume":"39","author":"L Chen","year":"2022","unstructured":"Chen L et al (2022) Drug design and repurposing with a sequence-to-drug paradigm. bbioRxiv 39:2314","journal-title":"bbioRxiv"},{"issue":"7553","key":"702_CR4","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436\u2013444","journal-title":"Nature"},{"issue":"19","key":"702_CR5","doi-asserted-by":"publisher","first-page":"14011","DOI":"10.1021\/acs.jmedchem.1c00927","volume":"64","author":"X Tong","year":"2021","unstructured":"Tong X et al (2021) Generative models for De Novo drug design. J Med Chem 64(19):14011\u201314027","journal-title":"J Med Chem"},{"issue":"14","key":"702_CR6","doi-asserted-by":"publisher","first-page":"3250","DOI":"10.3390\/molecules25143250","volume":"25","author":"E Lin","year":"2020","unstructured":"Lin E, Lin C-H, Lane H-Y (2020) Relevant applications of generative adversarial networks in drug design and discovery: molecular de novo design, dimensionality reduction, and de novo peptide and protein design. Molecules 25(14):3250","journal-title":"Molecules"},{"issue":"7","key":"702_CR7","doi-asserted-by":"publisher","first-page":"10883","DOI":"10.18632\/oncotarget.14073","volume":"8","author":"A Kadurin","year":"2017","unstructured":"Kadurin A et al (2017) The cornucopia of meaningful leads: applying deep adversarial autoencoders for new molecule development in oncology. Oncotarget 8(7):10883","journal-title":"Oncotarget"},{"issue":"1","key":"702_CR8","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1021\/acscentsci.7b00512","volume":"4","author":"MH Segler","year":"2018","unstructured":"Segler MH et al (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent Sci 4(1):120\u2013131","journal-title":"ACS Cent Sci"},{"key":"702_CR9","unstructured":"You J, Liu B, Ying Z, et al. Graph convolutional policy network for goal-directed molecular graph generation[J]. Advances in neural information processing systems, 2018, 31."},{"key":"702_CR10","unstructured":"Luo, Y., K. Yan, and S. Ji. GraphDF: A discrete flow model for molecular graph generation. In International Conference on Machine Learning. 2021. PMLR."},{"key":"702_CR11","first-page":"6229","volume":"34","author":"S Luo","year":"2021","unstructured":"Luo S et al (2021) A 3D generative model for structure-based drug design. Adv Neural Inf Proc Sys 34:6229\u201339","journal-title":"Adv Neural Inf Proc Sys"},{"key":"702_CR12","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1613\/jair.301","volume":"4","author":"LP Kaelbling","year":"1996","unstructured":"Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237\u2013285","journal-title":"J Artif Intell Res"},{"issue":"1","key":"702_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-019-0404-1","volume":"12","author":"\u0141 Maziarka","year":"2020","unstructured":"Maziarka \u0141 et al (2020) Mol-CycleGAN: a generative model for molecular optimization. J Cheminf 12(1):1\u201318","journal-title":"J Cheminf"},{"issue":"10","key":"702_CR14","doi-asserted-by":"publisher","first-page":"914","DOI":"10.1038\/s42256-021-00403-1","volume":"3","author":"J Wang","year":"2021","unstructured":"Wang J et al (2021) Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning. Nat Machine Intell 3(10):914\u2013922","journal-title":"Nat Machine Intell"},{"issue":"2","key":"702_CR15","doi-asserted-by":"publisher","first-page":"621","DOI":"10.1021\/acs.jcim.0c01060","volume":"61","author":"SR Krishnan","year":"2021","unstructured":"Krishnan SR et al (2021) Accelerating de novo drug design against novel proteins using deep learning. J Chem Inf Model 61(2):621\u2013630","journal-title":"J Chem Inf Model"},{"issue":"1\u20132","key":"702_CR16","doi-asserted-by":"publisher","first-page":"1700153","DOI":"10.1002\/minf.201700153","volume":"37","author":"D Merk","year":"2018","unstructured":"Merk D et al (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inf 37(1\u20132):1700153","journal-title":"Mol Inf"},{"issue":"9","key":"702_CR17","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1038\/s41587-019-0224-x","volume":"37","author":"A Zhavoronkov","year":"2019","unstructured":"Zhavoronkov A et al (2019) Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol 37(9):1038\u20131040","journal-title":"Nat Biotechnol"},{"issue":"3","key":"702_CR18","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1093\/bib\/bbaa161","volume":"22","author":"Q Bai","year":"2021","unstructured":"Bai Q et al (2021) MolAICal: a soft tool for 3D drug design of protein targets by artificial intelligence and classical algorithm. Brief Bioinform 22(3):161","journal-title":"Brief Bioinform"},{"issue":"2","key":"702_CR19","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1002\/jcc.21334","volume":"31","author":"O Trott","year":"2010","unstructured":"Trott O, Olson AJ (2010) AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 31(2):455\u2013461","journal-title":"J Comput Chem"},{"issue":"10","key":"702_CR20","doi-asserted-by":"publisher","first-page":"4282","DOI":"10.1021\/acs.molpharmaceut.9b00634","volume":"16","author":"M Skalic","year":"2019","unstructured":"Skalic M et al (2019) From target to drug: generative modeling for the multimodal structure-based ligand design. Mol Pharm 16(10):4282\u20134291","journal-title":"Mol Pharm"},{"key":"702_CR21","unstructured":"Masuda T, Ragoza M, Koes DR. (2020) Generating 3d molecular structures conditional on a receptor binding site with deep generative models. arXiv preprint arXiv:2010.14442."},{"key":"702_CR22","unstructured":"Ragoza, M, Masuda T, Koes DR. (2020). Learning a continuous representation of 3D molecular structures with deep generative models. arXiv preprint arXiv:2010.08687."},{"issue":"7","key":"702_CR23","doi-asserted-by":"publisher","first-page":"3240","DOI":"10.1021\/acs.jcim.0c01494","volume":"61","author":"M Xu","year":"2021","unstructured":"Xu M, Ran T, Chen H (2021) De novo molecule design through the molecular generative model conditioned by 3D information of protein binding sites. J Chem Inf Model 61(7):3240\u20133254","journal-title":"J Chem Inf Model"},{"issue":"3","key":"702_CR24","doi-asserted-by":"publisher","first-page":"1205","DOI":"10.1021\/acs.jcim.8b00706","volume":"59","author":"M Skalic","year":"2019","unstructured":"Skalic M et al (2019) Shape-based generative modeling for de novo drug design. J Chem Inf Model 59(3):1205\u20131214","journal-title":"J Chem Inf Model"},{"issue":"7873","key":"702_CR25","doi-asserted-by":"publisher","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","volume":"596","author":"J Jumper","year":"2021","unstructured":"Jumper J et al (2021) Highly accurate protein structure prediction with AlphaFold. Nature 596(7873):583\u2013589","journal-title":"Nature"},{"key":"702_CR26","unstructured":"Goodfellow I et al. (2014). Generative adversarial nets. Advances in neural information processing systems. 27."},{"key":"702_CR27","doi-asserted-by":"crossref","unstructured":"Wang J et al. (2022). De novo molecular design with deep molecular generative models for PPI inhibitors. Briefings in Bioinformatics. 23(4).","DOI":"10.1093\/bib\/bbac285"},{"key":"702_CR28","unstructured":"Gunel B et al. (2020). Supervised contrastive learning for pre-trained language model fine-tuning. arXiv preprint arXiv:2011.01403."},{"issue":"D1","key":"702_CR29","doi-asserted-by":"publisher","first-page":"D1100","DOI":"10.1093\/nar\/gkr777","volume":"40","author":"A Gaulton","year":"2012","unstructured":"Gaulton A et al (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40(D1):D1100\u2013D1107","journal-title":"Nucleic Acids Res"},{"key":"702_CR30","unstructured":"Vaswani A et al. (2017). Attention is all you need. Advances in neural information processing systems. 30."},{"issue":"56","key":"702_CR31","first-page":"595","volume":"1984","author":"HB GDR","year":"1984","unstructured":"GDR HB, Sharon N, Australia EW (1984) Nomenclature and symbolism for amino acids and peptides. Pure Appl Chem 1984(56):595\u2013624","journal-title":"Pure Appl Chem"},{"key":"702_CR32","unstructured":"Gulrajani, I et al. (2017). Improved training of wasserstein gans. Adv Neural Inform Proc Syst. 30."},{"issue":"24","key":"702_CR33","doi-asserted-by":"publisher","first-page":"8362","DOI":"10.1039\/D1SC01050F","volume":"12","author":"O Dollar","year":"2021","unstructured":"Dollar O et al (2021) Attention-based generative models for de novo molecular design. Chem Sci 12(24):8362\u20138372","journal-title":"Chem Sci"},{"issue":"1","key":"702_CR34","doi-asserted-by":"publisher","first-page":"3293","DOI":"10.1038\/s41467-022-30839-x","volume":"13","author":"D Flam-Shepherd","year":"2022","unstructured":"Flam-Shepherd D, Zhu K, Aspuru-Guzik A (2022) Language models can learn complex molecular distributions. Nat Commun 13(1):3293","journal-title":"Nat Commun"},{"key":"702_CR35","unstructured":"Tschannen M, Bachem O, Lucic M. (2018). Recent advances in autoencoder-based representation learning. arXiv preprint arXiv:1812.05069."},{"key":"702_CR36","unstructured":"Bjerrum EJ. (2017). SMILES enumeration as data augmentation for neural network modeling of molecules. arXiv preprint arXiv:1703.07076."},{"issue":"4","key":"702_CR37","first-page":"1029","volume":"3","author":"KK Chaudhary","year":"2016","unstructured":"Chaudhary KK, Mishra N (2016) A review on molecular docking: novel tool for drug discovery. Databases 3(4):1029","journal-title":"Databases"},{"issue":"22\u201323","key":"702_CR38","first-page":"5545","volume":"36","author":"K Huang","year":"2020","unstructured":"Huang K et al (2020) DeepPurpose: a deep learning library for drug\u2013target interaction prediction. Bioinformatics 36(22\u201323):5545\u20135547","journal-title":"Bioinformatics"},{"issue":"1","key":"702_CR39","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1093\/nar\/gkl999","volume":"35","author":"T Liu","year":"2007","unstructured":"Liu T et al (2007) BindingDB: a web-accessible database of experimentally determined protein\u2013ligand binding affinities. Nucleic Acids Res 35(1):198\u2013201","journal-title":"Nucleic Acids Res"},{"issue":"7","key":"702_CR40","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1021\/jm0306430","volume":"47","author":"RA Friesner","year":"2004","unstructured":"Friesner RA et al (2004) Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem 47(7):1739\u20131749","journal-title":"J Med Chem"},{"key":"702_CR41","doi-asserted-by":"publisher","first-page":"1931","DOI":"10.3389\/fphar.2020.565644","volume":"11","author":"D Polykovskiy","year":"2020","unstructured":"Polykovskiy D et al (2020) Molecular sets (MOSES): a benchmarking platform for molecular generation models. Front Pharmacol 11:1931","journal-title":"Front Pharmacol"},{"issue":"1","key":"702_CR42","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1097\/00008571-199602000-00006","volume":"6","author":"DE Comings","year":"1996","unstructured":"Comings DE et al (1996) The dopamine D2 receptor (DRD2) gene: a genetic risk factor in smoking. Pharmacogenetics 6(1):73\u201379","journal-title":"Pharmacogenetics"},{"issue":"4","key":"702_CR43","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1038\/nrc2812","volume":"10","author":"M Rouleau","year":"2010","unstructured":"Rouleau M et al (2010) PARP inhibition: PARP1 and beyond. Nat Rev Cancer 10(4):293\u2013301","journal-title":"Nat Rev Cancer"},{"issue":"1","key":"702_CR44","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-020-79682-4","volume":"11","author":"D Grechishnikova","year":"2021","unstructured":"Grechishnikova D (2021) Transformer neural network for protein-specific de novo drug generation as a machine translation problem. Sci Rep 11(1):1\u201313","journal-title":"Sci Rep"},{"key":"702_CR45","doi-asserted-by":"crossref","unstructured":"Freitag M, Al-Onaizan Y. (2017). Beam search strategies for neural machine translation. arXiv preprint arXiv:1702.01806.","DOI":"10.18653\/v1\/W17-3207"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00702-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00702-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00702-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,9]],"date-time":"2023-12-09T20:10:36Z","timestamp":1702152636000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00702-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,28]]},"references-count":45,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["702"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00702-2","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,28]]},"assertion":[{"value":"31 July 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 February 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 March 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"We declare no conflict of interest.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"38"}}