{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T03:49:37Z","timestamp":1761709777723,"version":"3.37.3"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,8,31]],"date-time":"2020-08-31T00:00:00Z","timestamp":1598832000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,8,31]],"date-time":"2020-08-31T00:00:00Z","timestamp":1598832000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100010909","name":"Young Scientists Fund","doi-asserted-by":"publisher","award":["21805303"],"award-info":[{"award-number":["21805303"]}],"id":[{"id":"10.13039\/501100010909","id-type":"DOI","asserted-by":"publisher"}]},{"name":"CSDB","award":["XXH135"],"award-info":[{"award-number":["XXH135"]}]},{"name":"SGST","award":["18DZ2294000"],"award-info":[{"award-number":["18DZ2294000"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Machine translation of chemical nomenclature has considerable application\u00a0prospect in chemical text data processing between languages. However, rule based machine translation tools have to face significant complication in rule sets building, especially in translation of chemical names between English and Chinese, which are the two most used languages\u00a0of chemical nomenclature in the world. We applied two types of neural networks in the task of chemical nomenclature translation between English and Chinese, and made a comparison\u00a0with an existing rule based machine translation tool. The result shows that deep learning based approaches have a great chance to precede rule based translation tools in machine translation of chemical nomenclature between English and Chinese.<\/jats:p>","DOI":"10.1186\/s13321-020-00457-0","type":"journal-article","created":{"date-parts":[[2020,8,31]],"date-time":"2020-08-31T12:03:04Z","timestamp":1598875384000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Neural machine translation of chemical nomenclature between English and Chinese"],"prefix":"10.1186","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5529-1875","authenticated-orcid":false,"given":"Tingjun","family":"Xu","sequence":"first","affiliation":[]},{"given":"Weiming","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Junhong","family":"Zhou","sequence":"additional","affiliation":[]},{"given":"Jingfang","family":"Dai","sequence":"additional","affiliation":[]},{"given":"Yingyong","family":"Li","sequence":"additional","affiliation":[]},{"given":"Yingli","family":"Zhao","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,8,31]]},"reference":[{"key":"457_CR1","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1515\/ci.2002.24.2.12b","volume":"24","author":"A McNaught","year":"2002","unstructured":"McNaught A (2002) Chemical nomenclature and structure representation. Chem Int 24:12\u201314. https:\/\/doi.org\/10.1515\/ci.2002.24.2.12b","journal-title":"Chem Int"},{"key":"457_CR2","unstructured":"Chemical Abstracts Service (2007) Naming and indexing of chemical substances for chemical abstracts. Appendix IV of CA Index Guide"},{"key":"457_CR3","unstructured":"Ikutoshi, Matsuura (2005) Development of a system for translation of chemical name into 2D-structure. 28th symposium on chemical information and computer science, 29\u201332"},{"key":"457_CR4","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1021\/ci100384d","volume":"51","author":"DM Lowe","year":"2011","unstructured":"Lowe DM, Corbett PT, Murray-Rust P, Glen RC (2011) Chemical name to structure: OPSIN, an open source solution. J Chem Inf Model 51:739\u2013753. https:\/\/doi.org\/10.1021\/ci100384d","journal-title":"J Chem Inf Model"},{"key":"457_CR5","unstructured":"Google Inc (2020) Google. https:\/\/www.google.com\/"},{"key":"457_CR6","volume-title":"Nomenclature of organic compounds","author":"China Chemical Society","year":"2018","unstructured":"China Chemical Society (2018) Nomenclature of organic compounds. SciencePress, Beijing"},{"key":"457_CR7","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1021\/c160055a009","volume":"14","author":"GG Vander Stouw","year":"1974","unstructured":"Vander Stouw GG, Elliott PM, Isenberg AC (1974) Automated conversion of chemical substance names to atom-bond connection tables. J Chem Doc 14:185\u2013193. https:\/\/doi.org\/10.1021\/c160055a009","journal-title":"J Chem Doc"},{"key":"457_CR8","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1021\/ci00062a009","volume":"29","author":"DI Cooke-Fox","year":"1989","unstructured":"Cooke-Fox DI, Kirby GH, Rayner JD (1989) Computer translation of IUPAC systematic organic chemical nomenclature. 1. Introduction and background to a grammar-based approach. J Chem Inf Comput Sci 29:101\u2013105. https:\/\/doi.org\/10.1021\/ci00062a009","journal-title":"J Chem Inf Comput Sci"},{"key":"457_CR9","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1021\/ci00066a004","volume":"30","author":"DI Cooke-Fox","year":"1990","unstructured":"Cooke-Fox DI, Kirby GH, Lord MR, Rayner JD (1990) Computer translation of IUPAC systematic organic chemical nomenclature. 4. Concise connection tables to structure diagrams. J Chem Inf Comput Sci 30:122\u2013127. https:\/\/doi.org\/10.1021\/ci00066a004","journal-title":"J Chem Inf Comput Sci"},{"key":"457_CR10","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1021\/ci800243w","volume":"49","author":"R Sayle","year":"2009","unstructured":"Sayle R (2009) Foreign language translation of chemical nomenclature by computer. J Chem Inf Model 49:519\u2013530. https:\/\/doi.org\/10.1021\/ci800243w","journal-title":"J Chem Inf Model"},{"key":"457_CR11","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1021\/c160005a012","volume":"2","author":"L Summers","year":"1962","unstructured":"Summers L (1962) Machine translation of Russian organic chemical names into English by analysis and resynthesis of the component fragments. J Chem Doc 2:83\u201386. https:\/\/doi.org\/10.1021\/c160005a012","journal-title":"J Chem Doc"},{"key":"457_CR12","doi-asserted-by":"publisher","first-page":"192","DOI":"10.1038\/192192a0","volume":"192","author":"E Garfield","year":"1961","unstructured":"Garfield E (1961) Chemico-linguistics: computer translation of chemical nomenclature. Nature 192:192. https:\/\/doi.org\/10.1038\/192192a0","journal-title":"Nature"},{"key":"457_CR13","unstructured":"Chen B Chen W (2006) Study on machine translation of English compound name to Chinese. The 8th symposium on scientific database and information technology. Changsha, 2006."},{"key":"457_CR14","unstructured":"Xu T, Chen W (2008) Study on machine translation of Chinese compound name to English. The 9th symposium on scientific database and information technology. Guilin, 2008."},{"key":"457_CR15","doi-asserted-by":"publisher","unstructured":"Cho K, van Merrienboer B, Gulcehre G, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2017) Learning phrase representations using RNN encoder-decoder for statistical machine translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), p 1724\u20131734. https:\/\/doi.org\/10.3115\/v1\/D14-1179","DOI":"10.3115\/v1\/D14-1179"},{"key":"457_CR16","unstructured":"Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. arXiv: 1409.0473. https:\/\/arxiv.org\/abs\/1409.0473"},{"key":"457_CR17","doi-asserted-by":"publisher","unstructured":"Luong T, Hieu P, Christopher DM (2015) Effective approaches to attention-based neural machine translation. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, p 1412\u20131421. https:\/\/doi.org\/10.18653\/v1\/D15-1166","DOI":"10.18653\/v1\/D15-1166"},{"key":"457_CR18","unstructured":"Tanakitrungruang W (2017) Attention-based sequence-to-sequence in keras. https:\/\/wanasit.github.io\/attention-based-sequence-to-sequence-in-keras.html"},{"key":"457_CR19","unstructured":"Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. arXiv: 1409.3215. https:\/\/arxiv.org\/abs\/1409.3215"},{"key":"457_CR20","unstructured":"keras (2019) Sequence-to-sequence example in Keras (character-level). https:\/\/github.com\/keras-team\/keras\/blob\/master\/examples\/cnn_seq2seq.py"},{"key":"457_CR21","doi-asserted-by":"publisher","DOI":"10.1016\/0165-6074(89)90269-X","author":"M Sundermeyer","year":"2012","unstructured":"Sundermeyer M, Schl\u00fcter R, Ney H (2012) LSTM Neural Networks for Language Modeling. Interspeech. https:\/\/doi.org\/10.1016\/0165-6074(89)90269-X","journal-title":"Interspeech"},{"key":"457_CR22","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1039\/C7ME00107J","volume":"3","author":"D Fooshee","year":"2018","unstructured":"Fooshee D, Mood A, Gutman E (2018) Deep learning for chemical reaction prediction. Mol Syst Des Eng 3:442\u2013452. https:\/\/doi.org\/10.1039\/C7ME00107J","journal-title":"Mol Syst Des Eng"},{"key":"457_CR23","unstructured":"keras (2019) Trains a basic character-level sequence-to-sequence model. https:\/\/github.com\/keras-team\/keras\/blob\/master\/examples\/lstm_seq2seq.py"},{"key":"457_CR24","unstructured":"Python Software Foundation (2020) Python 3. https:\/\/www.python.org"},{"key":"457_CR25","unstructured":"Chollet F et al (2015) Keras. https:\/\/keras.io"},{"key":"457_CR26","unstructured":"Google Inc (2019) Tensorflow. https:\/\/github.com\/tensorftow\/tensorftow"},{"key":"457_CR27","unstructured":"Shanghai Institute of Organic Chemistry (2020) Machine translation tool for chemical nomenclature. https:\/\/www.organchem.csdb.cn\/translate"},{"key":"457_CR28","unstructured":"Andrej K (2015) The unreasonable effectiveness of recurrent neural networks. https:\/\/karpathy.github.io\/2015\/05\/21\/rnn-effectiveness"},{"key":"457_CR29","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1351\/pac199971040587","volume":"71","author":"PM Giles Jr","year":"1999","unstructured":"Giles PM Jr (1999) Revised section F: natural products and related compound (IUPAC Recommendations 1999). Pure Appl Chem 71:587","journal-title":"Pure Appl Chem"},{"key":"457_CR30","doi-asserted-by":"publisher","DOI":"10.1039\/9781849733069","author":"H Favre","year":"2014","unstructured":"Favre H, Powell W (2014) Nomenclature of organic chemistry: IUPAC recommendations and preferred names 2013. R Soc Chem. https:\/\/doi.org\/10.1039\/9781849733069","journal-title":"R Soc Chem"},{"key":"457_CR31","unstructured":"Christopher O (2015) Understanding LSTM Networks. https:\/\/colah.github.io\/posts\/2015-08-Understanding-LSTMs"},{"key":"457_CR32","doi-asserted-by":"publisher","unstructured":"Sainath TN, Vinyals O, Senior A, Sak H (2015) Convolutional, long short-term memory, fully connected deep neural networks. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https:\/\/doi.org\/10.1109\/icassp.2015.7178838","DOI":"10.1109\/icassp.2015.7178838"},{"key":"457_CR33","unstructured":"Yoon K, Yacine J, David S, Alexander MR (2015) Character-aware neural language models. arXiv: 1508.06615. https:\/\/arxiv.org\/abs\/1508.06615"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-020-00457-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-020-00457-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-020-00457-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T00:03:10Z","timestamp":1630368190000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-020-00457-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,31]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["457"],"URL":"https:\/\/doi.org\/10.1186\/s13321-020-00457-0","relation":{},"ISSN":["1758-2946"],"issn-type":[{"type":"electronic","value":"1758-2946"}],"subject":[],"published":{"date-parts":[[2020,8,31]]},"assertion":[{"value":"5 August 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 August 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 August 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"50"}}