{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T14:51:18Z","timestamp":1769611878639,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,9,3]],"date-time":"2023-09-03T00:00:00Z","timestamp":1693699200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,9,3]]},"DOI":"10.1145\/3584371.3613013","type":"proceedings-article","created":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T18:52:30Z","timestamp":1696445550000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["CodonBERT: Using BERT for Sentiment Analysis to Better Predict Genes with Low Expression"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0991-7726","authenticated-orcid":false,"given":"Ashley Nicole","family":"Babjac","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Tennessee, Knoxville, Tennessee, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-9904-9741","authenticated-orcid":false,"given":"Zhixiu","family":"Lu","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Tennessee, Knoxville, Tennessee, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5741-4517","authenticated-orcid":false,"given":"Scott J","family":"Emrich","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Tennessee, Knoxville, Tennessee, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,10,4]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2187--2193","author":"Babjac Ashley","year":"2021","unstructured":"Ashley Babjac , Jun Li , and Scott Emrich . 2021 . Fine-Grained Synonymous Codon Usage Patterns and their Potential Role in Functional Protein Production . In 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2187--2193 . Ashley Babjac, Jun Li, and Scott Emrich. 2021. Fine-Grained Synonymous Codon Usage Patterns and their Potential Role in Functional Protein Production. In 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2187--2193."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acllong.269"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"J.L. Chaney A. Steele R. Carmichael A. Rodriguez A.T. Specht K. Ngo J. Li S. Emrich and P.L. Clark. 2017. Widespread position-specific conservation of synonymous rare codons within coding sequences. PLOS Computational Biology 13 5 (05 2017) 1--19.  J.L. Chaney A. Steele R. Carmichael A. Rodriguez A.T. Specht K. Ngo J. Li S. Emrich and P.L. Clark. 2017. Widespread position-specific conservation of synonymous rare codons within coding sequences. PLOS Computational Biology 13 5 (05 2017) 1--19.","DOI":"10.1371\/journal.pcbi.1005531"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts149"},{"key":"e_1_3_2_1_5_1","volume-title":"Natural language processing. Fundamentals of artificial intelligence","year":"2020","unstructured":"KR1442 Chowdhary. 2020. Natural language processing. Fundamentals of artificial intelligence ( 2020 ), 603--649. KR1442 Chowdhary. 2020. Natural language processing. Fundamentals of artificial intelligence (2020), 603--649."},{"key":"e_1_3_2_1_6_1","volume-title":"AlphaFold2 and the future of structural biology. Nature structural & molecular biology 28, 9","author":"Cramer Patrick","year":"2021","unstructured":"Patrick Cramer . 2021. AlphaFold2 and the future of structural biology. Nature structural & molecular biology 28, 9 ( 2021 ), 704--705. Patrick Cramer. 2021. AlphaFold2 and the future of structural biology. Nature structural & molecular biology 28, 9 (2021), 704--705."},{"key":"e_1_3_2_1_7_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)."},{"key":"e_1_3_2_1_8_1","volume-title":"A Comparison of LSTM and BERT for Small Corpus. arXiv preprint arXiv:2009.05451","author":"Ezen-Can Aysu","year":"2020","unstructured":"Aysu Ezen-Can . 2020. A Comparison of LSTM and BERT for Small Corpus. arXiv preprint arXiv:2009.05451 ( 2020 ). Aysu Ezen-Can. 2020. A Comparison of LSTM and BERT for Small Corpus. arXiv preprint arXiv:2009.05451 (2020)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/BIBE.2017.00-63"},{"key":"e_1_3_2_1_10_1","volume-title":"Measurement of average decoding rates of the 61 sense codons in vivo. eLife 3","author":"Gardin Justin","year":"2014","unstructured":"Justin Gardin , Rukhsana Yeasmin , Alisa Yurovsky , Ying Cai , Steve Skiena , Bruce Futcher , and Nahum Sonenberg . 2014. Measurement of average decoding rates of the 61 sense codons in vivo. eLife 3 ( 2014 ), e03735. Justin Gardin, Rukhsana Yeasmin, Alisa Yurovsky, Ying Cai, Steve Skiena, Bruce Futcher, and Nahum Sonenberg. 2014. Measurement of average decoding rates of the 61 sense codons in vivo. eLife 3 (2014), e03735."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"M.A. Gilchrist W.-C. Chen P. Shah C. L. Landerer and R. Zaretzki. 2015. Estimating gene expression and codon-specific translational efficiencies mutation biases and selection coefficients from genomic data alone. Genome Biology and Evolution 7 6 (05 2015) 1559--1579.  M.A. Gilchrist W.-C. Chen P. Shah C. L. Landerer and R. Zaretzki. 2015. Estimating gene expression and codon-specific translational efficiencies mutation biases and selection coefficients from genomic data alone. Genome Biology and Evolution 7 6 (05 2015) 1559--1579.","DOI":"10.1093\/gbe\/evv087"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btab083"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33014039"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","first-page":"100847","DOI":"10.1016\/j.rineng.2022.100847","article-title":"Codon-mRNA prediction using deep optimal neurocomputing technique (DLSTM-DSN-WOA) and multivariate analysis","volume":"17","author":"Kadhuim Zena A","year":"2023","unstructured":"Zena A Kadhuim and Samaher Al-Janabi . 2023 . Codon-mRNA prediction using deep optimal neurocomputing technique (DLSTM-DSN-WOA) and multivariate analysis . Results in Engineering 17 (2023), 100847 . Zena A Kadhuim and Samaher Al-Janabi. 2023. Codon-mRNA prediction using deep optimal neurocomputing technique (DLSTM-DSN-WOA) and multivariate analysis. Results in Engineering 17 (2023), 100847.","journal-title":"Results in Engineering"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation. 16--21","author":"Kokalj Enja","year":"2021","unstructured":"Enja Kokalj , Bla\u017e \u0160krlj , Nada Lavra\u010d , Senja Pollak , and Marko Robnik-\u0160ikonja . 2021 . BERT meets shapley: Extending SHAP explanations to transformer-based classifiers . In Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation. 16--21 . Enja Kokalj, Bla\u017e \u0160krlj, Nada Lavra\u010d, Senja Pollak, and Marko Robnik-\u0160ikonja. 2021. BERT meets shapley: Extending SHAP explanations to transformer-based classifiers. In Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation. 16--21."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Zeming Lin Halil Akin Roshan Rao Brian Hie Zhongkai Zhu Wenting Lu Nikita Smetanin Robert Verkuil Ori Kabeli Yaniv Shmueli etal 2023. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379 6637 (2023) 1123--1130.  Zeming Lin Halil Akin Roshan Rao Brian Hie Zhongkai Zhu Wenting Lu Nikita Smetanin Robert Verkuil Ori Kabeli Yaniv Shmueli et al. 2023. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379 6637 (2023) 1123--1130.","DOI":"10.1126\/science.ade2574"},{"key":"e_1_3_2_1_17_1","volume-title":"EPiC Series in Computing","volume":"70","author":"Lu Zhixiu","year":"2020","unstructured":"Zhixiu Lu , Michael Gilchrist , and Scott Emrich . 2020 . Analysis of mutation bias in shaping codon usage bias and its association with gene expression across species . In EPiC Series in Computing , Vol. 70 . EasyChair, 139--148. Zhixiu Lu, Michael Gilchrist, and Scott Emrich. 2020. Analysis of mutation bias in shaping codon usage bias and its association with gene expression across species. In EPiC Series in Computing, Vol. 70. EasyChair, 139--148."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN48605.2020.9206892"},{"key":"e_1_3_2_1_19_1","volume-title":"Simon Liu, and Sergei Maslov.","author":"Nambiar Ananthan","year":"2023","unstructured":"Ananthan Nambiar , John Malcolm Forsyth , Simon Liu, and Sergei Maslov. 2023 . DR-BERT: A Protein Language Model to Annotate Disordered Regions . bioRxiv (2023), 2023--02. Ananthan Nambiar, John Malcolm Forsyth, Simon Liu, and Sergei Maslov. 2023. DR-BERT: A Protein Language Model to Annotate Disordered Regions. bioRxiv (2023), 2023--02."},{"key":"e_1_3_2_1_20_1","first-page":"220","article-title":"Understanding auc-roc curve","volume":"26","author":"Narkhede Sarang","year":"2018","unstructured":"Sarang Narkhede . 2018 . Understanding auc-roc curve . Towards Data Science 26 , 1 (2018), 220 -- 227 . Sarang Narkhede. 2018. Understanding auc-roc curve. Towards Data Science 26, 1 (2018), 220--227.","journal-title":"Towards Data Science"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1080\/15476286.2021.1901185"},{"key":"e_1_3_2_1_22_1","volume-title":"Danyeong Lee, Nayeon Kim, Sangyeup Kim, Georgy Meshcheryakov, Andrey Lando, et al.","author":"Rafi Abdul Muntakim","year":"2023","unstructured":"Abdul Muntakim Rafi , Dmitry Penzar , Daria Nogina , Dohoon Lee , Eeshit Dhaval Vaishnav , Danyeong Lee, Nayeon Kim, Sangyeup Kim, Georgy Meshcheryakov, Andrey Lando, et al. 2023 . Evaluation and optimization of sequence-based gene regulatory deep learning models. bioRxiv (2023), 2023--04. Abdul Muntakim Rafi, Dmitry Penzar, Daria Nogina, Dohoon Lee, Eeshit Dhaval Vaishnav, Danyeong Lee, Nayeon Kim, Sangyeup Kim, Georgy Meshcheryakov, Andrey Lando, et al. 2023. Evaluation and optimization of sequence-based gene regulatory deep learning models. bioRxiv (2023), 2023--04."},{"key":"e_1_3_2_1_23_1","volume-title":"Oliver Dutton, Falk Hoffmann, Louie Henderson, Benjamin MJ Owens, Matthew Heberling, Emanuele Paci, and Kamil Tamiola.","author":"Redl Istvan","year":"2022","unstructured":"Istvan Redl , Carlo Fisicaro , Oliver Dutton, Falk Hoffmann, Louie Henderson, Benjamin MJ Owens, Matthew Heberling, Emanuele Paci, and Kamil Tamiola. 2022 . ADOPT : intrinsic protein disorder prediction through deep bidirectional transformers. bioRxiv (2022), 2022--05. Istvan Redl, Carlo Fisicaro, Oliver Dutton, Falk Hoffmann, Louie Henderson, Benjamin MJ Owens, Matthew Heberling, Emanuele Paci, and Kamil Tamiola. 2022. ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers. bioRxiv (2022), 2022--05."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1817299116"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","first-page":"356","DOI":"10.1002\/pro.3336","article-title":"% MinMax: A versatile tool for calculating and comparing synonymous codon usage and its impact on protein folding","volume":"27","author":"Rodriguez Anabel","year":"2018","unstructured":"Anabel Rodriguez , Gabriel Wright , Scott Emrich , and Patricia L Clark . 2018 . % MinMax: A versatile tool for calculating and comparing synonymous codon usage and its impact on protein folding . Protein Science 27 , 1 (2018), 356 -- 362 . Anabel Rodriguez, Gabriel Wright, Scott Emrich, and Patricia L Clark. 2018. % MinMax: A versatile tool for calculating and comparing synonymous codon usage and its impact on protein folding. Protein Science 27, 1 (2018), 356--362.","journal-title":"Protein Science"},{"key":"e_1_3_2_1_26_1","volume-title":"The shapley value in machine learning. arXiv preprint arXiv:2202.05594","author":"Rozemberczki Benedek","year":"2022","unstructured":"Benedek Rozemberczki , Lauren Watson , P\u00e9ter Bayer , Hao-Tsung Yang , Oliv\u00e9r Kiss , Sebastian Nilsson , and Rik Sarkar . 2022. The shapley value in machine learning. arXiv preprint arXiv:2202.05594 ( 2022 ). Benedek Rozemberczki, Lauren Watson, P\u00e9ter Bayer, Hao-Tsung Yang, Oliv\u00e9r Kiss, Sebastian Nilsson, and Rik Sarkar. 2022. The shapley value in machine learning. arXiv preprint arXiv:2202.05594 (2022)."},{"key":"e_1_3_2_1_27_1","volume-title":"D1","author":"Sayers Eric W","year":"2022","unstructured":"Eric W Sayers , Mark Cavanaugh , Karen Clark , Kim D Pruitt , Conrad L Schoch , Stephen T Sherry , and Ilene Karsch-Mizrachi . 2022. GenBank. Nucleic acids research 50 , D1 ( 2022 ), D161. Eric W Sayers, Mark Cavanaugh, Karen Clark, Kim D Pruitt, Conrad L Schoch, Stephen T Sherry, and Ilene Karsch-Mizrachi. 2022. GenBank. Nucleic acids research 50, D1 (2022), D161."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/15.3.1281"},{"key":"e_1_3_2_1_29_1","volume-title":"Attention is all you need. Advances in neural information processing systems 30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. Advances in neural information processing systems 30 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","first-page":"e0232003","DOI":"10.1371\/journal.pone.0232003","article-title":"Analysis of computational codon usage models and their association with translationally slow codons","volume":"15","author":"Wright Gabriel","year":"2020","unstructured":"Gabriel Wright , Anabel Rodriguez , Jun Li , Patricia L Clark , Tijana Milenkovi\u0107 , and Scott J Emrich . 2020 . Analysis of computational codon usage models and their association with translationally slow codons . PloS one 15 , 4 (2020), e0232003 . Gabriel Wright, Anabel Rodriguez, Jun Li, Patricia L Clark, Tijana Milenkovi\u0107, and Scott J Emrich. 2020. Analysis of computational codon usage models and their association with translationally slow codons. PloS one 15, 4 (2020), e0232003.","journal-title":"PloS one"},{"key":"e_1_3_2_1_31_1","volume-title":"Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity. Nature","author":"Zhang He","year":"2023","unstructured":"He Zhang , Liang Zhang , Ang Lin , Congcong Xu , Ziyu Li , Kaibo Liu , Boxiang Liu , Xiaopin Ma , Fanfan Zhao , Huiling Jiang , Chunxiu Chen , Haifa Shen , Hangwen Li , David H. Mathews , Yujian Zhang , and Liang Huang . 2023. Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity. Nature ( 2023 ). He Zhang, Liang Zhang, Ang Lin, Congcong Xu, Ziyu Li, Kaibo Liu, Boxiang Liu, Xiaopin Ma, Fanfan Zhao, Huiling Jiang, Chunxiu Chen, Haifa Shen, Hangwen Li, David H. Mathews, Yujian Zhang, and Liang Huang. 2023. Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity. Nature (2023)."}],"event":{"name":"BCB '23: 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics","location":"Houston TX USA","acronym":"BCB '23","sponsor":["SIGBio ACM Special Interest Group on Bioinformatics"]},"container-title":["Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3584371.3613013","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3584371.3613013","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:26Z","timestamp":1750178786000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3584371.3613013"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,3]]},"references-count":31,"alternative-id":["10.1145\/3584371.3613013","10.1145\/3584371"],"URL":"https:\/\/doi.org\/10.1145\/3584371.3613013","relation":{},"subject":[],"published":{"date-parts":[[2023,9,3]]},"assertion":[{"value":"2023-10-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}