{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T23:13:38Z","timestamp":1771715618971,"version":"3.50.1"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2022,3,24]],"date-time":"2022-03-24T00:00:00Z","timestamp":1648080000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"National Key Research and Development Projects","award":["2019YFE0103800"],"award-info":[{"award-number":["2019YFE0103800"]}]},{"name":"National Key Research and Development Projects","award":["2020YFB0704502"],"award-info":[{"award-number":["2020YFB0704502"]}]},{"name":"Science and Technology Program of Sichuan Province","award":["2021YFH0060"],"award-info":[{"award-number":["2021YFH0060"]}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["2018CDPZH-9"],"award-info":[{"award-number":["2018CDPZH-9"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["2019CDPZH-23"],"award-info":[{"award-number":["2019CDPZH-23"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Chinese-Hungarian Bilateral Project","award":["2018-2.1.14-T\u00c9T-CN-2018-00011"],"award-info":[{"award-number":["2018-2.1.14-T\u00c9T-CN-2018-00011"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Prediction of antimicrobial resistance based on whole-genome sequencing data has attracted greater attention due to its rapidity and convenience. Numerous machine learning\u2013based studies have used genetic variants to predict drug resistance in Mycobacterium tuberculosis (MTB), assuming that variants are homogeneous, and most of these studies, however, have ignored the essential correlation between variants and corresponding genes when encoding variants, and used a limited number of variants as prediction input. In this study, taking advantage of genome-wide variants for drug-resistance prediction and inspired by natural language processing, we summarize drug resistance prediction into document classification, in which variants are considered as words, mutated genes in an isolate as sentences, and an isolate as a document. We propose a novel hierarchical attentive neural network model (HANN) that helps discover drug resistance-related genes and variants and acquire more interpretable biological results. It captures the interaction among variants in a mutated gene as well as among mutated genes in an isolate. Our results show that for the four first-line drugs of isoniazid (INH), rifampicin (RIF), ethambutol (EMB) and pyrazinamide (PZA), the HANN achieves the optimal area under the ROC curve of 97.90, 99.05, 96.44 and 95.14% and the optimal sensitivity of 94.63, 96.31, 92.56 and 87.05%, respectively. In addition, without any domain knowledge, the model identifies drug resistance-related genes and variants consistent with those confirmed by previous studies, and more importantly, it discovers one more potential drug-resistance-related gene.<\/jats:p>","DOI":"10.1093\/bib\/bbac041","type":"journal-article","created":{"date-parts":[[2022,2,20]],"date-time":"2022-02-20T12:06:41Z","timestamp":1645358801000},"source":"Crossref","is-referenced-by-count":14,"title":["Drug resistance prediction and resistance genes identification in <i>Mycobacterium tuberculosis<\/i> based on a hierarchical attentive neural network utilizing genome-wide variants"],"prefix":"10.1093","volume":"23","author":[{"given":"Zhonghua","family":"Jiang","sequence":"first","affiliation":[{"name":"Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China"}]},{"given":"Yongmei","family":"Lu","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China"}]},{"given":"Zhuochong","family":"Liu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China"}]},{"given":"Wei","family":"Wu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China"}]},{"given":"Xinyi","family":"Xu","sequence":"additional","affiliation":[{"name":"Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China"}]},{"given":"Andr\u00e1s","family":"Dinny\u00e9s","sequence":"additional","affiliation":[{"name":"BioTalentum Ltd. Aulich Lajos str. 26. 2100 G\u00f6d\u00f6ll\u00f5, Hungary"}]},{"given":"Zhonghua","family":"Yu","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China"}]},{"given":"Li","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4372-8865","authenticated-orcid":false,"given":"Qun","family":"Sun","sequence":"additional","affiliation":[{"name":"Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China"}]}],"member":"286","published-online":{"date-parts":[[2022,3,24]]},"reference":[{"key":"2022051813062648100_ref1","volume-title":"Global Tuberculosis Report","author":"World Health Organization","year":"2021"},{"key":"2022051813062648100_ref2","doi-asserted-by":"crossref","first-page":"aad3292","DOI":"10.1126\/science.aad3292","article-title":"Multidrug evolutionary strategies to reverse antibiotic resistance","volume":"351","author":"Baym","year":"2016","journal-title":"Science"},{"key":"2022051813062648100_ref3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13073-019-0650-x","article-title":"Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs","volume":"11","author":"Phelan","year":"2019","journal-title":"Genome Med"},{"key":"2022051813062648100_ref4","doi-asserted-by":"crossref","first-page":"356","DOI":"10.1016\/j.ebiom.2019.04.016","article-title":"Beyond multidrug resistance: leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction","volume":"43","author":"Chen","year":"2019","journal-title":"EBioMedicine"},{"key":"2022051813062648100_ref5","doi-asserted-by":"crossref","first-page":"3240","DOI":"10.1093\/bioinformatics\/btz067","article-title":"DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis","volume":"35","author":"Yang","year":"2019","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref6","first-page":"1","volume-title":"Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics","author":"Safari","year":"2021"},{"key":"2022051813062648100_ref7","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab299","article-title":"An end-to-end heterogeneous graph attention network for Mycobacterium tuberculosis drug-resistance prediction","volume":"22","author":"Yang","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022051813062648100_ref8","doi-asserted-by":"crossref","first-page":"667","DOI":"10.3389\/fmicb.2020.00667","article-title":"Multi-label random forest model for tuberculosis drug resistance classification and mutation ranking","volume":"11","author":"Kouchaki","year":"2020","journal-title":"Front Microbiol"},{"key":"2022051813062648100_ref9","doi-asserted-by":"crossref","first-page":"1666","DOI":"10.1093\/bioinformatics\/btx801","article-title":"Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data","volume":"34","author":"Yang","year":"2018","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref10","doi-asserted-by":"crossref","first-page":"2276","DOI":"10.1093\/bioinformatics\/bty949","article-title":"Application of machine learning techniques to tuberculosis drug resistance analysis","volume":"35","author":"Kouchaki","year":"2019","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref11","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1145\/3411408.3411463","volume-title":"11th Hellenic Conference on Artificial Intelligence","author":"Gialitsis","year":"2020"},{"key":"2022051813062648100_ref12","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab005","article-title":"A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information","volume":"22","author":"Le","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022051813062648100_ref13","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1021\/acs.jproteome.1c00848","article-title":"Improved prediction model of protein lysine Crotonylation sites using bidirectional recurrent neural networks","volume":"21","author":"Tng","year":"2022","journal-title":"J Proteome Res"},{"key":"2022051813062648100_ref14","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbaa128","article-title":"Using deep neural networks and biological subwords to detect protein S-sulfenylation sites","volume":"22","author":"Do","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022051813062648100_ref15","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab117","article-title":"Deep drug-target binding affinity prediction with multiple attention blocks","volume":"22","author":"Zeng","year":"2021","journal-title":"Brief Bioinform"},{"key":"2022051813062648100_ref16","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2022051813062648100_ref17","doi-asserted-by":"crossref","first-page":"1403","DOI":"10.1056\/NEJMoa1800474","article-title":"Prediction of susceptibility to first-line tuberculosis drugs by DNA sequencing","volume":"379","author":"Allix-B\u00e9guec","year":"2018","journal-title":"N Engl J Med"},{"key":"2022051813062648100_ref18","doi-asserted-by":"crossref","first-page":"1193","DOI":"10.1016\/S1473-3099(15)00062-6","article-title":"Whole-genome sequencing for prediction of Mycobacterium tuberculosis drug susceptibility and resistance: a retrospective cohort study","volume":"15","author":"Walker","year":"2015","journal-title":"Lancet Infect Dis"},{"key":"2022051813062648100_ref19","doi-asserted-by":"crossref","first-page":"i884","DOI":"10.1093\/bioinformatics\/bty560","article-title":"Fastp: an ultra-fast all-in-one FASTQ preprocessor","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref20","article-title":"Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM","author":"Li","year":"2013","journal-title":"arXiv"},{"key":"2022051813062648100_ref21","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref22","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome Res"},{"key":"2022051813062648100_ref23","doi-asserted-by":"crossref","first-page":"104152","DOI":"10.1016\/j.meegid.2019.104152","article-title":"An optimized genomic VCF workflow for precise identification of Mycobacterium tuberculosis cluster from cross-platform whole genome sequencing data","volume":"79","author":"Disratthakit","year":"2020","journal-title":"Infect Genet Evol"},{"key":"2022051813062648100_ref24","doi-asserted-by":"crossref","first-page":"2156","DOI":"10.1093\/bioinformatics\/btr330","article-title":"The variant call format and VCFtools","volume":"27","author":"Danecek","year":"2011","journal-title":"Bioinformatics"},{"key":"2022051813062648100_ref25","doi-asserted-by":"crossref","first-page":"80","DOI":"10.4161\/fly.19695","article-title":"A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3","volume":"6","author":"Cingolani","year":"2012","journal-title":"Fly"},{"key":"2022051813062648100_ref26","article-title":"Multilingual hierarchical attention networks for document classification","author":"Pappas","journal-title":"In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)."},{"key":"2022051813062648100_ref27","doi-asserted-by":"crossref","first-page":"e0248299","DOI":"10.1371\/journal.pone.0248299","article-title":"\u201cWhen they say weed causes depression, but it\u2019s your fav antidepressant\u201d: knowledge-aware attention framework for relationship extraction","volume":"16","author":"Yadav","year":"2021","journal-title":"PLoS One"},{"key":"2022051813062648100_ref28","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani","year":"2017"},{"key":"2022051813062648100_ref29","doi-asserted-by":"crossref","first-page":"922","DOI":"10.3389\/fgene.2019.00922","article-title":"Machine learning predicts accurately Mycobacterium tuberculosis drug resistance from whole genome sequencing data","volume":"10","author":"Deelder","year":"2019","journal-title":"Front Genet"},{"key":"2022051813062648100_ref30","doi-asserted-by":"crossref","first-page":"e01819","DOI":"10.1128\/mBio.01819-14","article-title":"Mycobacterium tuberculosis pyrazinamide resistance determinants: a multicenter study","volume":"5","author":"Miotto","year":"2014","journal-title":"MBio"},{"key":"2022051813062648100_ref31","doi-asserted-by":"crossref","first-page":"e1004294","DOI":"10.1371\/journal.pcbi.1004294","article-title":"The opponent channel population code of sound location is an efficient representation of natural binaural sounds","volume":"11","author":"M\u0142ynarski","year":"2015","journal-title":"PLoS Comput Biol"},{"key":"2022051813062648100_ref32","doi-asserted-by":"crossref","DOI":"10.1007\/s11431-020-1647-3","article-title":"Pre-trained models for natural language processing: a survey","volume":"63","author":"Qiu","year":"2020","journal-title":"Sci China Technol Sc"},{"key":"2022051813062648100_ref33","doi-asserted-by":"crossref","first-page":"4068","DOI":"10.1128\/AAC.49.10.4068-4074.2005","article-title":"Molecular characterization of isoniazid-resistant Mycobacterium tuberculosis isolates collected in Australia","volume":"49","author":"Lavender","year":"2005","journal-title":"Antimicrob Agents Chemother"},{"key":"2022051813062648100_ref34","doi-asserted-by":"crossref","first-page":"3686","DOI":"10.1128\/JB.00628-15","article-title":"GtrA protein Rv3789 is required for arabinosylation of arabinogalactan in Mycobacterium tuberculosis","volume":"197","author":"Kolly","year":"2015","journal-title":"J Bacteriol"},{"key":"2022051813062648100_ref35","doi-asserted-by":"crossref","first-page":"361","DOI":"10.3109\/10409238.2014.925420","article-title":"The cell envelope glycoconjugates of Mycobacterium tuberculosis","volume":"49","author":"Angala","year":"2014","journal-title":"Crit Rev Biochem Mol Biol"},{"key":"2022051813062648100_ref36","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1038\/ng.3767","article-title":"Genomic analysis of globally diverse Mycobacterium tuberculosis strains provides insights into the emergence and spread of multidrug resistance","volume":"49","author":"Manson","year":"2017","journal-title":"Nat Genet"},{"key":"2022051813062648100_ref37","doi-asserted-by":"crossref","first-page":"4800","DOI":"10.1128\/AAC.00150-15","article-title":"Molecular analysis of the embCAB locus and embR gene involved in ethambutol resistance in clinical isolates of Mycobacterium tuberculosis in France","volume":"59","author":"Brossier","year":"2015","journal-title":"Antimicrob Agents Chemother"},{"key":"2022051813062648100_ref38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-10110-6","article-title":"GWAS for quantitative resistance phenotypes in Mycobacterium tuberculosis reveals resistance genes and regulatory regions","volume":"10","author":"Farhat","year":"2019","journal-title":"Nat Commun"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac041\/43744929\/bbac041.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac041\/43744929\/bbac041.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,18]],"date-time":"2022-05-18T13:07:32Z","timestamp":1652879252000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac041\/6553603"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,24]]},"references-count":38,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,5,13]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac041","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5]]},"published":{"date-parts":[[2022,3,24]]},"article-number":"bbac041"}}