{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T10:31:22Z","timestamp":1774953082772,"version":"3.50.1"},"reference-count":87,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2022,3,24]],"date-time":"2022-03-24T00:00:00Z","timestamp":1648080000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100013375","name":"USU","doi-asserted-by":"publisher","award":["A45112"],"award-info":[{"award-number":["A45112"]}],"id":[{"id":"10.13039\/501100013375","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Nitrogen is essential for life and its transformations are an important part of the global biogeochemical cycle. Being an essential nutrient, nitrogen exists in a range of oxidation states from +5 (nitrate) to \u22123 (ammonium and amino-nitrogen), and its oxidation and reduction reactions catalyzed by microbial enzymes determine its environmental fate. The functional annotation of the genes encoding the core nitrogen network enzymes has a broad range of applications in metagenomics, agriculture, wastewater treatment and industrial biotechnology. This study developed an alignment-free computational approach to determine the predicted nitrogen biochemical network-related enzymes from the sequence itself. We propose deepNEC, a novel end-to-end feature selection and classification model training approach for nitrogen biochemical network-related enzyme prediction. The algorithm was developed using Deep Learning, a class of machine learning algorithms that uses multiple layers to extract higher-level features from the raw input data. The derived protein sequence is used as an input, extracting sequential and convolutional features from raw encoded protein sequences based on classification rather than traditional alignment-based methods for enzyme prediction. Two large datasets of protein sequences, enzymes and non-enzymes were used to train the models with protein sequence features like amino acid composition, dipeptide composition (DPC), conformation transition and distribution, normalized Moreau\u2013Broto (NMBroto), conjoint and quasi order, etc. The k-fold cross-validation and independent testing were performed to validate our model training. deepNEC uses a four-tier approach for prediction; in the first phase, it will predict a query sequence as enzyme or non-enzyme; in the second phase, it will further predict and classify enzymes into nitrogen biochemical network-related enzymes or non-nitrogen metabolism enzymes; in the third phase, it classifies predicted enzymes into nine nitrogen metabolism classes; and in the fourth phase, it predicts the enzyme commission number out of 20 classes for nitrogen metabolism. Among all, the DPC\u2009+\u2009NMBroto hybrid feature gave the best prediction performance (accuracy of 96.15% in k-fold training and 93.43% in independent testing) with an Matthews correlation coefficient (0.92 training and 0.87 independent testing) in phase I; phase II (accuracy of 99.71% in k-fold training and 98.30% in independent testing); phase III (overall accuracy of 99.03% in k-fold training and 98.98% in independent testing); phase IV (overall accuracy of 99.05% in k-fold training and 98.18% in independent testing), the DPC feature gave the best prediction performance. We have also implemented a homology-based method to remove false negatives. All the models have been implemented on a web server (prediction tool), which is freely available at http:\/\/bioinfo.usu.edu\/deepNEC\/.<\/jats:p>","DOI":"10.1093\/bib\/bbac071","type":"journal-article","created":{"date-parts":[[2022,3,7]],"date-time":"2022-03-07T20:11:51Z","timestamp":1646683911000},"source":"Crossref","is-referenced-by-count":9,"title":["deepNEC: a novel alignment-free tool for the identification and classification of nitrogen biochemical network-related enzymes using deep learning"],"prefix":"10.1093","volume":"23","author":[{"given":"Naveen","family":"Duhan","sequence":"first","affiliation":[{"name":"Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, UT 84322 USA"}]},{"given":"Jeanette M","family":"Norton","sequence":"additional","affiliation":[{"name":"Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, UT 84322 USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8683-1240","authenticated-orcid":false,"given":"Rakesh","family":"Kaundal","sequence":"additional","affiliation":[{"name":"Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, UT 84322 USA"},{"name":"Bioinformatics Facility, Center for Integrated BioSystems, UT 84322 USA"},{"name":"Department of Computer Science, College of Science; Utah State University, Logan, UT 84322 USA"}]}],"member":"286","published-online":{"date-parts":[[2022,3,24]]},"reference":[{"key":"2022051813175439400_ref1","doi-asserted-by":"crossref","first-page":"20130164","DOI":"10.1098\/rstb.2013.0164","article-title":"The global nitrogen cycle in the Twentyfirst century","volume":"368","author":"Fowler","year":"2013","journal-title":"Philos Trans R Soc B Biol Sci"},{"key":"2022051813175439400_ref2","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1007\/s10533-004-0370-0","article-title":"Nitrogen cycles: past, present, and future","volume":"70","author":"Galloway","year":"2004","journal-title":"Biogeochemistry"},{"key":"2022051813175439400_ref3","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1126\/science.1153213","article-title":"The microbial engines that drive earth\u2019s biogeochemical cycles","volume":"320","author":"Falkowski","year":"2008","journal-title":"Science"},{"key":"2022051813175439400_ref4","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1038\/nature06592","article-title":"An earth-system perspective of the global nitrogen cycle","volume":"451","author":"Gruber","year":"2008","journal-title":"Nature"},{"key":"2022051813175439400_ref5","doi-asserted-by":"crossref","first-page":"1879","DOI":"10.1073\/pnas.1313713111","article-title":"Gene-centric approach to integrating environmental genomics and biogeochemical models","volume":"111","author":"Reed","year":"2014","journal-title":"Proc Natl Acad Sci"},{"key":"2022051813175439400_ref6","doi-asserted-by":"crossref","first-page":"1351","DOI":"10.5194\/bg-10-1351-2013","article-title":"Overlooked runaway feedback in the marine nitrogen cycle: the vicious cycle","volume":"10","author":"Landolfi","year":"2013","journal-title":"Biogeosciences"},{"key":"2022051813175439400_ref7","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/BF00002772","article-title":"Nitrogen limitation on land and in the sea: how can it occur?","volume":"13","author":"Vitousek","year":"1991","journal-title":"Biogeochem"},{"key":"2022051813175439400_ref8","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1016\/S1369-5274(00)00208-3","article-title":"Microbial nitrogen cycles: physiology, genomics and applications","volume":"4","author":"Ye","year":"2001","journal-title":"Curr Opin Microbiol"},{"key":"2022051813175439400_ref9","doi-asserted-by":"crossref","first-page":"2903","DOI":"10.1111\/j.1462-2920.2008.01786.x","article-title":"The microbial nitrogen cycle","volume":"10","author":"Jetten","year":"2008","journal-title":"Environ Microbiol"},{"key":"2022051813175439400_ref10","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1038\/nrmicro.2018.9","article-title":"The microbial nitrogen-cycling network","volume":"16","author":"Kuypers","year":"2018","journal-title":"Nat Rev Microbiol"},{"key":"2022051813175439400_ref11","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1016\/j.copbio.2004.06.008","article-title":"Enzyme assays for high-throughput screening","volume":"15","author":"Goddard","year":"2004","journal-title":"Curr Opin Biotechnol"},{"key":"2022051813175439400_ref12","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1093\/nar\/28.1.45","article-title":"The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000","volume":"28","author":"Bairoch","year":"2000","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref13","doi-asserted-by":"crossref","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","article-title":"UniProt: the universal protein knowledgebase in 2021","volume":"49","author":"Consortium","year":"2021","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref14","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/j.pisc.2014.02.006","article-title":"Current IUBMB recommendations on enzyme nomenclature and kinetics","volume":"1","author":"Cornish-Bowden","year":"2014","journal-title":"Perspect Sci"},{"key":"2022051813175439400_ref15","first-page":"92","article-title":"Prediction of enzyme classification from protein sequence without the use of sequence similarity","volume":"5","author":"des Jardins","year":"1997","journal-title":"Proc Int Conf Intell Syst Mol Biol"},{"key":"2022051813175439400_ref16","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.jmb.2004.10.024","article-title":"Predicting enzyme class from protein structure without alignments","volume":"345","author":"Dobson","year":"2005","journal-title":"J Mol Biol"},{"key":"2022051813175439400_ref17","doi-asserted-by":"crossref","first-page":"e84623","DOI":"10.1371\/journal.pone.0084623","article-title":"Prediction of detailed enzyme functions and identification of specificity determining residues by random forests","volume":"9","author":"Nagao","year":"2014","journal-title":"PLoS One"},{"key":"2022051813175439400_ref18","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gks372","article-title":"COFACTOR: an accurate comparative algorithm for structure-based protein function annotation","volume":"40","author":"Roy","year":"2012","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref19","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1038\/nmeth.3213","article-title":"The I-TASSER suite: protein structure and function prediction","volume":"12","author":"Yang","year":"2014","journal-title":"Nat Method"},{"key":"2022051813175439400_ref20","doi-asserted-by":"crossref","first-page":"W291","DOI":"10.1093\/nar\/gkx366","article-title":"COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information","volume":"45","author":"Zhang","year":"2017","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref21","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1186\/1471-2105-10-107","article-title":"EFICAz2: enzyme function inference by a combined approach enhanced by machine learning","volume":"10","author":"Arakaki","year":"2009","journal-title":"BMC Bioinform"},{"key":"2022051813175439400_ref22","doi-asserted-by":"crossref","first-page":"2687","DOI":"10.1093\/bioinformatics\/bts510","article-title":"EFICAz2.5: application of a high-precision enzyme function predictor to 396 proteomes","volume":"28","author":"Kumar","year":"2012","journal-title":"Bioinformatics"},{"key":"2022051813175439400_ref23","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1186\/1471-2105-12-376","article-title":"EnzymeDetector: an integrated enzyme function prediction tool and database","volume":"12","author":"Quester","year":"2011","journal-title":"BMC Bioinform"},{"key":"2022051813175439400_ref24","doi-asserted-by":"crossref","first-page":"6226","DOI":"10.1093\/nar\/gkh956","article-title":"EFICAz: a comprehensive approach for accurate genome-scale enzyme function inference","volume":"32","author":"Tian","year":"2004","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref25","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1002\/prot.22167","article-title":"Genome-wide enzyme annotation with precision control: catalytic families (CatFam) databases","volume":"74","author":"Yu","year":"2009","journal-title":"Proteins Struct Funct Bioinform"},{"key":"2022051813175439400_ref26","doi-asserted-by":"crossref","first-page":"3692","DOI":"10.1093\/nar\/gkg600","article-title":"SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence","volume":"31","author":"Cai","year":"2003","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref27","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1002\/prot.20045","article-title":"Enzyme family classification by support vector machines","volume":"55","author":"Cai","year":"2004","journal-title":"Protein Struct Funct Genet"},{"key":"2022051813175439400_ref28","doi-asserted-by":"crossref","first-page":"967","DOI":"10.1021\/pr0500399","article-title":"Predicting enzyme subclass by functional domain composition and pseudo amino acid composition","volume":"4","author":"Cai","year":"2005","journal-title":"J Proteome Res"},{"key":"2022051813175439400_ref29","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1093\/bioinformatics\/bth466","article-title":"Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes","volume":"21","author":"Chou","year":"2005","journal-title":"Bioinformatics"},{"key":"2022051813175439400_ref30","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1021\/pr0255710","article-title":"Prediction of enzyme family classes","volume":"2","author":"Chou","year":"2003","journal-title":"J Proteome Res"},{"key":"2022051813175439400_ref31","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1186\/1471-2105-13-61","article-title":"EnzML: multi-label prediction of enzyme classes using InterPro signatures","volume":"13","author":"De Ferrari","year":"2012","journal-title":"BMC Bioinform"},{"key":"2022051813175439400_ref32","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1016\/j.biosystems.2006.10.004","article-title":"Accurate prediction of enzyme subfamily class using an adaptive fuzzy k-nearest neighbor method","volume":"90","author":"Huang","year":"2007","journal-title":"Biosystems"},{"key":"2022051813175439400_ref33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1687-4153-2012-1","article-title":"A top-down approach to classify enzyme functional classes and sub-classes using random forest","volume":"2012","author":"Kumar","year":"2012","journal-title":"Eurasip J Bioinform Syst Biol"},{"key":"2022051813175439400_ref34","doi-asserted-by":"crossref","first-page":"e0155290","DOI":"10.1371\/journal.pone.0155290","article-title":"SVM-prot 2016: a web-server for machine learning prediction of protein functional families from sequence irrespective of similarity","volume":"11","author":"Li","year":"2016","journal-title":"PLoS One"},{"key":"2022051813175439400_ref35","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1016\/j.compbiolchem.2007.03.008","article-title":"ECS: an automatic enzyme classifier based on functional domain composition","volume":"31","author":"Lu","year":"2007","journal-title":"Comput Biol Chem"},{"key":"2022051813175439400_ref36","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1016\/j.compbiolchem.2009.09.002","article-title":"Efficiency analysis of KNN and minimum distance-based classifiers in enzyme family prediction","volume":"33","author":"Nasibov","year":"2009","journal-title":"Comput Biol Chem"},{"key":"2022051813175439400_ref37","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1016\/j.jtbi.2008.10.026","article-title":"Using support vector machines to distinguish enzymes: approached by incorporating wavelet transform","volume":"256","author":"Qiu","year":"2009","journal-title":"J Theor Biol"},{"key":"2022051813175439400_ref38","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/j.bbrc.2007.09.098","article-title":"EzyPred: a top-down approach for predicting enzyme functional classes and subclasses","volume":"364","author":"Bin","year":"2007","journal-title":"Biochem Biophys Res Commun"},{"key":"2022051813175439400_ref39","doi-asserted-by":"crossref","first-page":"S11","DOI":"10.1186\/1471-2105-14-S1-S11","article-title":"Accurate prediction of protein enzymatic class by N-to-1 neural networks","volume":"14","author":"Volpato","year":"2013","journal-title":"BMC Bioinform"},{"key":"2022051813175439400_ref40","doi-asserted-by":"crossref","first-page":"e200","DOI":"10.1093\/nar\/gkq873","article-title":"Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions","volume":"38","author":"Claesson","year":"2010","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref41","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1752-0509-5-S1-S6","article-title":"Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context","volume":"5","author":"Wang","year":"2011","journal-title":"BMC Syst Biol"},{"key":"2022051813175439400_ref42","doi-asserted-by":"crossref","first-page":"1441","DOI":"10.2174\/0929866511009011441","article-title":"Prediction of enzyme subfamily class via pseudo amino acid composition by incorporating the conjoint triad feature","volume":"17","author":"Wang","year":"2012","journal-title":"Protein Pept Lett"},{"key":"2022051813175439400_ref43","doi-asserted-by":"crossref","first-page":"546","DOI":"10.1016\/j.jtbi.2007.06.001","article-title":"Using Chou\u2019s amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes","volume":"248","author":"Bin","year":"2007","journal-title":"J Theor Biol"},{"key":"2022051813175439400_ref44","doi-asserted-by":"crossref","first-page":"760","DOI":"10.1093\/bioinformatics\/btx680","article-title":"DEEPre: sequence-based enzyme EC number prediction by deep learning","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2022051813175439400_ref45","doi-asserted-by":"crossref","first-page":"3150","DOI":"10.1093\/bioinformatics\/bts565","article-title":"CD-HIT: accelerated for clustering the next-generation sequencing data","volume":"28","author":"Fu","year":"2012","journal-title":"Bioinformatics"},{"key":"2022051813175439400_ref46","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.ab.2013.05.024","article-title":"IHSP-PseRAAAC: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition","volume":"442","author":"Feng","year":"2013","journal-title":"Anal Biochem"},{"key":"2022051813175439400_ref47","doi-asserted-by":"crossref","first-page":"W65","DOI":"10.1093\/nar\/gkv458","article-title":"Pse-in-one: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences","volume":"43","author":"Liu","year":"2015","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref48","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1104\/pp.110.156851","article-title":"Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis","volume":"154","author":"Kaundal","year":"2010","journal-title":"Plant Physiol"},{"key":"2022051813175439400_ref49","author":"National Center for Biotechnology Information"},{"key":"2022051813175439400_ref50","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1093\/nar\/30.1.47","article-title":"BRENDA, enzyme data and metabolic information","volume":"30","author":"Schomburg","year":"2002","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref51","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1093\/nar\/27.1.29","article-title":"KEGG: Kyoto Encyclopedia of genes and genomes","volume":"27","author":"Ogata","year":"1999","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref52","doi-asserted-by":"crossref","first-page":"D353","DOI":"10.1093\/nar\/gkw1092","article-title":"KEGG: new perspectives on genomes, pathways, diseases and drugs","volume":"45","author":"Kanehisa","year":"2017","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref53","doi-asserted-by":"crossref","first-page":"D751","DOI":"10.1093\/nar\/gkaa939","article-title":"The IMG\/M data management and analysis system v.6.0: new tools and advanced capabilities","volume":"49","author":"Chen","year":"2021","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref54","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/bioinformatics\/16.4.404","article-title":"The PSIPRED protein structure prediction server","volume":"16","author":"McGuffin","year":"2000","journal-title":"Bioinformatics"},{"key":"2022051813175439400_ref55","doi-asserted-by":"crossref","first-page":"W402","DOI":"10.1093\/nar\/gkz297","article-title":"The PSIPRED protein analysis workbench: 20 years on","volume":"47","author":"Buchan","year":"2019","journal-title":"Nucleic Acid Res"},{"key":"2022051813175439400_ref56","first-page":"265","volume-title":"12th USENIX Symp. Oper. Syst. Des. Implement. (OSDI 16)","author":"Abadi","year":"2016"},{"key":"2022051813175439400_ref57","article-title":"Scikit-learn: machine learning in python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J Mach Learn Res"},{"key":"2022051813175439400_ref58","volume-title":"Evaluation of requirements management tools with support for traceability-based change impact analysis","author":"Abma","year":"2009"},{"key":"2022051813175439400_ref59","first-page":"41","article-title":"A proposal for new evaluation metrics and result visualization technique for sentiment analysis tasks","volume":"8138 LNCS","author":"Valverde-Albacete","year":"2013","journal-title":"Lect Note Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics)"},{"key":"2022051813175439400_ref60","doi-asserted-by":"crossref","first-page":"e0177678","DOI":"10.1371\/journal.pone.0177678","article-title":"Optimal classifier for imbalanced data using Matthews correlation coefficient metric","volume":"12","author":"Boughorbel","year":"2017","journal-title":"PLoS One"},{"key":"2022051813175439400_ref61","first-page":"1","article-title":"Pharmadoop: a tool for pharmacophore searching using Hadoop framework","volume":"6","author":"Semwal","year":"2017","journal-title":"Netw Model Anal Heal Inform Bioinform"},{"key":"2022051813175439400_ref62","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1093\/clinchem\/39.4.561","article-title":"Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine","volume":"39","author":"Zweig","year":"1993","journal-title":"Clin Chem"},{"key":"2022051813175439400_ref63","first-page":"1285","article-title":"Measuring the accuracy of diagnostic systems","volume":"240","author":"Swets","year":"1988","journal-title":"Sci Sci"},{"key":"2022051813175439400_ref64","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1186\/s12859-018-2368-y","article-title":"ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature","volume":"19","author":"Dalkiran","year":"2018","journal-title":"BMC Bioinform"},{"key":"2022051813175439400_ref65","doi-asserted-by":"crossref","first-page":"13996","DOI":"10.1073\/pnas.1821905116","article-title":"Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers","volume":"116","author":"Ryu","year":"2019","journal-title":"Proc Natl Acad Sci"},{"key":"2022051813175439400_ref66","doi-asserted-by":"crossref","first-page":"2733","DOI":"10.1080\/07391102.2020.1754292","article-title":"DeEPn: a deep neural network based tool for enzyme functional annotation","volume":"39","author":"Semwal","year":"2020","journal-title":"J Biomol Struct Dyn"},{"key":"2022051813175439400_ref67","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40793-016-0168-4","article-title":"Complete genome of Nitrosospira briensis C-128, an ammonia-oxidizing bacterium from agricultural soil","volume":"11","author":"Rice","year":"2016","journal-title":"Stand Genomic Sci"},{"key":"2022051813175439400_ref68","doi-asserted-by":"crossref","first-page":"3559","DOI":"10.1128\/AEM.02722-07","article-title":"Complete genome sequence of Nitrosospira multiformis, an ammonia-oxidizing bacterium from the soil environment","volume":"74","author":"Norton","year":"2008","journal-title":"Appl Environ Microbiol"},{"key":"2022051813175439400_ref69","doi-asserted-by":"crossref","first-page":"985","DOI":"10.1007\/s00248-019-01378-8","article-title":"Physiological and genomic comparison of Nitrosomonas cluster 6a and 7 ammonia-oxidizing bacteria","volume":"78","author":"Sedlacek","year":"2019","journal-title":"Microb Ecol"},{"key":"2022051813175439400_ref70","doi-asserted-by":"crossref","first-page":"2759","DOI":"10.1128\/JB.185.9.2759-2773.2003","article-title":"Complete genome sequence of the ammonia-oxidizing bacterium and obligate chemolithoautotroph Nitrosomonas europaea","volume":"185","author":"Chain","year":"2003","journal-title":"J Bacteriol"},{"key":"2022051813175439400_ref71","doi-asserted-by":"crossref","first-page":"2993","DOI":"10.1111\/j.1462-2920.2007.01409.x","article-title":"Whole-genome analysis of the ammonia-oxidizing bacterium, Nitrosomonas eutropha C91: implications for niche adaptation","volume":"9","author":"Stein","year":"2007","journal-title":"Environ Microbiol"},{"key":"2022051813175439400_ref72","author":"IMG-taxon 2675903041 annotated assembly - Genome - Assembly - NCBI"},{"key":"2022051813175439400_ref73","doi-asserted-by":"crossref","first-page":"1130","DOI":"10.1038\/ismej.2016.191","article-title":"An acid-tolerant ammonia-oxidizing \u03b3-proteobacterium from soil","volume":"11","author":"Hayatsu","year":"2017","journal-title":"ISME J"},{"issue":"Pt 8","key":"2022051813175439400_ref74","first-page":"2738\u201352","article-title":"Nitrososphaera viennensis gen. Nov., sp. nov., an aerobic and mesophilic, ammonia-oxidizing archaeon from soil and a member of the archaeal phylum Thaumarchaeota","volume":"64","author":"Stieglmeier","year":"2014","journal-title":"Int J Syst Evol Microbiol"},{"issue":"5","key":"2022051813175439400_ref75","doi-asserted-by":"crossref","first-page":"fiw057","DOI":"10.1093\/femsec\/fiw057","article-title":"Isolation of \u2018Candidatus Nitrosocosmicus franklandus\u2019, a novel ureolytic soil archaeal ammonia oxidiser with tolerance to high ammonia concentration","volume":"92","author":"Lehtovirta-Morley","year":"2016","journal-title":"FEMS Microbiol Ecol"},{"key":"2022051813175439400_ref76","doi-asserted-by":"crossref","first-page":"2050","DOI":"10.1128\/AEM.72.3.2050-2063.2006","article-title":"Genome sequence of the chemolithoautotrophic nitrite-oxidizing bacterium Nitrobacter winogradskyi Nb-255","volume":"72","author":"Starkenburg","year":"2006","journal-title":"Appl Environ Microbiol"},{"key":"2022051813175439400_ref77","doi-asserted-by":"crossref","first-page":"27","DOI":"10.3389\/fmicb.2013.00027","article-title":"The genome of Nitrospina gracilis illuminates the metabolism and evolution of the major marine nitrite oxidizer","volume":"4","author":"L\u00fccker","year":"2013","journal-title":"Front Microbiol"},{"key":"2022051813175439400_ref78","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1038\/nature16461","article-title":"Complete nitrification by Nitrospira bacteria","volume":"528","author":"Daims","year":"2015","journal-title":"Nature"},{"key":"2022051813175439400_ref79","doi-asserted-by":"crossref","first-page":"2172","DOI":"10.1111\/1462-2920.12674","article-title":"Physiological characterization of anaerobic ammonium oxidizing bacterium \u2018Candidatus Jettenia caeni\u2019","volume":"17","author":"Ali","year":"2015","journal-title":"Environ Microbiol"},{"key":"2022051813175439400_ref80","doi-asserted-by":"crossref","first-page":"3133","DOI":"10.1111\/1462-2920.13355","article-title":"Hydroxylamine-dependent anaerobic ammonium oxidation (anammox) by \u201cCandidatus Brocadia sinica\u201d","volume":"18","author":"Oshiki","year":"2016","journal-title":"Environ Microbiol"},{"key":"2022051813175439400_ref81","doi-asserted-by":"crossref","first-page":"1472","DOI":"10.1101\/gr.076448.108","article-title":"Genome sequence of the beta-rhizobium Cupriavidus taiwanensis and comparative genomics of rhizobia","volume":"18","author":"Amadou","year":"2008","journal-title":"Genome Res"},{"key":"2022051813175439400_ref82","author":"ASM31769v1 - Genome - Assembly - NCBI"},{"key":"2022051813175439400_ref83","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1101\/gr.5798407","article-title":"Genome characteristics of facultatively symbiotic Frankia sp. strains reflect host range and host plant biogeography","volume":"17","author":"Normand","year":"2007","journal-title":"Genome Res"},{"key":"2022051813175439400_ref84","author":"ASM16719v1 - Genome - Assembly - NCBI"},{"key":"2022051813175439400_ref85","author":"ASM1462266v1 - Genome - Assembly - NCBI"},{"key":"2022051813175439400_ref86","doi-asserted-by":"crossref","first-page":"1130","DOI":"10.1038\/ismej.2016.191","article-title":"An acid-tolerant ammonia-oxidizing & gamma-proteobacterium from soil","volume":"11","author":"Hayatsu","year":"2017","journal-title":"ISME J"},{"key":"2022051813175439400_ref87","doi-asserted-by":"crossref","first-page":"11371","DOI":"10.1073\/pnas.1506533112","article-title":"Expanded metabolic versatility of ubiquitous nitrite-oxidizing bacteria from the genus Nitrospira","volume":"112","author":"Koch","year":"2015","journal-title":"Proc Natl Acad Sci"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac071\/43745326\/bbac071.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac071\/43745326\/bbac071.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,18]],"date-time":"2022-05-18T13:25:12Z","timestamp":1652880312000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac071\/6553605"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,24]]},"references-count":87,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,5,13]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac071","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5]]},"published":{"date-parts":[[2022,3,24]]},"article-number":"bbac071"}}