{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T11:16:43Z","timestamp":1767871003980,"version":"3.49.0"},"reference-count":34,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T00:00:00Z","timestamp":1767830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["309165\/2019-9"],"award-info":[{"award-number":["309165\/2019-9"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["409963\/2023-2"],"award-info":[{"award-number":["409963\/2023-2"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["302910\/2025-5"],"award-info":[{"award-number":["302910\/2025-5"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["311231\/2022-5"],"award-info":[{"award-number":["311231\/2022-5"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004586","name":"Funda\u00e7\u00e3o Carlos Chagas Filho de Amparo \u00e0 Pesquisa do Estado do Rio de Janeiro","doi-asserted-by":"publisher","award":["E-26\/201.446\/2022"],"award-info":[{"award-number":["E-26\/201.446\/2022"]}],"id":[{"id":"10.13039\/501100004586","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>\n                    Despite the growing availability of nuclear genomic data, mitochondrial genes remain the most widely used molecular markers in mammalian systematics. However, a quantitative assessment of the phylogenetic information content of mitochondrial loci compared to nuclear loci has never been carried out. Here, we apply a sparse learning approach based on Lasso regression to evaluate the contribution of alignment sites to phylogenetic likelihoods, providing the first estimates of phylogenetically effective lengths for markers commonly used in mammalian systematics. Analyzing more than 30,000 complete mammalian mitochondrial genomes and nuclear panels composed of either 100 randomly selected complete coding sequences or of partial gene segments from conventional markers, we examined phylogenetic informativeness at two taxonomic levels: within-species and among-species. On average, \u223c32% of mitochondrial sites and \u223c38% of nuclear sites were classified as phylogenetically informative. We found that the number of phylogenetically informative sites were positively correlated with total gene length. Therefore, longer mitochondrial genes, particularly\n                    <jats:italic>ND5<\/jats:italic>\n                    ,\n                    <jats:italic>COX1<\/jats:italic>\n                    , and\n                    <jats:italic>CYTB<\/jats:italic>\n                    , harbored the largest numbers of informative sites. Although nuclear coding sequences contained, on average, more informative sites, mitochondrial genes also yielded consistent resolution of among-species relationships. Overall, our results provide the first large-scale, quantitative comparison of phylogenetic information content across mammalian mitochondrial and nuclear genes, offering a principled framework for marker selection in future systematics studies that can be broadly applied to any lineage.\n                  <\/jats:p>","DOI":"10.3389\/fbinf.2025.1704212","type":"journal-article","created":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T06:36:11Z","timestamp":1767854171000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Assessment of phylogenetic informativeness in mitochondrial and nuclear genes for mammalian systematics using sparse learning"],"prefix":"10.3389","volume":"5","author":[{"given":"Carlos G.","family":"Schrago","sequence":"first","affiliation":[]},{"given":"Beatriz","family":"Mello","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2026,1,8]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1080\/10635150600755453","article-title":"Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative","volume":"55","author":"Anisimova","year":"2006","journal-title":"Syst. Biol."},{"key":"B2","doi-asserted-by":"publisher","first-page":"960","DOI":"10.1644\/1545-1542(2001)082%3C0960:ATOTGS%3E2.0.CO;2","article-title":"A test of the genetic species concept: Cytochrome-b sequences and mammals","volume":"82","author":"Bradley","year":"2001","journal-title":"J. Mammal."},{"key":"B3","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1007\/BF01734101","article-title":"Mitochondrial DNA sequences of primates: tempo and mode of evolution","volume":"18","author":"Brown","year":"1982","journal-title":"J. Mol. Evol."},{"key":"B4","doi-asserted-by":"publisher","first-page":"3","DOI":"10.32614\/CRAN.package.boot","article-title":"Boot: bootstrap functions","volume":"1","author":"Canty","year":"2025"},{"key":"B5","doi-asserted-by":"publisher","first-page":"e19648","DOI":"10.7717\/peerj.19648","article-title":"Cryptic diversity on the genus caenolestes (caenolestidae: paucituberculata) in the ecuadorian Andes","volume":"13","author":"Carri\u00f3n-Olmedo","year":"2025","journal-title":"PeerJ"},{"key":"B6","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1093\/oxfordjournals.molbev.a026302","article-title":"Mitochondrial genes and Mammalian phylogenies: increasing the reliability of branch length estimation","volume":"17","author":"Corneli","year":"2000","journal-title":"Mol. Biol. Evol."},{"key":"B7","doi-asserted-by":"publisher","first-page":"1240","DOI":"10.1093\/molbev\/msg136","article-title":"Assessing the phylogenetic utility of four mitochondrial genes and a nuclear intron in the Asian pit viper genus, trimeresurus: separate, simultaneous, and conditional data combination analyses","volume":"20","author":"Creer","year":"2003","journal-title":"Mol. Biol. Evol."},{"key":"B8","doi-asserted-by":"publisher","first-page":"e27138","DOI":"10.1371\/journal.pone.0027138","article-title":"Mitogenome phylogenetics: the impact of using single regions and partitioning schemes on topology, substitution rate and divergence time estimation","volume":"6","author":"Duch\u00eane","year":"2011","journal-title":"PLoS ONE"},{"key":"B9","doi-asserted-by":"publisher","first-page":"i118","DOI":"10.1093\/bioinformatics\/btac252","article-title":"A LASSO-based approach to sample sites for phylogenetic tree search","volume":"38","author":"Ecker","year":"2022","journal-title":"Bioinformatics"},{"key":"B10","first-page":"294","article-title":"DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates","volume":"3","author":"Folmer","year":"1994","journal-title":"Mol. Mar. Biol. Biotechnol."},{"key":"B11","doi-asserted-by":"publisher","DOI":"10.18637\/jss.v033.i01","article-title":"Regularization paths for generalized linear models via coordinate descent","volume":"33","author":"Friedman","year":"2010","journal-title":"J. Stat. Soft."},{"key":"B12","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1007\/BF02515385","article-title":"Evolution of the cytochromeb gene of mammals","volume":"32","author":"Irwin","year":"1991","journal-title":"J. Mol. Evol."},{"key":"B13","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1038\/nmeth.4285","article-title":"ModelFinder: fast model selection for accurate phylogenetic estimates","volume":"14","author":"Kalyaanamoorthy","year":"2017","journal-title":"Nat. Methods"},{"key":"B14","doi-asserted-by":"publisher","first-page":"772","DOI":"10.1093\/molbev\/mst010","article-title":"MAFFT multiple sequence alignment software version 7: improvements in performance and usability","volume":"30","author":"Katoh","year":"2013","journal-title":"Mol. Biol. Evol."},{"key":"B15","doi-asserted-by":"publisher","first-page":"eabn3107","DOI":"10.1126\/science.abn3107","article-title":"Integrating gene annotation with orthology inference at scale","volume":"380","author":"Kirilenko","year":"2023","journal-title":"Science"},{"key":"B16","doi-asserted-by":"publisher","first-page":"170","DOI":"10.1007\/BF02100115","article-title":"Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea","volume":"29","author":"Kishino","year":"1989","journal-title":"J. Mol. Evol."},{"key":"B17","doi-asserted-by":"publisher","first-page":"4674","DOI":"10.1093\/molbev\/msab227","article-title":"Evolutionary sparse learning for phylogenomics","volume":"38","author":"Kumar","year":"2021","journal-title":"Mol. Biol. Evol."},{"key":"B18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2013\/621604","article-title":"Evaluating phylogenetic informativeness as a predictor of phylogenetic signal for metazoan, fungal, and mammalian phylogenomic data sets","volume":"2013","author":"L\u00f3pez-Gir\u00e1ldez","year":"2013","journal-title":"BioMed Res. Int."},{"key":"B19","doi-asserted-by":"publisher","first-page":"2824","DOI":"10.1093\/gbe\/evz209","article-title":"Resolving phylogenetic relationships within Passeriformes based on mitochondrial genes and inferring the evolution of their mitogenomes in terms of duplications","volume":"11","author":"Mackiewicz","year":"2019","journal-title":"Genome Biol. Evol."},{"key":"B20","doi-asserted-by":"publisher","first-page":"e17076","DOI":"10.7717\/peerj.17076","article-title":"The efficacy of single mitochondrial genes at reconciling the complete mitogenome phylogeny\u2014a case study on dwarf chameleons","volume":"12","author":"Main","year":"2024","journal-title":"PeerJ"},{"key":"B21","doi-asserted-by":"publisher","first-page":"1530","DOI":"10.1093\/molbev\/msaa015","article-title":"IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era","volume":"37","author":"Minh","year":"2020","journal-title":"Mol. Biol. Evol."},{"key":"B22","doi-asserted-by":"publisher","first-page":"e0273330","DOI":"10.1371\/journal.pone.0273330","article-title":"Intraspecies variation of the mitochondrial genome: an evaluation for phylogenetic approaches based on the conventional choices of genes and segments on mitogenome","volume":"17","author":"Mor\u00f3n-L\u00f3pez","year":"2022","journal-title":"PLoS ONE"},{"key":"B23","doi-asserted-by":"publisher","first-page":"107328","DOI":"10.1016\/j.ympev.2021.107328","article-title":"The value of updating GenBank accessions for supermatrix phylogeny: the case of the new Guinean marsupial carnivore genus myoictis","volume":"166","author":"Phillips","year":"2022","journal-title":"Mol. Phylogenetics Evol."},{"key":"B24","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/s10914-024-09742-3","article-title":"Phylogenetic relationships of marmosops matschie, 1916 in the Atlantic Forest and topography effects on the diversity of M. paulensis tate, 1931 (didelphidae: thylamyini)","volume":"32","author":"Ribeiro","year":"2025","journal-title":"J. Mammal. Evol."},{"key":"B25","doi-asserted-by":"publisher","DOI":"10.1101\/2025.07.24.666198","article-title":"Efficient identification of phylogenetically informative alignment sites via sparse learning","author":"Schrago","year":"2025"},{"key":"B26","doi-asserted-by":"publisher","first-page":"240","DOI":"10.1111\/mam.12192","article-title":"Employing statistical learning to derive species-level genetic diversity for mammalian species","volume":"50","author":"Schrago","year":"2020","journal-title":"Mammal. Rev."},{"key":"B27","doi-asserted-by":"publisher","first-page":"492","DOI":"10.1080\/10635150290069913","article-title":"An approximately unbiased test of phylogenetic tree selection","volume":"51","author":"Shimoidara","year":"2002","journal-title":"Syst. Biol."},{"key":"B28","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection Via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J. R. Stat. Soc. Ser. B Methodol."},{"key":"B29","doi-asserted-by":"publisher","first-page":"e14156","DOI":"10.1371\/journal.pone.0014156","article-title":"Reconstructing mammalian phylogenies: a detailed comparison of the cytochrome b and cytochrome oxidase subunit I mitochondrial genes","volume":"5","author":"Tobe","year":"2010","journal-title":"PLoS ONE"},{"key":"B30","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1080\/10635150701311362","article-title":"Profiling phylogenetic informativeness","volume":"56","author":"Townsend","year":"2007","journal-title":"Syst. Biol."},{"key":"B31","doi-asserted-by":"publisher","first-page":"705","DOI":"10.1080\/10635150390235557","article-title":"Comparison of likelihood and bayesian methods for estimating divergence times using multiple gene loci and calibration points, with application to a radiation of cute-looking mouse lemur species","volume":"52","author":"Yang","year":"2003","journal-title":"Syst. Biol."},{"key":"B32","doi-asserted-by":"publisher","first-page":"933","DOI":"10.1093\/oxfordjournals.molbev.a025661","article-title":"Phylogenetic performance of mitochondrial protein-coding genes in resolving relationships among vertebrates","volume":"13","author":"Zardoya","year":"1996","journal-title":"Mol. Biol. Evol."},{"key":"B33","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1186\/s12862-021-01926-2","article-title":"Complete mitochondrial genomes reveal robust phylogenetic signals and evidence of positive selection in horseshoe bats","volume":"21","author":"Zhang","year":"2021","journal-title":"BMC Ecol. Evo"},{"key":"B34","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1093\/zoolinnean\/zlac012","article-title":"New data on the mitochondrial genome of nematocera (lower diptera): features, structures and phylogenetic implications","volume":"197","author":"Zhang","year":"2023","journal-title":"Zoological J. Linn. Soc."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1704212\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T06:36:13Z","timestamp":1767854173000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1704212\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,8]]},"references-count":34,"alternative-id":["10.3389\/fbinf.2025.1704212"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1704212","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,8]]},"article-number":"1704212"}}