{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:39:09Z","timestamp":1740184749195,"version":"3.37.3"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2020,1,23]],"date-time":"2020-01-23T00:00:00Z","timestamp":1579737600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100003150","name":"Le Fonds Qu\u00e9b\u00e9cois de la Recherche sur la Nature et les Technologies","doi-asserted-by":"crossref","award":["173878"],"award-info":[{"award-number":["173878"]}],"id":[{"id":"10.13039\/501100003150","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","award":["249644"],"award-info":[{"award-number":["249644"]}],"id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Phylogenetic trees and the methods for their analysis have played a key role in many evolutionary, ecological and bioinformatics studies. Alternatively, phylogenetic networks have been widely used to analyze and represent complex reticulate evolutionary processes which cannot be adequately studied using traditional phylogenetic methods. These processes include, among others, hybridization, horizontal gene transfer, and genetic recombination. Nowadays, sequence similarity and genome similarity networks have become an efficient tool for community analysis of large molecular datasets in comparative studies. These networks can be used for tackling a variety of complex evolutionary problems such as the identification of horizontal gene transfer events, the recovery of mosaic genes and genomes, and the study of holobionts.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>The shortest path in a phylogenetic tree is used to estimate evolutionary distances between species. We show how the shortest path concept can be extended to sequence similarity networks by defining five new distances, NetUniFrac, Spp, Spep, Spelp and Spinp, and the Transfer index, between species communities present in the network. These new distances can be seen as network analogs of the traditional UniFrac distance used to assess dissimilarity between species communities in a phylogenetic tree, whereas the Transfer index is intended for estimating the rate and direction of gene transfers, or species dispersal, between different phylogenetic, or ecological, species communities. Moreover, NetUniFrac and the Transfer index can be computed in linear time with respect to the number of edges in the network. We show how these new measures can be used to analyze microbiota and antibiotic resistance gene similarity networks.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Our NetFrac program, implemented in R and C, along with its source code, is freely available on Github at the following URL address: https:\/\/github.com\/XPHenry\/Netfrac.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa043","type":"journal-article","created":{"date-parts":[[2020,1,17]],"date-time":"2020-01-17T12:11:30Z","timestamp":1579263090000},"page":"2740-2749","source":"Crossref","is-referenced-by-count":1,"title":["Transfer index, NetUniFrac and some useful shortest path-based distances for community analysis in sequence similarity networks"],"prefix":"10.1093","volume":"36","author":[{"given":"Henry","family":"Xing","sequence":"first","affiliation":[{"name":"Department of Computer Sciences"}]},{"given":"Steven W","family":"Kembel","sequence":"additional","affiliation":[{"name":"Department of Biology , Universit\u00e9 du Qu\u00e9bec \u00e0 Montr\u00e9al, Montreal, Canada"}]},{"given":"Vladimir","family":"Makarenkov","sequence":"additional","affiliation":[{"name":"Department of Computer Sciences"}]}],"member":"286","published-online":{"date-parts":[[2020,1,23]]},"reference":[{"key":"2023013110294341700_btaa043-B1","doi-asserted-by":"crossref","first-page":"e4345","DOI":"10.1371\/journal.pone.0004345","article-title":"Using sequence similarity networks for visualization of relationships across diverse protein superfamilies","volume":"4","author":"Atkinson","year":"2009","journal-title":"PLoS One"},{"key":"2023013110294341700_btaa043-B2","doi-asserted-by":"crossref","first-page":"18266","DOI":"10.1073\/pnas.1206541109","article-title":"Evolutionary analyses of non-genealogical bonds produced by introgressive descent","volume":"109","author":"Bapteste","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013110294341700_btaa043-B3","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1016\/j.tig.2013.05.007","article-title":"Networks: expanding evolutionary thinking","volume":"29","author":"Bapteste","year":"2013","journal-title":"Trends Genet"},{"key":"2023013110294341700_btaa043-B4","doi-asserted-by":"crossref","first-page":"fux053","DOI":"10.1093\/femsre\/fux053","article-title":"Environmental factors influencing the development and spread of antibiotic resistance","volume":"42","author":"Bengtsson-Palme","year":"2018","journal-title":"FEMS Microbiol. Rev"},{"key":"2023013110294341700_btaa043-B5","doi-asserted-by":"crossref","first-page":"e144","DOI":"10.1093\/nar\/gkr735","article-title":"Towards an accurate identification of mosaic genes and partial horizontal gene transfers","volume":"39","author":"Boc","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023013110294341700_btaa043-B6","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1093\/sysbio\/syp103","article-title":"Inferring and validating horizontal gene transfer events using bipartition dissimilarity","volume":"59","author":"Boc","year":"2010","journal-title":"Syst. Biol"},{"key":"2023013110294341700_btaa043-B7","doi-asserted-by":"crossref","first-page":"W573","DOI":"10.1093\/nar\/gks485","article-title":"T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks","volume":"40","author":"Boc","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023013110294341700_btaa043-B8","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res"},{"volume-title":"Inferring Phylogenies","year":"2004","author":"Felsenstein","key":"2023013110294341700_btaa043-B9"},{"key":"2023013110294341700_btaa043-B10","doi-asserted-by":"crossref","first-page":"3228","DOI":"10.1111\/j.1462-2920.2010.02295.x","article-title":"The horizontal flow of the plasmid resistome: clues from inter-generic similarity networks","volume":"12","author":"Fondi","year":"2010","journal-title":"Environ. Microbiol"},{"key":"2023013110294341700_btaa043-B11","doi-asserted-by":"crossref","first-page":"7821","DOI":"10.1073\/pnas.122653799","article-title":"Community structure in social and biological networks","volume":"99","author":"Girvan","year":"2002","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013110294341700_btaa043-B12","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1093\/molbev\/msj030","article-title":"Application of phylogenetic networks in evolutionary studies","volume":"23","author":"Huson","year":"2006","journal-title":"Mol. Biol. Evol"},{"key":"2023013110294341700_btaa043-B13","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1093\/bioinformatics\/btt049","article-title":"MosaicFinder: identification of fused gene families in sequence similarity networks","volume":"29","author":"Jachiet","year":"2013","journal-title":"Bioinformatics"},{"key":"2023013110294341700_btaa043-B14","first-page":"275","article-title":"The rapid generation of mutation data matrices from protein sequences","volume":"8","author":"Jones","year":"1992","journal-title":"Comput. Appl. Biosci"},{"key":"2023013110294341700_btaa043-B15","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1146\/annurev.micro.55.1.709","article-title":"Horizontal gene transfer in prokaryotes: quantification and classification","volume":"55","author":"Koonin","year":"2001","journal-title":"Annu. Rev. Microbiol"},{"key":"2023013110294341700_btaa043-B16","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1038\/ismej.2010.133","article-title":"UniFrac: an effective distance metric for microbial community comparison","volume":"5","author":"Lozupone","year":"2011","journal-title":"ISME J"},{"key":"2023013110294341700_btaa043-B17","doi-asserted-by":"crossref","first-page":"026126","DOI":"10.1103\/PhysRevE.67.026126","article-title":"Mixing patterns in networks","volume":"67","author":"Newman","year":"2003","journal-title":"Phys. Rev. E"},{"key":"2023013110294341700_btaa043-B19","doi-asserted-by":"crossref","first-page":"3947","DOI":"10.1093\/molbev\/mss200","article-title":"Measuring community similarity with phylogenetic networks","volume":"29","author":"Parks","year":"2012","journal-title":"Mol. Biol. Evol"},{"key":"2023013110294341700_btaa043-B20","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1093\/molbev\/msx283","article-title":"CompositeSearch: a generalized network approach for composite gene families detection","volume":"35","author":"Pathmanathan","year":"2018","journal-title":"Mol. Biol. Evol"},{"key":"2023013110294341700_btaa043-B21","doi-asserted-by":"crossref","first-page":"4250","DOI":"10.1073\/pnas.0306398101","article-title":"Bacterial biota in the human distal esophagus","volume":"101","author":"Pei","year":"2004","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013110294341700_btaa043-B22","first-page":"406","article-title":"The neighbor-joining method: a new method for reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol. Biol. Evol"},{"key":"2023013110294341700_btaa043-B23","doi-asserted-by":"crossref","first-page":"7537","DOI":"10.1128\/AEM.01541-09","article-title":"Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities","volume":"75","author":"Schloss","year":"2009","journal-title":"Appl. Environ. Microb"},{"key":"2023013110294341700_btaa043-B24","doi-asserted-by":"crossref","first-page":"735","DOI":"10.1093\/sysbio\/syy015","article-title":"Inferring phylogenetic networks using PhyloNet","volume":"67","author":"Wen","year":"2018","journal-title":"Syst. Biol"},{"key":"2023013110294341700_btaa043-B25","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1016\/j.mbs.2018.02.005","article-title":"Phylogenetic diversity and biodiversity indices on phylogenetic networks","volume":"298","author":"Wicke","year":"2018","journal-title":"Math. Biosci"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa043\/32569034\/btaa043.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/9\/2740\/48984330\/bioinformatics_36_9_2740.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/9\/2740\/48984330\/bioinformatics_36_9_2740.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T20:38:39Z","timestamp":1675197519000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/9\/2740\/5714743"}},"subtitle":[],"editor":[{"given":"Pier","family":"Luigi Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,1,23]]},"references-count":24,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2020,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa043","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,5,1]]},"published":{"date-parts":[[2020,1,23]]}}}