{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,6]],"date-time":"2025-12-06T16:47:16Z","timestamp":1765039636417,"version":"3.41.2"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2022,8,30]],"date-time":"2022-08-30T00:00:00Z","timestamp":1661817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001407","name":"Department of Biotechnology, Ministry of Science and Technology, India","doi-asserted-by":"publisher","award":["BT\/Ag\/ Network\/Wheat\/2019-20"],"award-info":[{"award-number":["BT\/Ag\/ Network\/Wheat\/2019-20"]}],"id":[{"id":"10.13039\/501100001407","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Maintaining duplicate germplasms in genebanks hampers effective conservation and utilization of genebank resources. The redundant germplasm adds to the cost of germplasm conservation by requiring a large proportion of the genebank financial resources towards conservation rather than enriching the diversity. Besides, genome-wide-association analysis using an association panel with over-represented germplasms can be biased resulting in spurious marker-trait associations. The conventional methods of germplasm duplicate removal using passport information suffer from incomplete or missing passport information and data handling errors at various stages of germplasm enrichment. This limitation is less likely in the case of genotypic data. Therefore, we developed a web-based tool, Germplasm Duplicate Identification and Removal Tool (G-DIRT), which allows germplasm duplicate identification based on identity-by-state analysis using single-nucleotide polymorphism genotyping information along with pre-processing of genotypic data. A homozygous genotypic difference threshold of 0.1% for germplasm duplicates has been determined using tetraploid wheat genotypic data with 94.97% of accuracy. Based on the genotypic difference, the tool also builds a dendrogram that can visually depict the relationship between genotypes. To overcome the constraint of high-dimensional genotypic data, an offline version of G-DIRT in the interface of R has also been developed. The G-DIRT is expected to help genebank curators, breeders and other researchers across the world in identifying germplasm duplicates from the global genebank collections by only using the easily sharable genotypic data instead of physically exchanging the seeds or propagating materials. The web server will complement the existing methods of germplasm duplicate identification based on passport or phenotypic information being freely accessible at http:\/\/webtools.nbpgr.ernet.in\/gdirt\/.<\/jats:p>","DOI":"10.1093\/bib\/bbac348","type":"journal-article","created":{"date-parts":[[2022,8,30]],"date-time":"2022-08-30T12:59:03Z","timestamp":1661864343000},"source":"Crossref","is-referenced-by-count":6,"title":["G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data"],"prefix":"10.1093","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0799-0660","authenticated-orcid":false,"given":"Tanmaya Kumar","family":"Sahu","sequence":"first","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8579-6519","authenticated-orcid":false,"given":"Amit Kumar","family":"Singh","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shikha","family":"Mittal","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shailendra Kumar","family":"Jha","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sundeep","family":"Kumar","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sherry Rachel","family":"Jacob","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kuldeep","family":"Singh","sequence":"additional","affiliation":[{"name":"ICAR-National Bureau of Plant Genetic Resources (ICAR-NBPGR) , New Delhi, India"},{"name":"ICAR- Indian Agricultural Research Institute (ICAR-IARI) , New Delhi, India"},{"name":"International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) , Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,8,30]]},"reference":[{"volume-title":"The Second Report on the State of the World\u2019s Plant Genetic Resources for Food and Agriculture","year":"2010","author":"FAO","key":"2022092013225875200_ref1"},{"volume-title":"The Vegetable Garden","year":"1885","author":"Vilmorin-Andrieux","key":"2022092013225875200_ref2"},{"key":"2022092013225875200_ref3","doi-asserted-by":"crossref","first-page":"925","DOI":"10.3390\/plants9080925","article-title":"SNP markers and evaluation of duplicate holdings of Brassica oleracea in two European genebanks","volume":"9","author":"Palm\u00e9","year":"2020","journal-title":"Plants"},{"key":"2022092013225875200_ref4","doi-asserted-by":"crossref","first-page":"e1001595","DOI":"10.1371\/journal.pbio.1001595","article-title":"Where have all the crop phenotypes gone?","volume":"11","author":"Zamir","year":"2013","journal-title":"PLoS Biol"},{"key":"2022092013225875200_ref5","doi-asserted-by":"crossref","first-page":"e102448","DOI":"10.1371\/journal.pone.0102448","article-title":"Using genotyping-by-sequencing (GBS) for genomic discovery in cultivated oat","volume":"9","author":"Huang","year":"2014","journal-title":"PLoS One"},{"key":"2022092013225875200_ref6","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1089\/bio.2018.0033","article-title":"A case of need: linking traits to Genebank accessions","volume":"16","author":"Anglin","year":"2018","journal-title":"Biopreserv Biobank"},{"key":"2022092013225875200_ref7","doi-asserted-by":"crossref","first-page":"407","DOI":"10.3732\/ajb.1100385","article-title":"Genomics of gene banks: a case study in rice","volume":"99","author":"McCouch","year":"2012","journal-title":"Am J Bot"},{"key":"2022092013225875200_ref8","first-page":"92","article-title":"Genotyping-by-sequencing for plant breeding and genetics","volume":"5","author":"Poland","year":"2012","journal-title":"Plant Genome"},{"key":"2022092013225875200_ref9","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1038\/s41598-018-37269-0","article-title":"Efficient curation of genebanks using next generation sequencing reveals substantial duplication of germplasm accessions","volume":"9","author":"Singh","year":"2019","journal-title":"Sci Rep"},{"issue":"6","key":"2022092013225875200_ref10","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1016\/j.tplants.2021.03.010","article-title":"Designing future crops: genomics-assisted breeding comes of age","volume":"26","author":"Varshney","year":"2021","journal-title":"Trends Plant Sci"},{"issue":"2","key":"2022092013225875200_ref11","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1038\/s41588-018-0266-x","article-title":"Genebank genomics highlights the diversity of a global barley collection","volume":"51","author":"Milner","year":"2019","journal-title":"Nat Genet"},{"key":"2022092013225875200_ref12","doi-asserted-by":"crossref","first-page":"1049","DOI":"10.1007\/BF00222920","article-title":"The identification of duplicate accessions within a rice germplasm collection using RAPD analysis","volume":"90","author":"Virk","year":"1995","journal-title":"Theoret Appl Genetics"},{"key":"2022092013225875200_ref13","doi-asserted-by":"crossref","first-page":"1211","DOI":"10.1007\/s10531-004-7847-y","article-title":"Identification of duplicates for the optimization of carrot collection management","volume":"14","author":"Le Clerc","year":"2005","journal-title":"Biodivers Conserv"},{"issue":"1","key":"2022092013225875200_ref14","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1016\/j.indcrop.2011.09.004","article-title":"Determination of duplicates of accessions in a germplasm collection of flax\/linseed by means of digital image analysis","volume":"36","author":"Iva","year":"2012","journal-title":"Ind Crops Prod"},{"issue":"5","key":"2022092013225875200_ref15","doi-asserted-by":"crossref","first-page":"333","DOI":"10.21273\/JASHS.137.5.333","article-title":"Identification of \u201cduplicate\u201d accessions within the USDA-ARS National Plant Germplasm System Malus Collection","volume":"137","author":"Gross","year":"2012","journal-title":"J Am Soc Hort Sci"},{"issue":"3","key":"2022092013225875200_ref16","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1017\/S1479262117000156","article-title":"Duplication assessments in Brassica vegetable accessions","volume":"16","author":"Solberg","year":"2017","journal-title":"Plant Genet Resour"},{"key":"2022092013225875200_ref17","doi-asserted-by":"crossref","first-page":"110","DOI":"10.17221\/68\/2018-CJGPB","article-title":"Phenotyping and SSR markers as a tool for identification of duplicates in lettuce germplasm","volume":"55","author":"Sochor","year":"2019","journal-title":"Czech J Genet Plant Breed"},{"key":"2022092013225875200_ref18","doi-asserted-by":"crossref","first-page":"1057","DOI":"10.1007\/s40011-020-01178-y","article-title":"Identification of duplicates in ginger germplasm collection from Odisha using morphological and molecular characterization","volume":"90","author":"Das","year":"2020","journal-title":"Proc Natl Acad Sci India Sect B Biol Sci"},{"key":"2022092013225875200_ref19","doi-asserted-by":"crossref","first-page":"623736","DOI":"10.3389\/fgene.2020.623736","article-title":"Technological innovations for improving cassava production in sub-Saharan Africa","volume":"11","author":"Mbanjo","year":"2021","journal-title":"Front Genet"},{"key":"2022092013225875200_ref20","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1007\/s11103-021-01124-0","article-title":"DNA fingerprinting reveals varietal composition of Vietnamese cassava germplasm (Manihot esculenta Crantz) from farmers\u2019 field and genebank collections","volume":"109","author":"Ocampo","year":"2021","journal-title":"Plant Mol Biol"},{"issue":"4","key":"2022092013225875200_ref21","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1590\/S0044-59672013000400008","article-title":"Identification of duplicates of cassava accessions sampled on the north region of Brazil using microsatellite markers","volume":"43","author":"Moura","year":"2013","journal-title":"Acta Ama"},{"issue":"1","key":"2022092013225875200_ref22","doi-asserted-by":"crossref","first-page":"e03154","DOI":"10.1016\/j.heliyon.2019.e03154","article-title":"Genetic diversity and population structure analysis of Ghanaian and exotic cassava accessions using simple sequence repeat (SSR) markers","volume":"6","author":"Adjebeng-Danquah","year":"2020","journal-title":"Heliyon"},{"issue":"24","key":"2022092013225875200_ref23","doi-asserted-by":"crossref","first-page":"3326","DOI":"10.1093\/bioinformatics\/bts606","article-title":"A high-performance computing toolset for relatedness and principal component analysis of SNP data","volume":"28","author":"Zheng","year":"2012","journal-title":"Bioinformatics"},{"issue":"3","key":"2022092013225875200_ref24","doi-asserted-by":"crossref","first-page":"639","DOI":"10.1111\/1755-0998.12995","article-title":"Minor allele frequency thresholds strongly affect population structure inference with genomic data sets","volume":"19","author":"Linck","year":"2019","journal-title":"Mol Ecol Resour"},{"issue":"2","key":"2022092013225875200_ref25","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1016\/j.ajhg.2007.11.004","article-title":"Simple and efficient analysis of disease association with missing genotype data","volume":"82","author":"Lin","year":"2008","journal-title":"Am J Hum Genet"},{"author":"Gusareva","key":"2022092013225875200_ref26","article-title":"Epistasis genome-wide association interaction analysis (GWAI)"},{"issue":"5","key":"2022092013225875200_ref27","doi-asserted-by":"crossref","first-page":"887","DOI":"10.1086\/429864","article-title":"A note on exact tests of Hardy\u2013Weinberg equilibrium","volume":"76","author":"Wigginton","year":"2005","journal-title":"Am J Hum Genet"},{"issue":"2","key":"2022092013225875200_ref28","doi-asserted-by":"crossref","first-page":"e1608","DOI":"10.1002\/mpr.1608","article-title":"A tutorial on conducting genome-wide association studies: quality control and statistical analysis","volume":"27","author":"Marees","year":"2018","journal-title":"Int J Methods Psychiatr Res"},{"issue":"6","key":"2022092013225875200_ref29","doi-asserted-by":"crossref","first-page":"1071","DOI":"10.1086\/510257","article-title":"Exact tests of Hardy\u2013Weinberg equilibrium and homogeneity of disequilibrium across strata","volume":"79","author":"Schaid","year":"2006","journal-title":"Am J Hum Genet"},{"key":"2022092013225875200_ref30","doi-asserted-by":"crossref","first-page":"e90346","DOI":"10.1371\/journal.pone.0090346","article-title":"TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline","volume":"9","author":"Glaubitz","year":"2014","journal-title":"PLoS One"},{"issue":"15","key":"2022092013225875200_ref31","doi-asserted-by":"crossref","first-page":"2251","DOI":"10.1093\/bioinformatics\/btx145","article-title":"SeqArray-a storage-efficient high-performance data format for WGS variant calls","volume":"33","author":"Zheng","year":"2017","journal-title":"Bioinformatics"},{"issue":"3","key":"2022092013225875200_ref32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v064.i03","article-title":"Exploring diallelic genetic markers: the HardyWeinberg package","volume":"64","author":"Graffelman","year":"2015","journal-title":"J Stat Softw"},{"issue":"19","key":"2022092013225875200_ref33","doi-asserted-by":"crossref","first-page":"2811","DOI":"10.1093\/bioinformatics\/btu393","article-title":"Circlize implements and enhances circular visualization in R","volume":"30","author":"Gu","year":"2014","journal-title":"Bioinformatics"},{"key":"2022092013225875200_ref34","doi-asserted-by":"crossref","DOI":"10.3389\/fpls.2020.569905","article-title":"The global durum wheat panel (GDP): an international platform to identify and exchange beneficial alleles","volume":"11","author":"Mazzucotelli","year":"2020","journal-title":"Front Plant Sci"},{"key":"2022092013225875200_ref35","doi-asserted-by":"crossref","first-page":"4572","DOI":"10.1038\/s41467-020-18404-w","article-title":"Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints","volume":"11","author":"Sansaloni","year":"2020","journal-title":"Nat Commun"},{"key":"2022092013225875200_ref36","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/s41586-018-0063-9","article-title":"Genomic variation in 3,010 diverse accessions of Asian cultivated rice","volume":"557","author":"Wang","year":"2018","journal-title":"Nature"},{"issue":"1","key":"2022092013225875200_ref37","doi-asserted-by":"crossref","DOI":"10.3835\/plantgenome2018.06.0044","article-title":"An integrated genotyping-by-sequencing polymorphism map for over 10,000 sorghum genotypes","volume":"12","author":"Hu","year":"2019","journal-title":"Plant Genome"},{"key":"2022092013225875200_ref38","doi-asserted-by":"crossref","first-page":"16308","DOI":"10.1038\/s41598-020-73321-8","article-title":"Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding","volume":"10","author":"Wang","year":"2020","journal-title":"Sci Rep"},{"key":"2022092013225875200_ref39","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1038\/s41586-021-04066-1","article-title":"A chickpea genetic variation map based on the sequencing of 3,366 genomes","volume":"599","author":"Varshney","year":"2021","journal-title":"Nature"},{"issue":"2","key":"2022092013225875200_ref40","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1111\/pbi.13466","article-title":"Soybean (Glycine max) haplotype map (GmHapMap): a universal resource for soybean translational and functional genomics","volume":"19","author":"Torkamaneh","year":"2020","journal-title":"Plant Biotechnol J"},{"issue":"4","key":"2022092013225875200_ref41","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1590\/1678-992x-2017-0389","article-title":"Identification of duplicates in cassava germplasm banks based on single-nucleotide polymorphisms (SNPs)","volume":"76","author":"Albuquerque","year":"2019","journal-title":"Sci Agric"},{"issue":"11","key":"2022092013225875200_ref42","doi-asserted-by":"crossref","first-page":"447","DOI":"10.3389\/fgene.2020.00447","article-title":"Recommendations for choosing the genotyping method and best practices for quality control in crop genome-wide association studies","volume":"5","author":"Pavan","year":"2020","journal-title":"Front Genet"},{"key":"2022092013225875200_ref43","doi-asserted-by":"crossref","DOI":"10.1186\/s12864-019-5824-9","article-title":"Evaluation of linkage disequilibrium, population structure, and genetic diversity in the U.S. peanut mini core collection","volume":"20","author":"Otyama","year":"2019","journal-title":"BMC Genomics"},{"key":"2022092013225875200_ref44","article-title":"PGRdup: discover probable duplicates in plant genetic resources collections","author":"Aravind","year":"2021","journal-title":"R package version 0.2.3.7"},{"issue":"3","key":"2022092013225875200_ref45","first-page":"306","article-title":"Plant genetic resources in India: management and utilization","volume":"24","author":"Singh","year":"2020","journal-title":"Vavilovskii Zhurnal Genet Selektsii"},{"key":"2022092013225875200_ref46","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1111\/pbr.12252","article-title":"Identification of a diverse mini-core panel of Indian rice germplasm based on genotyping using microsatellite markers","volume":"134","author":"Tiwari","year":"2015","journal-title":"Plant Breed"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac348\/45936334\/bbac348.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac348\/45936334\/bbac348.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,20]],"date-time":"2022-09-20T18:01:04Z","timestamp":1663696864000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac348\/6678959"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,30]]},"references-count":46,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac348","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2022,9]]},"published":{"date-parts":[[2022,8,30]]},"article-number":"bbac348"}}