{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T03:07:47Z","timestamp":1774494467563,"version":"3.50.1"},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2017,11,7]],"date-time":"2017-11-07T00:00:00Z","timestamp":1510012800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/501100001412","name":"Council of Scientific and Industrial Research","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001412","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001332","name":"CSIR","doi-asserted-by":"publisher","award":["BSC0118"],"award-info":[{"award-number":["BSC0118"]}],"id":[{"id":"10.13039\/501100001332","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001332","name":"CSIR","doi-asserted-by":"publisher","award":["BSC0121"],"award-info":[{"award-number":["BSC0121"]}],"id":[{"id":"10.13039\/501100001332","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https:\/\/github.com\/rkmlab\/perf. The source code of PERF is deposited in GitHub at https:\/\/github.com\/rkmlab\/perf under an MIT license.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx721","type":"journal-article","created":{"date-parts":[[2017,11,6]],"date-time":"2017-11-06T07:09:22Z","timestamp":1509952162000},"page":"943-948","source":"Crossref","is-referenced-by-count":45,"title":["PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences"],"prefix":"10.1093","volume":"34","author":[{"given":"Akshay Kumar","family":"Avvaru","sequence":"first","affiliation":[{"name":"CSIR \u2013 Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India"}]},{"given":"Divya Tej","family":"Sowpati","sequence":"additional","affiliation":[{"name":"CSIR \u2013 Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India"}]},{"given":"Rakesh Kumar","family":"Mishra","sequence":"additional","affiliation":[{"name":"CSIR \u2013 Centre for Cellular and Molecular Biology, Hyderabad, Telangana, India"}]}],"member":"286","published-online":{"date-parts":[[2017,11,7]]},"reference":[{"key":"2023012712472810200_btx721-B1","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1093\/nar\/27.2.573","article-title":"Tandem repeats finder: a program to analyze DNA sequences","volume":"27","author":"Benson","year":"1999","journal-title":"Nucleic Acids Res"},{"key":"2023012712472810200_btx721-B2","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1093\/bioinformatics\/btp163","article-title":"Biopython: freely available Python tools for computational molecular biology and bioinformatics","volume":"25","author":"Cock","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012712472810200_btx721-B3","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1038\/nrg1348","article-title":"Microsatellites: Simple sequences with complex evolution","volume":"5","author":"Ellegren","year":"2004","journal-title":"Nat. Rev. Genet"},{"key":"2023012712472810200_btx721-B4","doi-asserted-by":"crossref","first-page":"e22.","DOI":"10.1093\/nar\/gks881","article-title":"MsDetector: toward a standard computational tool for DNA microsatellites detection","volume":"41","author":"Girgis","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023012712472810200_btx721-B5","doi-asserted-by":"crossref","first-page":"3383","DOI":"10.1093\/nar\/gkm271","article-title":"Repeat-induced epigenetic changes in intron 1 of the frataxin gene and its consequences in Friedreich ataxia","volume":"35","author":"Greene","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023012712472810200_btx721-B6","doi-asserted-by":"crossref","first-page":"288","DOI":"10.1016\/0168-9525(92)90137-S","article-title":"Microsatellites for linkage analysis of genetic-traits","volume":"8","author":"Hearne","year":"1992","journal-title":"Trends Genet"},{"key":"2023012712472810200_btx721-B7","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/S0168-9525(97)01008-1","article-title":"Simple sequence repeats as a source of quantitative genetic variation","volume":"13","author":"Kashi","year":"1997","journal-title":"Trends Genet"},{"key":"2023012712472810200_btx721-B8","doi-asserted-by":"crossref","first-page":"3672","DOI":"10.1093\/nar\/gkg617","article-title":"mreps: efficient and flexible detection of tandem repeats in DNA","volume":"31","author":"Kolpakov","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023012712472810200_btx721-B9","doi-asserted-by":"crossref","first-page":"1844","DOI":"10.1038\/ncomms2872","article-title":"GATA simple sequence repeats function as enhancer blocker boundaries","volume":"4","author":"Kumar","year":"2013","journal-title":"Nat. Commun"},{"key":"2023012712472810200_btx721-B10","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1002\/bies.200900111","article-title":"Repeat performance: how do genome packaging and regulation depend on simple sequence repeats?","volume":"32","author":"Kumar","year":"2010","journal-title":"Bioessays"},{"key":"2023012712472810200_btx721-B11","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1093\/bib\/bbs023","article-title":"Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance","volume":"14","author":"Lim","year":"2013","journal-title":"Brief. Bioinform"},{"key":"2023012712472810200_btx721-B12","doi-asserted-by":"crossref","first-page":"564","DOI":"10.4161\/rna.24326","article-title":"AAGAG repeat RNA is an essential component of nuclear matrix in Drosophila","volume":"10","author":"Pathak","year":"2013","journal-title":"RNA. Biol"},{"key":"2023012712472810200_btx721-B13","doi-asserted-by":"crossref","first-page":"2707","DOI":"10.1093\/bioinformatics\/btw298","article-title":"SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences","volume":"32","author":"Pickett","year":"2016","journal-title":"Bioinformatics"},{"key":"2023012712472810200_btx721-B14","doi-asserted-by":"crossref","first-page":"3922","DOI":"10.1093\/bioinformatics\/btx538","article-title":"Kmer-SSR: a fast and exhaustive SSR Search Algorithm","volume":"33","author":"Pickett","year":"2017","journal-title":"Bioinformatics"},{"key":"2023012712472810200_btx721-B15","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1093\/hmg\/ddi024","article-title":"Molecular dissection of the events leading to inactivation of the FMR1 gene","volume":"14","author":"Pietrobono","year":"2005","journal-title":"Hum. Mol. Genet"},{"key":"2023012712472810200_btx721-B16","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"BEDTools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012712472810200_btx721-B17","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1016\/j.gene.2014.08.052","article-title":"Length and sequence dependent accumulation of simple sequence repeats in vertebrates: potential role in genome organization and regulation","volume":"551","author":"Ramamoorthy","year":"2014","journal-title":"Gene"},{"key":"2023012712472810200_btx721-B18","doi-asserted-by":"crossref","first-page":"1441","DOI":"10.1101\/gr.184001","article-title":"Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential","volume":"11","author":"Temnykh","year":"2001","journal-title":"Genome Res"},{"key":"2023012712472810200_btx721-B19","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1007\/s00122-002-1031-0","article-title":"Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.)","volume":"106","author":"Thiel","year":"2003","journal-title":"Theor Appl Genet"},{"key":"2023012712472810200_btx721-B20","doi-asserted-by":"crossref","first-page":"1011","DOI":"10.1101\/gr.070409.107","article-title":"The biological effects of simple tandem repeats: Lessons from the repeat expansion diseases","volume":"18","author":"Usdin","year":"2008","journal-title":"Genome Res"},{"key":"2023012712472810200_btx721-B21","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1006\/geno.1994.1151","article-title":"Genome fingerprinting by simple sequence repeat (Ssr)-anchored polymerase chain-reaction amplification","volume":"20","author":"Zietkiewicz","year":"1994","journal-title":"Genomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/6\/943\/48914467\/bioinformatics_34_6_943.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/6\/943\/48914467\/bioinformatics_34_6_943.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T08:40:12Z","timestamp":1674808812000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/6\/943\/4600186"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,11,7]]},"references-count":21,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx721","relation":{"is-referenced-by":[{"id-type":"doi","id":"10.1007\/s12686-025-01401-w","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,3,15]]},"published":{"date-parts":[[2017,11,7]]}}}