{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T08:21:54Z","timestamp":1769156514794,"version":"3.49.0"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2016,12,9]],"date-time":"2016-12-09T00:00:00Z","timestamp":1481241600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The local score of a biological sequence analysis is a mathematical tool largely used to analyse biological sequences. Consequently, determining an accurate estimation of its distribution is crucial.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>First, we study the accuracy of classical results on the local score distribution in independent and identically distributed model using a Kolmogorov-Smirnov goodness of fit test. Second, we highlight how the length of the segment that realizes the local score improves the classical setting based on local score only. Finally, we study which part of the sequence contributes to the local score.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btw699","type":"journal-article","created":{"date-parts":[[2016,11,8]],"date-time":"2016-11-08T20:07:04Z","timestamp":1478635624000},"page":"654-660","source":"Crossref","is-referenced-by-count":9,"title":["Statistical significance based on length and position of the local score in a model of i.i.d. sequences"],"prefix":"10.1093","volume":"33","author":[{"given":"Agn\u00e8s","family":"Lagnoux","sequence":"first","affiliation":[{"name":"Institut de Math\u00e9matiques de Toulouse, UMR5219, Universit\u00e9 de Toulouse 2 Jean Jaur\u00e8s, 5 all\u00e9es Antonio Machado, Toulouse, Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sabine","family":"Mercier","sequence":"additional","affiliation":[{"name":"Institut de Math\u00e9matiques de Toulouse, UMR5219, Universit\u00e9 de Toulouse 2 Jean Jaur\u00e8s, 5 all\u00e9es Antonio Machado, Toulouse, Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pierre","family":"Vallois","sequence":"additional","affiliation":[{"name":"Institut Elie Cartan, UMR7502 CNRS, INRIA-BIGS, Universit\u00e9 de Lorraine, Vandoeuvre-l\u00e8s-Nancy Cedex, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2016,12,9]]},"reference":[{"key":"2023020204432299000_btw699-B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"jmb"},{"key":"2023020204432299000_btw699-B2","doi-asserted-by":"crossref","first-page":"1157","DOI":"10.1214\/aop\/1176991262","article-title":"The Erdos-Renyi strong law for pattern matching with a given proportion of mismatches","volume":"17","author":"Arratia","year":"1989","journal-title":"Ann. Prob"},{"key":"2023020204432299000_btw699-B3","first-page":"200","article-title":"A phase transition for the score in matching random sequences allowing deletions","volume":"4","author":"Arratia","year":"1994","journal-title":"Adv. Appl. Prob"},{"key":"2023020204432299000_btw699-B4","volume-title":"Problems and Solutions in Biological Sequence Analysis","author":"Borodovsky","year":"2006"},{"key":"2023020204432299000_btw699-B5","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1239\/jap\/1053003554","article-title":"An improved approximation for assessing the statistical significance of molecular sequence features","volume":"40","author":"Cellier","year":"2003","journal-title":"J. Appl. Prob"},{"key":"2023020204432299000_btw699-B6","doi-asserted-by":"crossref","DOI":"10.1016\/j.spa.2014.07.003","article-title":"Elements related to the largest complete excursion of a reflected Brownian motion stopped at a fixed time. Application to local score","volume":"124","author":"Chabriac","year":"2014","journal-title":"Stoch. Proc. Appl"},{"key":"2023020204432299000_btw699-B7","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1007\/s00574-006-0023-0","article-title":"Random projections and goodness-of-fit tests in infinite-dimensional spaces","volume":"37","author":"Cuesta-Albertos","year":"2006","journal-title":"Bull. Braz. Math. Soc. (N.S.)"},{"key":"2023020204432299000_btw699-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0304-4149(03)00061-9","article-title":"Asymptotic behavior of the local score of independent and identically distributed random sequences","volume":"107","author":"Daudin","year":"2003","journal-title":"Stoch. Proc. Appl"},{"key":"2023020204432299000_btw699-B9","first-page":"1737","article-title":"Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d. variables","volume":"19","author":"Dembo","year":"1991","journal-title":"Ann. Prob"},{"key":"2023020204432299000_btw699-B10","first-page":"1756","article-title":"Strong limit theorems of empirical distributions for large segmental exceedances of partial sums of Markov variables","volume":"19","author":"Dembo","year":"1991","journal-title":"Ann. Prob"},{"key":"2023020204432299000_btw699-B11","author":"Etienne","year":"2002"},{"key":"2023020204432299000_btw699-B12","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1023\/B:MCAP.0000026559.87023.ec","article-title":"Approximation of the distribution of the supremum of a centered random walk. Application to the local score","volume":"6","author":"Etienne","year":"2004","journal-title":"Methodol. Comput. Appl. Prob"},{"key":"2023020204432299000_btw699-B13","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1016\/S0167-7152(97)00020-5","article-title":"A multivariate Kolmogorov-Smirnov test of goodness of fit","volume":"35","author":"Justel","year":"1997","journal-title":"Stat. Prob. Lett"},{"key":"2023020204432299000_btw699-B14","doi-asserted-by":"crossref","first-page":"13355","DOI":"10.1073\/pnas.0501804102","article-title":"Statistical signals in bioinformatics","volume":"102","author":"Karlin","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020204432299000_btw699-B15","doi-asserted-by":"crossref","first-page":"2264","DOI":"10.1073\/pnas.87.6.2264","article-title":"Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes","volume":"87","author":"Karlin","year":"1990","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020204432299000_btw699-B16","doi-asserted-by":"crossref","first-page":"113","DOI":"10.2307\/1427732","article-title":"Limit distributions of maximal segmental score among Markov-dependent partial sums","volume":"24","author":"Karlin","year":"1992","journal-title":"Adv Appl. Prob"},{"key":"2023020204432299000_btw699-B17","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1214\/aop\/1176991772","article-title":"Maximal length of common words among random letter sequences","volume":"16","author":"Karlin","year":"1988","journal-title":"Ann. Prob"},{"key":"2023020204432299000_btw699-B19","first-page":"461\u2013463","article-title":"Confidence limits for an unknown distribution function","volume":"12","author":"Kolmogorov","year":"1941","journal-title":"Ann. Math. Stat"},{"key":"2023020204432299000_btw699-B20","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","article-title":"A simple method for displaying the hydropathic character of a protein","volume":"157","author":"Kyte","year":"1982","journal-title":"J. Mol. Biol"},{"key":"2023020204432299000_btw699-B21","first-page":"1","article-title":"Probability that the maximum of the reflected Brownian motion over a finite interval [0,t] is achieved by its last zero before t","volume":"20","author":"Lagnoux","year":"2015","journal-title":"Electron. Commun. Prob"},{"key":"2023020204432299000_btw699-B22","volume-title":"An Introduction to Bioinformatics","author":"Lesk","year":"2005"},{"key":"2023020204432299000_btw699-B23","author":"Lopes","year":"2007"},{"key":"2023020204432299000_btw699-B24","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbk001","article-title":"Statistical significance in biological sequence analysis","author":"Mitrophanov","year":"2006","journal-title":"Brief. Bioinformatics"},{"key":"2023020204432299000_btw699-B25","author":"Mercier","year":"1999"},{"key":"2023020204432299000_btw699-B26","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1089\/106652701752236197","article-title":"Exact distribution for the local score of one i.i.d. random sequence","volume":"8","author":"Mercier","year":"2001","journal-title":"J. Comp. Biol"},{"key":"2023020204432299000_btw699-B27","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1089\/cmb.2009.0198","article-title":"Alignment-free sequence comparison (I): statistics and power","volume":"16","author":"Reinert","year":"2009","journal-title":"J. Comput. Biol"},{"key":"2023020204432299000_btw699-B28","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1109\/TCBB.2007.1023","article-title":"On the length of the longest exact position match in a random sequence","volume":"4","author":"Reinert","year":"2007","journal-title":"EEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023020204432299000_btw699-B29","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-6846-3","volume-title":"Introduction to Computational Biology: Maps, Sequences and Genomes","author":"Waterman","year":"1995"},{"key":"2023020204432299000_btw699-B30","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2105-12-47","article-title":"Accurate statistics for local sequence alignment with position-dependent scoring by rare-event sampling","volume":"12","author":"Wolfsheimer","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020204432299000_btw699-B31","doi-asserted-by":"crossref","first-page":"301.","DOI":"10.1186\/s12859-015-0732-8","article-title":"Statistical significance approximation in local trend analysis of high-throughput timeseries data using the theory of Markov chains","volume":"16","author":"Xia","year":"2015","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/5\/654\/49037970\/bioinformatics_33_5_654.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/5\/654\/49037970\/bioinformatics_33_5_654.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T04:48:41Z","timestamp":1675313321000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/5\/654\/3056040"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2016,12,9]]},"references-count":30,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2017,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw699","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,3,1]]},"published":{"date-parts":[[2016,12,9]]}}}