{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,15]],"date-time":"2025-11-15T10:15:24Z","timestamp":1763201724815},"reference-count":48,"publisher":"World Scientific Pub Co Pte Ltd","issue":"06","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:p>Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.<\/jats:p>","DOI":"10.1142\/s0219720012500163","type":"journal-article","created":{"date-parts":[[2012,6,15]],"date-time":"2012-06-15T02:13:44Z","timestamp":1339726424000},"page":"1250016","source":"Crossref","is-referenced-by-count":7,"title":["SUITE OF TOOLS FOR STATISTICAL N-GRAM LANGUAGE MODELING FOR PATTERN MINING IN WHOLE GENOME SEQUENCES"],"prefix":"10.1142","volume":"10","author":[{"given":"MADHAVI K.","family":"GANAPATHIRAJU","sequence":"first","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Boulevard, Suite BAUM 423, Pittsburgh, PA 15206-3701, USA"}]},{"given":"ASIA D.","family":"MITCHELL","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Boulevard, Suite BAUM 423, Pittsburgh, PA 15206-3701, USA"}]},{"given":"MOHAMED","family":"THAHIR","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Boulevard, Suite BAUM 423, Pittsburgh, PA 15206-3701, USA"},{"name":"Intelligent Systems Program, University of Pittsburgh, USA"}]},{"given":"KAMIYA","family":"MOTWANI","sequence":"additional","affiliation":[{"name":"Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore 560012, India"}]},{"given":"SESHAN","family":"ANANTHASUBRAMANIAN","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, University of Pittsburgh, 5607 Baum Boulevard, Suite BAUM 423, Pittsburgh, PA 15206-3701, USA"},{"name":"Intelligent Systems Program, University of Pittsburgh, USA"}]}],"member":"219","published-online":{"date-parts":[[2012,10,18]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.2165\/00822942-200403020-00013"},{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-32263-4_2"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1007\/s12038-007-0087-z"},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-12-12"},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-9-S1-S4"},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1002\/prot.20373"},{"key":"rf10","doi-asserted-by":"publisher","DOI":"10.2165\/00822942-200403020-00008"},{"key":"rf11","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh580"},{"key":"rf12","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2008.03.007"},{"key":"rf13","doi-asserted-by":"publisher","DOI":"10.1016\/j.cmpb.2008.10.014"},{"key":"rf14","doi-asserted-by":"publisher","DOI":"10.1002\/prot.21480"},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-9-510"},{"key":"rf16","doi-asserted-by":"publisher","DOI":"10.1016\/j.cmpb.2005.11.007"},{"key":"rf17","doi-asserted-by":"publisher","DOI":"10.1016\/j.joi.2008.11.005"},{"key":"rf18","doi-asserted-by":"publisher","DOI":"10.1038\/35057062"},{"key":"rf19","doi-asserted-by":"publisher","DOI":"10.1038\/35057039"},{"key":"rf20","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2164-9-175"},{"key":"rf21","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/27.2.573"},{"volume-title":"Genomes","year":"2002","author":"Brown T. A.","key":"rf22"},{"key":"rf23","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2180-1-2"},{"key":"rf24","doi-asserted-by":"publisher","DOI":"10.1136\/jmg.2008.058909"},{"key":"rf25","doi-asserted-by":"publisher","DOI":"10.1111\/j.1399-0004.2007.00903.x"},{"key":"rf26","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/29.1.320"},{"key":"rf27","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.132275999"},{"key":"rf28","doi-asserted-by":"publisher","DOI":"10.1101\/gr.079244.108"},{"key":"rf29","doi-asserted-by":"publisher","DOI":"10.1016\/j.mrfmmm.2006.01.014"},{"key":"rf30","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2202-7-65"},{"key":"rf31","doi-asserted-by":"publisher","DOI":"10.1086\/512129"},{"key":"rf32","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkm181"},{"key":"rf33","doi-asserted-by":"publisher","DOI":"10.1038\/ng1515"},{"key":"rf34","doi-asserted-by":"publisher","DOI":"10.1373\/clinchem.2007.072629"},{"key":"rf35","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(87)90689-9"},{"key":"rf36","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511574931"},{"key":"rf37","first-page":"1149","volume":"29","author":"Stefan K.","journal-title":"Software: Practice and Experience"},{"key":"rf40","first-page":"307","volume":"11","author":"Goto N.","journal-title":"Genome. Inform."},{"key":"rf42","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-3-8"},{"key":"rf43","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/16.5.482"},{"key":"rf44","doi-asserted-by":"publisher","DOI":"10.1016\/S0168-9525(00)02024-2"},{"key":"rf45","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkg573"},{"key":"rf48","doi-asserted-by":"publisher","DOI":"10.1101\/gr.229202. Article published online before March 2002"},{"key":"rf49","doi-asserted-by":"publisher","DOI":"10.1016\/S0960-9822(02)01220-4"},{"key":"rf50","doi-asserted-by":"publisher","DOI":"10.1101\/gr.194201"},{"key":"rf53","first-page":"167","volume":"4","author":"SW K.","journal-title":"Genom. Inform."},{"key":"rf54","doi-asserted-by":"publisher","DOI":"10.2165\/00822942-200403020-00013"},{"key":"rf56","doi-asserted-by":"publisher","DOI":"10.1016\/j.jda.2004.08.002"},{"key":"rf57","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-32263-4_2"},{"key":"rf58","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.192319099"},{"key":"rf59","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.3233\/ISB-2009-0388","volume":"9","author":"Rani T. S.","journal-title":"In. Silico. Biol."}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720012500163","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,7,9]],"date-time":"2020-07-09T03:46:02Z","timestamp":1594266362000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720012500163"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,18]]},"references-count":48,"journal-issue":{"issue":"06","published-online":{"date-parts":[[2012,10,18]]},"published-print":{"date-parts":[[2012,12]]}},"alternative-id":["10.1142\/S0219720012500163"],"URL":"https:\/\/doi.org\/10.1142\/s0219720012500163","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"type":"print","value":"0219-7200"},{"type":"electronic","value":"1757-6334"}],"subject":[],"published":{"date-parts":[[2012,10,18]]}}}