{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,8]],"date-time":"2025-04-08T06:42:34Z","timestamp":1744094554348},"reference-count":23,"publisher":"World Scientific Pub Co Pte Lt","issue":"03","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2011,6]]},"abstract":"<jats:p> Hypersensitive (HS) sites in genomic sequences are reliable markers of DNA regulatory regions that control gene expression. Annotation of regulatory regions is important in understanding phenotypical differences among cells and diseases linked to pathologies in protein expression. Several computational techniques are devoted to mapping out regulatory regions in DNA by initially identifying HS sequences. Statistical learning techniques like Support Vector Machines (SVM), for instance, are employed to classify DNA sequences as HS or non-HS. This paper proposes a method to automate the basic steps in designing an SVM that improves the accuracy of such classification. The method proceeds in two stages and makes use of evolutionary algorithms. An evolutionary algorithm first designs optimal sequence motifs to associate explicit discriminating feature vectors with input DNA sequences. A second evolutionary algorithm then designs SVM kernel functions and parameters that optimally separate the HS and non-HS classes. Results show that this two-stage method significantly improves SVM classification accuracy. The method promises to be generally useful in automating the analysis of biological sequences, and we post its source code on our website. <\/jats:p>","DOI":"10.1142\/s0219720011005586","type":"journal-article","created":{"date-parts":[[2011,4,25]],"date-time":"2011-04-25T22:07:03Z","timestamp":1303769223000},"page":"399-413","source":"Crossref","is-referenced-by-count":6,"title":["A TWO-STAGE EVOLUTIONARY APPROACH FOR EFFECTIVE CLASSIFICATION OF HYPERSENSITIVE DNA SEQUENCES"],"prefix":"10.1142","volume":"09","author":[{"given":"UDAY","family":"KAMATH","sequence":"first","affiliation":[{"name":"Department of Computer Science, George Mason University, Fairfax, Virginia 20123, USA"}]},{"given":"AMARDA","family":"SHEHU","sequence":"additional","affiliation":[{"name":"Department of Computer Science, George Mason University, Fairfax, Virginia 20123, USA"}]},{"given":"KENNETH A.","family":"DE JONG","sequence":"additional","affiliation":[{"name":"Department of Computer Science, George Mason University, Fairfax, Virginia 20123, USA"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.genom.7.080505.115623"},{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.genom.8.080706.092323"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1038\/286854a0"},{"key":"rf5","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.bi.57.070188.001111"},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.89.3.1143"},{"key":"rf7","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.162342499"},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0400678101"},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1038\/nmeth721"},{"key":"rf10","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bti1047"},{"key":"rf12","volume-title":"Statistical Learning Theory","author":"Vapnik V. N.","year":"1998"},{"key":"rf13","first-page":"S1","volume":"9","author":"Habib T.","journal-title":"BMC Genom."},{"key":"rf14","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/16.9.799"},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1101\/gr.1679003"},{"key":"rf16","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-8-410"},{"key":"rf17","volume-title":"Computational Methods of Feature Selection","author":"Islamaj-Dogan R.","year":"2007"},{"key":"rf18","volume-title":"Kernel Methods in Computational Biology","author":"Noble W. S.","year":"2004"},{"key":"rf19","first-page":"1435","volume":"5","author":"Leslie C.","journal-title":"J. Mach. Learn. Res."},{"key":"rf25","first-page":"231","author":"Huang C.-L.","journal-title":"Expert Systems with Applications"},{"key":"rf30","volume-title":"Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond","author":"Sch\u00f6lkopf B.","year":"2002"},{"key":"rf31","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511809682"},{"key":"rf32","volume-title":"On the Programming of Computers by Means of Natural Selection","author":"Koza J.","year":"1992"},{"key":"rf36","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btn397"},{"key":"rf37","first-page":"1889","volume":"6","author":"Fan R.-E.","journal-title":"J. Mach. Learn. Res."}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720011005586","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T22:39:47Z","timestamp":1565131187000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720011005586"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6]]},"references-count":23,"journal-issue":{"issue":"03","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2011,6]]}},"alternative-id":["10.1142\/S0219720011005586"],"URL":"https:\/\/doi.org\/10.1142\/s0219720011005586","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,6]]}}}