{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T12:39:48Z","timestamp":1722688788469},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"18","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Array comparative genomic hybridization (arrayCGH) is widely used to measure DNA copy numbers in cancer research. ArrayCGH data report log-ratio intensities of thousands of probes sampled along the chromosomes. Typically, the choices of the locations and the lengths of the probes vary in different experiments. This discrepancy in choosing probes poses a challenge in integrated classification or analysis across multiple arrayCGH datasets. We propose an alignment-based framework to integrate arrayCGH samples generated from different probe sets. The alignment framework seeks an optimal alignment between the probe series of one arrayCGH sample and the probe series of another sample, intended to find the maximum possible overlap of DNA copy number variations between the two measured chromosomes. An alignment kernel is introduced for integrative patient sample classification and a multiple alignment algorithm is also introduced for identifying common regions with copy number aberrations.<\/jats:p>\n               <jats:p>Results: The probe alignment kernel and the MPA algorithm were experimented to integrate three bladder cancer datasets as well as artificial datasets. In the experiments, by integrating arrayCGH samples from multiple datasets, the probe alignment kernel used with support vector machines significantly improved patient sample classification accuracy over other baseline kernels. The experiments also demonstrated that the multiple probe alignment (MPA) algorithm can find common DNA aberrations that cannot be identified with the standard interpolation method. Furthermore, the MPA algorithm also identified many known bladder cancer DNA aberrations containing four known bladder cancer genes, three of which cannot be detected by interpolation.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/www.cs.umn.edu\/compbio\/ProbeAlign<\/jats:p>\n               <jats:p>Contact: \u00a0kuang@cs.umn.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq428","type":"journal-article","created":{"date-parts":[[2010,7,22]],"date-time":"2010-07-22T03:12:05Z","timestamp":1279768325000},"page":"2313-2320","source":"Crossref","is-referenced-by-count":7,"title":["Integrative classification and analysis of multiple arrayCGH datasets with probe alignment"],"prefix":"10.1093","volume":"26","author":[{"given":"Ze","family":"Tian","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, University of Minnesota Twin Cities, Minneapolis, MN, USA"}]},{"given":"Rui","family":"Kuang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of Minnesota Twin Cities, Minneapolis, MN, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,7,21]]},"reference":[{"key":"2023012508212455800_B1","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1093\/bioinformatics\/17.6.495","article-title":"Aligning gene expression time series with time warping algorithms","volume":"17","author":"Aach","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012508212455800_B2","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"issue":"Pt 1","key":"2023012508212455800_B3","doi-asserted-by":"crossref","first-page":"7012","DOI":"10.1158\/1078-0432.CCR-05-0177","article-title":"Bladder cancer stage and outcome by array-based comparative genomic hybridization","volume":"11","author":"Blaveri","year":"2005","journal-title":"Clin. Cancer Res"},{"issue":"Suppl. 7","key":"2023012508212455800_B4","doi-asserted-by":"crossref","first-page":"S16","DOI":"10.1038\/ng2028","article-title":"Methods and strategies for analyzing copy number variation using DNA microarrays","volume":"39","author":"Carter","year":"2007","journal-title":"Nat. Genet"},{"key":"2023012508212455800_B5","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790492","volume-title":"Biological Sequence Analysis: Probabilistic models of proteins and nucleic acids","author":"Durbin","year":"1998"},{"key":"2023012508212455800_B6","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1038\/nrg1767","article-title":"Structural variation in the human genome","volume":"7","author":"Feuk","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2023012508212455800_B7","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1198\/016214507000000923","article-title":"Bayesian hidden Markov modeling of array CGH data","volume":"103","author":"Guha","year":"2008","journal-title":"J. Am. Stat. Assoc"},{"key":"2023012508212455800_B8","doi-asserted-by":"crossref","DOI":"10.1186\/1755-8794-1-3","article-title":"Tiling resolution array CGH and high density expression profiling of urothelial carcinomas delineate genomic amplicons and candidate target genes specific for advanced tumors","volume":"1","author":"Heidenblad","year":"2008","journal-title":"BMC Med. Genomics"},{"key":"2023012508212455800_B9","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/0024-3795(88)90223-6","article-title":"Computing a nearest symmetric positive semidefinite matrix","volume":"103","author":"Higham","year":"1988","journal-title":"Linear Algebra Appl"},{"key":"2023012508212455800_B10","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1089\/106652703322756113","article-title":"Combining pairwise-sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships","volume":"10","author":"Liao","year":"2003","journal-title":"J. Comput. Biol"},{"key":"2023012508212455800_B11","doi-asserted-by":"crossref","first-page":"I86","DOI":"10.1093\/bioinformatics\/btn145","article-title":"Classification and feature selection algorithms for multi-class CGH data","volume":"24","author":"Liu","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508212455800_B12","doi-asserted-by":"crossref","first-page":"6538","DOI":"10.1038\/sj.onc.1209946","article-title":"E2F3 is the main target gene of the 6p22 amplicon with high specificity for human bladder cancer","volume":"25","author":"Oeggerli","year":"2006","journal-title":"Oncogene"},{"key":"2023012508212455800_B13","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","article-title":"Circular binary segmentation for the analysis of array-based DNA copy number data","volume":"5","author":"Olshen","year":"2004","journal-title":"Biostatistics"},{"key":"2023012508212455800_B14","doi-asserted-by":"crossref","first-page":"I375","DOI":"10.1093\/bioinformatics\/btn188","article-title":"Classification of arrayCGH data using fused SVM","volume":"24","author":"Rapaport","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508212455800_B15","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"key":"2023012508212455800_B16","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1186\/gm62","article-title":"Copy number variations and cancer","volume":"1","author":"Shlien","year":"2009","journal-title":"Genome Med"},{"key":"2023012508212455800_B17","doi-asserted-by":"crossref","first-page":"1386","DOI":"10.1038\/ng1923","article-title":"Regional copy number-independent deregulation of transcription in cancer","volume":"38","author":"Stransky","year":"2006","journal-title":"Nat. Genet"},{"key":"2023012508212455800_B18","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/ejhg.2009.47","article-title":"Copy number variation and association analysis of SHANK3 as a candidate gene for autism in the IMGSAC collection","volume":"17","author":"Sykes","year":"2009","journal-title":"Eur. J. Hum. Genet"},{"key":"2023012508212455800_B19","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","article-title":"CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice","volume":"22","author":"Thompson","year":"1994","journal-title":"Nucleic Acids Res"},{"key":"2023012508212455800_B20","doi-asserted-by":"crossref","first-page":"2831","DOI":"10.1093\/bioinformatics\/btp467","article-title":"A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge","volume":"25","author":"Tian","year":"2009","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/2313\/48856783\/bioinformatics_26_18_2313.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/2313\/48856783\/bioinformatics_26_18_2313.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:22:05Z","timestamp":1674634925000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/18\/2313\/209213"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,7,21]]},"references-count":20,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2010,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq428","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,9,15]]},"published":{"date-parts":[[2010,7,21]]}}}