{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,1,2]],"date-time":"2024-01-02T10:00:57Z","timestamp":1704189657788},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Gene expression profiling technologies have gradually become a community standard tool for clinical applications. For example, gene expression data has been analyzed to reveal novel disease subtypes (class discovery) and assign particular samples to well-defined classes (class prediction). In the past decade, many effective methods have been proposed for individual applications. However, there is still a pressing need for a unified framework that can reveal the complicated relationships between samples.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We propose a novel convex optimization model to perform class discovery and class prediction in a unified framework. An efficient algorithm is designed and software named OTCC (Optimization Tool for Clustering and Classification) is developed. Comparison in a simulated dataset shows that our method outperforms the existing methods. We then applied OTCC to acute leukemia and breast cancer datasets. The results demonstrate that our method not only can reveal the subtle structures underlying those cancer gene expression data but also can accurately predict the class labels of unknown cancer samples. Therefore, our method holds the promise to identify novel cancer subtypes and improve diagnosis.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>We propose a unified computational framework for class discovery and class prediction to facilitate the discovery and prediction of subtle subtypes of cancers. Our method can be generally applied to multiple types of measurements, e.g., gene expression profiling, proteomic measuring, and recent next-generation sequencing, since it only requires the similarities among samples as input.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-13-70","type":"journal-article","created":{"date-parts":[[2012,5,1]],"date-time":"2012-05-01T18:14:38Z","timestamp":1335896078000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["A unified computational model for revealing and predicting subtle subtypes of cancers"],"prefix":"10.1186","volume":"13","author":[{"given":"Xianwen","family":"Ren","sequence":"first","affiliation":[]},{"given":"Yong","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jiguang","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xiang-Sun","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,5,1]]},"reference":[{"issue":"5","key":"5304_CR1","doi-asserted-by":"publisher","first-page":"882","DOI":"10.1183\/09031936.01.00106601","volume":"18","author":"R Bals","year":"2001","unstructured":"Bals R, Jany B: Identification of disease genes by expression profiling. Eur Respir J 2001, 18(5):882\u2013889. 10.1183\/09031936.01.00106601","journal-title":"Eur Respir J"},{"issue":"5","key":"5304_CR2","doi-asserted-by":"publisher","first-page":"755","DOI":"10.1212\/WNL.57.5.755","volume":"57","author":"SA Greenberg","year":"2001","unstructured":"Greenberg SA: DNA microarray gene expression analysis technology and its application to neurological disorders. Neurology 2001, 57(5):755\u2013761. 10.1212\/WNL.57.5.755","journal-title":"Neurology"},{"issue":"1","key":"5304_CR3","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1016\/S0008-6363(01)00516-8","volume":"54","author":"PA Henriksen","year":"2002","unstructured":"Henriksen PA, Kotelevtsev Y: Application of gene expression profiling to cardiovascular disease. Cardiovasc Res 2002, 54(1):16\u201324. 10.1016\/S0008-6363(01)00516-8","journal-title":"Cardiovasc Res"},{"issue":"5","key":"5304_CR4","doi-asserted-by":"publisher","first-page":"405","DOI":"10.1016\/j.jala.2010.06.011","volume":"15","author":"A Lagraulet","year":"2010","unstructured":"Lagraulet A: Current Clinical and Pharmaceutical Applications of Microarrays: From Disease Biomarkers Discovery to Automated Diagnostics. J Assoc Lab Autom 2010, 15(5):405\u2013413. 10.1016\/j.jala.2010.06.011","journal-title":"J Assoc Lab Autom"},{"issue":"5439","key":"5304_CR5","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1126\/science.286.5439.531","volume":"286","author":"TR Golub","year":"1999","unstructured":"Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, et al.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 1999, 286(5439):531\u2013537. 10.1126\/science.286.5439.531","journal-title":"Science"},{"issue":"12","key":"5304_CR6","doi-asserted-by":"publisher","first-page":"4164","DOI":"10.1073\/pnas.0308531101","volume":"101","author":"J-P Brunet","year":"2004","unstructured":"Brunet J-P, Tamayo P, Golub TR, Mesirov JP: Metagenes and molecular pattern discovery using matrix factorization. Proc Nat Acad Sci USA 2004, 101(12):4164\u20134169. 10.1073\/pnas.0308531101","journal-title":"Proc Nat Acad Sci USA"},{"issue":"21","key":"5304_CR7","doi-asserted-by":"publisher","first-page":"3970","DOI":"10.1093\/bioinformatics\/bti653","volume":"21","author":"Y Gao","year":"2005","unstructured":"Gao Y, Church G: Improving molecular cancer class discovery through sparse non-negative matrix factorization. Bioinformatics 2005, 21(21):3970\u20133975. 10.1093\/bioinformatics\/bti653","journal-title":"Bioinformatics"},{"issue":"16","key":"5304_CR8","doi-asserted-by":"publisher","first-page":"2131","DOI":"10.1093\/bioinformatics\/btg296","volume":"19","author":"AL Hsu","year":"2003","unstructured":"Hsu AL, Tang S-L, Halgamuge SK: An unsupervised hierarchical dynamic self-organizing approach to cancer class discovery and marker gene identification in microarray data. Bioinformatics 2003, 19(16):2131\u20132140. 10.1093\/bioinformatics\/btg296","journal-title":"Bioinformatics"},{"issue":"12","key":"5304_CR9","doi-asserted-by":"publisher","first-page":"1495","DOI":"10.1093\/bioinformatics\/btm134","volume":"23","author":"H Kim","year":"2007","unstructured":"Kim H, Park H: Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis. Bioinformatics 2007, 23(12):1495\u20131502. 10.1093\/bioinformatics\/btm134","journal-title":"Bioinformatics"},{"issue":"7","key":"5304_CR10","doi-asserted-by":"publisher","first-page":"811","DOI":"10.1093\/bioinformatics\/btg095","volume":"19","author":"W Li","year":"2003","unstructured":"Li W, Fan M, Xiong M: SamCluster: an integrated scheme for automatic discovery of sample classes using gene expression profile. Bioinformatics 2003, 19(7):811\u2013817. 10.1093\/bioinformatics\/btg095","journal-title":"Bioinformatics"},{"issue":"16","key":"5304_CR11","doi-asserted-by":"publisher","first-page":"i90","DOI":"10.1093\/bioinformatics\/btn279","volume":"24","author":"I Steinfeld","year":"2008","unstructured":"Steinfeld I, Navon R, Ardigo D, Zavaroni I, Yakhini Z: Clinically driven semi-supervised class discovery in gene expression data. Bioinformatics 2008, 24(16):i90-i97. 10.1093\/bioinformatics\/btn279","journal-title":"Bioinformatics"},{"key":"5304_CR12","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1186\/1471-2105-5-126","volume":"5","author":"S Varma","year":"2004","unstructured":"Varma S, Simon R: Iterative class discovery and feature selection using Minimal Spanning Trees. BMC Bioinforma 2004, 5: 126. 10.1186\/1471-2105-5-126","journal-title":"BMC Bioinforma"},{"issue":"suppl 1","key":"5304_CR13","doi-asserted-by":"publisher","first-page":"S107","DOI":"10.1093\/bioinformatics\/17.suppl_1.S107","volume":"17","author":"A von Heydebreck","year":"2001","unstructured":"von Heydebreck A, Huber W, Poustka A, Vingron M: Identifying splits with clear separation: a new class discovery method for gene expression data. Bioinformatics 2001, 17(suppl 1):S107-S114. 10.1093\/bioinformatics\/17.suppl_1.S107","journal-title":"Bioinformatics"},{"issue":"21","key":"5304_CR14","doi-asserted-by":"publisher","first-page":"2888","DOI":"10.1093\/bioinformatics\/btm463","volume":"23","author":"Z Yu","year":"2007","unstructured":"Yu Z, Wong H-S, Wang H: Graph-based consensus clustering for class discovery from gene expression data. Bioinformatics 2007, 23(21):2888\u20132896. 10.1093\/bioinformatics\/btm463","journal-title":"Bioinformatics"},{"issue":"1","key":"5304_CR15","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1073\/pnas.97.1.262","volume":"97","author":"MPS Brown","year":"2000","unstructured":"Brown MPS, Grundy WN, Lin D, Cristianini N, Sugnet CW, Furey TS, Ares M, Haussler D: Knowledge-based analysis of microarray gene expression data by using support vector machines. ProcNat Acad Sci USA 2000, 97(1):262\u2013267. 10.1073\/pnas.97.1.262","journal-title":"ProcNat Acad Sci USA"},{"issue":"10","key":"5304_CR16","doi-asserted-by":"publisher","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","volume":"16","author":"TS Furey","year":"2000","unstructured":"Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 2000, 16(10):906\u2013914. 10.1093\/bioinformatics\/16.10.906","journal-title":"Bioinformatics"},{"issue":"7","key":"5304_CR17","doi-asserted-by":"publisher","first-page":"1055","DOI":"10.1093\/bioinformatics\/bti092","volume":"21","author":"Y Ji","year":"2005","unstructured":"Ji Y, Tsui K-W, Kim K: A novel means of using gene clusters in a two-step empirical Bayes method for predicting classes of samples. Bioinformatics 2005, 21(7):1055\u20131061. 10.1093\/bioinformatics\/bti092","journal-title":"Bioinformatics"},{"issue":"9","key":"5304_CR18","doi-asserted-by":"publisher","first-page":"1132","DOI":"10.1093\/bioinformatics\/btg102","volume":"19","author":"Y Lee","year":"2003","unstructured":"Lee Y, Lee C-K: Classification of multiple cancer types by multicategory support vector machines using gene expression data. Bioinformatics 2003, 19(9):1132\u20131139. 10.1093\/bioinformatics\/btg102","journal-title":"Bioinformatics"},{"issue":"20","key":"5304_CR19","doi-asserted-by":"publisher","first-page":"3896","DOI":"10.1093\/bioinformatics\/bti631","volume":"21","author":"AC Tan","year":"2005","unstructured":"Tan AC, Naiman DQ, Xu L, Winslow RL, Geman D: Simple decision rules for classifying human cancers from gene expression profiles. Bioinformatics 2005, 21(20):3896\u20133904. 10.1093\/bioinformatics\/bti631","journal-title":"Bioinformatics"},{"issue":"16","key":"5304_CR20","doi-asserted-by":"publisher","first-page":"2545","DOI":"10.1093\/bioinformatics\/bth281","volume":"20","author":"R Alexandridis","year":"2004","unstructured":"Alexandridis R, Lin S, Irwin M: Class discovery and classification of tumor samples using mixture modeling of gene expression data}a unified approach. Bioinformatics 2004, 20(16):2545\u20132552. 10.1093\/bioinformatics\/bth281","journal-title":"Bioinformatics"},{"key":"5304_CR21","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1016\/j.patcog.2007.05.018","volume":"41","author":"M Filippone","year":"2007","unstructured":"Filippone M, Camastra F, Masulli F, Rovetta S: Asurvey of kernel and spectral methods for clustering. Pattern Recognit 2007, 41: 176\u2013190.","journal-title":"Pattern Recognit"},{"key":"5304_CR22","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","volume":"17","author":"U von Luxburg","year":"2007","unstructured":"von Luxburg U: A Tutorial on Spectral Clustering. Stat Comput 2007, 17: 395\u2013416. 10.1007\/s11222-007-9033-z","journal-title":"Stat Comput"},{"issue":"18","key":"5304_CR23","doi-asserted-by":"publisher","first-page":"2023","DOI":"10.1093\/bioinformatics\/btn383","volume":"24","author":"T Hwang","year":"2008","unstructured":"Hwang T, Sicotte H, Tian Z, Wu B, Kocher J-P, Wigle DA, Kumar V, Kuang R: Robust and efficient identification of biomarkers by classifying features on graphs. Bioinformatics 2008, 24(18):2023\u20132029. 10.1093\/bioinformatics\/btn383","journal-title":"Bioinformatics"},{"issue":"5814","key":"5304_CR24","doi-asserted-by":"publisher","first-page":"972","DOI":"10.1126\/science.1136800","volume":"315","author":"BJ Frey","year":"2007","unstructured":"Frey BJ, Dueck D: Clustering by Passing Messages Between Data Points. Science 2007, 315(5814):972\u2013976. 10.1126\/science.1136800","journal-title":"Science"},{"issue":"1","key":"5304_CR25","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1007\/s10549-008-9982-8","volume":"114","author":"T Casey","year":"2009","unstructured":"Casey T, Bond J, Tighe S, Hunter T, Lintault L, Patel O, Eneman J, Crocker A, White J, Tessitore J, et al.: Molecular signatures suggest a major role for stromal cells in development of invasive breast cancer. Breast Cancer Res Treat 2009, 114(1):47\u201362. 10.1007\/s10549-008-9982-8","journal-title":"Breast Cancer Res Treat"},{"issue":"11","key":"5304_CR26","doi-asserted-by":"publisher","first-page":"4083","DOI":"10.1073\/pnas.0708598105","volume":"105","author":"C Kim","year":"2008","unstructured":"Kim C, Cheon M, Kang M, Chang I: A simple and exact Laplacian clustering of complex networking phenomena: Application to gene expression profiles. Proc Nat Acad Sci USA 2008, 105(11):4083\u20134087. 10.1073\/pnas.0708598105","journal-title":"Proc Nat Acad Sci USA"},{"key":"5304_CR27","first-page":"281","volume-title":"Some Methods for classification and analysis of multivariate observations. In: 1967","author":"JB Macqueen","year":"1967","unstructured":"Macqueen JB: Some Methods for classification and analysis of multivariate observations. In: 1967. University of California Press, Berkeley; 1967:281\u2013297."},{"issue":"2","key":"5304_CR28","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1109\/TIT.1982.1056489","volume":"28","author":"S Lloyd","year":"1982","unstructured":"Lloyd S: Least squares quantization in PCM. Inf Theory, IEEE Trans on 1982, 28(2):129\u2013137. 10.1109\/TIT.1982.1056489","journal-title":"Inf Theory, IEEE Trans on"},{"issue":"15","key":"5304_CR29","doi-asserted-by":"publisher","first-page":"1994","DOI":"10.1093\/bioinformatics\/btp330","volume":"25","author":"GA Pavlopoulos","year":"2009","unstructured":"Pavlopoulos GA, Moschopoulos CN, Hooper SD, Schneider R, Kossida S: jClust: A clustering and visualization toolbox. Bioinformatics 2009, 25(15):1994\u20131996. 10.1093\/bioinformatics\/btp330","journal-title":"Bioinformatics"},{"key":"5304_CR30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/18756891.2008.9727601","volume":"1","author":"C Yang","year":"2008","unstructured":"Yang C, Zhang X, Jiao L, Wang G: Self-Tuning Semi-Supervised Spectral Clustering. Comput Intell Secur, Int Conf on 2008, 1: 1\u20135.","journal-title":"Comput Intell Secur, Int Conf on"},{"key":"5304_CR31","doi-asserted-by":"publisher","first-page":"192","DOI":"10.1007\/978-3-540-69828-9_19","volume-title":"Data Integration in the Life Sciences","author":"A Mishra","year":"2008","unstructured":"Mishra A, Gillies D: Semi Supervised Spectral Clustering for Regulatory Module Discovery. In Data Integration in the Life Sciences. Edited by: Bairoch A, Cohen-Boulakia S, Froidevaux C. Berlin\/Heidelberg, Springer-Verlag; 2008:192\u2013203. vol. 5109 vol. 5109"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-13-70.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T19:32:05Z","timestamp":1630524725000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-13-70"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,5,1]]},"references-count":31,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["5304"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-13-70","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,5,1]]},"assertion":[{"value":"10 December 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 April 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 May 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"70"}}