{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,22]],"date-time":"2025-07-22T11:18:01Z","timestamp":1753183081416},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"S12","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2009,10]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Mass spectrometry spectra, widely used in proteomics studies as a screening tool for protein profiling and to detect discriminatory signals, are high dimensional data. A large number of local maxima (a.k.a. <jats:italic>peaks<\/jats:italic>) have to be analyzed as part of computational pipelines aimed at the realization of efficient predictive and screening protocols. With this kind of data dimensions and samples size the risk of over-fitting and selection bias is pervasive. Therefore the development of bio-informatics methods based on unsupervised feature extraction can lead to general tools which can be applied to several fields of predictive proteomics.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We propose a method for feature selection and extraction grounded on the theory of multi-scale spaces for high resolution spectra derived from analysis of serum. Then we use support vector machines for classification. In particular we use a database containing 216 samples spectra divided in 115 cancer and 91 control samples. The overall accuracy averaged over a large cross validation study is 98.18. The area under the ROC curve of the best selected model is 0.9962.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>We improved previous known results on the problem on the same data, with the advantage that the proposed method has an unsupervised feature selection phase. All the developed code, as MATLAB scripts, can be downloaded from <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/medeaserver.isa.cnr.it\/dacierno\/spectracode.htm\" ext-link-type=\"uri\">http:\/\/medeaserver.isa.cnr.it\/dacierno\/spectracode.htm<\/jats:ext-link>\n            <\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-10-s12-s9","type":"journal-article","created":{"date-parts":[[2009,10,15]],"date-time":"2009-10-15T18:16:13Z","timestamp":1255630573000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["A scale space approach for unsupervised feature selection in mass spectra classification for ovarian cancer detection"],"prefix":"10.1186","volume":"10","author":[{"given":"Michele","family":"Ceccarelli","sequence":"first","affiliation":[]},{"given":"Antonio","family":"d'Acierno","sequence":"additional","affiliation":[]},{"given":"Angelo","family":"Facchiano","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2009,10,15]]},"reference":[{"key":"3412_CR1","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1038\/nature01511","volume":"422","author":"R Aebersold","year":"2003","unstructured":"Aebersold R, Mann M: Mass spectrometry-based proteomics. Nature 2003, 422: 198\u2013207. 10.1038\/nature01511","journal-title":"Nature"},{"key":"3412_CR2","doi-asserted-by":"publisher","first-page":"572","DOI":"10.1016\/S0140-6736(02)07746-2","volume":"359","author":"EF Petricoin III","year":"2002","unstructured":"Petricoin EF III, Ardekani AM, Hitt BA, Levine PJ, Fusaro VA, Steinberg SM, Mills GB, Simone C, Fishman DA, Kohn EC, Liotta LA: Use of proteomic patterns in serum to identify ovarian cancer. Lancet 2002, 359: 572\u2013577. 10.1016\/S0140-6736(02)07746-2","journal-title":"Lancet"},{"key":"3412_CR3","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1677\/erc.0.0110163","volume":"11","author":"P Conrads","year":"2004","unstructured":"Conrads P, Fusaro VA, Ross S, Johann D, Rajapakse V, Hitt BA, Steinberg SM, Kohn EC, Fishman DA, Whiteley G, Barrett JC, Liotta LA, III EFP, Veenstra TD: High-resolution serum proteomic features for ovarian cancer detection. Endocrine-Related Cancer 2004, 11: 163\u2013178. 10.1677\/erc.0.0110163","journal-title":"Endocrine-Related Cancer"},{"key":"3412_CR4","doi-asserted-by":"publisher","first-page":"1157","DOI":"10.1162\/153244303322753616","volume":"3","author":"I Guyon","year":"2003","unstructured":"Guyon I, Elisseeff A: An Introduction to Variable and Feature Selection. Journal of machine learning research 2003, 3: 1157\u20131182. 10.1162\/153244303322753616","journal-title":"Journal of machine learning research"},{"issue":"2","key":"3412_CR5","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1093\/bib\/bbn008","volume":"9","author":"A Barla","year":"2008","unstructured":"Barla A, Jurman G, Riccadonna S, Merler S, Chierici M, Furlanello C: Machine learning methods for predictive proteomics. Briefings in Bioinformatics 2008, 9(2):119\u201328. 10.1093\/bib\/bbn008","journal-title":"Briefings in Bioinformatics"},{"key":"3412_CR6","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S, Vapnik V: Gene selection for cancer classification using support vector machines. Machine Learning 2002 2002, 46: 389\u2013422. 10.1023\/A:1012487302797","journal-title":"Machine Learning 2002"},{"key":"3412_CR7","doi-asserted-by":"publisher","first-page":"6730","DOI":"10.1073\/pnas.111153698","volume":"98","author":"H Zhang","year":"2001","unstructured":"Zhang H, Yu C, Singer B, M MX: Recursive partitioning for tumor classification with gene expression microarray data. PNAS 2001, 98: 6730\u20136735. 10.1073\/pnas.111153698","journal-title":"PNAS"},{"key":"3412_CR8","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1186\/1471-2105-7-197","volume":"7","author":"X Zhang","year":"2006","unstructured":"Zhang X, Lu X, Shi Q, Xu XQ, Leung HC, Harris L, Iglehart J, Miron A, Liu J, Wong W: Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data. BMC Bioinformatics 2006, 7: 197. 10.1186\/1471-2105-7-197","journal-title":"BMC Bioinformatics"},{"key":"3412_CR9","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1186\/1471-2105-4-54","volume":"4","author":"C Furlanello","year":"2003","unstructured":"Furlanello C, Serafini M, Merler S, Jurman G: Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinformatics 2003, 4: 54\u201373. 10.1186\/1471-2105-4-54","journal-title":"BMC Bioinformatics"},{"key":"3412_CR10","first-page":"2200","volume-title":"Bioinformatics","author":"J Yu","year":"2005","unstructured":"Yu J, Ongarello S, Fiedler R, Chen X, Toffolo G: Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data. Bioinformatics 2005, 2200\u20132209. 10.1093\/bioinformatics\/bti370"},{"issue":"13","key":"3412_CR11","doi-asserted-by":"publisher","first-page":"1636","DOI":"10.1093\/bioinformatics\/btg210","volume":"19","author":"B Wu","year":"2003","unstructured":"Wu B, Abbott T, Fishman D, McMurray W, Mor G, Stone K, Ward D, Williams K, Zhao H: Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data. Bioinformatics 2003, 19(13):1636\u20131643. 10.1093\/bioinformatics\/btg210","journal-title":"Bioinformatics"},{"key":"3412_CR12","volume-title":"Journal of Computational Biology","author":"R Lilien","year":"2003","unstructured":"Lilien R, Farid H, Donald B: Probabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Humn Serum. Journal of Computational Biology 2003."},{"issue":"8","key":"3412_CR13","doi-asserted-by":"publisher","first-page":"861","DOI":"10.1016\/j.patrec.2005.10.010","volume":"27","author":"T Fawcett","year":"2006","unstructured":"Fawcett T: An introduction to ROC analysis. Pattern Recogn Lett 2006, 27(8):861\u2013874. 10.1016\/j.patrec.2005.10.010","journal-title":"Pattern Recogn Lett"},{"key":"3412_CR14","doi-asserted-by":"publisher","first-page":"777","DOI":"10.1093\/bioinformatics\/btg484","volume":"20","author":"K Baggerly","year":"2004","unstructured":"Baggerly K, et al.: Reproducibility of SELDI-TOF protein patterns in serum: comparing datases from different experiments. Bioinformatics 2004, 20: 777\u2013785. 10.1093\/bioinformatics\/btg484","journal-title":"Bioinformatics"},{"key":"3412_CR15","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/1471-2105-4-24","volume":"4","author":"J Sorace","year":"2003","unstructured":"Sorace J, Zhan M: A data review and reassessment of ovarian cancer serum proteomics profiling. BMC Bioinformatics 2003, 4: 24\u201332. 10.1186\/1471-2105-4-24","journal-title":"BMC Bioinformatics"},{"key":"3412_CR16","doi-asserted-by":"publisher","first-page":"3034","DOI":"10.1093\/bioinformatics\/bth357","volume":"20","author":"R Tibshirani","year":"2004","unstructured":"Tibshirani R, et al.: Sample classification from protein mass spectrometry, by peack probability contrasts. Bioinformatics 2004, 20: 3034\u20133044. 10.1093\/bioinformatics\/bth357","journal-title":"Bioinformatics"},{"issue":"19","key":"3412_CR17","doi-asserted-by":"publisher","first-page":"2528","DOI":"10.1093\/bioinformatics\/btm385","volume":"23","author":"K Noy","year":"2007","unstructured":"Noy K, Fasulo D: Improved model based, platform independent feature extraction for mass spectrometry. Bioinformatics 2007, 23(19):2528\u20132535. 10.1093\/bioinformatics\/btm385","journal-title":"Bioinformatics"},{"key":"3412_CR18","first-page":"133","volume-title":"International Journal of Computer Vision","author":"A Witkin","year":"1987","unstructured":"Witkin A, Terzopoulos D, Kass M: Signal matching through scale space. International Journal of Computer Vision 1987, 133\u2013144. 10.1007\/BF00123162"},{"key":"3412_CR19","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-6465-9","volume-title":"Scale-Space Theory in Computer Vision","author":"T Lindeberg","year":"1994","unstructured":"Lindeberg T: Scale-Space Theory in Computer Vision. Kluwer Academic Publisher; 1994."},{"issue":"9","key":"3412_CR20","first-page":"200","volume":"16","author":"L Alvarez","year":"1993","unstructured":"Alvarez L, Lions PL, Guichard F, Morel JM: Axioms and Fundamental equations of Image Processing. Archives for Rational Mechanics and Analysis 1993, 16(9):200\u2013257.","journal-title":"Archives for Rational Mechanics and Analysis"},{"key":"3412_CR21","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The Nature Of Statistical Learning Theory","author":"V Vapnik","year":"1995","unstructured":"Vapnik V: The Nature Of Statistical Learning Theory. New York: Springer-Verlag; 1995."},{"key":"3412_CR22","volume-title":"Proceedings of the Fifth Annual workshop on Computational Learning Theory","author":"B Boser","year":"1992","unstructured":"Boser B, Guyon I, Vapnik V: a training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual workshop on Computational Learning Theory 1992."},{"key":"3412_CR23","doi-asserted-by":"publisher","first-page":"2758","DOI":"10.1109\/78.650102","volume":"45\u201311","author":"B Schoelkopf","year":"1997","unstructured":"Schoelkopf B, Sung K, Burges C, Girosi F, Niyogi P, Poggio T, Vapnik V: Comparing Support Vector Machines with Gaussian Kernels to Radial Basis Function Classifiers. IEEE Transactions on Signal Processing 1997, 45\u201311: 2758\u20132765. 10.1109\/78.650102","journal-title":"IEEE Transactions on Signal Processing"},{"key":"3412_CR24","volume-title":"Kernel Methods for Pattern Analysis","author":"N Cristianini","year":"2004","unstructured":"Cristianini N, Taylor JS: Kernel Methods for Pattern Analysis. Cambridge University Press; 2004."},{"issue":"7","key":"3412_CR25","doi-asserted-by":"publisher","first-page":"1667","DOI":"10.1162\/089976603321891855","volume":"15","author":"SS Keerthi","year":"2003","unstructured":"Keerthi SS, Lin CJ: Asymptotic behaviors of support vector machines with gaussian kernel. Neural Computation 2003, 15(7):1667\u20131689. 10.1162\/089976603321891855","journal-title":"Neural Computation"},{"issue":"4","key":"3412_CR26","doi-asserted-by":"publisher","first-page":"955","DOI":"10.1162\/089976698300017575","volume":"10","author":"A Verri","year":"1998","unstructured":"Verri A, Pontil M: Properties of support vector machines. Neural Computation 1998, 10(4):955\u2013974. 10.1162\/089976698300017575","journal-title":"Neural Computation"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-10-S12-S9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:39:04Z","timestamp":1630445944000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-10-S12-S9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,10]]},"references-count":26,"journal-issue":{"issue":"S12","published-print":{"date-parts":[[2009,10]]}},"alternative-id":["3412"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-10-s12-s9","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2009,10]]},"assertion":[{"value":"15 October 2009","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S9"}}