{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,6]],"date-time":"2026-06-06T14:04:02Z","timestamp":1780754642710,"version":"3.54.1"},"reference-count":44,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2025,2,6]],"date-time":"2025-02-06T00:00:00Z","timestamp":1738800000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,2,6]],"date-time":"2025-02-06T00:00:00Z","timestamp":1738800000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"crossref","award":["18\/CRT\/6049"],"award-info":[{"award-number":["18\/CRT\/6049"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"publisher","award":["18\/CRT\/6049"],"award-info":[{"award-number":["18\/CRT\/6049"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001631","name":"University College Dublin","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001631","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2025,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The use of hyperspectral imaging to investigate food samples has grown due to the improved performance and lower cost of instrumentation. Food engineers use hyperspectral images to classify the type and quality of a food sample, typically using classification methods. In order to train these methods, every pixel in each training image needs to be labelled. Typically, computationally cheap threshold-based approaches are used to label the pixels, and classification methods are trained based on those labels. However, threshold-based approaches are subjective and cannot be generalized across hyperspectral images taken in different conditions and of different foods. Here a consensus-constrained parsimonious Gaussian mixture model (ccPGMM) is proposed to label pixels in hyperspectral images using a model-based clustering approach. The ccPGMM utilizes information that is available on some pixels and specifies constraints on those pixels belonging to the same or different clusters while clustering the rest of the pixels in the image. A latent variable model is used to represent the high-dimensional data in terms of a small number of underlying latent factors. To ensure computational feasibility, a consensus clustering approach is employed, where the data are divided into multiple randomly selected subsets of variables and constrained clustering is applied to each data subset; the clustering results are then consolidated across all data subsets to provide a consensus clustering solution. The ccPGMM approach is applied to simulated datasets and real hyperspectral images of three types of puffed cereal, corn, rice, and wheat. Improved clustering performance and computational efficiency are demonstrated when compared to other current state-of-the-art approaches.<\/jats:p>","DOI":"10.1007\/s11634-025-00623-y","type":"journal-article","created":{"date-parts":[[2025,2,5]],"date-time":"2025-02-05T23:54:32Z","timestamp":1738799672000},"page":"323-359","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A consensus-constrained parsimonious Gaussian mixture model for clustering hyperspectral images"],"prefix":"10.1007","volume":"19","author":[{"given":"Ganesh","family":"Babu","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Aoife","family":"Gowen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Michael","family":"Fop","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3936-2757","authenticated-orcid":false,"given":"Isobel Claire","family":"Gormley","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,2,6]]},"reference":[{"issue":"4","key":"623_CR1","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1002\/wics.101","volume":"2","author":"H Abdi","year":"2010","unstructured":"Abdi H, Williams LJ (2010) Principal component analysis. Wiley Interdiscip Rev Comput Stat 2(4):433\u2013459","journal-title":"Wiley Interdiscip Rev Comput Stat"},{"issue":"1","key":"623_CR2","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1007\/s00216-010-3828-z","volume":"398","author":"JM Amigo","year":"2010","unstructured":"Amigo JM (2010) Practical issues of hyperspectral imaging analysis of solid dosage forms. Anal Bioanal Chem 398(1):93\u2013109","journal-title":"Anal Bioanal Chem"},{"key":"623_CR3","doi-asserted-by":"crossref","unstructured":"Amigo JM (2020) Hyperspectral and multispectral imaging: setting the scene. In: Data handling in science and technology, vol\u00a032. Elsevier, pp 3\u201316","DOI":"10.1016\/B978-0-444-63977-6.00001-8"},{"key":"623_CR4","doi-asserted-by":"crossref","unstructured":"Amigo JM, Santos C (2020) Preprocessing of hyperspectral and multispectral images. In: Data handling in science and technology, vol 32. Elsevier, pp 37\u201353","DOI":"10.1016\/B978-0-444-63977-6.00003-1"},{"key":"623_CR5","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1016\/j.aca.2015.09.030","volume":"896","author":"JM Amigo","year":"2015","unstructured":"Amigo JM, Babamoradi H, Elcoroaristizabal S (2015) Hyperspectral image analysis. A tutorial. Anal Chim Acta 896:34\u201351","journal-title":"Anal Chim Acta"},{"key":"623_CR6","doi-asserted-by":"crossref","unstructured":"Arefin AS, Riveros C, Berretta R, Moscato P (2012) Computing large-scale distance matrices on GPU. In: The 7th international conference on computer science & education (ICCSE 2012). IEEE, pp 576\u2013580","DOI":"10.1109\/ICCSE.2012.6295141"},{"issue":"2","key":"623_CR7","doi-asserted-by":"publisher","first-page":"291","DOI":"10.1093\/biomet\/asr013","volume":"98","author":"A Bhattacharya","year":"2011","unstructured":"Bhattacharya A, Dunson DB (2011) Sparse Bayesian infinite factor models. Biometrika 98(2):291\u2013306","journal-title":"Biometrika"},{"issue":"1\u20132","key":"623_CR8","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1016\/j.aca.2011.06.043","volume":"704","author":"T Bocklitz","year":"2011","unstructured":"Bocklitz T, Walter A, Hartmann K, R\u00f6sch P, Popp J (2011) How to pre-process Raman spectra for reliable and stable models? Anal Chim Acta 704(1\u20132):47\u201356","journal-title":"Anal Chim Acta"},{"issue":"3","key":"623_CR9","doi-asserted-by":"publisher","first-page":"489","DOI":"10.1007\/s00180-013-0433-6","volume":"29","author":"C Bouveyron","year":"2014","unstructured":"Bouveyron C, Brunet-Saumard C (2014) Discriminative variable selection for clustering with the sparse Fisher-EM algorithm. Comput Stat 29(3):489\u2013513","journal-title":"Comput Stat"},{"key":"623_CR10","doi-asserted-by":"publisher","DOI":"10.1017\/9781108644181","volume-title":"Model-based clustering and classification for data science: with applications in R,","author":"C Bouveyron","year":"2019","unstructured":"Bouveyron C, Celeux G, Murphy TB, Raftery AE (2019) Model-based clustering and classification for data science: with applications in R, vol 50. Cambridge University Press, Cambridge"},{"issue":"11","key":"623_CR11","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1080\/10408398.2011.651542","volume":"52","author":"Y-Z Feng","year":"2012","unstructured":"Feng Y-Z, Sun D-W (2012) Application of hyperspectral imaging in food safety inspection and control: a review. Crit Rev Food Sci Nutr 52(11):1039\u20131058","journal-title":"Crit Rev Food Sci Nutr"},{"key":"623_CR12","unstructured":"Fern XZ, Brodley CE (2003) Random projection for high dimensional data clustering: a cluster ensemble approach. In: Proceedings of the twentieth international conference on machine learning (ICML-2003), pp 186\u2013193"},{"key":"623_CR13","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1214\/18-SS119","volume":"12","author":"M Fop","year":"2018","unstructured":"Fop M, Murphy TB (2018) Variable selection methods for model-based clustering. Stat Surv 12:18\u201365","journal-title":"Stat Surv"},{"issue":"3","key":"623_CR14","doi-asserted-by":"publisher","first-page":"491","DOI":"10.1366\/0003702854248656","volume":"39","author":"P Geladi","year":"1985","unstructured":"Geladi P, MacDougall D, Martens H (1985) Linearization and scatter-correction for near-infrared reflectance spectra of meat. Appl Spectrosc 39(3):491\u2013500","journal-title":"Appl Spectrosc"},{"key":"623_CR15","unstructured":"Ghahramani Z, Hinton GE et al (1996) The EM algorithm for mixtures of factor analyzers. Technical report, Technical Report CRG-TR-96-1, University of Toronto"},{"key":"623_CR16","doi-asserted-by":"publisher","first-page":"a4","DOI":"10.1255\/jsi.2019.a4","volume":"8","author":"AA Gowen","year":"2019","unstructured":"Gowen AA, Xu J-L, Herrero-Langreo A (2019) Comparison of spectral selection methods in the development of classification models from visible near infrared hyperspectral imaging data. J Spectr Imaging 8:a4","journal-title":"J Spectr Imaging"},{"key":"623_CR17","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4419-7970-4","volume-title":"Entropy and information theory","author":"RM Gray","year":"2011","unstructured":"Gray RM (2011) Entropy and information theory. Springer, Berlin"},{"key":"623_CR18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v091.i01","volume":"91","author":"M Hahsler","year":"2019","unstructured":"Hahsler M, Piekenbrock M, Doran D (2019) dbscan: fast density-based clustering with R. J Stat Softw 91:1\u201330","journal-title":"J Stat Softw"},{"key":"623_CR19","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"L Hubert","year":"1985","unstructured":"Hubert L, Arabie P (1985) Comparing partitions. J Classif 2:193\u2013218","journal-title":"J Classif"},{"key":"623_CR20","doi-asserted-by":"publisher","first-page":"a3","DOI":"10.1255\/jsi.2016.a3","volume":"5","author":"J Jacques","year":"2016","unstructured":"Jacques J, Ruckebusch C (2016) Model-based co-clustering for hyperspectral images. J Spectr Imaging 5:a3","journal-title":"J Spectr Imaging"},{"key":"623_CR21","doi-asserted-by":"crossref","unstructured":"Khan K, Rehman SU, Aziz K, Fong S, Sarasvady S (2014) dbscan: past, present and future. In: The fifth international conference on the applications of digital information and web technologies (ICADIWT 2014). IEEE, pp 232\u2013238","DOI":"10.1109\/ICADIWT.2014.6814687"},{"key":"623_CR22","unstructured":"McLachlan G, Peel D (2000) Mixtures of factor analyzers. In: Proceedings of the seventeenth international conference on machine learning. Citeseer"},{"key":"623_CR23","volume-title":"Finite mixture models","author":"GJ McLachlan","year":"2004","unstructured":"McLachlan GJ, Peel D (2004) Finite mixture models. Wiley, London"},{"issue":"3\u20134","key":"623_CR24","doi-asserted-by":"publisher","first-page":"379","DOI":"10.1016\/S0167-9473(02)00183-4","volume":"41","author":"GJ McLachlan","year":"2003","unstructured":"McLachlan GJ, Peel D, Bean RW (2003) Modelling high-dimensional data by mixtures of factor analyzers. Comput Stat Data Anal 41(3\u20134):379\u2013388","journal-title":"Comput Stat Data Anal"},{"issue":"3","key":"623_CR25","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1007\/s00357-016-9211-9","volume":"33","author":"PD McNicholas","year":"2016","unstructured":"McNicholas PD (2016) Model-based clustering. J Classif 33(3):331\u2013373","journal-title":"J Classif"},{"issue":"3","key":"623_CR26","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1007\/s11222-008-9056-0","volume":"18","author":"PD McNicholas","year":"2008","unstructured":"McNicholas PD, Murphy TB (2008) Parsimonious Gaussian mixture models. Stat Comput 18(3):285\u2013296","journal-title":"Stat Comput"},{"issue":"3","key":"623_CR27","doi-asserted-by":"publisher","first-page":"711","DOI":"10.1016\/j.csda.2009.02.011","volume":"54","author":"PD McNicholas","year":"2010","unstructured":"McNicholas PD, Murphy TB, McDaid AF, Frost D (2010) Serial and parallel implementations of model-based clustering via parsimonious Gaussian mixture models. Comput Stat Data Anal 54(3):711\u2013723","journal-title":"Comput Stat Data Anal"},{"issue":"2","key":"623_CR28","first-page":"7","volume":"1","author":"PD McNicholas","year":"2023","unstructured":"McNicholas PD, ElSherbiny A, McDaid AF, Murphy TB (2023) pgmm: parsimonious Gaussian mixture models. R package version 1(2):7","journal-title":"R package version"},{"issue":"28","key":"623_CR29","doi-asserted-by":"publisher","first-page":"4548","DOI":"10.1002\/sim.7371","volume":"36","author":"D McParland","year":"2017","unstructured":"McParland D, Phillips CM, Brennan L, Roche HM, Gormley IC (2017) Clustering high-dimensional mixed data to uncover sub-phenotypes: joint analysis of phenotypic and genotypic data. Stat Med 36(28):4548\u20134569","journal-title":"Stat Med"},{"issue":"3","key":"623_CR30","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1007\/s11634-015-0200-3","volume":"10","author":"V Melnykov","year":"2016","unstructured":"Melnykov V, Melnykov I, Michael S (2016) Semi-supervised model-based clustering with positive and negative constraints. Adv Data Anal Classif 10(3):327\u2013349","journal-title":"Adv Data Anal Classif"},{"issue":"2","key":"623_CR31","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1002\/wics.199","volume":"4","author":"AA Neath","year":"2012","unstructured":"Neath AA, Cavanaugh JE (2012) The Bayesian information criterion: background, derivation, and applications. Wiley Interdiscip Rev Comput Stat 4(2):199\u2013203","journal-title":"Wiley Interdiscip Rev Comput Stat"},{"issue":"9","key":"623_CR32","doi-asserted-by":"publisher","first-page":"1277","DOI":"10.1016\/0031-3203(93)90135-J","volume":"26","author":"NR Pal","year":"1993","unstructured":"Pal NR, Pal SK (1993) A review on image segmentation techniques. Pattern Recogn 26(9):1277\u20131294","journal-title":"Pattern Recogn"},{"issue":"2","key":"623_CR33","doi-asserted-by":"publisher","first-page":"1883","DOI":"10.4249\/scholarpedia.1883","volume":"4","author":"LE Peterson","year":"2009","unstructured":"Peterson LE (2009) K-nearest neighbor. Scholarpedia 4(2):1883","journal-title":"Scholarpedia"},{"issue":"7\u20138","key":"623_CR34","doi-asserted-by":"publisher","first-page":"780","DOI":"10.1080\/08839510802170546","volume":"22","author":"K Punera","year":"2008","unstructured":"Punera K, Ghosh J (2008) Consensus-based ensembles of soft clusterings. Appl Artif Intell 22(7\u20138):780\u2013810","journal-title":"Appl Artif Intell"},{"key":"623_CR35","volume-title":"R: a language and environment for statistical computing","author":"R Core Team","year":"2023","unstructured":"R Core Team (2023) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna"},{"issue":"473","key":"623_CR36","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1198\/016214506000000113","volume":"101","author":"AE Raftery","year":"2006","unstructured":"Raftery AE, Dean N (2006) Variable selection for model-based clustering. J Am Stat Assoc 101(473):168\u2013178","journal-title":"J Am Stat Assoc"},{"issue":"10","key":"623_CR37","doi-asserted-by":"publisher","first-page":"1201","DOI":"10.1016\/j.trac.2009.07.007","volume":"28","author":"\u00c5 Rinnan","year":"2009","unstructured":"Rinnan \u00c5, Van Den Berg F, Engelsen SB (2009) Review of the most common pre-processing techniques for near-infrared spectra. TrAC Trends Anal Chem 28(10):1201\u20131222","journal-title":"TrAC Trends Anal Chem"},{"key":"623_CR38","unstructured":"Russell N, Murphy TB, Raftery AE (2015) Bayesian model averaging in model-based clustering and density estimation. arXiv preprint arXiv:1506.09035"},{"issue":"2","key":"623_CR39","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461\u2013464","journal-title":"Ann Stat"},{"issue":"1","key":"623_CR40","doi-asserted-by":"publisher","first-page":"289","DOI":"10.32614\/RJ-2016-021","volume":"8","author":"L Scrucca","year":"2016","unstructured":"Scrucca L, Fop M, Murphy TB, Raftery AE (2016) mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. The R Journal 8(1):289\u2013317","journal-title":"The R Journal"},{"key":"623_CR41","doi-asserted-by":"publisher","first-page":"190","DOI":"10.2307\/1217562","volume":"12","author":"RR Sokal","year":"1963","unstructured":"Sokal RR (1963) The principles and practice of numerical taxonomy. Taxon 12:190\u2013199","journal-title":"Taxon"},{"issue":"Dec","key":"623_CR42","first-page":"583","volume":"3","author":"A Strehl","year":"2002","unstructured":"Strehl A, Ghosh J (2002) Cluster ensembles: a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3(Dec):583\u2013617","journal-title":"J Mach Learn Res"},{"key":"623_CR43","doi-asserted-by":"publisher","first-page":"a19","DOI":"10.1255\/jsi.2020.a19","volume":"9","author":"J-L Xu","year":"2020","unstructured":"Xu J-L, Riccioli C, Herrero-Langreo A, Gowen AA (2020) Deep learning classifiers for near infrared spectral imaging: a tutorial. J Spectr Imaging 9:a19","journal-title":"J Spectr Imaging"},{"issue":"4","key":"623_CR44","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1109\/MGRS.2020.3032575","volume":"9","author":"H Zhai","year":"2021","unstructured":"Zhai H, Zhang H, Li P, Zhang L (2021) Hyperspectral image clustering: current achievements and future lines. IEEE Geosci Remote Sens Mag 9(4):35\u201367","journal-title":"IEEE Geosci Remote Sens Mag"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00623-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-025-00623-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00623-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T04:39:53Z","timestamp":1757133593000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-025-00623-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,6]]},"references-count":44,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["623"],"URL":"https:\/\/doi.org\/10.1007\/s11634-025-00623-y","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"value":"1862-5347","type":"print"},{"value":"1862-5355","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,6]]},"assertion":[{"value":"1 March 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 September 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 December 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 February 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}