{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,28]],"date-time":"2025-11-28T14:50:13Z","timestamp":1764341413549,"version":"3.46.0"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2025,3,10]],"date-time":"2025-03-10T00:00:00Z","timestamp":1741564800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,3,10]],"date-time":"2025-03-10T00:00:00Z","timestamp":1741564800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Open access funding provided by FCT|FCCN (b-on)."},{"DOI":"10.13039\/501100001871","name":"Funda\u00e7\u00e3o para a Ci\u00eancia e a Tecnologia","doi-asserted-by":"publisher","award":["LA\/P\/0063\/2020","UID\/GES\/00731\/2019"],"award-info":[{"award-number":["LA\/P\/0063\/2020","UID\/GES\/00731\/2019"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>We present parametric probabilistic models for numerical distributional variables. The proposed models are based on the representation of each distribution by a location measure and inter-quantile ranges, for given quantiles, thereby characterizing the underlying empirical distributions in a flexible way. Multivariate Normal distributions are assumed for the whole set of indicators, considering alternative structures of the variance\u2013covariance matrix. For all cases, maximum likelihood estimators of the corresponding parameters are derived. This modelling allows for hypothesis testing and multivariate parametric analysis. The proposed framework is applied to Analysis of Variance and parametric Discriminant Analysis of distributional data. A simulation study examines the performance of the proposed models in classification problems under different data conditions. Applications to Internet traffic data and Portuguese official data illustrate the relevance of the proposed approach.<\/jats:p>","DOI":"10.1007\/s11634-025-00624-x","type":"journal-article","created":{"date-parts":[[2025,3,10]],"date-time":"2025-03-10T03:42:12Z","timestamp":1741578132000},"page":"1119-1146","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Parametric models for distributional data"],"prefix":"10.1007","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2593-8818","authenticated-orcid":false,"given":"Paula","family":"Brito","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1378-2403","authenticated-orcid":false,"given":"A. Pedro","family":"Duarte Silva","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,3,10]]},"reference":[{"key":"624_CR1","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1016\/B978-0-12-714250-0.50016-4","volume-title":"Classification and clustering","author":"SW Ahmed","year":"1977","unstructured":"Ahmed SW, Lachenbruch PA (1977) Discriminant analysis when scale contamination is present in the initial sample. In: Van Ryzin J (ed) Classification and clustering. Academic Press, University of Wisconsin-Madison, Madison, pp 331\u2013353"},{"issue":"1","key":"624_CR2","doi-asserted-by":"publisher","first-page":"192","DOI":"10.1016\/j.ijforecast.2008.07.003","volume":"25","author":"J Arroyo","year":"2009","unstructured":"Arroyo J, Mat\u00e9 C (2009) Forecasting histogram time series with k-nearest neighbours methods. Int J Forecast 25(1):192\u2013207","journal-title":"Int J Forecast"},{"issue":"2","key":"624_CR3","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1002\/sam.10114","volume":"4","author":"J Arroyo","year":"2011","unstructured":"Arroyo J, Gonz\u00e1lez-Rivera G, Mat\u00e9 C, San Roque AM (2011) Smoothing methods for histogram-valued time series: an application to value-at-risk. Stat Anal Data Mining ASA Data Sci J 4(2):216\u2013228","journal-title":"Stat Anal Data Mining ASA Data Sci J"},{"issue":"375","key":"624_CR4","doi-asserted-by":"publisher","first-page":"676","DOI":"10.1080\/01621459.1981.10477703","volume":"76","author":"T Ashikaga","year":"1981","unstructured":"Ashikaga T, Chang P (1981) Robustness of Fisher\u2019s linear discriminant function under two-component mixed normal models. J Am Stat Assoc 76(375):676\u2013680","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"624_CR5","doi-asserted-by":"publisher","first-page":"465","DOI":"10.1080\/03610928508828925","volume":"14","author":"N Balakrishnan","year":"1985","unstructured":"Balakrishnan N, Kocherlakota S (1985) Robustness to nonnormality of the linear discriminant function: mixtures of normal distributions. Commun Stat Theory Methods 14(2):465\u2013478","journal-title":"Commun Stat Theory Methods"},{"issue":"22","key":"624_CR6","doi-asserted-by":"publisher","first-page":"2315","DOI":"10.1080\/03610928108828190","volume":"10","author":"C Bayne","year":"1981","unstructured":"Bayne C, Tan W (1981) QDF misclassification probabilities for known population parameters. Commun Stat Theory Methods 10(22):2315\u20132326","journal-title":"Commun Stat Theory Methods"},{"issue":"462","key":"624_CR7","doi-asserted-by":"publisher","first-page":"470","DOI":"10.1198\/016214503000242","volume":"98","author":"L Billard","year":"2003","unstructured":"Billard L, Diday E (2003) From the statistics of data to the statistics of knowledge: symbolic data analysis. J Am Stat Assoc 98(462):470\u2013487","journal-title":"J Am Stat Assoc"},{"key":"624_CR8","doi-asserted-by":"publisher","DOI":"10.1002\/9780470090183","volume-title":"Symbolic Data Analysis: conceptual statistics and data mining","author":"L Billard","year":"2006","unstructured":"Billard L, Diday E (2006) Symbolic Data Analysis: conceptual statistics and data mining. Wiley, Chichester"},{"issue":"5","key":"624_CR9","doi-asserted-by":"publisher","first-page":"1405","DOI":"10.1002\/wics.1405","volume":"9","author":"L Billard","year":"2017","unstructured":"Billard L, Kim J (2017) Hierarchical clustering for histogram data. Wiley Interdiscip Rev Comput Stat 9(5):1405","journal-title":"Wiley Interdiscip Rev Comput Stat"},{"key":"624_CR10","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-57155-8","volume-title":"Analysis of symbolic data: exploratory methods for extracting statistical information from complex data","author":"H-H Bock","year":"2000","unstructured":"Bock H-H, Diday E (2000) Analysis of symbolic data: exploratory methods for extracting statistical information from complex data. Springer, Berlin"},{"key":"624_CR11","doi-asserted-by":"crossref","unstructured":"Brito PM, Chavent M (2012) Divisive monothetic clustering for interval and histogram-valued data. In: ICPRAM 2012-1st international conference on pattern recognition applications and methods, pp 229\u2013234","DOI":"10.5220\/0003793502290234"},{"issue":"4","key":"624_CR12","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1002\/widm.1133","volume":"4","author":"P Brito","year":"2014","unstructured":"Brito P (2014) Symbolic Data Analysis: another look at the interaction of data mining and statistics. Wiley Interdiscip Rev Data Mining Knowl Discov 4(4):281\u2013295","journal-title":"Wiley Interdiscip Rev Data Mining Knowl Discov"},{"key":"624_CR13","doi-asserted-by":"publisher","DOI":"10.1201\/9781315370545","volume-title":"Analysis of distributional data","author":"P Brito","year":"2022","unstructured":"Brito P, Dias S (2022) Analysis of distributional data. CRC Press, Boca Raton"},{"issue":"1","key":"624_CR14","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1080\/02664763.2011.575125","volume":"39","author":"P Brito","year":"2012","unstructured":"Brito P, Duarte Silva AP (2012) Modelling interval data with Normal and Skew-Normal distributions. J Appl Stat 39(1):3\u201320","journal-title":"J Appl Stat"},{"issue":"1","key":"624_CR15","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1016\/0378-3758(79)90042-9","volume":"3","author":"EF Chinganda","year":"1979","unstructured":"Chinganda EF, Subrahmaniam K (1979) Robustness of the linear discriminant function to nonnormality: Johnson\u2019s system. J Stat Plan Inference 3(1):69\u201377","journal-title":"J Stat Plan Inference"},{"issue":"13","key":"624_CR16","doi-asserted-by":"publisher","first-page":"1285","DOI":"10.1080\/03610927908827830","volume":"8","author":"WR Clarke","year":"1979","unstructured":"Clarke WR, Lachenbruch PA, Broffitt B (1979) How non-normality affects the quadratic discriminant function. Commun Stat Theory Methods 8(13):1285\u20131301","journal-title":"Commun Stat Theory Methods"},{"key":"624_CR17","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.ins.2020.11.018","volume":"549","author":"FdA De Carvalho","year":"2021","unstructured":"De Carvalho FdA, Balzanella A, Irpino A, Verde R (2021) Co-clustering algorithms for distributional data with automated variable weighting. Inf Sci 549:87\u2013115","journal-title":"Inf Sci"},{"issue":"1","key":"624_CR18","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1016\/j.csda.2010.05.008","volume":"55","author":"P Delicado","year":"2011","unstructured":"Delicado P (2011) Dimensionality reduction when data are density functions. Comput Stat Data Anal 55(1):401\u2013420","journal-title":"Comput Stat Data Anal"},{"issue":"2","key":"624_CR19","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1002\/sam.11260","volume":"8","author":"S Dias","year":"2015","unstructured":"Dias S, Brito P (2015) Linear regression model with histogram-valued variables. Stat Anal Data Min ASA Data Sci J 8(2):75\u2013113","journal-title":"Stat Anal Data Min ASA Data Sci J"},{"issue":"1","key":"624_CR20","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1016\/j.ejor.2021.01.025","volume":"294","author":"S Dias","year":"2021","unstructured":"Dias S, Brito P, Amaral P (2021) Discriminant analysis of distributional data via fractional programming. Eur J Oper Res 294(1):206\u2013218","journal-title":"Eur J Oper Res"},{"key":"624_CR21","doi-asserted-by":"publisher","first-page":"516","DOI":"10.1007\/s00357-015-9189-8","volume":"32","author":"AP Duarte Silva","year":"2015","unstructured":"Duarte Silva AP, Brito P (2015) Discriminant analysis of interval data: an assessment of parametric and distance-based approaches. J Classif 32:516\u2013541","journal-title":"J Classif"},{"issue":"3","key":"624_CR22","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1081\/SAC-120003849","volume":"31","author":"AP Duarte Silva","year":"2002","unstructured":"Duarte Silva AP, Stam A, Neter J (2002) The effects of misclassification costs and skewed distributions in two-group classification. Commun Stat Simul Comput 31(3):401\u2013423","journal-title":"Commun Stat Simul Comput"},{"issue":"2","key":"624_CR23","doi-asserted-by":"crossref","first-page":"336","DOI":"10.32614\/RJ-2021-090","volume":"13","author":"AP Duarte Silva","year":"2021","unstructured":"Duarte Silva AP, Brito P, Filzmoser P, Dias JG (2021) MAINT. Data: modelling and analysing interval data in R. R J 13(2):336\u2013364","journal-title":"R J"},{"issue":"1","key":"624_CR24","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1016\/j.ijforecast.2011.02.007","volume":"28","author":"G Gonzalez-Rivera","year":"2012","unstructured":"Gonzalez-Rivera G, Arroyo J (2012) Time series modeling of histogram-valued data: the daily histogram time series of S &P500 intradaily returns. Int J Forecast 28(1):20\u201333","journal-title":"Int J Forecast"},{"key":"624_CR25","doi-asserted-by":"publisher","DOI":"10.1016\/j.gexplo.2024.107416","volume":"259","author":"TM Grygar","year":"2024","unstructured":"Grygar TM, Radoji\u010di\u0107 U, Pavl\u016d I, Greven S, Ne\u0161lehov\u00e1 JG, T\u016dmov\u00e1 \u0160, Hron K (2024) Exploratory functional data analysis of multivariate densities for the identification of agricultural soil contamination by risk elements. J Geochem Explor 259:107416","journal-title":"J Geochem Explor"},{"issue":"2","key":"624_CR26","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1002\/sam.10111","volume":"4","author":"M Ichino","year":"2011","unstructured":"Ichino M (2011) The quantile method for symbolic principal component analysis. Stat Anal Data Mining ASA Data Sci J 4(2):184\u2013198","journal-title":"Stat Anal Data Mining ASA Data Sci J"},{"issue":"4","key":"624_CR27","doi-asserted-by":"publisher","first-page":"1271","DOI":"10.3390\/stats5040077","volume":"5","author":"M Ichino","year":"2022","unstructured":"Ichino M (2022) The lookup table regression model for histogram-valued symbolic data. Stats 5(4):1271\u20131293","journal-title":"Stats"},{"issue":"2","key":"624_CR28","doi-asserted-by":"publisher","first-page":"359","DOI":"10.3390\/stats4020024","volume":"4","author":"M Ichino","year":"2021","unstructured":"Ichino M, Umbleja K, Yaguchi H (2021) Unsupervised feature selection for histogram-valued symbolic data using hierarchical conceptual clustering. Stats 4(2):359\u2013384","journal-title":"Stats"},{"key":"624_CR29","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1007\/s11634-015-0197-7","volume":"9","author":"A Irpino","year":"2015","unstructured":"Irpino A, Verde R (2015) Linear regression for numeric symbolic variables: a least squares approach based on Wasserstein distance. Adv Data Anal Classif 9:81\u2013106","journal-title":"Adv Data Anal Classif"},{"issue":"7","key":"624_CR30","doi-asserted-by":"publisher","first-page":"3351","DOI":"10.1016\/j.eswa.2013.12.001","volume":"41","author":"A Irpino","year":"2014","unstructured":"Irpino A, Verde R, De Carvalho FdA (2014) Dynamic clustering of histogram data based on adaptive squared Wasserstein distances. Expert Syst Appl 41(7):3351\u20133366","journal-title":"Expert Syst Appl"},{"key":"624_CR31","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1016\/j.ins.2017.04.040","volume":"406","author":"A Irpino","year":"2017","unstructured":"Irpino A, Verde R, Carvalho FdA (2017) Fuzzy clustering of distributional data with automatic weighting of variable components. Inf Sci 406:248\u2013268","journal-title":"Inf Sci"},{"key":"624_CR32","doi-asserted-by":"crossref","unstructured":"Irpino A, Verde R (2006) A new Wasserstein based distance for the hierarchical clustering of histogram symbolic data. In: Data science and classification. Springer, Berlin, Heidelberg, pp 185\u2013192","DOI":"10.1007\/3-540-34416-0_20"},{"key":"624_CR33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s11222-021-10060-4","volume":"32","author":"H Jin","year":"2022","unstructured":"Jin H, Billard L (2022) Copulas and histogram-valued data. J Comput Graph Stat 32:1\u201328","journal-title":"J Comput Graph Stat"},{"issue":"7","key":"624_CR34","doi-asserted-by":"publisher","first-page":"2250","DOI":"10.1016\/j.csda.2011.01.011","volume":"55","author":"J Kim","year":"2011","unstructured":"Kim J, Billard L (2011) A polythetic clustering process and cluster validity indexes for histogram-valued objects. Comput Stat Data Anal 55(7):2250\u20132262","journal-title":"Comput Stat Data Anal"},{"key":"624_CR35","doi-asserted-by":"publisher","DOI":"10.1002\/0471725293","volume-title":"Discriminant analysis and statistical pattern recognition","author":"G McLachlan","year":"1992","unstructured":"McLachlan G (1992) Discriminant analysis and statistical pattern recognition. Wiley, Chichester"},{"issue":"5","key":"624_CR36","doi-asserted-by":"publisher","first-page":"1181","DOI":"10.1080\/03610928508828970","volume":"14","author":"H Nakanishi","year":"1985","unstructured":"Nakanishi H, Sato Y (1985) The performance of the linear and quadratic discriminant functions for three types of non-normal distribution. Commun Stat Theory Methods 14(5):1181\u20131200","journal-title":"Commun Stat Theory Methods"},{"issue":"1","key":"624_CR37","doi-asserted-by":"publisher","first-page":"405","DOI":"10.1146\/annurev-statistics-030718-104938","volume":"6","author":"VM Panaretos","year":"2019","unstructured":"Panaretos VM, Zemel Y (2019) Statistical aspects of Wasserstein distances. Annu Rev Stat Appl 6(1):405\u2013431","journal-title":"Annu Rev Stat Appl"},{"key":"624_CR38","doi-asserted-by":"crossref","unstructured":"Petersen A, M\u00fcller H-G (2016) Functional data analysis for density functions by transformation to a Hilbert space","DOI":"10.32614\/CRAN.package.fdadensity"},{"issue":"1","key":"624_CR39","doi-asserted-by":"publisher","first-page":"85","DOI":"10.3758\/BRM.41.1.85","volume":"41","author":"JR Rausch","year":"2009","unstructured":"Rausch JR, Kelley K (2009) A comparison of linear and mixture models for discriminant analysis under nonnormality. Behav Res Methods 41(1):85\u201398","journal-title":"Behav Res Methods"},{"issue":"2","key":"624_CR40","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461\u2013464","journal-title":"Ann Stat"},{"key":"624_CR41","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316641","volume-title":"Multivariate observations","author":"GAF Seber","year":"1984","unstructured":"Seber GAF (1984) Multivariate observations. Wiley, Chichester"},{"key":"624_CR42","unstructured":"Statistics Portugal (2009) Recenceamento Agr\u00edcola 2009. http:\/\/ra09.ine.pt\/xportal\/xmain?xpid=RA2009&xpgid=ine_ra_sabermais. Accessed: 6 Dec 2024"},{"key":"624_CR43","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1007\/978-3-030-17065-3_37","volume-title":"Proceedings of the tenth international conference on soft computing and pattern recognition (SoCPaR 2018)","author":"A Subtil","year":"2020","unstructured":"Subtil A, Oliveira MR, Valadas R, Pacheco A, Salvador P (2020) Detecting internet-scale traffic redirection attacks using latent class models. In: Madureira AM, Abraham A, Gandhi N, Silva C, Antunes M (eds) Proceedings of the tenth international conference on soft computing and pattern recognition (SoCPaR 2018). Springer, Berlin, pp 370\u2013380"},{"key":"624_CR44","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1007\/s11634-020-00411-w","volume":"15","author":"K Umbleja","year":"2021","unstructured":"Umbleja K, Ichino M, Yaguchi H (2021) Hierarchical conceptual clustering based on quantile method for identifying microscopic details in distributional data. Adv Data Anal Classif 15:407\u2013436","journal-title":"Adv Data Anal Classif"},{"issue":"2","key":"624_CR45","doi-asserted-by":"publisher","first-page":"344","DOI":"10.1109\/TCYB.2015.2389653","volume":"46","author":"R Verde","year":"2015","unstructured":"Verde R, Irpino A, Balzanella A (2015) Dimension reduction techniques for distributional symbolic data. IEEE Trans Cybern 46(2):344\u2013355","journal-title":"IEEE Trans Cybern"},{"key":"624_CR46","doi-asserted-by":"crossref","unstructured":"Verde R, Irpino A (2007) Dynamic clustering of histogram data: using the right metric. In: Selected contributions in data analysis and classification. Springer, Berlin, pp 123\u2013134","DOI":"10.1007\/978-3-540-73560-1_12"},{"key":"624_CR47","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.ins.2022.07.064","volume":"609","author":"Q Zhao","year":"2022","unstructured":"Zhao Q, Wang H, Lu S (2022) M-LDQ feature embedding and regression modeling for distribution-valued data. Inf Sci 609:121\u2013152","journal-title":"Inf Sci"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00624-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-025-00624-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-025-00624-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,28]],"date-time":"2025-11-28T14:46:42Z","timestamp":1764341202000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-025-00624-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,10]]},"references-count":47,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["624"],"URL":"https:\/\/doi.org\/10.1007\/s11634-025-00624-x","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"type":"print","value":"1862-5347"},{"type":"electronic","value":"1862-5355"}],"subject":[],"published":{"date-parts":[[2025,3,10]]},"assertion":[{"value":"6 March 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 December 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 January 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 March 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}