{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,4]],"date-time":"2025-12-04T07:37:41Z","timestamp":1764833861490,"version":"3.46.0"},"reference-count":41,"publisher":"World Scientific Pub Co Pte Ltd","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Neur. Syst."],"published-print":{"date-parts":[[2025,12,15]]},"abstract":"<jats:p>This paper introduces a comprehensive framework for clustering analysis, centered on a novel incremental silhouette score calculation designed specifically for hierarchical clustering. This innovative method significantly reduces the computational complexity of silhouette evaluation, transforming the process from O(K N) to effectively O(N) for K hierarchical configurations (demonstrated by an over 100-fold speedup in our tests) making it feasible for large-scale datasets and enabling efficient cluster number estimation within hierarchical clustering scenarios. Building on this, we revisit and enhance the Principal Direction Divisive Partitioning (IPDDP) algorithm, proposing principal component analysis-maximum margin divisive clustering (PCA-MMDC), which utilizes multiple principal components for more accurate data partitioning, and PCA-MMDC-sc, which incorporates a scatter-based cluster selection for improved balance. These are integrated into a hybrid clustering strategy that combines the strengths of incremental silhouette calculation and the enhanced algorithms, allowing for robust cluster identification and effective management of noise and outliers. Experimental results on synthetic and real-world datasets demonstrate notable improvements in clustering accuracy (achieving an average Adjusted Rand Index (ARI) increase of over 10 percentage points on custom noisy synthetic datasets compared to K-Means) and computational efficiency. While the choice of principal components in PCA-MMDC presents a parameter, the overall framework offers a scalable and robust solution for complex clustering tasks, with future work aimed at adaptive parameter selection and extending incremental calculations to other validation metrics.<\/jats:p>","DOI":"10.1142\/s0129065725500765","type":"journal-article","created":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:54:36Z","timestamp":1760162076000},"source":"Crossref","is-referenced-by-count":0,"title":["Efficient Hybrid Hierarchical Clustering with Incremental Silhouette Score for Large, Noisy Datasets"],"prefix":"10.1142","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6507-8672","authenticated-orcid":false,"given":"Petros","family":"Barmpas","sequence":"first","affiliation":[{"name":"Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou, Lamia 35131, Greece"}]},{"given":"Panagiotis","family":"Anagnostou","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou, Lamia 35131, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9536-4090","authenticated-orcid":false,"given":"Sotiris","family":"Tasoulis","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou, Lamia 35131, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4266-701X","authenticated-orcid":false,"given":"Vassilis","family":"Plagianakos","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou, Lamia 35131, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3374-0422","authenticated-orcid":false,"given":"Spiros","family":"Georgakopoulos","sequence":"additional","affiliation":[{"name":"Department of Mathematics, University of Thessaly, Papasiopoulou, Lamia 35131, Greece"}]}],"member":"219","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"key":"S0129065725500765BIB001","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065724500503"},{"key":"S0129065725500765BIB002","doi-asserted-by":"publisher","DOI":"10.1016\/0377-0427(87)90125-7"},{"key":"S0129065725500765BIB003","doi-asserted-by":"publisher","DOI":"10.1080\/0094965031000136012"},{"key":"S0129065725500765BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2022.11.001"},{"key":"S0129065725500765BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.05.025"},{"key":"S0129065725500765BIB006","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009740529316"},{"key":"S0129065725500765BIB007","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-011-9210-5"},{"key":"S0129065725500765BIB008","doi-asserted-by":"publisher","DOI":"10.1142\/S012906572450014X"},{"key":"S0129065725500765BIB009","doi-asserted-by":"publisher","DOI":"10.1002\/widm.53"},{"volume-title":"HCER: Hierarchical Clustering-Ensemble Regressor","year":"2024","author":"Barmpas P.","key":"S0129065725500765BIB010"},{"key":"S0129065725500765BIB011","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107508"},{"key":"S0129065725500765BIB012","doi-asserted-by":"publisher","DOI":"10.1007\/s00357-014-9161-z"},{"key":"S0129065725500765BIB013","doi-asserted-by":"publisher","DOI":"10.3390\/sym15091679"},{"key":"S0129065725500765BIB014","doi-asserted-by":"publisher","DOI":"10.1080\/03610927408827101"},{"key":"S0129065725500765BIB015","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-021-98126-1"},{"key":"S0129065725500765BIB016","doi-asserted-by":"publisher","DOI":"10.1147\/rd.81.0022"},{"key":"S0129065725500765BIB017","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065724500692"},{"issue":"4","key":"S0129065725500765BIB018","first-page":"6485","volume":"45","author":"Javeed M.","year":"2023","journal-title":"J. Intell. Fuzzy Syst."},{"key":"S0129065725500765BIB019","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-019-04359-7"},{"issue":"127","key":"S0129065725500765BIB020","first-page":"1","volume":"24","author":"Tsitsulin A.","year":"2023","journal-title":"J. Mach. Learn. Res."},{"key":"S0129065725500765BIB021","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065725500133"},{"key":"S0129065725500765BIB022","doi-asserted-by":"publisher","DOI":"10.1007\/s10489-018-1238-7"},{"key":"S0129065725500765BIB023","unstructured":"K. Nakai, Ecoli UCI Machine Learning Repository (1996), doi: https:\/\/doi.org\/10.24432\/C5388M."},{"key":"S0129065725500765BIB024","unstructured":"A. Srinivasan, Statlog (Landsat Satellite) UCI Machine Learning Repository (1993), doi: https:\/\/doi.org\/10.24432\/C55887."},{"key":"S0129065725500765BIB025","unstructured":"L. Breiman and C. Stone, Waveform Database Generator (Version 1) UCI Machine Learning Repository (1984), doi: https:\/\/doi.org\/10.24432\/C5CS3C."},{"key":"S0129065725500765BIB026","doi-asserted-by":"publisher","DOI":"10.1109\/UEMCON53757.2021.9666609"},{"volume-title":"A Survey of Image Clustering: Taxonomy and Recent Methods","year":"2021","author":"Liu J.","key":"S0129065725500765BIB027"},{"key":"S0129065725500765BIB028","doi-asserted-by":"publisher","DOI":"10.1007\/s00357-024-09473-3"},{"key":"S0129065725500765BIB029","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0310991"},{"key":"S0129065725500765BIB030","doi-asserted-by":"publisher","DOI":"10.1007\/s11831-023-09939-4"},{"key":"S0129065725500765BIB031","doi-asserted-by":"publisher","DOI":"10.1037\/1082-989X.9.3.386"},{"key":"S0129065725500765BIB032","doi-asserted-by":"publisher","DOI":"10.1111\/coin.12100"},{"key":"S0129065725500765BIB033","doi-asserted-by":"publisher","DOI":"10.1016\/j.physa.2021.126433"},{"key":"S0129065725500765BIB034","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2022.11.139"},{"key":"S0129065725500765BIB035","doi-asserted-by":"publisher","DOI":"10.21105\/joss.00205"},{"key":"S0129065725500765BIB036","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-55187-2_34"},{"key":"S0129065725500765BIB037","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065724500631"},{"key":"S0129065725500765BIB038","unstructured":"J. Xie, R. Girshick and A. Farhadi, Unsupervised deep embedding for clustering analysis, arXiv:1511.06335."},{"key":"S0129065725500765BIB039","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065725500078"},{"key":"S0129065725500765BIB040","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2017.2682102"},{"key":"S0129065725500765BIB041","doi-asserted-by":"publisher","DOI":"10.1007\/s10916-024-02122-7"}],"container-title":["International Journal of Neural Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0129065725500765","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,4]],"date-time":"2025-12-04T07:06:04Z","timestamp":1764831964000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0129065725500765"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":41,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2025,12,15]]}},"alternative-id":["10.1142\/S0129065725500765"],"URL":"https:\/\/doi.org\/10.1142\/s0129065725500765","relation":{},"ISSN":["0129-0657","1793-6462"],"issn-type":[{"type":"print","value":"0129-0657"},{"type":"electronic","value":"1793-6462"}],"subject":[],"published":{"date-parts":[[2025,11,17]]},"article-number":"2550076"}}