{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T02:09:54Z","timestamp":1773972594648,"version":"3.50.1"},"posted":{"date-parts":[[2026,3,17]]},"group-title":"Computer Science and Mathematics","reference-count":0,"publisher":"MDPI AG","license":[{"start":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T00:00:00Z","timestamp":1773705600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"accepted":{"date-parts":[[2026,3,16]]},"abstract":"<jats:p>The marble industry relies on proprietary commercial names rather than objective visual categories, creating market inefficiencies for stakeholders who select stones based on appearance. Supervised classification methods perpetuate this problem by replicating inconsistent commercial labels instead of discovering intrinsic visual structure. We propose an unsupervised pipeline combining a two-stage training strategy, pure self-supervised pretraining followed by cluster-aware fine-tuning of a DINO Vision Transformer, with UMAP dimensionality reduction and Ward's agglomerative hierarchical clustering. Systematic ablation studies on 1,540 marble images spanning 10 commercial varieties validate each design choice: cluster-aware training at k=10 yields superior embeddings over the self-supervised baseline (Silhouette Score 0.778 vs. 0.761; Davies\u2013Bouldin Index 0.293 vs. 0.364), UMAP compression to five dimensions resolves high-dimensional noise pathologies, and Ward's linkage produces the most compact partitions. The resulting taxonomy reveals three phenomena invisible to commercial classification: cross-category merging of visually indistinguishable stones carrying different market names, intra-category splitting of heterogeneous sub-populations within single varieties, and coherent grouping where commercial and visual boundaries coincide. We further demonstrate that standard extrinsic metrics are misaligned with unsupervised taxonomy objectives when reference labels encode the inconsistencies the method aims to resolve. This work provides a validated methodology for data-driven visual classification in the natural stone industry and a transferable template for domains with unreliable labelling conventions.<\/jats:p>","DOI":"10.20944\/preprints202603.1344.v1","type":"posted-content","created":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T01:18:11Z","timestamp":1773969491000},"source":"Crossref","is-referenced-by-count":0,"title":["Unsupervised Hierarchical Visual Taxonomy of Marble Natural Stone Using Cluster-Aware Self-Supervised Vision Transformers"],"prefix":"10.20944","author":[{"given":"Margarida T\u00e2nger de Oliveira","family":"Figueiredo","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5200-2310","authenticated-orcid":false,"given":"Carlos M. A.","family":"Diogo","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9492-7207","authenticated-orcid":false,"given":"Gustavo","family":"Paneiro","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6135-8263","authenticated-orcid":false,"given":"Pedro","family":"Amaral","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0898-8810","authenticated-orcid":false,"given":"Ant\u00f3nio Alves de","family":"Campos","sequence":"additional","affiliation":[]}],"member":"1968","container-title":[],"original-title":[],"deposited":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T01:18:52Z","timestamp":1773969532000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.preprints.org\/manuscript\/202603.1344\/v1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,17]]},"references-count":0,"URL":"https:\/\/doi.org\/10.20944\/preprints202603.1344.v1","relation":{},"subject":[],"published":{"date-parts":[[2026,3,17]]},"subtype":"preprint"}}