{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,3]],"date-time":"2026-05-03T03:20:41Z","timestamp":1777778441788,"version":"3.51.4"},"reference-count":34,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2004,12,1]],"date-time":"2004-12-01T00:00:00Z","timestamp":1101859200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information Visualization"],"published-print":{"date-parts":[[2004,12]]},"abstract":"<jats:p>Clustering is an important technique for understanding of large multidimensional datasets. Most of clustering research to date has been focused on developing automatic clustering algorithms and cluster validation methods. The automatic algorithms are known to work well in dealing with clusters of regular shapes, for example, compact spherical shapes, but may incur higher error rates when dealing with arbitrarily shaped clusters. Although some efforts have been devoted to addressing the problem of skewed datasets, the problem of handling clusters with irregular shapes is still in its infancy, especially in terms of dimensionality of the datasets and the precision of the clustering results considered. Not surprisingly, the statistical indices works ineffective in validating clusters of irregular shapes, too. In this paper, we address the problem of clustering and validating arbitrarily shaped clusters with a visual framework (VISTA). The main idea of the VISTA approach is to capitalize on the power of visualization and interactive feedbacks to encourage domain experts to participate in the clustering revision and clustering validation process. The VISTA system has two unique features. First, it implements a linear and reliable visualization model to interactively visualize multi-dimensional datasets in a 2D star-coordinate space. Second, it provides a rich set of user-friendly interactive rendering operations, allowing users to validate and refine the cluster structure based on their visual experience as well as their domain knowledge.<\/jats:p>","DOI":"10.1057\/palgrave.ivs.9500076","type":"journal-article","created":{"date-parts":[[2004,7,22]],"date-time":"2004-07-22T05:15:30Z","timestamp":1090473330000},"page":"257-270","source":"Crossref","is-referenced-by-count":54,"title":["VISTA: Validating and Refining Clusters Via Visualization"],"prefix":"10.1177","volume":"3","author":[{"given":"Keke","family":"Chen","sequence":"first","affiliation":[{"name":"College of Computing, Georgia Institute of Technology, GA, U.S.A."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ling","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Computing, Georgia Institute of Technology, GA, U.S.A."}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2004,12,1]]},"reference":[{"key":"bibr1-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1145\/565117.565124"},{"key":"bibr2-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Guha S, Rastogi R, Shim K. CURE: an efficient clustering algorithm for large databases. Proceeding of ACM SIGMOD Conference. Seattle, Washington, USA, 1998.","DOI":"10.1145\/276304.276312"},{"key":"bibr3-palgrave.ivs.9500076","unstructured":"Sheikholeslami G, Chatterjee S, Zhang A. Wavecluster: a multi-resolution clustering approach for very large spatial databases. Proceeding of Very Large Databases Conference (VLDB). New York City, NY, USA, 1998."},{"key":"bibr4-palgrave.ivs.9500076","unstructured":"Ester M, Kriegel HP, Sander J, Xu X. A density-based algorithm for discovering clusters in large spatial databases with noise. Portland, Oregon, USA, Second International Conference on Knowledge Discovery and Data Mining 1996."},{"key":"bibr5-palgrave.ivs.9500076","volume-title":"Algorithms for Clustering Data","author":"Jain AK","year":"1988"},{"key":"bibr6-palgrave.ivs.9500076","volume-title":"Applied Multivariate Techniques","author":"Sharma S.","year":"1995"},{"key":"bibr7-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1145\/331499.331504"},{"key":"bibr8-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1111\/j.1551-6708.1987.tb00863.x"},{"key":"bibr9-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1145\/381641.381656"},{"key":"bibr10-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1057\/palgrave.ivs.9500006"},{"key":"bibr11-palgrave.ivs.9500076","volume-title":"Geometric Methods and Applications for Computer Science and Engineering","author":"Gallier J.","year":"2000"},{"key":"bibr12-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Kandogan E. Visualizing multi-dimensional clusters, trends, and outliers using star coordinates. Proceedings of ACM SIGKDD Conference. San Francisco, CA, USA, 2001.","DOI":"10.1145\/502512.502530"},{"key":"bibr13-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-5725-8"},{"key":"bibr14-palgrave.ivs.9500076","volume-title":"Mixture Models: Inference and Application to Clustering","author":"McLachlan G","year":"1988"},{"key":"bibr15-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Chen K, Liu L. Cluster rendering of skewed datasets via visualization. Proceedings of ACM Symposium on Applied Computing (SAC), Melbourne, Florida, USA, 2003.","DOI":"10.1145\/952710.952712"},{"key":"bibr16-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Zhang T, Ramakrishnan R, Livny M. BIRCH: an efficient data clustering method for very large databases. Proceedings of ACM SIGMOD Conference. Montreal, Canada, 1996.","DOI":"10.1145\/235968.233324"},{"key":"bibr17-palgrave.ivs.9500076","volume-title":"Artificial Intelligence: A Modern Approach","author":"Russel S","year":"1995"},{"key":"bibr18-palgrave.ivs.9500076","volume-title":"Machine Learning","author":"Mitchell T.","year":"1997"},{"key":"bibr19-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Blum A, Mitchell T. Combining labeled and unlabeled data with co-training. Proceedings of 8th Annual Conference on Computational Learning Theory, Madison, Wisconsin, USA, 1998.","DOI":"10.1145\/279943.279962"},{"key":"bibr20-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1109\/2.781637"},{"key":"bibr21-palgrave.ivs.9500076","unstructured":"Xu X, Ester M, Kriegel HP, Sander J. A distribution-based clustering algorithm for mining in large spatial databases. Proceedings of IEEE International Conference on Data Engineering (ICDE). Orlando, Florida, USA, 1998."},{"key":"bibr22-palgrave.ivs.9500076","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1080\/10618600.1995.10474674","volume":"23","author":"Cook D","year":"1995","journal-title":"Journal of Computational and Graphical Statistics"},{"key":"bibr23-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Yang L. Interactive exploration of very large relational datasets through 3d dynamic projections. Proceedings of ACM SIGKDD Conference. Boston, MA, USA, 2000.","DOI":"10.1145\/347090.347134"},{"key":"bibr24-palgrave.ivs.9500076","unstructured":"Dhillon IS, Modha DS, Spangler WS. Visualizing class structure of multidimensional data. The 30th Symposium on the Interface: Computing Science and Statistics, Minneapolis, Minnesota, USA, 1998."},{"key":"bibr25-palgrave.ivs.9500076","volume-title":"Visualizing data","author":"Cleveland WS","year":"1993"},{"key":"bibr26-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Faloutsos C, Lin KID. FastMap: a fast algorithm for indexing, datamining and visualization of traditional and multimedia datasets. Proceedings of ACM SIGMOD Conference, San Jose, CA, USA, 1995.","DOI":"10.1145\/568271.223812"},{"key":"bibr27-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1109\/DASFAA.2001.916368"},{"key":"bibr28-palgrave.ivs.9500076","unstructured":"Grinstein G, Ankerst M, Keim DA. Visual data mining: background, applications, and drug discovery applications. Proceeding of ACM SIGMOD Conference. Philadelphia, PA, USA, 1999."},{"key":"bibr29-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Hinneburg A, Keim DA, Wawryniuk M. Visual mining of high-dimensional data. IEEE Computer Graphics and Applications, 1999; 1\u20138.","DOI":"10.1109\/38.788795"},{"key":"bibr30-palgrave.ivs.9500076","doi-asserted-by":"crossref","unstructured":"Ankerst M, Breunig MM, Kriegel HP, Sander J. OPTICS: ordering points to identify the clustering structure. Proceeding of ACM SIGMOD Conference, Philadelphia, PA, USA, 1999.","DOI":"10.1145\/304182.304187"},{"key":"bibr31-palgrave.ivs.9500076","unstructured":"Chen K, Liu L. Validating and refining clusters via visual rendering. Proceedings of International Conference on Data Mining (1CDM), Melbourne, Florida, USA, 2003."},{"key":"bibr32-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-1904-8"},{"key":"bibr33-palgrave.ivs.9500076","first-page":"580","volume":"5","author":"DeMers D","year":"1993","journal-title":"Advances in Neural Information Processing Systems"},{"key":"bibr34-palgrave.ivs.9500076","doi-asserted-by":"publisher","DOI":"10.2307\/2347949"}],"container-title":["Information Visualization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1057\/palgrave.ivs.9500076","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1057\/palgrave.ivs.9500076","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T19:19:32Z","timestamp":1777490372000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1057\/palgrave.ivs.9500076"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,12]]},"references-count":34,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2004,12]]}},"alternative-id":["10.1057\/palgrave.ivs.9500076"],"URL":"https:\/\/doi.org\/10.1057\/palgrave.ivs.9500076","relation":{},"ISSN":["1473-8716","1473-8724"],"issn-type":[{"value":"1473-8716","type":"print"},{"value":"1473-8724","type":"electronic"}],"subject":[],"published":{"date-parts":[[2004,12]]}}}