{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T14:37:31Z","timestamp":1774535851013,"version":"3.50.1"},"reference-count":48,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2017,10,14]],"date-time":"2017-10-14T00:00:00Z","timestamp":1507939200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Information Visualization"],"published-print":{"date-parts":[[2019,1]]},"abstract":"<jats:p> Due to the intricate relationship between different dimensions of high-dimensional data, subspace analysis is often conducted to decompose dimensions and give prominence to certain subsets of dimensions, i.e. subspaces. Exploring and comparing subspaces are important to reveal the underlying features of subspaces, as well as to portray the characteristics of individual dimensions. To date, most of the existing high-dimensional data exploration and analysis approaches rely on dimensionality reduction algorithms (e.g. principal component analysis and multi-dimensional scaling) to project high-dimensional data, or their subspaces, to two-dimensional space and employ scatterplots for visualization. However, the dimensionality reduction algorithms are sometimes difficult to fine-tune and scatterplots are not effective for comparative visualization, making subspace comparison hard to perform. In this article, we aggregate high-dimensional data or their subspaces by computing pair-wise distances between all data items and showing the distances with matrix visualizations to present the original high-dimensional data or subspaces. Our approach enables effective visual comparisons among subspaces, which allows users to further investigate the characteristics of individual dimensions by studying their behaviors in similar subspaces. Through subspace comparisons, we identify dominant, similar, and conforming dimensions in different subspace contexts of synthetic and real-world high-dimensional data sets. Additionally, we present a prototype that integrates parallel coordinates plot and matrix visualization for high-dimensional data exploration and incremental dimensionality analysis, which also allows users to further validate the dimension characterization results derived from the subspace comparisons. <\/jats:p>","DOI":"10.1177\/1473871617733996","type":"journal-article","created":{"date-parts":[[2017,10,14]],"date-time":"2017-10-14T12:04:38Z","timestamp":1507982678000},"page":"94-109","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":12,"title":["High-dimensional data analysis with subspace comparison using matrix visualization"],"prefix":"10.1177","volume":"18","author":[{"given":"Junpeng","family":"Wang","sequence":"first","affiliation":[{"name":"The Ohio State University, Columbus, OH, USA"}]},{"given":"Xiaotong","family":"Liu","sequence":"additional","affiliation":[{"name":"IBM Research\u2013Almaden, San Jose, CA, USA"}]},{"given":"Han-Wei","family":"Shen","sequence":"additional","affiliation":[{"name":"The Ohio State University, Columbus, OH, USA"}]}],"member":"179","published-online":{"date-parts":[[2017,10,14]]},"reference":[{"key":"bibr1-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1177\/1473871612460526"},{"key":"bibr2-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2015.2467431"},{"key":"bibr3-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1145\/1497577.1497578"},{"key":"bibr4-1473871617733996","first-page":"217","volume-title":"Proceedings of the international conference on database theory","author":"Beyer K"},{"key":"bibr5-1473871617733996","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687770"},{"key":"bibr6-1473871617733996","first-page":"35","volume-title":"Proceedings of the IEEE symposium on visual analytics science and technology (VAST)","author":"Ferdosi BJ"},{"key":"bibr7-1473871617733996","first-page":"63","volume-title":"Proceedings of the IEEE conference on visual analytics science and technology (VAST)","author":"Tatu A"},{"key":"bibr8-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2013.150"},{"key":"bibr9-1473871617733996","first-page":"128","volume-title":"Proceedings of the IEEE Pacific visualization symposium (PacificVis)","author":"Zhou F"},{"key":"bibr10-1473871617733996","volume-title":"Principal component analysis","author":"Jolliffe I.","year":"2002"},{"key":"bibr11-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1201\/9780367801700"},{"key":"bibr12-1473871617733996","first-page":"2579","volume":"9","author":"Van der Maaten L","year":"2008","journal-title":"J Mach Learn Res"},{"key":"bibr13-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-68628-8"},{"key":"bibr14-1473871617733996","first-page":"57","volume-title":"Proceedings of the 2014 IEEE Pacific visualization symposium (PacificVis)","author":"Palmas G"},{"key":"bibr15-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1080\/00401706.1987.10488204"},{"key":"bibr16-1473871617733996","first-page":"318","volume-title":"Proceedings of the SIGCHI conference on human factors in computing systems","author":"Rao R"},{"key":"bibr17-1473871617733996","first-page":"1242","volume-title":"Proceedings of the 2004 ACM symposium on applied computing","author":"Tominski C"},{"key":"bibr18-1473871617733996","first-page":"57","volume-title":"Proceedings of the IEEE symposium on information visualization (INFOVIS 2004)","author":"Williams M"},{"key":"bibr19-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2011.178"},{"key":"bibr20-1473871617733996","first-page":"11","volume-title":"Proceedings of the 2016 IEEE 6th symposium on large data analysis and visualization (LDAV)","author":"Krause J"},{"key":"bibr21-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2013.133"},{"key":"bibr22-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346321"},{"key":"bibr23-1473871617733996","first-page":"287","volume-title":"Proceedings of the 2015 IEEE Pacific visualization symposium (PacificVis)","author":"Watanabe K"},{"key":"bibr24-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12639"},{"key":"bibr25-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1145\/276305.276314"},{"key":"bibr26-1473871617733996","first-page":"84","volume-title":"Proceedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining","author":"Cheng CH"},{"key":"bibr27-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1145\/304181.304188"},{"key":"bibr28-1473871617733996","first-page":"418","volume-title":"Proceedings of the 2002 ACM SIGMOD international conference on management of data","author":"Procopiuc CM"},{"key":"bibr29-1473871617733996","first-page":"11","volume-title":"Proceedings of the 4th IEEE international conference on data mining","author":"Baumgartner C"},{"key":"bibr30-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12935"},{"key":"bibr31-1473871617733996","volume-title":"Semiology of graphics: diagrams, networks, maps","author":"Bertin J.","year":"1983"},{"key":"bibr32-1473871617733996","first-page":"483","volume-title":"Proceedings of the SIGCHI conference on human factors in computing systems (CHI\u201913)","author":"Alper B"},{"key":"bibr33-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1057\/palgrave.ivs.9500092"},{"key":"bibr34-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1057\/palgrave.ivs.9500116"},{"key":"bibr35-1473871617733996","first-page":"269","volume-title":"Proceedings of the 33rd annual ACM conference on human factors in computing systems (CHI\u201915)","author":"Liu X"},{"key":"bibr36-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2015.2467199"},{"key":"bibr37-1473871617733996","first-page":"580","volume-title":"Proceedings of the international conference on scientific and statistical database management","author":"Achtert E"},{"key":"bibr38-1473871617733996","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824115"},{"issue":"2","key":"bibr39-1473871617733996","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1002\/sam.10071","volume":"3","author":"Liiv I.","year":"2010","journal-title":"Stat Anal Data Min"},{"key":"bibr40-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2009.09.011"},{"key":"bibr41-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1111\/1467-9868.00293"},{"key":"bibr42-1473871617733996","volume-title":"Beautiful evidence","author":"Tufte ER.","year":"2006"},{"key":"bibr43-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1126\/science.132.3434.1115"},{"key":"bibr44-1473871617733996","unstructured":"Bilmes JA. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models, vol. 4. Berkeley, CA: International Computer Science Institute, 1998, p. 126."},{"key":"bibr45-1473871617733996","first-page":"89","volume-title":"Proceedings of the IEEE symposium on information visualization","author":"Peng W"},{"key":"bibr46-1473871617733996","volume-title":"Computer solution of large sparse positive definite","author":"George A","year":"1981"},{"key":"bibr47-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1137\/S0895479894278952"},{"key":"bibr48-1473871617733996","doi-asserted-by":"publisher","DOI":"10.1137\/0613024"}],"container-title":["Information Visualization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1473871617733996","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1473871617733996","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1473871617733996","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T23:56:44Z","timestamp":1740873404000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1473871617733996"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,14]]},"references-count":48,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,1]]}},"alternative-id":["10.1177\/1473871617733996"],"URL":"https:\/\/doi.org\/10.1177\/1473871617733996","relation":{},"ISSN":["1473-8716","1473-8724"],"issn-type":[{"value":"1473-8716","type":"print"},{"value":"1473-8724","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,10,14]]}}}