{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,15]],"date-time":"2026-05-15T13:50:49Z","timestamp":1778853049916,"version":"3.51.4"},"reference-count":52,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2020,11,18]],"date-time":"2020-11-18T00:00:00Z","timestamp":1605657600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","award":["307411\/2016-8, 310299\/2018-7,"],"award-info":[{"award-number":["307411\/2016-8, 310299\/2018-7,"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002322","name":"Coordena\u00e7\u00e3o de Aperfei\u00e7oamento de Pessoal de N\u00edvel Superior","doi-asserted-by":"publisher","award":["001"],"award-info":[{"award-number":["001"]}],"id":[{"id":"10.13039\/501100002322","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Feature Analysis has become a very critical task in data analysis and visualization. Graph structures are very flexible in terms of representation and may encode important information on features but are challenging in regards to layout being adequate for analysis tasks. In this study, we propose and develop similarity-based graph layouts with the purpose of locating relevant patterns in sets of features, thus supporting feature analysis and selection. We apply a tree layout in the first step of the strategy, to accomplish node placement and overview based on feature similarity. By drawing the remainder of the graph edges on demand, further grouping and relationships among features are revealed. We evaluate those groups and relationships in terms of their effectiveness in exploring feature sets for data analysis. Correlation of features with a target categorical attribute and feature ranking are added to support the task. Multidimensional projections are employed to plot the dataset based on selected attributes to reveal the effectiveness of the feature set. Our results have shown that the tree-graph layout framework allows for a number of observations that are very important in user-centric feature selection, and not easy to observe by any other available tool. They provide a way of finding relevant and irrelevant features, spurious sets of noisy features, groups of similar features, and opposite features, all of which are essential tasks in different scenarios of data analysis. Case studies in application areas centered on documents, images and sound data demonstrate the ability of the framework to quickly reach a satisfactory compact representation from a larger feature set.<\/jats:p>","DOI":"10.3390\/a13110302","type":"journal-article","created":{"date-parts":[[2020,11,18]],"date-time":"2020-11-18T07:41:00Z","timestamp":1605685260000},"page":"302","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Graphs from Features: Tree-Based Graph Layout for Feature Analysis"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4799-8774","authenticated-orcid":false,"given":"Rosane","family":"Minghim","sequence":"first","affiliation":[{"name":"School of Computer Science and Information Technology, University College Cork, Western Road, T12 CY82 Cork, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0345-2075","authenticated-orcid":false,"given":"Liz","family":"Huancapaza","sequence":"additional","affiliation":[{"name":"Institute of Mathematical and Computer Sciences, University of S\u00e3o Paulo, S\u00e3o Paulo 03178-200, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erasmo","family":"Artur","sequence":"additional","affiliation":[{"name":"Superintend\u00eancia de Tecnologia da Informa\u00e7\u00e3o, Federal University of Piau\u00ed, Teresina 64049-550, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2608-4807","authenticated-orcid":false,"given":"Guilherme P.","family":"Telles","sequence":"additional","affiliation":[{"name":"Instituto de Computa\u00e7\u00e3o, University of Campinas, Campinas 13083-852, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5970-2283","authenticated-orcid":false,"given":"Ivar V.","family":"Belizario","sequence":"additional","affiliation":[{"name":"Institute of Mathematical and Computer Sciences, University of S\u00e3o Paulo, S\u00e3o Paulo 03178-200, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,11,18]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1007\/s11704-013-3903-7","article-title":"Big Data Challenge: A Data Management Perspective","volume":"7","author":"Chen","year":"2013","journal-title":"Front. Comput. Sci."},{"key":"ref_2","unstructured":"Liu, H., and Motoda, H. (2007). Computational Methods of Feature Selection (Chapman & Hall\/Crc Data Mining and Knowledge Discovery Series), Chapman & Hall\/CRC."},{"key":"ref_3","unstructured":"Guyon, I., Gunn, S., Nikravesh, M., and Zadeh, L.A. (2008). Feature Extraction: Foundations and Applications, Springer."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"917","DOI":"10.1109\/TC.1977.1674939","article-title":"A Branch and Bound Algorithm for Feature Subset Selection","volume":"C-26","author":"Narendra","year":"1977","journal-title":"IEEE Trans. Comput."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1119","DOI":"10.1016\/0167-8655(94)90127-9","article-title":"Floating search methods in feature selection","volume":"15","author":"Pudil","year":"1994","journal-title":"Pattern Recognit. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1016\/j.patrec.2010.12.016","article-title":"A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets","volume":"32","author":"Bermejo","year":"2011","journal-title":"Pattern Recognit. Lett."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1016\/0167-8655(89)90037-8","article-title":"A note on genetic algorithms for large-scale feature selection","volume":"10","author":"Siedlecki","year":"1989","journal-title":"Pattern Recognit. Lett."},{"key":"ref_8","unstructured":"Vafaie, H., and Jong, K.D. (1992, January 10\u201313). Genetic algorithms as a tool for feature selection in machine learning. Proceedings of the Fourth International Conference on Tools with Artificial Intelligence, Arlington, VA, USA."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yang, C., Chuang, L., Chen, Y., and Yang, C. (2008, January 11\u201313). Feature Selection Using Memetic Algorithms. Proceedings of the Third Int. Conf. on Convergence and Hybrid Information Technology, Busan, Korea.","DOI":"10.1109\/ICCIT.2008.81"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1145\/1809400.1809403","article-title":"Visual Analytics: How Much Visualization and How Much Analytics?","volume":"11","author":"Keim","year":"2010","journal-title":"SIGKDD Explor. Newsl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1002\/jgt.21630","article-title":"Minimal Obstructions for 1-Immersions and Hardness of 1-Planarity Testing","volume":"72","author":"Korzhik","year":"2013","journal-title":"J. Graph Theory"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2459","DOI":"10.1109\/TVCG.2011.212","article-title":"Improved Similarity Trees and their Application to Visual Data Classification","volume":"17","author":"Paiva","year":"2011","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1016\/S0031-3203(01)00084-X","article-title":"A hybrid filter\/wrapper approach of feature selection using information theory","volume":"35","author":"Sebban","year":"2002","journal-title":"Pattern Recognit."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1613\/jair.346","article-title":"Improved Heterogeneous Distance Functions","volume":"6","author":"Wilson","year":"1997","journal-title":"J. Artif. Intell. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1109\/MSP.2002.1028355","article-title":"Applications of entropic spanning graphs","volume":"19","author":"Hero","year":"2002","journal-title":"IEEE Signal Process. Mag."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/s10044-008-0107-0","article-title":"Feature selection, mutual information, and the classification of high-dimensional patterns","volume":"11","author":"Bonev","year":"2008","journal-title":"Pattern Anal. Appl. Vol."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhong, E., Xie, S., Fan, W., Ren, J., Peng, J., and Zhang, K. (2008, January 15\u201319). Graph-Based Iterative Hybrid Feature Selection. Proceedings of the IEEE International Conference on Data Mining, Pisa, Italy.","DOI":"10.1109\/ICDM.2008.63"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Berretta, R., Costa, W., and Moscato, P. (2008). Combinatorial Optimization Models for Finding Genetic Signatures from Gene Expression Datasets. Bioinformatics: Structure, Function and Applications, Humana Press.","DOI":"10.1007\/978-1-60327-429-6_19"},{"key":"ref_19","first-page":"246","article-title":"Graphical Feature Selection for Multilabel Classification Tasks","volume":"Volume 7014","author":"Lastra","year":"2011","journal-title":"Proceedings International Symposium of Advances in Intelligent Data Analysis"},{"key":"ref_20","first-page":"205","article-title":"A Graph-Based Approach to Feature Selection","volume":"Volume 6658","author":"Jiang","year":"2011","journal-title":"Proceedings of the International Workshop on Graph-Based Representations in Pattern Recognition"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1109\/TPAMI.2007.250608","article-title":"Dominant Sets and Pairwise Clustering","volume":"29","author":"Pavan","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1991","DOI":"10.1016\/j.patrec.2012.03.021","article-title":"Hypergraph based information-theoretic feature selection","volume":"33","author":"Zhang","year":"2012","journal-title":"Pattern Recognit. Lett."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Mandal, M., and Mukhopadhyay, A. (2013). Unsupervised Non-redundant Feature Selection: A Graph-Theoretic Approach. Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications, Springer.","DOI":"10.1007\/978-3-642-35314-7_43"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1109\/TKDE.2015.2493537","article-title":"Graph Regularized Feature Selection with Data Reconstruction","volume":"28","author":"Zhao","year":"2016","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/s12046-019-1238-2","article-title":"An information-theoretic graph-based approach for feature selection","volume":"45","author":"Das","year":"2020","journal-title":"Sadhana"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Roffo, G., Melzi, S., Castellani, U., Vinciarelli, A., and Cristani, M. (2020). Infinite Feature Selection: A Graph-based Feature Filtering Approach. IEEE Trans. Pattern Anal. Mach. Intell., 1.","DOI":"10.1109\/TPAMI.2020.3002843"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1614","DOI":"10.1109\/TVCG.2014.2346482","article-title":"INFUSE: Interactive Feature Selection for Predictive Modeling of High Dimensional Data","volume":"20","author":"Krause","year":"2014","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1111\/cgf.12385","article-title":"Visual-interactive Exploration of Interesting Multivariate Relations in Mixed Research Data Sets","volume":"33","author":"Bernard","year":"2014","journal-title":"Comput. Graph. Forum"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"May, T., Bannach, A., Davey, J., Ruppert, T., and Kohlhammer, J. (2011, January 23\u201328). Guiding feature subset selection with an interactive visualization. Proceedings of the 2011 IEEE Conference on Visual Analytics Science and Technology (VAST), Providence, RI, USA.","DOI":"10.1109\/VAST.2011.6102448"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1111\/cgf.13197","article-title":"Linear Discriminative Star Coordinates for Exploring Class and Cluster Separation of High Dimensional Data","volume":"36","author":"Wang","year":"2017","journal-title":"Comput. Graph. Forum"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1016\/j.eswa.2018.01.054","article-title":"Scaled radial axes for interactive visual feature selection: A case study for analyzing chronic conditions","volume":"100","author":"Sanchez","year":"2018","journal-title":"Expert Syst. Appl."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1016\/j.cag.2019.08.015","article-title":"A novel visual approach for enhanced attribute analysis and selection","volume":"84","author":"Artur","year":"2019","journal-title":"Comput. Graph."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2591","DOI":"10.1109\/TVCG.2011.178","article-title":"Brushing Dimensions\u2014A Dual Visual Analysis Model for High-Dimensional Data","volume":"17","author":"Turkay","year":"2011","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2625","DOI":"10.1109\/TVCG.2013.150","article-title":"Dimension Projection Matrix\/Tree: Interactive Subspace Visual Exploration and Analysis of High Dimensional Data","volume":"19","author":"Yuan","year":"2013","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1177\/1473871617713337","article-title":"Projections as Visual Aids for Classification System Design","volume":"17","author":"Rauber","year":"2017","journal-title":"Inf. Vis."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1962","DOI":"10.1109\/TVCG.2013.125","article-title":"A Partition-Based Framework for Building and Validating Regression Models","volume":"19","author":"Piringer","year":"2013","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1109\/TVCG.2015.2468291","article-title":"3D Regression Heat Map Analysis of Population Study Data","volume":"22","author":"Klemm","year":"2016","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhang, C., Yang, J., Zhan, F.B., Gong, X., Brender, J.D., Langlois, P.H., Barlowe, S., and Zhao, Y. (2016, January 19\u201322). A visual analytics approach to high-dimensional logistic regression modeling and its application to an environmental health study. Proceedings of the 2016 IEEE Pacific Visualization Symposium (PacificVis), Taipei, Taiwan.","DOI":"10.1109\/PACIFICVIS.2016.7465261"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1109\/TVCG.2018.2865043","article-title":"RegressionExplorer: Interactive Exploration of Logistic Regression Models with Subgroup Analysis","volume":"25","author":"Dingen","year":"2019","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1111\/cgf.13728","article-title":"The State of the Art in Visualizing Multivariate Networks","volume":"38","author":"Nobre","year":"2019","journal-title":"Comput. Graph. Forum (EuroVis)"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Wang, C., Yu, H., Grout, R.W., Ma, K., and Chen, J.H. (2011, January 1\u20134). Analyzing information transfer in time-varying multivariate data. Proceedings of the 2011 IEEE Pacific Visualization Symposium, Hong Kong, China.","DOI":"10.1109\/PACIFICVIS.2011.5742378"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zhang, Z., McDonnell, K.T., and Mueller, K. (March, January 28). A network-based interface for the exploration of high-dimensional data spaces. Proceedings of the 2012 IEEE Pacific Visualization Symposium, Songdo, Korea.","DOI":"10.1109\/PacificVis.2012.6183569"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"2683","DOI":"10.1109\/TVCG.2013.133","article-title":"An Information-Aware Framework for Exploring Multivariate Data Sets","volume":"19","author":"Biswas","year":"2013","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-006-6226-1","article-title":"Extremely Randomized Trees","volume":"36","author":"Geurts","year":"2006","journal-title":"Mach. Learn."},{"key":"ref_45","unstructured":"Cuadros, A.M., Paulovich, F.V., Minghim, R., and Telles, G.P. (November, January 30). Point placement by phylogenetic trees and its application to visual analysis of document collections. Proceedings of the IEEE Symposium on Visual Analytics Science and Technology, Sacramento, CA, USA."},{"key":"ref_46","first-page":"406","article-title":"The neighbor-joining method: A new method for reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol. Biol. Evol."},{"key":"ref_47","first-page":"2579","article-title":"Visualizing High-Dimensional Data Using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1109\/TVCG.2007.70443","article-title":"Least Square Projection: A Fast High-Precision Multidimensional Projection Technique and Its Application to Document Mapping","volume":"14","author":"Paulovich","year":"2008","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_49","unstructured":"McInnes, L., Healy, J., and Melville, J. (2020, November 16). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. Available online: https:\/\/arxiv.org\/abs\/1802.03426."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: A graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J. Comput. Appl. Math."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Dias, F.F., Pedrini, H., and Minghim, R. (2020). Soundscape segregation based on visual analysis and discriminating features. Ecol. Inform., 101184.","DOI":"10.1016\/j.ecoinf.2020.101184"},{"key":"ref_52","unstructured":"Hilasaca, L.M.H. (2020). Visual Active Learning para Rotulagem por Caracter\u00edsticas Discriminantes em Paisagens Ac\u00fasticas. [Master\u2019s Thesis, Instituto de Ci\u00eancias Matem\u00e1ticas e de Computa\u00e7\u00e3o, Universidade de S\u00e3o Paulo]."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/13\/11\/302\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:33:43Z","timestamp":1760178823000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/13\/11\/302"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,18]]},"references-count":52,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2020,11]]}},"alternative-id":["a13110302"],"URL":"https:\/\/doi.org\/10.3390\/a13110302","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,11,18]]}}}