{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:20:16Z","timestamp":1760242816971,"version":"build-2065373602"},"reference-count":50,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2016,8,8]],"date-time":"2016-08-08T00:00:00Z","timestamp":1470614400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Recent research into improving the effectiveness of forest inventory management using airborne LiDAR data has focused on developing advanced theories in data analytics. Furthermore, supervised learning as a predictive model for classifying tree genera (and species, where possible) has been gaining popularity in order to minimize this labor-intensive task. However, bottlenecks remain that hinder the immediate adoption of supervised learning methods. With supervised classification, training samples are required for learning the parameters that govern the performance of a classifier, yet the selection of training data is often subjective and the quality of such samples is critically important. For LiDAR scanning in forest environments, the quantification of data quality is somewhat abstract, normally referring to some metric related to the completeness of individual tree crowns; however, this is not an issue that has received much attention in the literature. Intuitively the choice of training samples having varying quality will affect classification accuracy. In this paper a Diversity Index (DI) is proposed that characterizes the diversity of data quality (Qi) among selected training samples required for constructing a classification model of tree genera. The training sample is diversified in terms of data quality as opposed to the number of samples per class. The diversified training sample allows the classifier to better learn the positive and negative instances and; therefore; has a higher classification accuracy in discriminating the \u201cunknown\u201d class samples from the \u201cknown\u201d samples. Our algorithm is implemented within the Random Forests base classifiers with six derived geometric features from LiDAR data. The training sample contains three tree genera (pine; poplar; and maple) and the validation samples contains four labels (pine; poplar; maple; and \u201cunknown\u201d). Classification accuracy improved from 72.8%; when training samples were selected randomly (with stratified sample size); to 93.8%; when samples were selected with additional criteria; and from 88.4% to 93.8% when an ensemble method was used.<\/jats:p>","DOI":"10.3390\/rs8080646","type":"journal-article","created":{"date-parts":[[2016,8,8]],"date-time":"2016-08-08T10:14:38Z","timestamp":1470651278000},"page":"646","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Maximizing the Diversity of Ensemble Random Forests for Tree Genera Classification Using High Density LiDAR Data"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2700-192X","authenticated-orcid":false,"given":"Connie","family":"Ko","sequence":"first","affiliation":[{"name":"Department of Geography, York University, 4700 Keele Street, Ross North 430, Toronto, ON M3J 1P3, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gunho","family":"Sohn","sequence":"additional","affiliation":[{"name":"Department of Earth and Space Science and Engineering, York University, 4700 Keele Street, Petrie Building 149, Toronto, ON M3J 1P3, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tarmo","family":"Remmel","sequence":"additional","affiliation":[{"name":"Department of Geography, York University, 4700 Keele Street, Ross North 430, Toronto, ON M3J 1P3, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John","family":"Miller","sequence":"additional","affiliation":[{"name":"Department of Earth and Space Science and Engineering, York University, 4700 Keele Street, Petrie Building 149, Toronto, ON M3J 1P3, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2016,8,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Maltamo, M., N\u00e6sset, E., and Vauhkonen, J. (2014). Applications of Airborne Laser Scanning: Concepts and Case Studies Forestry, Springer.","DOI":"10.1007\/978-94-017-8663-8"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1016\/S0034-4257(03)00140-8","article-title":"Identifying species of individual trees using airborne laser scanning","volume":"90","author":"Holmgren","year":"2004","journal-title":"Remote Sens. Environ."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1016\/j.isprsjprs.2006.10.006","article-title":"Classifying individual tree species under leaf-off and leaf-on conditions using airborne LiDAR","volume":"61","author":"Brandtberg","year":"2007","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1537","DOI":"10.1080\/01431160701736471","article-title":"Species identification of individual trees by combining high resolution LiDAR data with multi-spectral images","volume":"29","author":"Holmgren","year":"2008","journal-title":"Int. J. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1148","DOI":"10.1016\/j.rse.2009.02.010","article-title":"Capturing tree crown formation through implicit surface reconstruction using airborne LiDAR data","volume":"113","author":"Kato","year":"2009","journal-title":"Remote Sens. Environ."},{"key":"ref_6","unstructured":"\u00d8rka, H.O., N\u00e6sset, E., and Bollands\u00e5s, O.M. Utilizing Airborne Laser Intensity for Tree Species Classification. Available online: http:\/\/www.isprs.org\/proceedings\/XXXVI\/3-W52\/final_papers\/Oerka_2007.pdf."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1163","DOI":"10.1016\/j.rse.2009.02.002","article-title":"Classifying species of individual trees by intensity and structure features derived from airborne laser scanner data","volume":"113","year":"2009","journal-title":"Remote Sens. Environ."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"S441","DOI":"10.5589\/m08-052","article-title":"Effects of pulse density on predicting characteristics of individual trees of Scandinavian commercial species using alpha shape metrics based on ALS data","volume":"34","author":"Vauhkonen","year":"2008","journal-title":"Can. J. Remote Sens."},{"key":"ref_9","first-page":"37","article-title":"Identification of Scandinavian commercial species of individual trees from airborne laser scanning data using alpha shape metrics","volume":"55","author":"Vauhkonen","year":"2009","journal-title":"For. Sci."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1016\/j.rse.2010.01.016","article-title":"Imputation of single-tree attributes using airborne laser scanning-based height, intensity, and alpha shape metrics","volume":"114","author":"Vauhkonen","year":"2010","journal-title":"Remote Sens. Environ."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"319","DOI":"10.14214\/sf.156","article-title":"Tree species classification using airborne LiDAR\u2014Effects of stand and tree parameters, downsizing of training set, intensity normalization and sensor type","volume":"44","author":"Korpela","year":"2010","journal-title":"Silva Fenn."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1575","DOI":"10.1016\/j.rse.2009.03.017","article-title":"Tree species differentiation using intensity data derived from leaf-on and leaf-off airborne laser scanner data","volume":"113","author":"Kim","year":"2009","journal-title":"Remote Sens. Environ."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"3329","DOI":"10.1016\/j.rse.2011.07.016","article-title":"Classifying individual tree genera using stepwise cluster analysis based on height and intensity metrics derived from airborne laser scanner data","volume":"115","author":"Kim","year":"2011","journal-title":"Remote Sens. Environ."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1109\/TGRS.2004.842022","article-title":"Partially supervised classification of remote sensing images through SVM-based probability density estimation","volume":"43","author":"Mantero","year":"2005","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"2683","DOI":"10.1109\/TGRS.2007.897425","article-title":"A support vector domain description approach to supervised classification of remote sensing images","volume":"45","author":"Bruzzone","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","first-page":"1","article-title":"An overview of classifier fusion methods","volume":"7","author":"Ruta","year":"2000","journal-title":"Compt. Inf. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1007\/978-3-642-12127-2_26","article-title":"A multiple classifier system for classification of LIDAR remote sensing data using multi-class SVM","volume":"5997","author":"Samadzadegan","year":"2010","journal-title":"Multi. Classif. Syst."},{"key":"ref_18","unstructured":"Zhou, Z.H. (2012). Ensemble Methods: Foundations and Algorithms (Chapman & Hall\/Crc Machine Learning & Pattern Recognition), CRC Press."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.inffus.2007.07.002","article-title":"Classifier ensembles: Select real-world applications","volume":"9","author":"Oza","year":"2008","journal-title":"Inf. Fusion"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1109\/TKDE.2005.167","article-title":"On combining classifiers mass functions for text categorization","volume":"17","author":"Bell","year":"2005","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1509","DOI":"10.1109\/TPAMI.2005.207","article-title":"Recognition and verification of unconstrained handwritten words","volume":"27","author":"Koerich","year":"2005","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Lodha, S.K., Kreps, E.J., Helmbold, D.P., and Fitzpatrick, D. (2006, January 14\u201316). Aerial LiDAR data classification using support vector machines (SVM). Proceedings of the IEEE International Symposium on 3D Data Processing, Visualization, and Transmission, Chapel Hill, NC, USA.","DOI":"10.1109\/3DPVT.2006.23"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1007\/s100440200019","article-title":"Hierarchical fusion of multiple classifiers for hyperspectral data analysis","volume":"5","author":"Kumar","year":"2002","journal-title":"Pattern Anal. Appl."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"4224","DOI":"10.1080\/01431161.2013.774099","article-title":"An assessment of the effectiveness of a rotation forest ensemble for land-use and land-cover mapping","volume":"34","author":"Kavzoglu","year":"2013","journal-title":"Int. J. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1761","DOI":"10.1016\/j.patcog.2011.01.017","article-title":"An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs.-one and one-vs.-all schemes","volume":"44","author":"Galar","year":"2011","journal-title":"Pattern Recognit."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1214\/aos\/1028144844","article-title":"Classification by pairwise coupling","volume":"26","author":"Hastie","year":"1998","journal-title":"Ann. Stat."},{"key":"ref_27","first-page":"101","article-title":"In defense of one-vs.-all classification","volume":"5","author":"Rifkin","year":"2004","journal-title":"J. Mach. Learn. Res."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1023\/A:1007607513941","article-title":"An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting and randomization","volume":"40","author":"Dietterich","year":"2000","journal-title":"Mach. Learn."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"624","DOI":"10.1109\/TKDE.2008.181","article-title":"Adapted one-vs.-all decision trees for data stream classification","volume":"21","author":"Hashemi","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1109\/72.991427","article-title":"A comparison of methods for multi-class support vector machines","volume":"13","author":"Hsu","year":"2002","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_31","first-page":"47","article-title":"One-vs.-one and one-vs.-all multiclass SVM-RFE for gene selection in cancer classification","volume":"Volume 4447","author":"Marchiori","year":"2007","journal-title":"EvoBIO 2007"},{"key":"ref_32","unstructured":"Milgram, J., Cheriet, M., and Sabourin, R. (2006, January 5). One against \u201cone\u201d or \u201cone against all\u201d: Which one is better for handwriting recognition with SVMs?. Proceedings of 10th International Workshop on Frontiers in Handwriting Recognition, La Baule, France."},{"key":"ref_33","unstructured":"Yi, L., and Zheng, Y.F. (August, January 31). One-against-all multi-class SVM classification using reliability measures. Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN), Montr\u00e9al, QC, Canada."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1007730.1007733","article-title":"Editorial: Special issue on learning from imbalanced data sets","volume":"6","author":"Chawla","year":"2004","journal-title":"ACM SIGKDD Explor. Newsl."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/j.knosys.2013.01.018","article-title":"Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches","volume":"42","author":"Galar","year":"2013","journal-title":"Knowl. Based Syst."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"806","DOI":"10.1007\/978-3-540-27868-9_88","article-title":"The imbalanced training sample problem: Under or over sampling?","volume":"Volume 3138","author":"Fred","year":"2004","journal-title":"Structural, Syntactic, and Statistical Pattern Recognition"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1145\/1007730.1007734","article-title":"Mining with rarity: A unifying framework","volume":"6","author":"Weiss","year":"2004","journal-title":"ACM SIGKDD Explor. Newsl."},{"key":"ref_38","unstructured":"Japkowicz, N. (2000). Learning from Imbalanced Data Sets: A Comparison of Various Strategies, AAAI. AAAI Technical Report, WS-00-05."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1023\/A:1022859003006","article-title":"Measures of diversity in classifier ensembles","volume":"51","author":"Kuncheva","year":"2003","journal-title":"Mach. Learn."},{"key":"ref_40","unstructured":"Chen, C., Liaw, A., and Breiman, L. (2004). Using Random Forest to Learn Imbalanced Data, University of California. Technical Report 666."},{"key":"ref_41","unstructured":"Fawagreh, K., Gaber, M.M., and Elyan, E. (2014). Intelligent Data Engineering and Automated Learning\u2013IDEAL 2014, Springer."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/BF00058611","article-title":"Error reduction through learning multiple descriptions","volume":"24","author":"Ali","year":"1996","journal-title":"Mach. Learn."},{"key":"ref_43","first-page":"801","article-title":"Arcing classifiers","volume":"26","author":"Breiman","year":"1998","journal-title":"Ann. Stat."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1613\/jair.105","article-title":"Solving multiclass learning problems via error-correcting output codes","volume":"2","author":"Dietterich","year":"1995","journal-title":"J. Artif. Intell. Res."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1291","DOI":"10.1016\/S0031-3203(02)00121-8","article-title":"Attribute bagging: Improving accuracy of classifier ensembles by using random feature subsets","volume":"36","author":"Bryll","year":"2003","journal-title":"Pattern Recognit."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_47","first-page":"18","article-title":"Classification and regression by random Forest","volume":"2","author":"Liaw","year":"2002","journal-title":"R. News."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"S73","DOI":"10.5589\/m13-024","article-title":"Tree genera classification with geometric features from high-density airborne LiDAR","volume":"39","author":"Ko","year":"2013","journal-title":"Can. J. Remote Sens."},{"key":"ref_49","unstructured":"R Development Core Team The R Project for Statistical Computing. Available online: http:\/\/www.R-project.org\/."},{"key":"ref_50","unstructured":"Schwing, A., Zach, C., Zheng, Y., and Pollefeys, M. (2011, January 20\u201325). Adaptive random forest\u2014How many \u201cexperts\u201d to ask before making a decision?. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/8\/8\/646\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T19:27:58Z","timestamp":1760210878000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/8\/8\/646"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,8]]},"references-count":50,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2016,8]]}},"alternative-id":["rs8080646"],"URL":"https:\/\/doi.org\/10.3390\/rs8080646","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2016,8,8]]}}}