{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T09:33:13Z","timestamp":1770975193682,"version":"3.50.1"},"reference-count":53,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2024,1,31]],"date-time":"2024-01-31T00:00:00Z","timestamp":1706659200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the Project of Introducing Talents of Jilin Agricultural University","award":["202020010"],"award-info":[{"award-number":["202020010"]}]},{"name":"the Project of Introducing Talents of Jilin Agricultural University","award":["2021C044-10"],"award-info":[{"award-number":["2021C044-10"]}]},{"name":"the Project of Introducing Talents of Jilin Agricultural University","award":["2021YFD1500100"],"award-info":[{"award-number":["2021YFD1500100"]}]},{"name":"the Jilin Provincial Development and Reform Commission Innovation Capacity Building Project","award":["202020010"],"award-info":[{"award-number":["202020010"]}]},{"name":"the Jilin Provincial Development and Reform Commission Innovation Capacity Building Project","award":["2021C044-10"],"award-info":[{"award-number":["2021C044-10"]}]},{"name":"the Jilin Provincial Development and Reform Commission Innovation Capacity Building Project","award":["2021YFD1500100"],"award-info":[{"award-number":["2021YFD1500100"]}]},{"name":"the National Key R&amp;D Program of China","award":["202020010"],"award-info":[{"award-number":["202020010"]}]},{"name":"the National Key R&amp;D Program of China","award":["2021C044-10"],"award-info":[{"award-number":["2021C044-10"]}]},{"name":"the National Key R&amp;D Program of China","award":["2021YFD1500100"],"award-info":[{"award-number":["2021YFD1500100"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Soil organic matter (SOM) is an essential component of soil and is crucial for increasing agricultural production and soil fertility. The combination of hyperspectral remote sensing and deep learning can be used to predict the SOM content efficiently, rapidly, and cost-effectively on various scales. However, determining the optimal groups, inputs, and models for reducing the spatial heterogeneity of soil nutrients in large regions and to improve the accuracy of SOM prediction remains a challenge. Hyperspectral reflectance data from 1477 surface soil samples in Northeast China were utilized to evaluate three grouping methods (no groups (NG), traditional grouping (TG), and spectral grouping (SG)) and four inputs (raw reflectance (RR), continuum removal (CR), fractional-order differentiation (FOD), and spectral characteristic parameters (SCPs)). The SOM prediction accuracies of random forest (RF), convolutional neural network (CNN), and long short-term memory (LSTM) models were assessed. The results were as follows: (1) The highest accuracy was achieved using SG, SCPs, and the LSTM model, with a coefficient of determination (R2) of 0.82 and a root mean squared error (RMSE) of 0.69%. (2) The LSTM model exhibited the highest accuracy in SOM prediction (R2 = 0.82, RMSE = 0.89%), followed by the CNN model (R2 = 0.72, RMSE = 0.85%) and the RF model (R2 = 0.69, RMSE = 0.91%). (3) The SG provided higher SOM prediction accuracy than TG and NG. (4) The SCP-based prediction results were significantly better than those of the other inputs. The R2 of the SCP-based model was 0.27 higher and the RMSE was 0.40% lower than that of the RR-based model with NG. In addition, the LSTM model had higher prediction errors at low (0\u20132%) and high (8\u201310%) SOM contents, whereas the error was minimal at intermediate SOM contents (2\u20138%). The study results provide guidance for selecting grouping methods and approaches to improve the prediction accuracy of the SOM content and reduce the spatial heterogeneity of the SOM content in large regions.<\/jats:p>","DOI":"10.3390\/rs16030565","type":"journal-article","created":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T09:43:22Z","timestamp":1706780602000},"page":"565","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Methodology for Regional Soil Organic Matter Prediction with Spectroscopy: Optimal Sample Grouping, Input Variables, and Prediction Model"],"prefix":"10.3390","volume":"16","author":[{"given":"Xinle","family":"Zhang","sequence":"first","affiliation":[{"name":"College of Information Technology, Jilin Agricultural University, Changchun 130118, China"}]},{"given":"Chang","family":"Dong","sequence":"additional","affiliation":[{"name":"College of Information Technology, Jilin Agricultural University, Changchun 130118, China"}]},{"given":"Huanjun","family":"Liu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China"}]},{"given":"Xiangtian","family":"Meng","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6650-1022","authenticated-orcid":false,"given":"Chong","family":"Luo","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Black Soils Conservation and Utilization, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China"}]},{"given":"Yongqi","family":"Han","sequence":"additional","affiliation":[{"name":"College of Information Technology, Jilin Agricultural University, Changchun 130118, China"}]},{"given":"Hongfu","family":"Ai","sequence":"additional","affiliation":[{"name":"College of Information Technology, Jilin Agricultural University, Changchun 130118, China"}]}],"member":"1968","published-online":{"date-parts":[[2024,1,31]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Ge, X., Ding, J., Jin, X., Wang, J., Chen, X., Li, X., Liu, J., and Xie, B. (2021). Estimating Agricultural Soil Moisture Content through UAV-Based Hyperspectral Images in the Arid Region. Remote Sens., 13.","DOI":"10.3390\/rs13081562"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1016\/j.earscirev.2016.01.012","article-title":"A Global Spectral Library to Characterize the World\u2019s Soil","volume":"155","author":"Rossel","year":"2016","journal-title":"Earth-Sci. Rev."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1007\/s10812-009-9219-6","article-title":"Estimation and Analysis of the Parameters of a Field Spectroradiometer Covering the Spectral Range 350\u20132500 Nm","volume":"76","author":"Belyaev","year":"2009","journal-title":"J. Appl. Spectrosc."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/bs.agron.2015.02.002","article-title":"Chapter Four\u2014Soil Spectroscopy: An Alternative to Wet Chemistry for Soil Monitoring","volume":"Volume 132","author":"Sparks","year":"2015","journal-title":"Advances in Agronomy"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"12653","DOI":"10.1029\/JB095iB08p12653","article-title":"High Spectral Resolution Reflectance Spectroscopy of Minerals","volume":"95","author":"Clark","year":"1990","journal-title":"J. Geophys. Res. Solid Earth"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1007\/s10533-021-00755-1","article-title":"Soil Organic Carbon Fractions in the Great Plains of the United States: An Application of Mid-Infrared Spectroscopy","volume":"156","author":"Sanderman","year":"2021","journal-title":"Biogeochemistry"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1007\/s00340-021-07737-z","article-title":"Theoretical Investigation of Applicability and Limitations of Advanced Noise Reduction Methods for Wavelength Modulation Spectroscopy","volume":"128","author":"Fischer","year":"2022","journal-title":"Appl. Phys. B"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"218","DOI":"10.17221\/113\/2015-SWR","article-title":"Comparing Different Data Preprocessing Methods for Monitoring Soil Heavy Metals Based on Soil Spectral Features","volume":"10","author":"Gholizadeh","year":"2015","journal-title":"Soil Water Res."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Chu, X., Huang, Y., Yun, Y.-H., and Bian, X. (2022). Chemometric Methods in Analytical Spectroscopy Technology, Springer Nature.","DOI":"10.1007\/978-981-19-1625-0"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.rse.2012.12.015","article-title":"Retrieval of Spruce Leaf Chlorophyll Content from Airborne Image Data Using Continuum Removal and Radiative Transfer","volume":"131","author":"Kaplan","year":"2013","journal-title":"Remote Sens. Environ."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"115696","DOI":"10.1016\/j.geoderma.2022.115696","article-title":"Prediction of Soil Organic Matter Using Different Soil Classification Hierarchical Level Stratification Strategies and Spectral Characteristic Parameters","volume":"411","author":"Meng","year":"2022","journal-title":"Geoderma"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"e971252","DOI":"10.1155\/2012\/971252","article-title":"A Comparison of Feature-Based MLR and PLS Regression Techniques for the Prediction of Three Soil Constituents in a Degraded South African Ecosystem","volume":"2012","author":"Bayer","year":"2012","journal-title":"Appl. Environ. Soil Sci."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Laukamp, C., Rodger, A., LeGras, M., Lampinen, H., Lau, I.C., Pejcic, B., Stromberg, J., Francis, N., and Ramanaidou, E. (2021). Mineral Physicochemistry Underlying Feature-Based Extraction of Mineral Abundance and Composition from Shortwave, Mid and Thermal Infrared Reflectance Spectra. Minerals, 11.","DOI":"10.3390\/min11040347"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1080\/00387010.2017.1297958","article-title":"Hyperspectral Estimation of Soil Organic Matter Based on Different Spectral Preprocessing Techniques","volume":"50","author":"Qiao","year":"2017","journal-title":"Spectrosc. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.catena.2018.10.051","article-title":"Combination of Fractional Order Derivative and Memory-Based Learning Algorithm to Improve the Estimation Accuracy of Soil Organic Matter by Visible and near-Infrared Spectroscopy","volume":"174","author":"Hong","year":"2019","journal-title":"Catena"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1016\/S0065-2113(10)07005-7","article-title":"Chapter Five\u2014Visible and Near Infrared Spectroscopy in Soil Science","volume":"Volume 107","author":"Sparks","year":"2010","journal-title":"Advances in Agronomy"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Shi, Y., Zhao, J., Song, X., Qin, Z., Wu, L., Wang, H., and Tang, J. (2021). Hyperspectral Band Selection and Modeling of Soil Organic Matter Content in a Forest Using the Ranger Algorithm. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0253385"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1111\/ejss.12485","article-title":"Prediction of Soil Organic Carbon at the Country Scale: Stratification Strategies for near-Infrared Data","volume":"68","author":"Jaconi","year":"2017","journal-title":"Eur. J. Soil Sci."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1255\/jnirs.923","article-title":"Near Infrared Reflectance Spectroscopy for Estimating Soil Characteristics Valuable in the Diagnosis of Soil Fertility","volume":"19","author":"Genot","year":"2011","journal-title":"J. Infrared Spectrosc."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1007\/s12665-021-09582-x","article-title":"Improving the Accuracy of Soil Organic Carbon Content Prediction Based on Visible and Near-Infrared Spectroscopy and Machine Learning","volume":"80","author":"Xu","year":"2021","journal-title":"Environ. Earth Sci."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.chemolab.2011.11.003","article-title":"Optimization Criteria in Sample Selection Step of Local Regression for Quantitative Analysis of Large Soil NIRS Database","volume":"110","author":"Joffre","year":"2012","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Sun, W., Zhang, X., Zou, B., and Wu, T. (2017). Exploring the Potential of Spectral Classification in Estimation of Soil Contaminant Elements. Remote Sens., 9.","DOI":"10.3390\/rs9060632"},{"key":"ref_23","first-page":"268","article-title":"The Spectrum-Based Learner: A New Local Approach for Modeling Soil Vis\u2013NIR Spectra of Complex Datasets","volume":"195\u2013196","author":"Behrens","year":"2013","journal-title":"Geoderma"},{"key":"ref_24","first-page":"1","article-title":"Deep Learning Application for Predicting Soil Organic Matter Content by VIS-NIR Spectroscopy","volume":"2019","author":"Xu","year":"2019","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Zhang, L., Cai, Y., Huang, H., Li, A., Yang, L., and Zhou, C. (2022). A CNN-LSTM Model for Soil Organic Carbon Content Prediction with Long Time Series of MODIS-Based Phenological Variables. Remote Sens., 14.","DOI":"10.3390\/rs14184441"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"107","DOI":"10.5194\/soil-5-107-2019","article-title":"Multi-Source Data Integration for Soil Mapping Using Deep Learning","volume":"5","author":"Wadoux","year":"2019","journal-title":"Soil"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"565","DOI":"10.5194\/soil-6-565-2020","article-title":"The Influence of Training Sample Size on the Accuracy of Deep Learning Models for the Prediction of Soil Properties with Near-Infrared Spectroscopy Data","volume":"6","author":"Ng","year":"2020","journal-title":"Soil"},{"key":"ref_28","first-page":"456","article-title":"A Rapid and Accurate Procedure for Estimation of Organic Carbon in Soils","volume":"84","author":"Nelson","year":"1974","journal-title":"Proc. Indiana Acad. Sci."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.geoderma.2010.02.003","article-title":"A Critical Review of the Conventional SOC to SOM Conversion Factor","volume":"156","author":"Pribyl","year":"2010","journal-title":"Geoderma"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"104703","DOI":"10.1016\/j.catena.2020.104703","article-title":"Vis-SWIR Spectral Prediction Model for Soil Organic Matter with Different Grouping Strategies","volume":"195","author":"Bao","year":"2020","journal-title":"Catena"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1366\/000370210790572007","article-title":"A Graphical Method to Evaluate Spectral Preprocessing in Multivariate Regression Calibrations: Example with Savitzky\u2013Golay Filters and Partial Least Squares Regression","volume":"64","author":"Delwiche","year":"2010","journal-title":"Appl. Spectrosc."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1021\/ac60214a047","article-title":"Smoothing and Differentiation of Data by Simplified Least Squares Procedures","volume":"36","author":"Savitzky","year":"1964","journal-title":"Anal. Chem."},{"key":"ref_33","unstructured":"Ting, H. (2006). Study on Spectral Features of Soil Fe2O3. Geogr. Geo-Inf. Sci."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1111\/j.1469-8137.2011.03955.x","article-title":"Image Spectroscopy and Stable Isotopes Elucidate Functional Dissimilarity between Native and Nonnative Plant Species in the Aquatic Environment","volume":"193","author":"Santos","year":"2012","journal-title":"New Phytol."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1016\/j.compag.2016.03.016","article-title":"Soil Nitrogen Content Forecasting Based on Real-Time NIR Spectroscopy","volume":"124","author":"Zhang","year":"2016","journal-title":"Comput. Electron. Agric."},{"key":"ref_36","first-page":"3214","article-title":"Review of Soil Classification and Revision of China Soil Classification System","volume":"47","author":"Zhang","year":"2014","journal-title":"Sci. Agric. Sin."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Shang, X., Li, X., Morales-Esteban, A., Asencio-Cort\u00e9s, G., and Wang, Z. (2018). Data Field-Based K-Means Clustering for Spatio-Temporal Seismicity Analysis and Hazard Assessment. Remote Sens., 10.","DOI":"10.3390\/rs10030461"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"44416","DOI":"10.1109\/ACCESS.2019.2908975","article-title":"High Spatial Resolution PM2.5 Retrieval Using MODIS and Ground Observation Station Data Based on Ensemble Random Forest","volume":"7","author":"Chen","year":"2019","journal-title":"IEEE Access"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"D\u00edaz-Uriarte, R., and Alvarez de Andr\u00e9s, S. (2006). Gene Selection and Classification of Microarray Data Using Random Forest. BMC Bioinform., 7.","DOI":"10.1186\/1471-2105-7-3"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1002\/int.22505","article-title":"Optimal Feature Selection Based Speech Emotion Recognition Using Two-Stream Deep Convolutional Neural Network","volume":"36","author":"Mustaqeem","year":"2021","journal-title":"Int. J. Intell. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"5929","DOI":"10.1007\/s10462-020-09838-1","article-title":"A Review on the Long Short-Term Memory Model","volume":"53","author":"Mosquera","year":"2020","journal-title":"Artif. Intell. Rev."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1016\/j.still.2015.07.008","article-title":"Do We Really Need Large Spectral Libraries for Local Scale SOC Assessment with NIR Spectroscopy?","volume":"155","author":"Guerrero","year":"2016","journal-title":"Soil Tillage Res."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1671","DOI":"10.1007\/s11430-013-4808-x","article-title":"Development of a National VNIR Soil-Spectral Library for Soil Classification and Prediction of Organic Matter Concentrations","volume":"57","author":"Shi","year":"2014","journal-title":"Sci. China Earth Sci."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1007\/s42791-022-00046-2","article-title":"The Mathematical Analysis and Review of Noise in Industrial Valves","volume":"4","author":"Sotoodeh","year":"2022","journal-title":"JMST Adv."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1007\/s40010-017-0433-y","article-title":"A Research Review on Hyperspectral Data Processing and Analysis Algorithms","volume":"87","author":"Kale","year":"2017","journal-title":"Proc. Natl. Acad. Sci. India Sect. Phys. Sci."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.still.2017.05.008","article-title":"Two Preprocessing Techniques to Reduce Model Covariables in Soil Property Predictions by Vis-NIR Spectroscopy","volume":"172","author":"Dotto","year":"2017","journal-title":"Soil Tillage Res."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"2125","DOI":"10.1016\/j.saa.2003.11.013","article-title":"A Simple Method to Extract Spectral Parameters Using Fractional Derivative Spectrometry","volume":"60","author":"Kharintsev","year":"2004","journal-title":"Spectrochim. Acta. A. Mol. Biomol. Spectrosc."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"2663","DOI":"10.1007\/s40747-021-00637-x","article-title":"Feature Dimensionality Reduction: A Review","volume":"8","author":"Jia","year":"2022","journal-title":"Complex Intell. Syst."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Migenda, N., M\u00f6ller, R., and Schenck, W. (2021). Adaptive Dimensionality Reduction for Neural Network-Based Online Principal Component Analysis. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0248896"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/j.geoderma.2005.04.025","article-title":"Global Soil Characterization with VNIR Diffuse Reflectance Spectroscopy","volume":"132","author":"Brown","year":"2006","journal-title":"Geoderma"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"106031","DOI":"10.1016\/j.compag.2021.106031","article-title":"Predicting the Contents of Soil Salt and Major Water-Soluble Ions with Fractional-Order Derivative Spectral Indices and Variable Selection","volume":"182","author":"Lao","year":"2021","journal-title":"Comput. Electron. Agric."},{"key":"ref_52","unstructured":"Tanaka, Y., Kojima, R., Ishida, S., Yamashita, F., and Okuno, Y. (2021). Complex Network Prediction Using Deep Learning. arXiv."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"112353","DOI":"10.1016\/j.rse.2021.112353","article-title":"Field Spectroscopy of Canopy Nitrogen Concentration in Temperate Grasslands Using a Convolutional Neural Network","volume":"257","author":"Pullanagari","year":"2021","journal-title":"Remote Sens. Environ."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/3\/565\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T13:52:46Z","timestamp":1760104366000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/3\/565"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,31]]},"references-count":53,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2024,2]]}},"alternative-id":["rs16030565"],"URL":"https:\/\/doi.org\/10.3390\/rs16030565","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,31]]}}}