{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T21:13:21Z","timestamp":1775250801634,"version":"3.50.1"},"reference-count":62,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,8,19]],"date-time":"2023-08-19T00:00:00Z","timestamp":1692403200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,8,19]],"date-time":"2023-08-19T00:00:00Z","timestamp":1692403200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003093","name":"Ministry of Higher Education Malaysia","doi-asserted-by":"crossref","award":["FRGS\/1\/2022\/STG06\/USM\/02\/13"],"award-info":[{"award-number":["FRGS\/1\/2022\/STG06\/USM\/02\/13"]}],"id":[{"id":"10.13039\/501100003093","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Precision farming (PF) plays a crucial role in the field of agriculture to solve the challenges of food shortages in society. Heterogeneity, multicollinearity, and outliers are problems in PF because they can cause bias and lead to incorrect inferences. However, traditional methods typically assume it to be a homogenous model, and in machine learning, data scientists ignore heterogeneity. In this study, the aim is to identify the heterogeneity parameters and develop hybrid models before and after heterogeneity. Data on seaweed is collected using sensor smart farming technology attached to v-Groove Hybrid Solar Drier (v-GHSD). There are 29 drying parameters, and each parameter has 1914 observations. We considered the highest order up to the second order interaction, and the parameters increased to 435 parameters from 29 parameters. In high-dimensional data, the number of observations is less than the number of parameters. The authors proposed a method using the variance inflation factor to identify the heterogeneity parameters. Seven predictive models such as ridge, random forest, support vector machine, bagging, boosting, LASSO and elastic net are used to select the 15, 25, 35 and 45 significant drying parameters for the moisture content removal of the seaweed, and hybrid models are developed using robust statistical methods. For before heterogeneity, the hybrid model random forest M Hampel with 19 outliers is the best, because it performs better when compared to other models. For after heterogeneity, the hybrid model boosting M Hampel with 19 outliers is the best, because it performs better when compared to other models. These results are vital to seaweed precision farming. The study of heterogeneity will not only help us to comprehend the dynamics of the large number of the drying parameters, but also gives a way to leverage the data for efficient predictive modelling.<\/jats:p>","DOI":"10.1186\/s40537-023-00810-8","type":"journal-article","created":{"date-parts":[[2023,8,19]],"date-time":"2023-08-19T12:01:29Z","timestamp":1692446489000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Detecting heterogeneity parameters and hybrid models for precision farming"],"prefix":"10.1186","volume":"10","author":[{"given":"Olayemi Joshua","family":"Ibidoja","sequence":"first","affiliation":[]},{"given":"Fam Pei","family":"Shan","sequence":"additional","affiliation":[]},{"given":"Jumat","family":"Sulaiman","sequence":"additional","affiliation":[]},{"given":"Majid Khan Majahar","family":"Ali","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,8,19]]},"reference":[{"key":"810_CR1","doi-asserted-by":"publisher","DOI":"10.1016\/j.dajour.2022.100041","volume":"3","author":"SKS Durai","year":"2022","unstructured":"Durai SKS, Shamili MD. Smart farming using machine learning and deep learning techniques. Decis Anal J. 2022;3: 100041.","journal-title":"Decis Anal J"},{"key":"810_CR2","doi-asserted-by":"publisher","unstructured":"Moysiadis V, Sarigiannidis P, Vitsas V, Khelifi A. Smart Farming in Europe. Computer Science Review, 2021;39. https:\/\/doi.org\/10.1016\/j.cosrev.2020.100345.","DOI":"10.1016\/j.cosrev.2020.100345"},{"key":"810_CR3","doi-asserted-by":"publisher","DOI":"10.1016\/j.njas.2019.100315","author":"L Klerkx","year":"2019","unstructured":"Klerkx L, Jakku E, Labarthe P. A review of social science on digital agriculture, smart farming and agriculture 4.0: new contributions and a future research agenda. NJAS Wageningen J Life Sci. 2019;90\u201391. https:\/\/doi.org\/10.1016\/j.njas.2019.100315.","journal-title":"NJAS Wageningen J Life Sci."},{"key":"810_CR4","doi-asserted-by":"publisher","DOI":"10.3389\/fsufs.2018.00087","author":"DC Rose","year":"2018","unstructured":"Rose DC, Chilvers J. Agriculture 4.0: broadening responsible innovation in an era of smart farming. Front Sustain Food Syst. 2018. https:\/\/doi.org\/10.3389\/fsufs.2018.00087.","journal-title":"Front Sustain Food Syst."},{"key":"810_CR5","doi-asserted-by":"publisher","DOI":"10.3390\/agronomy10050743","author":"AT Balafoutis","year":"2020","unstructured":"Balafoutis AT, van Evert FK, Fountas S. Smart farming technology trends: Economic and environmental effects, labor impact, and adoption readiness. Agronomy. 2020;10(5). https:\/\/doi.org\/10.3390\/agronomy10050743.","journal-title":"Agronomy"},{"key":"810_CR6","doi-asserted-by":"publisher","first-page":"4843","DOI":"10.1109\/ACCESS.2020.3048415","volume":"9","author":"A Sharma","year":"2021","unstructured":"Sharma A, Jain A, Gupta P, Chowdary V. Machine learning applications for precision agriculture: a comprehensive review. IEEE Access. 2021;9:4843\u201373.","journal-title":"IEEE Access."},{"key":"810_CR7","unstructured":"National Oceanic and Atmospheric Administration. What is seaweed? National Ocean Service. 2017. https:\/\/oceanservice.noaa.gov\/facts\/seaweed.html#:~:text=%22Seaweed%22%20is%20the%20common%20name,Marine%20Sanctuary%20and%20National%20Park."},{"key":"810_CR8","unstructured":"Guiry MD. What are seaweeds? The Seaweed Site. 2014. https:\/\/www.seaweed.ie\/algae\/seaweeds.php."},{"issue":"SpecialIssue 1","key":"810_CR9","first-page":"1","volume":"37","author":"S Suwati","year":"2021","unstructured":"Suwati S, Romansyah E, Syarifudin S, Jani Y, Purnomo AH, Damat D, et al. Comparison between natural and cabinet drying on weight loss of seaweed Euchuema cottonii Weber-van Bosse. Sarhad J Agric. 2021;37(SpecialIssue 1):1\u20138.","journal-title":"Sarhad J Agric."},{"issue":"4","key":"810_CR10","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1080\/09670262.2017.1365175","volume":"52","author":"AH Buschmann","year":"2017","unstructured":"Buschmann AH, Camus C, Infante J, Neori A, Israel \u00c1, Hern\u00e1ndez-Gonz\u00e1lez MC, et al. Seaweed production: overview of the global state of exploitation, farming and emerging research activity. Eur J Phycol. 2017;52(4):391\u2013406.","journal-title":"Eur J Phycol"},{"key":"810_CR11","doi-asserted-by":"publisher","DOI":"10.1088\/1755-1315\/292\/1\/012070","author":"GB Pradana","year":"2019","unstructured":"Pradana GB, Prabowo KB, Hastuti RP, Djaeni M, Prasetyaningrum A. Seaweed drying process using tray dryer with dehumidified air system to increase efficiency of energy and quality product. IOP Conf Ser Earth Environ Sci. 2019. https:\/\/doi.org\/10.1088\/1755-1315\/292\/1\/012070.","journal-title":"IOP Conf Ser Earth Environ Sci."},{"key":"810_CR12","first-page":"159","volume":"11","author":"MKM Ali","year":"2017","unstructured":"Ali MKM, Sulaiman J, Md Yasir S, Ruslan M. Cubic spline as a powerful tools for processing experimental drying rate data of seaweed using solar drier. Malay J Math Sci. 2017;11:159\u201372.","journal-title":"Malay J Math Sci"},{"issue":"27","key":"810_CR13","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/j.algal.2017.07.025","volume":"1","author":"R van Oirschot","year":"2017","unstructured":"van Oirschot R, Thomas JBE, Gr\u00f6ndahl F, Fortuin KPJ, Brandenburg W, Potting J. Explorative environmental life cycle assessment for system design of seaweed cultivation and drying. Algal Res. 2017;1(27):43\u201354.","journal-title":"Algal Res"},{"issue":"12","key":"810_CR14","doi-asserted-by":"publisher","first-page":"1542","DOI":"10.1080\/07373937.2019.1686476","volume":"38","author":"HW Xiao","year":"2020","unstructured":"Xiao HW, Mujumdar AS. Importance of drying in support of human welfare. Drying Technol. 2020;38(12):1542\u20133.","journal-title":"Drying Technol"},{"key":"810_CR15","doi-asserted-by":"publisher","DOI":"10.1051\/matecconf\/201815605007","author":"S Suherman","year":"2018","unstructured":"Suherman S, Djaeni M, Kumoro AC, Prabowo RA, Rahayu S, Khasanah S. Comparison drying behavior of seaweed in solar, sun and oven tray dryers. MATEC Web Conf. 2018. https:\/\/doi.org\/10.1051\/matecconf\/201815605007.","journal-title":"MATEC Web Conf."},{"key":"810_CR16","doi-asserted-by":"crossref","unstructured":"Ali MKM, Fudholi A, Sulaiman J, Muthuvalu MS, Ruslan MH, Yasir SMd, et al. Post-harvest handling of eucheumatoid seaweeds. In: Tropical seaweed farming trends, problems and opportunities. Springer International Publishing, Cham; 2017. p. 131\u201345.","DOI":"10.1007\/978-3-319-63498-2_8"},{"key":"810_CR17","unstructured":"Ali MKM, Sulaiman J, Md Yasir S, Ruslan M. Cubic Spline as a Powerful Tools for Processing Experimental Drying Rate Data of Seaweed Using Solar Drier. Malaysian Journal of Mathematical Sciences, 2017;11:159\u2013172."},{"key":"810_CR18","doi-asserted-by":"publisher","DOI":"10.1016\/j.csite.2020.100745","author":"P Nimnuan","year":"2020","unstructured":"Nimnuan P, Nabnean S. Experimental and simulated investigations of the performance of the solar greenhouse dryer for drying cassumunar ginger (Zingiber cassumunar Roxb.). Case Stud Thermal Eng. 2020;22. https:\/\/doi.org\/10.1016\/j.csite.2020.100745.","journal-title":"Case Stud Thermal Eng."},{"key":"810_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/j.renene.2017.12.053","author":"DVN Lakshmi","year":"2018","unstructured":"Lakshmi DVN, Muthukumar P, Layek A, Nayak PK. Drying kinetics and quality analysis of black turmeric (Curcuma caesia) drying in a mixed mode forced convection solar dryer integrated with thermal energy storage. Renew Energy. 2018;120. https:\/\/doi.org\/10.1016\/j.renene.2017.12.053.","journal-title":"Renew Energy"},{"key":"810_CR20","doi-asserted-by":"publisher","DOI":"10.1080\/15435075.2020.1779074","author":"P Pankaew","year":"2020","unstructured":"Pankaew P, Aumporn O, Janjai S, Pattarapanitchai S, Sangsan M, Bala BK. Performance of a large-scale greenhouse solar dryer integrated with phase change material thermal storage system for drying of chili. Int J Green Energy. 2020;17(11). https:\/\/doi.org\/10.1080\/15435075.2020.1779074.","journal-title":"Int J Green Energy"},{"key":"810_CR21","doi-asserted-by":"publisher","DOI":"10.1016\/j.renene.2019.08.066","author":"S Vijayan","year":"2020","unstructured":"Vijayan S, Arjunan TV, Kumar A. Exergo-environmental analysis of an indirect forced convection solar dryer for drying bitter gourd slices. Renew Energy. 2020;146. https:\/\/doi.org\/10.1016\/j.renene.2019.08.066.","journal-title":"Renew Energy"},{"key":"810_CR22","doi-asserted-by":"publisher","DOI":"10.3390\/en13020350","author":"W Hao","year":"2020","unstructured":"Hao W, Liu S, Mi B, Lai Y. Mathematical modeling and performance analysis of a new hybrid solar dryer of lemon slices for controlling drying temperature. Energies (Basel). 2020;13(2). https:\/\/doi.org\/10.3390\/en13020350.","journal-title":"Energies (Basel)."},{"key":"810_CR23","doi-asserted-by":"publisher","DOI":"10.1016\/j.csite.2020.100787","author":"S Nabnean","year":"2020","unstructured":"Nabnean S, Nimnuan P. Experimental performance of direct forced convection household solar dryer for drying banana. Case Stud Thermal Eng. 2020;22. https:\/\/doi.org\/10.1016\/j.csite.2020.100787.","journal-title":"Case Stud Thermal Eng."},{"key":"810_CR24","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-017-0077-4","author":"J Majumdar","year":"2017","unstructured":"Majumdar J, Naraseeyappa S, Ankalaki S. Analysis of agriculture data using data mining techniques: application of big data. J Big Data. 2017;4(1). https:\/\/doi.org\/10.1186\/s40537-017-0077-4.","journal-title":"J Big Data."},{"key":"810_CR25","doi-asserted-by":"publisher","DOI":"10.1007\/s10811-020-02117-0","author":"MKM Ali","year":"2020","unstructured":"Ali MKM, Critchley AT, Hurtado AQ. The impacts of AMPEP K+ (Ascophyllum marine plant extract, enhanced with potassium) on the growth rate, carrageenan quality, and percentage incidence of the damaging epiphyte Neosiphonia apiculata on four strains of the commercially important carrageenophyte Kappaphycus, as developed by micropropagation techniques. J Appl Phycol. 2020;32(3). https:\/\/doi.org\/10.1007\/s10811-020-02117-0.","journal-title":"J Appl Phycol."},{"issue":"4","key":"810_CR26","doi-asserted-by":"publisher","first-page":"1179","DOI":"10.47836\/pjst.28.4.04","volume":"28","author":"HY Lim","year":"2020","unstructured":"Lim HY, Fam PS, Javaid A, Ali MKM. Ridge regression as efficient model selection and forecasting of fish drying using v-groove hybrid solar drier. Pertanika J Sci Technol. 2020;28(4):1179\u2013202.","journal-title":"Pertanika J Sci Technol"},{"issue":"4","key":"810_CR27","first-page":"4457","volume":"12","author":"MKM Majahar Ali","year":"2022","unstructured":"Majahar Ali MKM, Tahir Ismail M, Hamundu FM, Akhtar NA, et al. Hybrid model in machine learning\u2013robust regression applied for sustainability agriculture and food security. Int J Electric Comput Eng. 2022;12(4):4457\u201368.","journal-title":"Int J Electric Comput Eng."},{"key":"810_CR28","doi-asserted-by":"publisher","unstructured":"El-Din AMG, Senousy MB. A Solution for Handling Big Data Heterogeneity Problem. In: Lecture Notes in Networks and Systems. Springer, Singapore. 2022;224.\nhttps:\/\/doi.org\/10.1007\/978-981-16-2275-5_11.","DOI":"10.1007\/978-981-16-2275-5_11"},{"key":"810_CR29","doi-asserted-by":"publisher","DOI":"10.1108\/JADEE-03-2022-0044","author":"P Gouraram","year":"2022","unstructured":"Gouraram P, Goyari P, Paltasingh KR. Rice ecosystem heterogeneity and determinants of climate risk adaptation in Indian agriculture: farm-level evidence. J Agribus Dev Emerg Econ. 2022. https:\/\/doi.org\/10.1108\/JADEE-03-2022-0044.","journal-title":"J Agribus Dev Emerg Econ."},{"issue":"78","key":"810_CR30","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1016\/j.landusepol.2018.05.026","volume":"1","author":"Y Kanchanaroek","year":"2018","unstructured":"Kanchanaroek Y, Aslam U. Policy schemes for the transition to sustainable agriculture\u2014farmer preferences and spatial heterogeneity in northern Thailand. Land Use Policy. 2018;1(78):227\u201335.","journal-title":"Land Use Policy"},{"issue":"12","key":"810_CR31","doi-asserted-by":"publisher","first-page":"3779","DOI":"10.1007\/s11269-020-02630-4","volume":"34","author":"A Srivastava","year":"2020","unstructured":"Srivastava A, Kumari N, Maza M. Hydrological response to agricultural land use heterogeneity using variable infiltration capacity model. Water Resour Manage. 2020;34(12):3779\u201394.","journal-title":"Water Resour Manage"},{"issue":"8","key":"810_CR32","doi-asserted-by":"publisher","first-page":"1078","DOI":"10.3390\/agriculture12081078","volume":"12","author":"K Li","year":"2022","unstructured":"Li K, Liu J, Xue Y, Rahman S, Sriboonchitta S. Consequences of ignoring dependent error components and heterogeneity in a stochastic frontier model: an application to rice producers in northern Thailand. Agriculture. 2022;12(8):1078.","journal-title":"Agriculture"},{"key":"810_CR33","first-page":"322","volume":"1","author":"J Botzas-Coluni","year":"2021","unstructured":"Botzas-Coluni J, Crockett ETH, Rieb JT, Bennett EM. Farmland heterogeneity is associated with gains in some ecosystem services but also potential trade-offs. Agric Ecosyst Environ. 2021;1:322.","journal-title":"Agric Ecosyst Environ"},{"key":"810_CR34","doi-asserted-by":"crossref","unstructured":"Keane M, Neal T. Climate change and U.S. agriculture: accounting for multi-dimensional slope heterogeneity in production functions. Quantitative Economics, 2000;11:1391\u20131429","DOI":"10.3982\/QE1319"},{"key":"810_CR35","first-page":"24","volume":"1","author":"J Liao","year":"2020","unstructured":"Liao J, Liao T, He X, Zhang T, Li D, Luo X, et al. The effects of agricultural landscape composition and heterogeneity on bird diversity and community structure in the Chengdu Plain. China Glob Ecol Conserv. 2020;1:24.","journal-title":"China Glob Ecol Conserv"},{"key":"810_CR36","doi-asserted-by":"publisher","DOI":"10.3390\/electronics9050761","author":"F Drobni\u010d","year":"2020","unstructured":"Drobni\u010d F, Kos A, Pusti\u0161ek M. On the interpretability of machine learning models and experimental feature selection in case of multicollinear data. Electronics (Switzerland). 2020;9(5) https:\/\/doi.org\/10.3390\/electronics9050761.","journal-title":"Electronics (Switzerland)."},{"issue":"9","key":"810_CR37","first-page":"409","volume":"6","author":"\u00d6G Alma","year":"2011","unstructured":"Alma \u00d6G. Comparison of robust regression methods in linear regression. Int J Contemp Math Sci. 2011;6(9):409\u201321.","journal-title":"Int J Contemp Math Sci"},{"issue":"1","key":"810_CR38","first-page":"193","volume":"28","author":"A Javaid","year":"2020","unstructured":"Javaid A, Ismail MT, Ali MKM. Efficient model selection of collector efficiency in solar dryer using hybrid of LASSO and robust regression. Pertanika J Sci Technol. 2020;28(1):193\u2013210.","journal-title":"Pertanika J Sci Technol"},{"issue":"11","key":"810_CR39","first-page":"55","volume":"9","author":"AE Mohamed","year":"2018","unstructured":"Mohamed AE, Almongy HM, Mohamed AH. Comparison between M-estimation, S-estimation, and MM estimation methods of robust estimation with application and simulation. Int J Math Arch. 2018;9(11):55.","journal-title":"Int J Math Arch"},{"issue":"4","key":"810_CR40","doi-asserted-by":"publisher","first-page":"547","DOI":"10.18280\/mmep.080407","volume":"8","author":"MKM Mukhtar Ali","year":"2021","unstructured":"Mukhtar Ali MKM, Javaid A, Ismail MT, Fudholi A. Accurate and hybrid regularization\u2014robust regression model in handling multicollinearity and outlier using 8SC for big data. Math Model Eng Probl. 2021;8(4):547\u201356.","journal-title":"Math Model Eng Probl."},{"issue":"2","key":"810_CR41","doi-asserted-by":"publisher","first-page":"125","DOI":"10.33330\/jurteksi.v8i2.1514","volume":"8","author":"IMS Wijaya","year":"2022","unstructured":"Wijaya IMS, Sari DI. Quality control of optical fiber disruption with big data using the six sigma method. JURTEKSI (J Teknol Sist Inform). 2022;8(2):125\u201332.","journal-title":"JURTEKSI (J Teknol Sist Inform)"},{"issue":"2","key":"810_CR42","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1093\/rfs\/hht047","volume":"27","author":"TA Gormley","year":"2014","unstructured":"Gormley TA, Matsa DA. Common errors: how to (and not to) control for unobserved heterogeneity. Rev Financ Stud. 2014;27(2):617\u201361.","journal-title":"Rev Financ Stud"},{"key":"810_CR43","first-page":"268","volume":"5","author":"J Cheng","year":"2022","unstructured":"Cheng J, Sun J, Yao K, Xu M, Cao Y. A variable selection method based on mutual information and variance inflation factor. Spectrochim Acta A Mol Biomol Spectrosc. 2022;5:268.","journal-title":"Spectrochim Acta A Mol Biomol Spectrosc"},{"issue":"3","key":"810_CR44","doi-asserted-by":"publisher","first-page":"669","DOI":"10.1016\/j.ijforecast.2015.12.003","volume":"32","author":"S Kim","year":"2016","unstructured":"Kim S, Kim H. A new metric of absolute percentage error for intermittent demand forecasts. Int J Forecast. 2016;32(3):669\u201379.","journal-title":"Int J Forecast"},{"key":"810_CR45","doi-asserted-by":"publisher","first-page":"1","DOI":"10.7717\/peerj-cs.623","volume":"7","author":"D Chicco","year":"2021","unstructured":"Chicco D, Warrens MJ, Jurman G. The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput Sci. 2021;7:1\u201324.","journal-title":"PeerJ Comput Sci"},{"issue":"221","key":"810_CR46","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1016\/j.jclepro.2019.02.211","volume":"1","author":"SG Gouda","year":"2019","unstructured":"Gouda SG, Hussein Z, Luo S, Yuan Q. Model selection for accurate daily global solar radiation prediction in China. J Clean Prod. 2019;1(221):132\u201344.","journal-title":"J Clean Prod"},{"key":"810_CR47","unstructured":"Stridbeck R, Zhang L, Han K. How to analyze change from baseline: absolute or percentage change? D-level Essay in Statistics. 2009;1\u201318."},{"key":"810_CR48","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-1-6","author":"AJ Vickers","year":"2001","unstructured":"Vickers AJ. The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study. BMC Med Res Methodol. 2001. https:\/\/doi.org\/10.1186\/1471-2288-1-6.","journal-title":"BMC Med Res Methodol."},{"issue":"4","key":"810_CR49","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1016\/j.orcp.2007.07.005","volume":"1","author":"OA Waleekhachonloet","year":"2007","unstructured":"Waleekhachonloet OA, Limwattananon C, Limwattananon S, Gross CR. Group behavior therapy versus individual behavior therapy for healthy dieting and weight control management in overweight and obese women living in rural community. Obes Res Clin Pract. 2007;1(4):223\u201332.","journal-title":"Obes Res Clin Pract"},{"issue":"3","key":"810_CR50","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1016\/j.orcp.2007.04.001","volume":"1","author":"M Neovius","year":"2007","unstructured":"Neovius M, R\u00f6ssner S. Results from a randomized controlled trial comparing two low-calorie diet formulae. Obes Res Clin Pract. 2007;1(3):165\u201371.","journal-title":"Obes Res Clin Pract"},{"key":"810_CR51","doi-asserted-by":"publisher","DOI":"10.1016\/j.numecd.2009.01.010","author":"MK Kim","year":"2009","unstructured":"Kim MK, Tanaka K, Kim MJ, Matuso T, Endo T, Tomita T, et al. Comparison of epicardial, abdominal and regional fat compartments in response to weight loss. Nutr Metab Cardiovasc Dis. 2009;19(11). https:\/\/doi.org\/10.1016\/j.numecd.2009.01.010.","journal-title":"Nutr Metab Cardiovasc Dis."},{"key":"810_CR52","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4780081002","author":"L Kaiser","year":"1989","unstructured":"Kaiser L. Adjusting for baseline: change or percentage change? Stat Med. 1989. https:\/\/doi.org\/10.1002\/sim.4780081002.","journal-title":"Stat Med"},{"issue":"1","key":"810_CR53","first-page":"43","volume":"39","author":"L T\u00f6rnqvist","year":"1985","unstructured":"T\u00f6rnqvist L, Vartia P, Vartia YO. How should relative changes be measured? Am Stat. 1985;39(1):43\u20136.","journal-title":"Am Stat"},{"issue":"1","key":"810_CR54","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-020-00327-4","volume":"7","author":"RC Chen","year":"2020","unstructured":"Chen RC, Dewi C, Huang SW, Caraka RE. Selecting critical features for data classification based on machine learning methods. J Big Data. 2020;7(1):1\u201326.","journal-title":"J Big Data"},{"key":"810_CR55","unstructured":"Han Y. Stable feature selection: theory and algorithms. State University of New York at Binghamton. 2012."},{"key":"810_CR56","doi-asserted-by":"crossref","unstructured":"Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, et al. Feature selection: a data perspective. ACM computing surveys (CSUR). 2017;50(6):1\u201345.","DOI":"10.1145\/3136625"},{"key":"810_CR57","unstructured":"Gupta C. Feature selection and analysis for standard machine learning classification of audio beehive samples. (Doctoral dissertation, Utah State University). 2019."},{"key":"810_CR58","unstructured":"Ali MKM, Mukhtar, Ismail MT, Ferdinand MH, Alimuddin. Machine learning-based variable selection: An evaluation of Bagging and Boosting. Turk J Comput Math Educ. 2021;12(13):4343\u20139."},{"key":"810_CR59","first-page":"186","volume":"1","author":"GW Roell","year":"2022","unstructured":"Roell GW, Sathish A, Wan N, Cheng Q, Wen Z, Tang YJ, et al. A comparative evaluation of machine learning algorithms for predicting syngas fermentation outcomes. Biochem Eng J. 2022;1:186.","journal-title":"Biochem Eng J"},{"key":"810_CR60","doi-asserted-by":"publisher","DOI":"10.3390\/rs14030574","author":"T Adugna","year":"2022","unstructured":"Adugna T, Xu W, Fan J. Comparison of random forest and support vector machine classifiers for regional land cover mapping using coarse resolution FY-3C images. Remote Sens (Basel). 2022;14(3). https:\/\/doi.org\/10.3390\/rs14030574.","journal-title":"Remote Sens (Basel)."},{"key":"810_CR61","doi-asserted-by":"crossref","unstructured":"Sumari ADW, Charlinawati DS, Ariyanto Y. A simple approach using statistical-based machine learning to predict the weapon system operational readiness. In: The 1st International Conference on Data Science and Official Statistics. 2021. p. 343\u201351.","DOI":"10.34123\/icdsos.v2021i1.58"},{"issue":"11","key":"810_CR62","doi-asserted-by":"publisher","first-page":"1156","DOI":"10.1016\/j.idairyj.2004.12.005","volume":"15","author":"SA Jimenez-Marquez","year":"2005","unstructured":"Jimenez-Marquez SA, Thibault J, Lacroix C. Prediction of moisture in cheese of commercial production using neural networks. Int Dairy J. 2005;15(11):1156\u201374.","journal-title":"Int Dairy J"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00810-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-023-00810-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00810-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T22:34:00Z","timestamp":1700519640000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-023-00810-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,19]]},"references-count":62,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["810"],"URL":"https:\/\/doi.org\/10.1186\/s40537-023-00810-8","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,19]]},"assertion":[{"value":"12 November 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 August 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"130"}}