{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T19:55:55Z","timestamp":1770494155920,"version":"3.49.0"},"reference-count":30,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2024,10,22]],"date-time":"2024-10-22T00:00:00Z","timestamp":1729555200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>The large amount of available data, generated every second via sensors, social networks, organizations, and so on, has generated new lines of research that involve novel methods, techniques, resources, and\/or technologies. The development of big data systems (BDSs) can be approached from different perspectives, all of them useful, depending on the objectives pursued. In particular, in this work, we address BDSs in the area of software engineering, contributing to the generation of novel methodologies and techniques for software reuse. In this article, we propose a methodology to develop reusable BDSs by mirroring activities from software product line engineering. This means that the process of building BDSs is approached by analyzing the variety of domain features and modeling them as a family of related assets. The contextual perspective of the proposal, along with its supporting tool, is introduced through a case study in the agrometeorology domain. The characterization of variables for frost analysis exemplifies the importance of identifying variety, as well as the possibility of reusing previous analyses adjusted to the profile of each case. In addition to showing interesting findings from the case, we also exemplify our concept of context variety, which is a core element in modeling reusable BDSs.<\/jats:p>","DOI":"10.3390\/info15110661","type":"journal-article","created":{"date-parts":[[2024,10,22]],"date-time":"2024-10-22T04:10:14Z","timestamp":1729570214000},"page":"661","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["A Context-Based Perspective on Frost Analysis in Reuse-Oriented Big Data-System Developments"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8516-7453","authenticated-orcid":false,"given":"Agustina","family":"Buccella","sequence":"first","affiliation":[{"name":"GIISCO Research Group, Departamento de Ingenier\u00eda de Sistemas, Facultad de Inform\u00e1tica, Universidad Nacional del Comahue, Neuquen 8300, Argentina"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4804-6270","authenticated-orcid":false,"given":"Alejandra","family":"Cechich","sequence":"additional","affiliation":[{"name":"GIISCO Research Group, Departamento de Ingenier\u00eda de Sistemas, Facultad de Inform\u00e1tica, Universidad Nacional del Comahue, Neuquen 8300, Argentina"}]},{"given":"Federico","family":"Saurin","sequence":"additional","affiliation":[{"name":"GIISCO Research Group, Departamento de Ingenier\u00eda de Sistemas, Facultad de Inform\u00e1tica, Universidad Nacional del Comahue, Neuquen 8300, Argentina"}]},{"given":"Ayel\u00e9n","family":"Montenegro","sequence":"additional","affiliation":[{"name":"Instituto Nacional de Tecnolog\u00eda Agropecuaria (INTA), Alto Valle de R\u00edo Negro y Neuqu\u00e9n, Allen 8328, Argentina"}]},{"given":"Andrea","family":"Rodr\u00edguez","sequence":"additional","affiliation":[{"name":"Instituto Nacional de Tecnolog\u00eda Agropecuaria (INTA), Alto Valle de R\u00edo Negro y Neuqu\u00e9n, Allen 8328, Argentina"}]},{"given":"Angel","family":"Mu\u00f1oz","sequence":"additional","affiliation":[{"name":"Instituto Nacional de Tecnolog\u00eda Agropecuaria (INTA), Alto Valle de R\u00edo Negro y Neuqu\u00e9n, Allen 8328, Argentina"}]}],"member":"1968","published-online":{"date-parts":[[2024,10,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/52.914734","article-title":"Tactical approaches for alleviating distance in global software development","volume":"18","author":"Carmel","year":"2001","journal-title":"IEEE Softw."},{"key":"ref_2","unstructured":"Clements, P.C., and Northrop, L. (2001). Software Product Lines: Practices and Patterns, Addison-Wesley Longman Publishing Co., Inc."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Pohl, K., B\u00f6ckle, G., and Linden, F.J.v.d. (2005). Software Product Line Engineering: Foundations, Principles and Techniques, Springer.","DOI":"10.1007\/3-540-28901-1"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"van der Linden, F., Schmid, K., and Rommes, E. (2007). Software Product Lines in Action: The Best Industrial Practice in Product Line Engineering, Springer.","DOI":"10.1007\/978-3-540-71437-8"},{"key":"ref_5","unstructured":"K\u00e4k\u00f6l\u00e4, T. (November, January 31). ISO Initiatives on Software Product Line Engineering: Vision and Current Status\u2014Invited Talk for Variability. Proceedings of the ER Workshops, Brussels, Belgium."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"8","DOI":"10.5334\/dsj-2017-008","article-title":"On the Reuse of Scientific Data","volume":"16","author":"Pasquetto","year":"2017","journal-title":"Data Sci. J."},{"key":"ref_7","first-page":"4","article-title":"Big data and data reuse: A taxonomy of data reuse for balancing big data benefits and personal data protection","volume":"6","author":"Custers","year":"2016","journal-title":"Int. Data Priv. Law"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Borrison, R., Kl\u00f6pper, B., Chioua, M., Dix, M., and Sprick, B. (2018, January 21\u201323). Reusable Big Data System for Industrial Data Mining\u2014A Case Study on Anomaly Detection in Chemical Plants. Proceedings of the Intelligent Data Engineering and Automated Learning\u2014IDEAL 2018, Madrid, Spain.","DOI":"10.1007\/978-3-030-03493-1_64"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Xie, Z., Chen, Y., Speer, J., Walters, T., Tarazaga, P.A., and Kasarda, M. (2015, January 21\u201325). Towards Use And Reuse Driven Big Data Management. Proceedings of the 15th ACM\/IEEE-CS Joint Conference on Digital Libraries, Knoxville, TN, USA.","DOI":"10.1145\/2756406.2756924"},{"key":"ref_10","unstructured":"Klein, J. (2021, June 09). Reference Architectures for Big Data Systems, Carnegie Mellon University\u2019s Software Engineering Institute Blog. Available online: http:\/\/insights.sei.cmu.edu\/blog\/reference-architectures-for-big-data-systems\/."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mavi, H., and Tupper, G. (2004). Agrometeorology: Principles and Applications of Climate Studies in Agriculture, CRC Press. [1st ed.].","DOI":"10.1201\/9781482277999"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"100158","DOI":"10.1016\/j.array.2022.100158","article-title":"Minute-wise frost prediction: An approach of recurrent neural networks","volume":"14","author":"Zhou","year":"2022","journal-title":"Array"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"100176","DOI":"10.1016\/j.wace.2018.06.003","article-title":"Modelling frost generates insights for managing risk of minimum temperature extremes","volume":"27","author":"Gobbett","year":"2020","journal-title":"Weather. Clim. Extrem."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"4589","DOI":"10.1109\/JIOT.2018.2867333","article-title":"Prediction of Frost Events Using Machine Learning and IoT Sensing Devices","volume":"5","author":"Diedrichs","year":"2018","journal-title":"IEEE Internet Things J."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Talsma, C.J., Solander, K.C., Mudunuru, M.K., Crawford, B., and Powell, M.R. (2023). Frost prediction using machine learning and deep neural network models. Front. Artif. Intell., 5.","DOI":"10.3389\/frai.2022.963781"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/MS.2022.3199621","article-title":"AI Engineering: Realizing the Potential of AI","volume":"39","author":"Bosch","year":"2022","journal-title":"IEEE Softw."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Osycka, L., Cechich, A., Buccella, A., Montenegro, A., and Mu\u00f1oz, A. (2023, January 27\u201329). CoVaMaT: Functionality for Variety Reuse through a Supporting Tool. Proceedings of the XI Conference on Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET), La Plata, Argentina.","DOI":"10.1007\/978-3-031-40942-4_5"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"e13","DOI":"10.24215\/16666038.23.e13","article-title":"A Model of Reusable Assets in AIE Software Systems","volume":"23","author":"Buccella","year":"2023","journal-title":"J. Comput. Sci. Technol."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.jhydrol.2003.11.029","article-title":"Relation between climate variability and groundwater levels in the upper carbonate aquifer, southern Manitoba, Canada","volume":"290","author":"Chen","year":"2003","journal-title":"Hydrology"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Delle Rose, M., and Martano, P. (2023). Datasets of Groundwater Level and Surface Water Budget in a Central Mediterranean Site (21 June 2017\u20131 October 2022). Data, 8.","DOI":"10.3390\/data8020038"},{"key":"ref_21","unstructured":"Verdes, P.F., Granitto, P.M., Navone, H.D., and Ceccatto, H.A. (2000, January 4\u20138). Frost prediction with machine learning techniques. Proceedings of the VI Congreso Argentino de Ciencias de la Computacion, Buenos Aires, Argentina."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1016\/j.procs.2019.09.267","article-title":"Frost Forecast using Machine Learning\u2014From association to causality","volume":"159","author":"Shibuya","year":"2019","journal-title":"Procedia Comput. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"2264","DOI":"10.1016\/j.procs.2020.09.285","article-title":"Modelling and learning cause-effect\u2014Application in frost forecast","volume":"176","author":"Ding","year":"2020","journal-title":"Procedia Comput. Sci."},{"key":"ref_24","first-page":"101971","article-title":"Statistical characterization of frost zones: Case of tea freeze damage in the Kenyan highlands","volume":"84","author":"Kotikot","year":"2020","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_25","first-page":"36","article-title":"A Comparison of the Pearson, Spearman Rank and Kendall Tau Correlation Coefficients Using Quantitative Variables","volume":"20","author":"Shiekh","year":"2022","journal-title":"Asian J. Probab. Stat."},{"key":"ref_26","first-page":"25","article-title":"Calculating and Interpreting Percentage Changes for Economic Analysis","volume":"1","author":"Anderson","year":"2019","journal-title":"Appl. Econ. Teach. Resour. (Aetr) Agric. Appl. Econ. Assoc."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1016\/j.jhydrol.2015.01.007","article-title":"Determining the importance of model calibration for forecasting absolute\/relative changes in streamflow from LULC and climate changes","volume":"522","author":"Niraula","year":"2015","journal-title":"J. Hydrol."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Buccella, A., Cechich, A., Porfiri, J., and Diniz Dos Santos, D. (2019). Taxonomy-Oriented Domain Analysis of GIS: A Case Study for Paleontological Software Systems. ISPRS Int. J. -Geo-Inf., 8.","DOI":"10.3390\/ijgi8060270"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Buccella, A., Pol\u2019la, M., and Cechich, A. (2022). Improving Variabilty Analysis through Scenario-Based Incompatibility Detection. Information, 13.","DOI":"10.3390\/info13030149"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1109\/TSE.2004.50","article-title":"Evaluating software reuse alternatives: A model and its application to an industrial case study","volume":"30","author":"Tomer","year":"2004","journal-title":"IEEE Trans. Softw. Eng."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/15\/11\/661\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:17:46Z","timestamp":1760113066000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/15\/11\/661"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,22]]},"references-count":30,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2024,11]]}},"alternative-id":["info15110661"],"URL":"https:\/\/doi.org\/10.3390\/info15110661","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,22]]}}}