{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,14]],"date-time":"2025-05-14T04:50:17Z","timestamp":1747198217649,"version":"3.40.5"},"reference-count":27,"publisher":"Walter de Gruyter GmbH","issue":"4","license":[{"start":{"date-parts":[[2021,7,15]],"date-time":"2021-07-15T00:00:00Z","timestamp":1626307200000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002946","name":"Deutsches Zentrum f\u00fcr Luft- und Raumfahrt","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002946","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,27]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Machine Learning (ML) is ubiquitously on the advance. Like many domains, Earth Observation (EO) also increasingly relies on ML applications, where ML methods are applied to process vast amounts of heterogeneous and continuous data streams to answer socially and environmentally relevant questions. However, developing such ML- based EO systems remains challenging: Development processes and employed workflows are often barely structured and poorly reported. The application of ML methods and techniques is considered to be opaque and the lack of transparency is contradictory to the responsible development of ML-based EO applications. To improve this situation a better understanding of the current practices and engineering-related challenges in developing ML-based EO applications is required. In this paper, we report observations from an exploratory study where five experts shared their view on ML engineering in semi-structured interviews. We analysed these interviews with coding techniques as often applied in the domain of empirical software engineering. The interviews provide informative insights into the practical development of ML applications and reveal several engineering challenges. In addition, interviewees participated in a novel workflow sketching task, which provided a tangible reflection of implicit processes. Overall, the results confirm a gap between theoretical conceptions and real practices in ML development even though workflows were sketched abstractly as textbook-like. The results pave the way for a large-scale investigation on requirements for ML engineering in EO.<\/jats:p>","DOI":"10.1515\/itit-2020-0045","type":"journal-article","created":{"date-parts":[[2021,7,14]],"date-time":"2021-07-14T22:21:13Z","timestamp":1626301273000},"page":"235-247","source":"Crossref","is-referenced-by-count":3,"title":["A qualitative study of Machine Learning practices and engineering challenges in Earth Observation"],"prefix":"10.1515","volume":"63","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6217-8814","authenticated-orcid":false,"given":"Sophie","family":"Jentzsch","sequence":"first","affiliation":[{"name":"DLR Institute for Software Technology , Cologne , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1306-7880","authenticated-orcid":false,"given":"Nico","family":"Hochgeschwender","sequence":"additional","affiliation":[{"name":"DLR Institute for Software Technology , Cologne , Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"374","published-online":{"date-parts":[[2021,7,15]]},"reference":[{"key":"2023033120575691092_j_itit-2020-0045_ref_001","doi-asserted-by":"crossref","unstructured":"Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., Nagappan, N., Nushi, B., and Zimmermann, T. Software engineering for machine learning: A case study. In 2019 IEEE\/ACM 41st International Conference on Software Engineering IEEE, pp.\u2009291\u2013300.","DOI":"10.1109\/ICSE-SEIP.2019.00042"},{"key":"2023033120575691092_j_itit-2020-0045_ref_002","doi-asserted-by":"crossref","unstructured":"Cachia, M., and Millward, L. The telephone medium and semi-structured interviews: a complementary fit. Qualitative Research in Organizations and Management: An International Journal (2011).","DOI":"10.1108\/17465641111188420"},{"key":"2023033120575691092_j_itit-2020-0045_ref_003","doi-asserted-by":"crossref","unstructured":"de Souza Nascimento, E., Ahmed, I., Oliveira, E., Palheta, M.\u2009P., Steinmacher, I., and Conte, T. Understanding development process of machine learning systems: Challenges and solutions. In 2019 ACM\/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) (2019), IEEE, pp.\u20091\u20136.","DOI":"10.1109\/ESEM.2019.8870157"},{"key":"2023033120575691092_j_itit-2020-0045_ref_004","unstructured":"Fischer, G. Modeling of Subsurface Scattering from Ice Sheets for Pol-InSAR Applications. PhD thesis, ETH Zurich, 2019."},{"key":"2023033120575691092_j_itit-2020-0045_ref_005","doi-asserted-by":"crossref","unstructured":"Ge, Z., Song, Z., Ding, S.\u2009X., and Huang, B. Data mining and analytics in the process industry: The role of machine learning. Ieee Access 5 (2017), 20590\u201320616.","DOI":"10.1109\/ACCESS.2017.2756872"},{"key":"2023033120575691092_j_itit-2020-0045_ref_006","doi-asserted-by":"crossref","unstructured":"Hassan, M.\u2009M., Smith, A.\u2009C., Walker, K., Rahman, M.\u2009K., and Southworth, J. Rohingya refugee crisis and forest cover change in teknaf, bangladesh. Remote Sensing 10, 5 (2018), 689.","DOI":"10.3390\/rs10050689"},{"key":"2023033120575691092_j_itit-2020-0045_ref_007","doi-asserted-by":"crossref","unstructured":"Hill, C., Bellamy, R., Erickson, T., and Burnett, M. Trials and tribulations of developers of intelligent systems: A field study. In 2016 IEEE Symposium on Visual Languages and Human-Centric Computing (VL\/HCC) (2016), IEEE, pp.\u2009162\u2013170.","DOI":"10.1109\/VLHCC.2016.7739680"},{"key":"2023033120575691092_j_itit-2020-0045_ref_008","doi-asserted-by":"crossref","unstructured":"Jentzsch, S.\u2009F., and Hochgeschwender, N. Don\u2019t forget your roots! using provenance data for transparent and explainable development of machine learning models. In 2019 34th IEEE\/ACM International Conference on Automated Software Engineering Workshop (ASEW) (2019), IEEE, pp.\u200937\u201340.","DOI":"10.1109\/ASEW.2019.00025"},{"key":"2023033120575691092_j_itit-2020-0045_ref_009","doi-asserted-by":"crossref","unstructured":"Kim, M., Zimmermann, T., DeLine, R., and Begel, A. The emerging role of data scientists on software development teams. In 2016 IEEE\/ACM 38th International Conference on Software Engineering (ICSE) (2016), IEEE, pp.\u200996\u2013107.","DOI":"10.1145\/2884781.2884783"},{"key":"2023033120575691092_j_itit-2020-0045_ref_010","doi-asserted-by":"crossref","unstructured":"Kim, M., Zimmermann, T., DeLine, R., and Begel, A. Data scientists in software teams: State of the art and challenges. IEEE Transactions on Software Engineering 44, 11 (2017), 1024\u20131038.","DOI":"10.1109\/TSE.2017.2754374"},{"key":"2023033120575691092_j_itit-2020-0045_ref_011","unstructured":"Koubarakis, M., Bereta, K., Bilidas, D., Giannousis, K., Ioannidis, T., Pantazi, D.-A., Stamoulis, G., Haridi, S., Vlassov, V., Bruzzone, L., et al. From copernicus big data to extreme earth analytics. Open Proceedings (2019), 690\u2013693."},{"key":"2023033120575691092_j_itit-2020-0045_ref_012","doi-asserted-by":"crossref","unstructured":"Lapuschkin, S., W\u00e4ldchen, S., Binder, A., Montavon, G., Samek, W., and M\u00fcller, K.-R. Unmasking clever hans predictors and assessing what machines really learn. Nature communications 10, 1 (2019), 1\u20138.","DOI":"10.1038\/s41467-019-08987-4"},{"key":"2023033120575691092_j_itit-2020-0045_ref_013","unstructured":"Parrella, G., Hajnsek, I., and Papathanassiou, K. Estimation of snow and firn properties by means of multi-angular polarimetric sar measurements."},{"key":"2023033120575691092_j_itit-2020-0045_ref_014","doi-asserted-by":"crossref","unstructured":"Phillips, S., Zimmermann, T., and Bird, C. Understanding and improving software build teams. In Proceedings of the 36th international conference on software engineering (2014), pp.\u2009735\u2013744.","DOI":"10.1145\/2568225.2568274"},{"key":"2023033120575691092_j_itit-2020-0045_ref_015","doi-asserted-by":"crossref","unstructured":"Quinn, J.\u2009A., Nyhan, M.\u2009M., Navarro, C., Coluccia, D., Bromley, L., and Luengo-Oroz, M. Humanitarian applications of machine learning with remote-sensing data: review and case study in refugee settlement mapping. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 376, 2128 (2018), 20170363.","DOI":"10.1098\/rsta.2017.0363"},{"key":"2023033120575691092_j_itit-2020-0045_ref_016","unstructured":"Schelter, S., Biessmann, F., Januschowski, T., Salinas, D., Seufert, S., Szarvas, G., Vartak, M., Madden, S., Miao, H., Deshpande, A., et al. On challenges in machine learning model management. IEEE Data Eng. Bull. 41, 4 (2018), 5\u201315."},{"key":"2023033120575691092_j_itit-2020-0045_ref_017","unstructured":"Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., and Young, M. Machine learning: The high interest credit card of technical debt."},{"key":"2023033120575691092_j_itit-2020-0045_ref_018","doi-asserted-by":"crossref","unstructured":"Strandberg, P.\u2009E. Ethical interviews in software engineering. In 2019 ACM\/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) (2019), IEEE, pp.\u20091\u201311.","DOI":"10.1109\/ESEM.2019.8870192"},{"key":"2023033120575691092_j_itit-2020-0045_ref_019","unstructured":"Van Someren, M., Barnard, Y., and Sandberg, J. The think aloud method: a practical approach to modelling cognitive. Citeseer, 1994."},{"key":"2023033120575691092_j_itit-2020-0045_ref_020","doi-asserted-by":"crossref","unstructured":"Vianna, A., Ferreira, W., and Gama, K. An exploratory study of how specialists deal with testing in data stream processing applications. In 2019 ACM\/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) (2019), IEEE, pp.\u20091\u20136.","DOI":"10.1109\/ESEM.2019.8870186"},{"key":"2023033120575691092_j_itit-2020-0045_ref_021","doi-asserted-by":"crossref","unstructured":"Vogelsang, A., and Borg, M. Requirements engineering for machine learning: Perspectives from data scientists. In 2019 IEEE 27th International Requirements Engineering Conference Workshops (REW) (2019), IEEE, pp.\u2009245\u2013251.","DOI":"10.1109\/REW.2019.00050"},{"key":"2023033120575691092_j_itit-2020-0045_ref_022","doi-asserted-by":"crossref","unstructured":"Wang, M., Cui, Y., Wang, X., Xiao, S., and Jiang, J. Machine learning for networking: Workflow, advances and opportunities. Ieee Network 32, 2 (2017), 92\u201399.","DOI":"10.1109\/MNET.2017.1700200"},{"key":"2023033120575691092_j_itit-2020-0045_ref_023","unstructured":"Wulder, M.\u2009A., Loveland, T.\u2009R., Roy, D.\u2009P., Crawford, C.\u2009J., Masek, J.\u2009G., Woodcock, C.\u2009E., Allen, R.\u2009G., Anderson, M.\u2009C., Belward, A.\u2009S., Cohen, W.\u2009B., et\u2009al. Remote sensing of environment: Current status of landsat program, science, and applications."},{"key":"2023033120575691092_j_itit-2020-0045_ref_024","doi-asserted-by":"crossref","unstructured":"Yang, J., Gong, P., Fu, R., Zhang, M., Chen, J., Liang, S., Xu, B., Shi, J., and Dickinson, R. The role of satellite remote sensing in climate change studies. Nature climate change 3, 10 (2013), 875\u2013883.","DOI":"10.1038\/nclimate1908"},{"key":"2023033120575691092_j_itit-2020-0045_ref_025","unstructured":"Zaharia, M., Chen, A., Davidson, A., Ghodsi, A., Hong, S.\u2009A., Konwinski, A., Murching, S., Nykodym, T., Ogilvie, P., Parkhe, M., et al. Accelerating the machine learning lifecycle with mlflow. IEEE Data Eng. Bull. 41, 4 (2018), 39\u201345."},{"key":"2023033120575691092_j_itit-2020-0045_ref_026","doi-asserted-by":"crossref","unstructured":"Zhang, L., Zhang, L., and Du, B. Deep learning for remote sensing data: A technical tutorial on the state of the art. IEEE Geoscience and Remote Sensing Magazine 4, 2 (2016), 22\u201340.","DOI":"10.1109\/MGRS.2016.2540798"},{"key":"2023033120575691092_j_itit-2020-0045_ref_027","doi-asserted-by":"crossref","unstructured":"Zhu, X.\u2009X., Tuia, D., Mou, L., Xia, G.-S., Zhang, L., Xu, F., and Fraundorfer, F. Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geoscience and Remote Sensing Magazine 5, 4 (2017), 8\u201336.","DOI":"10.1109\/MGRS.2017.2762307"}],"container-title":["it - Information Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/itit-2020-0045\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/itit-2020-0045\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,1]],"date-time":"2023-04-01T10:29:50Z","timestamp":1680344990000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1515\/itit-2020-0045\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,15]]},"references-count":27,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,8,20]]},"published-print":{"date-parts":[[2021,9,27]]}},"alternative-id":["10.1515\/itit-2020-0045"],"URL":"https:\/\/doi.org\/10.1515\/itit-2020-0045","relation":{},"ISSN":["2196-7032","1611-2776"],"issn-type":[{"type":"electronic","value":"2196-7032"},{"type":"print","value":"1611-2776"}],"subject":[],"published":{"date-parts":[[2021,7,15]]}}}