{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T11:54:26Z","timestamp":1780401266575,"version":"3.54.1"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"14","license":[{"start":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T00:00:00Z","timestamp":1759104000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T00:00:00Z","timestamp":1759104000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100006382","name":"Universidad de Oviedo","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100006382","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Cluster Comput"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Machine learning (ML) transitioned from a purely academic discipline to an applied field, gaining strategic importance in various industries. Meanwhile, Machine Learning Operations (MLOps) has been widely adopted by enterprises as a comprehensive approach for developing and managing machine learning applications. Despite its advantages, challenges remain. The rising demand for flexibility and scalability has led organizations to embrace multi-cloud and hybrid cloud architectures as preferred solutions. However, the autonomous and distributed nature of modern application development, combined with the complexity of training and deploying machine learning models, makes unified operational management impractical, and this will further affect application quality and efficiency. To address these challenges, this paper proposes a framework to manage model training and deployment in a multi-cloud environment. This framework uses a policy-based resource provisioning approach, agent-based application topology reconstruction, and a visualization dashboard. It aims to provide a cloud provider-neutral solution that enhances the quality of application operations. The framework design is introduced, followed by the implementation of a proof-of-concept prototype. Experiments conducted in various empirical scenarios demonstrate that the proposed framework effectively manages deployment resources while providing clear visibility and control across multiple clouds. The results confirm that this framework enhances control over deployment resources and optimizes model deployment efficiency in multi-cloud infrastructure.<\/jats:p>","DOI":"10.1007\/s10586-025-05584-7","type":"journal-article","created":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T20:21:27Z","timestamp":1759177287000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Optimizing machine learning operations in multi-cloud infrastructure: a framework for unified deployment management and topology discovery"],"prefix":"10.1007","volume":"28","author":[{"given":"Hao","family":"Wei","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xabiel Garc\u00eda","family":"Pa\u00f1eda","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Joaqu\u00edn Salvach\u00faa","family":"Rodriguez","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,9,29]]},"reference":[{"issue":"3","key":"5584_CR1","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1007\/s42979-021-00592-x","volume":"2","author":"IH Sarker","year":"2021","unstructured":"Sarker, I.H.: Machine learning: Algorithms, real-world applications and research directions. SN computer science 2(3), 160 (2021)","journal-title":"SN computer science"},{"key":"5584_CR2","doi-asserted-by":"publisher","first-page":"106368","DOI":"10.1016\/j.infsof.2020.106368","volume":"127","author":"LE Lwakatare","year":"2020","unstructured":"Lwakatare, L.E., Raj, A., Crnkovic, I., Bosch, J., Olsson, H.H.: Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions. Inf. Softw. Technol. 127, 106368 (2020)","journal-title":"Inf. Softw. Technol."},{"key":"5584_CR3","unstructured":"Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., Young, M., Crespo, J.-F., Dennison, D.: Hidden technical debt in machine learning systems. In: Cortes, C., Lawrence, N., Lee, D., Sugiyama, M., Garnett, R. (eds.), Advances in Neural Information Processing Systems, vol. 28. Curran Associates, Inc (2015)"},{"key":"5584_CR4","doi-asserted-by":"publisher","unstructured":"Zhengxin, F., Yi, Y., Jingyu, Z., Liu Yue, M., Qinghua, Y.L., Xiwei, X., Jeff, W., Chen, W., Shuai, Z. Shiping, C.: Mlops spanning whole machine learning life cycle:\u00a0A survey.\u00a0https:\/\/doi.org\/10.48550\/arXiv.2304.07296\u00a0(2023)","DOI":"10.48550\/arXiv.2304.07296"},{"key":"5584_CR5","doi-asserted-by":"publisher","first-page":"31866","DOI":"10.1109\/ACCESS.2023.3262138","volume":"11","author":"D Kreuzberger","year":"2023","unstructured":"Kreuzberger, D., K\u00fchl, N., Hirschl, S.: Machine learning operations (mlops): Overview, definition, and architecture. IEEE Access 11, 31866\u201331879 (2023)","journal-title":"IEEE Access"},{"key":"5584_CR6","unstructured":"S &P Global Market Intelligency. Multicloud in the mainstream. techreport, S &P Global Market Intelligency, February (2023)"},{"issue":"4","key":"5584_CR7","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1007\/s10664-023-10308-9","volume":"28","author":"HM Ayas","year":"2023","unstructured":"Ayas, H.M., Leitner, P., Hebig, R.: An empirical study of the systemic and technical migration towards microservices. Empir. Softw. Eng. 28(4), 85 (2023)","journal-title":"Empir. Softw. Eng."},{"issue":"10","key":"5584_CR8","doi-asserted-by":"publisher","first-page":"2061","DOI":"10.1007\/s00607-023-01173-x","volume":"105","author":"G Fragiadakis","year":"2023","unstructured":"Fragiadakis, G., Liagkou, V., Filiopoulou, E., Fragkakis, D., Michalakelis, C., Nikolaidou, M.: Cloud services cost comparison: a clustering analysis framework. Computing 105(10), 2061\u20132088 (2023)","journal-title":"Computing"},{"issue":"4","key":"5584_CR9","doi-asserted-by":"publisher","first-page":"2469","DOI":"10.1007\/s10664-019-09681-1","volume":"24","author":"C Laaber","year":"2019","unstructured":"Laaber, C., Scheuner, J., Leitner, P.: Software microbenchmarking in the cloud. how bad is it really? Empir. Softw. Eng. 24(4), 2469\u20132508 (2019)","journal-title":"Empir. Softw. Eng."},{"key":"5584_CR10","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2021.111031","volume":"180","author":"G Giray","year":"2021","unstructured":"Giray, G.: A software engineering perspective on engineering machine learning systems: State of the art and challenges. J. Syst. Softw. 180,(2021)","journal-title":"J. Syst. Softw."},{"issue":"6","key":"5584_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3533378","volume":"55","author":"A Paleyes","year":"2022","unstructured":"Paleyes, A., Urma, R.-G., Lawrence, N.D.: Challenges in deploying machine learning: A survey of case studies. ACM Comput. Surv. 55(6), 1\u201329 (2022)","journal-title":"ACM Comput. Surv."},{"key":"5584_CR12","doi-asserted-by":"crossref","unstructured":"Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., Nagappan, N., Nushi, B., Zimmermann, T.: Software engineering for machine learning: A case study. In 2019 IEEE\/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pages 291\u2013300 (2019)","DOI":"10.1109\/ICSE-SEIP.2019.00042"},{"issue":"5","key":"5584_CR13","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1007\/s10664-021-09967-3","volume":"26","author":"NB Moe","year":"2021","unstructured":"Moe, N.B., \u0160mite, D., Paasivaara, M., Lassenius, C.: Finding the sweet spot for organizational control and team autonomy in large-scale agile software development. Empir. Softw. Eng. 26(5), 101 (2021)","journal-title":"Empir. Softw. Eng."},{"key":"5584_CR14","doi-asserted-by":"crossref","unstructured":"Shridhar, A., Nadig, D.: Heuristic-based resource allocation for cloud-native machine learning workloads. In 2022 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS), pages 415\u2013418 (2022)","DOI":"10.1109\/ANTS56424.2022.10227727"},{"issue":"5","key":"5584_CR15","doi-asserted-by":"publisher","first-page":"665","DOI":"10.1007\/s42979-023-02101-8","volume":"4","author":"M-C Chiang","year":"2023","unstructured":"Chiang, M.-C., Zhang, L.-W., Chou, Y.-M., Chou, J.: Dynamic resource management for machine learning pipeline workloads. SN Computer Science 4(5), 665 (2023)","journal-title":"SN Computer Science"},{"key":"5584_CR16","doi-asserted-by":"crossref","unstructured":"Chiang, M.-C., Chou, J.: Dynamoml: Dynamic resource management operators for machine learning workloads. In Proceedings of the 11th International Conference on Cloud Computing and Services Science-Volume 1: CLOSER, pages 122\u2013132. INSTICC, SciTePress (2021)","DOI":"10.5220\/0010483401220132"},{"issue":"12","key":"5584_CR17","doi-asserted-by":"publisher","first-page":"4425","DOI":"10.3390\/s22124425","volume":"22","author":"R Mi\u00f1\u00f3n","year":"2022","unstructured":"Mi\u00f1\u00f3n, R., Arcaya, J.D., Torre-Bastida, A.I., Hartlieb, P.: Pangea: An mlops tool for automatically generating infrastructure and deploying analytic pipelines in edge, fog and cloud layers. Sensors 22(12), 4425 (2022)","journal-title":"Sensors"},{"key":"5584_CR18","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Hua, W., Zhou, Z., Suh, G.E., Delimitrou, C.: Sinan: Ml-based and qos-aware resource management for cloud microservices. In Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS \u201921, page 167-181, New York, NY, USA. Association for Computing Machinery (2021)","DOI":"10.1145\/3445814.3446693"},{"key":"5584_CR19","doi-asserted-by":"publisher","first-page":"102127","DOI":"10.1016\/j.simpat.2020.102127","volume":"104","author":"N Gholipour","year":"2020","unstructured":"Gholipour, N., Arianyan, E., Buyya, R.: A novel energy-aware resource management technique using joint vm and container consolidation approach for green computing in cloud data centers. Simul. Model. Pract. Theory 104, 102127 (2020)","journal-title":"Simul. Model. Pract. Theory"},{"issue":"4","key":"5584_CR20","doi-asserted-by":"publisher","first-page":"848","DOI":"10.1109\/JAS.2021.1003934","volume":"8","author":"Q-H Zhu","year":"2021","unstructured":"Zhu, Q.-H., Tang, H., Huang, J.-J., Hou, Y.: Task scheduling for multi-cloud computing subject to security and reliability constraints. IEEE\/CAA Journal of Automatica Sinica 8(4), 848\u2013865 (2021)","journal-title":"IEEE\/CAA Journal of Automatica Sinica"},{"issue":"1","key":"5584_CR21","doi-asserted-by":"publisher","first-page":"21850","DOI":"10.1038\/s41598-024-72774-5","volume":"14","author":"S Mangalampalli","year":"2024","unstructured":"Mangalampalli, S., Karri, G.R., Ratnamani, M.V., Mohanty, S.N., Jabr, B.A., Ali, Y.A., Ali, S., Abdullaeva, B.S.: Efficient deep reinforcement learning based task scheduler in multi cloud environment. Sci. Rep. 14(1), 21850 (2024)","journal-title":"Sci. Rep."},{"issue":"9","key":"5584_CR22","doi-asserted-by":"publisher","first-page":"2897","DOI":"10.1007\/s00607-024-01311-z","volume":"106","author":"A Ghasemi","year":"2024","unstructured":"Ghasemi, A., Haghighat, A.T., Keshavarzi, A.: Enhancing virtual machine placement efficiency in cloud data centers: a hybrid approach using multi-objective reinforcement learning and clustering strategies. Computing 106(9), 2897\u20132922 (2024)","journal-title":"Computing"},{"key":"5584_CR23","unstructured":"Li, Z., Cheng, Q., Hsieh, K., Dang, Y., Huang, P., Singh, P., Yang, X., L, Qin, Wu, Y., Levy, S., Chintalapati, M.: Gandalf: An intelligent, End-To-End analytics service for safe deployment in Large-Scale cloud infrastructure. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20), pages 389\u2013402, Santa Clara, CA, February. USENIX Association (2020)"},{"issue":"1","key":"5584_CR24","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1109\/COMST.2016.2597193","volume":"19","author":"S Khan","year":"2017","unstructured":"Khan, S., Gani, A., Wahab, A.W.A., Guizani, M., Khan, M.K.: Topology Discovery in Software Defined Networks: Threats, Taxonomy, and State-of-the-Art. IEEE Communications Surveys & Tutorials 19(1), 303\u2013324 (2017)","journal-title":"IEEE Communications Surveys & Tutorials"},{"key":"5584_CR25","doi-asserted-by":"crossref","unstructured":"Hwang, J., Liu, G., Zeng, S., Wu, F.Y., Wood, T.: Topology Discovery and Service Classification for Distributed-Aware Clouds. In 2014 IEEE International Conference on Cloud Engineering, pages 385\u2013390. IEEE (2014)","DOI":"10.1109\/IC2E.2014.86"},{"key":"5584_CR26","doi-asserted-by":"crossref","unstructured":"Sangpetch, A., Kim, H.S.: VDEP: VM Dependency Discovery in Multi-tier Cloud Applications. In 2015 IEEE 8th International Conference on Cloud Computing, pages 694\u2013701. IEEE (2015)","DOI":"10.1109\/CLOUD.2015.97"},{"key":"5584_CR27","first-page":"69","volume":"2","author":"T Lutellier","year":"2015","unstructured":"Lutellier, T., Chollak, D., Garciam, J., Tan, L., Derek, R., Medvidovic, N., Kroeger, R.: Comparing software architecture recovery techniques using accurate dependencies. 2015 IEEE\/ACM 37th IEEE International Conference on Software Engineering 2, 69\u201378 (2015)","journal-title":"2015 IEEE\/ACM 37th IEEE International Conference on Software Engineering"},{"key":"5584_CR28","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Xu, Z., Liu, C., Chen, H., Sun, J., Qiu, D., Liu, Y.: Software architecture recovery with information fusion. In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC\/FSE 2023, page 1535\u20131547, New York, NY, USA. Association for Computing Machinery (2023)","DOI":"10.1145\/3611643.3616285"},{"key":"5584_CR29","doi-asserted-by":"crossref","unstructured":"Link, D., Behnamghader, P., Moazeni, R., Boehm, B.: Recover and relax: Concern-oriented software architecture recovery for systems development and maintenance. In 2019 IEEE\/ACM International Conference on Software and System Processes (ICSSP), pages 64\u201373, May (2019)","DOI":"10.1109\/ICSSP.2019.00018"},{"key":"5584_CR30","doi-asserted-by":"crossref","unstructured":"Link, D., Behnamghader, P., Moazeni, R., Boehm, B.: The value of software architecture recovery for maintenance. In Proceedings of the 12th Innovations in Software Engineering Conference (Formerly Known as India Software Engineering Conference), ISEC \u201919, New York, NY, USA. Association for Computing Machinery (2019)","DOI":"10.1145\/3299771.3299787"},{"key":"5584_CR31","doi-asserted-by":"crossref","unstructured":"Ullmann, G.C., Gu\u00e9h\u00e9neuc, Y.-G., Petrillo, F., Anquetil, N., Politowski, C.: Visualising game engine subsystem coupling patterns. In Paolo Ciancarini, Angelo Di Iorio, Helmut Hlavacs, and Francesco Poggi, editors, Entertainment Computing\u2013 ICEC, pages 263\u2013274, Singapore, 2023. Springer Nature Singapore (2023)","DOI":"10.1007\/978-981-99-8248-6_22"},{"issue":"2","key":"5584_CR32","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1109\/TSE.2017.2671865","volume":"44","author":"T Lutellier","year":"2018","unstructured":"Lutellier, T., Chollak, D., Garcia, J., Tan, L., Rayside, D., Medvidovi\u0107, N., Kroeger, R.: Measuring the impact of code dependencies on software architecture recovery techniques. IEEE Trans. Software Eng. 44(2), 159\u2013181 (2018)","journal-title":"IEEE Trans. Software Eng."},{"key":"5584_CR33","unstructured":"Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2015)"}],"container-title":["Cluster Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-025-05584-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10586-025-05584-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-025-05584-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,27]],"date-time":"2025-11-27T10:05:51Z","timestamp":1764237951000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10586-025-05584-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,29]]},"references-count":33,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["5584"],"URL":"https:\/\/doi.org\/10.1007\/s10586-025-05584-7","relation":{},"ISSN":["1386-7857","1573-7543"],"issn-type":[{"value":"1386-7857","type":"print"},{"value":"1573-7543","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,29]]},"assertion":[{"value":"22 November 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 May 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 June 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 September 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter discussed in this manuscript.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of Interest"}}],"article-number":"933"}}