{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T12:24:04Z","timestamp":1781785444043,"version":"3.54.5"},"reference-count":65,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T00:00:00Z","timestamp":1760659200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>Artificial intelligence (AI) is increasingly embedded within organizational infrastructures, yet the foundational role of data in shaping AI outcomes remains underexplored. This study positions data at the center of complexity, uncertainty, and strategic decision-making in AI development, aligning with the emerging paradigm of data-centric AI (DCAI). Based on in-depth interviews with 74 senior AI and data professionals, the research examines how experts conceptualize and operationalize data throughout the AI lifecycle. A thematic analysis reveals five interconnected domains reflecting sociotechnical and organizational challenges\u2014such as data quality, governance, contextualization, and alignment with business objectives. The study proposes a conceptual model depicting data as a dynamic infrastructure underpinning all AI phases, from collection to deployment and monitoring. Findings indicate that data-related issues, more than model sophistication, are the primary bottlenecks undermining system reliability, fairness, and accountability. Practically, this research advocates for increased investment in the development of intelligent systems designed to ensure high-quality data management. Theoretically, it reframes data as a site of labor and negotiation, challenging dominant model-centric narratives. By integrating empirical insights with normative concerns, this study contributes to the design of more trustworthy and ethically grounded AI systems within the DCAI framework.<\/jats:p>","DOI":"10.3390\/make7040122","type":"journal-article","created":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T11:39:34Z","timestamp":1760701174000},"page":"122","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Behind the Algorithm: International Insights into Data-Driven AI Model Development"],"prefix":"10.3390","volume":"7","author":[{"given":"Limor","family":"Ziv","sequence":"first","affiliation":[{"name":"School of Communication, Bar-Ilan University, Ramat Gan 5290002, Israel"},{"name":"Department of Management, Bar-Ilan University, Ramat Gan 5290002, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6791-1624","authenticated-orcid":false,"given":"Maayan","family":"Nakash","sequence":"additional","affiliation":[{"name":"Department of Management, Bar-Ilan University, Ramat Gan 5290002, Israel"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,10,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1007\/s00146-025-02267-0","article-title":"The end AI innocence: Genie is out of the bottle","volume":"40","author":"Gill","year":"2025","journal-title":"AI Soc."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3457607","article-title":"A survey on bias and fairness in machine learning","volume":"54","author":"Mehrabi","year":"2021","journal-title":"ACM Comput. Surv. (CSUR)"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Nakash, M., and Bolisani, E. (2024). Knowledge management meets artificial intelligence: A systematic review and future research agenda. European Conference on Knowledge Management, Academic Conferences International Limited.","DOI":"10.34190\/eckm.25.1.2443"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1108\/BPMJ-11-2024-1137","article-title":"The transformative impact of AI on knowledge management processes","volume":"31","author":"Nakash","year":"2025","journal-title":"Bus. Process Manag. J."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Sambasivan, N., Kapania, S., Highfill, H., Akrong, D., Paritosh, P., and Aroyo, L.M. (2021, January 8\u201313). \u201cEveryone wants to do the model work, not the data work\u201d: Data Cascades in High-Stakes AI. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, Yokohama, Japan.","DOI":"10.1145\/3411764.3445518"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Goyal, M., and Mahmoud, Q.H. (2024). A systematic review of synthetic data generation techniques using generative AI. Electronics, 13.","DOI":"10.3390\/electronics13173509"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Patel, K. (2025, February 04). Ethical Reflections on Data-Centric AI: Balancing Benefits and Risks. Available at SSRN 4993089. Available online: https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=4993089.","DOI":"10.2139\/ssrn.4993089"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zha, D., Bhat, Z.P., Lai, K.H., Yang, F., and Hu, X. (2023, January 27\u201329). Data-centric AI: Perspectives and challenges. Proceedings of the 2023 SIAM International Conference on Data Mining (SDM), Minneapolis, MN, USA.","DOI":"10.1137\/1.9781611977653.ch106"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"33173","DOI":"10.1109\/ACCESS.2024.3369417","article-title":"Opportunities and challenges in data-centric AI","volume":"12","author":"Kumar","year":"2024","journal-title":"IEEE Access"},{"key":"ref_10","first-page":"3","article-title":"Machine learning and big data: What is important?","volume":"42","author":"Stonebraker","year":"2019","journal-title":"IEEE Data Eng. Bull."},{"key":"ref_11","first-page":"5320","article-title":"Dataperf: Benchmarks for data-centric ai development","volume":"36","author":"Mazumder","year":"2023","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_12","first-page":"2","article-title":"Only 3% of companies\u2019 data meets basic quality standards","volume":"95","author":"Nagle","year":"2017","journal-title":"Harv. Bus. Rev."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1038\/d41586-025-01067-2","article-title":"Why an overreliance on AI-driven modelling is bad for science","volume":"640","author":"Narayanan","year":"2025","journal-title":"Nature"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"e275","DOI":"10.1002\/pra2.275","article-title":"Good systems, bad data?: Interpretations of AI hype and failures","volume":"57","author":"Slota","year":"2020","journal-title":"Proc. Assoc. Inf. Sci. Technol."},{"key":"ref_15","unstructured":"Wagstaff, K. (2012). Machine learning that matters. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1588","DOI":"10.1177\/23998083221105496","article-title":"Planning data","volume":"49","author":"Batty","year":"2022","journal-title":"Environ. Plan. B Urban Anal. City Sci."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Jarrahi, M.H., Memariani, A., and Guha, S. (2022). The principles of data-centric AI (DCAI). arXiv.","DOI":"10.1145\/3571724"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"791","DOI":"10.1007\/s00778-022-00775-9","article-title":"Data collection and quality challenges in deep learning: A data-centric ai perspective","volume":"32","author":"Whang","year":"2023","journal-title":"VLDB J."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"e13406","DOI":"10.1111\/exsy.13406","article-title":"Artificial intelligence governance: Ethical considerations and implications for social responsibility","volume":"41","author":"Camilleri","year":"2024","journal-title":"Expert Syst."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"2463722","DOI":"10.1080\/08839514.2025.2463722","article-title":"AI ethics: Integrating transparency, fairness, and privacy in AI development","volume":"39","author":"Radanliev","year":"2025","journal-title":"Appl. Artif. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1007\/s10676-022-09624-3","article-title":"A sociotechnical perspective for the future of AI: Narratives, inequalities, and human control","volume":"24","author":"Sartori","year":"2022","journal-title":"Ethics Inf. Technol."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"100336","DOI":"10.1016\/j.patter.2021.100336","article-title":"Data and its (dis) contents: A survey of dataset development and use in machine learning research","volume":"2","author":"Paullada","year":"2021","journal-title":"Patterns"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3711118","article-title":"Data-centric artificial intelligence: A survey","volume":"57","author":"Zha","year":"2025","journal-title":"ACM Comput. Surv."},{"key":"ref_24","unstructured":"Russell, S., and Norvig, P. (2020). Artificial Intelligence: A Modern Approach, Pearson Education. [4th ed]. Available online: http:\/\/lib.ysu.am\/disciplines_bk\/efdd4d1d4c2087fe1cbe03d9ced67f34.pdf."},{"key":"ref_25","unstructured":"Domingos, P. (2015). The Master Algorithm: How the Quest for the Ultimate Learning Machine will Remake Our World, Basic Books. Available online: https:\/\/www.redalyc.org\/pdf\/6380\/638067264018.pdf."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Domingos, P. (2018, January 10\u201315). Machine learning for data management: Problems and solutions. Proceedings of the 2018 International Conference on Management of Data, Houston, TX, USA. Available online: https:\/\/doi.org\/10.1145\/3183713.3199515.","DOI":"10.1145\/3183713.3199515"},{"key":"ref_27","first-page":"1","article-title":"Introduction to machine learning & knowledge extraction (make)","volume":"1","author":"Holzinger","year":"2019","journal-title":"Mach. Learn. Knowl. Extr."},{"key":"ref_28","unstructured":"Bommasani, R., Hudson, D.A., Adeli, E., Altman, R., Arora, S., von Arx, S., and Liang, P. (2021). On the opportunities and risks of foundation models. arXiv."},{"key":"ref_29","first-page":"277","article-title":"Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration","volume":"25","author":"Zheng","year":"2023","journal-title":"J. Inf. Technol. Case Appl. Res."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"5873","DOI":"10.1109\/TAI.2024.3444742","article-title":"Recent advances in generative ai and large language models: Current status, challenges, and perspectives","volume":"5","author":"Hagos","year":"2024","journal-title":"IEEE Trans. Artif. Intell."},{"key":"ref_31","first-page":"37","article-title":"AI for knowledge creation, curation, and consumption in context","volume":"25","author":"Schwartz","year":"2024","journal-title":"J. Assoc. Inf. Syst."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/s44163-024-00151-2","article-title":"Challenges with developing and deploying AI models and applications in industrial systems","volume":"4","author":"Sinha","year":"2024","journal-title":"Discov. Artif. Intell."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Kirchner, K., Bolisani, E., Kassaneh, T.C., Scarso, E., and Taraghi, N. (2025). Generative AI Meets Knowledge Management: Insights From Software Development Practices. Knowl. Process Manag., 1\u201313.","DOI":"10.1002\/kpm.70004"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Peretz, O., and Nakash, M. (2025, January 5\u20137). From Junior to Senior: Skill Requirements for AI Professionals Across Career Stages. Proceedings of the International Conference on Research in Business, Management and Finance, Rome, Italy.","DOI":"10.33422\/icrbmf.v2i1.1178"},{"key":"ref_35","unstructured":"McKinsey (2025, May 22). The State of AI in 2023: Generative AI\u2019s Breakout Year. Available online: https:\/\/www.mckinsey.com\/capabilities\/quantumblack\/our-insights\/the-state-of-ai-in-2023-generative-ais-breakout-year."},{"key":"ref_36","unstructured":"Deloitte (2025, May 22). 2024 Year-End Generative AI Report. Available online: https:\/\/www.deloitte.com\/us\/en\/what-we-do\/capabilities\/applied-artificial-intelligence\/content\/state-of-generative-ai-in-enterprise.html."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1177\/09726225221124035","article-title":"Factors influencing the AI adoption in organizations","volume":"21","author":"Kurup","year":"2022","journal-title":"Metamorphosis"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1111\/jems.12576","article-title":"AI adoption in America: Who, what, and where","volume":"33","author":"McElheran","year":"2024","journal-title":"J. Econ. Manag. Strategy"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"e661","DOI":"10.7717\/peerj-cs.661","article-title":"Artificial intelligence maturity model: A systematic literature review","volume":"7","author":"Sadiq","year":"2021","journal-title":"PeerJ Comput. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Romeo, E., and Lacko, J. (2025). Adoption and integration of AI in organizations: A systematic review of challenges and drivers towards future directions of research. Kybernetes, 1\u201322.","DOI":"10.1108\/K-07-2024-2002"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1395","DOI":"10.1007\/s10869-024-09963-6","article-title":"The buffering role of workplace mindfulness: How job insecurity of human-artificial intelligence collaboration impacts employees\u2019 work\u2013life-related outcomes","volume":"39","author":"Wu","year":"2024","journal-title":"J. Bus. Psychol."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1007\/s00146-019-00931-w","article-title":"In AI we trust? Perceptions about automated decision-making by artificial intelligence","volume":"35","author":"Araujo","year":"2020","journal-title":"AI Soc."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1267","DOI":"10.1007\/s00146-023-01704-2","article-title":"Embedding AI in society: Ethics, policy, governance, and impacts","volume":"38","author":"Pflanzer","year":"2023","journal-title":"AI Soc."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"230","DOI":"10.1016\/j.procs.2025.02.116","article-title":"The Artificial Intelligence Act: Insights regarding its application and implications","volume":"256","author":"Cabrera","year":"2025","journal-title":"Procedia Comput. Sci."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1961","DOI":"10.1007\/s00146-023-01650-z","article-title":"The regulation of artificial intelligence","volume":"39","author":"Finocchiaro","year":"2024","journal-title":"Ai Soc."},{"key":"ref_46","unstructured":"Redman, T.C. (2018). If your data is bad, your machine learning tools are useless. Harv. Bus. Rev., 2, Available online: https:\/\/hbr.org\/2018\/04\/if-your-data-is-bad-your-machine-learning-tools-are-useless."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1177\/0890334417698693","article-title":"About research: Qualitative methodologies","volume":"33","author":"Dodgson","year":"2017","journal-title":"J. Hum. Lact."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Islam, M.R., Khan, N.A., and Baikady, R. (2022). Sampling techniques for qualitative research. Principles of Social Research Methodology, Springer.","DOI":"10.1007\/978-981-19-5441-2"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"23","DOI":"10.26458\/jedep.v7i1.571","article-title":"Qualitative research methodology in social sciences and related subjects","volume":"7","author":"Mohajan","year":"2018","journal-title":"J. Econ. Dev. Environ. People"},{"key":"ref_50","unstructured":"Gummesson, E. (2000). Qualitative Methods in Management Research, Sage. Available online: https:\/\/www.researchgate.net\/publication\/215915855_Qualitative_Research_Methods_in_Management_Research."},{"key":"ref_51","unstructured":"Creswell, J.W., and Poth, C.N. (2016). Qualitative Inquiry and Research Design: Choosing Among Five Approaches, Sage Publications."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"192","DOI":"10.4103\/2229-3485.115389","article-title":"Qualitative research","volume":"4","author":"Pathak","year":"2013","journal-title":"Perspect. Clin. Res."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Scanlan, C.L. (2020). Preparing for the Unanticipated: Challenges in Conducting Semi-Structured, In-Depth Interviews, Sage Publications Limited.","DOI":"10.4135\/9781529719208"},{"key":"ref_54","first-page":"1","article-title":"Purposive sampling in qualitative research: A framework for the entire journey","volume":"59","author":"Ahmad","year":"2024","journal-title":"Qual. Quant."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1177\/1525822X05279903","article-title":"How many interviews are enough? An experiment with data saturation and variability","volume":"18","author":"Guest","year":"2006","journal-title":"Field Methods"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1177\/1525822X16640447","article-title":"How many interviews are enough to identify metathemes in multisited and cross-cultural research? Another perspective on Guest, Bunce, and Johnson\u2019s (2006) landmark study","volume":"29","author":"Hagaman","year":"2017","journal-title":"Field Methods"},{"key":"ref_57","first-page":"11","article-title":"Does sample size matter in qualitative research?: A review of qualitative interviews in IS research","volume":"54","author":"Marshall","year":"2013","journal-title":"J. Comput. Inf. Syst."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"426","DOI":"10.1108\/QMR-06-2016-0053","article-title":"Sample size for qualitative research","volume":"19","author":"Boddy","year":"2016","journal-title":"Qual. Mark. Res. Int. J."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Bouncken, R.B., Czakon, W., and Schmitt, F. (2025). Purposeful sampling and saturation in qualitative research methodologies: Recommendations and review. Rev. Manag. Sci., 1\u201337.","DOI":"10.1007\/s11846-025-00881-2"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"1319","DOI":"10.1007\/s10508-012-0016-6","article-title":"Sample size policy for qualitative studies using in-depth interviews","volume":"41","author":"Dworkin","year":"2012","journal-title":"Arch. Sex. Behav."},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Braun, V., and Clarke, V. (2024). Thematic analysis. Encyclopedia of Quality of Life and Well-Being Research, Springer International Publishing.","DOI":"10.1007\/978-3-031-17299-1_3470"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Guest, G., MacQueen, K.M., and Namey, E.E. (2011). Applied Thematic Analysis, Sage publications. Available online: https:\/\/antle.iat.sfu.ca\/wp-content\/uploads\/Guest_2012_AppliedThematicAnlaysis_Ch1.pdf.","DOI":"10.4135\/9781483384436"},{"key":"ref_63","first-page":"1","article-title":"Intercoder reliability in qualitative research: Debates and practical guidelines","volume":"19","author":"Joffe","year":"2020","journal-title":"Int. J. Qual. Methods"},{"key":"ref_64","unstructured":"D\u2019Ignazio, C., and Klein, L.F. (2023). Data Feminism, MIT Press."},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Brandao, P.R. (2025). The Impact of Artificial Intelligence on Modern Society. AI, 6.","DOI":"10.3390\/ai6080190"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/7\/4\/122\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T04:20:58Z","timestamp":1761020458000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/7\/4\/122"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,17]]},"references-count":65,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["make7040122"],"URL":"https:\/\/doi.org\/10.3390\/make7040122","relation":{},"ISSN":["2504-4990"],"issn-type":[{"value":"2504-4990","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,17]]}}}