{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T12:33:49Z","timestamp":1769085229356,"version":"3.49.0"},"reference-count":33,"publisher":"China Science Publishing & Media Ltd.","issue":"4","license":[{"start":{"date-parts":[[2022,8,31]],"date-time":"2022-08-31T00:00:00Z","timestamp":1661904000000},"content-version":"vor","delay-in-days":242,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines (that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and stewardship has the potential to remarkably enhance the framework for the reuse of research data. In this way, FAIR is aiding digital transformation. The \u2018FAIRification\u2019 of data increases the interoperability and (re)usability of data, so that new and robust analytical tools, such as machine learning (ML) models, can access the data to deduce meaningful insights, extract actionable information, and identify hidden patterns. This article aims to build a FAIR ML model pipeline using the generic FAIRification workflow to make the whole ML analytics process FAIR. Accordingly, FAIR input data was modelled using a FAIR ML model. The output data from the FAIR ML model was also made FAIR. For this, a hybrid hierarchical k-means (HHK) clustering ML algorithm was applied to group the data into homogeneous subgroups and ascertain the underlying structure of the data using a Nigerian-based FAIR dataset that contains data on economic factors, healthcare facilities, and coronavirus occurrences in all the 36 states of Nigeria. The model showed that research data and the ML pipeline can be FAIRified, shared, and reused by following the proposed FAIRification workflow and implementing technical architecture.<\/jats:p>","DOI":"10.1162\/dint_a_00182","type":"journal-article","created":{"date-parts":[[2022,8,31]],"date-time":"2022-08-31T15:27:18Z","timestamp":1661959638000},"page":"971-990","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":9,"title":["FAIR Machine Learning Model Pipeline Implementation of COVID-19 Data"],"prefix":"10.3724","volume":"4","author":[{"given":"Sakinat","family":"Folorunso","sequence":"first","affiliation":[{"name":"Department of Mathematical Sciences, Olabisi Onabanjo University, P.M.B 2002, Ago-Iwoye, Ogun State, Nigeria 120005, Nigeria"}]},{"given":"Ezekiel","family":"Ogundepo","sequence":"additional","affiliation":[{"name":"Data Science Nigeria, Lagos 105102, Nigeria"}]},{"given":"Mariam","family":"Basajja","sequence":"additional","affiliation":[{"name":"Leiden University, 1011NC, Amsterdam, the Netherlands"}]},{"given":"Joseph","family":"Awotunde","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Ilorin, Ilorin, Kwara State, 240103, Nigeria"}]},{"given":"Abdullahi","family":"Kawu","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Ibrahim Badamosi University, Lapai, Niger State, 911101, Nigeria"}]},{"given":"Francisca","family":"Oladipo","sequence":"additional","affiliation":[{"name":"Kampala International University, 260101, Uganda"},{"name":"Federal University Lokoja, Nigeria"},{"name":"Virus Outbreak Data Network-Africa"}]},{"given":"Abdullahi","family":"Ibrahim","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Ibrahim Badamosi University, Lapai, Niger State, 911101, Nigeria"}]}],"member":"2026","published-online":{"date-parts":[[2022,10,1]]},"reference":[{"key":"2022122202562622400_ref1","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1162\/dint_a_00028","article-title":"A generic workflow for the data FAIRification process","volume":"2","author":"Jacobsen","year":"2020","journal-title":"Data Intelligence"},{"key":"2022122202562622400_ref2","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1162\/dint_a_00026","article-title":"Making data and workflows findable for machines","volume":"2","author":"Weigel","year":"2020","journal-title":"Data Intelligence"},{"issue":"1","key":"2022122202562622400_ref3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR Guiding Principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Scientific Data"},{"issue":"4","key":"2022122202562622400_ref4","doi-asserted-by":"crossref","first-page":"933","DOI":"10.1016\/j.drudis.2019.01.008","article-title":"Implementation and relevance of FAIR data principles in biopharmaceutical R & D","volume":"24","author":"Wise","year":"2019","journal-title":"Drug Discovery Today"},{"key":"2022122202562622400_ref5","volume-title":"Machine learning pipelines: Provenance, reproducibility and FAIR Data Principles [Online]","author":"Samuel","year":"22  2020"},{"key":"2022122202562622400_ref6","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1162\/dint_a_00033","article-title":"FAIR computational workflow","volume":"2","author":"Goble","year":"2020","journal-title":"Data Intelligence"},{"key":"2022122202562622400_ref7","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1016\/j.asoc.2019.01.042","article-title":"A new soft computing model for estimating and controlling blast-produced ground vibration based on hierarchical K-means clustering and cubist algorithms","volume":"77","author":"Nguyen","year":"2019","journal-title":"Applied Soft Computing Journal"},{"key":"2022122202562622400_ref8","volume-title":"Novel hybrid hierarchical-k-means clustering method (H-K-means) for microarray analysis","author":"Chen","year":"2005"},{"key":"2022122202562622400_ref9","volume-title":"Hybrid hierarchical clustering: An experimental analysis","author":"Murugesan","year":"2011"},{"key":"2022122202562622400_ref10","volume-title":"ProvBook [Online]","author":"GitHub","year":"2021"},{"key":"2022122202562622400_ref11","first-page":"17","volume-title":"REPRODUCE-ME: Ontology-based data access for reproducibility of microscopy experiments","author":"Samuel","year":"2017"},{"key":"2022122202562622400_ref12","first-page":"51","volume-title":"Hierarchical k-means: A hybrid clustering algorithm and its application to study gene expression in lung adenocarcinoma","author":"Hasan","year":"2015"},{"key":"2022122202562622400_ref13","doi-asserted-by":"crossref","first-page":"106424","DOI":"10.1016\/j.dib.2020.106424","article-title":"An exploratory assessment of a multidimensional healthcare and economic data on COVID-19 in Nigeria","volume":"33","author":"Ogundepo","year":"2020","journal-title":"Data in Brief"},{"key":"2022122202562622400_ref14","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1162\/dint_a_00051","article-title":"Considerations for the conduction and interpretation of FAIRness evaluations","volume":"2","author":"De Miranda Azevedo","year":"2020","journal-title":"Data Intelligence"},{"key":"2022122202562622400_ref15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/sdata.2018.118","article-title":"Comment: A design framework and exemplar metrics for FAIRness","volume":"5","author":"Wilkinson","year":"2018","journal-title":"Scientific Data"},{"key":"2022122202562622400_ref16","volume-title":"From raw data to FAIR Data: The FAIRification workflow for health research","author":"Sinaci","year":"2020"},{"key":"2022122202562622400_ref17","volume-title":"ML-schema: Exposing the semantics of machine learning with schemas and ontologies","author":"Publio","year":"2018"},{"key":"2022122202562622400_ref18","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1162\/dint_a_00031","article-title":"Making FAIR easy with FAIR tools: From creolization to convergence","volume":"2","author":"Thompson","year":"2020","journal-title":"Data Intelligence"},{"issue":"1","key":"2022122202562622400_ref19","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1038\/s41597-019-0184-5","article-title":"Evaluating FAIR maturity through a scalable, automated, community-governed framework","volume":"6","author":"Wilkinson","year":"2019","journal-title":"Scientific Data"},{"key":"2022122202562622400_ref20","volume-title":"REPRODUCE-ME [Online]","author":"GitHub","year":"2021"},{"key":"2022122202562622400_ref21","volume-title":"Library of ontologies [Online]","author":"BioPortal","year":"2021"},{"key":"2022122202562622400_ref22","volume-title":"ProvBook: Provenance-based semantic enrichment of interactive notebooks for reproducibility","author":"Samuel","year":"2018"},{"key":"2022122202562622400_ref23","volume-title":"The P-Plan ontology [Online]","author":"Garijo","year":"2014"},{"key":"2022122202562622400_ref24","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-58451-5_19","article-title":"Integrative data management for reproducibility of microscopy experiments","volume-title":"The Semantic Web. ESWC 2017. Lecture Notes in Computer Science","author":"Samuel","year":"2017"},{"key":"2022122202562622400_ref25","volume-title":"Clustering-analysis [Online]","author":"GitHub","year":"2021"},{"key":"2022122202562622400_ref26","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1016\/S0378-3758(02)00388-9","article-title":"A new algorithm for hybrid hierarchical clustering with visualization and the bootstrap","volume":"117","author":"Van der Laan","year":"2003","journal-title":"Journal of Statistical Planning and Inference"},{"issue":"3","key":"2022122202562622400_ref27","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1504\/IJDMB.2010.033523","article-title":"Using hybrid hierarchical k-means (HHK) clustering algorithm for protein sequence motif super-rule-tree (SRT) structure construction","volume":"4","author":"Chen","year":"2010","journal-title":"International Journal of Data Mining and Bioinformatics"},{"key":"2022122202562622400_ref28","first-page":"90","article-title":"Review on determining number of cluster in K-means clustering","volume":"1","author":"Kodinariya","year":"2013","journal-title":"International Journal of Advanced Research in Computer Science and Management Studies"},{"key":"2022122202562622400_ref29","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1016\/S0167-9473(98)00102-9","article-title":"Displaying a clustering with CLUSPLOT","volume":"30","author":"Pison","year":"1999","journal-title":"Computational Statistics & Data Analysis"},{"key":"2022122202562622400_ref30","volume-title":"These cities are the hubs of Africa's economic boom [Online]","author":"Jacobs","year":"4  2018"},{"key":"2022122202562622400_ref31","volume-title":"Africa's biggest shipping ports [Online]","author":"Businesstech","year":"8  2015"},{"key":"2022122202562622400_ref32","volume-title":"Kaduna [Online]","author":"Wikipedia","year":"2021"},{"key":"2022122202562622400_ref33","volume-title":"Abia State [Online]","author":"Wikipedia","year":"2021"}],"container-title":["Data Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/4\/971\/2063789\/dint_a_00182.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/dint\/article-pdf\/4\/4\/971\/2063789\/dint_a_00182.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,14]],"date-time":"2025-03-14T07:42:41Z","timestamp":1741938161000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.sciengine.com\/doi\/10.1162\/dint_a_00182"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"references-count":33,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,10,1]]}},"URL":"https:\/\/doi.org\/10.1162\/dint_a_00182","relation":{},"ISSN":["2641-435X"],"issn-type":[{"value":"2641-435X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022]]},"published":{"date-parts":[[2022]]}}}