{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T21:54:20Z","timestamp":1773352460986,"version":"3.50.1"},"reference-count":63,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,6,15]],"date-time":"2023-06-15T00:00:00Z","timestamp":1686787200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Big Data"],"abstract":"<jats:p>Collaborations between scientists from the global north and global south (N-S collaborations) are a key driver of the \u201cfourth paradigm of science\u201d and have proven crucial to addressing global crises like COVID-19 and climate change. However, despite their critical role, N-S collaborations on datasets are not well understood. Science of science studies tend to rely on publications and patents to examine N-S collaboration patterns. To this end, the rise of global crises requiring N-S collaborations to produce and share data presents an urgent need to understand the prevalence, dynamics, and political economy of N-S collaborations on research datasets. In this paper, we employ a mixed methods case study research approach to analyze the frequency of and division of labor in N-S collaborations on datasets submitted to GenBank over 29 years (1992\u20132021). We find: (1) there is a low representation of N-S collaborations over the 29-year period. When they do occur, N-S collaborations display \u201cburstiness\u201d patterns, suggesting that N-S collaborations on datasets are formed and maintained reactively in the wake of global health crises such as infectious disease outbreaks; (2) The division of labor between datasets and publications is disproportionate to the global south in the early years, but becomes more overlapping after 2003. An exception in the case of countries with lower S&amp;amp;T capacity but high income, where these countries have a higher prevalence on datasets (e.g., United Arab Emirates). We qualitatively inspect a sample of N-S dataset collaborations to identify leadership patterns in dataset and publication authorship. The findings lead us to argue there is a need to include N-S dataset collaborations in measures of research outputs to nuance the current models and assessment tools of equity in N-S collaborations. The paper contributes to the SGDs objectives to develop data-driven metrics that can inform scientific collaborations on research datasets.<\/jats:p>","DOI":"10.3389\/fdata.2023.1054655","type":"journal-article","created":{"date-parts":[[2023,6,15]],"date-time":"2023-06-15T05:50:08Z","timestamp":1686808208000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["North-south scientific collaborations on research datasets: a longitudinal analysis of the division of labor on genomic datasets (1992\u20132021)"],"prefix":"10.3389","volume":"6","author":[{"given":"Sarah","family":"Bratt","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mrudang","family":"Langalia","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Abhishek","family":"Nanoti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2023,6,15]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"1111","DOI":"10.1111\/1467-6486.00326","article-title":"Formal mentoring systems: an examination of the effects of mentor\/prot\u00e9g\u00e9 cognitive styles on the mentoring process","volume":"39","author":"Armstrong","year":"2002","journal-title":"J. Manage. Stud."},{"key":"B2","doi-asserted-by":"publisher","first-page":"30524","DOI":"10.3402\/gha.v9.30524","article-title":"North\u2013south collaboration and capacity development in global health research in low- and middle-income countries \u2013 the ARCADE projects","volume":"9","author":"Atkins","year":"2016","journal-title":"Glob. Health Act."},{"key":"B3","doi-asserted-by":"publisher","first-page":"330","DOI":"10.1038\/d41586-019-00350-3","article-title":"Small research teams \u201cdisrupt\u201d science more radically than large ones","volume":"566","author":"Azoulay","year":"2019","journal-title":"Nature"},{"key":"B4","doi-asserted-by":"publisher","first-page":"e299","DOI":"10.1371\/journal.pmed.0030299","article-title":"Evaluating health research capacity building: an evidence-based tool","volume":"3","author":"Bates","year":"2006","journal-title":"PLoS Med."},{"key":"B5","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1007\/BF02016308","article-title":"Studies in scientific collaboration Part III. Professionalization and the natural history of modern scientific co-authorship","volume":"1","author":"Beaver","year":"2005","journal-title":"Scientometrics"},{"key":"B6","doi-asserted-by":"publisher","first-page":"D41","DOI":"10.1093\/nar\/gkx1094","article-title":"GenBank","volume":"46","author":"Benson","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"B7","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/978-1-84882-854-4_15","article-title":"\u201cCollaboration in metagenomics: Sequence databases and the organization of scientific work,\u201d","volume-title":"ECSCW 2009","author":"Bietz","year":"2009"},{"key":"B8","doi-asserted-by":"publisher","first-page":"716","DOI":"10.1504\/IJTM.2001.002988","article-title":"Scientific and technical human capital: an alternative model for research evaluation","volume":"22","author":"Bozeman","year":"2001","journal-title":"Int. J. Technol. Manage."},{"key":"B9","volume-title":"Research data management practices and impacts on long-term data sustainability: an institutional exploration","author":"Bratt","year":"2022"},{"key":"B10","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1002\/pra2.2017.14505401005","article-title":"Big data, big metadata and quantitative study of science: a workflow model for big scientometrics","volume":"54","author":"Bratt","year":"2017","journal-title":"Proc. Assoc. Inform. Sci. Technol."},{"key":"B11","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1186\/s12961-015-0048-1","article-title":"SDH-NET: a South\u2013North-South collaboration to build sustainable research capacities on social determinants of health in low- and middle-income countries","volume":"13","author":"Cash-Gibson","year":"2015","journal-title":"Health Res. Policy Syst."},{"key":"B12","doi-asserted-by":"publisher","first-page":"642","DOI":"10.1002\/pra2.676","article-title":"A preliminary analysis of geography of collaboration in data papers by S&T capacity index","volume":"59","author":"Chen","year":"2022","journal-title":"Proc. Assoc. Inform. Sci. Technol."},{"key":"B13","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1525\/hsps.2003.33.2.261","article-title":"LIGO becomes big science","volume":"33","author":"Collins","year":"2003","journal-title":"Hist. Stud. Phys. Biol. Sci."},{"key":"B14","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/s11192-016-1954-x","article-title":"Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank","volume":"108","author":"Costa","year":"2016","journal-title":"Scientometrics"},{"key":"B15","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1109\/JCDL.2014.6970197","article-title":"\u201cResearch networks in data repositories,\u201d","volume-title":"IEEE\/ACM Joint Conference on Digital Libraries","author":"Costa","year":"2014"},{"key":"B16","author":"Crane","year":"1972","journal-title":"Invisible colleges: Diffusion of knowledge in scientific communities"},{"key":"B17","volume-title":"Qualitative Inquiry and Research Design: Choosing Among Five Approaches","author":"Creswell","year":"2016"},{"key":"B18","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1038\/d41586-021-00065-4","article-title":"Alarming COVID variants show vital role of genomic surveillance","volume":"589","author":"Cyranoski","year":"2021","journal-title":"Nature"},{"key":"B19","unstructured":"2023"},{"key":"B20","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1038\/s41588-022-01071-6","article-title":"Promoting the genomic revolution in Africa through the Nigerian 100K Genome Project","volume":"54","author":"Fatumo","year":"2022","journal-title":"Nat. Genet."},{"key":"B21","doi-asserted-by":"publisher","first-page":"e0258286","DOI":"10.1371\/journal.pone.0258286","article-title":"Considering equity in global health collaborations: a qualitative study on experiences of equity","volume":"16","author":"Faure","year":"2021","journal-title":"PLoS ONE"},{"key":"B22","doi-asserted-by":"publisher","DOI":"10.1126\/science.aao0185","article-title":"Science of science","author":"Fortunato","year":"2018","journal-title":"Science"},{"key":"B23","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1002\/asi.21688","article-title":"Mapping world scientific collaboration: authors, institutions, and countries","volume":"63","author":"Gazni","year":"2012","journal-title":"J. Am. Soc. Inform. Sci. Technol."},{"key":"B24","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1007\/1-4020-2755-9_12","article-title":"\u201cAnalysing scientific networks through co-authorship,\u201d","volume-title":"Handbook of quantitative science and technology research: The use of publication and patent statistics in studies of S&T systems","author":"Gl\u00e4nzel","year":"2005"},{"key":"B25","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1038\/s41562-022-01351-5","article-title":"Leading countries in global science increasingly receive more citations than other countries doing similar research","volume":"6","author":"Gomez","year":"2022","journal-title":"Nat. Hum. Behav."},{"key":"B26","doi-asserted-by":"publisher","first-page":"e1009277","DOI":"10.1371\/journal.pcbi.1009277","article-title":"Ten simple rules for Global North researchers to stop perpetuating helicopter research in the Global South","volume":"17","author":"Haelewaters","year":"2021","journal-title":"PLoS Comput. Biol."},{"key":"B27","doi-asserted-by":"publisher","first-page":"385","DOI":"10.1080\/01972240290108195","article-title":"Strong, weak, and latent ties and the impact of new media","volume":"18","author":"Haythornthwaite","year":"2002","journal-title":"Inform. Soc."},{"key":"B28","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1002\/pra2.608","article-title":"Collaboration networks and career trajectories: what do metadata from data repositories tell us?","volume":"59","author":"Hemsley","year":"2022","journal-title":"Proc. Assoc. Inf. Sci. Technol."},{"key":"B29","doi-asserted-by":"crossref","first-page":"793","DOI":"10.1177\/1403494818812637","article-title":"Achieving the SDGs through interdisciplinary research in global health","volume":"47","author":"Herzig Van Wees","year":"2019","journal-title":"Scand. J. Public Health"},{"key":"B30","volume-title":"The Fourth Paradigm: Data-Intensive Scientific Discovery","author":"Hey","year":"2009"},{"key":"B31","doi-asserted-by":"publisher","first-page":"595299","DOI":"10.3389\/frma.2020.595299","article-title":"Real-time bibliometrics: dimensions as a resource for analyzing aspects of COVID-19","volume":"5","author":"Hook","year":"2021","journal-title":"Front. Res. Metrics Anal."},{"key":"B32","doi-asserted-by":"publisher","first-page":"912","DOI":"10.1162\/qss_a_00228","article-title":"Recalibrating the scope of scholarly publishing: a modest step in a vast decolonization process","volume":"3","author":"Khanna","year":"2022","journal-title":"Quant. Sci. Stud."},{"key":"B33","first-page":"146","volume-title":"Resource sharing: the invisible service. State library services and issues: facing future challenges","author":"Krueger","year":"1986"},{"key":"B34","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1057\/s41271-016-0002-7","article-title":"Transforming our world: implementing the 2030 agenda through sustainable development goal indicators","volume":"37","author":"Lee","year":"2016","journal-title":"J. Public Health Policy"},{"key":"B35","doi-asserted-by":"crossref","first-page":"e009067","DOI":"10.1136\/bmjgh-2022-009067","article-title":"The use, misuse and overuse of the \u2018low-income and middle-income countries' category","volume":"7","author":"Lencucha","year":"2022","journal-title":"BMJ Global Health"},{"key":"B36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/pra2.2016.14505301072","article-title":"Software citation, reuse and metadata considerations: an exploratory study examining LAMMPS","volume":"53","author":"Li","year":"2016","journal-title":"Proc. Assoc. Inform. Sci. Technol."},{"key":"B37","doi-asserted-by":"publisher","DOI":"10.1038\/d41586-021-02549-9","article-title":"Researchers from global south under-represented in development research","author":"Liverpool","year":"2021","journal-title":"Nature"},{"key":"B38","doi-asserted-by":"publisher","first-page":"4975","DOI":"10.1007\/s11192-021-03971-6","article-title":"The sharing of research data facing the COVID-19 pandemic","volume":"126","author":"Lucas-Dominguez","year":"2021","journal-title":"Scientometrics"},{"key":"B39","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1007\/978-0-387-34872-8_20","article-title":"\u201cSocial shaping of information infrastructure: on being specific about the technology,\u201d","volume-title":"Information Technology and Changes in Organizational Work: Proceedings of the IFIP WG8. 2 Working Conference on Information Technology and Changes in Organizational Work","author":"Monteiro","year":"1996"},{"key":"B40","year":"2022"},{"key":"B41","doi-asserted-by":"publisher","first-page":"016131","DOI":"10.1103\/PhysRevE.64.016131","article-title":"Scientific collaboration networks. I. Network construction and fundamental results","volume":"64","author":"Newman","year":"2001","journal-title":"Phys. Rev. E"},{"key":"B42","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1186\/s12992-022-00898-2","article-title":"Bridging the genomic data gap in Africa: implications for global disease burdens","volume":"18","author":"Omotoso","year":"2022","journal-title":"Global. Health"},{"key":"B43","doi-asserted-by":"crossref","first-page":"119","DOI":"10.7312\/pric91844","volume-title":"Big Science, Little Science","author":"Price","year":"1963"},{"key":"B44","article-title":"OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts","author":"Priem","year":"2022","journal-title":"arXiv [Preprint]."},{"key":"B45","first-page":"1235","article-title":"Methodological and technical challenges in big scientometric data analytics","volume":"26","author":"Qin","year":"2009","journal-title":"Mol. Biol. Evol."},{"key":"B46","doi-asserted-by":"publisher","first-page":"100412","DOI":"10.1016\/j.patter.2021.100412","article-title":"African genomic data sharing and the struggle for equitable benefit","volume":"3","author":"Ramsay","year":"2022","journal-title":"Patterns"},{"key":"B47","unstructured":"RCSB PDB: Homepage2023"},{"key":"B48","unstructured":"RogersE. M.\n          Diffusion of Innovations. Simon and Schuster2010"},{"key":"B49","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1016\/S2055-6640(20)31093-1","article-title":"The impact of Thailand's public health response to the HIV epidemic 1984\u20132015: understanding the ingredients of success","volume":"2","author":"Siraprapasiri","year":"2016","journal-title":"J. Virus Erad."},{"key":"B50","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1016\/j.cell.2019.02.048","article-title":"The missing diversity in human genetic studies","volume":"177","author":"Sirugo","year":"2019","journal-title":"Cell"},{"key":"B51","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1080\/21624887.2020.1760587","article-title":"Algorithmic warfare and the reinvention of accuracy","volume":"8","author":"Suchman","year":"2020","journal-title":"Crit. Stud. Security"},{"key":"B52","doi-asserted-by":"publisher","first-page":"03063127221104938","DOI":"10.1177\/03063127221104938","article-title":"Imaginaries of omniscience: automating intelligence in the US Department of Defense","volume":"23","author":"Suchman","year":"2022","journal-title":"Soc. Stud. Sci."},{"key":"B53","article-title":"\u201cGray's laws: database-centric computing in science Published in: Tansley, S., & Tolle, K. M. (2009),\u201d","volume-title":"The Fourth Paradigm: Data-Intensive Scientific Discovery (Vol. 1)","author":"Szalay","year":"2009"},{"key":"B54","doi-asserted-by":"publisher","first-page":"104318","DOI":"10.1016\/j.marpol.2020.104318","article-title":"The usual suspects? Distribution of collaboration capital in marine biodiversity research","volume":"124","author":"Tolochko","year":"2021","journal-title":"Marine Policy"},{"key":"B55","doi-asserted-by":"publisher","first-page":"375","DOI":"10.1126\/science.aaa3581","article-title":"Big science is hard but worth it","volume":"348","author":"Turner","year":"2015","journal-title":"Science"},{"key":"B56","doi-asserted-by":"publisher","first-page":"e93376","DOI":"10.1371\/journal.pone.0093376","article-title":"International scientific collaboration in HIV and HPV: a network analysis","volume":"9","author":"Vanni","year":"2014","journal-title":"PLoS ONE"},{"key":"B57","doi-asserted-by":"publisher","first-page":"1079","DOI":"10.1007\/s10734-020-00600-8","article-title":"The emergence of the higher education research field (1976\u20132018): preferential attachment, smallworldness and fragmentation in its collaboration networks","volume":"81","author":"Vlegels","year":"2021","journal-title":"Higher Educ."},{"key":"B58","volume-title":"Science and Technology Collaboration: Building Capability in Developing Countries","author":"Wagner","year":"2001"},{"key":"B59","author":"Wagner","year":"2015"},{"key":"B60","article-title":"Measuring the globalization of knowledge networks","author":"Wagner","year":"2009","journal-title":"arXiv [Preprint]"},{"key":"B61","doi-asserted-by":"crossref","DOI":"10.1017\/9781108610834","volume-title":"The Science of Science","author":"Wang","year":"2021"},{"key":"B62","doi-asserted-by":"publisher","first-page":"e2200927119","DOI":"10.1073\/pnas.2200927119","article-title":"Flat teams drive scientific innovation","volume":"119","author":"Xu","year":"2022","journal-title":"Proc. Nat. Acad. Sci."},{"key":"B63","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1007\/s11192-020-03531-4","article-title":"How scientific research reacts to international public health emergencies: a global analysis of response patterns","volume":"124","author":"Zhang","year":"2020","journal-title":"Scientometrics"}],"container-title":["Frontiers in Big Data"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdata.2023.1054655\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,15]],"date-time":"2023-06-15T05:50:28Z","timestamp":1686808228000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdata.2023.1054655\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,15]]},"references-count":63,"alternative-id":["10.3389\/fdata.2023.1054655"],"URL":"https:\/\/doi.org\/10.3389\/fdata.2023.1054655","relation":{},"ISSN":["2624-909X"],"issn-type":[{"value":"2624-909X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,15]]},"article-number":"1054655"}}