{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T22:10:12Z","timestamp":1773785412209,"version":"3.50.1"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"D1","license":[{"start":{"date-parts":[[2021,11,8]],"date-time":"2021-11-08T00:00:00Z","timestamp":1636329600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100012116","name":"EMBL-EBI","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100012116","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010269","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["201535\/Z\/16\/Z"],"award-info":[{"award-number":["201535\/Z\/16\/Z"]}],"id":[{"id":"10.13039\/100010269","id-type":"DOI","asserted-by":"publisher"}]},{"name":"FAIRplus","award":["802750"],"award-info":[{"award-number":["802750"]}]},{"name":"ELIXIR"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,1,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http:\/\/www.ebi.ac.uk\/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https:\/\/www.ebi.ac.uk\/about\/terms-of-use. The BioSamples code is available at https:\/\/github.com\/EBIBioSamples\/biosamples-v4 and distributed under the Apache\u00a02.0 license.<\/jats:p>","DOI":"10.1093\/nar\/gkab1046","type":"journal-article","created":{"date-parts":[[2021,10,16]],"date-time":"2021-10-16T10:07:08Z","timestamp":1634378828000},"page":"D1500-D1507","source":"Crossref","is-referenced-by-count":57,"title":["BioSamples database: FAIRer samples metadata to accelerate research data management"],"prefix":"10.1093","volume":"50","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9551-6370","authenticated-orcid":false,"given":"M\u00e9lanie","family":"Courtot","sequence":"first","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8753-7369","authenticated-orcid":false,"given":"Dipayan","family":"Gupta","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4839-5158","authenticated-orcid":false,"given":"Isuru","family":"Liyanage","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5923-3859","authenticated-orcid":false,"given":"Fuqi","family":"Xu","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2513-5396","authenticated-orcid":false,"given":"Tony","family":"Burdett","sequence":"additional","affiliation":[{"name":"European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK"}]}],"member":"286","published-online":{"date-parts":[[2021,11,8]]},"reference":[{"key":"2022010507325595000_B1","doi-asserted-by":"crossref","first-page":"D121","DOI":"10.1093\/nar\/gkaa967","article-title":"The international nucleotide sequence database collaboration","volume":"49","author":"Arita","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B2","doi-asserted-by":"crossref","first-page":"D1172","DOI":"10.1093\/nar\/gky1061","article-title":"BioSamples database: an updated sample metadata hub","volume":"47","author":"Courtot","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B3","doi-asserted-by":"crossref","first-page":"2422","DOI":"10.12688\/f1000research.9656.2","article-title":"Identifying ELIXIR core data resources","volume":"5","author":"Durinx","year":"2017","journal-title":"F1000Research"},{"key":"2022010507325595000_B4","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1089\/big.2014.0068","article-title":"Data integration for heterogenous datasets","volume":"2","author":"Hendler","year":"2014","journal-title":"Big Data"},{"key":"2022010507325595000_B5","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1186\/s12874-020-01057-0","article-title":"The challenges in data integration \u2013 heterogeneity and complexity in clinical trials and patient registries of Systemic Lupus Erythematosus","volume":"20","author":"Le\u00a0Sueur","year":"2020","journal-title":"BMC Med. Res. Methodol."},{"key":"2022010507325595000_B6","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1038\/nrd4626","article-title":"SLC transporters as therapeutic targets: emerging opportunities","volume":"14","author":"Lin","year":"2015","journal-title":"Nat. Rev. Drug Discov."},{"key":"2022010507325595000_B7","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1100\/tsw.2009.57","article-title":"Minimum Information About a Microarray Experiment (MIAME)\u2014successes, failures, challenges","volume":"9","author":"Brazma","year":"2009","journal-title":"ScientificWorldJournal"},{"key":"2022010507325595000_B8","doi-asserted-by":"crossref","first-page":"D991","DOI":"10.1093\/nar\/gks1193","article-title":"NCBI GEO: archive for functional genomics data sets\u2014update","volume":"41","author":"Barrett","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B9","doi-asserted-by":"crossref","first-page":"D711","DOI":"10.1093\/nar\/gky964","article-title":"ArrayExpress update \u2013 from bulk to single-cell expression data","volume":"47","author":"Athar","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B10","doi-asserted-by":"crossref","first-page":"25","DOI":"10.7171\/jbt.18-2902-002","article-title":"The cellosaurus, a cell-line knowledge resource","volume":"29","author":"Bairoch","year":"2018","journal-title":"J. Biomol. Tech."},{"key":"2022010507325595000_B11","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1111\/nph.16544","article-title":"Enabling reusability of plant phenomic datasets with MIAPPE 1.1","volume":"227","author":"Papoutsoglou","year":"2020","journal-title":"New Phytol."},{"key":"2022010507325595000_B12","doi-asserted-by":"crossref","first-page":"1671403","DOI":"10.34133\/2019\/1671403","article-title":"Applying FAIR principles to plant phenotypic data management in GnpIS","volume":"2019","author":"Pommier","year":"2019","journal-title":"Plant Phenomics"},{"key":"2022010507325595000_B13","doi-asserted-by":"crossref","first-page":"D28","DOI":"10.1093\/nar\/gkq967","article-title":"The european nucleotide archive","volume":"39","author":"Leinonen","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B14","doi-asserted-by":"crossref","first-page":"S28","DOI":"10.1097\/01.PAT.0000461407.88852.73","article-title":"The global alliance for genomics and health: towards international sharing of genomic and clinical data","volume":"47","author":"North","year":"2015","journal-title":"Pathology"},{"key":"2022010507325595000_B15","doi-asserted-by":"crossref","first-page":"giab060","DOI":"10.1093\/gigascience\/giab060","article-title":"ISA API: An open platform for interoperable life science experimental metadata","volume":"10","author":"Johnson","year":"2021","journal-title":"GigaScience"},{"key":"2022010507325595000_B16","doi-asserted-by":"crossref","first-page":"D82","DOI":"10.1093\/nar\/gkaa1028","article-title":"The european nucleotide archive in 2020","volume":"49","author":"Harrison","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B17","doi-asserted-by":"crossref","DOI":"10.1038\/s41586-021-03767-x","article-title":"Mapping the human genetic architecture of COVID-19","author":"Covid- Host Genetics Initiative","year":"2021","journal-title":"Nature"},{"key":"2022010507325595000_B18","doi-asserted-by":"crossref","DOI":"10.1101\/2020.11.20.20227355","article-title":"Single cell profiling of COVID-19 patients: an international data resource from multiple tissues","author":"Chan Zuckerberg Initiative Single-Cell Covid Consortia","year":"2020"},{"key":"2022010507325595000_B19","doi-asserted-by":"crossref","first-page":"D246","DOI":"10.1093\/nar\/gkx1158","article-title":"Expression Atlas: gene and protein expression across multiple studies and organisms","volume":"46","author":"Papatheodorou","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B20","doi-asserted-by":"crossref","first-page":"D57","DOI":"10.1093\/nar\/gkr1163","article-title":"BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata","volume":"40","author":"Barrett","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B21","doi-asserted-by":"crossref","first-page":"W619","DOI":"10.1093\/nar\/gkab417","article-title":"The COVID-19 Data Portal: accelerating SARS-CoV-2 and COVID-19 research through rapid open access data sharing","volume":"49","author":"Harrison","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"2022010507325595000_B22","doi-asserted-by":"crossref","first-page":"baaa062","DOI":"10.1093\/database\/baaa062","article-title":"NCBI Taxonomy: a comprehensive update on curation, resources and tools","volume":"2020","author":"Schoch","year":"2020","journal-title":"Database"},{"key":"2022010507325595000_B23","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1038\/s41579-020-0364-5","article-title":"Tara Oceans: towards global ocean ecosystems biology","volume":"18","author":"Sunagawa","year":"2020","journal-title":"Nat. Rev. Microbiol."},{"key":"2022010507325595000_B24","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1089\/bio.2012.0003","article-title":"A minimum data set for sharing biobank samples, information, and data: MIABIS","volume":"10","author":"Norlin","year":"2012","journal-title":"Biopreserv. Biobank."},{"key":"2022010507325595000_B25","doi-asserted-by":"crossref","first-page":"e27041","DOI":"10.7554\/eLife.27041","article-title":"The human cell atlas","volume":"6","author":"Regev","year":"2017","journal-title":"Elife"},{"key":"2022010507325595000_B26","doi-asserted-by":"crossref","first-page":"692","DOI":"10.1038\/ng.3312","article-title":"The European Genome-phenome Archive of human data consented for biomedical research","volume":"47","author":"Lappalainen","year":"2015","journal-title":"Nat. Genet."},{"key":"2022010507325595000_B27","doi-asserted-by":"crossref","DOI":"10.20944\/preprints202008.0220.v1","article-title":"The PHA4GE SARS-CoV-2 contextual data specification for open genomic epidemiology","author":"Griffiths","year":"2020"},{"key":"2022010507325595000_B28","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkab960","article-title":"The European Variation Archive: a FAIR resource of genomic variation for all species","author":"Cezard","year":"2021","journal-title":"Nucleic Acids Res."}],"container-title":["Nucleic Acids Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/nar\/article-pdf\/50\/D1\/D1500\/42057802\/gkab1046.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/nar\/article-pdf\/50\/D1\/D1500\/42057802\/gkab1046.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,1,5]],"date-time":"2022-01-05T07:51:34Z","timestamp":1641369094000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/nar\/article\/50\/D1\/D1500\/6423179"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,8]]},"references-count":28,"journal-issue":{"issue":"D1","published-online":{"date-parts":[[2021,11,8]]},"published-print":{"date-parts":[[2022,1,7]]}},"URL":"https:\/\/doi.org\/10.1093\/nar\/gkab1046","relation":{},"ISSN":["0305-1048","1362-4962"],"issn-type":[{"value":"0305-1048","type":"print"},{"value":"1362-4962","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,1,7]]},"published":{"date-parts":[[2021,11,8]]}}}