{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:46:59Z","timestamp":1753876019153,"version":"3.41.2"},"reference-count":18,"publisher":"Oxford University Press (OUP)","funder":[{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1444573"],"award-info":[{"award-number":["1444573"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"name":"US Department of Agriculture\u2019s National Agricultural Library"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Data and metadata interoperability between data storage systems is a critical component of the FAIR data principles. Programmatic and consistent means of reconciling metadata models between databases promote data exchange and thus increases its access to the scientific community. This process requires (i) metadata mapping between the models and (ii) software to perform the mapping. Here, we describe our efforts to map metadata associated with genome assemblies between the National Center for Biotechnology Information (NCBI) data resources and the Chado biological database schema. We present mappings for multiple NCBI data structures and introduce a Tripal software module, Tripal EUtils, to pull metadata from NCBI into a Tripal\/Chado database. We discuss potential mapping challenges and solutions and provide suggestions for future development to further increase interoperability between these platforms.<\/jats:p><jats:p>Database URL: https:\/\/github.com\/NAL-i5K\/tripal_eutils<\/jats:p>","DOI":"10.1093\/database\/baz143","type":"journal-article","created":{"date-parts":[[2019,11,22]],"date-time":"2019-11-22T20:09:30Z","timestamp":1574453370000},"source":"Crossref","is-referenced-by-count":1,"title":["Tripal EUtils: a Tripal module to increase exchange and reuse of genome assembly metadata"],"prefix":"10.1093","volume":"2020","author":[{"given":"B","family":"Condon","sequence":"first","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"A","family":"Almsaeed","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"S","family":"Buehler","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"C P","family":"Childers","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"S P","family":"Ficklin","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2971-9353","authenticated-orcid":false,"given":"M E","family":"Staton","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4584-6056","authenticated-orcid":false,"given":"M F","family":"Poelchau","sequence":"additional","affiliation":[{"name":"United States Department of Agriculture, Agricultural Research Service, National Agricultural Library, 10301 Baltimore Avenue, Beltsville, MD 20705, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,1,21]]},"reference":[{"key":"2020013000364830400_ref1","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci. Data."},{"key":"2020013000364830400_ref2","doi-asserted-by":"crossref","first-page":"D23","DOI":"10.1093\/nar\/gky1069","article-title":"Database resources of the National Center for Biotechnology Information","volume":"47","author":"Sayers","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2020013000364830400_ref3","first-page":"D714","article-title":"The i5k Workspace@NAL\u2014enabling genomic data access, visualization and curation of arthropod genomes","volume":"43.D1","author":"Poelchau","year":"2014","journal-title":"Nucleic Acids Res."},{"year":"2018","author":"FAIRsharing Team","key":"2020013000364830400_ref4"},{"key":"2020013000364830400_ref5","doi-asserted-by":"crossref","DOI":"10.1093\/database\/baz077","article-title":"Tripal v3: an ontology-based toolkit for construction of FAIR biological community databases","volume":"2019","author":"Spoor","year":"2019","journal-title":"Database"},{"key":"2020013000364830400_ref6","doi-asserted-by":"crossref","first-page":"i337","DOI":"10.1093\/bioinformatics\/btm189","article-title":"A Chado case study: an ontology-based modular schema for representing genome-associated biological information","volume":"23","author":"Mungall","year":"2007","journal-title":"Bioinformatics"},{"year":"2010","author":"Sayers","article-title":"A general introduction to the E-utilities","key":"2020013000364830400_ref7"},{"key":"2020013000364830400_ref8","doi-asserted-by":"crossref","DOI":"10.1002\/0471250953.bi0906s12","article-title":"Using Chado to store genome annotation data","author":"Zhou","year":"2006","journal-title":"Curr. Protoc. Bioinformatics"},{"key":"2020013000364830400_ref9","doi-asserted-by":"crossref","first-page":"D759","DOI":"10.1093\/nar\/gky1003","article-title":"Fly Base 2.0: the next generation","volume":"47","author":"Thurmond","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2020013000364830400_ref10","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2020013000364830400_ref11","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1186\/2041-1480-5-14","article-title":"The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery","volume":"5","author":"Dumontier","year":"2014","journal-title":"J. Biomed. Semantics"},{"key":"2020013000364830400_ref12","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1093\/bioinformatics\/btt113","article-title":"EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats","volume":"29","author":"Ison","year":"2013","journal-title":"Bioinformatics"},{"key":"2020013000364830400_ref13","doi-asserted-by":"crossref","first-page":"D57","DOI":"10.1093\/nar\/gkr1163","article-title":"BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata","volume":"40","author":"Barrett","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2020013000364830400_ref14","doi-asserted-by":"crossref","first-page":"D73","DOI":"10.1093\/nar\/gkv1226","article-title":"Assembly: a resource for assembled genomes at NCBI","volume":"44","author":"Kitts","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"2020013000364830400_ref15","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nbt.1823","article-title":"Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications","volume":"29","author":"Yilmaz","year":"2011","journal-title":"Nat. Biotechnol."},{"key":"2020013000364830400_ref16","first-page":"118","article-title":"A new ontology lookup service at EMBL-EBI","volume-title":"Proceedings of the 8th International Conference on Semantic Web Applications and Tools for Life Sciences,","author":"Jupp","year":"2015"},{"year":"2008","author":"Sayers","article-title":"E-utilities quick start","key":"2020013000364830400_ref17"},{"key":"2020013000364830400_ref18","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1038\/ng.1054","article-title":"Toward interoperable bioscience data","volume":"44","author":"Sansone","year":"2012","journal-title":"Nat. Genet."}],"container-title":["Database"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baz143\/31953627\/baz143.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baz143\/31953627\/baz143.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,6]],"date-time":"2022-10-06T19:43:03Z","timestamp":1665085383000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baz143\/5709695"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,1,1]]},"references-count":18,"URL":"https:\/\/doi.org\/10.1093\/database\/baz143","relation":{},"ISSN":["1758-0463"],"issn-type":[{"type":"electronic","value":"1758-0463"}],"subject":[],"published-other":{"date-parts":[[2020]]},"published":{"date-parts":[[2020,1,1]]},"article-number":"baz143"}}