{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T01:12:14Z","timestamp":1771031534614,"version":"3.50.1"},"reference-count":52,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T00:00:00Z","timestamp":1660262400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["OIA-1849227"],"award-info":[{"award-number":["OIA-1849227"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Maine EPSCoR at the University of Maine"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,30]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Environmental DNA (eDNA), as a rapidly expanding research field, stands to benefit from shared resources including sampling protocols, study designs, discovered sequences, and taxonomic assignments to sequences. High-quality community shareable eDNA resources rely heavily on comprehensive metadata documentation that captures the complex workflows covering field sampling, molecular biology lab work, and bioinformatic analyses. There are limited sources that provide documentation of database development on comprehensive metadata for eDNA and these workflows and no open-source software.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present medna-metadata, an open-source, modular system that aligns with Findable, Accessible, Interoperable, and Reusable guiding principles that support scholarly data reuse and the database and application development of a standardized metadata collection structure that encapsulates critical aspects of field data collection, wet lab processing, and bioinformatic analysis. Medna-metadata is showcased with metabarcoding data from the Gulf of Maine (Polinski et al., 2019).<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The source code of the medna-metadata web application is hosted on GitHub (https:\/\/github.com\/Maine-eDNA\/medna-metadata). Medna-metadata is a docker-compose installable package. Documentation can be found at https:\/\/medna-metadata.readthedocs.io\/en\/latest\/?badge=latest. The application is implemented in Python, PostgreSQL and PostGIS, RabbitMQ, and NGINX, with all major browsers supported. A demo can be found at https:\/\/demo.metadata.maine-edna.org\/.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac556","type":"journal-article","created":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T13:29:39Z","timestamp":1660310979000},"page":"4589-4597","source":"Crossref","is-referenced-by-count":11,"title":["medna-metadata: an open-source data management system for tracking environmental DNA samples and metadata"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3787-5121","authenticated-orcid":false,"given":"M","family":"Kimble","sequence":"first","affiliation":[{"name":"School of Computing and Information Science, University of Maine , Orono, ME 04469, USA"}]},{"given":"S","family":"Allers","sequence":"additional","affiliation":[{"name":"Department of Molecular and Biomedical Sciences, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2971-4328","authenticated-orcid":false,"given":"K","family":"Campbell","sequence":"additional","affiliation":[{"name":"School of Computing and Information Science, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9250-5887","authenticated-orcid":false,"given":"C","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computing and Information Science, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5378-0931","authenticated-orcid":false,"given":"L M","family":"Jackson","sequence":"additional","affiliation":[{"name":"Advanced Research Computing, Security and Information Management, University of Maine , Orono, ME 04469, USA"},{"name":"Maine EPSCoR, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6463-1336","authenticated-orcid":false,"given":"B L","family":"King","sequence":"additional","affiliation":[{"name":"Department of Molecular and Biomedical Sciences, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5497-3295","authenticated-orcid":false,"given":"S","family":"Silverbrand","sequence":"additional","affiliation":[{"name":"School of Marine Sciences, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9952-1268","authenticated-orcid":false,"given":"G","family":"York","sequence":"additional","affiliation":[{"name":"Environmental DNA Laboratory, Coordinated Operating Research Entities, University of Maine , Orono, ME 04469, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7703-0270","authenticated-orcid":false,"given":"K","family":"Beard","sequence":"additional","affiliation":[{"name":"School of Computing and Information Science, University of Maine , Orono, ME 04469, USA"}]}],"member":"286","published-online":{"date-parts":[[2022,8,12]]},"reference":[{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1093\/bioinformatics\/bty820","article-title":"Parkour LIMS: high-quality sample preparation in next generation sequencing","volume":"35","author":"Anatskiy","year":"2019","journal-title":"Bioinformatics"},{"key":"2023041408242839500_","first-page":"49","volume-title":"Publishing DNA-Derived Data through Biodiversity Data Platforms, Version 1.0.","author":"Andersson","year":"2020"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"C143","DOI":"10.1107\/S0108767311096486","article-title":"MyTARDIS: managing the lifecycle of crystallography data","volume":"67","author":"Androulakis","year":"2011","journal-title":"Acta Crystallogr. A Found. Crystallogr"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"e1005755","DOI":"10.1371\/journal.pcbi.1005755","article-title":"Unmet needs for analyzing biological big data: a survey of 704 NSF principal investigators","volume":"13","author":"Barone","year":"2017","journal-title":"PLoS Comput. Biol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1016\/j.tim.2015.08.009","article-title":"Microbial malaise: how can we classify the microbiome?","volume":"23","author":"Beiko","year":"2015","journal-title":"Trends Microbiol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1935","DOI":"10.1098\/rstb.2005.1725","article-title":"Defining operational taxonomic units using DNA barcode data","volume":"360","author":"Blaxter","year":"2005","journal-title":"Philos. Trans. R Soc. B Biol. Sci"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1186\/s40168-018-0470-z","article-title":"Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2\u2019s q2-feature-classifier plugin","volume":"6","author":"Bokulich","year":"2018","journal-title":"Microbiome"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","DOI":"10.3897\/ab.e68634","volume-title":"A Practical Guide to DNA-Based Methods for Biodiversity Assessment","author":"Bruce","year":"2021"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1373\/clinchem.2008.112797","article-title":"The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments","volume":"55","author":"Bustin","year":"2009","journal-title":"Clin. Chem"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"6408","DOI":"10.1021\/acs.est.8b01071","article-title":"Does size matter? An experimental evaluation of the relative abundance and decay rates of aquatic environmental DNA","volume":"52","author":"Bylemans","year":"2018","journal-title":"Environ. Sci. Technol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1038\/nmeth.3869","article-title":"DADA2: high-resolution sample inference from Illumina amplicon data","volume":"13","author":"Callahan","year":"2016","journal-title":"Nat. Methods"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"2639","DOI":"10.1038\/ismej.2017.119","article-title":"Exact sequence variants should replace operational taxonomic units in marker-gene data analysis","volume":"11","author":"Callahan","year":"2017","journal-title":"ISME J"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1834","DOI":"10.1111\/j.1365-294X.2012.05550.x","article-title":"Bioinformatic challenges for DNA metabarcoding of plants and animals: bioinformatic for DNA metabarcoding","volume":"21","author":"Coissac","year":"2012","journal-title":"Mol. Ecol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1985","DOI":"10.1111\/2041-210X.13276","article-title":"Non-specific amplification compromises environmental DNA metabarcoding with COI","volume":"10","author":"Collins","year":"2019","journal-title":"Methods Ecol. Evol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1186\/s12859-022-04584-3","article-title":"A data management infrastructure for the integration of imaging and omics data in life sciences","volume":"23","author":"Cuellar","year":"2022","journal-title":"BMC Bioinformatics"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"D1442","DOI":"10.1093\/nar\/gkab1014","article-title":"GreeNC 2.0: a comprehensive database of plant long non-coding RNAs","volume":"50","author":"Di Marsico","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1038\/nbt1360","article-title":"The minimum information about a genome sequence (MIGS) specification","volume":"26","author":"Field","year":"2008","journal-title":"Nat. Biotechnol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"D597","DOI":"10.1093\/nar\/gks1160","article-title":"The protist ribosomal reference database (PR2): a catalog of unicellular eukaryote small sub-unit rRNA sequences with curated taxonomy","volume":"41","author":"Guillou","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023041408242839500_","author":"Haendel","year":"2016"},{"key":"2023041408242839500_","first-page":"20191409","article-title":"Predicting the fate of eDNA in the environment and implications for studying biodiversity. Proceedings of the royal society","volume":"286","author":"Harrison","year":"2019","journal-title":"Proc. Biol. Sci"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1111\/1755-0998.13107","article-title":"A practical guide to sample preservation and pre-PCR processing of aquatic environmental DNA","volume":"20","author":"Kumar","year":"2020","journal-title":"Mol. Ecol. Resour"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"135","DOI":"10.3389\/fevo.2020.00135","article-title":"A systematic review of sources of variability and uncertainty in eDNA data for environmental monitoring","volume":"8","author":"Mathieu","year":"2020","journal-title":"Front. Ecol. Evol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1126\/science.1179653","article-title":"Accessible reproducible research","volume":"327","author":"Mesirov","year":"2010","journal-title":"Science"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1002\/edn3.121","article-title":"An illustrated manual for environmental DNA research: water sampling guidelines and experimental protocols","volume":"3","author":"Minamoto","year":"2021","journal-title":"Environ. DNA"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1002\/edn3.81","article-title":"An analysis of metadata reporting in freshwater environmental DNA research calls for the development of best practice guidelines","volume":"2","author":"Nicholson","year":"2020","journal-title":"Environ. DNA"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"9721","DOI":"10.1002\/ece3.6594","article-title":"A total crapshoot? Evaluating bioinformatic decisions in animal diet metabarcoding analyses","volume":"10","author":"O\u2019Rourke","year":"2020","journal-title":"Ecol. Evol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"3397","DOI":"10.1093\/icesjms\/fsab082","article-title":"The role of taxonomic expertise in interpretation of metabarcoding studies","volume":"78","author":"Pappalardo","year":"2021","journal-title":"ICES J. Mar. Sci"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"814","DOI":"10.1038\/sj.embor.7401061","article-title":"There shall be order: the legacy of Linnaeus in the age of molecular biology","volume":"8","author":"Paterlini","year":"2007","journal-title":"EMBO Rep"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"151783","DOI":"10.1016\/j.scitotenv.2021.151783","article-title":"Environmental DNA metabarcoding for benthic monitoring: a review of sediment sampling and DNA extraction methods","volume":"818","author":"Pawlowski","year":"2022","journal-title":"Sci. Total Environ"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"giz092","DOI":"10.1093\/gigascience\/giz092","article-title":"Prospects and challenges of implementing DNA metabarcoding for high-throughput insect surveillance","volume":"8","author":"Piper","year":"2019","journal-title":"GigaScience"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"14820","DOI":"10.1038\/s41598-019-51341-3","article-title":"Metabarcoding assessment of prokaryotic and eukaryotic taxa in sediments from Stellwagen bank national marine sanctuary","volume":"9","author":"Polinski","year":"2019","journal-title":"Sci. Rep"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1111\/j.1471-8286.2007.01678.x","article-title":"BOLD: the barcode of life data system: BARCODING","volume":"7","author":"Ratnasingham","year":"2007","journal-title":"Mol. Ecol. Notes"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1890\/1051-0761(2002)012[0618:ATATOU]2.0.CO;2","article-title":"A taxonomy and treatment of uncertainty for ecology and conservation biology","volume":"12","author":"Regan","year":"2002","journal-title":"Ecol. Appl"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"giaa140","DOI":"10.1093\/gigascience\/giaa140","article-title":"Streamlining data-intensive biology with workflow systems","volume":"10","author":"Reiter","year":"2021","journal-title":"GigaScience"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/s43705-021-00033-z","article-title":"Handling of spurious sequences affects the outcome of high-throughput 16S rRNA gene amplicon profiling","volume":"1","author":"Reitmeier","year":"2021","journal-title":"ISME Commun"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"641","DOI":"10.3389\/fpls.2016.00641","article-title":"Publishing FAIR data: an exemplar methodology utilizing PHI-Base","volume":"7","author":"Rodr\u00edguez-Iglesias","year":"2016","journal-title":"Front. Plant Sci"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1004","DOI":"10.1002\/rra.3610","article-title":"Reference databases, primer choice, and assay sensitivity for environmental metabarcoding: lessons learnt from a re-evaluation of an eDNA fish assessment in the Volga headwaters","volume":"36","author":"Schenekar","year":"2020","journal-title":"River Res. Applic"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1111\/1755-0998.12402","article-title":"Tag jumps illuminated\u2014reducing sequence-to-sample misidentifications in metabarcoding studies","volume":"15","author":"Schnell","year":"2015","journal-title":"Mol. Ecol. Resour"},{"key":"2023041408242839500_","volume-title":"Use of the New England Aquarium to Evaluate Environmental DNA Metabarcoding of Gulf of Maine Vertebrates and Invertebrates [Master of Science in Marine Biology]","author":"Silverbrand","year":"2021"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","DOI":"10.3133\/fs20093054","article-title":"The national map\u2014hydrography","volume":"3054","author":"Simley","year":"2009","journal-title":"US Geological Survey Fact Sheet"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"226","DOI":"10.3389\/fmars.2020.00226","article-title":"Improved environmental DNA reference library detects overlooked marine fishes in New Jersey, United States","volume":"7","author":"Stoeckle","year":"2020","journal-title":"Front. Mar. Sci"},{"key":"2023041408242839500_","volume-title":"Database Modeling and Design: Logical Design.","author":"Teorey","year":"2011","edition":"5th edn"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1093\/bib\/bbx086","article-title":"Big data management challenges in health research\u2014a literature review","volume":"20","author":"Wang","year":"2019","journal-title":"Brief. Bioinformatics"},{"key":"2023041408242839500_","first-page":"17","article-title":"The Douglas-Peucker line simplification algorithm","volume":"22","author":"Whyatt","year":"1988","journal-title":"Bull. Soc. Univ. Cartogr"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The FAIR guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci. Data"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1038\/s41592-021-01254-9","article-title":"Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers","volume":"18","author":"Wratten","year":"2021","journal-title":"Nat. Methods"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nbt.1823","article-title":"Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications","volume":"29","author":"Yilmaz","year":"2011","journal-title":"Nat. Biotechnol"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"D643","DOI":"10.1093\/nar\/gkt1209","article-title":"The SILVA and \u201call-species living tree project (LTP)\u201d taxonomic frameworks","volume":"42","author":"Yilmaz","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023041408242839500_","first-page":"204","volume-title":"Species Occurrence Data from the Aquatic eDNAtlas Database","author":"Young","year":"2018"},{"key":"2023041408242839500_","doi-asserted-by":"crossref","first-page":"e108793","DOI":"10.1371\/journal.pone.0108793","article-title":"Taxonomic reference libraries for environmental barcoding: a best practice example from diatom research","volume":"9","author":"Zimmermann","year":"2014","journal-title":"PLoS One"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac556\/45477487\/btac556.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/19\/4589\/49885340\/btac556.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/19\/4589\/49885340\/btac556.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,25]],"date-time":"2023-11-25T15:24:06Z","timestamp":1700925846000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/19\/4589\/6663773"}},"subtitle":[],"editor":[{"given":"Peter","family":"Robinson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,8,12]]},"references-count":52,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2022,9,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac556","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,10,1]]},"published":{"date-parts":[[2022,8,12]]}}}