{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,12]],"date-time":"2026-04-12T19:24:23Z","timestamp":1776021863300,"version":"3.50.1"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,21]],"date-time":"2023-09-21T00:00:00Z","timestamp":1695254400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,21]],"date-time":"2023-09-21T00:00:00Z","timestamp":1695254400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004070","name":"Khalifa University of Science, Technology and Research","doi-asserted-by":"publisher","award":["CIRA-2019-076"],"award-info":[{"award-number":["CIRA-2019-076"]}],"id":[{"id":"10.13039\/501100004070","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004070","name":"Khalifa University of Science, Technology and Research","doi-asserted-by":"publisher","award":["CIRA-2019-076"],"award-info":[{"award-number":["CIRA-2019-076"]}],"id":[{"id":"10.13039\/501100004070","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004070","name":"Khalifa University of Science, Technology and Research","doi-asserted-by":"publisher","award":["CIRA-2019-076"],"award-info":[{"award-number":["CIRA-2019-076"]}],"id":[{"id":"10.13039\/501100004070","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004070","name":"Khalifa University of Science, Technology and Research","doi-asserted-by":"publisher","award":["CIRA-2019-076"],"award-info":[{"award-number":["CIRA-2019-076"]}],"id":[{"id":"10.13039\/501100004070","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004070","name":"Khalifa University of Science, Technology and Research","doi-asserted-by":"publisher","award":["CIRA-2019-076"],"award-info":[{"award-number":["CIRA-2019-076"]}],"id":[{"id":"10.13039\/501100004070","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Plummeting DNA sequencing cost in recent years has enabled genome sequencing projects to scale up by several orders of magnitude, which is transforming genomics into a highly data-intensive field of research. This development provides the much needed statistical power required for genotype\u2013phenotype predictions in complex diseases.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Methods<\/jats:title>\n                <jats:p>In order to efficiently leverage the wealth of information, we here assessed several genomic data science tools. The rationale to focus on on-premise installations is to cope with situations where data confidentiality and compliance regulations etc.\u00a0rule out cloud based solutions. We established a comprehensive qualitative and quantitative comparison between BCFtools, SnpSift, Hail, GEMINI, and OpenCGA. The tools were compared in terms of data storage technology, query speed, scalability, annotation, data manipulation, visualization, data output representation, and availability.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>Tools that leverage sophisticated data structures are noted as the most suitable for large-scale projects in varying degrees of scalability in comparison to flat-file manipulation (e.g., BCFtools, and SnpSift). Remarkably, for small to mid-size projects, even lightweight relational database.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>The assessment criteria provide insights into the typical questions posed in scalable genomics and serve as guidance for the development of scalable computational infrastructure in genomics.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-023-05470-2","type":"journal-article","created":{"date-parts":[[2023,9,21]],"date-time":"2023-09-21T09:02:24Z","timestamp":1695286944000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Critical assessment of on-premise approaches to scalable genome analysis"],"prefix":"10.1186","volume":"24","author":[{"given":"Amira","family":"Al-Aamri","sequence":"first","affiliation":[]},{"given":"Syafiq","family":"Kamarul Azman","sequence":"additional","affiliation":[]},{"given":"Gihan","family":"Daw Elbait","sequence":"additional","affiliation":[]},{"given":"Habiba","family":"Alsafar","sequence":"additional","affiliation":[]},{"given":"Andreas","family":"Henschel","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,21]]},"reference":[{"key":"5470_CR1","doi-asserted-by":"crossref","first-page":"5","DOI":"10.3389\/fdata.2018.00005","volume":"1","author":"T Hartung","year":"2018","unstructured":"Hartung T. Making big sense from big data. Front Big Data. 2018;1:5.","journal-title":"Front Big Data"},{"issue":"7","key":"5470_CR2","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1038\/jhg.2010.55","volume":"55","author":"CS Ku","year":"2010","unstructured":"Ku CS, Loy EY, Salim A, Pawitan Y, Chia KS. The discovery of human genetic variations and their use as disease markers: past, present and future. J Hum Genet. 2010;55(7):403\u201315.","journal-title":"J Hum Genet"},{"issue":"9","key":"5470_CR3","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0216838","volume":"14","author":"MO Adetunji","year":"2019","unstructured":"Adetunji MO, Lamont SJ, Abasht B, Schmidt CJ. Variant analysis pipeline for accurate detection of genomic variants from transcriptome sequencing data. PLoS ONE. 2019;14(9): e0216838.","journal-title":"PLoS ONE"},{"issue":"7","key":"5470_CR4","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003153","volume":"9","author":"U Paila","year":"2013","unstructured":"Paila U, Chapman BA, Kirchner R, Quinlan AR. GEMINI: integrative exploration of genetic variation and genome annotations. PLoS Comput Biol. 2013;9(7): e1003153.","journal-title":"PLoS Comput Biol"},{"issue":"2","key":"5470_CR5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s12041-019-1101-6","volume":"98","author":"SA Chellappa","year":"2019","unstructured":"Chellappa SA, Pathak AK, Sinha P, Jainarayanan AK, Jain S, Brahmachari SK. Meta-analysis of genomic variants and gene expression data in schizophrenia suggests the potential need for adjunctive therapeutic interventions for neuropsychiatric disorders. J Genet. 2019;98(2):1\u201313.","journal-title":"J Genet"},{"issue":"7","key":"5470_CR6","doi-asserted-by":"crossref","first-page":"2185","DOI":"10.1534\/g3.120.401279","volume":"10","author":"X Chang","year":"2020","unstructured":"Chang X, Zhong D, Wang X, Bonizzoni M, Li Y, Zhou G, et al. Genomic variant analyses in pyrethroid resistant and susceptible malaria vector, Anopheles sinensis. G3 Genes Genomes Genet. 2020;10(7):2185\u201393.","journal-title":"G3 Genes Genomes Genet"},{"issue":"5","key":"5470_CR7","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1038\/nrg3706","volume":"15","author":"PC Sham","year":"2014","unstructured":"Sham PC, Purcell SM. Statistical power and significance testing in large-scale genetic studies. Nat Rev Genet. 2014;15(5):335\u201346.","journal-title":"Nat Rev Genet"},{"issue":"7933","key":"5470_CR8","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1038\/s41586-022-05275-y","volume":"610","author":"L Yengo","year":"2022","unstructured":"Yengo L, Vedantam S, Marouli E, Sidorenko J, Bartell E, Sakaue S, et al. A saturated map of common genetic variants associated with human height. Nature. 2022;610(7933):704\u201312.","journal-title":"Nature"},{"key":"5470_CR9","unstructured":"Massie M, Nothaft F, Hartl C, Kozanitis C, Schumacher A, Joseph AD, et\u00a0al. Adam: genomics formats and processing patterns for cloud scale computing. University of California, Berkeley technical report, No UCB\/EECS-2013. 2013;207:2013."},{"issue":"8","key":"5470_CR10","doi-asserted-by":"crossref","first-page":"761","DOI":"10.2217\/pme.13.80","volume":"10","author":"SB Haga","year":"2013","unstructured":"Haga SB. 100k genome project: sequencing and much more. Pers Med. 2013;10(8):761\u20134.","journal-title":"Pers Med"},{"issue":"Supplement-III","key":"5470_CR11","doi-asserted-by":"crossref","first-page":"S2","DOI":"10.1016\/j.je.2016.12.005","volume":"27","author":"A Nagai","year":"2017","unstructured":"Nagai A, Hirata M, Kamatani Y, Muto K, Matsuda K, Kiyohara Y, et al. Overview of the BioBank Japan Project: study design and profile. J Epidemiol. 2017;27(Supplement-III):S2\u20138.","journal-title":"J Epidemiol"},{"key":"5470_CR12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41591-023-02211-z","volume":"29","author":"D Greene","year":"2023","unstructured":"Greene D, Consortium GER, Pirri D, Frudd K, Sackey E, Al-Owain M, et al. Genetic association analysis of 77539 genomes reveals rare disease etiologies. Nat Med. 2023;29:1\u201310.","journal-title":"Nat Med"},{"key":"5470_CR13","volume-title":"Genomics in the cloud: using Docker, GATK, and WDL in Terra","author":"GA Van der Auwera","year":"2020","unstructured":"Van der Auwera GA, O\u2019Connor BD. Genomics in the cloud: using Docker, GATK, and WDL in Terra. Sebastopol: O\u2019Reilly Media, Inc; 2020."},{"issue":"21","key":"5470_CR14","doi-asserted-by":"crossref","first-page":"2987","DOI":"10.1093\/bioinformatics\/btr509","volume":"27","author":"H Li","year":"2011","unstructured":"Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27(21):2987\u201393.","journal-title":"Bioinformatics"},{"issue":"2","key":"5470_CR15","doi-asserted-by":"crossref","first-page":"80","DOI":"10.4161\/fly.19695","volume":"6","author":"P Cingolani","year":"2012","unstructured":"Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6(2):80\u201392.","journal-title":"Fly"},{"issue":"15","key":"5470_CR16","doi-asserted-by":"crossref","first-page":"2156","DOI":"10.1093\/bioinformatics\/btr330","volume":"27","author":"P Danecek","year":"2011","unstructured":"Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156\u20138.","journal-title":"Bioinformatics"},{"issue":"1","key":"5470_CR17","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1038\/s41525-021-00227-3","volume":"6","author":"BS Pedersen","year":"2021","unstructured":"Pedersen BS, Brown JM, Dashnow H, Wallace AD, Velinder M, Tristani-Firouzi M, et al. Effective variant filtering and expected candidate variant yield in studies of rare human disease. NPJ Genom Med. 2021;6(1):60.","journal-title":"NPJ Genom Med"},{"key":"5470_CR18","unstructured":"Team H.: Hail 0.2. https:\/\/github.com\/hail-is\/hail\/commit\/13190f0b6103. Accessed 18 Aug 2021"},{"issue":"W1","key":"5470_CR19","doi-asserted-by":"crossref","first-page":"W189","DOI":"10.1093\/nar\/gkx445","volume":"45","author":"J Lopez","year":"2017","unstructured":"Lopez J, Coll J, Haimel M, Kandasamy S, Tarraga J, Furio-Tari P, et al. HGVA: the human genome variation archive. Nucleic Acids Res. 2017;45(W1):W189\u201394.","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"5470_CR20","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1093\/bib\/bbv051","volume":"17","author":"SN Hart","year":"2016","unstructured":"Hart SN, Duffy P, Quest DJ, Hossain A, Meiners MA, Kocher JP. VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files. Brief Bioinform. 2016;17(2):346\u201351.","journal-title":"Brief Bioinform"},{"issue":"16","key":"5470_CR21","doi-asserted-by":"crossref","first-page":"e164","DOI":"10.1093\/nar\/gkq603","volume":"38","author":"K Wang","year":"2010","unstructured":"Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164\u2013e164.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"5470_CR22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-016-0974-4","volume":"17","author":"W McLaren","year":"2016","unstructured":"McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, et al. The ensembl variant effect predictor. Genome Biol. 2016;17(1):1\u201314.","journal-title":"Genome Biol"},{"issue":"14","key":"5470_CR23","doi-asserted-by":"crossref","first-page":"2076","DOI":"10.1093\/bioinformatics\/btu168","volume":"30","author":"V Obenchain","year":"2014","unstructured":"Obenchain V, Lawrence M, Carey V, Gogarten S, Shannon P, Morgan M. VariantAnnotation: a bioconductor package for exploration and annotation of genetic variants. Bioinformatics. 2014;30(14):2076\u20138.","journal-title":"Bioinformatics"},{"issue":"12","key":"5470_CR24","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.3390\/genes10121017","volume":"10","author":"L Shi","year":"2019","unstructured":"Shi L, Wang Z. Computational strategies for scalable genomics analysis. Genes. 2019;10(12):1017.","journal-title":"Genes"},{"issue":"D1","key":"5470_CR25","doi-asserted-by":"crossref","first-page":"D325","DOI":"10.1093\/nar\/gkaa1113","volume":"49","author":"The Gene Ontology Consortium","year":"2021","unstructured":"The Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 2021;49(D1):D325\u201334.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"5470_CR26","doi-asserted-by":"crossref","first-page":"D587","DOI":"10.1093\/nar\/gkac963","volume":"51","author":"M Kanehisa","year":"2023","unstructured":"Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2023;51(D1):D587\u201392.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"5470_CR27","doi-asserted-by":"crossref","first-page":"D1062","DOI":"10.1093\/nar\/gkx1153","volume":"46","author":"MJ Landrum","year":"2018","unstructured":"Landrum MJ, Lee JM, Benson M, Brown GR, Chao C, Chitipiralla S, et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res. 2018;46(D1):D1062\u20137.","journal-title":"Nucleic Acids Res"},{"key":"5470_CR28","volume-title":"The international statistical classification of diseases and health related problems ICD-10: tenth revision. volume 1: tabular list","author":"World Health Organization","year":"2004","unstructured":"World Health Organization. The international statistical classification of diseases and health related problems ICD-10: tenth revision. volume 1: tabular list, vol. 1. Geneva: World Health Organization; 2004."},{"issue":"1","key":"5470_CR29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-018-2205-3","volume":"19","author":"M Oudah","year":"2018","unstructured":"Oudah M, Henschel A. Taxonomy-aware feature engineering for microbiome classification. BMC Bioinform. 2018;19(1):1\u201313.","journal-title":"BMC Bioinform"},{"issue":"1","key":"5470_CR30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-019-49114-z","volume":"9","author":"GA Tollefson","year":"2019","unstructured":"Tollefson GA, Schuster J, Gelin F, Agudelo A, Ragavendran A, Restrepo I, et al. VIVA (VIsualization of VAriants): a VCF file visualization tool. Sci Rep. 2019;9(1):1\u20137.","journal-title":"Sci Rep"},{"key":"5470_CR31","doi-asserted-by":"crossref","first-page":"358","DOI":"10.3389\/fphar.2019.00358","volume":"10","author":"Y Liang","year":"2019","unstructured":"Liang Y, He L, Zhao Y, Hao Y, Zhou Y, Li M, et al. Comparative analysis for the performance of variant calling pipelines on detecting the de novo mutations in humans. Front Pharmacol. 2019;10:358.","journal-title":"Front Pharmacol"},{"issue":"8","key":"5470_CR32","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1101\/gr.9.8.677","volume":"9","author":"ST Sherry","year":"1999","unstructured":"Sherry ST, Ward M, Sirotkin K. dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res. 1999;9(8):677\u20139.","journal-title":"Genome Res"},{"key":"5470_CR33","doi-asserted-by":"crossref","unstructured":"Chen S, Francioli LC, Goodrich JK, Collins RL, Kanai M, Wang Q, et\u00a0al. A genome-wide mutational constraint map quantified from variation in 76,156 human genomes. bioRxiv. 2022;2022\u201303.","DOI":"10.1101\/2022.03.20.485034"},{"issue":"7571","key":"5470_CR34","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","volume":"526","author":"Genomes Project Consortium","year":"2015","unstructured":"Genomes Project Consortium. A global reference for human genetic variation. Nature. 2015;526(7571):68.","journal-title":"Nature"},{"issue":"5","key":"5470_CR35","doi-asserted-by":"crossref","first-page":"718","DOI":"10.1093\/bioinformatics\/btq671","volume":"27","author":"H Li","year":"2011","unstructured":"Li H. Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics. 2011;27(5):718\u20139.","journal-title":"Bioinformatics"},{"key":"5470_CR36","doi-asserted-by":"crossref","first-page":"527","DOI":"10.3389\/fgene.2021.660428","volume":"12","author":"G Daw Elbait","year":"2021","unstructured":"Daw Elbait G, Henschel A, Tay GK, Al Safar HS. A population-specific major allele reference genome from the United Arab Emirates population. Front Genet. 2021;12:527.","journal-title":"Front Genet"},{"issue":"7726","key":"5470_CR37","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/s41586-018-0579-z","volume":"562","author":"C Bycroft","year":"2018","unstructured":"Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203\u20139.","journal-title":"Nature"},{"key":"5470_CR38","doi-asserted-by":"publisher","unstructured":"Bear C, Lamb A, Tran N. The Vertica database: SQL RDBMS for managing big data. In: Proceedings of the 2012 workshop on management of big data systems. MBDS \u201912. New York, NY, USA: Association for Computing Machinery; 2012. p. 37\u201338. https:\/\/doi.org\/10.1145\/2378356.2378367.","DOI":"10.1145\/2378356.2378367"},{"key":"5470_CR39","volume-title":"MongoDB: the definitive guide: powerful and scalable data storage","author":"S Bradshaw","year":"2019","unstructured":"Bradshaw S, Brazil E, Chodorow K. MongoDB: the definitive guide: powerful and scalable data storage. Sebastopol: O\u2019Reilly Media; 2019."},{"key":"5470_CR40","volume-title":"HBase: the definitive guide: random access to your planet-size data","author":"L George","year":"2011","unstructured":"George L. HBase: the definitive guide: random access to your planet-size data. Sebastopol: O\u2019Reilly Media, Inc.; 2011."},{"issue":"1","key":"5470_CR41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13073-019-0693-z","volume":"12","author":"X Liu","year":"2020","unstructured":"Liu X, Li C, Mou C, Dong Y, Tu Y. dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs. Genome Med. 2020;12(1):1\u20138.","journal-title":"Genome Med"},{"issue":"7414","key":"5470_CR42","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","volume":"489","author":"ENCODE Project Consortium","year":"2012","unstructured":"ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57.","journal-title":"Nature"},{"issue":"7","key":"5470_CR43","doi-asserted-by":"crossref","first-page":"1003","DOI":"10.1093\/bioinformatics\/btt637","volume":"30","author":"BJ Raney","year":"2014","unstructured":"Raney BJ, Dreszer TR, Barber GP, Clawson H, Fujita PA, Wang T, et al. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics. 2014;30(7):1003\u20135.","journal-title":"Bioinformatics"},{"key":"5470_CR44","unstructured":"McKusick V, Hamosh A, Scott A, Amberger J, Valle D. Online Mendelian inheritance in man (OMIM). McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University. National Center for Biotechnology Information, National Library of Medicine, Bethesda; 2004. http:\/\/www.ncbi.nlm.nih.gov\/omim\/."},{"issue":"suppl\u20131","key":"5470_CR45","doi-asserted-by":"crossref","first-page":"D767","DOI":"10.1093\/nar\/gkn892","volume":"37","author":"T KeshavaPrasad","year":"2009","unstructured":"KeshavaPrasad T, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, et al. Human protein reference database\u20142009 update. Nucleic Acids Res. 2009;37(suppl\u20131):D767\u201372.","journal-title":"Nucleic Acids Res"},{"issue":"W1","key":"5470_CR46","doi-asserted-by":"crossref","first-page":"W609","DOI":"10.1093\/nar\/gks575","volume":"40","author":"M Bleda","year":"2012","unstructured":"Bleda M, Tarraga J, De Mar\u00eda A, Salavert F, Garcia-Alonso L, Celma M, et al. Cell Base, a comprehensive collection of RESTful web services for retrieving relevant biological information from heterogeneous sources. Nucleic Acids Res. 2012;40(W1):W609\u201314.","journal-title":"Nucleic Acids Res"},{"issue":"19","key":"5470_CR47","doi-asserted-by":"crossref","first-page":"3387","DOI":"10.1093\/bioinformatics\/bty358","volume":"34","author":"BS Pedersen","year":"2018","unstructured":"Pedersen BS, Quinlan AR. Hts-nim: scripting high-performance genomic analyses. Bioinformatics. 2018;34(19):3387\u20139.","journal-title":"Bioinformatics"},{"key":"5470_CR48","doi-asserted-by":"crossref","first-page":"e910","DOI":"10.14806\/ej.24.0.910","volume":"24","author":"L Papageorgiou","year":"2018","unstructured":"Papageorgiou L, Eleni P, Raftopoulou S, Mantaiou M, Megalooikonomou V, Vlachakis D. Genomic big data hitting the storage bottleneck. EMBnet J. 2018;24:e910.","journal-title":"EMBnet J"},{"key":"5470_CR49","unstructured":"Caulfield M, Davies J, Dennys M, Elbahy L, Fowler T, Hill S, et\u00a0al. The National Genomics Research and Healthcare Knowledgebase. figshare; 2017."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05470-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05470-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05470-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T22:06:06Z","timestamp":1700517966000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05470-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,21]]},"references-count":49,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5470"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05470-2","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,21]]},"assertion":[{"value":"27 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 September 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conmpeting interests"}}],"article-number":"354"}}