{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T00:07:53Z","timestamp":1773274073876,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,12,5]],"date-time":"2017-12-05T00:00:00Z","timestamp":1512432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,12,5]]},"DOI":"10.1145\/3148055.3148060","type":"proceedings-article","created":{"date-parts":[[2017,12,1]],"date-time":"2017-12-01T20:03:47Z","timestamp":1512158627000},"page":"219-226","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Managing Variant Calling Files the Big Data Way"],"prefix":"10.1145","author":[{"given":"Aikaterini","family":"Boufea","sequence":"first","affiliation":[{"name":"University of Edinburgh, Edinburgh, United Kingdom"}]},{"given":"Richard","family":"Finkers","sequence":"additional","affiliation":[{"name":"Wageningen University &amp; Research, Wageningen, Netherlands"}]},{"given":"Martijn","family":"van Kaauwen","sequence":"additional","affiliation":[{"name":"Wageningen University &amp; Research, Wageningen, Netherlands"}]},{"given":"Mark","family":"Kramer","sequence":"additional","affiliation":[{"name":"Wageningen University &amp; Research, Wageningen, Netherlands"}]},{"given":"Ioannis N.","family":"Athanasiadis","sequence":"additional","affiliation":[{"name":"Wageningen University &amp; Research, Wageningen, Netherlands"}]}],"member":"320","published-online":{"date-parts":[[2017,12,5]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1111\/tpj.12616"},{"key":"e_1_3_2_1_2_1","first-page":"1061","article-title":"Analysis of Metagenomics Next Generation Sequence Data for Fungal ITS Barcoding: Do You Need Advance Bioinformatics Experience","author":"Ahmed Abdalla","year":"2016","unstructured":"Abdalla Ahmed . 2016 . Analysis of Metagenomics Next Generation Sequence Data for Fungal ITS Barcoding: Do You Need Advance Bioinformatics Experience ? Frontiers in Microbiology 7 , July (2016), 1061 . Abdalla Ahmed. 2016. Analysis of Metagenomics Next Generation Sequence Data for Fungal ITS Barcoding: Do You Need Advance Bioinformatics Experience? Frontiers in Microbiology 7, July (2016), 1061.","journal-title":"Frontiers in Microbiology 7"},{"key":"e_1_3_2_1_3_1","volume-title":"http:\/\/hadoop.apache. org\/. (2011--2017). {Online","author":"Foundation Apache Software","year":"2017","unstructured":"Apache Software Foundation . 2011--2017. Apache Hadoop . http:\/\/hadoop.apache. org\/. (2011--2017). {Online ; last accessed 23- Feb- 2017 }. Apache Software Foundation. 2011--2017. Apache Hadoop. http:\/\/hadoop.apache. org\/. (2011--2017). {Online; last accessed 23-Feb-2017}."},{"key":"e_1_3_2_1_4_1","volume-title":"http:\/\/parquet. apache.org\/ {Online","author":"Foundation Apache Software","year":"2017","unstructured":"Apache Software Foundation . 2017. Apache Parquet . (Jan. 2017). http:\/\/parquet. apache.org\/ {Online ; last accessed 23- Feb- 2017 }. Apache Software Foundation. 2017. Apache Parquet. (Jan. 2017). http:\/\/parquet. apache.org\/ {Online; last accessed 23-Feb-2017}."},{"key":"e_1_3_2_1_5_1","volume-title":"http:\/\/spark. apache.org\/ {Online","author":"Foundation Apache Software","year":"2017","unstructured":"Apache Software Foundation . 2017. Apache Spark . (Jan. 2017). http:\/\/spark. apache.org\/ {Online ; last accessed 23- Feb- 2017 }. Apache Software Foundation. 2017. Apache Spark. (Jan. 2017). http:\/\/spark. apache.org\/ {Online; last accessed 23-Feb-2017}."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1038\/nature15393"},{"key":"e_1_3_2_1_7_1","volume-title":"Experimental results of \"Managing variant calling datasets the big data way\". (May","author":"Boufea Aikaterini","year":"2017","unstructured":"Aikaterini Boufea and Ioannis N Athanasiadis . 2017. Experimental results of \"Managing variant calling datasets the big data way\". (May 2017 ). Aikaterini Boufea and Ioannis N Athanasiadis. 2017. Experimental results of \"Managing variant calling datasets the big data way\". (May 2017)."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr330"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btv179"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","first-page":"1575","DOI":"10.1093\/bioinformatics\/btx010","article-title":"FASTdoop","volume":"33","author":"Petrillo Umberto Ferraro","year":"2017","unstructured":"Umberto Ferraro Petrillo , Gianluca Roscigno , Giuseppe Cattaneo , and Raffaele Giancarlo . 2017 . FASTdoop : A Versatile and Efficient Library for the Input of FASTA and FASTQ Files for MapReduce Hadoop Bioinformatics Applications. Bioinformatics 33 , 10 (2017), 1575 -- 1577 . Umberto Ferraro Petrillo, Gianluca Roscigno, Giuseppe Cattaneo, and Raffaele Giancarlo. 2017. FASTdoop: A Versatile and Efficient Library for the Input of FASTA and FASTQ Files for MapReduce Hadoop Bioinformatics Applications. Bioinformatics 33, 10 (2017), 1575--1577.","journal-title":"Bioinformatics"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbv051"},{"key":"e_1_3_2_1_12_1","volume-title":"Cloud-scale RNAsequencing differential expression analysis with Myrna. Genome biology 11, 8","author":"Langmead Ben","year":"2010","unstructured":"Ben Langmead , Kasper D Hansen , and Jeffrey T Leek . 2010. Cloud-scale RNAsequencing differential expression analysis with Myrna. Genome biology 11, 8 ( 2010 ), R83. Ben Langmead, Kasper D Hansen, and Jeffrey T Leek. 2010. Cloud-scale RNAsequencing differential expression analysis with Myrna. Genome biology 11, 8 (2010), R83."},{"key":"e_1_3_2_1_13_1","volume-title":"Searching for SNPs with cloud computing. Genome biology 10, 11","author":"Langmead Ben","year":"2009","unstructured":"Ben Langmead , Michael C Schatz , Jimmy Lin , Mihai Pop , and Steven L Salzberg . 2009. Searching for SNPs with cloud computing. Genome biology 10, 11 ( 2009 ), R134. Ben Langmead, Michael C Schatz, Jimmy Lin, Mihai Pop, and Steven L Salzberg. 2009. Searching for SNPs with cloud computing. Genome biology 10, 11 (2009), R134."},{"key":"e_1_3_2_1_14_1","first-page":"530","article-title":"A review of bioinformatic pipeline frameworks","volume":"18","author":"Leipzig Jeremy","year":"2016","unstructured":"Jeremy Leipzig . 2016 . A review of bioinformatic pipeline frameworks . Briefings in Bioinformatics 18 , 3 (2016), 530 -- 536 . Jeremy Leipzig. 2016. A review of bioinformatic pipeline frameworks. Briefings in Bioinformatics 18, 3 (2016), 530--536.","journal-title":"Briefings in Bioinformatics"},{"key":"e_1_3_2_1_15_1","volume-title":"Bioinformatics","author":"Lubitz Timo","year":"2016","unstructured":"Timo Lubitz , Jens Hahn , Frank T. Bergmann , Elad Noor , Edda Klipp , and Wolfram Liebermeister . 2016 . SBtab: A flexible table format for data exchange in Systems Biology . Bioinformatics 32, April (2016), btw179--. Timo Lubitz, Jens Hahn, Frank T. Bergmann, Elad Noor, Edda Klipp, and Wolfram Liebermeister. 2016. SBtab: A flexible table format for data exchange in Systems Biology. Bioinformatics 32, April (2016), btw179--."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btv048"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt528"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1186\/s12864-015-2269-7"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btp236"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt601"},{"key":"e_1_3_2_1_22_1","volume-title":"https:\/\/www.surf.nl\/en\/about-surf\/subsidiaries\/surfsara\/. (2016). {Online","author":"SURF - Collaborative organization for ICT in Dutch education and research. 2016. SURFsara.","year":"2017","unstructured":"SURF - Collaborative organization for ICT in Dutch education and research. 2016. SURFsara. https:\/\/www.surf.nl\/en\/about-surf\/subsidiaries\/surfsara\/. (2016). {Online ; last accessed 23- Feb- 2017 }. SURF - Collaborative organization for ICT in Dutch education and research. 2016. SURFsara. https:\/\/www.surf.nl\/en\/about-surf\/subsidiaries\/surfsara\/. (2016). {Online; last accessed 23-Feb-2017}."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btu343"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2934664"}],"event":{"name":"UCC '17: 10th International Conference on Utility and Cloud Computing","location":"Austin Texas USA","acronym":"UCC '17","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IEEE TCSC IEEE Technical Committee on Scalable Computing"]},"container-title":["Proceedings of the Fourth IEEE\/ACM International Conference on Big Data Computing, Applications and Technologies"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3148055.3148060","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3148055.3148060","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:26:33Z","timestamp":1750213593000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3148055.3148060"}},"subtitle":["Using HDFS and Apache Parquet"],"short-title":[],"issued":{"date-parts":[[2017,12,5]]},"references-count":23,"alternative-id":["10.1145\/3148055.3148060","10.1145\/3148055"],"URL":"https:\/\/doi.org\/10.1145\/3148055.3148060","relation":{},"subject":[],"published":{"date-parts":[[2017,12,5]]},"assertion":[{"value":"2017-12-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}