{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T05:33:20Z","timestamp":1770701600964,"version":"3.49.0"},"reference-count":112,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,4,5]],"date-time":"2021-04-05T00:00:00Z","timestamp":1617580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100010083","name":"Hunan Provincial Innovation Foundation for Postgraduate","doi-asserted-by":"publisher","award":["CX2018B313"],"award-info":[{"award-number":["CX2018B313"]}],"id":[{"id":"10.13039\/501100010083","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004969","name":"Applied Scientific Research Fund","doi-asserted-by":"publisher","award":["19A316"],"award-info":[{"award-number":["19A316"]}],"id":[{"id":"10.13039\/501100004969","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004969","name":"Applied Scientific Research Fund","doi-asserted-by":"publisher","award":["15CY007"],"award-info":[{"award-number":["15CY007"]}],"id":[{"id":"10.13039\/501100004969","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002767","name":"Hunan Provincial Science and Technology Department","doi-asserted-by":"publisher","award":["2018WK4001"],"award-info":[{"award-number":["2018WK4001"]}],"id":[{"id":"10.13039\/501100002767","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100007847","name":"Natural Science Foundation of Jilin Province of China","doi-asserted-by":"publisher","award":["2018JJ2262"],"award-info":[{"award-number":["2018JJ2262"]}],"id":[{"id":"10.13039\/100007847","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61972185"],"award-info":[{"award-number":["61972185"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772197"],"award-info":[{"award-number":["61772197"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61832019"],"award-info":[{"award-number":["61832019"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,2]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The rapid increase of genome data brought by gene sequencing technologies poses a massive challenge to data processing. To solve the problems caused by enormous data and complex computing requirements, researchers have proposed many methods and tools which can be divided into three types: big data storage, efficient algorithm design and parallel computing. The purpose of this review is to investigate popular parallel programming technologies for genome sequence processing. Three common parallel computing models are introduced according to their hardware architectures, and each of which is classified into two or three types and is further analyzed with their features. Then, the parallel computing for genome sequence processing is discussed with four common applications: genome sequence alignment, single nucleotide polymorphism calling, genome sequence preprocessing, and pattern detection and searching. For each kind of application, its background is firstly introduced, and then a list of tools or algorithms are summarized in the aspects of principle, hardware platform and computing efficiency. The programming model of each hardware and application provides a reference for researchers to choose high-performance computing tools. Finally, we discuss the limitations and future trends of parallel computing technologies.<\/jats:p>","DOI":"10.1093\/bib\/bbab070","type":"journal-article","created":{"date-parts":[[2021,2,11]],"date-time":"2021-02-11T16:19:29Z","timestamp":1613060369000},"source":"Crossref","is-referenced-by-count":17,"title":["Parallel computing for genome sequence processing"],"prefix":"10.1093","volume":"22","author":[{"given":"You","family":"Zou","sequence":"first","affiliation":[{"name":"Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Changsha, China"}]},{"given":"Yuejie","family":"Zhu","sequence":"additional","affiliation":[{"name":"Hunan Provincial Key Lab of Bioinformatics, School of Computer Science and Engineering at Central South University, Changsha, China"}]},{"given":"Yaohang","family":"Li","sequence":"additional","affiliation":[{"name":"computer science at Old Dominion University, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4593-9332","authenticated-orcid":false,"given":"Fang-Xiang","family":"Wu","sequence":"additional","affiliation":[{"name":"College of Engineering and the Department of Computer Science at the University of Saskatchewan, Saskatoon, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1516-0480","authenticated-orcid":false,"given":"Jianxin","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering at Central South University, Changsha, Hunan, China"}]}],"member":"286","published-online":{"date-parts":[[2021,4,5]]},"reference":[{"issue":"4","key":"2021090815325840100_ref1","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1038\/nbt.2181","article-title":"DNA sequencing with nanopores","volume":"30","author":"Schneider","year":"2012","journal-title":"Nat Biotechnol"},{"issue":"10","key":"2021090815325840100_ref2","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1038\/nbt1486","article-title":"Next-generation DNA sequencing","volume":"26","author":"Shendure","year":"2008","journal-title":"Nat Biotechnol"},{"issue":"1","key":"2021090815325840100_ref3","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1146\/annurev.genom.9.081307.164359","article-title":"Next-generation DNA sequencing methods","volume":"9","author":"Mardis","year":"2008","journal-title":"Annu Rev Genomics Hum Genet"},{"issue":"5507","key":"2021090815325840100_ref4","doi-asserted-by":"crossref","first-page":"1304","DOI":"10.1126\/science.1058040","article-title":"The sequence of the human genome","volume":"291","author":"Venter","year":"2001","journal-title":"Science"},{"key":"2021090815325840100_ref5","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/j.csbj.2017.07.004","article-title":"Computing platforms for big biological data analytics: perspectives and challenges","volume":"15","author":"Yin","year":"2017","journal-title":"Comput Struct Biotechnol J"},{"key":"2021090815325840100_ref6","doi-asserted-by":"crossref","DOI":"10.1098\/rsta.2019.0394","article-title":"The parallelism motifs of genomic data analysis","volume":"378","author":"Yelick","year":"2020","journal-title":"Phil Trans R Soc A"},{"issue":"5","key":"2021090815325840100_ref7","first-page":"870","article-title":"Graphics processing units in bioinformatics, computational biology and systems biology","author":"Nobile","year":"2017","journal-title":"Brief Bioinform"},{"issue":"4","key":"2021090815325840100_ref8","article-title":"Survey of MapReduce frame operation in bioinformatics","author":"Quan","year":"2014","journal-title":"Brief Bioinform"},{"key":"2021090815325840100_ref9","first-page":"368","article-title":"Review: high-performance computing to detect epistasis in genome scale data sets","volume":"3","author":"Alex","year":"2016","journal-title":"Brief Bioinform"},{"issue":"4","key":"2021090815325840100_ref10","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1049\/iet-spr.2012.0262","article-title":"Parallel computing for efficient time-frequency feature extraction of power quality disturbances","volume":"7","author":"Krishna","year":"2013","journal-title":"IET Signal Processing"},{"issue":"9","key":"2021090815325840100_ref11","doi-asserted-by":"crossref","first-page":"948","DOI":"10.1109\/TC.1972.5009071","article-title":"Some computer organizations and their effectiveness","volume":"100","author":"Flynn","year":"1972","journal-title":"IEEE Trans Comput"},{"key":"2021090815325840100_ref12","first-page":"1","volume-title":"International Parallel and Distributed Processing Symposium, 2010","author":"Bhandarkar"},{"issue":"11","key":"2021090815325840100_ref13","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1145\/2934664","article-title":"Apache spark: a unified engine for big data processing","volume":"59","author":"Zaharia","year":"2016","journal-title":"Commun ACM"},{"key":"2021090815325840100_ref14","first-page":"260","volume-title":"Proceedings of the 17th international conference on Parallel architectures and compilation techniques (PACT '08)","author":"He"},{"key":"2021090815325840100_ref15","first-page":"1","volume-title":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","author":"Shvachko"},{"key":"2021090815325840100_ref16","first-page":"1","volume-title":"Proceedings of the 4th Annual Symposium on Cloud Computing, 2013","author":"Vavilapalli"},{"issue":"1","key":"2021090815325840100_ref17","first-page":"28","article-title":"Big data analytics in bioinformatics: a machine learning perspective","volume":"5","author":"Kashyap","year":"2015","journal-title":"Comput Sci"},{"key":"2021090815325840100_ref18","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkh435","article-title":"BLAST: at the core of a powerful and diverse set of sequence analysis tools","author":"McGinnis","year":"2004","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"2021090815325840100_ref19","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1093\/bioinformatics\/btn025","article-title":"SOAP: short oligonucleotide alignment program","volume":"24","author":"Li","year":"2008","journal-title":"Bioinformatics"},{"issue":"11","key":"2021090815325840100_ref20","doi-asserted-by":"crossref","first-page":"1851","DOI":"10.1101\/gr.078212.108","article-title":"Mapping short DNA sequencing reads and calling variants using mapping quality scores","volume":"18","author":"Li","year":"2008","journal-title":"Genome Res"},{"issue":"21","key":"2021090815325840100_ref21","doi-asserted-by":"crossref","first-page":"2431","DOI":"10.1093\/bioinformatics\/btn416","article-title":"ZOOM! Zillions of oligos mapped","volume":"24","author":"Lin","year":"2008","journal-title":"Bioinformatics"},{"issue":"8","key":"2021090815325840100_ref22","article-title":"Bioinformatics applications on apache spark","volume":"7","author":"Guo","year":"2018","journal-title":"GigaScience"},{"issue":"1","key":"2021090815325840100_ref23","article-title":"Big data analysis: apache spark perspective","volume":"15","author":"Shoro","year":"2015","journal-title":"Global J Comp Sci Technol"},{"key":"2021090815325840100_ref24","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.comcom.2020.01.020","article-title":"Distributed and scalable computing framework for improving request processing of wearable IoT assisted medical sensors on pervasive computing system","volume":"151","author":"Fouad","year":"2020","journal-title":"Comput Commun"},{"key":"2021090815325840100_ref25","volume-title":"Proceedings of the 2006 ACM\/IEEE conference on Supercomputing (SC '06)","author":"Sur"},{"key":"2021090815325840100_ref26","volume-title":"Proceedings of the 2006 ACM\/IEEE conference on Supercomputing (SC '06)","author":"Luebke"},{"key":"2021090815325840100_ref27","volume-title":"ACM SIGGRAPH 2007 courses (SIGGRAPH '07)","author":"Buck"},{"key":"2021090815325840100_ref28","first-page":"329","volume-title":"A parallel, distributed and network-based processing, 2019","author":"Giansanti"},{"key":"2021090815325840100_ref29","first-page":"265","volume-title":"Operating Systems Design and Implementation, 2016","author":"Abadi"},{"key":"2021090815325840100_ref30","article-title":"Theano: a Python framework for fast computation of mathematical expressions","author":"Alrfou","year":"2016","journal-title":"arXiv preprint arXiv:1605.02688"},{"key":"2021090815325840100_ref31","volume-title":"Torch: A Modular Machine Learning Software Library","author":"Collobert","year":"2002"},{"issue":"3","key":"2021090815325840100_ref32","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1109\/MCSE.2010.69","article-title":"OpenCL: a parallel programming standard for heterogeneous computing systems","volume":"12","author":"Stone","year":"2010","journal-title":"Comput Sci Eng"},{"key":"2021090815325840100_ref33","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4302-5927-5","volume-title":"Intel\u00ae Xeon Phi\u2122 Coprocessor Architecture and Tools: The Guide for Application Developers","author":"Rahman","year":"2013"},{"key":"2021090815325840100_ref34","first-page":"388","volume-title":"Algorithms and Architectures for Parallel Processing. ICA3PP 2015. Lecture Notes in Computer Science","author":"Ni"},{"issue":"14","key":"2021090815325840100_ref35","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows--Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"issue":"5","key":"2021090815325840100_ref36","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long-read alignment with Burrows--Wheeler transform","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2021090815325840100_ref37","article-title":"Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM","author":"Li","year":"2013","journal-title":"arXiv preprint arXiv:1303.3997,"},{"issue":"4","key":"2021090815325840100_ref38","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat Methods"},{"issue":"s1","key":"2021090815325840100_ref39","first-page":"S1791","article-title":"Faster single-end alignment generation utilizing multi-thread for BWA","volume":"26","author":"Jo","year":"2015","journal-title":"Biomed Mater Eng"},{"issue":"1","key":"2021090815325840100_ref40","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1186\/s12859-019-2597-8","article-title":"Pblat: a multithread blat algorithm speeding up aligning sequences to genomes","volume":"20","author":"Wang","year":"2019","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"2021090815325840100_ref41","doi-asserted-by":"crossref","first-page":"656","DOI":"10.1101\/gr.229202","article-title":"BLAT\u2014the BLAST-like alignment tool","volume":"12","author":"Kent","year":"2002","journal-title":"Genome Res"},{"issue":"5","key":"2021090815325840100_ref42","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1089\/cmb.2014.0156","article-title":"PASTA: ultra-large multiple sequence alignment for nucleotide and amino-acid sequences","volume":"22","author":"Mirarab","year":"2015","journal-title":"J Comput Biol"},{"issue":"11","key":"2021090815325840100_ref43","doi-asserted-by":"crossref","first-page":"1363","DOI":"10.1093\/bioinformatics\/btp236","article-title":"CloudBurst: highly sensitive read mapping with MapReduce","volume":"25","author":"Schatz","year":"2009","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref44","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1186\/1471-2105-9-128","article-title":"Using quality scores and longer reads improves accuracy of Solexa read mapping","volume":"9","author":"Smith","year":"2008","journal-title":"BMC Bioinformatics"},{"issue":"15","key":"2021090815325840100_ref45","doi-asserted-by":"crossref","first-page":"2159","DOI":"10.1093\/bioinformatics\/btr325","article-title":"SEAL: a distributed short read mapping and duplicate removal tool","volume":"27","author":"Pireddu","year":"2011","journal-title":"Bioinformatics"},{"issue":"24","key":"2021090815325840100_ref46","doi-asserted-by":"crossref","first-page":"4003","DOI":"10.1093\/bioinformatics\/btv506","article-title":"BigBWA: approaching the burrows--wheeler aligner to big data technologies","volume":"31","author":"Abu\u00edn","year":"2015","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref47","first-page":"10","article-title":"Speeding up large-scale next generation sequencing data analysis with pBWA","volume":"1","author":"Peters","year":"2012","journal-title":"J Biocomput"},{"issue":"5","key":"2021090815325840100_ref48","first-page":"e155461","article-title":"SparkBWA: speeding up the alignment of high-throughput DNA sequencing data","volume":"11","author":"Abu\u00edn Jos\u00e9","year":"2016","journal-title":"PloS One"},{"issue":"5","key":"2021090815325840100_ref49","doi-asserted-by":"crossref","first-page":"760","DOI":"10.1093\/bioinformatics\/bty733","article-title":"SpaRC: scalable sequence clustering using apache spark","volume":"35","author":"Shi","year":"2019","journal-title":"Bioinformatics"},{"issue":"18","key":"2021090815325840100_ref50","doi-asserted-by":"crossref","first-page":"2948","DOI":"10.1093\/bioinformatics\/btx354","article-title":"PASTASpark: multiple sequence alignment meets big data","volume":"33","author":"Abuin","year":"2017","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref51","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/1756-0500-5-27","article-title":"BarraCUDA - a fast short read sequence aligner using graphics processing units","volume":"5","author":"Klus","year":"2012","journal-title":"BMC Res Notes"},{"issue":"14","key":"2021090815325840100_ref52","doi-asserted-by":"crossref","first-page":"1830","DOI":"10.1093\/bioinformatics\/bts276","article-title":"CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows--Wheeler transform","volume":"28","author":"Liu","year":"2012","journal-title":"Bioinformatics"},{"issue":"15","key":"2021090815325840100_ref53","doi-asserted-by":"crossref","first-page":"1966","DOI":"10.1093\/bioinformatics\/btp336","article-title":"SOAP2: an improved ultrafast tool for short read alignment","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref54","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-017-1725-6","article-title":"CMSA: a heterogeneous CPU\/GPU computing system for multiple similar RNA\/DNA sequence alignment","volume":"18","author":"Chen","year":"2017","journal-title":"BMC Bioinformatics"},{"issue":"15","key":"2021090815325840100_ref55","doi-asserted-by":"crossref","first-page":"2475","DOI":"10.1093\/bioinformatics\/btv177","article-title":"HAlign: fast multiple similar DNA\/RNA sequence alignment based on the Centre star strategy","volume":"31","author":"Zou","year":"2015","journal-title":"Bioinformatics"},{"issue":"13","key":"2021090815325840100_ref56","doi-asserted-by":"crossref","first-page":"2306","DOI":"10.1093\/bioinformatics\/bty930","article-title":"BGSA: a bit-parallel global sequence alignment toolkit for multi-core and many-core architectures","volume":"35","author":"Zhang","year":"2019","journal-title":"Bioinformatics"},{"key":"2021090815325840100_ref57","article-title":"NVBIO: a library of reusable components designed by NVIDIA corporation to accelerate bioinformatics applications using CUDA","author":"NVIDIA Corporation"},{"issue":"20","key":"2021090815325840100_ref58","doi-asserted-by":"crossref","first-page":"3437","DOI":"10.1093\/bioinformatics\/bty380","article-title":"Generic accelerated sequence alignment in SeqAn using vectorization and multi-threading","volume":"34","author":"Rahn","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref59","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1007\/s13353-015-0292-7","article-title":"Review of alignment and SNP calling algorithms for next-generation sequencing data","volume":"57","author":"Mielczarek","year":"2016","journal-title":"J Appl Genet"},{"issue":"22","key":"2021090815325840100_ref60","doi-asserted-by":"crossref","first-page":"3694","DOI":"10.1093\/bioinformatics\/btv440","article-title":"FermiKit: assembly-based variant calling for Illumina resequencing data","volume":"31","author":"Li","year":"2015","journal-title":"Bioinformatics"},{"issue":"24","key":"2021090815325840100_ref61","doi-asserted-by":"crossref","first-page":"4006","DOI":"10.1093\/bioinformatics\/btv507","article-title":"BS-SNPer: SNP calling in bisulfite-seq data","volume":"31","author":"Gao","year":"2015","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref62","doi-asserted-by":"crossref","DOI":"10.1186\/s12864-020-6569-1","article-title":"GSAlign: an efficient sequence alignment tool for intra-species genomes","volume":"21","author":"Lin","year":"2020","journal-title":"BMC Genomics"},{"issue":"18","key":"2021090815325840100_ref63","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref64","doi-asserted-by":"crossref","first-page":"e1005944","DOI":"10.1371\/journal.pcbi.1005944","article-title":"MUMmer4: A fast and versatile genome alignment system","volume":"14","author":"Mar\u00e7ais","year":"2018","journal-title":"PLoS Comput Biol"},{"issue":"3","key":"2021090815325840100_ref65","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1101\/gr.113985.110","article-title":"Adaptive seeds tame genomic sequence comparison","volume":"21","author":"Kie\u0142basa","year":"2011","journal-title":"Genome Res"},{"issue":"10","key":"2021090815325840100_ref66","doi-asserted-by":"crossref","first-page":"e2530","DOI":"10.1016\/j.heliyon.2019.e02530","article-title":"PVCTools: parallel variation calling tools","volume":"5","author":"Jin","year":"2019","journal-title":"Heliyon"},{"issue":"9","key":"2021090815325840100_ref67","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"Mckenna","year":"2010","journal-title":"Genome Res"},{"issue":"15","key":"2021090815325840100_ref68","doi-asserted-by":"crossref","first-page":"2482","DOI":"10.1093\/bioinformatics\/btv179","article-title":"Halvade: scalable sequence analysis with MapReduce","volume":"31","author":"Decap","year":"2015","journal-title":"Bioinformatics"},{"issue":"10","key":"2021090815325840100_ref69","first-page":"1421","article-title":"Basin development and tectonic history of the Llanos Basin, eastern cordillera, and middle Magdalena Valley, Colombia","volume":"79","author":"Cooper","year":"1995","journal-title":"AAPG Bulletin"},{"issue":"3","key":"2021090815325840100_ref70","doi-asserted-by":"crossref","first-page":"e174575","DOI":"10.1371\/journal.pone.0174575","article-title":"Halvade-RNA: parallel variant calling from transcriptomic data using MapReduce","volume":"12","author":"Decap","year":"2017","journal-title":"PloS One"},{"issue":"1","key":"2021090815325840100_ref71","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1093\/bioinformatics\/bts635","article-title":"STAR: ultrafast universal RNA-seq aligner","volume":"29","author":"Dobin","year":"2013","journal-title":"Bioinformatics"},{"key":"2021090815325840100_ref72","first-page":"148","volume-title":"Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics (ACM-BCB '17), 2017","author":"Mushtaq"},{"issue":"1","key":"2021090815325840100_ref73","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-019-2665-0","article-title":"ADS-HCSpark: a scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on spark","volume":"20","author":"Xiao","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2021090815325840100_ref74","article-title":"A multi-task convolutional deep neural network for variant calling in single molecule sequencing","author":"Luo","year":"2019","journal-title":"Nat Commun"},{"issue":"10","key":"2021090815325840100_ref75","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1038\/nbt.4235","article-title":"A universal SNP and small-indel variant caller using deep neural networks","volume":"36","author":"Poplin","year":"2018","journal-title":"Nat Biotechnol"},{"issue":"10","key":"2021090815325840100_ref76","doi-asserted-by":"crossref","first-page":"e1003880","DOI":"10.1371\/journal.pcbi.1003880","article-title":"FamSeq: a variant calling program for family-based sequencing data using graphics processing units","volume":"10","author":"Peng","year":"2014","journal-title":"PLoS Comput Biol"},{"issue":"11","key":"2021090815325840100_ref77","doi-asserted-by":"crossref","first-page":"2557","DOI":"10.1109\/TPDS.2018.2839578","article-title":"mSNP: a massively parallel algorithm for large-scale SNP detection","volume":"29","author":"Cui","year":"2018","journal-title":"IEEE Trans Parallel Distrib Syst"},{"issue":"6","key":"2021090815325840100_ref78","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1101\/gr.088013.108","article-title":"SNP detection for massively parallel whole-genome resequencing","volume":"19","author":"Li","year":"2009","journal-title":"Genome Res"},{"issue":"17","key":"2021090815325840100_ref79","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/bty560","article-title":"Fastp: an ultra-fast all-in-one FASTQ preprocessor","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2021090815325840100_ref80","first-page":"e27551v","article-title":"PRINSEQ++, a multi-threading tool for fast and efficient quality control and preprocessing of sequencing datasets","volume":"7","author":"Cantu","year":"2019","journal-title":"PeerJ Preprints"},{"issue":"6","key":"2021090815325840100_ref81","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1093\/bioinformatics\/btr026","article-title":"Quality control and preprocessing of metagenomic datasets","volume":"27","author":"Schmieder","year":"2011","journal-title":"Bioinformatics"},{"issue":"1","key":"2021090815325840100_ref82","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-2936-9","article-title":"FastProNGS: fast preprocessing of next-generation sequencing reads","volume":"20","author":"Liu","year":"2019","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"2021090815325840100_ref83","doi-asserted-by":"crossref","first-page":"e30619","DOI":"10.1371\/journal.pone.0030619","article-title":"NGS QC toolkit: a toolkit for quality control of next generation sequencing data","volume":"7","author":"Patel","year":"2012","journal-title":"PLoS One"},{"issue":"21","key":"2021090815325840100_ref84","doi-asserted-by":"crossref","first-page":"3355","DOI":"10.1093\/bioinformatics\/btx342","article-title":"GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping","volume":"33","author":"Alser","year":"2017","journal-title":"Bioinformatics"},{"issue":"21","key":"2021090815325840100_ref85","doi-asserted-by":"crossref","first-page":"4255","DOI":"10.1093\/bioinformatics\/btz234","article-title":"Shouji: a fast and efficient pre-alignment filter for sequence alignment","volume":"35","author":"Alser","year":"2019","journal-title":"Bioinformatics"},{"issue":"10","key":"2021090815325840100_ref86","doi-asserted-by":"crossref","first-page":"1553","DOI":"10.1093\/bioinformatics\/btu856","article-title":"Shifted hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping","volume":"31","author":"Xin","year":"2015","journal-title":"Bioinformatics"},{"issue":"10","key":"2021090815325840100_ref87","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/ng.437","article-title":"Personalized copy number and segmental duplication maps using next-generation sequencing","volume":"41","author":"Alkan","year":"2009","journal-title":"Nat Genet"},{"key":"2021090815325840100_ref88","article-title":"MAGNET: understanding and improving the accuracy of genome pre-alignment filtering","author":"Alser","year":"2017","journal-title":"arXiv preprint arXiv:1707.01631"},{"issue":"2","key":"2021090815325840100_ref89","first-page":"130","article-title":"Review of different sequence motif finding algorithms","volume":"11","author":"Hashim","year":"2019","journal-title":"Avicenna J Med Biotechnol"},{"key":"2021090815325840100_ref90","doi-asserted-by":"crossref","first-page":"162","DOI":"10.1016\/j.eswa.2017.03.026","article-title":"EPMA: efficient pattern matching algorithm for DNA sequences","author":"Tahir","year":"2017","journal-title":"Expert Syst Appl"},{"issue":"06","key":"2021090815325840100_ref91","doi-asserted-by":"crossref","first-page":"967","DOI":"10.1142\/S0129054109006991","article-title":"Efficient variants of the backward-oracle-matching algorithm","volume":"20","author":"Faro","year":"2009","journal-title":"Int J Found Comput Sci"},{"issue":"1","key":"2021090815325840100_ref92","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1109\/TCBB.2018.2861399","article-title":"MultiMotifMaker: a multi-thread tool for identifying DNA methylation motifs from Pacbio reads","volume":"17","author":"Li","year":"2020","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021090815325840100_ref93","author":"Pacific Biosciences"},{"issue":"4","key":"2021090815325840100_ref94","doi-asserted-by":"crossref","first-page":"805","DOI":"10.3390\/a6040805","article-title":"PMS6MC: a multicore algorithm for motif discovery","volume":"6","author":"Bandyopadhyay","year":"2013","journal-title":"Algorithms"},{"key":"2021090815325840100_ref95","first-page":"1","volume-title":"2012 IEEE 2nd International Conference on Computational Advances in Bio and medical Sciences (ICCABS)","author":"Bandyopadhyay"},{"key":"2021090815325840100_ref96","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1016\/j.engappai.2018.06.009","article-title":"An efficient approach for mining sequential patterns using multiple threads on very large databases","author":"Huynh","year":"2018","journal-title":"Eng Appl Artif Intel"},{"key":"2021090815325840100_ref97","first-page":"40","volume-title":"Advances in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science","author":"Fournier-Viger"},{"key":"2021090815325840100_ref98","first-page":"549","article-title":"Parallel motif extraction from very long sequences","volume-title":"Proceedings of the 22nd ACM international conference on Information & Knowledge Management (CIKM '13).","author":"Sahli"},{"issue":"4","key":"2021090815325840100_ref99","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1089\/cmb.2010.0177","article-title":"MADMX: a strategy for maximal dense motif extraction","volume":"18","author":"Grossi","year":"2011","journal-title":"J Comput Biol"},{"issue":"5","key":"2021090815325840100_ref100","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1093\/dnares\/dsp014","article-title":"Exhaustive search for over-represented DNA sequence motifs with CisFinder","volume":"16","author":"Sharov","year":"2009","journal-title":"DNA Res"},{"issue":"1","key":"2021090815325840100_ref101","doi-asserted-by":"crossref","first-page":"472","DOI":"10.1186\/s12859-018-2498-2","article-title":"BiSpark: a spark-based highly scalable aligner for bisulfite sequencing data","volume":"19","author":"Soe","year":"2018","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"2021090815325840100_ref102","doi-asserted-by":"crossref","first-page":"1571","DOI":"10.1093\/bioinformatics\/btr167","article-title":"Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications","volume":"27","author":"Krueger","year":"2011","journal-title":"Bioinformatics"},{"issue":"14","key":"2021090815325840100_ref103","doi-asserted-by":"crossref","first-page":"2170","DOI":"10.1016\/j.patrec.2009.10.009","article-title":"CUDA--MEME: accelerating motif discovery in biological sequences using CUDA-enabled graphics processing units","volume":"31","author":"Liu","year":"2010","journal-title":"Patt Recogn Lett"},{"issue":"9","key":"2021090815325840100_ref104","first-page":"101","article-title":"Efficient computation of motif discovery on Intel many integrated Core (MIC) architecture","volume":"19","author":"Peng","year":"2018","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"2021090815325840100_ref105","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1093\/bib\/bbs088","article-title":"Survey of MapReduce frame operation in bioinformatics","volume":"15","author":"Zou","year":"2014","journal-title":"Brief Bioinform"},{"key":"2021090815325840100_ref106","first-page":"36","volume-title":"2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), IEEE, 2013","author":"APD"},{"key":"2021090815325840100_ref107"},{"key":"2021090815325840100_ref108","first-page":"307","volume-title":"Proceedings of the 7th Symposium on Operating Systems Design and Implementation, 2006","author":"Weil"},{"key":"2021090815325840100_ref109","first-page":"23","volume-title":"31st Euromicro Conference on Real-Time Systems (ECRTS 2019), 2019","author":"Pujol"},{"issue":"4","key":"2021090815325840100_ref110","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1038\/nrg.2017.113","article-title":"Cloud computing for genomic data analysis and collaboration","volume":"19","author":"Langmead","year":"2018","journal-title":"Nat Rev Genet"},{"issue":"6","key":"2021090815325840100_ref111","doi-asserted-by":"crossref","first-page":"998","DOI":"10.1109\/TPDS.2010.178","article-title":"Cloud technologies for bioinformatics applications","volume":"22","author":"Ekanayake","year":"2010","journal-title":"IEEE Trans Parallel Distrib Syst"},{"key":"2021090815325840100_ref112","volume-title":"Proceedings of the 6th International Conference on Bioinformatics and Biomedical Science (ICBBS '17)","author":"Li"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab070\/40260858\/bbab070.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab070\/40260858\/bbab070.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,21]],"date-time":"2023-10-21T01:25:31Z","timestamp":1697851531000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab070\/6210355"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,5]]},"references-count":112,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,9,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab070","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9]]},"published":{"date-parts":[[2021,4,5]]},"article-number":"bbab070"}}