{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T23:53:27Z","timestamp":1775260407281,"version":"3.50.1"},"reference-count":29,"publisher":"World Scientific Pub Co Pte Ltd","issue":"06","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2018,12]]},"abstract":"<jats:p>In recent years, there have been many studies utilizing DNA methylome data to answer fundamental biological questions. Bisulfite sequencing (BS-seq) has enabled measurement of a genome-wide absolute level of DNA methylation at single-nucleotide resolution. However, due to the ambiguity introduced by bisulfite-treatment, the aligning process especially in large-scale epigenetic research is still considered a huge burden. We present Cloud-BS, an efficient BS-seq aligner designed for parallel execution on a distributed environment. Utilizing Apache Hadoop framework, Cloud-BS splits sequencing reads into multiple blocks and transfers them to distributed nodes. By designing each aligning procedure into separate map and reducing tasks while an internal key-value structure is optimized based on the MapReduce programming model, the algorithm significantly improves alignment performance without sacrificing mapping accuracy. In addition, Cloud-BS minimizes the innate burden of configuring a distributed environment by providing a pre-configured cloud image. Cloud-BS shows significantly improved bisulfite alignment performance compared to other existing BS-seq aligners. We believe our algorithm facilitates large-scale methylome data analysis. The algorithm is freely available at https:\/\/paryoja.github.io\/Cloud-BS\/ .<\/jats:p>","DOI":"10.1142\/s0219720018400280","type":"journal-article","created":{"date-parts":[[2018,10,31]],"date-time":"2018-10-31T02:11:09Z","timestamp":1540951869000},"page":"1840028","source":"Crossref","is-referenced-by-count":1,"title":["Cloud-BS: A MapReduce-based bisulfite sequencing aligner on cloud"],"prefix":"10.1142","volume":"16","author":[{"given":"Joungmin","family":"Choi","sequence":"first","affiliation":[{"name":"Division of Computer Science, Sookmyung Women\u2019s University, 100 Cheongpa-ro 47-gil, 04310 Seoul, Republic of Korea"}]},{"given":"Yoonjae","family":"Park","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, 08826 Seoul, Republic of Korea"}]},{"given":"Sun","family":"Kim","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, 08826 Seoul, Republic of Korea"}]},{"given":"Heejoon","family":"Chae","sequence":"additional","affiliation":[{"name":"Division of Computer Science, Sookmyung Women\u2019s University, 100 Cheongpa-ro 47-gil, 04310 Seoul, Republic of Korea"}]}],"member":"219","published-online":{"date-parts":[[2019,1,7]]},"reference":[{"key":"S0219720018400280BIB004","doi-asserted-by":"publisher","DOI":"10.1016\/S0022-2836(05)80360-2"},{"key":"S0219720018400280BIB005","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMoa0706550"},{"key":"S0219720018400280BIB006","doi-asserted-by":"publisher","DOI":"10.1186\/1745-6150-7-43"},{"key":"S0219720018400280BIB008","doi-asserted-by":"publisher","DOI":"10.1002\/path.1024"},{"key":"S0219720018400280BIB009","doi-asserted-by":"publisher","DOI":"10.1016\/0167-8191(96)00024-5"},{"key":"S0219720018400280BIB010","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts264"},{"issue":"5","key":"S0219720018400280BIB011","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1387\/ijdb.8645558","volume":"39","author":"Heby O","year":"1995","journal-title":"Int J Dev Biol"},{"key":"S0219720018400280BIB012","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-018-2120-7"},{"key":"S0219720018400280BIB013","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-017-1191-5"},{"key":"S0219720018400280BIB014","doi-asserted-by":"publisher","DOI":"10.1016\/j.yexcr.2006.03.006"},{"key":"S0219720018400280BIB016","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr167"},{"key":"S0219720018400280BIB017","doi-asserted-by":"publisher","DOI":"10.1038\/nmeth.1828"},{"issue":"3","key":"S0219720018400280BIB018","first-page":"R25, 2009","volume":"10","author":"Langmead B","journal-title":"Genome Biol"},{"key":"S0219720018400280BIB019","doi-asserted-by":"publisher","DOI":"10.1038\/366362a0"},{"key":"S0219720018400280BIB020","volume":"2009","author":"Li Y","year":"2009","journal-title":"Critical Assessment of Massive Data Anaysis (CAMDA)"},{"issue":"3","key":"S0219720018400280BIB021","first-page":"428","volume":"33","author":"Luu PL","year":"2016","journal-title":"Bioinformatics"},{"key":"S0219720018400280BIB022","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gki901"},{"key":"S0219720018400280BIB023","doi-asserted-by":"publisher","DOI":"10.1126\/science.6164095"},{"key":"S0219720018400280BIB025","doi-asserted-by":"publisher","DOI":"10.1186\/1756-0500-4-171"},{"key":"S0219720018400280BIB026","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr394"},{"key":"S0219720018400280BIB027","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2015.07.286"},{"key":"S0219720018400280BIB028","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-15-337"},{"key":"S0219720018400280BIB030","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btp236"},{"key":"S0219720018400280BIB031","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkj461"},{"key":"S0219720018400280BIB033","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-9-128"},{"key":"S0219720018400280BIB034","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-S12-S1"},{"key":"S0219720018400280BIB035","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq057"},{"key":"S0219720018400280BIB036","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-10-232"},{"key":"S0219720018400280BIB037","doi-asserted-by":"publisher","DOI":"10.1007\/BF02825064"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720018400280","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T22:35:41Z","timestamp":1775255741000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720018400280"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12]]},"references-count":29,"journal-issue":{"issue":"06","published-online":{"date-parts":[[2019,1,7]]},"published-print":{"date-parts":[[2018,12]]}},"alternative-id":["10.1142\/S0219720018400280"],"URL":"https:\/\/doi.org\/10.1142\/s0219720018400280","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12]]}}}