{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T19:20:10Z","timestamp":1761765610422},"reference-count":12,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>With the advent of next-generation sequencing (NGS) technologies, full cDNA shotgun sequencing has become a major approach in the study of transcriptomes, and several different protocols in 454 sequencing have been invented. As each protocol uses its own short DNA tags or adapters attached to the ends of cDNA fragments for labeling or sequencing, different contaminants may lead to mis-assembly and inaccurate sequence products.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We have designed and implemented a new program for raw sequence cleaning in a graphical user interface and a batch script. The cleaning process consists of several modules including barcode trimming, sequencing adapter trimming, amplification primer trimming, poly-A tail trimming, vector screening and low quality region trimming. These modules can be combined based on various sequencing applications.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>ESTclean is a software package not only for cleaning cDNA sequences, but also for helping to develop sequencing protocols by providing summary tables and figures for sequencing quality control in a graphical user interface. It outperforms in cleaning read sequences from complicated sequencing protocols which use barcodes and multiple amplification primers.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-13-247","type":"journal-article","created":{"date-parts":[[2012,9,26]],"date-time":"2012-09-26T08:56:57Z","timestamp":1348649817000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing"],"prefix":"10.1186","volume":"13","author":[{"given":"Hongseok","family":"Tae","sequence":"first","affiliation":[]},{"given":"Dongsung","family":"Ryu","sequence":"additional","affiliation":[]},{"given":"Suhas","family":"Sureshchandra","sequence":"additional","affiliation":[]},{"given":"Jeong-Hyeon","family":"Choi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,9,26]]},"reference":[{"key":"5797_CR1","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1038\/nmeth1156","volume":"5","author":"SC Schuster","year":"2008","unstructured":"Schuster SC: Next-generation sequencing transforms today\u2019s biology. Nat Meth 2008, 5: 16\u201318. 10.1038\/nmeth1156","journal-title":"Nat Meth"},{"key":"5797_CR2","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1186\/1471-2164-10-219","volume":"10","author":"E Meyer","year":"2009","unstructured":"Meyer E, Aglyamova G, Wang S, Buchanan-Carter J, Abrego D, Colbourne J, Willis B, Matz M: Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx. BMC Genomics 2009, 10: 219. 10.1186\/1471-2164-10-219","journal-title":"BMC Genomics"},{"key":"5797_CR3","unstructured":"VecScreen http:\/\/www.ncbi.nlm.nih.gov\/VecScreen\/VecScreen.html"},{"issue":"12","key":"5797_CR4","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1093\/bioinformatics\/17.12.1093","volume":"17","author":"HH Chou","year":"2001","unstructured":"Chou HH, Holmes MH: DNA sequence quality trimming and vector removal. Bioinformatics 2001, 17(12):1093\u20131104. 10.1093\/bioinformatics\/17.12.1093","journal-title":"Bioinformatics"},{"key":"5797_CR5","unstructured":"Cross_match http:\/\/www.phrap.org\/phredphrapconsed.html"},{"key":"5797_CR6","unstructured":"SeqClean https:\/\/sourceforge.net\/projects\/seqclean\/"},{"issue":"4","key":"5797_CR7","doi-asserted-by":"publisher","first-page":"462","DOI":"10.1093\/bioinformatics\/btm632","volume":"24","author":"JR White","year":"2008","unstructured":"White JR, Roberts M, Yorke JA, Pop M: Figaro: a novel statistical method for vector sequence removal. Bioinformatics 2008, 24(4):462\u2013467. 10.1093\/bioinformatics\/btm632","journal-title":"Bioinformatics"},{"key":"5797_CR8","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1186\/1471-2105-11-38","volume":"11","author":"J Falgueras","year":"2010","unstructured":"Falgueras J, Lara A, Fernandez-Pozo N, Canton F, Perez-Trabado G, Claros MG: SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinformatics 2010, 11: 38. http:\/\/www.biomedcentral.com\/1471\u20132105\/11\/38 10.1186\/1471-2105-11-38","journal-title":"BMC Bioinformatics"},{"key":"5797_CR9","unstructured":"Parallel Tagged Sequencing https:\/\/bioinf.eva.mpg.de\/pts\/"},{"key":"5797_CR10","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"S Altschul","year":"1990","unstructured":"Altschul S, Gish W, Miller W, Myers E, Lipman D: Basic local alignment search tool. J Mol Biol 1990, 215: 403\u2013410.","journal-title":"J Mol Biol"},{"key":"5797_CR11","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","volume":"48","author":"SB Needleman","year":"1970","unstructured":"Needleman SB, Wunsch CD: A general method applicable to the search for similarities in the amino-acid sequence of two proteins. J Mol Biol 1970, 48: 443\u2013453. 10.1016\/0022-2836(70)90057-4","journal-title":"J Mol Biol"},{"issue":"9","key":"5797_CR12","doi-asserted-by":"publisher","first-page":"1859","DOI":"10.1093\/bioinformatics\/bti310","volume":"21","author":"TD Wu","year":"2005","unstructured":"Wu TD, Watanabe CK: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 2005, 21(9):1859\u20131875. 10.1093\/bioinformatics\/bti310","journal-title":"Bioinformatics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-13-247.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T23:33:22Z","timestamp":1630539202000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-13-247"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,26]]},"references-count":12,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["5797"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-13-247","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,9,26]]},"assertion":[{"value":"9 July 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 September 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"247"}}