{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,23]],"date-time":"2026-04-23T19:47:31Z","timestamp":1776973651360,"version":"3.51.4"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>The sequencing platform BGISEQ-500 is based on DNBSEQ technology and provides high throughput with low costs. This sequencer has been widely used in various areas of scientific and clinical research. A better understanding of the sequencing process and performance of this system is essential for stabilizing the sequencing process, accurately interpreting sequencing results and efficiently solving sequencing problems. To address these concerns, a comprehensive database, SEQdata-BEACON, was constructed to accumulate the run performance data in BGISEQ-500.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      A total of 60 BGISEQ-500 instruments in the BGI-Wuhan lab were used to collect sequencing performance data. Lanes in paired-end 100 (PE100) sequencing using 10\u2009bp barcode were chosen, and each lane was assigned a unique entry number as its identification number (ID). From November 2018 to April 2019, 2236 entries were recorded in the database containing 65 metrics about sample, yield, quality, machine state and supplies information. Using a correlation matrix, 52 numerical metrics were clustered into three groups signifying yield-quality, machine state and sequencing calibration. The distributions of the metrics also delivered information about patterns and rendered clues for further explanation or analysis of the sequencing process. Using the data of a total of 200\u2009cycles, a linear regression model well simulated the final outputs. Moreover, the predicted final yield could be provided in the 15th cycle of the early stage of sequencing, and the corresponding R\n                      <jats:sup>2<\/jats:sup>\n                      of the 200th and 15th cycle models were 0.97 and 0.81, respectively. The model was run with the test sets obtained from May 2019 to predict the yield, which resulted in an R\n                      <jats:sup>2<\/jats:sup>\n                      of 0.96. These results indicate that our simulation model was reliable and effective.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>\n                      Data sources, statistical findings and application tools provide a constantly updated reference for BGISEQ-500 users to comprehensively understand DNBSEQ technology, solve sequencing problems and optimize run performance. These resources are available on our website\n                      <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/seqbeacon.genomics.cn:443\/home.html\">http:\/\/seqBEACON.genomics.cn:443\/home.html<\/jats:ext-link>\n                      .\n                    <\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s13040-019-0209-9","type":"journal-article","created":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T07:02:46Z","timestamp":1573801366000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["SEQdata-BEACON: a comprehensive database of sequencing performance and statistical tools for performance evaluation and yield simulation in BGISEQ-500"],"prefix":"10.1186","volume":"12","author":[{"given":"Yanqiu","family":"Zhou","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chen","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rongfang","family":"Zhou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anzhi","family":"Lu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Biao","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Liling","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ling","family":"Chen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bei","family":"Luo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jin","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhijian","family":"Tian","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2019,11,15]]},"reference":[{"issue":"6","key":"209_CR1","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1038\/nrg.2016.49","volume":"17","author":"S Goodwin","year":"2016","unstructured":"Goodwin S, McPherson JD, McCombie WR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet. 2016;17(6):333\u201351. https:\/\/doi.org\/10.1038\/nrg.2016.49.","journal-title":"Nat Rev Genet"},{"key":"209_CR2","doi-asserted-by":"publisher","DOI":"10.4172\/2469-9853.S1-005","volume-title":"Next generation DNA sequencing (II): techniques, applications. Top 10 contributions on bioinformatics & systems biology","author":"WJ Ansorge","year":"2018","unstructured":"Ansorge WJ. Next generation DNA sequencing (II): techniques, applications. Top 10 contributions on bioinformatics & systems biology; 2018. https:\/\/doi.org\/10.4172\/2469-9853.S1-005."},{"issue":"5961","key":"209_CR3","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1126\/science.1181498","volume":"327","author":"R Drmanac","year":"2010","unstructured":"Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG, et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327(5961):78\u201381. https:\/\/doi.org\/10.1126\/science.1181498.","journal-title":"Science."},{"issue":"5","key":"209_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/gigascience\/gix024","volume":"6","author":"J Huang","year":"2017","unstructured":"Huang J, Liang X, Xuan Y, Geng C, Li Y, Lu H, et al. A reference human genome dataset of the BGISEQ-500 sequencer. Gigascience. 2017;6(5):1\u20139. https:\/\/doi.org\/10.1093\/gigascience\/gix024.","journal-title":"Gigascience."},{"issue":"1","key":"209_CR5","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1186\/s12859-019-2751-3","volume":"20","author":"Y Xu","year":"2019","unstructured":"Xu Y, Lin Z, Tang C, Tang Y, Cai Y, Zhong H, et al. A new massively parallel nanoball sequencing platform for whole exome research. BMC Bioinformatics. 2019;20(1):153. https:\/\/doi.org\/10.1186\/s12859-019-2751-3.","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"209_CR6","doi-asserted-by":"publisher","first-page":"492","DOI":"10.1016\/j.cell.2017.06.042","volume":"170","author":"K Chen","year":"2017","unstructured":"Chen K, Liu J, Liu S, Xia M, Zhang X, Han D, et al. Methyltransferase SETD2-mediated methylation of STAT1 is critical for interferon antiviral activity. Cell. 2017;170(3):492\u2013506 e14. https:\/\/doi.org\/10.1016\/j.cell.2017.06.042.","journal-title":"Cell."},{"key":"209_CR7","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1186\/s13148-016-0287-1","volume":"8","author":"T Fehlmann","year":"2016","unstructured":"Fehlmann T, Reinheimer S, Geng C, Su X, Drmanac S, Alexeev A, et al. cPAS-based sequencing on the BGISEQ-500 to explore small non-coding RNAs. Clin Epigenetics. 2016;8:123. https:\/\/doi.org\/10.1186\/s13148-016-0287-1.","journal-title":"Clin Epigenetics"},{"issue":"1","key":"209_CR8","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/s40168-018-0429-0","volume":"6","author":"M Han","year":"2018","unstructured":"Han M, Hao L, Lin Y, Li F, Wang J, Yang H, et al. A novel affordable reagent for room temperature storage and transport of fecal samples for metagenomic analyses. Microbiome. 2018;6(1):43. https:\/\/doi.org\/10.1186\/s40168-018-0429-0.","journal-title":"Microbiome."},{"issue":"7720","key":"209_CR9","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1038\/s41586-018-0415-5","volume":"560","author":"S Li","year":"2018","unstructured":"Li S, Tian Y, Wu K, Ye Y, Yu J, Zhang J, et al. Modulating plant growth-metabolism coordination for sustainable agriculture. Nature. 2018;560(7720):595\u2013600. https:\/\/doi.org\/10.1038\/s41586-018-0415-5.","journal-title":"Nature."},{"issue":"1","key":"209_CR10","doi-asserted-by":"publisher","first-page":"e0190264","DOI":"10.1371\/journal.pone.0190264","volume":"13","author":"AM Patch","year":"2018","unstructured":"Patch AM, Nones K, Kazakoff SH, Newell F, Wood S, Leonard C, et al. Germline and somatic variant identification using BGISEQ-500 and HiSeq X ten whole genome sequencing. PLoS One. 2018;13(1):e0190264. https:\/\/doi.org\/10.1371\/journal.pone.0190264.","journal-title":"PLoS One"},{"issue":"1","key":"209_CR11","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1038\/s41467-018-03590-5","volume":"9","author":"D Liu","year":"2018","unstructured":"Liu D, Zhang XX, Li MC, Cao CH, Wan DY, Xi BX, et al. C\/EBPbeta enhances platinum resistance of ovarian cancer cells by reprogramming H3K79 methylation. Nat Commun. 2018;9(1):1739. https:\/\/doi.org\/10.1038\/s41467-018-03590-5.","journal-title":"Nat Commun"},{"issue":"1","key":"209_CR12","doi-asserted-by":"publisher","first-page":"470","DOI":"10.1038\/s41467-018-08205-7","volume":"10","author":"L Liu","year":"2019","unstructured":"Liu L, Liu C, Quintero A, Wu L, Yuan Y, Wang M, et al. Deconvolution of single-cell multi-omics layers reveals regulatory heterogeneity. Nat Commun. 2019;10(1):470. https:\/\/doi.org\/10.1038\/s41467-018-08205-7.","journal-title":"Nat Commun"},{"key":"209_CR13","doi-asserted-by":"publisher","unstructured":"Natarajan KN, Miao Z, Jiang M, Huang X, Zhou H, Xie J, et al. Comparative analysis of sequencing technologies for single-cell transcriptomics. Genome Biol. 2019;20(1). https:\/\/doi.org\/10.1186\/s13059-019-1676-5.","DOI":"10.1186\/s13059-019-1676-5"},{"key":"209_CR14","doi-asserted-by":"publisher","unstructured":"Zhao Y, Li X, Zhao W, Wang J, Yu J, Wan Z, et al. Single-cell transcriptomic landscape of nucleated cells in umbilical cord blood. Gigascience. 2019;8(5). https:\/\/doi.org\/10.1093\/gigascience\/giz047.","DOI":"10.1093\/gigascience\/giz047"},{"key":"209_CR15","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1007\/978-981-13-1562-6_2","volume-title":"Bioinformatics: sequences, structures, phylogeny","author":"G Bansal","year":"2018","unstructured":"Bansal G, Narta K, Teltumbade MR. Next-Generation sequencing: technology, advancements, and applications. In: Shanker A, editor. Bioinformatics: sequences, structures, phylogeny. Singapore: Springer; 2018. p. 15\u201346."},{"key":"209_CR16","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1186\/s13007-018-0337-0","volume":"14","author":"FY Zhu","year":"2018","unstructured":"Zhu FY, Chen MX, Ye NH, Qiao WM, Gao B, Law WK, et al. Comparative performance of the BGISEQ-500 and Illumina HiSeq4000 sequencing platforms for transcriptome analysis in plants. Plant Methods. 2018;14:69. https:\/\/doi.org\/10.1186\/s13007-018-0337-0.","journal-title":"Plant Methods"},{"issue":"3","key":"209_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/gigascience\/gix133","volume":"7","author":"C Fang","year":"2018","unstructured":"Fang C, Zhong H, Lin Y, Chen B, Han M, Ren H, et al. Assessment of the cPAS-based BGISEQ-500 platform for metagenomic sequencing. Gigascience. 2018;7(3):1\u20138. https:\/\/doi.org\/10.1093\/gigascience\/gix133.","journal-title":"Gigascience."},{"issue":"5","key":"209_CR18","doi-asserted-by":"publisher","first-page":"798","DOI":"10.1101\/gr.245126.118","volume":"29","author":"Ou Wang","year":"2019","unstructured":"Wang O, Chin R, Cheng X, Wu KYM, Mao Q, Tang J, et al. Efficient and unique co-barcoding of second-generation sequencing reads from long DNA molecules enabling cost effective and accurate sequencing, haplotyping, and de novo assembly. Genome Res. 2019. https:\/\/doi.org\/10.1101\/gr.245126.118.","journal-title":"Genome Research"},{"key":"209_CR19","doi-asserted-by":"publisher","unstructured":"Gorbachev A, Kulemin N, Naumov V, Belova V, Kwon D, Rebrikov D, et al. Comparative analysis of novel MGISEQ-2000 sequencing platform vs Illumina HiSeq 2500 for whole-genome sequencing. BioRxiv. 2019. https:\/\/doi.org\/10.1101\/577080.","DOI":"10.1101\/577080"},{"key":"209_CR20","doi-asserted-by":"publisher","unstructured":"Senabouth A, Anderson S, Shi Q, Shi L, Jiang F, Zhang W, et al. Comparative performance of the BGI and Illumina sequencing technology for single-cell RNAsequencing. BioRxiv. 2019. https:\/\/doi.org\/10.1101\/552588.","DOI":"10.1101\/552588"},{"key":"209_CR21","unstructured":"Andrews S. FastQC: a quality control tool for high throughput sequence data. http:\/\/www.bioinformatics.babraham.ac.uk\/projects\/fastqc\/. Accessed 18 Nov 2018."},{"key":"209_CR22","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1186\/s13040-016-0099-z","volume":"9","author":"K Icay","year":"2016","unstructured":"Icay K, Chen P, Cervera A, Rantanen V, Lehtonen R, Hautaniemi S. SePIA: RNA and small RNA sequence processing, integration, and analysis. BioData Min. 2016;9:20. https:\/\/doi.org\/10.1186\/s13040-016-0099-z.","journal-title":"BioData Min"},{"key":"209_CR23","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1016\/j.atg.2016.06.001","volume":"10","author":"C Endrullat","year":"2016","unstructured":"Endrullat C, Glokler J, Franke P, Frohme M. Standardization and quality management in next-generation sequencing. Appl Transl Genom. 2016;10:2\u20139. https:\/\/doi.org\/10.1016\/j.atg.2016.06.001.","journal-title":"Appl Transl Genom"},{"issue":"1","key":"209_CR24","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1186\/s12864-019-5569-5","volume":"20","author":"Q Li","year":"2019","unstructured":"Li Q, Zhao X, Zhang W, Wang L, Wang J, Xu D, et al. Reliable multiplex sequencing with rare index mis-assignment on DNB-based NGS platform. BMC Genomics. 2019;20(1):215. https:\/\/doi.org\/10.1186\/s12864-019-5569-5.","journal-title":"BMC Genomics"},{"issue":"3","key":"209_CR25","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/gigascience\/giy013","volume":"7","author":"S Cheng","year":"2018","unstructured":"Cheng S, Melkonian M, Smith SA, Brockington S, Archibald JM, Delaux PM, et al. 10KP: a phylodiverse genome sequencing plan. Gigascience. 2018;7(3):1\u20139. https:\/\/doi.org\/10.1093\/gigascience\/giy013.","journal-title":"Gigascience."},{"key":"209_CR26","unstructured":"Illumina Proactive Instrument Monitoring. https:\/\/www.illumina.com\/services\/instrument-services-training\/product-support-services\/instrument-monitoring.html. Accessed 20 May 2019."}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-019-0209-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13040-019-0209-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-019-0209-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,13]],"date-time":"2020-11-13T19:12:02Z","timestamp":1605294722000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/s13040-019-0209-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,15]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["209"],"URL":"https:\/\/doi.org\/10.1186\/s13040-019-0209-9","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/652347","asserted-by":"object"}]},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,15]]},"assertion":[{"value":"16 July 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 October 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2019","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"21"}}