{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,12,15]],"date-time":"2023-12-15T07:58:32Z","timestamp":1702627112208},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T00:00:00Z","timestamp":1626739200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T00:00:00Z","timestamp":1626739200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>As exome sequencing (ES) integrates into clinical practice, we should make every effort to utilize all information generated. Copy-number variation can lead to Mendelian disorders, but small copy-number variants (CNVs) often get overlooked or obscured by under-powered data collection. Many groups have developed methodology for detecting CNVs from ES, but existing methods often perform poorly for small CNVs and rely on large numbers of samples not always available to clinical laboratories. Furthermore, methods often rely on Bayesian approaches requiring user-defined priors in the setting of insufficient prior knowledge. This report first demonstrates the benefit of multiplexed exome capture (pooling samples prior to capture), then presents a novel detection algorithm, mcCNV (\u201cmultiplexed capture CNV\u201d), built around multiplexed capture.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We demonstrate: (1) multiplexed capture reduces inter-sample variance; (2) our mcCNV method, a novel depth-based algorithm for detecting CNVs from multiplexed capture ES data, improves the detection of small CNVs. We contrast our novel approach, agnostic to prior information, with the the commonly-used ExomeDepth. In a simulation study mcCNV demonstrated a favorable false discovery rate (FDR). When compared to calls made from matched genome sequencing, we find the mcCNV algorithm performs comparably to ExomeDepth.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>Implementing multiplexed capture increases power to detect single-exon CNVs. The novel mcCNV algorithm may provide a more favorable FDR than ExomeDepth. The greatest benefits of our approach derive from (1) not requiring a database of reference samples and (2) not requiring prior information about the prevalance or size of variants.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-021-04246-w","type":"journal-article","created":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T15:29:38Z","timestamp":1626794978000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Pre-capture multiplexing provides additional power to detect copy number variation in exome sequencing"],"prefix":"10.1186","volume":"22","author":[{"given":"Dayne L.","family":"Filer","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fengshen","family":"Kuo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alicia T.","family":"Brandt","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christian R.","family":"Tilley","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Piotr A.","family":"Mieczkowski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathan S.","family":"Berg","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kimberly","family":"Robasky","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yun","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chris","family":"Bizon","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jeffery L.","family":"Tilson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bradford C.","family":"Powell","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Darius M.","family":"Bost","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Clark D.","family":"Jeffries","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kirk C.","family":"Wilhelmsen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,7,20]]},"reference":[{"issue":"12","key":"4246_CR1","doi-asserted-by":"publisher","first-page":"0209185","DOI":"10.1371\/journal.pone.0209185","volume":"13","author":"DS Marchuk","year":"2018","unstructured":"Marchuk DS, Crooks K, Strande N, Kaiser-Rogers K, Milko LV, Brandt A, Arreola A, Tilley CR, Bizon C, Vora NL, Wilhelmsen KC, Evans JP, Berg JS. Increasing the diagnostic yield of exome sequencing by copy number variant analysis. PLoS One. 2018;13(12):0209185. https:\/\/doi.org\/10.1371\/journal.pone.0209185.","journal-title":"PLoS One"},{"issue":"8","key":"4246_CR2","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1038\/gim.2014.160","volume":"17","author":"K Retterer","year":"2015","unstructured":"Retterer K, Scuffins J, Schmidt D, Lewis R, Pineda-Alvarez D, Stafford A, Schmidt L, Warren S, Gibellini F, Kondakova A, Blair A, Bale S, Matyakhina L, Meck J, Aradhya S, Haverfield E. Assessing copy number from exome sequencing and exome array cgh based on cnv spectrum in a large clinical cohort. Genet Med. 2015;17(8):623\u20139. https:\/\/doi.org\/10.1038\/gim.2014.160.","journal-title":"Genet Med"},{"key":"4246_CR3","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1186\/s13039-017-0333-5","volume":"10","author":"R Yao","year":"2017","unstructured":"Yao R, Zhang C, Yu T, Li N, Hu X, Wang X, Wang J, Shen Y. Evaluation of three read-depth based cnv detection tools using whole-exome sequencing data. Mol Cytogenet. 2017;10:30. https:\/\/doi.org\/10.1186\/s13039-017-0333-5.","journal-title":"Mol Cytogenet"},{"issue":"21","key":"4246_CR4","doi-asserted-by":"publisher","first-page":"2747","DOI":"10.1093\/bioinformatics\/bts526","volume":"28","author":"V Plagnol","year":"2012","unstructured":"Plagnol V, Curtis J, Epstein M, Mok KY, Stebbings E, Grigoriadou S, Wood NW, Hambleton S, Burns SO, Thrasher AJ, Kumararatne D, Doffinger R, Nejentsev S. A robust model for read count data in exome sequencing experiments and implications for copy number variant calling. Bioinformatics. 2012;28(21):2747\u201354. https:\/\/doi.org\/10.1093\/bioinformatics\/bts526.","journal-title":"Bioinformatics"},{"issue":"8","key":"4246_CR5","doi-asserted-by":"publisher","first-page":"1525","DOI":"10.1101\/gr.138115.112","volume":"22","author":"N Krumm","year":"2012","unstructured":"Krumm N, Sudmant PH, Ko A, O\u2019Roak BJ, Malig M, Coe BP, Quinlan AR, Nickerson DA, Eichler EE. Copy number variation detection and genotyping from exome sequence data. Genome Res. 2012;22(8):1525\u201332. https:\/\/doi.org\/10.1101\/gr.138115.112.","journal-title":"Genome Res"},{"issue":"4","key":"4246_CR6","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1016\/j.ajhg.2012.08.005","volume":"91","author":"M Fromer","year":"2012","unstructured":"Fromer M, Moran JL, Chambert K, Banks E, Bergen SE, Ruderfer DM, Handsaker RE, McCarroll SA, O\u2019Donovan MC, Owen MJ, Kirov G, Sullivan PF, Hultman CM, Sklar P, Purcell SM. Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet. 2012;91(4):597\u2013607. https:\/\/doi.org\/10.1016\/j.ajhg.2012.08.005.","journal-title":"Am J Hum Genet"},{"issue":"6","key":"4246_CR7","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1093\/nar\/gku1363","volume":"43","author":"Y Jiang","year":"2015","unstructured":"Jiang Y, Oldridge DA, Diskin SJ, Zhang NR. Codex: a normalization and copy number variation detection method for whole exome sequencing. Nucleic Acids Res. 2015;43(6):39. https:\/\/doi.org\/10.1093\/nar\/gku1363.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"4246_CR8","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1038\/s41436-018-0033-5","volume":"21","author":"R Truty","year":"2019","unstructured":"Truty R, Paul J, Kennemer M, Lincoln SE, Olivares E, Nussbaum RL, Aradhya S. Prevalence and properties of intragenic copy-number variation in mendelian disease genes. Genet Med. 2019;21(1):114\u201323. https:\/\/doi.org\/10.1038\/s41436-018-0033-5.","journal-title":"Genet Med"},{"issue":"10","key":"4246_CR9","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1093\/nar\/gks001","volume":"40","author":"Y Benjamini","year":"2012","unstructured":"Benjamini Y, Speed TP. Summarizing and correcting the gc content bias in high-throughput sequencing. Nucleic Acids Res. 2012;40(10):72. https:\/\/doi.org\/10.1093\/nar\/gks001.","journal-title":"Nucleic Acids Res"},{"issue":"3","key":"4246_CR10","doi-asserted-by":"publisher","first-page":"380","DOI":"10.1093\/bib\/bbu027","volume":"16","author":"L Kadalayil","year":"2015","unstructured":"Kadalayil L, Rafiq S, Rose-Zerilli MJJ, Pengelly RJ, Parker H, Oscier D, Strefford JC, Tapper WJ, Gibson J, Ennis S, Collins A. Exome sequence read depth methods for identifying copy number changes. Brief Bioinform. 2015;16(3):380\u201392. https:\/\/doi.org\/10.1093\/bib\/bbu027.","journal-title":"Brief Bioinform"},{"issue":"1","key":"4246_CR11","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1038\/nmeth.1276","volume":"6","author":"DY Chiang","year":"2009","unstructured":"Chiang DY, Getz G, Jaffe DB, O\u2019Kelly MJT, Zhao X, Carter SL, Russ C, Nusbaum C, Meyerson M, Lander ES. High-resolution mapping of copy-number alterations with massively parallel sequencing. Nat Methods. 2009;6(1):99\u2013103. https:\/\/doi.org\/10.1038\/nmeth.1276.","journal-title":"Nat Methods"},{"issue":"3","key":"4246_CR12","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1093\/bioinformatics\/btr670","volume":"28","author":"V Boeva","year":"2012","unstructured":"Boeva V, Popova T, Bleakley K, Chiche P, Cappo J, Schleiermacher G, Janoueix-Lerosey I, Delattre O, Barillot E. Control-freec: a tool for assessing copy number and allelic content using next-generation sequencing data. Bioinformatics. 2012;28(3):423\u20135. https:\/\/doi.org\/10.1093\/bioinformatics\/btr670.","journal-title":"Bioinformatics"},{"issue":"4","key":"4246_CR13","doi-asserted-by":"publisher","first-page":"1141","DOI":"10.1109\/TCBB.2018.2883333","volume":"17","author":"X Yuan","year":"2020","unstructured":"Yuan X, Bai J, Zhang J, Yang L, Duan J, Li Y, Gao M. Condel: Detecting copy number variation and genotyping deletion zygosity from single tumor samples using sequence data. IEEE\/ACM Trans Comput Biol Bioinform. 2020;17(4):1141\u201353. https:\/\/doi.org\/10.1109\/TCBB.2018.2883333.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"4246_CR14","doi-asserted-by":"publisher","unstructured":"Yuan X, Yu J, Xi J, Yang L, Shang J, Li Z, Duan J. Cnv\\_iftv: an isolation forest and total variation-based detection of cnvs from short-read sequencing data. IEEE\/ACM Trans Comput Biol Bioinform. 2019. https:\/\/doi.org\/10.1109\/TCBB.2019.2920889.","DOI":"10.1109\/TCBB.2019.2920889"},{"issue":"6","key":"4246_CR15","doi-asserted-by":"publisher","first-page":"974","DOI":"10.1101\/gr.114876.110","volume":"21","author":"A Abyzov","year":"2011","unstructured":"Abyzov A, Urban AE, Snyder M, Gerstein M. Cnvnator: an approach to discover, genotype, and characterize typical and atypical cnvs from family and population genome sequencing. Genome Res. 2011;21(6):974\u201384. https:\/\/doi.org\/10.1101\/gr.114876.110.","journal-title":"Genome Res"},{"issue":"3","key":"4246_CR16","doi-asserted-by":"publisher","first-page":"408","DOI":"10.1016\/j.ajhg.2012.07.004","volume":"91","author":"M Zhu","year":"2012","unstructured":"Zhu M, Need AC, Han Y, Ge D, Maia JM, Zhu Q, Heinzen EL, Cirulli ET, Pelak K, He M, Ruzzo EK, Gumbs C, Singh A, Feng S, Shianna KV, Goldstein DB. Using erds to infer copy-number variants in high-coverage genomes. Am J Hum Genet. 2012;91(3):408\u201321. https:\/\/doi.org\/10.1016\/j.ajhg.2012.07.004.","journal-title":"Am J Hum Genet"},{"key":"4246_CR17","doi-asserted-by":"publisher","first-page":"618","DOI":"10.1186\/1471-2164-13-618","volume":"13","author":"AE Shearer","year":"2012","unstructured":"Shearer AE, Hildebrand MS, Ravi H, Joshi S, Guiffre AC, Novak B, Happe S, LeProust EM, Smith RJH. Pre-capture multiplexing improves efficiency and cost-effectiveness of targeted genomic enrichment. BMC Genomics. 2012;13:618. https:\/\/doi.org\/10.1186\/1471-2164-13-618.","journal-title":"BMC Genomics"},{"key":"4246_CR18","unstructured":"Minka TP. Estimating a dirichlet distribution. Technical report. 2000."},{"issue":"2","key":"4246_CR19","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1016\/0005-2795(75)90109-9","volume":"405","author":"BW Matthews","year":"1975","unstructured":"Matthews BW. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochim Biophys Acta. 1975;405(2):442\u201351.","journal-title":"Biochim Biophys Acta"},{"issue":"1","key":"4246_CR20","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1016\/j.ajhg.2017.12.007","volume":"102","author":"B Trost","year":"2018","unstructured":"Trost B, Walker S, Wang Z, Thiruvahindrapuram B, MacDonald JR, Sung WWL, Pereira SL, Whitney J, Chan AJS, Pellecchia G, Reuter MS, Lok S, Yuen RKC, Marshall CR, Merico D, Scherer SW. A comprehensive workflow for read depth-based identification of copy-number variation from whole-genome sequence data. Am J Hum Genet. 2018;102(1):142\u201355. https:\/\/doi.org\/10.1016\/j.ajhg.2017.12.007.","journal-title":"Am J Hum Genet"},{"key":"4246_CR21","doi-asserted-by":"publisher","first-page":"683","DOI":"10.1186\/1471-2164-13-683","volume":"13","author":"E Ramos","year":"2012","unstructured":"Ramos E, Levinson BT, Chasnoff S, Hughes A, Young AL, Thornton K, Li A, Vallania FLM, Province M, Druley TE. Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing. BMC Genomics. 2012;13:683. https:\/\/doi.org\/10.1186\/1471-2164-13-683.","journal-title":"BMC Genomics"},{"issue":"6","key":"4246_CR22","doi-asserted-by":"publisher","first-page":"1001","DOI":"10.1038\/leu.2011.32","volume":"25","author":"A Wesolowska","year":"2011","unstructured":"Wesolowska A, Dalgaard MD, Borst L, Gautier L, Bak M, Weinhold N, Nielsen BF, Helt LR, Audouze K, Nersting J, Tommerup N, Brunak S, Sicheritz-Ponten T, Leffers H, Schmiegelow K, Gupta R. Cost-effective multiplexing before capture allows screening of 25 000 clinically relevant snps in childhood acute lymphoblastic leukemia. Leukemia. 2011;25(6):1001\u20136. https:\/\/doi.org\/10.1038\/leu.2011.32.","journal-title":"Leukemia"},{"issue":"11","key":"4246_CR23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0048616","volume":"7","author":"M Neiman","year":"2012","unstructured":"Neiman M, Sundling S, Gr\u00f6nberg H, Hall P, Czene K, Lindberg J, Klevebring D. Library preparation and multiplex capture for massive parallel sequencing applications made efficient and easy. PLOS ONE. 2012;7(11):1\u20136. https:\/\/doi.org\/10.1371\/journal.pone.0048616.","journal-title":"PLOS ONE"},{"issue":"5","key":"4246_CR24","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1101\/gr.128124.111","volume":"22","author":"N Rohland","year":"2012","unstructured":"Rohland N, Reich D. Cost-effective, high-throughput dna sequencing libraries for multiplexed target capture. Genome Res. 2012;22(5):939\u201346. https:\/\/doi.org\/10.1101\/gr.128124.111.","journal-title":"Genome Res"},{"issue":"6","key":"4246_CR25","doi-asserted-by":"publisher","first-page":"1051","DOI":"10.1016\/j.ajhg.2016.04.011","volume":"98","author":"RC Green","year":"2016","unstructured":"Green RC, Goddard KAB, Jarvik GP, Amendola LM, Appelbaum PS, Berg JS, Bernhardt BA, Biesecker LG, Biswas S, Blout CL, Bowling KM, Brothers KB, Burke W, Caga-Anan CF, Chinnaiyan AM, Chung WK, Clayton EW, Cooper GM, East K, Evans JP, Fullerton SM, Garraway LA, Garrett JR, Gray SW, Henderson GE, Hindorff LA, Holm IA, Lewis MH, Hutter CM, Janne PA, Joffe S, Kaufman D, Knoppers BM, Koenig BA, Krantz ID, Manolio TA, McCullough L, McEwen J, McGuire A, Muzny D, Myers RM, Nickerson DA, Ou J, Parsons DW, Petersen GM, Plon SE, Rehm HL, Roberts JS, Robinson D, Salama JS, Scollon S, Sharp RR, Shirts B, Spinner NB, Tabor HK, Tarczy-Hornoch P, Veenstra DL, Wagle N, Weck K, Wilfond BS, Wilhelmsen K, Wolf SM, Wynn J, Yu J-H. Clinical sequencing exploratory research consortium: Accelerating evidence-based practice of genomic medicine. Am J Hum Genet. 2016;98(6):1051\u201366. https:\/\/doi.org\/10.1016\/j.ajhg.2016.04.011.","journal-title":"Am J Hum Genet"},{"issue":"2","key":"4246_CR26","doi-asserted-by":"publisher","first-page":"31","DOI":"10.5808\/GI.2015.13.2.31","volume":"13","author":"K Kim","year":"2015","unstructured":"Kim K, Seong M-W, Chung W-H, Park SS, Leem S, Park W, Kim J, Lee K, Park RW, Kim N. Effect of next-generation exome sequencing depth for discovery of diagnostic variants. Genomics Inform. 2015;13(2):31\u20139. https:\/\/doi.org\/10.5808\/GI.2015.13.2.31.","journal-title":"Genomics Inform"},{"issue":"6","key":"4246_CR27","first-page":"500","volume":"74","author":"AKM Foreman","year":"2013","unstructured":"Foreman AKM, Lee K, Evans JP. The NCGENES project: exploring the new world of genome sequencing. N C Med J. 2013;74(6):500\u20134.","journal-title":"N C Med J"},{"key":"4246_CR28","unstructured":"Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013. arXiv:1303.3997"},{"issue":"19","key":"4246_CR29","doi-asserted-by":"publisher","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","volume":"28","author":"J Koster","year":"2012","unstructured":"Koster J, Rahmann S. Snakemake-a scalable bioinformatics workflow engine. Bioinformatics. 2012;28(19):2520\u20132. https:\/\/doi.org\/10.1093\/bioinformatics\/bts480.","journal-title":"Bioinformatics"},{"issue":"16","key":"4246_CR30","doi-asserted-by":"publisher","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","volume":"25","author":"H Li","year":"2009","unstructured":"Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The sequence alignment\/map format and samtools. Bioinformatics. 2009;25(16):2078\u20139. https:\/\/doi.org\/10.1093\/bioinformatics\/btp352.","journal-title":"Bioinformatics"},{"issue":"1110","key":"4246_CR31","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1002\/0471250953.bi1110s43","volume":"43","author":"GA Van der Auwera","year":"2013","unstructured":"Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, Jordan T, Shakir K, Roazen D, Thibault J, Banks E, Garimella KV, Altshuler D, Gabriel S, DePristo MA. From fastq data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43(1110):11\u2013101111033. https:\/\/doi.org\/10.1002\/0471250953.bi1110s43.","journal-title":"Curr Protoc Bioinformatics"},{"issue":"10","key":"4246_CR32","doi-asserted-by":"publisher","first-page":"1275","DOI":"10.1093\/bioinformatics\/btt143","volume":"29","author":"D Yu","year":"2013","unstructured":"Yu D, Huber W, Vitek O. Shrinkage estimation of dispersion in negative binomial models for rna-seq experiments with small sample size. Bioinformatics. 2013;29(10):1275\u201382. https:\/\/doi.org\/10.1093\/bioinformatics\/btt143.","journal-title":"Bioinformatics"},{"issue":"1","key":"4246_CR33","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","volume":"57","author":"Y Benjamini","year":"1995","unstructured":"Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodol). 1995;57(1):289\u2013300. https:\/\/doi.org\/10.1111\/j.2517-6161.1995.tb02031.x.","journal-title":"J R Stat Soc Ser B (Methodol)"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-04246-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-021-04246-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-021-04246-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,20]],"date-time":"2021-07-20T15:30:15Z","timestamp":1626795015000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-021-04246-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,20]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["4246"],"URL":"https:\/\/doi.org\/10.1186\/s12859-021-04246-w","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,20]]},"assertion":[{"value":"18 January 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 May 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 July 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All human data were collected following all guidelines and regulations with the approval and under the supervision of the UNC Institutional Review Board. All research participants, or participants\u2019 guardians when applicable, received appropriate counseling and provided informed consent to participate in this research. No identifying information or sequence level data are included in this manuscript or accompanying data.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"374"}}