{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:25:33Z","timestamp":1772166333820,"version":"3.50.1"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,3,9]],"date-time":"2021-03-09T00:00:00Z","timestamp":1615248000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2021,3,9]],"date-time":"2021-03-09T00:00:00Z","timestamp":1615248000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Introduction<\/jats:title>\n                    <jats:p>Twin studies indicate that a substantial fraction of ovarian cancers should be predictable from genetic testing. Genetic risk scores can stratify women into different classes of risk. Higher risk women can be treated or screened for ovarian cancer, which should reduce ovarian cancer death rates. However, current ovarian cancer genetic risk scores do not work that well. We developed a genetic risk score based on variations in the length of chromosomes.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>We evaluated this genetic risk score using data collected by The Cancer Genome Atlas. We synthesized a dataset of 414 women who had ovarian serous carcinoma and 4225 women who had no form of ovarian cancer. We characterized each woman by 22 numbers, representing the length of each chromosome in their germ line DNA. We used a gradient boosting machine to build a classifier that can predict whether a woman had been diagnosed with ovarian cancer.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>The genetic risk score based on chromosomal-scale length variation could stratify women such that the highest 20% had a 160x risk (95% confidence interval 50x-450x) compared to the lowest 20%. The genetic risk score we developed had an area under the curve of the receiver operating characteristic curve of 0.88 (95% confidence interval 0.86\u20130.91).<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>A genetic risk score based on chromosomal-scale length variation of germ line DNA provides an effective means of predicting whether or not a woman will develop ovarian cancer.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s13040-021-00253-y","type":"journal-article","created":{"date-parts":[[2021,3,9]],"date-time":"2021-03-09T06:03:46Z","timestamp":1615269826000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Genetic risk score for ovarian cancer based on chromosomal-scale length variation"],"prefix":"10.1186","volume":"14","author":[{"given":"Christopher","family":"Toh","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7995-5197","authenticated-orcid":false,"given":"James P.","family":"Brody","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,3,9]]},"reference":[{"key":"253_CR1","doi-asserted-by":"publisher","first-page":"394","DOI":"10.3322\/caac.21492","volume":"68","author":"F Bray","year":"2018","unstructured":"Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394\u2013424. https:\/\/doi.org\/10.3322\/caac.21492.","journal-title":"CA Cancer J Clin"},{"key":"253_CR2","doi-asserted-by":"publisher","first-page":"284","DOI":"10.3322\/caac.21456","volume":"68","author":"LA Torre","year":"2018","unstructured":"Torre LA, Trabert B, DeSantis CE, Miller KD, Samimi G, Runowicz CD, et al. Ovarian cancer statistics, 2018. CA Cancer J Clin. 2018;68:284\u201396. https:\/\/doi.org\/10.3322\/caac.21456.","journal-title":"CA Cancer J Clin"},{"key":"253_CR3","doi-asserted-by":"publisher","unstructured":"Bast RC. Status of Tumor Markers in Ovarian Cancer Screening. Journal of Clinical Oncology. 2003;21: 200s\u20132205. doi:https:\/\/doi.org\/10.1200\/JCO.2003.01.068","DOI":"10.1200\/JCO.2003.01.068"},{"key":"253_CR4","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1016\/J.BPOBGYN.2016.10.017","volume":"41","author":"L Andrews","year":"2017","unstructured":"Andrews L, Mutch DG. Hereditary ovarian Cancer and risk reduction. Best Pract Res Clin Obstet Gynaecol. 2017;41:31\u201348. https:\/\/doi.org\/10.1016\/J.BPOBGYN.2016.10.017.","journal-title":"Best Pract Res Clin Obstet Gynaecol"},{"key":"253_CR5","doi-asserted-by":"publisher","unstructured":"Grossman DC, Curry SJ, Owens DK, Barry MJ, Davidson KW, Doubeni CA, et al. Screening for ovarian cancer US preventive services task force recommendation statement. JAMA. 2018. https:\/\/doi.org\/10.1001\/jama.2017.21926.","DOI":"10.1001\/jama.2017.21926"},{"key":"253_CR6","doi-asserted-by":"publisher","unstructured":"Trimbos JB. Surgical treatment of early-stage ovarian cancer. Best Pract Res. 2017. https:\/\/doi.org\/10.1016\/j.bpobgyn.2016.10.001.","DOI":"10.1016\/j.bpobgyn.2016.10.001"},{"key":"253_CR7","doi-asserted-by":"publisher","unstructured":"Bast RC, Lu Z, Han CY, Lu KH, Anderson KS, Drescher CW, et al. Biomarkers and Strategies for Early Detection of Ovarian Cancer. Cancer Epidemiology Biomarkers &amp; Prevention. 2020; cebp.1057.2020. doi:https:\/\/doi.org\/10.1158\/1055-9965.EPI-20-1057","DOI":"10.1158\/1055-9965.EPI-20-1057"},{"key":"253_CR8","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1001\/jama.2015.17703","volume":"315","author":"LA Mucci","year":"2016","unstructured":"Mucci LA, Hjelmborg JB, Harris JR, Czene K, Havelick DJ, Scheike T, et al. Familial risk and heritability of Cancer among twins in Nordic countries. JAMA. 2016;315:68\u201376. https:\/\/doi.org\/10.1001\/jama.2015.17703.","journal-title":"JAMA."},{"key":"253_CR9","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1097\/01.gim.0000229689.18263.f4","volume":"8","author":"ACJW Janssens","year":"2006","unstructured":"Janssens ACJW, Aulchenko YS, Elefante S, Borsboom GJJM, Steyerberg EW, van Duijn CM. Predictive testing for complex diseases using multiple genes: fact or fiction? Genet Med. 2006;8:395\u2013400. https:\/\/doi.org\/10.1097\/01.gim.0000229689.18263.f4.","journal-title":"Genet Med"},{"key":"253_CR10","doi-asserted-by":"publisher","first-page":"R166","DOI":"10.1093\/hmg\/ddn250","volume":"17","author":"ACJW Janssens","year":"2008","unstructured":"Janssens ACJW, van Duijn CM. Genome-based prediction of common diseases: advances and prospects. Hum Mol Genet. 2008;17:R166\u201373. https:\/\/doi.org\/10.1093\/hmg\/ddn250.","journal-title":"Hum Mol Genet"},{"key":"253_CR11","doi-asserted-by":"publisher","unstructured":"Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat Rev Genet. 2018. https:\/\/doi.org\/10.1038\/s41576-018-0018-x.","DOI":"10.1038\/s41576-018-0018-x"},{"key":"253_CR12","doi-asserted-by":"publisher","unstructured":"Lambert SA, Abraham G, Inouye M. Towards clinical utility of polygenic risk scores. Hum Mol Genet. 2019. https:\/\/doi.org\/10.1093\/hmg\/ddz187.","DOI":"10.1093\/hmg\/ddz187"},{"key":"253_CR13","doi-asserted-by":"publisher","first-page":"1219","DOI":"10.1038\/s41588-018-0183-z","volume":"50","author":"AV Khera","year":"2018","unstructured":"Khera AV, Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50:1219\u201324. https:\/\/doi.org\/10.1038\/s41588-018-0183-z.","journal-title":"Nat Genet"},{"key":"253_CR14","doi-asserted-by":"publisher","first-page":"362","DOI":"10.1038\/ng.2564","volume":"45","author":"PDP Pharoah","year":"2013","unstructured":"Pharoah PDP, Tsai Y-Y, Ramus SJ, Phelan CM, Goode EL, Lawrenson K, et al. GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer. Nat Genet. 2013;45:362\u201370. https:\/\/doi.org\/10.1038\/ng.2564.","journal-title":"Nat Genet"},{"key":"253_CR15","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1038\/ng.3185","volume":"47","author":"KB Kuchenbaecker","year":"2015","unstructured":"Kuchenbaecker KB, Ramus SJ, Tyrer J, Lee A, Shen HC, Beesley J, et al. Identification of six new susceptibility loci for invasive epithelial ovarian cancer. Nat Genet. 2015;47:164\u201371. https:\/\/doi.org\/10.1038\/ng.3185.","journal-title":"Nat Genet"},{"key":"253_CR16","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1186\/s13073-020-00742-5","volume":"12","author":"CM Lewis","year":"2020","unstructured":"Lewis CM, Vassos E. Polygenic risk scores: from research tools to clinical instruments. Genome Med. 2020;12:44. https:\/\/doi.org\/10.1186\/s13073-020-00742-5.","journal-title":"Genome Med"},{"key":"253_CR17","doi-asserted-by":"publisher","unstructured":"Goode EL, Chenevix-Trench G, Song H, Ramus SJ, Notaridou M, Lawrenson K, et al. A genome-wide association study identifies susceptibility loci for ovarian cancer at 2q31 and 8q24. Nat Genet. 2010. https:\/\/doi.org\/10.1038\/ng.668.","DOI":"10.1038\/ng.668"},{"key":"253_CR18","doi-asserted-by":"publisher","first-page":"1113","DOI":"10.1038\/ng.2764","volume":"45","author":"JN Weinstein","year":"2013","unstructured":"Weinstein JN, Collisson EA, Mills GB, Shaw KRM, Ozenberger BA, Ellrott K, et al. The Cancer genome atlas pan-Cancer analysis project. Nat Genet. 2013;45:1113\u201320. https:\/\/doi.org\/10.1038\/ng.2764.","journal-title":"Nat Genet"},{"key":"253_CR19","doi-asserted-by":"publisher","first-page":"609","DOI":"10.1038\/nature10166","volume":"474","author":"D Bell","year":"2011","unstructured":"Bell D, Berchuck A, Birrer M, Chien J, Cramer DW, Dao F, et al. Integrated genomic analyses of ovarian carcinoma. Nature. 2011;474:609\u201315. https:\/\/doi.org\/10.1038\/nature10166.","journal-title":"Nature."},{"key":"253_CR20","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1016\/j.cell.2018.03.042","volume":"173","author":"C Hutter","year":"2018","unstructured":"Hutter C, Zenklusen JC. The Cancer genome atlas: creating lasting value beyond its data. Cell. 2018;173:283\u20135. https:\/\/doi.org\/10.1016\/j.cell.2018.03.042.","journal-title":"Cell."},{"key":"253_CR21","unstructured":"Copy Number Variation Analysis Pipeline. [cited 18 Jan 2018]. Available: https:\/\/docs.gdc.cancer.gov\/Data\/Bioinformatics_Pipelines\/CNV_Pipeline\/"},{"key":"253_CR22","doi-asserted-by":"publisher","first-page":"1253","DOI":"10.1038\/ng.237","volume":"40","author":"JM Korn","year":"2008","unstructured":"Korn JM, Kuruvilla FG, McCarroll SA, Wysoker A, Nemesh J, Cawley S, et al. Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet. 2008;40:1253\u201360. https:\/\/doi.org\/10.1038\/ng.237.","journal-title":"Nat Genet"},{"key":"253_CR23","doi-asserted-by":"publisher","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","volume":"5","author":"AB Olshen","year":"2004","unstructured":"Olshen AB, Venkatraman ES, Lucito R, Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004;5:557\u201372. https:\/\/doi.org\/10.1093\/biostatistics\/kxh008.","journal-title":"Biostatistics."},{"key":"253_CR24","unstructured":"National Cancer Institute Genomic Data Commons. [cited 18 Jan 2018]. Available: https:\/\/gdc.cancer.gov\/"},{"key":"253_CR25","doi-asserted-by":"publisher","first-page":"e7","DOI":"10.1158\/0008-5472.CAN-17-0617","volume":"77","author":"SM Reynolds","year":"2017","unstructured":"Reynolds SM, Miller M, Lee P, Leinonen K, Paquette SM, Rodebaugh Z, et al. The ISB Cancer genomics cloud: a flexible cloud-based platform for Cancer genomics research. Cancer Res. 2017;77:e7\u2013e10. https:\/\/doi.org\/10.1158\/0008-5472.CAN-17-0617.","journal-title":"Cancer Res"},{"key":"253_CR26","unstructured":"Gijsbers P, LeDell E, Thomas J, Poirier S, Bischl B, Vanschoren J. An Open Source AutoML Benchmark. 6th ICML Workshop on Automated Machine Learning. 2019. Available: https:\/\/arxiv.org\/pdf\/1907.00909.pdf"},{"key":"253_CR27","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","volume":"38","author":"JH Friedman","year":"2002","unstructured":"Friedman JH. Stochastic gradient boosting. Comput Stat Data Anal. 2002;38:367\u201378. https:\/\/doi.org\/10.1016\/S0167-9473(01)00065-2.","journal-title":"Comput Stat Data Anal"},{"key":"253_CR28","unstructured":"Tenny S, Hoffman MR. Odds ratio (OR). StatPearls. StatPearls publishing; 2020. Available: http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/28613750."},{"key":"253_CR29","unstructured":"Lundberg SM, Lee SI. A unified approach to interpreting model predictions. Adv Neural Inf Proces Syst. 2017."},{"key":"253_CR30","unstructured":"Zhang B. Colorectal cancer predictive test using germ-line DNA data and multiple machine learning methods. 2019. Available: https:\/\/escholarship.org\/uc\/item\/44f3f487"},{"key":"253_CR31","unstructured":"Olson RS, Cava W, Mustahsan Z, Varik A, Moore JH. Data-driven advice for applying machine learning to bioinformatics problems. Pacific symposium on Biocomputing Pacific symposium on Biocomputing. 2018;23: 192\u2013203. Available: http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/29218881."},{"key":"253_CR32","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1038\/nature11412","volume":"490","author":"DC Koboldt","year":"2012","unstructured":"Koboldt DC, Fulton RS, McLellan MD, Schmidt H, Kalicki-Veizer J, McMichael JF, et al. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490:61\u201370. https:\/\/doi.org\/10.1038\/nature11412.","journal-title":"Nature."},{"key":"253_CR33","unstructured":"Abeliovich D, Kaduri L, Lerer I, Weinberg N, Amir G, Sagi M, et al. The founder mutations 185delAG and 5382insC in BRCA1 and 6174delT in BRCA2 appear in 60% of ovarian cancer and 30% of early-onset breast cancer patients among Ashkenazi women. Am J Hum Genet. 1997."},{"key":"253_CR34","doi-asserted-by":"publisher","unstructured":"Efron B. Prediction, estimation, and attribution. J Am Stat Assoc. 2020. https:\/\/doi.org\/10.1080\/01621459.2020.1762613.","DOI":"10.1080\/01621459.2020.1762613"},{"key":"253_CR35","doi-asserted-by":"publisher","unstructured":"Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006. https:\/\/doi.org\/10.1038\/ng1847.","DOI":"10.1038\/ng1847"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00253-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13040-021-00253-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13040-021-00253-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,9]],"date-time":"2021-03-09T06:06:35Z","timestamp":1615269995000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/s13040-021-00253-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,9]]},"references-count":35,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["253"],"URL":"https:\/\/doi.org\/10.1186\/s13040-021-00253-y","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.07.18.20156976","asserted-by":"object"}]},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,9]]},"assertion":[{"value":"3 November 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 February 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 March 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Ethics approval for the initial data collection was obtained by the TCGA project. The work presented here is based on anonymized data and is thus not considered human subjects research.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not Applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors\u2019 do not have any competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"18"}}