{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:08Z","timestamp":1772138048051,"version":"3.50.1"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2021,7,16]],"date-time":"2021-07-16T00:00:00Z","timestamp":1626393600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>Codes (https:\/\/github.com\/KyoungYeulLee\/3Cnet\/) and data (https:\/\/zenodo.org\/record\/4716879#.YIO-xqkzZH1) are freely available to non-commercial users.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab529","type":"journal-article","created":{"date-parts":[[2021,7,15]],"date-time":"2021-07-15T15:11:49Z","timestamp":1626361909000},"page":"4626-4634","source":"Crossref","is-referenced-by-count":35,"title":["3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints"],"prefix":"10.1093","volume":"37","author":[{"given":"Dhong-Gun","family":"Won","sequence":"first","affiliation":[{"name":"Research and Development Center , 3billion, Seoul 06193, Republic of Korea"}]},{"given":"Dong-Wook","family":"Kim","sequence":"additional","affiliation":[{"name":"Research and Development Center , 3billion, Seoul 06193, Republic of Korea"}]},{"given":"Junwoo","family":"Woo","sequence":"additional","affiliation":[{"name":"Research and Development Center , 3billion, Seoul 06193, Republic of Korea"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0572-4289","authenticated-orcid":false,"given":"Kyoungyeul","family":"Lee","sequence":"additional","affiliation":[{"name":"Research and Development Center , 3billion, Seoul 06193, Republic of Korea"}]}],"member":"286","published-online":{"date-parts":[[2021,7,16]]},"reference":[{"key":"2023051607130473900_btab529-B1","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1038\/nmeth0410-248","article-title":"A method and server for predicting damaging missense mutations","volume":"7","author":"Adzhubei","year":"2010","journal-title":"Nat. Methods"},{"key":"2023051607130473900_btab529-B2","doi-asserted-by":"crossref","first-page":"1.2.1","DOI":"10.1002\/cpbi.27","article-title":"Searching Online Mendelian Inheritance in Man (OMIM): a knowledgebase of human genes and genetic phenotypes","volume":"58","author":"Amberger","year":"2017","journal-title":"Curr. Protoc. Bioinf"},{"key":"2023051607130473900_btab529-B3","doi-asserted-by":"crossref","first-page":"1067","DOI":"10.1016\/j.ajhg.2016.03.024","article-title":"Performance of ACMG-AMP variant-interpretation guidelines among nine laboratories in the clinical sequencing exploratory research consortium","volume":"98","author":"Amendola","year":"2016","journal-title":"Am. J. Hum. Genet"},{"key":"2023051607130473900_btab529-B4","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"Auton","year":"2015","journal-title":"Nature"},{"key":"2023051607130473900_btab529-B5","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1016\/S0895-4356(03)00207-5","article-title":"External validation is necessary in prediction research: a clinical example","volume":"56","author":"Bleeker","year":"2003","journal-title":"J. Clin. Epidemiol"},{"key":"2023051607130473900_btab529-B6","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1186\/1471-2164-14-S3-S3","article-title":"Identifying Mendelian disease genes with the variant effect scoring tool","volume":"14","author":"Carter","year":"2013","journal-title":"BMC Genomics"},{"key":"2023051607130473900_btab529-B7","doi-asserted-by":"crossref","first-page":"e1001091","DOI":"10.1371\/journal.pbio.1001091","article-title":"Modernizing reference genome assemblies","volume":"9","author":"Church","year":"2011","journal-title":"PLoS Biol"},{"key":"2023051607130473900_btab529-B8","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1002\/humu.22981","article-title":"HGVS recommendations for the description of sequence variants: 2016 Update","volume":"37","author":"den Dunnen","year":"2016","journal-title":"Hum. Mutat"},{"key":"2023051607130473900_btab529-B9","doi-asserted-by":"crossref","first-page":"562","DOI":"10.3389\/fneur.2019.00562","article-title":"Identification of cellular pathogenicity markers for SIL1 mutations linked to marinesco-sj\u00f6gren syndrome","volume":"10","author":"Gatz","year":"2019","journal-title":"Front. Neurol"},{"key":"2023051607130473900_btab529-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-017-1353-5","article-title":"Evaluation of in silico algorithms for use with ACMG\/AMP clinical variant interpretation guidelines","volume":"18","author":"Ghosh","year":"2017","journal-title":"Genome Biol"},{"key":"2023051607130473900_btab529-B11","doi-asserted-by":"crossref","first-page":"490","DOI":"10.1038\/ejhg.2011.258","article-title":"Disease gene identification strategies for exome sequencing","volume":"20","author":"Gilissen","year":"2012","journal-title":"Eur. J. Hum. Genet"},{"key":"2023051607130473900_btab529-B12","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1002\/humu.22768","article-title":"The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity","volume":"36","author":"Grimm","year":"2015","journal-title":"Hum. Mutat"},{"key":"2023051607130473900_btab529-B13","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"key":"2023051607130473900_btab529-B14","doi-asserted-by":"crossref","first-page":"1262","DOI":"10.1126\/science.aac9396","article-title":"De novo mutations in congenital heart disease with neurodevelopmental and other congenital anomalies","volume":"350","author":"Homsy","year":"2015","journal-title":"Science"},{"key":"2023051607130473900_btab529-B15","doi-asserted-by":"crossref","first-page":"877","DOI":"10.1016\/j.ajhg.2016.08.016","article-title":"REVEL: an ensemble method for predicting the pathogenicity of rare missense variants","volume":"99","author":"Ioannidis","year":"2016","journal-title":"Am. J. Hum. Genet"},{"key":"2023051607130473900_btab529-B16","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1038\/nature13908","article-title":"The contribution of de novo coding mutations to autism spectrum disorder","volume":"515","author":"Iossifov","year":"2014","journal-title":"Nature"},{"key":"2023051607130473900_btab529-B17","doi-asserted-by":"crossref","first-page":"1593","DOI":"10.1038\/ng.3970","article-title":"Contribution of rare inherited and de novo variants in 2,871 congenital heart disease probands","volume":"49","author":"Jin","year":"2017","journal-title":"Nat. Genet"},{"key":"2023051607130473900_btab529-B18","doi-asserted-by":"crossref","first-page":"D840","DOI":"10.1093\/nar\/gkw971","article-title":"The ExAC browser: displaying reference data information from over 60 000 exomes","volume":"45","author":"Karczewski","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051607130473900_btab529-B19","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1038\/nprot.2009.86","article-title":"Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm","volume":"4","author":"Kumar","year":"2009","journal-title":"Nat. Protoc"},{"key":"2023051607130473900_btab529-B20","doi-asserted-by":"crossref","first-page":"D862","DOI":"10.1093\/nar\/gkv1222","article-title":"ClinVar: public archive of interpretations of clinically relevant variants","volume":"44","author":"Landrum","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023051607130473900_btab529-B21","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1038\/nature19057","article-title":"Analysis of protein-coding genetic variation in 60,706 humans","volume":"536","author":"Lek","year":"2016","journal-title":"Nature"},{"key":"2023051607130473900_btab529-B22","doi-asserted-by":"crossref","first-page":"894","DOI":"10.1002\/humu.21517","article-title":"dbNSFP: A lightweight database of human nonsynonymous SNPs and their functional predictions","volume":"32","author":"Liu","year":"2011","journal-title":"Hum. Mutat"},{"key":"2023051607130473900_btab529-B23","doi-asserted-by":"crossref","first-page":"W99","DOI":"10.1093\/nar\/gkz411","article-title":"Simple ClinVar: an interactive web server to explore and retrieve gene and disease variants aggregated in ClinVar database","volume":"47","author":"P\u00e9rez-Palma","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023051607130473900_btab529-B24","doi-asserted-by":"crossref","first-page":"D501","DOI":"10.1093\/nar\/gki025","article-title":"NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins","volume":"33","author":"Pruitt","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023051607130473900_btab529-B25","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1093\/bioinformatics\/btu703","article-title":"DANN: a deep learning approach for annotating the pathogenicity of genetic variants","volume":"31","author":"Quang","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051607130473900_btab529-B26","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1038\/nmeth.1818","article-title":"HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment","volume":"9","author":"Remmert","year":"2011","journal-title":"Nat. Methods"},{"key":"2023051607130473900_btab529-B27","doi-asserted-by":"crossref","first-page":"D886","DOI":"10.1093\/nar\/gky1016","article-title":"CADD: predicting the deleteriousness of variants throughout the human genome","volume":"47","author":"Rentzsch","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023051607130473900_btab529-B28","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1038\/gim.2015.30","article-title":"Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology","volume":"17","author":"Richards","year":"2015","journal-title":"Genet. Med"},{"key":"2023051607130473900_btab529-B29","article-title":"An overview of multi-task learning in deep neural networks","author":"Ruder","year":"2017"},{"key":"2023051607130473900_btab529-B30","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1111\/cge.13848","article-title":"Diagnostic yield and clinical utility of whole exome sequencing using an automated variant prioritization system, EVIDENCE","volume":"98","author":"Seo","year":"2020","journal-title":"Clin. Genet"},{"key":"2023051607130473900_btab529-B31","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1002\/humu.22225","article-title":"Predicting the functional, molecular, and phenotypic consequences of amino acid substitutions using hidden Markov models","volume":"34","author":"Shihab","year":"2013","journal-title":"Hum. Mutat"},{"key":"2023051607130473900_btab529-B32","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1038\/gim.2015.180","article-title":"Exploring the landscape of pathogenic genetic variation in the ExAC population database: insights of relevance to variant classification","volume":"18","author":"Song","year":"2016","journal-title":"Genet. Med"},{"key":"2023051607130473900_btab529-B33","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1002\/humu.10212","article-title":"Human Gene Mutation Database (HGMD): 2003 update","volume":"21","author":"Stenson","year":"2003","journal-title":"Hum. Mutat"},{"key":"2023051607130473900_btab529-B34","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1038\/s41588-018-0167-z","article-title":"Predicting the clinical impact of human mutation with deep neural networks","volume":"50","author":"Sundaram","year":"2018","journal-title":"Nat. Genet"},{"key":"2023051607130473900_btab529-B35","doi-asserted-by":"crossref","first-page":"1282","DOI":"10.1093\/bioinformatics\/btm098","article-title":"UniRef: comprehensive and non-redundant UniProt reference clusters","volume":"23","author":"Suzek","year":"2007","journal-title":"Bioinformatics"},{"key":"2023051607130473900_btab529-B36","article-title":"Improving deep learning using generic data augmentation","author":"Taylor","year":"2017"},{"key":"2023051607130473900_btab529-B37","first-page":"5999","article-title":"Attention is all you need","volume":"2017","author":"Vaswani","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst"},{"key":"2023051607130473900_btab529-B38","doi-asserted-by":"crossref","first-page":"2147","DOI":"10.1093\/bioinformatics\/btr357","article-title":"CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer","volume":"27","author":"Wong","year":"2011","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab529\/39766412\/btab529.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4626\/50334689\/btab529.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4626\/50334689\/btab529.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T03:40:15Z","timestamp":1684208415000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/24\/4626\/6322986"}},"subtitle":[],"editor":[{"given":"Tobias","family":"Marschall","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,7,16]]},"references-count":38,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2021,12,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab529","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.09.27.302927","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,12,15]]},"published":{"date-parts":[[2021,7,16]]}}}