{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T02:50:14Z","timestamp":1777431014464,"version":"3.51.4"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,4,13]],"date-time":"2021-04-13T00:00:00Z","timestamp":1618272000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Hubei Province Funds for Natural Science","award":["2019CFB552"],"award-info":[{"award-number":["2019CFB552"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Bridging heterogeneous mutation data fills in the gap between various data categories and propels discovery of disease-related genes. It is known that genome-wide association study (GWAS) infers significant mutation associations that link genotype and phenotype. However, due to the differences of size and quality between GWAS studies, not all de facto vital variations are able to pass the multiple testing. In the meantime, mutation events widely reported in literature unveil typical functional biological process, including mutation types like gain of function and loss of function. To bring together the heterogeneous mutation data, we propose a \u2018Gene\u2013Disease Association prediction by Mutation Data Bridging (GDAMDB)\u2019 pipeline with a statistic generative model. The model learns the distribution parameters of mutation associations and mutation types and recovers false-negative GWAS mutations that fail to pass significant test but represent supportive evidences of functional biological process in literature. Eventually, we applied GDAMDB in Alzheimer\u2019s disease (AD) and predicted 79 AD-associated genes. Besides, 12 of them from the original GWAS, 60 of them are supported to be AD-related by other GWAS or literature report, and rest of them are newly predicted genes. Our model is capable of enhancing the GWAS-based gene association discovery by well combining text mining results. The positive result indicates that bridging the heterogeneous mutation data is contributory for the novel disease-related gene discovery.<\/jats:p>","DOI":"10.1093\/bib\/bbab079","type":"journal-article","created":{"date-parts":[[2021,2,23]],"date-time":"2021-02-23T15:16:49Z","timestamp":1614093409000},"source":"Crossref","is-referenced-by-count":5,"title":["Bridging heterogeneous mutation data to enhance disease gene discovery"],"prefix":"10.1093","volume":"22","author":[{"given":"Kaiyin","family":"Zhou","sequence":"first","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuxing","family":"Wang","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kevin","family":"Bretonnel Cohen","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jin-Dong","family":"Kim","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaohang","family":"Ma","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhixue","family":"Shen","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiangyu","family":"Meng","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingbo","family":"Xia","sequence":"additional","affiliation":[{"name":"Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Hubei Province, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,4,13]]},"reference":[{"key":"2021090817330307200_ref1","volume-title":"Pattern Recognition and Machine Learning","author":"Bishop","year":"2006"},{"key":"2021090817330307200_ref2","first-page":"993","article-title":"Latent dirichlet allocation","volume-title":"J Mach Learn Res","author":"Blei","year":"2003"},{"issue":"2","key":"2021090817330307200_ref3","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1007\/s00401-018-1928-6","article-title":"Dissecting the genetic relationship between cardiovascular risk factors and Alzheimer\u2019s disease","volume":"137","author":"Broce","year":"2019","journal-title":"Acta Neuropathol"},{"issue":"D1","key":"2021090817330307200_ref4","doi-asserted-by":"crossref","first-page":"D1005","DOI":"10.1093\/nar\/gky1120","article-title":"The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019","volume":"47","author":"Annalisa Buniello","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2021090817330307200_ref5","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1038\/ng.2355","article-title":"Leveraging models of cell regulation and GWAS data in integrative network-based association studies","volume":"44","author":"Califano","year":"2012","journal-title":"Nat Genet"},{"issue":"4","key":"2021090817330307200_ref6","doi-asserted-by":"crossref","first-page":"642","DOI":"10.1093\/bib\/bbv068","article-title":"Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes","volume":"17","author":"Cheng","year":"2016","journal-title":"Brief Bioinform"},{"issue":"733","key":"2021090817330307200_ref7","first-page":"e56","article-title":"The role of tau phosphorylation and cleavage in neuronal cell death","volume":"12","author":"Chun","year":"2007","journal-title":"Front Biosci"},{"issue":"1","key":"2021090817330307200_ref8","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1038\/nm0197-67","article-title":"Mutant presenilins of Alzheimer\u2019s disease increase production of 42-residue amyloid $\\beta $-protein in both transfected cells and transgenic mice","volume":"3","author":"Citron","year":"1997","journal-title":"Nat Med"},{"key":"2021090817330307200_ref9","doi-asserted-by":"crossref","first-page":"231","DOI":"10.3389\/fgene.2012.00231","article-title":"Non-coding RNA and pseudogenes in neurodegenerative diseases:\u201dthe (un) usual suspects\u201d","volume":"3","author":"Costa","year":"2012","journal-title":"Front Genet"},{"issue":"18","key":"2021090817330307200_ref10","doi-asserted-by":"crossref","first-page":"2882","DOI":"10.1093\/bioinformatics\/btx314","article-title":"IGESS: a statistical approach to integrating individual-level genotype data and summary statistics in genome-wide association studies","volume":"33","author":"Dai","year":"2017","journal-title":"Bioinformatics"},{"issue":"2","key":"2021090817330307200_ref11","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1016\/j.nbd.2009.05.015","article-title":"Oxidative damage to the promoter region of sqstm1\/p62 is common to neurodegenerative disease","volume":"35","author":"Du","year":"2009","journal-title":"Neurobiol Dis"},{"issue":"1","key":"2021090817330307200_ref12","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1186\/s13024-017-0184-x","article-title":"Late onset Alzheimer\u2019s disease genetics implicates microglial pathways in disease risk","volume":"12","author":"Efthymiou","year":"2017","journal-title":"Mol Neurodegener"},{"issue":"6","key":"2021090817330307200_ref13","doi-asserted-by":"crossref","first-page":"e94661","DOI":"10.1371\/journal.pone.0094661","article-title":"Gene-wide analysis detects two new susceptibility genes for Alzheimer\u2019s disease","volume":"9","author":"Escott-Price","year":"2014","journal-title":"PLoS One"},{"issue":"2","key":"2021090817330307200_ref14","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1056\/NEJMoa1211851","article-title":"Trem2 variants in Alzheimer\u2019s disease","volume":"368","author":"Guerreiro","year":"2013","journal-title":"N Engl J Med"},{"key":"2021090817330307200_ref15","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1016\/0165-6147(91)90609-V","article-title":"Amyloid deposition as the central event in the aetiology of Alzheimer\u2019s disease","volume":"12","author":"Hardy","year":"1991","journal-title":"Trends Pharmacol Sci"},{"issue":"5","key":"2021090817330307200_ref16","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1038\/ng.803","article-title":"Common variants at abca7, ms4a6a\/ms4a4e, epha1, cd33 and cd2ap are associated with Alzheimer\u2019s disease","volume":"43","author":"Hollingworth","year":"2011","journal-title":"Nat Genet"},{"issue":"7","key":"2021090817330307200_ref17","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1186\/s12918-018-0633-3","article-title":"The emerging role of microrna-4487\/6845-3p in Alzheimer\u2019s disease pathologies is induced by a$\\beta $25\u201335 triggered in sh-sy5y cell","volume":"12","author":"Hu","year":"2018","journal-title":"BMC Syst Biol"},{"issue":"1","key":"2021090817330307200_ref18","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1093\/bioinformatics\/btq615","article-title":"dmgwas: dense module searching for genome-wide association studies in protein\u2013protein interaction networks","volume":"27","author":"Jia","year":"2011","journal-title":"Bioinformatics"},{"issue":"7","key":"2021090817330307200_ref19","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1016\/j.jalz.2016.12.012","article-title":"Transethnic genome-wide scan identifies novel Alzheimer\u2019s disease loci","volume":"13","author":"Jun","year":"2017","journal-title":"Alzheimers Dement"},{"key":"2021090817330307200_ref20","doi-asserted-by":"crossref","first-page":"134176","DOI":"10.1016\/j.neulet.2019.03.048","article-title":"Microrna expression profiles of neuron-derived extracellular vesicles in plasma from patients with amyotrophic lateral sclerosis","volume":"708","author":"Katsu","year":"2019","journal-title":"Neurosci Lett"},{"issue":"2","key":"2021090817330307200_ref21","doi-asserted-by":"crossref","first-page":"476","DOI":"10.1016\/j.expneurol.2011.11.010","article-title":"Mir-106b impairs cholesterol efflux and increases a$\\beta $ levels by repressing abca1 expression","volume":"235","author":"Kim","year":"2012","journal-title":"Exp Neurol"},{"issue":"D1","key":"2021090817330307200_ref22","doi-asserted-by":"crossref","first-page":"D985","DOI":"10.1093\/nar\/gkw1055","article-title":"Open targets: a platform for therapeutic target identification and validation","volume":"45","author":"Koscielny","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2021090817330307200_ref23","first-page":"294629","article-title":"Meta-analysis of genetic association with diagnosed Alzheimer\u2019s disease identifies novel risk loci and implicates abeta, tau, immunity and lipid processing","author":"Kunkle","year":"2018","journal-title":"bioRxiv"},{"issue":"11","key":"2021090817330307200_ref24","doi-asserted-by":"crossref","first-page":"862","DOI":"10.1038\/nrn960","article-title":"Calcium dyshomeostasis and intracellular signalling in Alzheimer\u2019s disease","volume":"3","author":"LaFerla","year":"2002","journal-title":"Nat Rev Neurosci"},{"issue":"12","key":"2021090817330307200_ref25","doi-asserted-by":"crossref","first-page":"1452","DOI":"10.1038\/ng.2802","article-title":"Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer\u2019s disease","volume":"45","author":"Lambert","year":"2013","journal-title":"Nat Genet"},{"issue":"8","key":"2021090817330307200_ref26","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1016\/j.tibs.2019.03.009","article-title":"Gain-of-function mutations: an emerging advantage for cancer biology","volume":"44","author":"Li","year":"2019","journal-title":"Trends Biochem Sci"},{"issue":"11","key":"2021090817330307200_ref27","doi-asserted-by":"crossref","first-page":"3981","DOI":"10.1523\/JNEUROSCI.3617-10.2011","article-title":"Complement receptor 2 is expressed in neural progenitor cells and regulates adult hippocampal neurogenesis","volume":"31","author":"Moriyama","year":"2011","journal-title":"J Neurosci"},{"issue":"10","key":"2021090817330307200_ref28","doi-asserted-by":"crossref","first-page":"1133","DOI":"10.1016\/j.jalz.2017.01.016","article-title":"Systems biology approach to late-onset Alzheimer\u2019s disease genome-wide association study identifies novel candidate genes validated using brain expression data and caenorhabditis elegans experiments","volume":"13","author":"Shubhabrata Mukherjee","year":"2017","journal-title":"Alzheimers Dement"},{"issue":"4","key":"2021090817330307200_ref29","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1016\/j.biopsych.2017.05.014","article-title":"Untangling genetic risk for Alzheimer\u2019s disease","volume":"83","author":"Pimenova","year":"2018","journal-title":"Biol Psychiatry"},{"key":"2021090817330307200_ref30","first-page":"D833","article-title":"DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants","volume-title":"Nucleic Acids Res","author":"Pi\u00f1ero","year":"2017"},{"issue":"1","key":"2021090817330307200_ref31","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1002\/0471250953.bi1112s47","article-title":"Bedtools: the swiss-army tool for genome feature analysis","volume":"47","author":"Quinlan","year":"2014","journal-title":"Curr Protoc Bioinformatics"},{"issue":"2","key":"2021090817330307200_ref32","doi-asserted-by":"crossref","first-page":"e358","DOI":"10.1038\/tp.2014.2","article-title":"Follow-up of loci from the international genomics of Alzheimer\u2019s disease project identifies trip4 as a novel susceptibility gene","volume":"4","author":"Ruiz","year":"2014","journal-title":"Transl Psychiatry"},{"issue":"20","key":"2021090817330307200_ref33","doi-asserted-by":"crossref","first-page":"6691","DOI":"10.1093\/nar\/gkp724","article-title":"Gsk3$\\beta $ is a negative regulator of the transcriptional coactivator maml1","volume":"37","author":"Ribeiro","year":"2009","journal-title":"Nucleic Acids Res"},{"issue":"6738","key":"2021090817330307200_ref34","doi-asserted-by":"crossref","first-page":"A23","DOI":"10.1038\/399a023","article-title":"Translating cell biology into therapeutic advances in Alzheimer\u2019s disease","volume":"399","author":"Selkoe","year":"1999","journal-title":"Nature"},{"key":"2021090817330307200_ref35","doi-asserted-by":"crossref","first-page":"227","DOI":"10.3389\/fnmol.2017.00227","article-title":"Identification of the role of mir-142-5p in Alzheimer\u2019s disease by comparative bioinformatics and cellular analysis","volume":"10","author":"Song","year":"2017","journal-title":"Front Mol Neurosci"},{"issue":"5","key":"2021090817330307200_ref36","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1038\/ng.3246","article-title":"Loss-of-function variants in abca7 confer risk of Alzheimer\u2019s disease","volume":"47","author":"Steinberg","year":"2015","journal-title":"Nat Genet"},{"issue":"5","key":"2021090817330307200_ref37","doi-asserted-by":"crossref","first-page":"1776","DOI":"10.1093\/bib\/bbz115","article-title":"Optimization of in silico tools for predicting genetic variants: individualizing for genes with molecular sub-regional stratification","volume":"21","author":"Tang","year":"2020","journal-title":"Brief Bioinform"},{"issue":"2","key":"2021090817330307200_ref38","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1016\/j.cell.2011.09.029","article-title":"Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs","volume":"147","author":"Tay","year":"2011","journal-title":"Cell"},{"issue":"18","key":"2021090817330307200_ref39","doi-asserted-by":"crossref","first-page":"2883","DOI":"10.1093\/bioinformatics\/btw234","article-title":"Seth detects and normalizes genetic variants in text","volume":"32","author":"Thomas","year":"2016","journal-title":"Bioinformatics"},{"issue":"12","key":"2021090817330307200_ref40","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1038\/nrg2884","article-title":"Analysing biological pathways in genome-wide association studies","volume":"11","author":"Wang","year":"2010","journal-title":"Nat Rev Genet"},{"key":"2021090817330307200_ref41","first-page":"8030369","article-title":"Profiling microrna from brain by microarray in a transgenic mouse model of Alzheimer\u2019s disease","volume":"2017","author":"Wang","year":"2017","journal-title":"BioMed Res Int"},{"key":"2021090817330307200_ref42","doi-asserted-by":"crossref","first-page":"62","DOI":"10.18653\/v1\/D19-5710","article-title":"An overview of the active gene annotation corpus and the BioNLP ost 2019 AGAC track tasks","volume-title":"Proceedings of the 5th Workshop on BioNLP Open Shared Tasks","author":"Wang","year":"2019"},{"key":"2021090817330307200_ref43","doi-asserted-by":"crossref","DOI":"10.1109\/BIBM47256.2019.8983031","article-title":"An active gene annotation corpus and its application on anti-epilepsy drug discovery","volume-title":"BIBM 2019: International Conference on Bioinformatics & Biomedicine","author":"Wang","year":"2019"},{"issue":"12","key":"2021090817330307200_ref44","doi-asserted-by":"crossref","first-page":"1080","DOI":"10.1038\/nbt.2758","article-title":"Rational drug repositioning by medical genetics","volume":"31","author":"Wang","year":"2013","journal-title":"Nat Biotechnol"},{"issue":"2","key":"2021090817330307200_ref45","doi-asserted-by":"crossref","first-page":"249","DOI":"10.3233\/JAD-2010-101515","article-title":"Heritability of different forms of memory in the late onset Alzheimer\u2019s disease family study","volume":"23","author":"Wilson","year":"2011","journal-title":"J Alzheimers Dis"},{"issue":"2","key":"2021090817330307200_ref46","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1016\/j.nbd.2005.07.004","article-title":"Differences in apolipoprotein e3\/3 and e4\/4 allele-specific gene expression in hippocampus in Alzheimer disease","volume":"21","author":"Xu","year":"2006","journal-title":"Neurobiol Dis"},{"issue":"30","key":"2021090817330307200_ref47","doi-asserted-by":"crossref","first-page":"15753","DOI":"10.1074\/jbc.M116.718023","article-title":"The endosome-associated deubiquitinating enzyme usp8 regulates bace1 enzyme ubiquitination and degradation","volume":"291","author":"Yeates","year":"2016","journal-title":"J Biol Chem"},{"issue":"25","key":"2021090817330307200_ref48","doi-asserted-by":"crossref","first-page":"10601","DOI":"10.1073\/pnas.0701096104","article-title":"Attenuation of neuroinflammation and Alzheimer\u2019s disease pathology by liver x receptors","volume":"104","author":"Zelcer","year":"2007","journal-title":"Proc Natl Acad Sci"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab079\/40260782\/bbab079.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/5\/bbab079\/40260782\/bbab079.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T15:19:56Z","timestamp":1631114396000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab079\/6224263"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,13]]},"references-count":48,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,9,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab079","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-44127\/v1","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9]]},"published":{"date-parts":[[2021,4,13]]},"article-number":"bbab079"}}