{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T20:39:43Z","timestamp":1774903183080,"version":"3.50.1"},"reference-count":15,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T00:00:00Z","timestamp":1593561600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T00:00:00Z","timestamp":1593561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n<jats:title>Background<\/jats:title>\n<jats:p>Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Results<\/jats:title>\n<jats:p>Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Conclusions<\/jats:title>\n<jats:p>We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs\/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting.<\/jats:p>\n<\/jats:sec>","DOI":"10.1186\/s12859-020-03609-z","type":"journal-article","created":{"date-parts":[[2020,7,1]],"date-time":"2020-07-01T11:45:41Z","timestamp":1593603941000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Loqusdb: added value of an observations database of local genomic variation"],"prefix":"10.1186","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0001-1047","authenticated-orcid":false,"given":"M\u00e5ns","family":"Magnusson","sequence":"first","affiliation":[]},{"given":"Jesper","family":"Eisfeldt","sequence":"additional","affiliation":[]},{"given":"Daniel","family":"Nilsson","sequence":"additional","affiliation":[]},{"given":"Adam","family":"Rosenbaum","sequence":"additional","affiliation":[]},{"given":"Valtteri","family":"Wirta","sequence":"additional","affiliation":[]},{"given":"Anna","family":"Lindstrand","sequence":"additional","affiliation":[]},{"given":"Anna","family":"Wedell","sequence":"additional","affiliation":[]},{"given":"Henrik","family":"Stranneheim","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,7,1]]},"reference":[{"issue":"7616","key":"3609_CR1","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1038\/nature19057","volume":"536","author":"M Lek","year":"2016","unstructured":"Exome Aggregation Consortium, Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, O\u2019Donnell-Luria AH, Ware JS, Hill AJ, Cummings BB, Tukiainen T, Birnbaum DP, Kosmicki JA, Duncan LE, Estrada K, Zhao F, Zou J, Pierce-Hoffman E, Berghout J, Cooper DN, Deflaux N, DePristo M, Do R, Flannick J, Fromer M, Gauthier L, Goldstein J, Gupta N, Howrigan D, Kiezun A, Kurki MI, Moonshine AL, Natarajan P, Orozco L, Peloso GM, Poplin R, Rivas MA, Ruano-Rubio V, Rose SA, Ruderfer DM, Shakir K, Stenson PD, Stevens C, Thomas BP, Tiao G, Tusie-Luna MT, Weisburd B, Won H-H, Yu D, Altshuler DM, Ardissino D, Boehnke M, Danesh J, Donnelly S, Elosua R, Florez JC, Gabriel SB, Getz G, Glatt SJ, Hultman CM, Kathiresan S, Laakso M, McCarroll S, McCarthy MI, McGovern D, McPherson R, Neale BM, Palotie A, Purcell SM, Saleheen D, Scharf JM, Sklar P, Sullivan PF, Tuomilehto J, Tsuang MT, Watkins HC, Wilson JG, Daly MJ, MacArthur DG. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016; 536(7616):285\u201391. https:\/\/doi.org\/10.1038\/nature19057.","journal-title":"Nature"},{"key":"3609_CR2","doi-asserted-by":"publisher","first-page":"434","DOI":"10.1038\/s41586-020-2308-7","volume":"581","author":"KJ Karczewski","year":"2020","unstructured":"Karczewski KJ, Francioli LC, Tiao G, et al.The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020; 581:434\u201343. https:\/\/doi.org\/10.1038\/s41586-020-2308-7.","journal-title":"Nature"},{"key":"3609_CR3","doi-asserted-by":"publisher","first-page":"986","DOI":"10.1093\/nar\/gkt958","volume":"42(Database iss","author":"JR MacDonald","year":"2014","unstructured":"MacDonald JR, Ziman R, Yuen RKC, Feuk L, Scherer SW. The Database of Genomic Variants: A curated collection of structural variation in the human genome. Nucleic Acids Res. 2014; 42(Database issue):986\u201392. https:\/\/doi.org\/10.1093\/nar\/gkt958.","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"3609_CR4","doi-asserted-by":"publisher","first-page":"524","DOI":"10.1016\/j.ajhg.2009.03.010","volume":"84","author":"HV Firth","year":"2009","unstructured":"Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, Rajan D, Vooren SV, Moreau Y, Pettett RM, Carter NP. DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am J Hum Genet. 2009; 84(4):524\u201333. https:\/\/doi.org\/10.1016\/j.ajhg.2009.03.010.","journal-title":"Am J Hum Genet"},{"key":"3609_CR5","doi-asserted-by":"publisher","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","volume":"581","author":"RL Collins","year":"2020","unstructured":"Collins RL, Brand H, Karczewski KJ, et al.A structural variation reference for medical and population genetics. Nature. 2020; 581:444\u201351. https:\/\/doi.org\/10.1038\/s41586-020-2287-8.","journal-title":"Nature"},{"issue":"20","key":"3609_CR6","doi-asserted-by":"publisher","first-page":"2843","DOI":"10.1093\/bioinformatics\/btu356","volume":"30","author":"H Li","year":"2014","unstructured":"Li H. Toward better understanding of artifacts in variant calling from high-coverage samples. Bioinformatics. 2014; 30(20):2843\u201351. https:\/\/doi.org\/10.1093\/bioinformatics\/btu356.","journal-title":"Bioinformatics"},{"issue":"7","key":"3609_CR7","doi-asserted-by":"publisher","first-page":"1003153","DOI":"10.1371\/journal.pcbi.1003153","volume":"9","author":"U Paila","year":"2013","unstructured":"Paila U, Chapman BA, Kirchner R, Quinlan AR. GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations. PLoS Comput Biol. 2013; 9(7):1003153. https:\/\/doi.org\/10.1371\/journal.pcbi.1003153.","journal-title":"PLoS Comput Biol"},{"key":"3609_CR8","doi-asserted-by":"crossref","unstructured":"Fokkema IFAC, Taschner PEM, Schaafsma GCP, Celli J, Laros JFJ, den Dunnen JT. LOVD v.2.0: The next generation in gene variant databases. Hum Mutat. 2011; 32(5):557\u2013563. https:\/\/doi.org\/10.1002\/humu.21438\nhttps:\/\/doi.org\/{_eprint:}\nhttps:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/humu.21438 https:\/\/doi.org\/{_eprint:} https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/humu.21438.","DOI":"10.1002\/humu.21438"},{"issue":"11","key":"3609_CR9","doi-asserted-by":"publisher","first-page":"1253","DOI":"10.1038\/ejhg.2017.130","volume":"25","author":"A Ameur","year":"2017","unstructured":"Ameur A, Dahlberg J, Olason P, Vezzi F, Karlsson R, Martin M, Viklund J, K\u00e4h\u00e4ri AK, Lundin P, Che H, Thutkawkorapin J, Eisfeldt J, Lampa S, Dahlberg M, Hagberg J, Jareborg N, Liljedahl U, Jonasson I, Johansson \u00c5, Feuk L, Lundeberg J, Syv\u00e4nen AC, Lundin S, Nilsson D, Nystedt B, Magnusson PK, Gyllensten U. SweGen: A whole-genome data resource of genetic variability in a cross-section of the Swedish population. Eur J Hum Genet. 2017; 25(11):1253\u201360. https:\/\/doi.org\/10.1038\/ejhg.2017.130.","journal-title":"Eur J Hum Genet"},{"issue":"15","key":"3609_CR10","doi-asserted-by":"publisher","first-page":"2156","DOI":"10.1093\/bioinformatics\/btr330","volume":"27","author":"P Danecek","year":"2011","unstructured":"Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R. The variant call format and VCFtools. Bioinformatics. 2011; 27(15):2156\u20138. https:\/\/doi.org\/10.1093\/bioinformatics\/btr330.","journal-title":"Bioinformatics"},{"issue":"13","key":"3609_CR11","doi-asserted-by":"publisher","first-page":"2202","DOI":"10.1093\/bioinformatics\/btv112","volume":"31","author":"A Tan","year":"2015","unstructured":"Tan A, Abecasis GR, Kang HM. Unified representation of genetic variants. Bioinformatics. 2015; 31(13):2202\u201304. https:\/\/doi.org\/10.1093\/bioinformatics\/btv112.","journal-title":"Bioinformatics"},{"issue":"1","key":"3609_CR12","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1186\/s13059-016-0973-5","volume":"17","author":"BS Pedersen","year":"2016","unstructured":"Pedersen BS, Layer RM, Quinlan AR. Vcfanno: Fast, flexible annotation of genetic variants. Genome Biol. 2016; 17(1):118. https:\/\/doi.org\/10.1186\/s13059-016-0973-5.","journal-title":"Genome Biol"},{"issue":"6","key":"3609_CR13","doi-asserted-by":"publisher","first-page":"974","DOI":"10.1101\/gr.114876.110","volume":"21","author":"A Abyzov","year":"2011","unstructured":"Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011; 21(6):974\u201384. https:\/\/doi.org\/10.1101\/gr.114876.110.","journal-title":"Genome Res"},{"issue":"4","key":"3609_CR14","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1016\/j.cub.2009.11.054","volume":"20","author":"P Soares","year":"2010","unstructured":"Soares P, Achilli A, Semino O, Davies W, Macaulay V, Bandelt H-J, Torroni A, Richards MB. The Archaeogenetics of Europe. Curr Biol. 2010; 20(4):174\u201383. https:\/\/doi.org\/10.1016\/j.cub.2009.11.054.","journal-title":"Curr Biol"},{"issue":"3","key":"3609_CR15","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1111\/j.1469-1809.2007.00429.x","volume":"72","author":"T Lappalainen","year":"2008","unstructured":"Lappalainen T, Laitinen V, Salmela E, Andersen P, Huoponen K, Savontaus M-L, Lahermo P. Migration Waves to the Baltic Sea Region. Ann Hum Genet. 2008; 72(3):337\u201348. https:\/\/doi.org\/10.1111\/j.1469-1809.2007.00429.x.","journal-title":"Ann Hum Genet"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03609-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-03609-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03609-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,1]],"date-time":"2021-07-01T00:07:51Z","timestamp":1625098071000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-03609-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,1]]},"references-count":15,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3609"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-03609-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,1]]},"assertion":[{"value":"17 January 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 July 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"273"}}