{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T19:58:47Z","timestamp":1776369527395,"version":"3.51.2"},"reference-count":22,"publisher":"Oxford University Press (OUP)","license":[{"start":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T00:00:00Z","timestamp":1770854400000},"content-version":"vor","delay-in-days":42,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100008562","name":"University of Texas at Austin","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100008562","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100011597","name":"Penn State Cancer Institute","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100011597","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013209","name":"Hellenic Foundation for Research and Innovation","doi-asserted-by":"publisher","award":["23592\u2013EMISSION"],"award-info":[{"award-number":["23592\u2013EMISSION"]}],"id":[{"id":"10.13039\/501100013209","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100013209","name":"Hellenic Foundation for Research and Innovation","doi-asserted-by":"publisher","award":["28787-VIROMINE"],"award-info":[{"award-number":["28787-VIROMINE"]}],"id":[{"id":"10.13039\/501100013209","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Basic Research Financing Action","award":["16718-PRPFOR"],"award-info":[{"award-number":["16718-PRPFOR"]}]},{"name":"National Recovery and Resilience Plan","award":["TAEDR-0539180"],"award-info":[{"award-number":["TAEDR-0539180"]}]},{"name":"Cancer Research Institute Immuno-Informatics Postdoctoral Fellowship","award":["CR14925"],"award-info":[{"award-number":["CR14925"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The development of biomarkers for population screening, early cancer detection, monitoring, and recurrence surveillance offers substantial potential to improve patient outcomes and save lives. Nullomers are short k-mers that are absent from a human genome, and neomers are the subset of nullomers that emerge recurrently due to somatic mutations during cancer development. Here, we have developed neomerDB, a database that encompasses a catalogue of neomers across cancer types and organs. We examined 10\u2009000 whole exome sequencing and 2658 whole genome sequencing tumour-matched samples and identified the set of neomers associated with each cancer type and organ. We also analysed 76\u2009215 whole genomes and 730\u2009947 whole exomes of individuals from diverse ancestries, from which we removed nullomers and neomers that can arise due to germline variants in the population. Finally, we conducted a case study demonstrating that neomers can be utilized to detect glioblastoma from liquid biopsy samples (n\u00a0=\u00a038), utilizing cell-free DNA and cell-free RNA, achieving a Receiver Operating Characteristic - Area Under the Curve score of 0.98 and a precision-recall score of 0.99. neomerDB is a user-friendly database that enables advanced searches, provides interactive visualizations, and download options for neomer biomarkers. neomerDB is publicly available at https:\/\/neomerDB.com\/.<\/jats:p>","DOI":"10.1093\/database\/baag006","type":"journal-article","created":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T12:40:54Z","timestamp":1769172054000},"source":"Crossref","is-referenced-by-count":0,"title":["neomerDB: a comprehensive database of neomer biomarkers in cancer"],"prefix":"10.1093","volume":"2026","author":[{"given":"Kimonas","family":"Provatas","sequence":"first","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Candace S Y","family":"Chan","sequence":"additional","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ioannis","family":"Kerasiotis","sequence":"additional","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eleftherios","family":"Bochalis","sequence":"additional","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Akshatha","family":"Nayak","sequence":"additional","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Brad E","family":"Zacharia","sequence":"additional","affiliation":[{"name":"Penn State Milton S. Hershey Medical Center Department of Neurosurgery, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Georgios A","family":"Pavlopoulos","sequence":"additional","affiliation":[{"name":"Institute for Fundamental Biomedical Research, BSRC \u201cAlexander Fleming\u201d , 34 Fleming Street, 16672, Vari, Athens ,","place":["Greece"]},{"name":"Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) Department of Computational Biology, , Masdar City, Abu Dhabi ,","place":["United Arab Emirates"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Li","sequence":"additional","affiliation":[{"name":"Penn State College of Medicine Division of Hematology and Oncology, Department of Pediatrics, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3641-1488","authenticated-orcid":false,"given":"Ilias","family":"Georgakopoulos-Soares","sequence":"additional","affiliation":[{"name":"Division of Pharmacology and Toxicology, College of Pharmacy, The University of Texas at Austin, Dell Pediatric Research Institute , 1400 Barbara Jordan Boulevard, Austin, TX 78723 ,","place":["United States"]},{"name":"The Pennsylvania State University College of Medicine Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, , 500 University Drive, Hershey, PA 17033 ,","place":["United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2026,2,12]]},"reference":[{"key":"2026041615025934200_bib1","first-page":"12","article-title":"Cancer statistics, 2024","volume":"74","author":"Siegel","year":"2024","journal-title":"CA Cancer J Clin"},{"key":"2026041615025934200_bib2","doi-asserted-by":"publisher","first-page":"eaay9040","DOI":"10.1126\/science.aay9040","article-title":"Early detection of cancer","volume":"375","author":"Crosby","year":"2022","journal-title":"Science"},{"key":"2026041615025934200_bib3","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1186\/s12967-023-03960-8","article-title":"Liquid biopsies: the future of cancer early detection","volume":"21","author":"Connal","year":"2023","journal-title":"J Transl Med"},{"key":"2026041615025934200_bib4","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1146\/annurev-bioeng-110222-111259","article-title":"Liquid biopsy based on cell-free DNA and RNA","volume":"26","author":"Loy","year":"2024","journal-title":"Annu Rev Biomed Eng"},{"key":"2026041615025934200_bib5","doi-asserted-by":"publisher","first-page":"eaaw3616","DOI":"10.1126\/science.aaw3616","article-title":"Epigenetics, fragmentomics, and topology of cell-free DNA in liquid biopsies","volume":"372","author":"Lo","year":"2021","journal-title":"Science"},{"key":"2026041615025934200_bib6","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1038\/s41568-025-00795-x","article-title":"Genomic and fragmentomic landscapes of cell-free DNA for early cancer detection","volume":"25","author":"Bruhm","year":"2025","journal-title":"Nat Rev Cancer"},{"key":"2026041615025934200_bib7","first-page":"355","article-title":"Absent sequences: nullomers and primes","volume":"2007","author":"Hampikian","year":"2007","journal-title":"Pac Symp Biocomput"},{"key":"2026041615025934200_bib8","doi-asserted-by":"publisher","DOI":"10.1101\/2021.08.15.21261805","article-title":"Leveraging sequences missing from the human genome to diagnose cancer","author":"Georgakopoulos-Soares","year":"2023","journal-title":"medRxiv"},{"key":"2026041615025934200_bib9","doi-asserted-by":"publisher","first-page":"861","DOI":"10.1038\/s41417-024-00741-3","article-title":"Utilizing nullomers in cell-free RNA for early cancer detection","volume":"31","author":"Montgomery","year":"2024","journal-title":"Cancer Gene Ther"},{"key":"2026041615025934200_bib10","doi-asserted-by":"publisher","first-page":"103595","DOI":"10.1016\/j.esmoop.2024.103595","article-title":"Detecting pulmonary malignancy against benign nodules using noninvasive cell-free DNA fragmentomics assay","volume":"9","author":"Xu","year":"2024","journal-title":"ESMO Open"},{"key":"2026041615025934200_bib11","doi-asserted-by":"publisher","first-page":"btaf138","DOI":"10.1093\/bioinformatics\/btaf138","article-title":"Detecting known neoepitopes, gene fusions, transposable elements, and circular RNAs in cell-free RNA","volume":"41","author":"Mahajan","year":"2025","journal-title":"Bioinformatics"},{"key":"2026041615025934200_bib12","doi-asserted-by":"publisher","first-page":"qzaf028","DOI":"10.1093\/gpbjnl\/qzaf028","article-title":"Cell-free DNA fragmentomics assay to discriminate the malignancy of breast nodules and evaluate treatment response","volume":"23","author":"Liu","year":"2025","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2026041615025934200_bib13","doi-asserted-by":"publisher","first-page":"e70316","DOI":"10.1002\/cam4.70316","article-title":"Predicting disease progression in inoperable localized NSCLC patients using ctDNA machine learning model","volume":"13","author":"Wu","year":"2024","journal-title":"Cancer Med"},{"key":"2026041615025934200_bib14","doi-asserted-by":"publisher","first-page":"e140","DOI":"10.1016\/j.jtcvs.2024.04.026","article-title":"Cell-free DNA assay for malignancy classification of high-risk lung nodules","volume":"168","author":"Wang","year":"2024","journal-title":"J Thorac Cardiovasc Surg"},{"key":"2026041615025934200_bib15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-021-02459-z","article-title":"Absent from DNA and protein: genomic characterization of nullomers and nullpeptides across functional categories and evolution","volume":"22","author":"Georgakopoulos-Soares","year":"2021","journal-title":"Genome Biol"},{"key":"2026041615025934200_bib16","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1038\/s41586-020-1969-6","article-title":"Pan-cancer analysis of whole genomes","volume":"578","author":"ICGC\/TCGA Pan-Cancer Analysis of Whole Genomes Consortium","year":"2020","journal-title":"Nature"},{"key":"2026041615025934200_bib17","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1016\/j.cels.2018.03.002","article-title":"Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines","volume":"6","author":"Ellrott","year":"2018","journal-title":"Cell Syst"},{"key":"2026041615025934200_bib18","doi-asserted-by":"publisher","first-page":"2759","DOI":"10.1093\/bioinformatics\/btx304","article-title":"KMC 3: counting and manipulating k-mer statistics","volume":"33","author":"Kokot","year":"2017","journal-title":"Bioinformatics"},{"key":"2026041615025934200_bib19","first-page":"e970v1","article-title":"Efficient \u201cpythonic\u201d access to FASTA files using pyfaidx","volume":"3","author":"Shirley","year":"2015","journal-title":"PeerJ PrePrints"},{"key":"2026041615025934200_bib20","doi-asserted-by":"publisher","first-page":"434","DOI":"10.1038\/s41586-020-2308-7","article-title":"The mutational constraint spectrum quantified from variation in 141,456 humans","volume":"581","author":"Karczewski","year":"2020","journal-title":"Nature"},{"key":"2026041615025934200_bib21","doi-asserted-by":"crossref","first-page":"764","DOI":"10.1093\/bioinformatics\/btr011","article-title":"A fast, lock-free approach for efficient parallel counting of occurrences of k-mers","volume":"27","author":"Mar\u00e7ais","year":"2011","journal-title":"Bioinformatics"},{"key":"2026041615025934200_bib22","doi-asserted-by":"crossref","unstructured":"Chen Y, Chen L, Lun ATL \u00a0et al. \u00a0edgeR v4: powerful differential analysis of sequencing data with expanded functionality and improved support for small counts and larger datasets. Nucleic Acids Res. 2025;53:gkaf018. 10.1093\/nar\/gkaf018","DOI":"10.1093\/nar\/gkaf018"}],"container-title":["Database"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baag006\/8474793","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baag006\/8474793","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T19:03:03Z","timestamp":1776366183000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baag006\/8474793"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026]]},"references-count":22,"URL":"https:\/\/doi.org\/10.1093\/database\/baag006","relation":{},"ISSN":["1758-0463"],"issn-type":[{"value":"1758-0463","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026]]},"published":{"date-parts":[[2026]]},"article-number":"baag006"}}