{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T03:03:07Z","timestamp":1771470187188,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2024,3,15]],"date-time":"2024-03-15T00:00:00Z","timestamp":1710460800000},"content-version":"vor","delay-in-days":14,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004189","name":"Max Planck Society","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004189","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Local alignments of query sequences in large databases represent a core part of metagenomic studies and facilitate homology search. Following the development of NCBI Blast, many applications aimed to provide faster and equally sensitive local alignment frameworks. Most applications focus on protein alignments, while only few also facilitate DNA-based searches. None of the established programs allow searching DNA sequences from bisulfite sequencing experiments commonly used for DNA methylation profiling, for which specific alignment strategies need to be implemented.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here, we introduce Lambda3, a new version of the local alignment application Lambda. Lambda3 is the first solution that enables the search of protein, nucleotide as well as bisulfite-converted nucleotide query sequences. Its protein mode achieves comparable performance to that of the highly optimized protein alignment application Diamond, while the nucleotide mode consistently outperforms established local nucleotide aligners. Combined, Lambda3 presents a universal local alignment framework that enables fast and sensitive homology searches for a wide range of use-cases.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Lambda3 is free and open-source software publicly available at https:\/\/github.com\/seqan\/lambda\/.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae097","type":"journal-article","created":{"date-parts":[[2024,3,13]],"date-time":"2024-03-13T19:53:42Z","timestamp":1710359622000},"source":"Crossref","is-referenced-by-count":9,"title":["Lambda3: homology search for protein, nucleotide, and bisulfite-converted sequences"],"prefix":"10.1093","volume":"40","author":[{"given":"Hannes","family":"Hauswedell","sequence":"first","affiliation":[{"name":"deCODE genetics\/Amgen Inc. , Reykjavik, Iceland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4783-3814","authenticated-orcid":false,"given":"Sara","family":"Hetzel","sequence":"additional","affiliation":[{"name":"Department of Genome Regulation, Max Planck Institute for Molecular Genetics , Berlin 14195, Germany"}]},{"given":"Simon G","family":"Gottlieb","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin , Berlin 14195, Germany"},{"name":"Institute for Bio- and Geosciences, Forschungszentrum J\u00fclich GmbH , J\u00fclich 52428, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0723-4980","authenticated-orcid":false,"given":"Helene","family":"Kretzmer","sequence":"additional","affiliation":[{"name":"Department of Genome Regulation, Max Planck Institute for Molecular Genetics , Berlin 14195, Germany"}]},{"given":"Alexander","family":"Meissner","sequence":"additional","affiliation":[{"name":"Department of Genome Regulation, Max Planck Institute for Molecular Genetics , Berlin 14195, Germany"},{"name":"Department of Biology, Chemistry and Pharmacy, Freie Universit\u00e4t Berlin , Berlin 14195, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3078-8129","authenticated-orcid":false,"given":"Knut","family":"Reinert","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin , Berlin 14195, Germany"},{"name":"Efficient Algorithms for Omics Data Group, Max Planck Institute for Molecular Genetics , Berlin 14195, Germany"}]}],"member":"286","published-online":{"date-parts":[[2024,3,14]]},"reference":[{"key":"2024032107584484100_btae097-B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol"},{"key":"2024032107584484100_btae097-B2","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1038\/s41586-018-0386-6","article-title":"Structure and function of the global topsoil microbiome","volume":"560","author":"Bahram","year":"2018","journal-title":"Nature"},{"key":"2024032107584484100_btae097-B3","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1038\/s41559-019-0810-9","article-title":"Diversity of cytosine methylation across the fungal tree of life","volume":"3","author":"Bewick","year":"2019","journal-title":"Nat Ecol Evol"},{"key":"2024032107584484100_btae097-B4","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1038\/s41592-021-01101-x","article-title":"Sensitive protein alignments at tree-of-life scale using DIAMOND","volume":"18","author":"Buchfink","year":"2021","journal-title":"Nat Methods"},{"key":"2024032107584484100_btae097-B5","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nmeth.3176","article-title":"Fast and sensitive protein alignment using DIAMOND","volume":"12","author":"Buchfink","year":"2015","journal-title":"Nat Methods"},{"key":"2024032107584484100_btae097-B6","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2024032107584484100_btae097-B7","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1038\/nature06745","article-title":"Shotgun bisulphite sequencing of the arabidopsis genome reveals DNA methylation patterning","volume":"452","author":"Cokus","year":"2008","journal-title":"Nature"},{"key":"2024032107584484100_btae097-B8","doi-asserted-by":"crossref","first-page":"i766","DOI":"10.1093\/bioinformatics\/bty567","article-title":"DREAM-Yara: an exact read mapper for very large databases with short update time","volume":"34","author":"Dadi","year":"2018","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B9","first-page":"390","author":"Ferragina","year":"2000"},{"key":"2024032107584484100_btae097-B10","doi-asserted-by":"crossref","first-page":"1827","DOI":"10.1073\/pnas.89.5.1827","article-title":"A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands","volume":"89","author":"Frommer","year":"1992","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024032107584484100_btae097-B11","author":"Gottlieb","year":"2023"},{"key":"2024032107584484100_btae097-B12","author":"Grant","year":"2023"},{"key":"2024032107584484100_btae097-B13","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-030-90990-1","volume-title":"Sequence analysis and modern C++, volume 33 of computational biology","author":"Hauswedell","year":"2022"},{"key":"2024032107584484100_btae097-B14","author":"Hauswedell","year":"2023"},{"key":"2024032107584484100_btae097-B15","doi-asserted-by":"crossref","first-page":"i349","DOI":"10.1093\/bioinformatics\/btu439","article-title":"Lambda: the local aligner for massive biological data","volume":"30","author":"Hauswedell","year":"2014","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B16","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1038\/nature11209","article-title":"A framework for human microbiome research","volume":"486","author":"Human Microbiome Project Consortium","year":"2012","journal-title":"Nature"},{"key":"2024032107584484100_btae097-B17","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1093\/bioinformatics\/btt254","article-title":"A poor man\u2019s BLASTX\u2013high-throughput metagenomic protein database search using PAUDA","volume":"30","author":"Huson","year":"2014","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B18","doi-asserted-by":"crossref","first-page":"2264","DOI":"10.1073\/pnas.87.6.2264","article-title":"Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes","volume":"87","author":"Karlin","year":"1990","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024032107584484100_btae097-B19","doi-asserted-by":"crossref","first-page":"9623","DOI":"10.1073\/pnas.1707009114","article-title":"Numerous uncharacterized and highly divergent microbes which colonize humans are revealed by circulating cell-free DNA","volume":"114","author":"Kowarsky","year":"2017","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024032107584484100_btae097-B20","doi-asserted-by":"crossref","first-page":"1571","DOI":"10.1093\/bioinformatics\/btr167","article-title":"Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications","volume":"27","author":"Krueger","year":"2011","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B21","doi-asserted-by":"crossref","first-page":"e43","DOI":"10.1093\/nar\/gkt1325","article-title":"Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing","volume":"42","author":"Kunde-Ramamoorthy","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2024032107584484100_btae097-B22","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1186\/s13148-015-0135-8","article-title":"Whole-genome bisulfite sequencing of cell-free DNA identifies signature associated with metastatic breast cancer","volume":"7","author":"Legendre","year":"2015","journal-title":"Clin Epigenetics"},{"key":"2024032107584484100_btae097-B23","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1093\/protein\/gzg044","article-title":"Reduction of protein sequence complexity by residue grouping","volume":"16","author":"Li","year":"2003","journal-title":"Protein Eng"},{"key":"2024032107584484100_btae097-B24","doi-asserted-by":"crossref","first-page":"3503","DOI":"10.1016\/j.csbj.2022.07.001","article-title":"Research progress of reduced amino acid alphabets in protein analysis and prediction","volume":"20","author":"Liang","year":"2022","journal-title":"Comput Struct Biotechnol J"},{"key":"2024032107584484100_btae097-B25","doi-asserted-by":"crossref","first-page":"615821","DOI":"10.3389\/fonc.2021.615821","article-title":"Characterization of cell free plasma methyl-DNA from xenografted tumors to guide the selection of diagnostic markers for early-stage cancers","volume":"11","author":"Liu","year":"2021","journal-title":"Front Oncol"},{"key":"2024032107584484100_btae097-B26","author":"Mehringer","year":"2023"},{"key":"2024032107584484100_btae097-B27","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1038\/s41592-022-01431-4","article-title":"Critical assessment of metagenome interpretation: the second round of challenges","volume":"19","author":"Meyer","year":"2022","journal-title":"Nat Methods"},{"key":"2024032107584484100_btae097-B28","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1093\/protein\/13.3.149","article-title":"Simplified amino acid alphabets for protein fold recognition and implications for folding","volume":"13","author":"Murphy","year":"2000","journal-title":"Protein Eng"},{"key":"2024032107584484100_btae097-B29","doi-asserted-by":"crossref","first-page":"bbab021","DOI":"10.1093\/bib\/bbab021","article-title":"Comprehensive benchmarking of software for mapping whole genome bisulfite data: from read alignment to DNA methylation analysis","volume":"22","author":"Nunn","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024032107584484100_btae097-B30","doi-asserted-by":"crossref","first-page":"1698","DOI":"10.1093\/bioinformatics\/bts254","article-title":"Fast and sensitive mapping of bisulfite-treated sequencing data","volume":"28","author":"Otto","year":"2012","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B31","first-page":"3.1.1","article-title":"An introduction to sequence similarity (\u201chomology\u201d) searching","author":"Pearson","year":"2013","journal-title":"Curr Protoc Bioinformatics"},{"key":"2024032107584484100_btae097-B32","doi-asserted-by":"crossref","first-page":"3437","DOI":"10.1093\/bioinformatics\/bty380","article-title":"Generic accelerated sequence alignment in SeqAn using vectorization and multi-threading","volume":"34","author":"Rahn","year":"2018","journal-title":"Bioinformatics"},{"key":"2024032107584484100_btae097-B33","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1016\/j.jbiotec.2017.07.017","article-title":"The SeqAn C++ template library for efficient sequence analysis: a resource for programmers","volume":"261","author":"Reinert","year":"2017","journal-title":"J Biotechnol"},{"key":"2024032107584484100_btae097-B34","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1146\/annurev-genom-090413-025358","article-title":"Alignment of next-generation sequencing reads","volume":"16","author":"Reinert","year":"2015","journal-title":"Annu Rev Genomics Hum Genet"},{"key":"2024032107584484100_btae097-B35","doi-asserted-by":"crossref","first-page":"2994","DOI":"10.1093\/nar\/29.14.2994","article-title":"Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements","volume":"29","author":"Sch\u00e4ffer","year":"2001","journal-title":"Nucleic Acids Res"},{"key":"2024032107584484100_btae097-B36","doi-asserted-by":"crossref","first-page":"102782","DOI":"10.1016\/j.isci.2021.102782","article-title":"Raptor: a fast and space-efficient pre-filter for querying very large collections of nucleotide sequences","volume":"24","author":"Seiler","year":"2021","journal-title":"iScience"},{"key":"2024032107584484100_btae097-B37","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J Mol Biol"},{"key":"2024032107584484100_btae097-B38","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1038\/nrg1709","article-title":"Metagenomics: DNA sequencing of environmental samples","volume":"6","author":"Tringe","year":"2005","journal-title":"Nat Rev Genet"},{"key":"2024032107584484100_btae097-B39","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/s41559-017-0446-6","article-title":"Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico","volume":"2","author":"V\u00e5gene","year":"2018","journal-title":"Nat Ecol Evol"},{"key":"2024032107584484100_btae097-B40","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1038\/s41597-019-0117-3","article-title":"Metagenomics and transcriptomics data from human colorectal cancer","volume":"6","author":"Visnovska","year":"2019","journal-title":"Sci Data"},{"key":"2024032107584484100_btae097-B41","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1186\/1471-2105-12-159","article-title":"RAPSearch: a fast protein similarity search tool for short reads","volume":"12","author":"Ye","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2024032107584484100_btae097-B42","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1093\/bioinformatics\/bti070","article-title":"The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions","volume":"21","author":"Yu","year":"2005","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae097\/56975838\/btae097.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/3\/btae097\/57038893\/btae097.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/3\/btae097\/57038893\/btae097.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,21]],"date-time":"2024-03-21T08:54:26Z","timestamp":1711011266000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae097\/7629128"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,3,1]]},"references-count":42,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae097","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,3,1]]},"published":{"date-parts":[[2024,3,1]]},"article-number":"btae097"}}