{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T00:58:41Z","timestamp":1771030721328,"version":"3.50.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2427,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements.<\/jats:p>\n               <jats:p>Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions.<\/jats:p>\n               <jats:p>Availability: SoDA2 is available online at https:\/\/hippocrates.duhs.duke.edu\/soda. Simulated sequences are available upon request.<\/jats:p>\n               <jats:p>Contact: \u00a0kepler@duke.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq056","type":"journal-article","created":{"date-parts":[[2010,2,11]],"date-time":"2010-02-11T03:43:13Z","timestamp":1265859793000},"page":"867-872","source":"Crossref","is-referenced-by-count":67,"title":["SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements"],"prefix":"10.1093","volume":"26","author":[{"given":"Supriya","family":"Munshaw","sequence":"first","affiliation":[{"name":"1 Center for Computational Immunology, 2 Computational Biology and Bioinformatics Program, Duke University, P.O. Box 90090 and 3 Department of Biostatistics and Bioinformatics, 2424 Erwin Road, Suite 1103, Durham, NC 27705, USA"},{"name":"1 Center for Computational Immunology, 2 Computational Biology and Bioinformatics Program, Duke University, P.O. Box 90090 and 3 Department of Biostatistics and Bioinformatics, 2424 Erwin Road, Suite 1103, Durham, NC 27705, USA"}]},{"given":"Thomas B.","family":"Kepler","sequence":"additional","affiliation":[{"name":"1 Center for Computational Immunology, 2 Computational Biology and Bioinformatics Program, Duke University, P.O. Box 90090 and 3 Department of Biostatistics and Bioinformatics, 2424 Erwin Road, Suite 1103, Durham, NC 27705, USA"},{"name":"1 Center for Computational Immunology, 2 Computational Biology and Bioinformatics Program, Duke University, P.O. Box 90090 and 3 Department of Biostatistics and Bioinformatics, 2424 Erwin Road, Suite 1103, Durham, NC 27705, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,2,9]]},"reference":[{"key":"2023012508032484200_B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023012508032484200_B2","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1016\/0006-291X(83)91413-4","article-title":"Synthesis of compositionally unique DNA by terminal deoxynucleotidyl transferase","volume":"111","author":"Basu","year":"1983","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023012508032484200_B3","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1007\/BF03401757","article-title":"Frequent N addition and clonal relatedness among immunoglobulin lambda light chains expressed in rheumatoid arthritis synovia and PBL, and the influence of V lambda gene segment utilization on CDR3 length","volume":"4","author":"Bridges","year":"1998","journal-title":"Mol. Med."},{"key":"2023012508032484200_B4","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/PL00006530","article-title":"Enhanced evolvability in immunoglobulin V genes under somatic hypermutation","volume":"49","author":"Cowell","year":"1999","journal-title":"J. Mol. Evol."},{"key":"2023012508032484200_B5","doi-asserted-by":"crossref","first-page":"752","DOI":"10.1038\/311752a0","article-title":"Insertion of N regions into heavy-chain genes is correlated with the expression of terminal deoxytransferase in B-cells","volume":"311","author":"Desiderio","year":"1984","journal-title":"Nature"},{"key":"2023012508032484200_B6","first-page":"80","article-title":"Pairwise alignment using HMMs","volume-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids.","author":"Durbin","year":"1998"},{"key":"2023012508032484200_B7","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1101\/gr.8.3.186","article-title":"Basecalling of automated sequencer traces using phred. II. Error probabilities","volume":"8","author":"Ewing","year":"1998","journal-title":"Genome Res."},{"key":"2023012508032484200_B8","doi-asserted-by":"crossref","first-page":"S12","DOI":"10.1186\/1471-2105-6-S4-S12","article-title":"A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteins","volume":"6","author":"Fariselli","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012508032484200_B9","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1093\/hmg\/4.6.983","article-title":"Organization of the human immunoglobulin lambda light-chain locus on chromosome 22q11.2","volume":"4","author":"Frippiat","year":"1995","journal-title":"Human Mol. Genet."},{"key":"2023012508032484200_B10","doi-asserted-by":"crossref","first-page":"1580","DOI":"10.1093\/bioinformatics\/btm147","article-title":"iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences","volume":"23","author":"Gaeta","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508032484200_B11","doi-asserted-by":"crossref","first-page":"W435","DOI":"10.1093\/nar\/gkh412","article-title":"IMGT\/V-QUEST, an integrated software program for immunoglobulin and T cell receptor V\u2013J and V\u2013D\u2013J rearrangement analysis","volume":"32","author":"Guidicelli","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012508032484200_B12","doi-asserted-by":"crossref","first-page":"657","DOI":"10.1006\/jmbi.2001.4662","article-title":"Yet another numbering scheme for immunoglobulin variable domains: an automatic modeling and analysis tool","volume":"309","author":"Honegger","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012508032484200_B13","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1186\/1471-2172-5-19","article-title":"Exonuclease activity and P nucleotide addition in the generation of the expressed immunoglobulin repertoire","volume":"5","author":"Jackson","year":"2004","journal-title":"BMC Immunol."},{"key":"2023012508032484200_B14","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/29.1.207","article-title":"IMGT, the international ImMunoGeneTics database","volume":"29","author":"LeFranc","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012508032484200_B15","doi-asserted-by":"crossref","first-page":"9667","DOI":"10.1093\/nar\/15.23.9667","article-title":"Physical map of the immunoglobulin K locus and its implications for the mechanisms of VK-JK rearrangement","volume":"15","author":"Lorenz","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012508032484200_B16","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1017\/CBO9780511811135.008","article-title":"Hidden Markov Models","volume-title":"Methods for Computational Gene Prediction.","author":"Majoros","year":"2007"},{"key":"2023012508032484200_B17","doi-asserted-by":"crossref","first-page":"i379","DOI":"10.1093\/bioinformatics\/bth945","article-title":"IMGT\/JunctionAnalysis: the first tool for the analysis of the immunoglobulin and T cell receptor complex V-J and V-D-J JUNCTIONs","volume":"20","author":"Monod","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508032484200_B18","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1016\/S0092-8674(00)00078-7","article-title":"Class switch recombination and hypermutation require activation-induced cytidine deaminase (AID), a potential RNA editing enzyme","volume":"102","author":"Muramatsu","year":"2000","journal-title":"Cell"},{"key":"2023012508032484200_B19","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden markov-models and selected applications in speech recognition","volume":"77","author":"Rabiner","year":"1989","journal-title":"Proc. IEEE"},{"key":"2023012508032484200_B20","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1038\/286676a0","article-title":"Two types of somatic recombination are necessary for the generation of complete immunoglobulin heavy-chain genes","volume":"286","author":"Sakano","year":"1980","journal-title":"Nature"},{"key":"2023012508032484200_B21","doi-asserted-by":"crossref","first-page":"2642","DOI":"10.4049\/jimmunol.156.7.2642","article-title":"Di- and trinucleotide target preferences of somatic mutagenesis in normal and autoreactive b cells","volume":"156","author":"Smith","year":"1996","journal-title":"J. Immunol."},{"key":"2023012508032484200_B22","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"2023012508032484200_B23","doi-asserted-by":"crossref","first-page":"6790","DOI":"10.4049\/jimmunol.172.11.6790","article-title":"Characterization of the human Ig heavy chain antigen binding complementarity determining region 3 using a newly developed software algorithm, JOINSOLVER","volume":"172","author":"Souto-Carneiro","year":"2004","journal-title":"J. Immunol."},{"key":"2023012508032484200_B24","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/302575a0","article-title":"Somatic generation of antibody diversity","volume":"302","author":"Tonegawa","year":"1983","journal-title":"Nature"},{"key":"2023012508032484200_B25","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1093\/bioinformatics\/btk004","article-title":"SoDA: implementation of a 3D alignment algorithm for inference of antigen receptor recombinations","volume":"22","author":"Volpe","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508032484200_B26","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1172\/JCI20255","article-title":"Human immunoglobulin selection associated with class switch and possible tolerogenic origins for C delta class-switched B cells","volume":"113","author":"Zheng","year":"2004","journal-title":"J. Clin. Invest."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/7\/867\/48854973\/bioinformatics_26_7_867.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/7\/867\/48854973\/bioinformatics_26_7_867.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:06:21Z","timestamp":1674633981000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/7\/867\/212530"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,2,9]]},"references-count":26,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2010,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq056","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,4,1]]},"published":{"date-parts":[[2010,2,9]]}}}