{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,15]],"date-time":"2026-03-15T05:06:45Z","timestamp":1773551205964,"version":"3.50.1"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1201,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Protein domains are subunits that can fold and evolve independently. Identification of domain boundary locations is often the first step in protein folding and function annotations. Most of the current methods deduce domain boundaries by sequence-based analysis, which has low accuracy. There is no efficient method for predicting discontinuous domains that consist of segments from separated sequence regions. As template-based methods are most efficient for protein 3D structure modeling, combining multiple threading alignment information should increase the accuracy and reliability of computational domain predictions.<\/jats:p>\n               <jats:p>Result: We developed a new protein domain predictor, ThreaDom, which deduces domain boundary locations based on multiple threading alignments. The core of the method development is the derivation of a domain conservation score that combines information from template domain structures and terminal and internal alignment gaps. Tested on 630 non-redundant sequences, without using homologous templates, ThreaDom generates correct single- and multi-domain classifications in 81% of cases, where 78% have the domain linker assigned within \u00b120 residues. In a second test on 486 proteins with discontinuous domains, ThreaDom achieves an average precision 84% and recall 65% in domain boundary prediction. Finally, ThreaDom was examined on 56 targets from CASP8 and had a domain overlap rate 73, 87 and 85% with the target for Free Modeling, Hard multiple-domain and discontinuous domain proteins, respectively, which are significantly higher than most domain predictors in the CASP8. Similar results were achieved on the targets from the most recently CASP9 and CASP10 experiments.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/zhanglab.ccmb.med.umich.edu\/ThreaDom\/.<\/jats:p>\n               <jats:p>Contact: \u00a0zhng@umich.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt209","type":"journal-article","created":{"date-parts":[[2013,6,27]],"date-time":"2013-06-27T05:33:26Z","timestamp":1372311206000},"page":"i247-i256","source":"Crossref","is-referenced-by-count":74,"title":["ThreaDom: extracting protein domain boundary information from multiple threading alignments"],"prefix":"10.1093","volume":"29","author":[{"given":"Zhidong","family":"Xue","sequence":"first","affiliation":[]},{"given":"Dong","family":"Xu","sequence":"additional","affiliation":[]},{"given":"Yan","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Yang","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2013,6,19]]},"reference":[{"key":"2023062614315755100_btt209-B1","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1093\/nar\/gkn944","article-title":"FIEFDom: a transparent domain boundary recognition system using a fuzzy mean operator","volume":"37","author":"Bondugula","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023062614315755100_btt209-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10618-005-0023-5","article-title":"DOMpro: protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks","volume":"13","author":"Cheng","year":"2006","journal-title":"Data Min. Knowl. Discov."},{"key":"2023062614315755100_btt209-B3","doi-asserted-by":"crossref","first-page":"1522","DOI":"10.1016\/j.str.2010.08.017","article-title":"Detailed analysis of function divergence in a large and diverse domain superfamily: toward a refined protocol of function classification","volume":"18","author":"Dessailly","year":"2010","journal-title":"Structure"},{"key":"2023062614315755100_btt209-B4","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1016\/j.jmb.2005.05.037","article-title":"Armadillo: domain boundary prediction by amino acid composition","volume":"350","author":"Dumontier","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B5","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1093\/bioinformatics\/btq700","article-title":"DROP: an SVM domain linker predictor trained with optimal features selected by random forest","volume":"27","author":"Ebina","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B6","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1186\/1471-2105-12-43","article-title":"DoBo: protein domain boundary prediction by integrating evolutionary signals and machine learning","volume":"12","author":"Eickholt","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023062614315755100_btt209-B7","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1002\/prot.22554","article-title":"Assessment of domain boundary predictions and the prediction of intramolecular contacts in CASP8","volume":"77","author":"Ezkurdia","year":"2009","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B8","doi-asserted-by":"crossref","first-page":"D211","DOI":"10.1093\/nar\/gkp985","article-title":"The Pfam protein families database","volume":"38","author":"Finn","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023062614315755100_btt209-B9","doi-asserted-by":"crossref","first-page":"839","DOI":"10.1006\/jmbi.2001.5387","article-title":"SnapDRAGON: a method to delineate protein structural domains from sequence data","volume":"316","author":"George","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B10","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1016\/S0022-2836(03)00269-9","article-title":"Exhaustive enumeration of protein domain families","volume":"328","author":"Heger","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B11","doi-asserted-by":"crossref","first-page":"D188","DOI":"10.1093\/nar\/gki096","article-title":"ADDA: a domain database with global coverage of the protein universe","volume":"33","author":"Heger","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023062614315755100_btt209-B12","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1093\/bioinformatics\/14.10.846","article-title":"Hidden Markov models for detecting remote protein homologies","volume":"14","author":"Karplus","year":"1998","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B13","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1002\/prot.20737","article-title":"Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM","volume":"61","author":"Kim","year":"2005","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B14","doi-asserted-by":"crossref","first-page":"678","DOI":"10.1002\/prot.20095","article-title":"CHOP proteins into structural domain-like fragments","volume":"55","author":"Liu","year":"2004","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B15","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1016\/S0022-2836(05)80134-2","article-title":"SCOP: a structural classification of proteins database for the investigation of sequences and structures","volume":"247","author":"Murzin","year":"1995","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B16","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","article-title":"CATH\u2014a hierarchic classification of protein domain structures","volume":"5","author":"Orengo","year":"1997","journal-title":"Structure"},{"key":"2023062614315755100_btt209-B17","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1186\/1471-2105-7-277","article-title":"EVEREST: automatic identification and classification of protein domains in all protein sequences","volume":"7","author":"Portugaly","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023062614315755100_btt209-B18","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1016\/j.jmb.2006.05.035","article-title":"Structural diversity of domain superfamilies in the CATH database","volume":"360","author":"Reeves","year":"2006","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B19","doi-asserted-by":"crossref","first-page":"951","DOI":"10.1093\/bioinformatics\/bti125","article-title":"Protein homology detection by HMM-HMM comparison","volume":"21","author":"S\u00f6ding","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B20","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1093\/bib\/3.3.246","article-title":"ProDom: automated clustering of homologous domains","volume":"3","author":"Servant","year":"2002","journal-title":"Brief. Bioinform."},{"key":"2023062614315755100_btt209-B21","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1002\/prot.20442","article-title":"PPRODO: prediction of protein domain boundaries using neural networks","volume":"59","author":"Sim","year":"2005","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B22","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1093\/bioinformatics\/btg031","article-title":"DomCut: prediction of inter-domain linker regions in amino acid sequences","volume":"19","author":"Suyama","year":"2003","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B23","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1016\/j.jmb.2011.10.045","article-title":"Intra-chain 3D segment swapping spawns the evolution of new multidomain protein architectures","volume":"415","author":"Szilagyi","year":"2012","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B24","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1002\/prot.20736","article-title":"Evaluation of domain prediction in CASP6","volume":"61","author":"Tai","year":"2005","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B25","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1002\/prot.21675","article-title":"Assessment of predictions submitted for the CASP7 domain prediction category","volume":"69","author":"Tress","year":"2007","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B26","first-page":"1589","article-title":"PISCES: a protein sequence culling server","volume":"19","author":"Wang","year":"2003","journal-title":"Biopolymers"},{"key":"2023062614315755100_btt209-B27","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1093\/bioinformatics\/16.7.613","article-title":"Domain size distributions can predict domain boundaries","volume":"16","author":"Wheelan","year":"2000","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B28","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1186\/1741-7007-5-17","article-title":"Ab initio modeling of small proteins by iterative TASSER simulations","volume":"5","author":"Wu","year":"2007","journal-title":"BMC Biol."},{"key":"2023062614315755100_btt209-B29","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1002\/prot.21945","article-title":"MUSTER: improving protein sequence profile-profile alignments by using multiple sources of structure information","volume":"72","author":"Wu","year":"2008","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B30","doi-asserted-by":"crossref","first-page":"3375","DOI":"10.1093\/nar\/gkm251","article-title":"LOMETS: a local meta-threading-server for protein structure prediction","volume":"35","author":"Wu","year":"2007","journal-title":"Nucleic Acids. Res."},{"key":"2023062614315755100_btt209-B31","doi-asserted-by":"crossref","first-page":"1314","DOI":"10.1016\/j.jmb.2008.10.093","article-title":"OPUS-Dom: applying the folding-based method VECFOLD to determine protein domain boundaries","volume":"385","author":"Wu","year":"2009","journal-title":"J. Mol. Biol."},{"key":"2023062614315755100_btt209-B32","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1002\/1097-0134(20000815)40:3<343::AID-PROT10>3.0.CO;2-S","article-title":"Protein threading using PROSPECT: design and evaluation","volume":"40","author":"Xu","year":"2000","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B33","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1093\/bioinformatics\/16.12.1091","article-title":"Protein domain decomposition using a graph-theoretic approach","volume":"16","author":"Xu","year":"2000","journal-title":"Bioinformatics"},{"key":"2023062614315755100_btt209-B34","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1109\/TNB.2008.2000747","article-title":"DomNet: protein domain boundary prediction using enhanced general regression network and new profiles","volume":"7","author":"Yoo","year":"2008","journal-title":"IEEE Trans. Nanobiosci."},{"key":"2023062614315755100_btt209-B35","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1002\/prot.21702","article-title":"Template-based modeling and free modeling by I-TASSER in CASP7","volume":"69","author":"Zhang","year":"2007","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B36","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1016\/j.sbi.2008.02.004","article-title":"Progress and challenges in protein structure prediction","volume":"18","author":"Zhang","year":"2008","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023062614315755100_btt209-B37","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1002\/prot.22588","article-title":"I-TASSER: Fully automated protein structure prediction in CASP8","volume":"77","author":"Zhang","year":"2009","journal-title":"Proteins"},{"key":"2023062614315755100_btt209-B38","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1002\/prot.20308","article-title":"Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments","volume":"58","author":"Zhou","year":"2005","journal-title":"Proteins"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/13\/i247\/50703613\/bioinformatics_29_13_i247.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/13\/i247\/50703613\/bioinformatics_29_13_i247.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,26]],"date-time":"2023-06-26T15:32:09Z","timestamp":1687793529000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/13\/i247\/187571"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,6,19]]},"references-count":38,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2013,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt209","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2013,7]]},"published":{"date-parts":[[2013,6,19]]}}}