{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T18:12:37Z","timestamp":1769883157692,"version":"3.49.0"},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Most proteins consist of multiple domains, independent structural and evolutionary units that are often reshuffled in genomic rearrangements to form new protein architectures. Template-based modeling methods can often detect homologous templates for individual domains, but templates that could be used to model the entire query protein are often not available.<\/jats:p>\n               <jats:p>Results: We have developed a fast docking algorithm ab initio domain assembly (AIDA) for assembling multi-domain protein structures, guided by the ab initio folding potential. This approach can be extended to discontinuous domains (i.e. domains with \u2018inserted\u2019 domains). When tested on experimentally solved structures of multi-domain proteins, the relative domain positions were accurately found among top 5000 models in 86% of cases. AIDA server can use domain assignments provided by the user or predict them from the provided sequence. The latter approach is particularly useful for automated protein structure prediction servers. The blind test consisting of 95 CASP10 targets shows that domain boundaries could be successfully determined for 97% of targets.<\/jats:p>\n               <jats:p>Availability and implementation: The AIDA package as well as the benchmark sets used here are available for download at http:\/\/ffas.burnham.org\/AIDA\/.<\/jats:p>\n               <jats:p>Contact: \u00a0adam@sanfordburnham.org<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv092","type":"journal-article","created":{"date-parts":[[2015,2,21]],"date-time":"2015-02-21T15:10:09Z","timestamp":1424531409000},"page":"2098-2105","source":"Crossref","is-referenced-by-count":61,"title":["AIDA: <i>ab initio<\/i> domain assembly for automated multi-domain protein structure prediction and domain\u2013domain interaction prediction"],"prefix":"10.1093","volume":"31","author":[{"given":"Dong","family":"Xu","sequence":"first","affiliation":[{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"}]},{"given":"Lukasz","family":"Jaroszewski","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"},{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"}]},{"given":"Zhanwen","family":"Li","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"}]},{"given":"Adam","family":"Godzik","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"},{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"},{"name":"1 Bioinformatics and Systems Biology Program, Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA, 2Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Dr. La Jolla, CA 92093-0446, USA and 3Center of Excellence in Genomic Medicine Research (CEGMR), King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia"}]}],"member":"286","published-online":{"date-parts":[[2015,2,19]]},"reference":[{"key":"2023020202134831100_btv092-B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023020202134831100_btv092-B2","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1023\/A:1026113408773","article-title":"Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination","volume":"4","author":"Apic","year":"2003","journal-title":"J. Struct. Funct. Genomics"},{"key":"2023020202134831100_btv092-B3","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1002\/prot.20557","article-title":"Docking to single-domain and multiple-domain proteins: old and new challenges","volume":"60","author":"Ben-Zeev","year":"2005","journal-title":"Proteins"},{"key":"2023020202134831100_btv092-B4","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023020202134831100_btv092-B5","doi-asserted-by":"crossref","first-page":"e114","DOI":"10.1371\/journal.pcbi.0020114","article-title":"Expansion of protein domain repeats","volume":"2","author":"Bjorklund","year":"2006","journal-title":"PLoS Comput. Biol."},{"key":"2023020202134831100_btv092-B6","doi-asserted-by":"crossref","first-page":"3390","DOI":"10.1093\/nar\/gki615","article-title":"Protein length in eukaryotic and prokaryotic proteomes","volume":"33","author":"Brocchieri","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023020202134831100_btv092-B7","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1186\/1471-2105-9-441","article-title":"Structural assembly of two-domain proteins by rigid-body docking","volume":"9","author":"Cheng","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020202134831100_btv092-B8","doi-asserted-by":"crossref","first-page":"1701","DOI":"10.1126\/science.1085371","article-title":"Evolution of the protein repertoire","volume":"300","author":"Chothia","year":"2003","journal-title":"Science"},{"key":"2023020202134831100_btv092-B9","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.jmb.2005.02.007","article-title":"Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions","volume":"348","author":"Ekman","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023020202134831100_btv092-B10","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1038\/nrm2144","article-title":"The folding and evolution of multidomain proteins","volume":"8","author":"Han","year":"2007","journal-title":"Nat. Rev. Mol. Cell Biol."},{"key":"2023020202134831100_btv092-B11","doi-asserted-by":"crossref","first-page":"S156","DOI":"10.1088\/1478-3975\/2\/4\/S10","article-title":"Combinatorial docking approach for structure prediction of large proteins and multi-molecular assemblies","volume":"2","author":"Inbar","year":"2005","journal-title":"Phys. Biol."},{"key":"2023020202134831100_btv092-B12","doi-asserted-by":"crossref","first-page":"W284","DOI":"10.1093\/nar\/gki418","article-title":"FFAS03: a server for profile\u2013profile sequence alignments","volume":"33","author":"Jaroszewski","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023020202134831100_btv092-B13","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1006\/jmbi.1999.3091","article-title":"Protein secondary structure prediction based on position-specific scoring matrices","volume":"292","author":"Jones","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023020202134831100_btv092-B14","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1007\/978-1-59745-243-4_3","article-title":"Inferring protein-protein interactions from multiple protein domain combinations","volume":"541","author":"Kanaan","year":"2009","journal-title":"Methods Mol. Biol."},{"key":"2023020202134831100_btv092-B15","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nprot.2009.2","article-title":"Protein structure prediction on the web: a case study using the Phyre server","volume":"4","author":"Kelley","year":"2009","journal-title":"Nat. Protoc."},{"key":"2023020202134831100_btv092-B16","doi-asserted-by":"crossref","first-page":"778","DOI":"10.1002\/prot.22488","article-title":"Improved prediction of protein side-chain conformations with SCWRL4","volume":"77","author":"Krivov","year":"2009","journal-title":"Proteins"},{"key":"2023020202134831100_btv092-B17","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1186\/1471-2105-7-310","article-title":"Docking protein domains in contact space","volume":"7","author":"Lise","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020202134831100_btv092-B18","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1016\/S0065-3233(08)60402-7","article-title":"Conformation of polypeptides and proteins","volume":"23","author":"Ramachandran","year":"1968","journal-title":"Adv. Protein Chem."},{"key":"2023020202134831100_btv092-B19","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1006\/jmbi.1993.1626","article-title":"Comparative protein modelling by satisfaction of spatial restraints","volume":"234","author":"Sali","year":"1993","journal-title":"J. Mol. Biol."},{"key":"2023020202134831100_btv092-B20","doi-asserted-by":"crossref","first-page":"1589","DOI":"10.1093\/bioinformatics\/btg224","article-title":"PISCES: a protein sequence culling server","volume":"19","author":"Wang","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020202134831100_btv092-B21","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1110\/ps.062270707","article-title":"Prediction of structures of multidomain proteins from structures of the individual domains","volume":"16","author":"Wollacott","year":"2007","journal-title":"Protein Sci."},{"key":"2023020202134831100_btv092-B22","doi-asserted-by":"crossref","first-page":"W308","DOI":"10.1093\/nar\/gku369","article-title":"AIDA: ab initio domain assembly server","volume":"42","author":"Xu","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"2023020202134831100_btv092-B23","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/btt578","article-title":"FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking","volume":"30","author":"Xu","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020202134831100_btv092-B24","doi-asserted-by":"crossref","first-page":"1715","DOI":"10.1002\/prot.24065","article-title":"Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field","volume":"80","author":"Xu","year":"2012","journal-title":"Proteins"},{"key":"2023020202134831100_btv092-B25","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1093\/bioinformatics\/btq066","article-title":"How significant is a protein structure similarity with TM-score = 0.5?","volume":"26","author":"Xu","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020202134831100_btv092-B26","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1093\/bioinformatics\/16.12.1091","article-title":"Protein domain decomposition using a graph-theoretic approach","volume":"16","author":"Xu","year":"2000","journal-title":"Bioinformatics"},{"key":"2023020202134831100_btv092-B27","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1002\/prot.20264","article-title":"Scoring function for automated assessment of protein structure template quality","volume":"57","author":"Zhang","year":"2004","journal-title":"Proteins"},{"key":"2023020202134831100_btv092-B28","doi-asserted-by":"crossref","first-page":"2714","DOI":"10.1110\/ps.0217002","article-title":"Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction","volume":"11","author":"Zhou","year":"2002","journal-title":"Protein Sci."},{"key":"2023020202134831100_btv092-B29","doi-asserted-by":"crossref","first-page":"e1002701","DOI":"10.1371\/journal.pcbi.1002701","article-title":"This D\u00e9j\u00e0 Vu feeling\u2014analysis of multidomain protein evolution in eukaryotic genomes","volume":"8","author":"Zmasek","year":"2012","journal-title":"PLoS Comput. Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/13\/2098\/49034617\/bioinformatics_31_13_2098.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/13\/2098\/49034617\/bioinformatics_31_13_2098.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T03:40:34Z","timestamp":1675309234000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/13\/2098\/195845"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,2,19]]},"references-count":29,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2015,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv092","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,7,1]]},"published":{"date-parts":[[2015,2,19]]}}}