{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,29]],"date-time":"2025-11-29T07:49:52Z","timestamp":1764402592130},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Describing domain architecture is a critical step in the functional characterization of proteins. However, some orphan domains do not match any profile stored in dedicated domain databases and are thereby difficult to analyze.<\/jats:p>\n               <jats:p>Results: We present here an original novel approach, called TREMOLO-HCA, for the analysis of orphan domain sequences and inspired from our experience in the use of Hydrophobic Cluster Analysis (HCA). Hidden relationships between protein sequences can be more easily identified from the PSI-BLAST results, using information on domain architecture, HCA plots and the conservation degree of amino acids that may participate in the protein core. This can lead to reveal remote relationships with known families of domains, as illustrated here with the identification of a hidden Tudor tandem in the human BAHCC1 protein and a hidden ET domain in the Saccharomyces cerevisiae Taf14p and human AF9 proteins. The results obtained in such a way are consistent with those provided by HHPRED, based on pairwise comparisons of HHMs. Our approach can, however, be applied even in absence of domain profiles or known 3D structures for the identification of novel families of domains. It can also be used in a reverse way for refining domain profiles, by starting from known protein domain families and identifying highly divergent members, hitherto considered as orphan.<\/jats:p>\n               <jats:p>Availability: We provide a possible integration of this approach in an open TREMOLO-HCA package, which is fully implemented in python v2.7 and is available on request. Instructions are available at http:\/\/www.impmc.upmc.fr\/\u223ccallebau\/tremolohca.html.<\/jats:p>\n               <jats:p>Contact: \u00a0isabelle.callebaut@impmc.upmc.fr<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary Data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt271","type":"journal-article","created":{"date-parts":[[2013,5,16]],"date-time":"2013-05-16T01:00:44Z","timestamp":1368666044000},"page":"1726-1733","source":"Crossref","is-referenced-by-count":31,"title":["Identification of hidden relationships from the coupling of Hydrophobic Cluster Analysis and Domain Architecture information"],"prefix":"10.1093","volume":"29","author":[{"given":"Guilhem","family":"Faure","sequence":"first","affiliation":[{"name":"IMPMC, UMR7590, CNRS, Universit\u00e9 Pierre et Marie Curie-Paris6, Paris Cedex 05, France"}]},{"given":"Isabelle","family":"Callebaut","sequence":"additional","affiliation":[{"name":"IMPMC, UMR7590, CNRS, Universit\u00e9 Pierre et Marie Curie-Paris6, Paris Cedex 05, France"}]}],"member":"286","published-online":{"date-parts":[[2013,5,14]]},"reference":[{"key":"2023012810454453500_btt271-B1","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1016\/j.febslet.2012.02.012","article-title":"Crystal structures of the Tudor domains of human PHF20 reveal novel structural variations on the Royal Family of proteins","volume":"586","author":"Adams-Cioaba","year":"2012","journal-title":"FEBS Lett."},{"key":"2023012810454453500_btt271-B2","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1139\/O08-129","article-title":"Structure and function of histone methylation binding proteins","volume":"87","author":"Adams-Cioaba","year":"2009","journal-title":"Biochem. Cell Biol."},{"key":"2023012810454453500_btt271-B3","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1006\/jmbi.1999.2653","article-title":"Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches","volume":"287","author":"Aravind","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023012810454453500_btt271-B4","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1126\/science.1210915","article-title":"Structural basis of silencing: Sir3 BAH domain in complex with a nucleosome at 3.0 \u00c5 resolution","volume":"334","author":"Armache","year":"2011","journal-title":"Science"},{"key":"2023012810454453500_btt271-B5","doi-asserted-by":"crossref","first-page":"1453","DOI":"10.1126\/science.1210557","article-title":"Mutations in CIC and FUBP1 contribute to human oligodendroglioma","volume":"333","author":"Bettegowda","year":"2011","journal-title":"Science"},{"key":"2023012810454453500_btt271-B9","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/S0014-5793(96)01312-9","article-title":"From BRCA1 to RAP1: a widespread BRCT module closely associated with DNA repair","volume":"400","author":"Callebaut","year":"1997","journal-title":"FEBS Lett."},{"key":"2023012810454453500_btt271-B10","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1042\/bj3210125","article-title":"The human EBNA-2 coactivator p100: multidomain organization and relationship to the staphylococcal nuclease fold and to the tudor protein involved in Drosophila melanogaster development","volume":"321","author":"Callebaut","year":"1997","journal-title":"Biochem. J."},{"key":"2023012810454453500_btt271-B14","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1007\/s000180050082","article-title":"Deciphering protein sequence information through hydrophobic cluster analysis (HCA): current status and perspectives","volume":"53","author":"Callebaut","year":"1997","journal-title":"Cell Mol. Life Sci."},{"key":"2023012810454453500_btt271-B15","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/S0014-5793(99)00132-5","article-title":"The BAH (bromo-adjacent homology) domain: a link between DNA methylation, replication and transcriptional regulation","volume":"446","author":"Callebaut","year":"1999","journal-title":"FEBS Lett."},{"key":"2023012810454453500_btt271-B17","doi-asserted-by":"crossref","first-page":"3592","DOI":"10.1093\/nar\/gkf470","article-title":"Metallo-beta-lactamase fold within nucleic acids processing enzymes: the beta-CASP family","volume":"30","author":"Callebaut","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"2023012810454453500_btt271-B18","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1186\/1471-2164-6-100","article-title":"Prediction of the general transcription factors associated with RNA polymerase II in Plasmodium falciparum: conserved features and differences relative to other eukaryotes","volume":"6","author":"Callebaut","year":"2005","journal-title":"BMC Genomics"},{"key":"2023012810454453500_btt271-B19","doi-asserted-by":"crossref","first-page":"13857","DOI":"10.1074\/jbc.C500473200","article-title":"Cernunnos interacts with the XRCC4 x DNA-ligase IV complex and is homologous to the yeast nonhomologous end-joining factor Nej1","volume":"281","author":"Callebaut","year":"2006","journal-title":"J. Biol. Chem."},{"key":"2023012810454453500_btt271-B21","doi-asserted-by":"crossref","first-page":"916","DOI":"10.1038\/nsmb.2353","article-title":"PHF20 is an effector protein of p53 double lysine methylation that stabilizes and activates p53","volume":"19","author":"Cui","year":"2012","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012810454453500_btt271-B22","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1038\/nature10509","article-title":"Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia","volume":"478","author":"Dawson","year":"2011","journal-title":"Nature"},{"key":"2023012810454453500_btt271-B23","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/j.jmb.2005.02.007","article-title":"Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions","volume":"348","author":"Ekman","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012810454453500_btt271-B24","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/1472-6807-7-2","article-title":"A generalized analysis of hydrophobic and loop clusters within globular protein sequences","volume":"7","author":"Eudes","year":"2007","journal-title":"BMC Struct. Biol."},{"key":"2023012810454453500_btt271-B25","doi-asserted-by":"crossref","first-page":"W29","DOI":"10.1093\/nar\/gkr367","article-title":"HMMER web server: interactive sequence similarity searching","volume":"39","author":"Finn","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012810454453500_btt271-B26","first-page":"D1008","article-title":"You bet-cha: a novel family of transcriptional regulators","volume":"6","author":"Florence","year":"2001","journal-title":"Front Biosci."},{"key":"2023012810454453500_btt271-B27","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/0014-5793(87)80439-8","article-title":"Hydrophobic cluster analysis: an efficient new way to compare and analyse amino acid sequences","volume":"224","author":"Gaboriaud","year":"1987","journal-title":"FEBS Lett."},{"key":"2023012810454453500_btt271-B28","doi-asserted-by":"crossref","first-page":"1744","DOI":"10.1038\/sj.onc.1203506","article-title":"ENL, the MLL fusion partner in t(11;19), binds to the c-Abl interactor protein 1 (ABI1) that is fused to MLL in t(10;11)+","volume":"19","author":"Garc\u00eda-Cu\u00e9llar","year":"2000","journal-title":"Oncogene"},{"key":"2023012810454453500_btt271-B29","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/sj.onc.1204108","article-title":"The ENL moiety of the childhood leukemia-associated MLL-ENL oncoprotein recruits human Polycomb 3","volume":"20","author":"Garc\u00eda-Cu\u00e9llar","year":"2001","journal-title":"Oncogene"},{"key":"2023012810454453500_btt271-B30","doi-asserted-by":"crossref","first-page":"E636","DOI":"10.1073\/pnas.1107107108","article-title":"Human Polymerase-Associated Factor complex (PAFc) connects the Super Elongation Complex (SEC) to RNA polymerase II on chromatin","volume":"108","author":"He","year":"2011","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012810454453500_btt271-B31","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1002\/prot.10355","article-title":"Non-intertwined binary patterns of hydrophobic\/nonhydrophobic amino acids are considerably better markers of regular secondary structures than nonconstrained patterns","volume":"51","author":"Hennetin","year":"2003","journal-title":"Proteins"},{"key":"2023012810454453500_btt271-B33","doi-asserted-by":"crossref","first-page":"19384","DOI":"10.1073\/pnas.1006436107","article-title":"Transcriptional silencing functions of the yeast protein Orc1\/Sir3 subfunctionalized after gene duplication","volume":"107","author":"Hickman","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012810454453500_btt271-B34","doi-asserted-by":"crossref","first-page":"8489","DOI":"10.1073\/pnas.0503525102","article-title":"Structural basis of the Sir1-origin recognition complex interaction in transcriptional silencing","volume":"102","author":"Hou","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012810454453500_btt271-B35","doi-asserted-by":"crossref","first-page":"1383","DOI":"10.1242\/jcs.092965","article-title":"The Capicua repressor\u2014a general sensor of RTK signaling in development and disease","volume":"125","author":"Jim\u00e9nez","year":"2012","journal-title":"J. Cell Sci."},{"key":"2023012810454453500_btt271-B36","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1038\/nature10956","article-title":"The BAH domain of ORC1 links H4K20me2 to DNA replication licensing and Meier-Gorlin syndrome","volume":"484","author":"Kuo","year":"2012","journal-title":"Nature"},{"key":"2023012810454453500_btt271-B38","doi-asserted-by":"crossref","first-page":"6086","DOI":"10.1128\/MCB.23.17.6086-6102.2003","article-title":"Yaf9, a novel NuA4 histone acetyltransferase subunit, is required for the cellular response to spindle stress in yeast","volume":"23","author":"Le Masson","year":"2003","journal-title":"Mol. Cell. Biol."},{"key":"2023012810454453500_btt271-B39","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/j.str.2012.11.011","article-title":"Leukemia fusion target AF9 is an intrinsically disordered transcriptional regulator that recruits multiple partners via coupled folding and binding","volume":"21","author":"Leach","year":"2013","journal-title":"Structure"},{"key":"2023012810454453500_btt271-B41","doi-asserted-by":"crossref","first-page":"2174","DOI":"10.1110\/ps.037580.108","article-title":"Solution structure of the extraterminal domain of the bromodomain-containing protein BRD4","volume":"17","author":"Lin","year":"2008","journal-title":"Protein Sci."},{"key":"2023012810454453500_btt271-B42","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1038\/nrm3417","article-title":"The super elongation complex (SEC) family in transcriptional control","volume":"13","author":"Luo","year":"2012","journal-title":"Nat. Rev. Mol. Cell. Biol."},{"key":"2023012810454453500_btt271-B43","doi-asserted-by":"crossref","first-page":"D348","DOI":"10.1093\/nar\/gks1243","article-title":"CDD: conserved domains and protein three-dimensional structure","volume":"41","author":"Marchler-Bauer","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023012810454453500_btt271-B44","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1146\/annurev-pathol-011811-132434","article-title":"The pathogenesis of mixed-lineage leukemia","volume":"7","author":"Muntean","year":"2012","journal-title":"Annu. Rev. Pathol."},{"key":"2023012810454453500_btt271-B45","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1002\/(SICI)1097-0134(19981115)33:3<329::AID-PROT3>3.0.CO;2-E","article-title":"Populations of hydrophobic amino acids within protein globular domains: identification of conserved \u201ctopohydrophobic\u201d positions","volume":"33","author":"Poupon","year":"1998","journal-title":"Proteins"},{"key":"2023012810454453500_btt271-B46","first-page":"173","article-title":"HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment","volume":"25","author":"Remmert","year":"2011","journal-title":"Nat. Methods"},{"key":"2023012810454453500_btt271-B47","doi-asserted-by":"crossref","first-page":"e77","DOI":"10.1371\/journal.pbio.0050077","article-title":"The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific","volume":"5","author":"Rusch","year":"2007","journal-title":"PLoS Biol."},{"key":"2023012810454453500_btt271-B48","doi-asserted-by":"crossref","first-page":"573","DOI":"10.4161\/epi.5.7.12856","article-title":"Reading chromatin. Insights from yeast into YEATS domain structure and function","volume":"5","author":"Schulze","year":"2010","journal-title":"Epigenetics"},{"key":"2023012810454453500_btt271-B49","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1016\/j.jmb.2004.06.073","article-title":"Structural biology sheds light on the puzzle of genomic ORFans","volume":"342","author":"Siew","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012810454453500_btt271-B50","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1128\/MCB.18.1.122","article-title":"The oncogenic capacity of HRX-ENL requires the transcriptional transactivation activity of ENL and the DNA binding motifs of HRX","volume":"18","author":"Slany","year":"1998","journal-title":"Mol. Cell. Biol.."},{"key":"2023012810454453500_btt271-B51","doi-asserted-by":"crossref","first-page":"W244","DOI":"10.1093\/nar\/gki408","article-title":"The HHpred interactive server for protein homology detection and structure prediction","volume":"33","author":"S\u00f6ding","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012810454453500_btt271-B52","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1016\/j.ccr.2011.09.008","article-title":"CBX8, a polycomb group protein, is essential for MLL-AF9-induced leukemogenesis","volume":"20","author":"Tan","year":"2011","journal-title":"Cancer Cell"},{"key":"2023012810454453500_btt271-B53","doi-asserted-by":"crossref","first-page":"1025","DOI":"10.1038\/nsmb1338","article-title":"How chromatin-binding modules interpret histone modifications: lessons from professional pocket pickers","volume":"14","author":"Taverna","year":"2007","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012810454453500_btt271-B55","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1006\/jmbi.2000.3786","article-title":"Estimating the number of protein folds and families from complete genome data","volume":"299","author":"Wolf","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012810454453500_btt271-B56","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1093\/protein\/5.7.629","article-title":"Detection of secondary structure elements in proteins by hydrophobic cluster analysis","volume":"5","author":"Woodcock","year":"1992","journal-title":"Protein Eng."},{"key":"2023012810454453500_btt271-B57","doi-asserted-by":"crossref","first-page":"488","DOI":"10.3109\/10409238.2010.512001","article-title":"Keeping it in the family: diverse histone recognition by conserved structural folds","volume":"45","author":"Yap","year":"2010","journal-title":"Crit. Rev. Biochem. Mol. Biol."},{"key":"2023012810454453500_btt271-B59","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1038\/sj.emboj.7600622","article-title":"BLAP75, an essential component of Bloom's syndrome protein complexes that maintain genome integrity","volume":"24","author":"Yin","year":"2005","journal-title":"EMBO J."},{"key":"2023012810454453500_btt271-B60","doi-asserted-by":"crossref","first-page":"5525","DOI":"10.1038\/sj.onc.1208699","article-title":"The eleven-nineteen-leukemia protein ENL connects nuclear MLL fusion partners with chromatin","volume":"24","author":"Zeisig","year":"2005","journal-title":"Oncogene"},{"key":"2023012810454453500_btt271-B61","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1042\/BJ20110004","article-title":"Solution structure of the Taf14 YEATS domain and its roles in cell growth of Saccharomyces cerevisiae","volume":"436","author":"Zhang","year":"2011","journal-title":"Biochem. J."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/14\/1726\/48889675\/bioinformatics_29_14_1726.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/14\/1726\/48889675\/bioinformatics_29_14_1726.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,28]],"date-time":"2023-01-28T12:30:24Z","timestamp":1674909024000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/14\/1726\/229686"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,5,14]]},"references-count":48,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2013,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt271","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,7,15]]},"published":{"date-parts":[[2013,5,14]]}}}