{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T01:39:33Z","timestamp":1775612373069,"version":"3.50.1"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2023,12,1]],"date-time":"2023-12-01T00:00:00Z","timestamp":1701388800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000928","name":"Welch Foundation","doi-asserted-by":"publisher","award":["I-2074\u201320210327"],"award-info":[{"award-number":["I-2074\u201320210327"]}],"id":[{"id":"10.13039\/100000928","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Protein sequences can be broadly categorized into two classes: those which adopt stable secondary structure and fold into a domain (i.e. globular proteins), and those that do not. The sequences belonging to this latter class are conformationally heterogeneous and are described as being intrinsically disordered. Decades of investigation into the structure and function of globular proteins has resulted in a suite of computational tools that enable their sub-classification by domain type, an approach that has revolutionized how we understand and predict protein functionality. Conversely, it is unknown if sequences of disordered protein regions are subject to broadly generalizable organizational principles that would enable their sub-classification.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here, we report the development of a statistical approach that quantifies linear variance in amino acid composition across a sequence. With multiple examples, we provide evidence that intrinsically disordered regions are organized into statistically non-random modules of unique compositional bias. Modularity is observed for both low and high-complexity sequences and, in some cases, we find that modules are organized in repetitive patterns. These data demonstrate that disordered sequences are non-randomly organized into modular architectures and motivate future experiments to comprehensively classify module types and to determine the degree to which modules constitute functionally separable units analogous to the domains of globular proteins.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The source code, documentation, and data to reproduce all figures are freely available at https:\/\/github.com\/MWPlabUTSW\/Chi-Score-Analysis.git. The analysis is also available as a Google Colab Notebook (https:\/\/colab.research.google.com\/github\/MWPlabUTSW\/Chi-Score-Analysis\/blob\/main\/ChiScore_Analysis.ipynb).<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad732","type":"journal-article","created":{"date-parts":[[2023,12,1]],"date-time":"2023-12-01T15:32:03Z","timestamp":1701444723000},"source":"Crossref","is-referenced-by-count":14,"title":["Protein intrinsically disordered regions have a non-random, modular architecture"],"prefix":"10.1093","volume":"39","author":[{"given":"Brendan S","family":"McConnell","sequence":"first","affiliation":[{"name":"Department of Biophysics, , University of Texas Southwestern Medical Center, Dallas, TX 75235, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7571-0010","authenticated-orcid":false,"given":"Matthew W","family":"Parker","sequence":"additional","affiliation":[{"name":"Department of Biophysics, , University of Texas Southwestern Medical Center, Dallas, TX 75235, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,12,1]]},"reference":[{"key":"2023121404114968000_btad732-B1","doi-asserted-by":"crossref","first-page":"1185","DOI":"10.1042\/BST20160172","article-title":"The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease","volume":"44","author":"Babu","year":"2016","journal-title":"Biochem Soc Trans"},{"key":"2023121404114968000_btad732-B2","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/j.sbi.2011.02.005","article-title":"Evolution and disorder","volume":"21","author":"Brown","year":"2011","journal-title":"Curr Opin Struct Biol"},{"key":"2023121404114968000_btad732-B3","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1007\/s00239-001-2309-6","article-title":"Evolutionary rate heterogeneity in proteins with long disordered regions","volume":"55","author":"Brown","year":"2002","journal-title":"J Mol Evol"},{"key":"2023121404114968000_btad732-B4","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1038\/s41594-019-0248-4","article-title":"Cryo-EM structures of four polymorphic TDP-43 amyloid cores","volume":"26","author":"Cao","year":"2019","journal-title":"Nat Struct Mol Biol"},{"key":"2023121404114968000_btad732-B5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nargab\/lqab048","article-title":"LCD-Composer: an intuitive, composition-centric method enabling the identification and detailed functional mapping of low-complexity domains","volume":"3","author":"Cascarina","year":"2021","journal-title":"NAR Genomics Bioinforma"},{"key":"2023121404114968000_btad732-B6","doi-asserted-by":"crossref","first-page":"167373","DOI":"10.1016\/j.jmb.2021.167373","article-title":"Uncovering non-random binary patterns within sequences of intrinsically disordered proteins","volume":"434","author":"Cohan","year":"2022","journal-title":"J Mol Biol"},{"key":"2023121404114968000_btad732-B7","doi-asserted-by":"crossref","first-page":"1537","DOI":"10.1016\/j.str.2016.07.007","article-title":"ALS mutations disrupt phase separation mediated by \u03b1-helical structure in the TDP-43 low-complexity C-terminal domain","volume":"24","author":"Conicella","year":"2016","journal-title":"Structure"},{"key":"2023121404114968000_btad732-B8","doi-asserted-by":"crossref","first-page":"13392","DOI":"10.1073\/pnas.1304749110","article-title":"Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues","volume":"110","author":"Das","year":"2013","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023121404114968000_btad732-B9","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1039\/C1MB05231D","article-title":"Attributes of short linear motifs","volume":"8","author":"Davey","year":"2012","journal-title":"Mol Biosyst"},{"key":"2023121404114968000_btad732-B10","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1016\/S1093-3263(00)00138-8","article-title":"Intrinsically disordered protein","volume":"19","author":"Dunker","year":"2001","journal-title":"J Mol Graph Model"},{"key":"2023121404114968000_btad732-B11","doi-asserted-by":"crossref","first-page":"4312","DOI":"10.1016\/j.bpj.2021.08.039","article-title":"Metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure","volume":"120","author":"Emenecker","year":"2021","journal-title":"Biophys J"},{"key":"2023121404114968000_btad732-B12","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.bpj.2016.11.3200","article-title":"CIDER: resources to analyze sequence-ensemble relationships of intrinsically disordered proteins","volume":"112","author":"Holehouse","year":"2017","journal-title":"Biophys J"},{"key":"2023121404114968000_btad732-B13","doi-asserted-by":"crossref","first-page":"19614","DOI":"10.1074\/jbc.M113.463828","article-title":"Structural transformation of the amyloidogenic core region of TDP-43 protein initiates its aggregation and cytoplasmic inclusion","volume":"288","author":"Jiang","year":"2013","journal-title":"J Biol Chem"},{"key":"2023121404114968000_btad732-B14","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2023121404114968000_btad732-B15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3390\/biom10121636","article-title":"Comparative assessment of intrinsic disorder predictions with a focus on protein and nucleic acid-binding proteins","volume":"10","author":"Katuwawala","year":"2020","journal-title":"Biomolecules"},{"key":"2023121404114968000_btad732-B16","doi-asserted-by":"crossref","first-page":"e2109668118","DOI":"10.1073\/pnas.2109668118","article-title":"Distinct roles of hnRNPH1 low-complexity domains in splicing and transcription","volume":"118","author":"Kim","year":"2021","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023121404114968000_btad732-B17","author":"King","year":"2022"},{"key":"2023121404114968000_btad732-B18","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.ymeth.2014.10.031","article-title":"The Hitchhiker\u2019s guide to Hi-C analysis: practical guidelines","volume":"72","author":"Lajoie","year":"2015","journal-title":"Methods"},{"key":"2023121404114968000_btad732-B19","doi-asserted-by":"crossref","first-page":"e77058","DOI":"10.7554\/eLife.77058","article-title":"A unified view of low complexity region (LCRs) across species","volume":"11","author":"Lee","year":"2022","journal-title":"eLife"},{"key":"2023121404114968000_btad732-B20","doi-asserted-by":"crossref","first-page":"28727","DOI":"10.1073\/pnas.2012216117","article-title":"Redox-mediated regulation of an evolutionarily conserved cross-\u03b2 structure formed by the TDP43 low complexity domain","volume":"117","author":"Lin","year":"2020","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023121404114968000_btad732-B21","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1016\/j.cell.2022.12.013","article-title":"Functional partitioning of transcriptional regulators by patterned charge blocks","volume":"186","author":"Lyons","year":"2023","journal-title":"Cell"},{"key":"2023121404114968000_btad732-B22","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1002\/pro.3754","article-title":"IDDomainSpotter: compositional bias reveals domains in long disordered protein regions\u2014insights from transcription factors","volume":"29","author":"Millard","year":"2020","journal-title":"Protein Sci"},{"key":"2023121404114968000_btad732-B23","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/BF02703118","article-title":"Protein sequences as random fractals","volume":"18","author":"Mitra","year":"1993","journal-title":"J Biosci"},{"key":"2023121404114968000_btad732-B24","doi-asserted-by":"crossref","first-page":"3262","DOI":"10.1039\/c2mb25202c","article-title":"Chemical composition is maintained in poorly conserved intrinsically disordered regions and suggests a means for their classification","volume":"8","author":"Moesa","year":"2012","journal-title":"Mol Biosyst"},{"key":"2023121404114968000_btad732-B25","doi-asserted-by":"crossref","first-page":"923","DOI":"10.1038\/s41594-021-00677-4","article-title":"Molecular interactions contributing to FUS SYGQ LC-RGG phase separation and co-partitioning with RNA polymerase II heptads","volume":"28","author":"Murthy","year":"2021","journal-title":"Nat Struct Mol Biol"},{"key":"2023121404114968000_btad732-B26","doi-asserted-by":"crossref","first-page":"5533","DOI":"10.1093\/bioinformatics\/btaa1045","article-title":"MobiDB-lite 3.0: fast consensus annotation of intrinsic disorder flavors in proteins","volume":"36","author":"Necci","year":"2020","journal-title":"Bioinformatics"},{"key":"2023121404114968000_btad732-B27","doi-asserted-by":"crossref","first-page":"2164","DOI":"10.1002\/pro.3041","article-title":"Large-scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe","volume":"25","author":"Necci","year":"2016","journal-title":"Protein Sci"},{"key":"2023121404114968000_btad732-B28","doi-asserted-by":"crossref","first-page":"e48562","DOI":"10.7554\/eLife.48562","article-title":"A new class of disordered elements controls DNA replication through initiator self-assembly","volume":"8","author":"Parker","year":"2019","journal-title":"eLife"},{"key":"2023121404114968000_btad732-B29","first-page":"164","volume-title":"Pac Symp Biocomput","author":"Patil","year":"2012"},{"key":"2023121404114968000_btad732-B30","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1016\/S0959-440X(94)90108-2","article-title":"Introns and exons","volume":"4","author":"Patthy","year":"1994","journal-title":"Curr Opin Struct Biol"},{"key":"2023121404114968000_btad732-B31","doi-asserted-by":"crossref","first-page":"7206","DOI":"10.1128\/MCB.24.16.7206-7213.2004","article-title":"Scrambled prion domains form prions and amyloid","volume":"24","author":"Ross","year":"2004","journal-title":"Mol Cell Biol"},{"key":"2023121404114968000_btad732-B32","doi-asserted-by":"crossref","first-page":"12825","DOI":"10.1073\/pnas.0506136102","article-title":"Primary sequence independence for prion formation","volume":"102","author":"Ross","year":"2005","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023121404114968000_btad732-B33","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1038\/333210a0","article-title":"Acid blobs and negative noodles","volume":"333","author":"Sigler","year":"1988","journal-title":"Nature"},{"key":"2023121404114968000_btad732-B34","doi-asserted-by":"crossref","first-page":"D266","DOI":"10.1093\/nar\/gkaa1079","article-title":"CATH: increased structural coverage of functional space","volume":"49","author":"Sillitoe","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023121404114968000_btad732-B35","doi-asserted-by":"crossref","first-page":"5188","DOI":"10.2741\/3594","article-title":"Intrinsic disorder in proteins associated with neurodegenerative diseases","volume":"14","author":"Uversky","year":"2009","journal-title":"Front Biosci (Landmark Ed)"},{"key":"2023121404114968000_btad732-B36","doi-asserted-by":"crossref","first-page":"10","DOI":"10.3389\/fphy.2019.00010","article-title":"Intrinsically disordered proteins and their \u2018mysterious\u2019 (meta)physics","volume":"7","author":"Uversky","year":"2019","journal-title":"Front Phys"},{"key":"2023121404114968000_btad732-B37","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1002\/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7","article-title":"Why are\u202f\u2018natively unfolded\u2019 proteins unstructured under physiologic conditions?","volume":"41","author":"Uversky","year":"2000","journal-title":"Proteins"},{"key":"2023121404114968000_btad732-B38","doi-asserted-by":"crossref","first-page":"688","DOI":"10.1016\/j.cell.2018.06.006","article-title":"A molecular grammar governing the driving forces for phase separation of prion-like RNA binding proteins","volume":"174","author":"Wang","year":"2018","journal-title":"Cell"},{"key":"2023121404114968000_btad732-B39","doi-asserted-by":"crossref","first-page":"348","DOI":"10.1016\/j.febslet.2004.09.036","article-title":"Reduced amino acid alphabet is sufficient to accurately recognize intrinsically disordered protein","volume":"576","author":"Weathers","year":"2004","journal-title":"FEBS Lett"},{"key":"2023121404114968000_btad732-B40","first-page":"697","volume-title":"Proc Natl Acad Sci USA","author":"Wetlaufer","year":"1973"},{"key":"2023121404114968000_btad732-B41","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1007\/BF00163155","article-title":"The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences","volume":"38","author":"White","year":"1994","journal-title":"J Mol Evol"},{"key":"2023121404114968000_btad732-B42","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1007\/BF02407307","article-title":"The evolution of proteins from random amino acid sequences. I. Evidence from the lengthwise distribution of amino acids in modern protein sequences","volume":"36","author":"White","year":"1993","journal-title":"J Mol Evol"},{"key":"2023121404114968000_btad732-B43","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/0097-8485(93)85006-X","article-title":"Statistics of local complexity in amino acid sequences and sequence databases","volume":"17","author":"Wootton","year":"1993","journal-title":"Comput Chem"},{"key":"2023121404114968000_btad732-B44","doi-asserted-by":"crossref","first-page":"e46883","DOI":"10.7554\/eLife.46883","article-title":"Proteome-wide signatures of function in highly diverged intrinsically disordered regions","volume":"8","author":"Zarin","year":"2019","journal-title":"eLife"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad732\/53970815\/btad732.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/12\/btad732\/54429299\/btad732.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/12\/btad732\/54429299\/btad732.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,13]],"date-time":"2023-12-13T23:14:43Z","timestamp":1702509283000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad732\/7457484"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,12,1]]},"references-count":44,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2023,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad732","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.05.10.539862","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,12,1]]},"published":{"date-parts":[[2023,12,1]]},"article-number":"btad732"}}