{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T12:13:25Z","timestamp":1771071205644,"version":"3.50.1"},"reference-count":57,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2022,1,30]],"date-time":"2022-01-30T00:00:00Z","timestamp":1643500800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM127390"],"award-info":[{"award-number":["GM127390"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000928","name":"Welch Foundation","doi-asserted-by":"publisher","award":["I-1505"],"award-info":[{"award-number":["I-1505"]}],"id":[{"id":"10.13039\/100000928","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,28]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Intrinsically disordered proteins (IDPs) are involved in numerous processes crucial for living organisms. Bias in amino acid composition of these proteins determines their unique biophysical and functional features. Distinct intrinsically disordered regions (IDRs) with compositional bias play different important roles in various biological processes. IDRs enriched in particular amino acids in human proteome have not been described consistently.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We developed DisEnrich\u2014the database of human proteome IDRs that are significantly enriched in particular amino acids. Each human protein is described using Gene Ontology (GO) function terms, disorder prediction for the full-length sequence using three methods, enriched IDR composition and ranks of human proteins with similar enriched IDRs. Distribution analysis of enriched IDRs among broad functional categories revealed significant overrepresentation of R- and Y-enriched IDRs in metabolic and enzymatic activities and F-enriched IDRs in transport. About 75% of functional categories contain IDPs with IDRs significantly enriched in hydrophobic residues that are important for protein\u2013protein interactions.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The database is available at http:\/\/prodata.swmed.edu\/DisEnrichDB\/.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics Advances online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac051","type":"journal-article","created":{"date-parts":[[2022,1,25]],"date-time":"2022-01-25T20:14:51Z","timestamp":1643141691000},"page":"1870-1876","source":"Crossref","is-referenced-by-count":4,"title":["DisEnrich: database of enriched regions in human dark proteome"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7982-4242","authenticated-orcid":false,"given":"Kirill E","family":"Medvedev","sequence":"first","affiliation":[{"name":"Department of Biophysics, University of Texas Southwestern Medical Center , Dallas, TX 75390, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3505-9665","authenticated-orcid":false,"given":"Jimin","family":"Pei","sequence":"additional","affiliation":[{"name":"McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center , Dallas, TX 75390, USA"}]},{"given":"Nick V","family":"Grishin","sequence":"additional","affiliation":[{"name":"Department of Biophysics, University of Texas Southwestern Medical Center , Dallas, TX 75390, USA"},{"name":"Department of Biochemistry, University of Texas Southwestern Medical Center , Dallas, TX 75390, USA"},{"name":"Howard Hughes Medical Institute, University of Texas Southwestern Medical Center , Dallas, TX 75390, USA"}]}],"member":"286","published-online":{"date-parts":[[2022,1,30]]},"reference":[{"key":"2023020109004468200_btac051-B1","doi-asserted-by":"crossref","first-page":"a014159","DOI":"10.1101\/cshperspect.a014159","article-title":"Perspectives on mucus properties and formation\u2014lessons from the biochemical world","volume":"2","author":"Ambort","year":"2012","journal-title":"Cold Spring Harb. Perspect. Med"},{"key":"2023020109004468200_btac051-B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023020109004468200_btac051-B3","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1042\/BJ20100131","article-title":"Dual roles for MEF2A and MEF2D during human macrophage terminal differentiation and c-Jun expression","volume":"430","author":"Aude-Garcia","year":"2010","journal-title":"Biochem. J"},{"key":"2023020109004468200_btac051-B4","doi-asserted-by":"crossref","first-page":"956","DOI":"10.2174\/092986608785849164","article-title":"TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder","volume":"15","author":"Campen","year":"2008","journal-title":"Protein Pept. Lett"},{"key":"2023020109004468200_btac051-B5","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1126\/science.aac7420","article-title":"Crystal structure of the metazoan Nup62\u2022Nup58\u2022Nup54 nucleoporin complex","volume":"350","author":"Chug","year":"2015","journal-title":"Science"},{"key":"2023020109004468200_btac051-B6","doi-asserted-by":"crossref","first-page":"16764","DOI":"10.1073\/pnas.0608175103","article-title":"Fluorescence correlation spectroscopy shows that monomeric polyglutamine molecules form collapsed structures in aqueous solutions","volume":"103","author":"Crick","year":"2006","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020109004468200_btac051-B7","doi-asserted-by":"crossref","first-page":"13392","DOI":"10.1073\/pnas.1304749110","article-title":"Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues","volume":"110","author":"Das","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020109004468200_btac051-B8","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1007\/s12015-007-0004-8","article-title":"Signaling pathways in cancer and embryonic stem cells","volume":"3","author":"Dreesen","year":"2007","journal-title":"Stem Cell Rev"},{"key":"2023020109004468200_btac051-B9","doi-asserted-by":"crossref","first-page":"756","DOI":"10.1016\/j.sbi.2008.10.002","article-title":"Function and structure of inherently disordered proteins","volume":"18","author":"Dunker","year":"2008","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023020109004468200_btac051-B10","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1016\/j.semcdb.2014.09.025","article-title":"Intrinsically disordered proteins and multicellular organisms","volume":"37","author":"Dunker","year":"2015","journal-title":"Semin. Cell Dev. Biol"},{"key":"2023020109004468200_btac051-B11","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1002\/iub.1044","article-title":"Role of disorder in I\u03baB-NF\u03baB interaction","volume":"64","author":"Dyson","year":"2012","journal-title":"IUBMB Life"},{"key":"2023020109004468200_btac051-B12","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1038\/nrm1589","article-title":"Intrinsically unstructured proteins and their functions","volume":"6","author":"Dyson","year":"2005","journal-title":"Nat. Rev. Mol. Cell Biol"},{"key":"2023020109004468200_btac051-B13","first-page":"33","article-title":"Intrinsically disordered proteins in the nucleus of human cells","volume":"1","author":"Frege","year":"2015","journal-title":"Biochem. Biophys. Rep"},{"key":"2023020109004468200_btac051-B14","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.molcel.2011.05.013","article-title":"Opposing effects of glutamine and asparagine govern prion formation by intrinsically disordered proteins","volume":"43","author":"Halfmann","year":"2011","journal-title":"Mol. Cell"},{"key":"2023020109004468200_btac051-B15","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1016\/j.ijbiomac.2019.06.143","article-title":"Functions of intrinsic disorder in proteins involved in DNA demethylation during pre-implantation embryonic development","volume":"136","author":"Han","year":"2019","journal-title":"Int. J. Biol. Macromol"},{"key":"2023020109004468200_btac051-B16","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1093\/bioinformatics\/btw678","article-title":"Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks","volume":"33","author":"Hanson","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020109004468200_btac051-B17","doi-asserted-by":"crossref","first-page":"476","DOI":"10.1186\/s12859-017-1906-3","article-title":"fLPS: fast discovery of compositional biases for the protein universe","volume":"18","author":"Harrison","year":"2017","journal-title":"BMC Bioinform"},{"key":"2023020109004468200_btac051-B18","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1016\/S0022-2836(02)00969-5","article-title":"Intrinsic disorder in cell-signaling and cancer-associated proteins","volume":"323","author":"Iakoucheva","year":"2002","journal-title":"J. Mol. Biol"},{"key":"2023020109004468200_btac051-B19","doi-asserted-by":"crossref","first-page":"1273","DOI":"10.1016\/j.yexcr.2008.12.008","article-title":"PRMT1 mediated methylation of TAF15 is required for its positive gene regulatory function","volume":"315","author":"Jobert","year":"2009","journal-title":"Exp. Cell Res"},{"key":"2023020109004468200_btac051-B20","doi-asserted-by":"crossref","first-page":"2864","DOI":"10.1038\/s41598-018-21142-1","article-title":"Plastic roles of phenylalanine and tyrosine residues of TLS\/FUS in complex formation with the G-quadruplexes of telomeric DNA and TERRA","volume":"8","author":"Kondo","year":"2018","journal-title":"Sci. Rep"},{"key":"2023020109004468200_btac051-B21","doi-asserted-by":"crossref","first-page":"e1002641","DOI":"10.1371\/journal.pcbi.1002641","article-title":"Intrinsic disorder in the human spliceosomal proteome","volume":"8","author":"Korneta","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023020109004468200_btac051-B22","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1073\/pnas.44.2.98","article-title":"Application of a theory of enzyme specificity to protein synthesis","volume":"44","author":"Koshland","year":"1958","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020109004468200_btac051-B23","doi-asserted-by":"crossref","first-page":"2011","DOI":"10.1016\/j.jmb.2016.01.002","article-title":"The multiple faces of disordered nucleoporins","volume":"428","author":"Lemke","year":"2016","journal-title":"J. Mol. Biol"},{"key":"2023020109004468200_btac051-B24","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1002\/prot.20279","article-title":"Sequence patterns associated with disordered regions in proteins","volume":"58","author":"Lise","year":"2004","journal-title":"Proteins"},{"key":"2023020109004468200_btac051-B25","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1042\/BJ20121346","article-title":"Describing sequence-ensemble relationships for intrinsically disordered proteins","volume":"449","author":"Mao","year":"2013","journal-title":"Biochem. J"},{"key":"2023020109004468200_btac051-B26","doi-asserted-by":"crossref","first-page":"44153","DOI":"10.1074\/jbc.M306856200","article-title":"An evolutionarily conserved role for SRm160 in 3\u02b9-end processing that functions independently of exon junction complex formation","volume":"278","author":"McCracken","year":"2003","journal-title":"J. Biol. Chem"},{"key":"2023020109004468200_btac051-B27","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1002\/prot.24424","article-title":"Intrinsically disordered regions in autophagy proteins","volume":"82","author":"Mei","year":"2014","journal-title":"Proteins"},{"key":"2023020109004468200_btac051-B28","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1016\/j.jmb.2007.07.004","article-title":"Molecular principles of the interactions of disordered proteins","volume":"372","author":"M\u00e9sz\u00e1ros","year":"2007","journal-title":"J. Mol. Biol"},{"key":"2023020109004468200_btac051-B29","doi-asserted-by":"crossref","first-page":"W329","DOI":"10.1093\/nar\/gky384","article-title":"IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding","volume":"46","author":"M\u00e9sz\u00e1ros","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023020109004468200_btac051-B30","doi-asserted-by":"crossref","first-page":"1402","DOI":"10.1093\/bioinformatics\/btx015","article-title":"MobiDB-lite: fast and highly specific consensus prediction of intrinsic disorder in proteins","volume":"33","author":"Necci","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020109004468200_btac051-B31","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1146\/annurev-biochem-072711-164947","article-title":"Intrinsically disordered proteins and intrinsically disordered protein regions","volume":"83","author":"Oldfield","year":"2014","journal-title":"Annu. Rev. Biochem"},{"key":"2023020109004468200_btac051-B32","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1007\/s00018-014-1661-9","article-title":"Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life","volume":"72","author":"Peng","year":"2015","journal-title":"Cell. Mol. Life Sci"},{"key":"2023020109004468200_btac051-B33","doi-asserted-by":"crossref","first-page":"D471","DOI":"10.1093\/nar\/gkx1071","article-title":"MobiDB 3.0: more annotations for intrinsic disorder, conformational diversity and interactions in proteins","volume":"46","author":"Piovesan","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023020109004468200_btac051-B34","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1016\/j.pep.2005.11.027","article-title":"Gene synthesis, expression, purification, and characterization of human Jagged-1 intracellular region","volume":"47","author":"Popovic","year":"2006","journal-title":"Protein Expr. Purif"},{"key":"2023020109004468200_btac051-B35","volume-title":"R: A Language and Environment for Statistical Computing","year":"2013"},{"key":"2023020109004468200_btac051-B36","doi-asserted-by":"crossref","first-page":"780","DOI":"10.1016\/j.bbagrm.2016.03.006","article-title":"Sox2\/Oct4: a delicately balanced partnership in pluripotent stem cells and embryogenesis","volume":"1859","author":"Rizzino","year":"2016","journal-title":"Biochim. Biophys. Acta"},{"key":"2023020109004468200_btac051-B37","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1128\/MCB.24.3.1174-1187.2004","article-title":"Human RNPS1 and its associated factors: a versatile alternative pre-mRNA splicing regulator in vivo","volume":"24","author":"Sakashita","year":"2004","journal-title":"Mol. Cell. Biol"},{"key":"2023020109004468200_btac051-B38","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1158\/0008-5472.CAN-03-2636","article-title":"Inhibition of MUC4 expression suppresses pancreatic tumor cell growth and metastasis","volume":"64","author":"Singh","year":"2004","journal-title":"Cancer Res"},{"key":"2023020109004468200_btac051-B39","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1016\/j.molcel.2013.05.021","article-title":"Defining the RGG\/RG motif","volume":"50","author":"Thandapani","year":"2013","journal-title":"Mol. Cell"},{"key":"2023020109004468200_btac051-B40","doi-asserted-by":"crossref","first-page":"e24360","DOI":"10.4161\/idp.24360","article-title":"The alphabet of intrinsic disorder: I. Act like a Pro: on the abundance and roles of proline residues in intrinsically disordered proteins","volume":"1","author":"Theillet","year":"2013","journal-title":"Intrinsically Disord. Proteins"},{"key":"2023020109004468200_btac051-B41","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1016\/j.tibs.2012.08.004","article-title":"Intrinsically disordered proteins: a 10-year recap","volume":"37","author":"Tompa","year":"2012","journal-title":"Trends Biochem. Sci"},{"key":"2023020109004468200_btac051-B42","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023020109004468200_btac051-B43","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1146\/annurev-biophys-062920-063704","article-title":"Recent developments in the field of intrinsically disordered proteins: intrinsic disorder-based emergence in cellular biology in light of the physiological and pathological liquid-liquid phase transitions","volume":"50","author":"Uversky","year":"2021","journal-title":"Annu. Rev. Biophys"},{"key":"2023020109004468200_btac051-B44","doi-asserted-by":"crossref","first-page":"1231","DOI":"10.1016\/j.bbapap.2010.01.017","article-title":"Understanding protein non-folding","volume":"1804","author":"Uversky","year":"2010","journal-title":"Biochim. Biophys. Acta"},{"key":"2023020109004468200_btac051-B45","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1146\/annurev.biophys.37.032807.125924","article-title":"Intrinsically disordered proteins in human diseases: introducing the D2 concept","volume":"37","author":"Uversky","year":"2008","journal-title":"Annu. Rev. Biophys"},{"key":"2023020109004468200_btac051-B46","first-page":"27","article-title":"Disease mutations in disordered regions\u2014exception to the rule?","volume":"8","author":"Vacic","year":"2012","journal-title":"Mol"},{"key":"2023020109004468200_btac051-B47","doi-asserted-by":"crossref","first-page":"6589","DOI":"10.1021\/cr400525m","article-title":"Classification of intrinsically disordered regions and proteins","volume":"114","author":"van der Lee","year":"2014","journal-title":"Chem. Rev"},{"key":"2023020109004468200_btac051-B48","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1089\/dna.2016.3222","article-title":"Discovery, characterization, and functional study of a novel MEF2D CAG repeat in duck (Anas platyrhynchos)","volume":"35","author":"Wang","year":"2016","journal-title":"DNA Cell Biol"},{"key":"2023020109004468200_btac051-B49","doi-asserted-by":"crossref","first-page":"2138","DOI":"10.1093\/bioinformatics\/bth195","article-title":"The DISOPRED server for the prediction of protein disorder","volume":"20","author":"Ward","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020109004468200_btac051-B50","doi-asserted-by":"crossref","first-page":"1188","DOI":"10.1016\/j.cell.2012.05.022","article-title":"Getting RNA and protein in phase","volume":"149","author":"Weber","year":"2012","journal-title":"Cell"},{"key":"2023020109004468200_btac051-B51","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1083\/jcb.129.1.255","article-title":"Episialin (MUC1) overexpression inhibits integrin-mediated cell adhesion to extracellular matrix components","volume":"129","author":"Wesseling","year":"1995","journal-title":"J. Cell Biol"},{"key":"2023020109004468200_btac051-B52","first-page":"89","article-title":"The protein non-folding problem: amino acid determinants of intrinsic order and disorder","author":"Williams","year":"2001","journal-title":"Pac. Symp. Biocomput"},{"key":"2023020109004468200_btac051-B53","doi-asserted-by":"crossref","first-page":"e1003192","DOI":"10.1371\/journal.pcbi.1003192","article-title":"On the importance of polar interactions for complexes containing intrinsically disordered proteins","volume":"9","author":"Wong","year":"2013","journal-title":"PLoS Comput. Biol"},{"key":"2023020109004468200_btac051-B54","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1038\/nrm3920","article-title":"Intrinsically disordered proteins in cellular signalling and regulation","volume":"16","author":"Wright","year":"2015","journal-title":"Nat. Rev. Mol. Cell Biol"},{"key":"2023020109004468200_btac051-B55","doi-asserted-by":"crossref","first-page":"1882","DOI":"10.1021\/pr060392u","article-title":"Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions","volume":"6","author":"Xie","year":"2007","journal-title":"J. Proteome Res"},{"key":"2023020109004468200_btac051-B56","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1080\/07391102.2012.675145","article-title":"Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life","volume":"30","author":"Xue","year":"2012","journal-title":"J. Biomol. Struct. Dyn"},{"key":"2023020109004468200_btac051-B57","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1080\/073911012010525024","article-title":"The roles of intrinsic disorder in orchestrating the Wnt-pathway","volume":"29","author":"Xue","year":"2012","journal-title":"J. Biomol. Struct. Dyn"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac051\/42490627\/btac051.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/7\/1870\/49009144\/btac051.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/7\/1870\/49009144\/btac051.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T08:03:22Z","timestamp":1700121802000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/7\/1870\/6517502"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,1,30]]},"references-count":57,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2022,3,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac051","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,4,1]]},"published":{"date-parts":[[2022,1,30]]}}}