{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T15:17:54Z","timestamp":1773155874844,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"23","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: G protein-coupled receptors (GPCRs) play an important role in many physiological systems by transducing an extracellular signal into an intracellular response. Over 50% of all marketed drugs are targeted towards a GPCR. There is considerable interest in developing an algorithm that could effectively predict the function of a GPCR from its primary sequence. Such an algorithm is useful not only in identifying novel GPCR sequences but in characterizing the interrelationships between known GPCRs.<\/jats:p><jats:p>Results: An alignment-free approach to GPCR classification has been developed using techniques drawn from data mining and proteochemometrics. A dataset of over 8000 sequences was constructed to train the algorithm. This represents one of the largest GPCR datasets currently available. A predictive algorithm was developed based upon the simplest reasonable numerical representation of the protein's physicochemical properties. A selective top-down approach was developed, which used a hierarchical classifier to assign sequences to subdivisions within the GPCR hierarchy. The predictive performance of the algorithm was assessed against several standard data mining classifiers and further validated against Support Vector Machine-based GPCR prediction servers. The selective top-down approach achieves significantly higher accuracy than standard data mining methods in almost all cases.<\/jats:p><jats:p>Contact: \u00a0m.davies@mail.cryst.bbk.ac.uk<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm506","type":"journal-article","created":{"date-parts":[[2007,10,24]],"date-time":"2007-10-24T00:20:21Z","timestamp":1193185221000},"page":"3113-3118","source":"Crossref","is-referenced-by-count":76,"title":["On the hierarchical classification of G protein-coupled receptors"],"prefix":"10.1093","volume":"23","author":[{"given":"Matthew N.","family":"Davies","sequence":"first","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrew","family":"Secker","sequence":"additional","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alex A.","family":"Freitas","sequence":"additional","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Miguel","family":"Mendao","sequence":"additional","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jon","family":"Timmis","sequence":"additional","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Darren R.","family":"Flower","sequence":"additional","affiliation":[{"name":"1 Edward Jenner Institute, Compton, Newbury, Berkshire, RG20 7NN, 2Department of Computing and Centre for BioMedical Informatics, University of Kent, Canterbury, Kent CT2 7NF and 3Departments of Computer Science and Electronics, University of York, Heslington, York YO10 5DD, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2007,9,22]]},"reference":[{"key":"2023041107510320500_","first-page":"162","article-title":"A compendium of specific motifs for diagnosing GPCR subtypes","volume":"22","author":"Attwood","year":"2001","journal-title":"Pharmacol. Sci"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1093\/nar\/30.1.239","article-title":"PRINTS and PRINTS-S shed light on protein ancestry","volume":"30","author":"Attwood","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"W383","DOI":"10.1093\/nar\/gkh416","article-title":"GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors","volume":"32","author":"Bhasin","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"W143","DOI":"10.1093\/nar\/gki351","article-title":"GPCRsclass: a web tool for the classification of amine type of G protein-coupled receptors","volume":"33","author":"Bhasin","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1081\/RRS-120025192","article-title":"Conformational changes of G protein-coupled receptors during their activation by agonist binding","volume":"23","author":"Bissantz","year":"2003","journal-title":"J. Recept. Signal Transduct. Res"},{"key":"2023041107510320500_","article-title":"WEKA Classification Algorithms, Version 1.6","author":"Brownlee","year":"2007"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1186\/1471-2148-6-108","article-title":"Evolution of secretin family GPCR members in the metazoa","volume":"6","author":"Cardoso","year":"2006","journal-title":"BMC Evol. Biol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1124\/pr.54.2.323","article-title":"G protein-coupled receptor allosterism and complexing","volume":"54","author":"Christopoulos","year":"2002","journal-title":"Pharmacol. Rev"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1093\/bioinformatics\/btg1058","article-title":"Predicting gene function in Saccharomyces cerevisiae","volume":"19","author":"Claire","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041107510320500_","first-page":"126","article-title":"Advances in bioinformatics and computational biology","volume-title":"Volume 4643 of Lecture Notes in Bioinformatics","author":"Costa","year":"2007"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"8115","DOI":"10.1523\/JNEUROSCI.1015-06.2006","article-title":"The role of protein interaction motifs in regulating the polarity and clustering of the metabotropic glutamate receptor mGluR1a","volume":"26","author":"Das","year":"2006","journal-title":"J. Neurosci"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"2800","DOI":"10.1002\/pmic.200700093","article-title":"Proteomic applications of automated GPCR classification","volume":"7","author":"Davies","year":"2007","journal-title":"Proteomics"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1016\/S0304-4157(99)00006-4","article-title":"Modelling G-protein-coupled receptors for drug design","volume":"1422","author":"Flower","year":"1999","journal-title":"Biochim. Biophys. Acta"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1016\/S1084-9521(04)00094-1","article-title":"Integrative bioinformatics for functional genome annotation: trawling for G protein-coupled receptors","volume":"15","author":"Flower","year":"2004","journal-title":"Semin. Cell Dev. Biol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1042\/bst0300473","article-title":"Bioinformatics and type II G-protein-coupled receptors","volume":"30","author":"Foord","year":"2002","journal-title":"Biochem. Soc. Trans"},{"key":"2023041107510320500_","article-title":"Generating accurate rule sets without global optimization","author":"Frank","year":"1998"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"175","DOI":"10.4018\/978-1-59904-271-8.ch007","article-title":"A tutorial on hierarchical classification with applications in bioinformatics","volume-title":"Research and Trends in Data Mining Technologies and Applications","author":"Freitas","year":"2007"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"864","DOI":"10.1016\/j.ympev.2006.11.007","article-title":"Formation of new genes explains lower intron density in mammalian Rhodopsin G protein-coupled receptors","volume":"43","author":"Fridmanis","year":"2006","journal-title":"Mol. Phylogenet. Evol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1034\/j.1600-0773.2002.910607.x","article-title":"Structural basis for activation of G-protein-coupled receptors","volume":"91","author":"Gether","year":"2002","journal-title":"Pharmacol. Toxicol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/j.bbagen.2004.12.001","article-title":"Nine new human Rhodopsin family G-protein coupled receptors: identification, sequence characterisation and evolutionary relationship","volume":"1722","author":"Gloriam","year":"2005","journal-title":"Biochim. Biophys. Acta"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1111\/j.1745-7270.2005.00110.x","article-title":"Fast fourier transform-based support vector machine for prediction of G-protein coupled receptor subfamilies","volume":"37","author":"Guo","year":"2005","journal-title":"Acta Biochim. Biophys. Sin. (Shanghai)"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/s00726-006-0332-z","article-title":"Classifying G protein-coupled receptors and nuclear receptors on the basis of protein power spectrum from fast Fourier transform","volume":"30","author":"Guo","year":"2006","journal-title":"Amino Acids"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1139\/o98-012","article-title":"Structural and functional aspects of G protein-coupled receptor oligomerization","volume":"76","author":"Hebert","year":"1998","journal-title":"Biochem. Cell Biol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1093\/nar\/gkg103","article-title":"GPCRDB information system for G protein-coupled receptors","volume":"31","author":"Horn","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1093\/bioinformatics\/18.1.147","article-title":"Classifying G-protein coupled receptors with support vector machines","volume":"18","author":"Karchin","year":"2002","journal-title":"Bioinformatics"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1162\/089976601300014493","article-title":"Improvements to Platt's SMO algorithm for SVM classifier design","volume":"13","author":"Keerthi","year":"2001","journal-title":"Neural Comput"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1002\/1439-7633(20021004)3:10<928::AID-CBIC928>3.0.CO;2-5","article-title":"Drug design strategies for targeting G-protein-coupled receptors","volume":"3","author":"Klabunde","year":"2002","journal-title":"Chembiochem"},{"key":"2023041107510320500_","first-page":"1","article-title":"GCRDb: a G-protein-coupled receptor database","volume":"2","author":"Kolakowski","year":"1994","journal-title":"Recept. Channels"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1124\/mol.61.6.1465","article-title":"Proteochemometrics modelling of the interaction of amine G-protein coupled receptors with a diverse set of ligands","volume":"61","author":"Lapinsh","year":"2002","journal-title":"Mol. Pharmacol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1016\/j.drudis.2006.04.007","article-title":"G-protein-coupled receptor heterodimers: pharmacology, function and relevance to drug discovery","volume":"11","author":"Milligan","year":"2006","journal-title":"Drug Discov. Today"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"1638","DOI":"10.1126\/science.1106267","article-title":"Insect sex-pheromone signals mediated by specific combinations of olfactory receptors","volume":"307","author":"Nakagawa","year":"2005","journal-title":"Science"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"W380","DOI":"10.1093\/nar\/gkh431","article-title":"PRED-GPCR: GPCR recognition and family classification server","volume":"32","author":"Papasaikas","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1016\/j.ejcb.2006.04.003","article-title":"The Dictyostelium repertoire of seven transmembrane domain receptors","volume":"85","author":"Prabhu","year":"2006","journal-title":"Eur. J. Cell Biol"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1021\/jm9700575","article-title":"New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids","volume":"41","author":"Sandberg","year":"1998","journal-title":"J. Med. Chem"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.ygcen.2004.12.018","article-title":"The GRAFS classification system of G-protein coupled receptors in comparative perspective","volume":"142","author":"Schiath","year":"2005","journal-title":"Gen. Comp. Endocrinol"},{"key":"2023041107510320500_","first-page":"13","article-title":"An experimental comparison of classification algorithms for the hierarchical prediction of protein function","author":"Secker","year":"2007"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1023\/B:GENP.0000030197.83685.94","article-title":"Artificial immune recognition system (AIRS): an immune-inspired supervised learning algorithm","volume":"5","author":"Watkins","year":"2004","journal-title":"Genet. Program. Evolv. Mach"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"D5","DOI":"10.1093\/nar\/gkl1031","article-title":"Database resources of the national center for biotechnology information","volume":"35","author":"Wheeler","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1110\/ps.051745906","article-title":"A general model of G protein-coupled receptor sequences and its application to detect remote homologs","volume":"15","author":"Wistrand","year":"2006","journal-title":"Protein Sci"},{"key":"2023041107510320500_","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005"},{"key":"2023041107510320500_","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1016\/0003-2670(93)80437-P","article-title":"DNA and peptide sequences and chemical processes mutlivariately modelled by principal component analysis and partial least-squares projections to latent structures","volume":"277","author":"Wold","year":"1993","journal-title":"Anal. Chim. Acta"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3113\/49821361\/bioinformatics_23_23_3113.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3113\/49821361\/bioinformatics_23_23_3113.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,14]],"date-time":"2023-05-14T13:43:00Z","timestamp":1684071780000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/23\/3113\/291144"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,9,22]]},"references-count":41,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2007,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm506","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,12,1]]},"published":{"date-parts":[[2007,9,22]]}}}