{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T21:51:11Z","timestamp":1740174671655,"version":"3.37.3"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2023,11,28]],"date-time":"2023-11-28T00:00:00Z","timestamp":1701129600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP160101527"],"award-info":[{"award-number":["DP160101527"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,4,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Principal components analysis (PCA) has been one of the staple methods used in stylometry. In a 2021 article, Pervez Rizvi casts doubt on this method and argues that some widely cited results based on it should be set aside. In the current article, I show that none of Rizvi\u2019s theoretical claims or experimental results stand up to examination. Rizvi argues that discarding the principal components beyond the first two makes the method unreliable, but permutation testing of PCAs shows that the top components in these trials are significant and robust, and the results across many experiments show the combination of the first and second component to be effective in classification. Rizvi argues that PCA components must be treated separately, and much of his critique of the PCA method is based on this standpoint, but this is not the practice in the work presented in the publications he cites or in the wider literature. Rizvi is unable to replicate a chart in an article by Craig, but his replication, unlike the original, does not account for the widely varying sizes of samples in his data. The current article shows that Rizvi\u2019s claims are misguided and that using PCA in the Burrows tradition to find and formalize authorial discriminations in text samples from plays of the Shakespearean era is efficacious and robust.<\/jats:p>","DOI":"10.1093\/llc\/fqad083","type":"journal-article","created":{"date-parts":[[2023,11,29]],"date-time":"2023-11-29T02:07:52Z","timestamp":1701223672000},"page":"97-108","source":"Crossref","is-referenced-by-count":0,"title":["Principal components analysis in stylometry"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9336-1678","authenticated-orcid":false,"given":"Hugh","family":"Craig","sequence":"first","affiliation":[{"name":"School of Humanities, Creative Industries and Social Sciences, University of Newcastle , Newcastle, NSW 2308, Australia"}]}],"member":"286","published-online":{"date-parts":[[2023,11,28]]},"reference":[{"volume-title":"Practical Multivariate Analysis.","year":"2020","author":"Afifi","key":"2024040210375465600_fqad083-B1"},{"key":"2024040210375465600_fqad083-B2","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1198\/016214505000000628","article-title":"Prediction by Supervised Principal Components\u2019,","volume":"101","author":"Bair","year":"2006","journal-title":"Journal of the American Statistical Association"},{"key":"2024040210375465600_fqad083-B3","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1080\/09332480.2003.10554843","article-title":"Who Wrote the 15th book of Oz? An Application of Multivariate Analysis to Authorship Attribution\u2019,","volume":"16","author":"Binongo","year":"2003","journal-title":"Chance: A Magazine of the American Statistical Association"},{"key":"2024040210375465600_fqad083-B4","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1093\/llc\/7.2.91","article-title":"Not Unless You Ask Nicely: The Interpretive Nexus Between Analysis and Information\u2019,","volume":"7","author":"Burrows","year":"1992","journal-title":"Literary and Linguistic Computing"},{"key":"2024040210375465600_fqad083-B5","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/llc\/fqi067","article-title":"All the Way Through: Testing for Authorship in Different Frequency Strata\u2019,","volume":"22","author":"Burrows","year":"2007","journal-title":"Literary and Linguistic Computing"},{"key":"2024040210375465600_fqad083-B6","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1080\/0013838X.2012.668786","article-title":"Authors and Characters\u2019,","volume":"23","author":"Burrows","year":"2012","journal-title":"English Studies"},{"year":"2022","author":"Camargo","key":"2024040210375465600_fqad083-B7"},{"key":"2024040210375465600_fqad083-B8","doi-asserted-by":"crossref","first-page":"e12967","DOI":"10.7717\/peerj.12967","article-title":"PCAtest: Testing the Statistical Significance of Principal Component Analysis in R\u2019","volume":"10","author":"Camargo","year":"2022","journal-title":"PeerJ"},{"key":"2024040210375465600_fqad083-B9","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1207\/s15327906mbr0102_10","article-title":"The Scree Test for the Number of Factors\u2019,","volume":"1","author":"Cattell","year":"1966","journal-title":"Multivariate Behavioral Research"},{"key":"2024040210375465600_fqad083-B10","article-title":"Style, Statistics and New Models of Authorship\u2019,","volume":"15","author":"Craig","year":"2009","journal-title":"Early Modern Literary Studies"},{"key":"2024040210375465600_fqad083-B11","doi-asserted-by":"crossref","DOI":"10.1017\/9781108120456","volume-title":"Style, Computers, and Early Modern Drama: Beyond Authorship.","author":"Craig","year":"2017"},{"key":"2024040210375465600_fqad083-B151","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511605437","volume-title":"Shakespeare, Computers, and the Mystery of Authorship.","author":"Craig","year":"2009"},{"key":"2024040210375465600_fqad083-B12","first-page":"1","article-title":"Bernard of Clairvaux and Nicholas of Monti\u00e9ramey: Tracing the Secretarial Trail with Computational Stylistics\u2019,","volume":"92(Suppl 1)","author":"De Gussem","year":"2017","journal-title":"Speculum"},{"key":"2024040210375465600_fqad083-B13","doi-asserted-by":"crossref","first-page":"3143","DOI":"10.1016\/j.measurement.2013.06.038","article-title":"Bearing Degradation Process Prediction Based on the PCA and Optimized LS-SVM Model\u2019,","volume":"46","author":"Dong","year":"2013","journal-title":"Measurement"},{"key":"2024040210375465600_fqad083-B14","doi-asserted-by":"crossref","DOI":"10.4135\/9781412985475","volume-title":"Principal Components Analysis","author":"Dunteman","year":"1989"},{"key":"2024040210375465600_fqad083-B15","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1093\/llc\/fqz085","article-title":"Stylistic Palimpsests: Computational Stylistic Perspectives on Precursory Authorship in Aphra Behn\u2019s Drama\u2019,","volume":"36","author":"Evans","year":"2021","journal-title":"Digital Scholarship in the Humanities"},{"year":"2007","author":"Farmer","key":"2024040210375465600_fqad083-B16"},{"key":"2024040210375465600_fqad083-B17","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1093\/llc\/14.3.375","article-title":"Cicero, Sigonio, and Burrows: Investigating the Authenticity of the Consolatio\u2019,","volume":"14","author":"Forsyth","year":"1999","journal-title":"Literary and Linguistic Computing"},{"key":"2024040210375465600_fqad083-B18","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1075\/ijcl.18.2.04gra","article-title":"Interfacing Corpus Linguistics and Computational Stylistics: Translation Universals in Translational Literary Polish\u2019,","volume":"18","author":"Grabowski","year":"2013","journal-title":"International Journal of Corpus Linguistics"},{"year":"2018","author":"Hartmann","key":"2024040210375465600_fqad083-B19"},{"key":"2024040210375465600_fqad083-B20","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1093\/llc\/13.3.111","article-title":"The Evolution of Stylometry in Humanities Scholarship\u2019,","volume":"13","author":"Holmes","year":"1998","journal-title":"Literary and Linguistic Computing"},{"key":"2024040210375465600_fqad083-B21","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1093\/llc\/16.4.403","article-title":"A Widow and her Soldier: Stylometry and the American Civil War\u2019,","volume":"16","author":"Holmes","year":"2001","journal-title":"Literary and Linguistic Computing"},{"key":"2024040210375465600_fqad083-B22","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1037\/h0071325","article-title":"Analysis of a Complex of Statistical Variables into Principal Components\u2019,","volume":"24","author":"Hotelling","year":"1933","journal-title":"Journal of Educational Psychology"},{"volume-title":"IBM SPSS Statistics for Windows, Version 27.0.","year":"2020","author":"IBM Corp","key":"2024040210375465600_fqad083-B23"},{"key":"2024040210375465600_fqad083-B24","doi-asserted-by":"crossref","first-page":"20150202","DOI":"10.1098\/rsta.2015.0202","article-title":"Principal Component Analysis: A Review and Recent Developments\u2019,","volume":"374","author":"Jolliffe","year":"2016","journal-title":"Philosophical Transactions of the Royal Society A"},{"volume-title":"Principal Component Analysis","year":"2013","author":"Jolliffe","key":"2024040210375465600_fqad083-B25"},{"key":"2024040210375465600_fqad083-B26","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1037\/h0034747","article-title":"On the Psychology of Prediction\u2019,","volume":"80","author":"Kahneman","year":"1973","journal-title":"Psychological Review"},{"key":"2024040210375465600_fqad083-B27","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1177\/001316446002000116","article-title":"The Application of Electronic Computers to Factor Analysis\u2019,","volume":"20","author":"Kaiser","year":"1960","journal-title":"Educational and Psychological Measurement"},{"key":"2024040210375465600_fqad083-B28","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"LIII. On Lines and Planes of Closest Fit to Systems of Points in Space\u2019,","volume":"2","author":"Pearson","year":"1901","journal-title":"The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science"},{"first-page":"1","year":"","author":"Rehman","key":"2024040210375465600_fqad083-B29"},{"key":"2024040210375465600_fqad083-B30","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1093\/llc\/fqy038","article-title":"The Interpretation of Zeta Test Results\u2019,","volume":"34","author":"Rizvi","year":"2019","journal-title":"Digital Scholarship in the Humanities"},{"key":"2024040210375465600_fqad083-B31","first-page":"1030","article-title":"Shakespeare and Principal Components Analysis\u2019,","volume":"36","author":"Rizvi","year":"2021","journal-title":"Digital Studies in the Humanities"},{"key":"2024040210375465600_fqad083-B32","first-page":"130","article-title":"Corneille, Moli\u00e8re et les Autres. Stilometrische Analysen zu Autorschaft und Gattungszugeh\u00f6rigkeit im franz\u00f6sischen Theater der Klassik. Literaturwissenschaft im Digitalen Medienwandel\u2019. C. Sch\u00f6ch and L. Schneider,","volume":"7","author":"Sch\u00f6ch","year":"2014","journal-title":"Online PhiN"},{"key":"2024040210375465600_fqad083-B33","first-page":"538","volume-title":"Multivariate Analysis VI","author":"Takemura","year":"1985"},{"key":"2024040210375465600_fqad083-B34","doi-asserted-by":"crossref","DOI":"10.1093\/actrade\/9780199591169.001.0001","volume-title":"The New Oxford Shakespeare: Authorship Companion","author":"Taylor","year":"2017"},{"key":"2024040210375465600_fqad083-B35","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1037\/h0031322","article-title":"Belief in the Law of Small Numbers\u2019,","volume":"76","author":"Tversky","year":"1971","journal-title":"Psychological Bulletin"},{"key":"2024040210375465600_fqad083-B36","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1126\/science.185.4157.1124","article-title":"Judgment under Uncertainty: Heuristics and Biases\u2019,","volume":"185","author":"Tversky","year":"1974","journal-title":"Science"},{"key":"2024040210375465600_fqad083-B37","first-page":"103","article-title":"Permutation Tests to Estimate Significances on Principal Components Analysis","volume":"2","author":"Vieira","year":"2012","journal-title":"Computational Ecology and Software"},{"key":"2024040210375465600_fqad083-B38","doi-asserted-by":"crossref","first-page":"e2937","DOI":"10.1002\/cem.2937","article-title":"Selecting the Number of Factors in Principal Component Analysis by Permutation Testing\u2014Numerical and Practical Aspects\u2019,","volume":"31","author":"Vitale","year":"","journal-title":"Journal of Chemometrics"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/39\/1\/97\/57134448\/fqad083.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/39\/1\/97\/57134448\/fqad083.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,2]],"date-time":"2024-04-02T13:56:12Z","timestamp":1712066172000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article\/39\/1\/97\/7453619"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,28]]},"references-count":39,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,11,28]]},"published-print":{"date-parts":[[2024,4,2]]}},"URL":"https:\/\/doi.org\/10.1093\/llc\/fqad083","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"type":"print","value":"2055-7671"},{"type":"electronic","value":"2055-768X"}],"subject":[],"published-other":{"date-parts":[[2024,4,1]]},"published":{"date-parts":[[2023,11,28]]}}}