{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:26:42Z","timestamp":1760059602394,"version":"build-2065373602"},"reference-count":15,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T00:00:00Z","timestamp":1750723200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Spanish Ministry of Science and Innovation","award":["PID2021-126884NB-I00 (MCIN\/AEI\/10.13039\/501100011033)"],"award-info":[{"award-number":["PID2021-126884NB-I00 (MCIN\/AEI\/10.13039\/501100011033)"]}]},{"name":"Fundaci\u00f3n Ram\u00f3n Areces","award":["PID2021-126884NB-I00 (MCIN\/AEI\/10.13039\/501100011033)"],"award-info":[{"award-number":["PID2021-126884NB-I00 (MCIN\/AEI\/10.13039\/501100011033)"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Orthographic knowledge is a critical component of skilled language use, yet its large-scale behavioral signatures remain understudied in Spanish. To address this gap, we developed OrthoKnow-SP, a megastudy that captures spelling decisions from 27,185 native Spanish-speaking adults who completed an 80-item forced-choice task. Each trial required selecting the correctly spelled word from a pair comprising a real word and a pseudohomophone foil that preserved pronunciation while violating the correct graphemic representation. The stimuli targeted six high-confusability contrasts in Spanish orthography. We recorded response accuracy and reaction times for over 2.17 million trials, alongside demographic and device metadata. Results show robust variability across items and individuals, with item-level metrics closely aligned with independent norms of word prevalence. A composite difficulty index integrating speed and accuracy further allowed fine-grained item ranking. The dataset provides the first population-scale norms of Spanish spelling difficulty, capturing regional and generational diversity absent from traditional lab-based studies. Public release of OrthoKnow-SP enables new research on the cognitive and demographic factors shaping orthographic decisions, and provides educators, clinicians, and developers with a valuable benchmark for assessing spelling competence and modeling written language processing.<\/jats:p>","DOI":"10.3390\/data10070101","type":"journal-article","created":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T08:50:57Z","timestamp":1750755057000},"page":"101","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["OrthoKnow-SP: A Large-Scale Dataset on Orthographic Knowledge and Spelling Decisions in Spanish Adults"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3312-8559","authenticated-orcid":false,"given":"Jon Andoni","family":"Du\u00f1abeitia","sequence":"first","affiliation":[{"name":"Centro de Investigaci\u00f3n Nebrija en Cognici\u00f3n (CINC), Universidad Nebrija, 28043 Madrid, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,6,24]]},"reference":[{"key":"ref_1","unstructured":"Snowling, M.J., Hulme, C., and Nation, K. (2022). Word recognition I: Visual and orthographic processing. The Science of Reading: A Handbook, Wiley Blackwell. [2nd ed.]."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1016\/j.neuroimage.2016.05.072","article-title":"Uncovering phonological and orthographic selectivity across the reading network using fMRI-RA","volume":"138","author":"Glezer","year":"2016","journal-title":"NeuroImage"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1080\/10888430701530730","article-title":"Reading Ability: Lexical Quality to Comprehension","volume":"11","author":"Perfetti","year":"2007","journal-title":"Sci. Stud. Read."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Chrabaszcz, A., Gebremedhen, N.I., Alvarez, T.A., Durisko, C., and Fiez, J.A. (2023). Orthographic learning in adults through overt and covert reading. Acta Psychol., 241.","DOI":"10.1016\/j.actpsy.2023.104061"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1867","DOI":"10.3758\/s13428-020-01357-9","article-title":"How do Spanish speakers read words? Insights from a crowdsourced lexical decision megastudy","volume":"52","author":"Aguasvivas","year":"2020","journal-title":"Behav. Res."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Ziegler, J.C., and Ferrand, L. (2011). Orthographic consistency and word-frequency effects in auditory word recognition. Front. Psychol., 2.","DOI":"10.3389\/fpsyg.2011.00263"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.cognition.2017.06.015","article-title":"Phonological and orthographic coding in deaf skilled readers","volume":"168","author":"Carreiras","year":"2017","journal-title":"Cognition"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Afonso, O., Su\u00e1rez-Coalla, P., and Cuetos, F. (2015). Spelling impairments in Spanish dyslexic adults. Front. Psychol., 6.","DOI":"10.3389\/fpsyg.2015.00466"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Costello, B., Caffarra, S., Fari\u00f1a, N., Du\u00f1abeitia, J.A., and Carreiras, M. (2021). Reading without phonology: ERP evidence from skilled deaf readers of Spanish. Sci. Rep., 11.","DOI":"10.1038\/s41598-021-84490-5"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Dean, C.A., and Kroff, J.R.V. (2017). Cross-Linguistic Orthographic Effects in Late Spanish\/English Bilinguals. Languages, 2.","DOI":"10.3390\/languages2040024"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Furgoni, A., Martin, C.D., and Stoehr, A. (2025). A cross linguistic study on orthographic influence during auditory word recognition. Sci. Rep., 15.","DOI":"10.1038\/s41598-025-92885-x"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1457","DOI":"10.1080\/17470218.2015.1051065","article-title":"Megastudies, crowdsourcing, and large datasets in psycholinguistics: An overview of recent developments","volume":"68","author":"Keuleers","year":"2015","journal-title":"Q. J. Exp. Psychol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1246","DOI":"10.3758\/s13428-013-0326-1","article-title":"EsPal: One-stop shopping for Spanish word properties","volume":"45","author":"Duchon","year":"2013","journal-title":"Behav. Res."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Aguasvivas, J.A., Carreiras, M., Brysbaert, M., Mandera, P., Keuleers, E., and Du\u00f1abeitia, J.A. (2018). SPALEX: A Spanish lexical decision database from a massive online data collection. Front. Psychol., 9.","DOI":"10.3389\/fpsyg.2018.02156"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"898","DOI":"10.3758\/s13428-021-01669-4","article-title":"The predictors of general knowledge: Data from a Spanish megastudy","volume":"54","author":"Boada","year":"2022","journal-title":"Behav. Res."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/101\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:57:49Z","timestamp":1760032669000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/101"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,24]]},"references-count":15,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7]]}},"alternative-id":["data10070101"],"URL":"https:\/\/doi.org\/10.3390\/data10070101","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2025,6,24]]}}}