{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T14:07:01Z","timestamp":1780409221559,"version":"3.54.1"},"reference-count":45,"publisher":"MIT Press - Journals","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["TACL"],"published-print":{"date-parts":[[2017,12]]},"abstract":"<jats:p> We introduce a method for measuring the correspondence between low-level speech features and human perception, using a cognitive model of speech perception implemented directly on speech recordings. We evaluate two speaker normalization techniques using this method and find that in both cases, speech features that are normalized across speakers predict human data better than unnormalized speech features, consistent with previous research. Results further reveal differences across normalization methods in how well each predicts human data. This work provides a new framework for evaluating low-level representations of speech on their match to human perception, and lays the groundwork for creating more ecologically valid models of speech perception. <\/jats:p>","DOI":"10.1162\/tacl_a_00071","type":"journal-article","created":{"date-parts":[[2018,12,28]],"date-time":"2018-12-28T15:42:50Z","timestamp":1546011770000},"page":"425-440","source":"Crossref","is-referenced-by-count":8,"title":["Evaluating Low-Level Speech Features Against Human Perceptual                     Data"],"prefix":"10.1162","volume":"5","author":[{"given":"Caitlin","family":"Richter","sequence":"first","affiliation":[{"name":"Dept. of Linguistics, University of Pennsylvania,"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Naomi H.","family":"Feldman","sequence":"additional","affiliation":[{"name":"Dept. of Linguistics and UMIACS, University of Maryland,"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Harini","family":"Salgado","sequence":"additional","affiliation":[{"name":"Dept. of Computer Science, Pomona College,"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Aren","family":"Jansen","sequence":"additional","affiliation":[{"name":"HLTCOE, Johns Hopkins University,"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"281","reference":[{"issue":"5","key":"p_1","doi-asserted-by":"crossref","first-page":"3099","DOI":"10.1121\/1.1795335","volume":"116","author":"Adank Patti","year":"2004","journal-title":"Journal of the Acoustical Society of America"},{"key":"p_2","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1016\/0010-0277(94)90042-6","volume":"52","author":"Andruski Jean E.","year":"1994","journal-title":"Cognition"},{"issue":"4","key":"p_3","doi-asserted-by":"crossref","first-page":"916","DOI":"10.3758\/s13423-014-0783-2","volume":"22","author":"Apfelbaum Keith S.","year":"2015","journal-title":"Psychonomic Bulletin and Review"},{"issue":"5","key":"p_4","doi-asserted-by":"crossref","first-page":"3453","DOI":"10.1121\/1.4747011","volume":"132","author":"Barreda Santiago","year":"2012","journal-title":"Journal of the Acoustical Society of America"},{"key":"p_5","doi-asserted-by":"publisher","DOI":"10.1162\/0898929054985473"},{"issue":"3","key":"p_8","doi-asserted-by":"crossref","first-page":"804","DOI":"10.1016\/j.cognition.2008.04.004","volume":"108","author":"Clayards Meghan","year":"2008","journal-title":"Cognition"},{"issue":"6","key":"p_9","doi-asserted-by":"crossref","first-page":"633","DOI":"10.1016\/j.specom.2005.09.010","volume":"48","author":"Clopper Cynthia G.","year":"2006","journal-title":"Speech Communication"},{"issue":"5","key":"p_10","doi-asserted-by":"crossref","first-page":"3246","DOI":"10.1121\/1.411700","volume":"97","author":"Cohen Jordan","year":"1995","journal-title":"Journal of the Acoustical Society of America"},{"key":"p_11","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1016\/j.wocn.2009.08.004","volume":"38","author":"Cole Jennifer","year":"2010","journal-title":"Journal of Phonetics"},{"key":"p_12","doi-asserted-by":"crossref","first-page":"710","DOI":"10.1016\/j.cognition.2008.06.003","volume":"108","author":"Dahan Delphine","year":"2008","journal-title":"Cognition"},{"key":"p_14","first-page":"1","volume":"39","author":"Dempster Arthur P.","year":"1977","journal-title":"Journal of the Royal Statistical Society, B"},{"issue":"4","key":"p_15","doi-asserted-by":"crossref","first-page":"752","DOI":"10.1037\/a0017196","volume":"116","author":"Feldman Naomi H.","year":"2009","journal-title":"Psychological Review"},{"issue":"3","key":"p_16","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1561\/2000000004","volume":"1","author":"Gales Mark","year":"2007","journal-title":"Foundations and Trends in Signal Processing"},{"issue":"1","key":"p_18","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.csl.2005.05.002","volume":"20","author":"Giuliani Diego","year":"2006","journal-title":"Computer Speech & Language"},{"key":"p_21","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1016\/j.wocn.2004.03.001","volume":"32","author":"Halberstam Benjamin","year":"2004","journal-title":"Journal of Phonetics"},{"issue":"1","key":"p_22","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1093\/biomet\/57.1.97","volume":"57","author":"Hastings W.","year":"1970","journal-title":"Biometrika"},{"issue":"4","key":"p_24","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1109\/89.326616","volume":"2","author":"Hermansky Hynek","year":"1994","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"issue":"4","key":"p_25","doi-asserted-by":"crossref","first-page":"1738","DOI":"10.1121\/1.399423","volume":"87","author":"Hermansky Hynek","year":"1990","journal-title":"Journal of the Acoustical Society of America"},{"issue":"5","key":"p_26","doi-asserted-by":"crossref","first-page":"3099","DOI":"10.1121\/1.411872","volume":"97","author":"Hillenbrand James","year":"1995","journal-title":"Journal of the Acoustical Society of America"},{"issue":"5","key":"p_27","doi-asserted-by":"crossref","first-page":"3059","DOI":"10.1121\/1.2188377","volume":"119","author":"Holt Lori L.","year":"2006","journal-title":"Journal of the Acoustical Society of America"},{"issue":"6","key":"p_28","first-page":"1939","volume":"37","author":"Idemaru Kaori","year":"2011","journal-title":"Journal of Experimental Psychology: Human Perception and Performance"},{"issue":"3","key":"p_29","first-page":"1009","volume":"40","author":"Idemaru Kaori","year":"2014","journal-title":"Journal of Experimental Psychology: Human Perception and Performance"},{"issue":"6","key":"p_30","doi-asserted-by":"crossref","first-page":"3950","DOI":"10.1121\/1.4765076","volume":"132","author":"Idemaru Kaori","year":"2012","journal-title":"Journal of the Acoustical Society of America"},{"issue":"9","key":"p_31","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1097\/WNR.0b013e3281053c4e","volume":"18","author":"Joanisse Marc F.","year":"2007","journal-title":"NeuroReport"},{"issue":"4","key":"p_33","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1016\/j.wocn.2005.08.004","volume":"34","author":"Johnson Keith","year":"2006","journal-title":"Journal of Phonetics"},{"issue":"1","key":"p_34","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1109\/89.221366","volume":"1","author":"Junqua Jean-Claude","year":"1993","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"issue":"2","key":"p_36","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1037\/a0038695","volume":"122","author":"Kleinschmidt Dave F.","year":"2015","journal-title":"Psychological Review"},{"issue":"6","key":"p_37","doi-asserted-by":"crossref","first-page":"1681","DOI":"10.3758\/s13423-016-1049-y","volume":"23","author":"Kronrod Yakov","year":"2016","journal-title":"Psychonomic Bulletin and Review"},{"issue":"6","key":"p_38","first-page":"1783","volume":"41","author":"Liu Ran","year":"2015","journal-title":"Journal of Experimental Psychology: Human Perception and Performance"},{"issue":"5","key":"p_39","doi-asserted-by":"crossref","first-page":"2836","DOI":"10.1121\/1.2897047","volume":"123","author":"Liu Chuping","year":"2008","journal-title":"Journal of the Acoustical Society of America"},{"issue":"2","key":"p_40","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1121\/1.1912396","volume":"49","author":"Lobanov B. M.","year":"1971","journal-title":"Journal of the Acoustical Society of America"},{"key":"p_41","doi-asserted-by":"crossref","first-page":"B101","DOI":"10.1016\/S0010-0277(01)00157-3","volume":"82","author":"Maye Jessica","year":"2002","journal-title":"Cognition"},{"issue":"2","key":"p_42","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1037\/a0022325","volume":"118","author":"McMurray Bob","year":"2011","journal-title":"Psychological Review"},{"issue":"5","key":"p_43","doi-asserted-by":"crossref","first-page":"2114","DOI":"10.1121\/1.397862","volume":"85","author":"Miller James D.","year":"1989","journal-title":"Journal of the Acoustical Society of America"},{"issue":"6","key":"p_45","doi-asserted-by":"crossref","first-page":"808","DOI":"10.1080\/01690965.2010.490047","volume":"25","author":"Monahan Philip J.","year":"2010","journal-title":"Language and Cognitive Processes"},{"issue":"5","key":"p_47","doi-asserted-by":"crossref","first-page":"2088","DOI":"10.1121\/1.397861","volume":"85","author":"Nearey Terrance M.","year":"1989","journal-title":"Journal of the Acoustical Society of America"},{"issue":"4","key":"p_48","doi-asserted-by":"crossref","first-page":"2253","DOI":"10.1121\/1.418207","volume":"101","author":"Nittrouer Susan","year":"1997","journal-title":"Journal of the Acoustical Society of America"},{"key":"p_49","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1016\/S0095-4470(19)30639-4","volume":"20","author":"Nittrouer Susan","year":"1992","journal-title":"Journal of Phonetics"},{"issue":"2","key":"p_50","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1121\/1.1906875","volume":"24","author":"Peterson Gordon E.","year":"1952","journal-title":"Journal of the Acoustical Society of America"},{"issue":"1","key":"p_51","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1044\/jshr.0401.10","volume":"4","author":"Peterson Gordon E.","year":"1961","journal-title":"Journal of Speech and Hearing Research"},{"issue":"2","key":"p_52","doi-asserted-by":"crossref","first-page":"285","DOI":"10.3758\/BF03213946","volume":"15","author":"Pisoni David B.","year":"1974","journal-title":"Perception and Psychophysics"},{"issue":"1","key":"p_54","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1037\/0033-2909.92.1.81","volume":"92","author":"Repp Bruno H.","year":"1982","journal-title":"Psychological Bulletin"},{"issue":"4","key":"p_58","doi-asserted-by":"crossref","first-page":"443","DOI":"10.3758\/PBR.17.4.443","volume":"17","author":"Shi Lei","year":"2010","journal-title":"Psychonomic Bulletin and Review"},{"issue":"10","key":"p_60","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1177\/0956797610384142","volume":"21","author":"Toscano Joseph C.","year":"2010","journal-title":"Psychological Science"},{"issue":"1","key":"p_62","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/S0167-6393(98)00033-8","volume":"25","author":"Viikki Olli","year":"1998","journal-title":"Speech Communication"}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/tacl_a_00071","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:38:15Z","timestamp":1615585095000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/43411"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,12]]},"references-count":45,"alternative-id":["10.1162\/tacl_a_00071"],"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00071","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,12]]}}}