{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:21:38Z","timestamp":1760149298192,"version":"build-2065373602"},"reference-count":36,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2023,7,31]],"date-time":"2023-07-31T00:00:00Z","timestamp":1690761600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>As formant frequencies of vowel sounds are critical acoustic cues for vowel perception, human listeners need to be sensitive to formant frequency change. Numerous studies have found that formant frequency discrimination is affected by many factors like formant frequency, speech level, and fundamental frequency. Theoretically, to perceive a formant frequency change, human listeners with normal hearing may need a relatively constant change in the excitation and loudness pattern, and this internal change in auditory processing is independent of vowel category. Thus, the present study examined whether such metrics could explain the effects of formant frequency and speech level on formant frequency discrimination thresholds. Moreover, a simulation model based on the auditory excitation-pattern and loudness-pattern models was developed to simulate the auditory processing of vowel signals and predict thresholds of vowel formant discrimination. The results showed that predicted thresholds based on auditory metrics incorporating auditory excitation or loudness patterns near the target formant showed high correlations and low root-mean-square errors with human behavioral thresholds in terms of the effects of formant frequency and speech level). In addition, the simulation model, which particularly simulates the spectral processing of acoustic signals in the human auditory system, may be used to evaluate the auditory perception of speech signals for listeners with hearing impairments and\/or different language backgrounds.<\/jats:p>","DOI":"10.3390\/info14080429","type":"journal-article","created":{"date-parts":[[2023,7,31]],"date-time":"2023-07-31T02:13:32Z","timestamp":1690769612000},"page":"429","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Auditory Models for Formant Frequency Discrimination of Vowel Sounds"],"prefix":"10.3390","volume":"14","author":[{"given":"Can","family":"Xu","sequence":"first","affiliation":[{"name":"Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin, Austin, TX 78712, USA"}]},{"given":"Chang","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin, Austin, TX 78712, USA"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,31]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Liu, C. (2009, January 11\u201313). Auditory model of intensity discrimination and vowel formant discrimination: Effect of signal frequency. Proceedings of the 2009 3rd International Conference on Bioinformatics and Biomedical Engineering, Beijing, China.","DOI":"10.1109\/ICBBE.2009.5162425"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1654","DOI":"10.1121\/1.421264","article-title":"Auditory models of formant frequency discrimination for isolated vowels","volume":"103","author":"Zheng","year":"1998","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2114","DOI":"10.1121\/1.397862","article-title":"Auditory-perceptual interpretation of the vowel","volume":"85","author":"Miller","year":"1989","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2088","DOI":"10.1121\/1.397861","article-title":"Static, dynamic, and relational properties in vowel perception","volume":"85","author":"Nearey","year":"1989","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"3099","DOI":"10.1121\/1.411872","article-title":"Acoustic characteristics of American English vowels","volume":"97","author":"Hillenbrand","year":"1995","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1121\/1.410024","article-title":"Formant frequency discrimination for isolated English vowels","volume":"95","author":"Watson","year":"1994","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1755","DOI":"10.1121\/1.420085","article-title":"Frequency discrimination of stylized synthetic vowels with a single formant","volume":"102","author":"Lyzenga","year":"1997","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2956","DOI":"10.1121\/1.423878","article-title":"Frequency discrimination of stylized synthetic vowels with two formants","volume":"104","author":"Lyzenga","year":"1998","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2945","DOI":"10.1121\/1.428134","article-title":"Vowel formant discrimination: Towards more ordinary listening conditions","volume":"106","author":"Zheng","year":"1999","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"2141","DOI":"10.1121\/1.1400737","article-title":"Vowel formant discrimination II: Effects of stimulus uncertainty, consonantal context, and training","volume":"110","year":"2001","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2923","DOI":"10.1121\/1.1612490","article-title":"Discrimination and identification of vowels by young, hearing-impaired adults","volume":"114","author":"Richie","year":"2003","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2855","DOI":"10.1121\/1.2781580","article-title":"Factors affecting vowel formant discrimination by hearing-impaired listeners","volume":"122","author":"Liu","year":"2007","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"EL52","DOI":"10.1121\/1.2884085","article-title":"Rollover effect of signal level on vowel formant discrimination","volume":"123","author":"Liu","year":"2008","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2462","DOI":"10.1121\/1.417954","article-title":"Fundamental frequency effects on thresholds of vowel formant discrimination","volume":"100","author":"Li","year":"1996","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"3139","DOI":"10.1121\/1.413106","article-title":"Thresholds of formant-frequency discrimination of vowels in consonantal context","volume":"97","year":"1995","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1055\/s-0040-1715947","article-title":"Temporally jittered speech produces performance intensity, phonetically balanced rollover in young normal-hearing listeners","volume":"13","author":"Miranda","year":"2002","journal-title":"J. Am. Acad. Audiol."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1121\/1.1605151","article-title":"Effects of high presentation levels on recognitions of low- and high frequency speech","volume":"4","author":"Molis","year":"2003","journal-title":"Acoust. Res. Lett. Online"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2431","DOI":"10.1121\/1.426848","article-title":"Monosyllabic word recognition at higher-than-normal speech and noise levels","volume":"105","author":"Studebaker","year":"1999","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/0378-5955(90)90170-T","article-title":"Derivation of auditory filter shapes from notched-noise data","volume":"47","author":"Glasberg","year":"1990","journal-title":"Hear. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"3770","DOI":"10.1121\/1.414972","article-title":"Modeling formant frequency discrimination of female vowels","volume":"99","author":"Sommers","year":"1996","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1016\/0378-5955(87)90050-5","article-title":"Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns","volume":"28","author":"Moore","year":"1987","journal-title":"Hear. Res."},{"key":"ref_22","first-page":"335","article-title":"A revision of Zwicker\u2019s loudness model","volume":"82","author":"Moore","year":"1996","journal-title":"Acta Acust. United Acust."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1044\/1059-0889(2012\/12-0044)","article-title":"Effects of signal level and spectral contrast on vowel formant discrimination","volume":"22","author":"Woodall","year":"2013","journal-title":"Am. J. Audiol."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/S0167-6393(98)00085-5","article-title":"Restructuring speech representations using a pitch-adaptive time\u2013frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds","volume":"27","author":"Kawahara","year":"1999","journal-title":"Speech Commun."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"750","DOI":"10.1121\/1.389861","article-title":"Suggested formulae for calculating auditory-filter bandwidths and excitation patterns","volume":"74","author":"Moore","year":"1983","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/S0378-5955(03)00347-2","article-title":"A revised model of loudness perception applied to cochlear hearing loss","volume":"188","author":"Moore","year":"2004","journal-title":"Hear. Res."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1037\/h0046162","article-title":"On the psychophysical law","volume":"64","author":"Stevens","year":"1957","journal-title":"Psychol. Rev."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1037\/h0021703","article-title":"A model of loudness summation","volume":"72","author":"Zwicker","year":"1965","journal-title":"Psychol. Rev."},{"key":"ref_29","unstructured":"Deng, L., and O\u2019Shaughnessy, D. (2003). Speech Processing: A Dynamic and Optimization-Oriented Approach, Routledge."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"3615","DOI":"10.1121\/1.414959","article-title":"A quantitative model of the \u201ceffective\u201d signal processing in the auditory system. I. Model structure","volume":"99","author":"Dau","year":"1996","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"3623","DOI":"10.1121\/1.414960","article-title":"A quantitative model of the \u201ceffective\u201d signal processing in the auditory system. II. Simulations and measurements","volume":"99","author":"Dau","year":"1996","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"2892","DOI":"10.1121\/1.420344","article-title":"Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers","volume":"102","author":"Dau","year":"1997","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2906","DOI":"10.1121\/1.420345","article-title":"Modeling auditory processing of amplitude modulation. II. Spectral and temporal integration","volume":"102","author":"Dau","year":"1997","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1633","DOI":"10.1121\/1.394518","article-title":"Distribution of auditory-filter bandwidths at 2 kHz in young normal listeners","volume":"81","author":"Moore","year":"1987","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_35","first-page":"906","article-title":"Development and evaluation of a model for predicting the audibility of time-varying sounds in the presence of background sounds","volume":"53","author":"Glasberg","year":"2005","journal-title":"J. Audio Eng. Soc."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"EL189","DOI":"10.1121\/1.4742318","article-title":"Formant discrimination of speech and non-speech sounds for English and Chinese listeners","volume":"132","author":"Liu","year":"2012","journal-title":"J. Acoust. Soc. Am."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/8\/429\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:22:51Z","timestamp":1760127771000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/8\/429"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,31]]},"references-count":36,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2023,8]]}},"alternative-id":["info14080429"],"URL":"https:\/\/doi.org\/10.3390\/info14080429","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2023,7,31]]}}}