{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T09:35:25Z","timestamp":1773826525805,"version":"3.50.1"},"reference-count":128,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,5,21]],"date-time":"2024-05-21T00:00:00Z","timestamp":1716249600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Comput. Sci."],"abstract":"<jats:p>This article reviews recent literature investigating speech variation in production and comprehension during spoken language communication between humans and devices. Human speech patterns toward voice-AI presents a test to our scientific understanding about speech communication and language use. First, work exploring how human-AI interactions are similar to, or different from, human-human interactions in the realm of speech variation is reviewed. In particular, we focus on studies examining how users adapt their speech when resolving linguistic misunderstandings by computers and when accommodating their speech toward devices. Next, we consider work that investigates how top-down factors in the interaction can influence users\u2019 linguistic interpretations of speech produced by technological agents and how the ways in which speech is generated (via text-to-speech synthesis, TTS) and recognized (using automatic speech recognition technology, ASR) has an effect on communication. Throughout this review, we aim to bridge both HCI frameworks and theoretical linguistic models accounting for variation in human speech. We also highlight findings in this growing area that can provide insight to the cognitive and social representations underlying linguistic communication more broadly. Additionally, we touch on the implications of this line of work for addressing major societal issues in speech technology.<\/jats:p>","DOI":"10.3389\/fcomp.2024.1384252","type":"journal-article","created":{"date-parts":[[2024,5,21]],"date-time":"2024-05-21T05:00:54Z","timestamp":1716267654000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Linguistic analysis of human-computer interaction"],"prefix":"10.3389","volume":"6","author":[{"given":"Georgia","family":"Zellou","sequence":"first","affiliation":[]},{"given":"Nicole","family":"Holliday","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,5,21]]},"reference":[{"key":"ref1","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1515\/nor-2017-0198","article-title":"Gender stereotyping of political candidates","volume":"28","author":"Aalberg","year":"2007","journal-title":"Nordicom Rev."},{"key":"ref4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3311956","article-title":"Music, search, and IoT: how people (really) use voice assistants","volume":"26","author":"Ammari","year":"2019","journal-title":"ACM Trans. Comput. Hum. Interact."},{"key":"ref5","author":"Ang","year":"2020"},{"key":"ref6","doi-asserted-by":"publisher","first-page":"045204","DOI":"10.1121\/10.0010274","article-title":"The clear speech intelligibility benefit for text-to-speech voices: effects of speaking style and visual guise","volume":"2","author":"Aoki","year":"2022","journal-title":"JASA Express Lett."},{"key":"ref7","doi-asserted-by":"publisher","first-page":"101328","DOI":"10.1016\/j.wocn.2024.101328","article-title":"Being clear about clear speech: intelligibility of hard-of-hearing-directed, non-native-directed, and casual speech for L1- and L2-English listeners","volume":"104","author":"Aoki","year":"2024","journal-title":"J. Phon."},{"key":"ref8","author":"Axon","year":"2022"},{"key":"ref9","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1016\/j.wocn.2011.09.001","article-title":"Evidence for phonetic and social selectivity in spontaneous phonetic imitation","volume":"40","author":"Babel","year":"2012","journal-title":"J. Phon."},{"key":"ref10","doi-asserted-by":"publisher","first-page":"527","DOI":"10.1080\/01690960802299378","article-title":"Mechanisms of interaction in speech production","volume":"24","author":"Baese-Berk","year":"2009","journal-title":"Lang. Cogn. Proc."},{"key":"ref11","doi-asserted-by":"publisher","first-page":"456","DOI":"10.1162\/105474603322761270","article-title":"Toward a more robust theory and measure of social presence: review and suggested criteria","volume":"12","author":"Biocca","year":"2003","journal-title":"Presence"},{"key":"ref12","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1017\/S0954394522000151","article-title":"Medium-shifting and intraspeaker variation in conversational interviews","volume":"34","author":"Bleaman","year":"2022","journal-title":"Lang. Var. Chang."},{"key":"ref13","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1016\/j.cognition.2011.05.011","article-title":"The role of beliefs in lexical alignment: evidence from dialogs with humans and computers","volume":"121","author":"Branigan","year":"2011","journal-title":"Cognition"},{"key":"ref14","first-page":"13","article-title":"Computer-and human-directed speech before and after correction","volume":"6","author":"Burnham","year":"2010","journal-title":"Spaceflight"},{"key":"ref15","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1016\/j.jml.2015.12.009","article-title":"Dynamically adapted context-specific hyper-articulation: feedback from interlocutors affects speakers\u2019 subsequent pronunciations","volume":"89","author":"Buz","year":"2016","journal-title":"J. Mem. Lang."},{"key":"ref16","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1007\/978-3-319-91244-8_39","article-title":"Are people polite to smartphones? How evaluations of smartphones depend on who is asking","volume-title":"Human-computer interaction. Interaction in context: 20th international conference, HCI international 2018, Las Vegas, NV, USA, July 15\u201320, 2018, proceedings, part II 20","author":"Carolus","year":"2018"},{"key":"ref17","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/B978-0-444-70536-5.50007-5","article-title":"Mental models in human-computer interaction","volume-title":"Handbook of Human-Computer Interaction","author":"Carroll","year":"1988"},{"key":"ref18","author":"Choe","year":"2022"},{"key":"ref19","author":"Cihan","year":"2022"},{"key":"ref20","author":"Clark","year":"1999"},{"key":"ref21","author":"Cohn","year":"2019"},{"key":"ref22","author":"Cohn","year":"2019"},{"key":"ref23","doi-asserted-by":"publisher","first-page":"101123","DOI":"10.1016\/j.wocn.2021.101123","article-title":"Acoustic-phonetic properties of Siri-and human-directed speech","volume":"90","author":"Cohn","year":"2022","journal-title":"J. Phon."},{"key":"ref24","doi-asserted-by":"publisher","first-page":"101567","DOI":"10.1016\/j.langsci.2023.101567","article-title":"Vocal accommodation to technology: the role of physical form","volume":"99","author":"Cohn","year":"2023","journal-title":"Lang. Sci."},{"key":"ref25","author":"Cohn","year":"2020"},{"key":"ref26","doi-asserted-by":"publisher","first-page":"675704","DOI":"10.3389\/fcomm.2021.675704","article-title":"Prosodic differences in human-and Alexa-directed speech, but similar local intelligibility adjustments","volume":"6","author":"Cohn","year":"2021","journal-title":"Front. Commun."},{"key":"ref27","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1016\/j.ijhcs.2015.05.008","article-title":"Voice anthropomorphism, interlocutor modelling and alignment effects on syntactic choices in human\u2212 computer dialogue","volume":"83","author":"Cowan","year":"2015","journal-title":"Int. J. Hum. Comput. Stud."},{"key":"ref28","doi-asserted-by":"publisher","first-page":"e12524","DOI":"10.1111\/desc.12524","article-title":"Accent detection and social cognition: evidence of protracted learning","volume":"21","author":"Creel","year":"2018","journal-title":"Dev. Sci."},{"key":"ref29","author":"De Renesse","year":"2017"},{"key":"ref30","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-662-46590-5","volume-title":"The new digital natives: Cutting the chord","author":"Dingli","year":"2015"},{"key":"ref31","doi-asserted-by":"publisher","first-page":"1204211","DOI":"10.3389\/fcomp.2023.1204211","article-title":"Comparing alignment toward American, British, and Indian English text-to-speech (TTS) voices: influence of social attitudes and talker guise","volume":"5","author":"Dodd","year":"2023","journal-title":"Front. Comput. Sci."},{"key":"ref32","doi-asserted-by":"publisher","first-page":"330","DOI":"10.1080\/15475441.2020.1784736","article-title":"The development of sociolinguistic competence across the lifespan: three domains of regional dialect perception","volume":"16","author":"Dossey","year":"2020","journal-title":"Lang. Learn. Dev."},{"key":"ref34","author":"Dubois","year":"2024"},{"key":"ref35","volume-title":"Jocks and burnouts: Social categories and identity in the high school","author":"Eckert","year":"1989"},{"key":"ref36","first-page":"184","article-title":"Human-machine communication in the classroom","volume-title":"Handbook of instructional communication","author":"Edwards","year":"2017"},{"key":"ref37","volume-title":"In case of emergency: How technologies mediate crisis and normalize inequality","author":"Ellcessor","year":"2022"},{"key":"ref38","author":"Ernst","year":"2020"},{"key":"ref41","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1007\/s12124-021-09668-y","article-title":"Anthropomorphizing technology: a conceptual review of anthropomorphism research and how it relates to children\u2019s engagements with digital voice assistants","volume":"56","author":"Festerling","year":"2022","journal-title":"Integr. Psychol. Behav. Sci."},{"key":"ref42","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1086\/269264","article-title":"Race-of-interviewer effects in a preelection poll Virginia 1989","volume":"55","author":"Finkel","year":"1991","journal-title":"Public Opin. Q."},{"key":"ref43","doi-asserted-by":"publisher","first-page":"71","DOI":"10.30658\/hmc","article-title":"Building a stronger CASA: extending the computers are social actors paradigm","volume":"1","author":"Gambino","year":"2020","journal-title":"Hum. Mach. Commun."},{"key":"ref44","doi-asserted-by":"publisher","first-page":"111","DOI":"10.30658\/hmc.4.6","article-title":"Considering the context to build theory in HCI, HRI, and HMC: explicating differences in processes of communication and socialization with social technologies","volume":"4","author":"Gambino","year":"2022","journal-title":"Hum. Mach. Commun."},{"key":"ref45","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/j.specom.2020.12.004","article-title":"Phonetic accommodation to natural and synthetic voices: behavior of groups and individuals in speech shadowing","volume":"127","author":"Gessinger","year":"2021","journal-title":"Speech Comm."},{"key":"ref46","first-page":"87","article-title":"Accent mobility: a model and some data","volume":"152","author":"Giles","year":"1973","journal-title":"Anthropol. Linguist."},{"key":"ref47","doi-asserted-by":"publisher","first-page":"271","DOI":"10.2190\/TCMU-0U65-XTEH-B950","article-title":"Intergenerational talk and communication with older people","volume":"34","author":"Giles","year":"1992","journal-title":"Int. J. Aging Hum. Dev."},{"key":"ref48","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1017\/S0047404500000701","article-title":"Towards a theory of interpersonal accommodation through language: some Canadian data 1","volume":"2","author":"Giles","year":"1973","journal-title":"Lang. Soc."},{"key":"ref49","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1037\/0033-295X.105.2.251","article-title":"Echoes of echoes? An episodic theory of lexical access","volume":"105","author":"Goldinger","year":"1998","journal-title":"Psychol. Rev."},{"key":"ref50","doi-asserted-by":"publisher","first-page":"716","DOI":"10.3758\/BF03196625","article-title":"Episodic memory reflected in printed word naming","volume":"11","author":"Goldinger","year":"2004","journal-title":"Psychon. Bull. Rev."},{"key":"ref51","doi-asserted-by":"publisher","first-page":"113515","DOI":"10.1016\/j.dss.2021.113515","article-title":"Mental models and expectation violations in conversational AI interactions","volume":"144","author":"Grimes","year":"2021","journal-title":"Decis. Support. Syst."},{"key":"ref01","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-02139-8","author":"Habash","year":"2010","journal-title":"Introduction to Arabic natural language processing"},{"key":"ref52","author":"Harrington","year":"2022"},{"key":"ref53","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1515\/ling.2010.027","article-title":"Stuffed toys and speech perception","volume":"48","author":"Hay","year":"2010","journal-title":"Linguistics"},{"key":"ref54","doi-asserted-by":"publisher","first-page":"458","DOI":"10.1016\/j.wocn.2005.10.001","article-title":"Factors influencing speech perception in the context of a merger-in-progress","volume":"34","author":"Hay","year":"2006","journal-title":"J. Phon."},{"key":"ref55","doi-asserted-by":"publisher","first-page":"503","DOI":"10.1080\/01411920902989227","article-title":"Digital natives: where is the evidence?","volume":"36","author":"Helsper","year":"2010","journal-title":"Br. Educ. Res. J."},{"key":"ref56","doi-asserted-by":"publisher","first-page":"642783","DOI":"10.3389\/frai.2021.642783","article-title":"Perception in black and white: effects of intonational variables and filtering conditions on sociolinguistic judgments with implications for ASR","volume":"4","author":"Holliday","year":"2021","journal-title":"Front. Artif. Intell."},{"key":"ref57","doi-asserted-by":"publisher","first-page":"1116955","DOI":"10.3389\/fcomm.2023.1116955","article-title":"Siri, you've changed! Acoustic properties and racialized judgments of voice assistants","volume":"8","author":"Holliday","year":"2023","journal-title":"Front. Commun."},{"key":"ref58","author":"Holliday","year":"2022"},{"key":"ref59","author":"Hu","year":"2019"},{"key":"ref60","first-page":"91","article-title":"The role of age stereotypes in interpersonal communication","volume-title":"Handbook of Communication and Aging Research","author":"Hummert","year":"2004"},{"key":"ref61","doi-asserted-by":"publisher","first-page":"5837","DOI":"10.1007\/s10462-022-10315-0","article-title":"Conventional and contemporary approaches used in text to speech synthesis: a review","volume":"56","author":"Kaur","year":"2023","journal-title":"Artif. Intell. Rev."},{"key":"ref62","doi-asserted-by":"publisher","first-page":"103170","DOI":"10.1016\/j.im.2019.103170","article-title":"Do (how) digital natives adopt a new technology differently than digital immigrants? A longitudinal study","volume":"57","author":"Kesharwani","year":"2020","journal-title":"Inf. Manag."},{"key":"ref63","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1515\/labphon.2011.004","article-title":"Phonetic convergence in spontaneous conversations as a function of interlocutor language distance","volume":"2","author":"Kim","year":"2011","journal-title":"Lab. Phonol."},{"key":"ref64","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1080\/21639159.2020.1808811","article-title":"Born digital: is there going to be a new culture of digital natives?","volume":"31","author":"Kincl","year":"2021","journal-title":"J. Glob. Scholars Market. Sci."},{"key":"ref65","doi-asserted-by":"publisher","first-page":"7684","DOI":"10.1073\/pnas.1915768117","article-title":"Racial disparities in automated speech recognition","volume":"117","author":"Koenecke","year":"2020","journal-title":"Proc. Natl. Acad. Sci."},{"key":"ref66","doi-asserted-by":"publisher","first-page":"362","DOI":"10.1121\/1.1635842","article-title":"Acoustic properties of naturally produced clear speech at normal speaking rates","volume":"115","author":"Krause","year":"2004","journal-title":"J. Acoust. Soc. Am."},{"key":"ref67","doi-asserted-by":"publisher","first-page":"785283","DOI":"10.3389\/fpsyg.2021.785283","article-title":"\u201cSounding Black\u201d: speech Stereotypicality activates racial stereotypes and expectations about appearance","volume":"12","author":"Kurinec","year":"2021","journal-title":"Front. Psychol."},{"key":"ref68","first-page":"221","article-title":"Linguistic change as a form of communication","volume-title":"Human communication","author":"Labov","year":"2015"},{"key":"ref69","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1111\/j.1468-2885.2004.tb00302.x","article-title":"Presence, explicated","volume":"14","author":"Lee","year":"2004","journal-title":"Commun. Theory"},{"key":"ref70","author":"Lee","year":"2008"},{"key":"ref71","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1007\/978-94-009-2037-8_16","article-title":"Explaining phonetic variation: a sketch of the H&H theory","volume-title":"Speech production and speech modelling","author":"Lindblom","year":"1990"},{"key":"ref72","volume-title":"English with an accent: Language, ideology and discrimination in the United States","author":"Lippi-Green","year":"2011"},{"key":"ref73","author":"Liu","year":"2020"},{"key":"ref74","author":"Lopatovska","year":"2018"},{"key":"ref75","author":"Lovato","year":"2015"},{"key":"ref76","author":"Lovato","year":"2019"},{"key":"ref77","doi-asserted-by":"publisher","first-page":"e1973","DOI":"10.7717\/peerj-cs.1973","article-title":"Real-time multilingual speech recognition and speaker diarization system based on whisper segmentation","volume":"10","author":"Lyu","year":"2024","journal-title":"PeerJ Comput. Sci."},{"key":"ref78","author":"Markl","year":"2021"},{"key":"ref79","author":"Mayo","year":"2012"},{"key":"ref80","doi-asserted-by":"publisher","first-page":"502","DOI":"10.1177\/0023830914565191","article-title":"Social expectation improves speech perception in noise","volume":"58","author":"McGowan","year":"2015","journal-title":"Lang. Speech"},{"key":"ref81","author":"Mendoza-Denton","year":"1997"},{"key":"ref82","doi-asserted-by":"publisher","first-page":"169","DOI":"10.3389\/frai.2021.725911","article-title":"I don\u2019t think these devices are very culturally sensitive. Impact of automated speech recognition errors on African Americans","volume":"4","author":"Mengesha","year":"2021","journal-title":"Front. Artif. Intell."},{"key":"ref85","author":"Nakamura","year":"2009"},{"key":"ref86","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1111\/0022-4537.00153","article-title":"Machines and mindlessness: social responses to computers","volume":"56","author":"Nass","year":"2000","journal-title":"J. Soc. Issues"},{"key":"ref87","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1111\/j.1559-1816.1999.tb00142.x","article-title":"Are people polite to computers? Responses to computer-based interviewing systems","volume":"29","author":"Nass","year":"1999","journal-title":"J. Appl. Soc. Psychol."},{"key":"ref89","doi-asserted-by":"publisher","first-page":"864","DOI":"10.1111\/j.1559-1816.1997.tb00275.x","article-title":"Are machines gender neutral? Gender-stereotypic responses to computers with voices","volume":"27","author":"Nass","year":"1997","journal-title":"J. Appl. Soc. Psychol."},{"key":"ref90","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1111\/j.1468-2958.1993.tb00311.x","article-title":"Voices, boxes, and sources of messages: computers and social actors","volume":"19","author":"Nass","year":"1993","journal-title":"Hum. Commun. Res."},{"key":"ref91","author":"Nass","year":"1994"},{"key":"ref92","author":"N\u00e9meth","year":"2007"},{"key":"ref93","first-page":"421","article-title":"Hey ASR system! Why aren\u2019t you more inclusive? Automatic speech recognition systems\u2019 bias and proposed bias mitigation techniques. A literature review","volume-title":"International conference on human-computer interaction","author":"Ngueajio","year":"2022"},{"key":"ref94","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1177\/0261927X99018001005","article-title":"The effect of social information on the perception of sociolinguistic variables","volume":"18","author":"Niedzielski","year":"1999","journal-title":"J. Lang. Soc. Psychol."},{"key":"ref95","author":"O\u2019Mahony","year":"2021"},{"key":"ref96","doi-asserted-by":"publisher","first-page":"101538","DOI":"10.1016\/j.csl.2023.101538","article-title":"Understanding automatic speech recognition","volume":"83","author":"O\u2019Shaughnessy","year":"2023","journal-title":"Comput. Speech Lang."},{"key":"ref97","doi-asserted-by":"publisher","first-page":"107788","DOI":"10.1016\/j.chb.2023.107788","article-title":"What affects the usage of artificial conversational agents? An agent personality and love theory perspective","volume":"145","author":"Pal","year":"2023","journal-title":"Comput. Hum. Behav."},{"key":"ref98","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1016\/j.wocn.2011.10.001","article-title":"Phonetic convergence in college roommates","volume":"40","author":"Pardo","year":"2012","journal-title":"J. Phon."},{"key":"ref99","doi-asserted-by":"publisher","first-page":"421","DOI":"10.1518\/001872000779698132","article-title":"Linguistic cues and memory for synthetic and natural speech","volume":"42","author":"Paris","year":"2000","journal-title":"Hum. Factors"},{"key":"ref100","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1201\/9781410615862.ch3","article-title":"Mental models in human-computer interaction","volume":"17","author":"Payne","year":"2007","journal-title":"Hum. Comput. Interact. Hand."},{"key":"ref101","doi-asserted-by":"publisher","first-page":"738","DOI":"10.3389\/fcomm.2024.1346738","article-title":"Linguistic patterning of laughter in human-Socialbot interactions","volume":"9","author":"Perkins Booker","year":"2024","journal-title":"Front. Commun."},{"key":"ref102","doi-asserted-by":"publisher","first-page":"434","DOI":"10.1044\/jshr.2904.434","article-title":"Speaking clearly for the hard of hearing II: acoustic characteristics of clear and conversational speech","volume":"29","author":"Picheny","year":"1986","journal-title":"J. Speech Lang. Hear. Res."},{"key":"ref103","author":"Porter","year":"2022"},{"key":"ref104","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1108\/10748120110424843","article-title":"Digital natives, digital immigrants part 2: do they really think differently?","volume":"9","author":"Prensky","year":"2001","journal-title":"Horizon"},{"key":"ref105","author":"Ram","year":"2018"},{"key":"ref106","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1007\/s10462-023-10540-1","article-title":"The role of politeness in human\u2013machine interactions: a systematic literature review and future perspectives","volume":"56","author":"Ribino","year":"2023","journal-title":"Artif. Intell. Rev."},{"key":"ref108","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1007\/BF00973770","article-title":"Nonlanguage factors affecting undergraduates' judgments of nonnative English-speaking teaching assistants","volume":"33","author":"Rubin","year":"1992","journal-title":"Res. High. Educ."},{"key":"ref109","author":"Russell","year":"2007"},{"key":"ref110","doi-asserted-by":"publisher","first-page":"3044","DOI":"10.1121\/1.4781735","article-title":"An acoustic study of real and imagined foreigner-directed speech","volume":"121","author":"Scarborough","year":"2007","journal-title":"J. Acoust. Soc. Am."},{"key":"ref111","doi-asserted-by":"publisher","first-page":"3793","DOI":"10.1121\/1.4824120","article-title":"Clarity in communication:\u201cclear\u201d speech authenticity and lexical neighborhood density effects in speech production and perception","volume":"134","author":"Scarborough","year":"2013","journal-title":"J. Acoust. Soc. Am."},{"key":"ref112","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1016\/j.wocn.2013.03.007","article-title":"Exaggeration of featural contrasts in clarifications of misheard speech in English","volume":"41","author":"Schertz","year":"2013","journal-title":"J. Phon."},{"key":"ref113","doi-asserted-by":"publisher","first-page":"422","DOI":"10.3758\/BF03194890","article-title":"Imitation in shadowing words","volume":"66","author":"Shockley","year":"2004","journal-title":"Percept. Psychophys."},{"key":"ref114","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/978-3-030-51870-7_4","article-title":"\u201cSpeech melody and speech content Didn\u2019t fit together\u201d\u2013differences in speech behavior for device directed and human directed interactions","volume":"1","author":"Siegert","year":"2021","journal-title":"Adv. Data Sci."},{"key":"ref115","doi-asserted-by":"publisher","first-page":"1677","DOI":"10.1121\/1.2000788","article-title":"Production and perception of clear speech in Croatian and English","volume":"118","author":"Smiljani\u0107","year":"2005","journal-title":"J. Acoust. Soc. Am."},{"key":"ref116","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1016\/j.chb.2018.09.014","article-title":"Searching for questions, original thoughts, or advancing theory: human-machine communication","volume":"90","author":"Spence","year":"2019","journal-title":"Comput. Hum. Behav."},{"key":"ref117","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1016\/j.csl.2017.10.004","article-title":"Predicting speech intelligibility with deep neural networks","volume":"48","author":"Spille","year":"2018","journal-title":"Comput. Speech Lang."},{"key":"ref118","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1006\/imms.1993.1028","article-title":"Mental models: concepts for human-computer interaction research","volume":"38","author":"Staggers","year":"1993","journal-title":"Int. J. Man Mach. Stud."},{"key":"ref121","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1002\/9781118426456.ch3","article-title":"Toward a theory of interactive media effects (TIME) four models for explaining how interface features affect user psychology","volume-title":"The Handbook of the Psychology of Communication Technology","author":"Sundar","year":"2015"},{"key":"ref122","author":"Sutton","year":"2019"},{"key":"ref123","author":"Uchanski","year":"2005"},{"key":"ref124","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1016\/j.specom.2006.10.003","article-title":"Do you speak E-NG-LI-SH? A comparison of foreigner-and infant-directed speech","volume":"49","author":"Uther","year":"2007","journal-title":"Speech Comm."},{"key":"ref125","author":"Van den Oord","year":"2016"},{"key":"ref126","author":"Waddell","year":"2021"},{"key":"ref127","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1016\/j.specom.2022.03.009","article-title":"Uneven success: automatic speech recognition and ethnicity-related dialects","volume":"140","author":"Wassink","year":"2022","journal-title":"Speech Comm."},{"key":"ref128","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1177\/1745691610369336","article-title":"Who sees human? The stability and importance of individual differences in anthropomorphism","volume":"5","author":"Waytz","year":"2010","journal-title":"Perspect. Psychol. Sci."},{"key":"ref129","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.3758\/s13423-022-02218-6","article-title":"Automatic imitation of human and computer-generated vocal stimuli","volume":"30","author":"Wilt","year":"2022","journal-title":"Psychon. Bull. Rev."},{"key":"ref130","doi-asserted-by":"crossref","DOI":"10.1002\/9780470714089","volume-title":"Distant speech recognition","author":"W\u00f6lfel","year":"2009"},{"key":"ref131","author":"Wood","year":"2018"},{"key":"ref132","author":"Wu","year":"2020"},{"key":"ref133","author":"Yamagishi","year":"2004"},{"key":"ref134","doi-asserted-by":"publisher","first-page":"3424","DOI":"10.1121\/10.0004989","article-title":"Partial compensation for coarticulatory vowel nasalization across concatenative and neural text-to-speech","volume":"149","author":"Zellou","year":"2021","journal-title":"J. Acoust. Soc. Am."},{"key":"ref135","doi-asserted-by":"publisher","first-page":"600361","DOI":"10.3389\/fcomm.2020.600361","article-title":"Age-and gender-related differences in speech alignment toward humans and voice-AI","volume":"5","author":"Zellou","year":"2021","journal-title":"Front. Commun."},{"key":"ref137","doi-asserted-by":"publisher","first-page":"692","DOI":"10.1353\/lan.2023.a914191","article-title":"Listener beliefs and perceptual learning: differences between device and human guises","volume":"99","author":"Zellou","year":"2023","journal-title":"Language"},{"key":"ref138","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1038\/s41598-023-50516-3","article-title":"Linguistic disparities in cross-language automatic speech recognition transfer from Arabic to Tashlhiyt","volume":"14","author":"Zellou","year":"2024","journal-title":"Sci. Rep."},{"key":"ref139","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1016\/j.specom.2009.04.004","article-title":"Statistical parametric speech synthesis","volume":"51","author":"Zen","year":"2009","journal-title":"Speech Comm."}],"container-title":["Frontiers in Computer Science"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fcomp.2024.1384252\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,21]],"date-time":"2024-05-21T05:01:06Z","timestamp":1716267666000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fcomp.2024.1384252\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,21]]},"references-count":128,"alternative-id":["10.3389\/fcomp.2024.1384252"],"URL":"https:\/\/doi.org\/10.3389\/fcomp.2024.1384252","relation":{},"ISSN":["2624-9898"],"issn-type":[{"value":"2624-9898","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,21]]},"article-number":"1384252"}}