{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,21]],"date-time":"2026-07-21T23:17:16Z","timestamp":1784675836694,"version":"3.55.0"},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"CSCW2","license":[{"start":{"date-parts":[[2021,10,13]],"date-time":"2021-10-13T00:00:00Z","timestamp":1634083200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"TEC, New Zealand","award":["3716751"],"award-info":[{"award-number":["3716751"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Hum.-Comput. Interact."],"published-print":{"date-parts":[[2021,10,13]]},"abstract":"<jats:p>With voice user interfaces (VUIs) becoming ubiquitous and speech synthesis technology maturing, it is possible to synthesise voices to resemble our friends and relatives (which we will collectively call 'kin') and use them on VUIs. However, designing such interfaces and investigating how the familiarity of kin voices affect user perceptions remain under-explored. Our surveys and interviews with 25 users revealed that VUIs using kin voices were perceived as more engaging, persuasive and safer yet eerier than VUIs using common virtual assistant voices. We then developed a technology probe, KinVoice, an Alexa-based VUI that was deployed in three households over two weeks. Users set reminders using KinVoice, which in turn, gave the reminders in synthesised kin voices. This was to explore users' needs, uncover challenges involved and inspire new applications. We discuss design guidelines for integrating familiar kin voices into VUIs, applications that benefit from its usage, and implications for balancing voice realism and usability with security and diversification.<\/jats:p>","DOI":"10.1145\/3479590","type":"journal-article","created":{"date-parts":[[2021,10,19]],"date-time":"2021-10-19T02:32:07Z","timestamp":1634610727000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":30,"title":["KinVoices: Using Voices of Friends and Family in Voice Interfaces"],"prefix":"10.1145","volume":"5","author":[{"given":"Sam W. T.","family":"Chan","sequence":"first","affiliation":[{"name":"Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tamil Selvan","family":"Gunasekaran","sequence":"additional","affiliation":[{"name":"The University of Auckland, Auckland, New Zealand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yun Suen","family":"Pai","sequence":"additional","affiliation":[{"name":"Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haimo","family":"Zhang","sequence":"additional","affiliation":[{"name":"Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Suranga","family":"Nanayakkara","sequence":"additional","affiliation":[{"name":"Auckland Bioengineering Institute, The University of Auckland, Auckland, New Zealand"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2021,10,18]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359996.3364754"},{"key":"e_1_2_2_2_1","volume-title":"Proceedings of the 17th International Conference on INFORMATICS in ECONOMY","author":"Airehrour David","year":"2018","unstructured":"David Airehrour, Samaneh Madanian, and Alwin Mathew Abraham. 2018. Designing a memory-aid and reminder system for dementia patients and older adults. Proceedings of the 17th International Conference on INFORMATICS in ECONOMY (2018), 75--81."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3342775.3342806"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3379336.3381478"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-008-0001--3"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3264901"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1124772.1124947"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978--981--10--2779--6_103--1"},{"key":"e_1_2_2_9_1","unstructured":"Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et al. 2020. Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","unstructured":"Joao Paulo Cabral Benjamin R. Cowan Katja Zibrek and Rachel McDonnell. 2017. The Influence of Synthetic Voice on the Evaluation of a Virtual Character.. In INTERSPEECH. 229--233. https:\/\/doi.org\/10.21437\/Interspeech.2017--325","DOI":"10.21437\/Interspeech.2017--325"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376789"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359325"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357236.3395479"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3432190"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3311823.3311870"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3405755.3406145"},{"key":"e_1_2_2_17_1","volume-title":"Marketing at the Confluence between Entertainment and Analytics","author":"Ch\u00e9rif Emna","unstructured":"Emna Ch\u00e9rif and Jean-Francc ois Lemoine. 2017. Human vs. synthetic recommendation agents' voice: The effects on consumer reactions. In Marketing at the Confluence between Entertainment and Analytics. Springer, 301--310."},{"key":"e_1_2_2_18_1","volume-title":"Anthropomorphic virtual assistants and the reactions of Internet users: An experiment on the assistant's voice. Recherche et Applications en Marketing (English Edition)","author":"Ch\u00e9rif Emna","year":"2019","unstructured":"Emna Ch\u00e9rif and Jean-Francc ois Lemoine. 2019. Anthropomorphic virtual assistants and the reactions of Internet users: An experiment on the assistant's voice. Recherche et Applications en Marketing (English Edition), Vol. 34, 1 (2019), 28--47."},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300705"},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2015.05.008"},{"key":"e_1_2_2_21_1","unstructured":"Paul Dourish. 1996. Book Review - The Media Equation: How People Treat Computers Television and New Media Like Real People and Places. https:\/\/www.dourish.com\/publications\/media-review.html Retrieved 2021-04--15 from"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3338286.3340116"},{"key":"e_1_2_2_23_1","volume-title":"A Survey Investigating Usage of Virtual Personal Assistants. arXiv preprint arXiv:1807.04606","author":"Dubiel Mateusz","year":"2018","unstructured":"Mateusz Dubiel, Martin Halvey, and Leif Azzopardi. 2018. A Survey Investigating Usage of Virtual Personal Assistants. arXiv preprint arXiv:1807.04606 (2018)."},{"key":"e_1_2_2_24_1","unstructured":"e-pill LLC. 2020. 25 Alarm Clock Reminder Rosie Reminder. https:\/\/www.epill.com\/italkclock.html Retrieved 2020-09-01 from"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3098279.3098556"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1037\/0022-006X.62.1.130"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2157689.2157717"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3344274"},{"key":"e_1_2_2_29_1","first-page":"707","article-title":"Personal sensory reminder with customizable voice message","volume":"6","author":"Flaherty Loretta M.","year":"2004","unstructured":"Loretta M. Flaherty. 2004. Personal sensory reminder with customizable voice message. US Patent 6,707,383.","journal-title":"US Patent"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.30658\/hmc.1.5"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022382413579"},{"key":"e_1_2_2_32_1","volume-title":"Who's got talent? Comparing TTS systems for comprehensibility, naturalness, and intelligibility. Future-proof CALL: language learning as exploration and encounters--short papers from EUROCALL","author":"Grimshaw Jennica","year":"2018","unstructured":"Jennica Grimshaw, Tiago Bione, and Walcir Cardoso. 2018. Who's got talent? Comparing TTS systems for comprehensibility, naturalness, and intelligibility. Future-proof CALL: language learning as exploration and encounters--short papers from EUROCALL (2018), 83--88."},{"key":"e_1_2_2_33_1","volume-title":"Voice interaction design: crafting the new conversational speech systems","author":"Harris Randy Allen","unstructured":"Randy Allen Harris. 2004. Voice interaction design: crafting the new conversational speech systems. Elsevier."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.chb.2010.05.015"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/642611.642616"},{"key":"e_1_2_2_36_1","unstructured":"Apple Inc. 2011. Siri. https:\/\/www.apple.com\/siri\/ Retrieved 2020-09-01 from"},{"key":"e_1_2_2_37_1","unstructured":"Amazon.com Inc. 2014. Alexa. https:\/\/www.amazon.com\/b?node=17934671011 Retrieved 2020-09-01 from"},{"key":"e_1_2_2_38_1","volume-title":"Computerised help information and interaction project for people with memory loss and mild dementia. Journal of Pain Manage","author":"Jawaid S.","year":"2016","unstructured":"S. Jawaid and Rachel Mccrindle. 2016. Computerised help information and interaction project for people with memory loss and mild dementia. Journal of Pain Manage (2016), 269--272."},{"key":"e_1_2_2_39_1","unstructured":"Corentin Jemine. 2019 a. Master thesis: Automatic Multispeaker Voice Cloning. (2019). https:\/\/matheo.uliege.be\/handle\/2268.2\/6801"},{"key":"e_1_2_2_40_1","unstructured":"Corentin Jemine. 2019 b. Real-Time-Voice-Cloning. https:\/\/github.com\/CorentinJ\/Real-Time-Voice-Cloning."},{"key":"e_1_2_2_41_1","volume-title":"Yonghui Wu, et al.","author":"Jia Ye","year":"2018","unstructured":"Ye Jia, Yu Zhang, Ron Weiss, Quan Wang, Jonathan Shen, Fei Ren, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu, et al. 2018. Transfer learning from speaker verification to multispeaker text-to-speech synthesis. In Advances in neural information processing systems. 4480--4490."},{"key":"e_1_2_2_42_1","volume-title":"Francis","author":"K\u00f6nig Alexandra","year":"2016","unstructured":"Alexandra K\u00f6nig, Aarti Malhotra, Jesse Hoey, and Linda E. Francis. 2016. Designing personalized prompts for a virtual assistant to support elderly care home residents.. In PervasiveHealth. 278--282."},{"key":"e_1_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.obhdp.2014.12.002"},{"key":"e_1_2_2_44_1","unstructured":"Google LLC. 2016. Google Assistant. https:\/\/assistant.google.com\/ Retrieved 2020-09-01 from"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.chb.2014.04.043"},{"key":"e_1_2_2_46_1","doi-asserted-by":"crossref","unstructured":"Miriam Meyerhoff. 2006. Introducing Sociolinguistics. Routledge.","DOI":"10.4324\/9780203966709"},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24177-7_30"},{"key":"e_1_2_2_48_1","volume-title":"3rd ACM Conference on Computer-Supported Cooperative Work and Social Computing, CSCW 2020 ; Conference date: 17--10--2020 Through 21--10--2020","author":"Muppirishetty P.","year":"2020","unstructured":"P. Muppirishetty and Minha Lee. 2020. Voice User Interfaces for mental healthcare: Leveraging technology to help our inner voice. 3rd ACM Conference on Computer-Supported Cooperative Work and Social Computing, CSCW 2020 ; Conference date: 17--10--2020 Through 21--10--2020."},{"key":"e_1_2_2_49_1","volume-title":"Wired for speech: How voice activates and advances the human-computer relationship","author":"Nass Clifford","unstructured":"Clifford Nass and Scott Brave. 2005. Wired for speech: How voice activates and advances the human-computer relationship. MIT press Cambridge, MA."},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1037\/1076-898X.7.3.171"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/223355.223538"},{"key":"e_1_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/191666.191703"},{"key":"e_1_2_2_53_1","volume-title":"Bratt","author":"Neupane Ajaya","year":"2019","unstructured":"Ajaya Neupane, Nitesh Saxena, Leanne M. Hirshfield, and Sarah E. Bratt. 2019. The Crux of Voice (In) Security: A Brain Study of Speaker Legitimacy Detection.. In NDSS."},{"key":"e_1_2_2_54_1","volume-title":"Proceedings of the 5th Nordic conference on Human-computer interaction: building bridges. 523--526","author":"Lan Swee","year":"2008","unstructured":"Andreea Niculescu, George M. White, See Swee Lan, Ratna Utari Waloejo, and Yoko Kawaguchi. 2008. Impact of English regional accents on user acceptance of voice user interfaces. In Proceedings of the 5th Nordic conference on Human-computer interaction: building bridges. 523--526."},{"key":"e_1_2_2_55_1","volume-title":"Presence 2001 Conference","author":"Nowak Kristine","year":"2001","unstructured":"Kristine Nowak. 2001. Defining and differentiating copresence, social presence and presence as transportation. In Presence 2001 Conference, Philadelphia, PA. Citeseer, 1--23."},{"key":"e_1_2_2_56_1","doi-asserted-by":"publisher","DOI":"10.1162\/105474603322761289"},{"key":"e_1_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10209-005-0120--7"},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376450"},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1177\/0265407596131005"},{"key":"e_1_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3334480.3383092"},{"key":"e_1_2_2_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300326"},{"key":"e_1_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3174214"},{"key":"e_1_2_2_63_1","volume-title":"The media equation: How people treat computers, television, and new media like real people","author":"Reeves Byron","unstructured":"Byron Reeves and Clifford Nass. 1996. The media equation: How people treat computers, television, and new media like real people. Cambridge university press Cambridge, UK."},{"key":"e_1_2_2_64_1","volume-title":"A Look Back on Baidu's AI Innovations","author":"Research Baidu","year":"2019","unstructured":"Baidu Research. 2020. A Look Back on Baidu's AI Innovations in 2019. http:\/\/research.baidu.com\/Blog\/index-view?id=130 Retrieved 2020-09-01 from"},{"key":"e_1_2_2_65_1","unstructured":"Resemble. 2019. Resemble AI. https:\/\/www.resemble.ai\/ Retrieved 2021-04--15 from"},{"key":"e_1_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2005.07.002"},{"key":"e_1_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300833"},{"key":"e_1_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-011-0100-4"},{"key":"e_1_2_2_69_1","volume-title":"A Survey on Neural Speech Synthesis. arXiv preprint arXiv:2106.15561","author":"Tan Xu","year":"2021","unstructured":"Xu Tan, Tao Qin, Frank Soong, and Tie-Yan Liu. 2021. A Survey on Neural Speech Synthesis. arXiv preprint arXiv:2106.15561 (2021)."},{"key":"e_1_2_2_70_1","volume-title":"Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies. 107--111","author":"Veaux Christophe","year":"2013","unstructured":"Christophe Veaux, Junichi Yamagishi, and Simon King. 2013. Towards personalised synthesised voices for individuals with vocal disabilities: Voice banking and reconstruction. In Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies. 107--111."},{"key":"e_1_2_2_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300772"}],"container-title":["Proceedings of the ACM on Human-Computer Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3479590","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3479590","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,14]],"date-time":"2025-07-14T04:57:13Z","timestamp":1752469033000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3479590"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,13]]},"references-count":71,"journal-issue":{"issue":"CSCW2","published-print":{"date-parts":[[2021,10,13]]}},"alternative-id":["10.1145\/3479590"],"URL":"https:\/\/doi.org\/10.1145\/3479590","relation":{},"ISSN":["2573-0142"],"issn-type":[{"value":"2573-0142","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,13]]},"assertion":[{"value":"2021-10-18","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}