{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T05:19:12Z","timestamp":1751606352623,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,6,12]],"date-time":"2023-06-12T00:00:00Z","timestamp":1686528000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"name":"\u00f0uvre f\u00e9d\u00e9rale Les Amis des Aveugles et Malvoyants ASBL"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,6,12]]},"DOI":"10.1145\/3573381.3596471","type":"proceedings-article","created":{"date-parts":[[2023,8,29]],"date-time":"2023-08-29T17:51:36Z","timestamp":1693331496000},"page":"248-253","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Developing an Interactive Agent for Blind and Visually Impaired People"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5850-9817","authenticated-orcid":false,"given":"Vincent","family":"Stragier","sequence":"first","affiliation":[{"name":"Numediart Institute, ISIA Lab, Faculty of Engineering, University of Mons, Belgium"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0573-8480","authenticated-orcid":false,"given":"Omar","family":"Seddati","sequence":"additional","affiliation":[{"name":"Numediart Institute, ISIA Lab, Faculty of Engineering, University of Mons, Belgium"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7024-2150","authenticated-orcid":false,"given":"Thierry","family":"Dutoit","sequence":"additional","affiliation":[{"name":"Numediart Institute, ISIA Lab, Faculty of Engineering, University of Mons, Belgium"}]}],"member":"320","published-online":{"date-parts":[[2023,8,29]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n. d.]. JAWS - Logiciel de lecture d\u2019\u00e9cran avec retour vocale et braille. https:\/\/sensotec.be\/fr\/produit\/jaws\/.  [n. d.]. JAWS - Logiciel de lecture d\u2019\u00e9cran avec retour vocale et braille. https:\/\/sensotec.be\/fr\/produit\/jaws\/."},{"key":"e_1_3_2_1_2_1","unstructured":"2012. Various Documents Related to Tesseract OCR. https:\/\/tesseract-ocr.github.io\/docs\/.  2012. Various Documents Related to Tesseract OCR. https:\/\/tesseract-ocr.github.io\/docs\/."},{"key":"e_1_3_2_1_3_1","unstructured":"2017. NV Access.  2017. NV Access."},{"key":"e_1_3_2_1_4_1","unstructured":"2022. Envision App. https:\/\/www.letsenvision.com\/app.  2022. Envision App. https:\/\/www.letsenvision.com\/app."},{"key":"e_1_3_2_1_5_1","unstructured":"2022. Lookout - Vision Assist\u00e9e \u2013 Applications Sur Google Play.  2022. Lookout - Vision Assist\u00e9e \u2013 Applications Sur Google Play."},{"key":"e_1_3_2_1_6_1","unstructured":"2022. Seeing AI App from Microsoft. https:\/\/www.microsoft.com\/en-us\/ai\/seeing-ai.  2022. Seeing AI App from Microsoft. https:\/\/www.microsoft.com\/en-us\/ai\/seeing-ai."},{"key":"e_1_3_2_1_7_1","unstructured":"2023. Activer VoiceOver et s\u2019entra\u00eener \u00e0 utiliser les gestes sur l\u2019iPhone. https:\/\/support.apple.com\/fr-fr\/guide\/iphone\/iph3e2e415f\/ios.  2023. Activer VoiceOver et s\u2019entra\u00eener \u00e0 utiliser les gestes sur l\u2019iPhone. https:\/\/support.apple.com\/fr-fr\/guide\/iphone\/iph3e2e415f\/ios."},{"key":"e_1_3_2_1_8_1","unstructured":"2023. C\u00e9cit\u00e9 et d\u00e9ficience visuelle. https:\/\/www.who.int\/fr\/news-room\/fact-sheets\/detail\/blindness-and-visual-impairment.  2023. C\u00e9cit\u00e9 et d\u00e9ficience visuelle. https:\/\/www.who.int\/fr\/news-room\/fact-sheets\/detail\/blindness-and-visual-impairment."},{"key":"e_1_3_2_1_9_1","unstructured":"2023. Get Started on Android with TalkBack - Android Accessibility Help. https:\/\/support.google.com\/accessibility\/android\/answer\/6283677?hl=en-GB.  2023. Get Started on Android with TalkBack - Android Accessibility Help. https:\/\/support.google.com\/accessibility\/android\/answer\/6283677?hl=en-GB."},{"key":"e_1_3_2_1_10_1","unstructured":"2023. GPT-4. https:\/\/openai.com\/product\/gpt-4.  2023. GPT-4. https:\/\/openai.com\/product\/gpt-4."},{"key":"e_1_3_2_1_11_1","unstructured":"2023. Introducing Our Virtual Volunteer Tool for People Who Are Blind or Have Low Vision Powered by OpenAI\u2019s GPT-4. https:\/\/www.bemyeyes.com\/blog\/introducing-be-my-eyes-virtual-volunteer.  2023. Introducing Our Virtual Volunteer Tool for People Who Are Blind or Have Low Vision Powered by OpenAI\u2019s GPT-4. https:\/\/www.bemyeyes.com\/blog\/introducing-be-my-eyes-virtual-volunteer."},{"key":"e_1_3_2_1_12_1","unstructured":"2023. OOrion. https:\/\/apps.apple.com\/fr\/app\/oorion\/id1567957213.  2023. OOrion. https:\/\/apps.apple.com\/fr\/app\/oorion\/id1567957213."},{"key":"#cr-split#-e_1_3_2_1_13_1.1","unstructured":"Jiankang Deng Jia Guo Yuxiang Zhou Jinke Yu Irene Kotsia and Stefanos Zafeiriou. 2019. RetinaFace: Single-stage Dense Face Localisation in the Wild. https:\/\/doi.org\/10.48550\/arXiv.1905.00641 arxiv:1905.00641\u00a0[cs] 10.48550\/arXiv.1905.00641"},{"key":"#cr-split#-e_1_3_2_1_13_1.2","doi-asserted-by":"crossref","unstructured":"Jiankang Deng Jia Guo Yuxiang Zhou Jinke Yu Irene Kotsia and Stefanos Zafeiriou. 2019. RetinaFace: Single-stage Dense Face Localisation in the Wild. https:\/\/doi.org\/10.48550\/arXiv.1905.00641 arxiv:1905.00641\u00a0[cs]","DOI":"10.1109\/CVPR42600.2020.00525"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TTS.2020.2992344"},{"key":"e_1_3_2_1_15_1","unstructured":"Alexander Dunn John Dagdelen Nicholas Walker Sanghoon Lee Andrew\u00a0S. Rosen Gerbrand Ceder Kristin Persson and Anubhav Jain. 2022. Structured Information Extraction from Complex Scientific Text with Fine-Tuned Large Language Models. arxiv:2212.05238\u00a0[cond-mat]  Alexander Dunn John Dagdelen Nicholas Walker Sanghoon Lee Andrew\u00a0S. Rosen Gerbrand Ceder Kristin Persson and Anubhav Jain. 2022. Structured Information Extraction from Complex Scientific Text with Fine-Tuned Large Language Models. arxiv:2212.05238\u00a0[cond-mat]"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0231968"},{"key":"#cr-split#-e_1_3_2_1_17_1.1","unstructured":"Charles\u00a0F. Jekel and Raphael\u00a0T. Haftka. 2018. Classifying Online Dating Profiles on Tinder Using FaceNet Facial Embeddings. https:\/\/doi.org\/10.48550\/arXiv.1803.04347 arxiv:1803.04347\u00a0[cs eess stat] Comment: 6 pages 7 figures. 10.48550\/arXiv.1803.04347"},{"key":"#cr-split#-e_1_3_2_1_17_1.2","unstructured":"Charles\u00a0F. Jekel and Raphael\u00a0T. Haftka. 2018. Classifying Online Dating Profiles on Tinder Using FaceNet Facial Embeddings. https:\/\/doi.org\/10.48550\/arXiv.1803.04347 arxiv:1803.04347\u00a0[cs eess stat] Comment: 6 pages 7 figures."},{"volume-title":"Computer Recognition of Human Faces","author":"Kanade Takeo","key":"e_1_3_2_1_18_1","unstructured":"Takeo Kanade . 1977. Computer Recognition of Human Faces . Birkh\u00e4user Basel , Basel . https:\/\/doi.org\/10.1007\/978-3-0348-5737-6 10.1007\/978-3-0348-5737-6 Takeo Kanade. 1977. Computer Recognition of Human Faces. Birkh\u00e4user Basel, Basel. https:\/\/doi.org\/10.1007\/978-3-0348-5737-6"},{"key":"#cr-split#-e_1_3_2_1_19_1.1","unstructured":"Parminder Kaur Mayuri Ganore Rucha Doiphode Ashwini Garud and Tejaswini Ghuge. 2017. Be My Eyes : Android App for Visually Impaired People. https:\/\/doi.org\/10.13140\/RG.2.2.12307.48164 10.13140\/RG.2.2.12307.48164"},{"key":"#cr-split#-e_1_3_2_1_19_1.2","unstructured":"Parminder Kaur Mayuri Ganore Rucha Doiphode Ashwini Garud and Tejaswini Ghuge. 2017. Be My Eyes : Android App for Visually Impaired People. https:\/\/doi.org\/10.13140\/RG.2.2.12307.48164"},{"key":"#cr-split#-e_1_3_2_1_20_1.1","doi-asserted-by":"crossref","unstructured":"Edgar Kaziakhmedov Klim Kireev Grigorii Melnikov Mikhail Pautov and Aleksandr Petiushko. 2019. Real-World Attack on MTCNN Face Detection System. In 2019 International Multi-Conference on Engineering Computer and Information Sciences (SIBIRCON). 0422-0427. https:\/\/doi.org\/10.1109\/SIBIRCON48586.2019.8958122 10.1109\/SIBIRCON48586.2019.8958122","DOI":"10.1109\/SIBIRCON48586.2019.8958122"},{"key":"#cr-split#-e_1_3_2_1_20_1.2","doi-asserted-by":"crossref","unstructured":"Edgar Kaziakhmedov Klim Kireev Grigorii Melnikov Mikhail Pautov and Aleksandr Petiushko. 2019. Real-World Attack on MTCNN Face Detection System. In 2019 International Multi-Conference on Engineering Computer and Information Sciences (SIBIRCON). 0422-0427. https:\/\/doi.org\/10.1109\/SIBIRCON48586.2019.8958122","DOI":"10.1109\/SIBIRCON48586.2019.8958122"},{"key":"e_1_3_2_1_21_1","volume-title":"SSD: Single Shot MultiBox Detector. Vol.\u00a09905. 21\u201337. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2 arxiv:1512.02325\u00a0[cs] Comment: ECCV","author":"Liu Wei","year":"2016","unstructured":"Wei Liu , Dragomir Anguelov , Dumitru Erhan , Christian Szegedy , Scott Reed , Cheng-Yang Fu , and Alexander\u00a0 C. Berg . 2016 . SSD: Single Shot MultiBox Detector. Vol.\u00a09905. 21\u201337. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2 arxiv:1512.02325\u00a0[cs] Comment: ECCV 2016. 10.1007\/978-3-319-46448-0_2 Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander\u00a0C. Berg. 2016. SSD: Single Shot MultiBox Detector. Vol.\u00a09905. 21\u201337. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2 arxiv:1512.02325\u00a0[cs] Comment: ECCV 2016."},{"key":"#cr-split#-e_1_3_2_1_22_1.1","doi-asserted-by":"crossref","unstructured":"Birgit Lugrin Catherine Pelachaud and David Traum (Eds.). 2021. The Handbook on Socially Interactive Agents: 20 Years of Research on Embodied Conversational Agents Intelligent Virtual Agents and Social Robotics Volume 1: Methods Behavior Cognition (first ed.). ACM New York NY USA. https:\/\/doi.org\/10.1145\/3477322 10.1145\/3477322","DOI":"10.1145\/3477322.3477324"},{"key":"#cr-split#-e_1_3_2_1_22_1.2","doi-asserted-by":"crossref","unstructured":"Birgit Lugrin Catherine Pelachaud and David Traum (Eds.). 2021. The Handbook on Socially Interactive Agents: 20 Years of Research on Embodied Conversational Agents Intelligent Virtual Agents and Social Robotics Volume 1: Methods Behavior Cognition (first ed.). ACM New York NY USA. https:\/\/doi.org\/10.1145\/3477322","DOI":"10.1145\/3477322.3477324"},{"key":"e_1_3_2_1_23_1","unstructured":"MIPsoft. 2023. BlindSquare. https:\/\/apps.apple.com\/fr\/app\/blindsquare\/id500557255.  MIPsoft. 2023. BlindSquare. https:\/\/apps.apple.com\/fr\/app\/blindsquare\/id500557255."},{"key":"e_1_3_2_1_24_1","volume-title":"CHI 2019 Workshop on Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions","author":"Moore K.","year":"2019","unstructured":"Roger\u00a0 K. Moore . 2019 . A \u2019Canny\u2019 Approach to Spoken Language Interfaces. arxiv:1908.08131\u00a0[cs] Comment : Presented at the CHI 2019 Workshop on Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions , 4-9 May 2019, Glasgow, UK. Roger\u00a0K. Moore. 2019. A \u2019Canny\u2019 Approach to Spoken Language Interfaces. arxiv:1908.08131\u00a0[cs] Comment: Presented at the CHI 2019 Workshop on Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions, 4-9 May 2019, Glasgow, UK."},{"key":"e_1_3_2_1_25_1","unstructured":"Rafal Naczyk. 2020. Eqla. https:\/\/eqla.be.  Rafal Naczyk. 2020. Eqla. https:\/\/eqla.be."},{"key":"#cr-split#-e_1_3_2_1_27_1.1","unstructured":"Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll\u00a0L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul Christiano Jan Leike and Ryan Lowe. 2022. Training Language Models to Follow Instructions with Human Feedback. https:\/\/doi.org\/10.48550\/arXiv.2203.02155 arxiv:2203.02155\u00a0[cs] 10.48550\/arXiv.2203.02155"},{"key":"#cr-split#-e_1_3_2_1_27_1.2","unstructured":"Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll\u00a0L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul Christiano Jan Leike and Ryan Lowe. 2022. Training Language Models to Follow Instructions with Human Feedback. https:\/\/doi.org\/10.48550\/arXiv.2203.02155 arxiv:2203.02155\u00a0[cs]"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.29.41"},{"key":"e_1_3_2_1_29_1","unstructured":"Alex\u00a0P Pentland. [n. d.]. Face Recognition Using Eigenfaces. ([n. d.]).  Alex\u00a0P Pentland. [n. d.]. Face Recognition Using Eigenfaces. ([n. d.])."},{"key":"e_1_3_2_1_30_1","volume-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition","author":"Schroff Florian","year":"2015","unstructured":"Florian Schroff , Dmitry Kalenichenko , and James Philbin . 2015. FaceNet: A Unified Embedding for Face Recognition and Clustering. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 815\u2013823. https:\/\/doi.org\/10.1109\/CVPR.2015.7298682 arxiv:1503.03832\u00a0[cs] Comment: Also published , in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2015 . 10.1109\/CVPR.2015.7298682 Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A Unified Embedding for Face Recognition and Clustering. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 815\u2013823. https:\/\/doi.org\/10.1109\/CVPR.2015.7298682 arxiv:1503.03832\u00a0[cs] Comment: Also published, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2015."},{"key":"e_1_3_2_1_31_1","unstructured":"Sefik Serengil. 2018. Facial Expression Recognition with Keras.  Sefik Serengil. 2018. Facial Expression Recognition with Keras."},{"key":"e_1_3_2_1_32_1","unstructured":"Sefik Serengil. 2022. Deep Face Detection with Mediapipe.  Sefik Serengil. 2022. Deep Face Detection with Mediapipe."},{"key":"e_1_3_2_1_33_1","unstructured":"Sefik\u00a0Ilkin Serengil. 2023. Deepface.  Sefik\u00a0Ilkin Serengil. 2023. Deepface."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICACCCT.2016.7831628"},{"key":"e_1_3_2_1_35_1","unstructured":"SITNFlash. 2020. Racial Discrimination in Face Recognition Technology.  SITNFlash. 2020. Racial Discrimination in Face Recognition Technology."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000013087.49260.fb"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3844\/jmrsp.2019.1.32"},{"volume-title":"Workshop, Teven\u00a0Le Scao, Angela Fan, et. al.","year":"2023","key":"e_1_3_2_1_38_1","unstructured":"BigScience Workshop, Teven\u00a0Le Scao, Angela Fan, et. al. , 2023 . BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. https:\/\/doi.org\/10.48550\/arXiv.2211.05100 arxiv:2211.05100\u00a0[cs] 10.48550\/arXiv.2211.05100 BigScience Workshop, Teven\u00a0Le Scao, Angela Fan, et. al., 2023. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. https:\/\/doi.org\/10.48550\/arXiv.2211.05100 arxiv:2211.05100\u00a0[cs]"},{"key":"e_1_3_2_1_39_1","volume-title":"Real-Time Face Detection Based on YOLO. In 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII). 221\u2013224","author":"Yang Wang","year":"2018","unstructured":"Wang Yang and Zheng Jiachun . 2018 . Real-Time Face Detection Based on YOLO. In 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII). 221\u2013224 . https:\/\/doi.org\/10.1109\/ICKII.2018.8569109 10.1109\/ICKII.2018.8569109 Wang Yang and Zheng Jiachun. 2018. Real-Time Face Detection Based on YOLO. In 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII). 221\u2013224. https:\/\/doi.org\/10.1109\/ICKII.2018.8569109"}],"event":{"name":"IMX '23: ACM International Conference on Interactive Media Experiences","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGMM ACM Special Interest Group on Multimedia","SIGCHI ACM Special Interest Group on Computer-Human Interaction"],"location":"Nantes France","acronym":"IMX '23"},"container-title":["Proceedings of the 2023 ACM International Conference on Interactive Media Experiences"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3573381.3596471","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3573381.3596471","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:25Z","timestamp":1750178245000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3573381.3596471"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,12]]},"references-count":44,"alternative-id":["10.1145\/3573381.3596471","10.1145\/3573381"],"URL":"https:\/\/doi.org\/10.1145\/3573381.3596471","relation":{},"subject":[],"published":{"date-parts":[[2023,6,12]]},"assertion":[{"value":"2023-08-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}