{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T15:55:25Z","timestamp":1776095725352,"version":"3.50.1"},"reference-count":188,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2024,10,23]],"date-time":"2024-10-23T00:00:00Z","timestamp":1729641600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Hum.-Robot Interact."],"published-print":{"date-parts":[[2024,12,31]]},"abstract":"<jats:p>\n            Social robots in our daily surroundings, like personal guides, waiter robots, home helpers, assistive robots, and telepresence\/teleoperation robots, are increasing day by day. Their usability and acceptability largely depend on their explicit and implicit interaction capability with fellow human beings. As a result, social behavior is one of the most sought-after qualities that a robot can possess. However, there is no specific aspect and\/or feature that defines socially acceptable behavior, and it largely depends on the situation, application, and society. In this article, we investigate one such social behavior for collocated robots. Imagine a group of people is interacting with each other, and we want to join the group. We as human beings do it in a socially acceptable manner, i.e., within the group, we do position ourselves in such a way that we can participate in the group activity without disturbing\/obstructing anybody. To possess such a quality, first, a robot needs to determine the formation of the group and then determine a position for itself, which we humans do implicitly. There are many theories which study group formations and proxemics; one such theory is f-formation which could be utilized for this purpose. As the types of formations can be very diverse, detecting the social groups is not a trivial task. In this article, we provide a comprehensive survey of the existing work on social interaction and group detection using f-formation for robotics and other applications. We also put forward a novel holistic survey framework combining some of the possibly more important concerns and modules relevant to this problem. We define taxonomies based on methods, camera views, datasets, detection capabilities and scale, evaluation approaches, and application areas. We discuss certain open challenges and limitations in the current literature along with possible future research directions based on this framework. In particular, we discuss the existing methods\/techniques and their relative merits and demerits, applications, and provide a set of unsolved but relevant problems in this domain. The official website for this work is available at:\n            <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/HrishavBakulBarua\/Social-Robots-F-formation\">https:\/\/github.com\/HrishavBakulBarua\/Social-Robots-F-formation<\/jats:ext-link>\n          <\/jats:p>","DOI":"10.1145\/3682072","type":"journal-article","created":{"date-parts":[[2024,7,29]],"date-time":"2024-07-29T14:17:16Z","timestamp":1722262636000},"page":"1-48","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Enabling Social Robots to Perceive and Join Socially Interacting Groups Using F-formation: A Comprehensive Overview"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5568-3804","authenticated-orcid":false,"given":"Hrishav Bakul","family":"Barua","sequence":"first","affiliation":[{"name":"TCS Research, Kolkata, India and Faculty of Information Technology, Monash University, Melbourne, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-3039-6120","authenticated-orcid":false,"given":"Theint Haythi","family":"Mg","sequence":"additional","affiliation":[{"name":"University of G\u00f6ttingen, G\u00f6ttingen, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0897-1835","authenticated-orcid":false,"given":"Pradip","family":"Pramanick","sequence":"additional","affiliation":[{"name":"TCS Research, Kolkata, India and Interdepartmental Center for Advances in Robotic Surgery\u2014ICAROS, University of Naples Federico II, Naples, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4777-2086","authenticated-orcid":false,"given":"Chayan","family":"Sarkar","sequence":"additional","affiliation":[{"name":"TCS Research, New Delhi, India and Computer Science, Yale University, New Haven, CT, USA"}]}],"member":"320","published-online":{"date-parts":[[2024,10,23]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-021-00419-9"},{"key":"e_1_3_3_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/1922649.1922653"},{"key":"e_1_3_3_4_2","volume-title":"8th International Conference on Machine Vision (ICMV 2015)","volume":"9875","author":"Aghaei Maedeh","year":"2015","unstructured":"Maedeh Aghaei, Mariella Dimiccoli, and Petia Radeva. 2015. Towards social interaction detection in egocentric photo-streams. In 8th International Conference on Machine Vision (ICMV 2015), Vol. 9875. International Society for Optics and Photonics, 987514."},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2016.7900087"},{"key":"e_1_3_3_6_2","unstructured":"AImageLab. 2021a. AImageLab datasets. Retrieved from http:\/\/imagelab.ing.unimore.it\/files\/EGO-GROUP.zip"},{"key":"e_1_3_3_7_2","unstructured":"AImageLab. 2021b. AImageLab datasets. Retrieved from http:\/\/imagelab.ing.unimore.it\/files\/EGO-HPE.zip"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3122865.3122869"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2496269"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806238"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2015.06.006"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2014.91"},{"key":"e_1_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2004.1308100"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5540156"},{"key":"e_1_3_3_15_2","unstructured":"University of Michigan (Ann Arbor). 2021. Collective Activity Dataset. Retrieved from http:\/\/www.eecs.umich.edu\/vision\/activity-dataset.html"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-25554-5_5"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.5898\/JHRI.6.3.Ball"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2013.10.003"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN47096.2020.9223469"},{"key":"e_1_3_3_20_2","unstructured":"Hrishav Bakul Barua Ganesh Krishnasamy KokSheik Wong Abhinav Dhall and Kalin Stefanov. 2024a. HistoHDR-Net: Histogram equalization for single LDR to HDR image translation. arXiv:2402.06692."},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/APSIPAASC58517.2023.10317568"},{"key":"e_1_3_3_22_2","unstructured":"Hrishav Bakul Barua Pradip Pramanick and Chayan Sarkar. 2021. System and method for enabling robot to perceive and detect socially interacting groups. US Patent App. 17\/138 224."},{"key":"e_1_3_3_23_2","unstructured":"Hrishav Bakul Barua Kalin Stefanov KokSheik Wong Abhinav Dhall and Ganesh Krishnasamy. 2024b. GTA-HDR: A large-scale synthetic dataset for HDR image reconstruction. arXiv:2403.17837."},{"key":"e_1_3_3_24_2","unstructured":"Loris Bazzani. 2021a. Retrieved fom http:\/\/www.lorisbazzani.info\/code-datasets\/multi-camera-dataset\/"},{"key":"e_1_3_3_25_2","unstructured":"Loris Bazzani. 2021b. Friends Meet Dataset. Retrieved fom https:\/\/pavis.iit.it\/datasets\/fmdataset & https:\/\/pavisdata.iit.it\/data\/bazzani\/fm_dataset\/fm_dataset.zip"},{"key":"e_1_3_3_26_2","unstructured":"Loris Bazzani. 2021c. Private. Retrieved from http:\/\/www.lorisbazzani.info\/code-datasets\/multi-camera-dataset\/"},{"key":"e_1_3_3_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247888"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1468-0394.2012.00622.x"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.5244\/C.23.14"},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126516"},{"key":"e_1_3_3_31_2","unstructured":"Abhijan Bhattacharyya Ashis Sau Ruddra Dev Roychoudhury Hrishav Bakul Barua Chayan Sarkar Sayan Paul Brojeshwar Bhowmick Arpan Pal and Balamuralidhar Purushothaman. 2021. Edge centric communication protocol for remotely maneuvering a tele-presence robot in a geographically distributed environment. US Patent App. 16\/988 453."},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247817"},{"key":"e_1_3_3_33_2","unstructured":"Laura Cabrera-Quiros. 2021. The MatchNMingle dataset. Retrieved from http:\/\/matchmakers.ewi.tudelft.nl\/matchnmingle\/pmwiki\/index.php?n=Main.TheDataset"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2017.03.002"},{"key":"e_1_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247845"},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001415550071"},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10593-2_28"},{"key":"e_1_3_3_38_2","article-title":"Discovering groups of people in images","author":"Choi Wongun","year":"2014","unstructured":"Wongun Choi, Yu-Wei Chao, Caroline Pantofaru, and Silvio Savarese. 2014b. Discovering groups of people in images. In ECCV.","journal-title":"ECCV"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511620539"},{"key":"e_1_3_3_40_2","unstructured":"CMU. 2021. CMU Panoptic Dataset. Retrieved from https:\/\/domedb.perception.cs.cmu.edu"},{"key":"e_1_3_3_41_2","unstructured":"Claudio Coppola. 2021. UoL 3D Social Interaction Dataset. Retrieved from https:\/\/lcas.lincoln.ac.uk\/wp\/research\/data-sets-software\/uol-3d-social-interaction-dataset"},{"key":"e_1_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2017.8172405"},{"key":"e_1_3_3_43_2","unstructured":"Marco Cristani. 2021. CoffeeeBreak dataset. Retrieved from http:\/\/profs.sci.univr.it\/cristanm\/datasets.html"},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.5244\/C.25.23"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/PASSAT\/SocialCom.2011.32"},{"key":"e_1_3_3_46_2","unstructured":"Cyberbotics. 2021. Webots-Open Source Robot Simulator. Retrieved from https:\/\/cyberbotics.com\/"},{"key":"e_1_3_3_47_2","first-page":"35","volume-title":"LREC 2018 Workshop: Language and Body in Real Life (LB-IRL2018) and Multimodal Corpora (MMC2018) Joint Workshop","author":"Den Yasuharu","year":"2018","unstructured":"Yasuharu Den. 2018. F-formation and social context: How spatial orientation of participants\u2019 bodies is organized in the vast field. In LREC 2018 Workshop: Language and Body in Real Life (LB-IRL2018) and Multimodal Corpora (MMC2018) Joint Workshop, 35\u201339."},{"key":"e_1_3_3_48_2","first-page":"25","volume-title":"International Conference on Intelligent User Interfaces","author":"Dim Eyal","year":"2013","unstructured":"Eyal Dim and Tsvi Kuflik. 2013. Social F-formation in blended reality. In International Conference on Intelligent User Interfaces, 25\u201328."},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/2662869"},{"key":"e_1_3_3_50_2","first-page":"1","article-title":"How ethical issues raised by human\u2013robot interaction can impact the intention to use the robot?","author":"Etemad-Sajadi Reza","year":"2022","unstructured":"Reza Etemad-Sajadi, Antonin Soussan, and Th\u00e9o Sch\u00f6pfer. 2022. How ethical issues raised by human\u2013robot interaction can impact the intention to use the robot? International Journal of Social Robotics (2022), 1\u201313.","journal-title":"International Journal of Social Robotics"},{"key":"e_1_3_3_51_2","unstructured":"Alessandro Farinelli. 2021. Retrieved from http:\/\/profs.sci.univr.it\/*cristanm\/datasets.html"},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502212"},{"key":"e_1_3_3_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/2661126.2661134"},{"key":"e_1_3_3_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502096"},{"key":"e_1_3_3_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN46459.2019.8956444"},{"key":"e_1_3_3_56_2","first-page":"396","volume-title":"2014 International Conference on Computer Vision Theory and Applications (VISAPP)","volume":"2","author":"G\u00e1rate Carolina","year":"2014","unstructured":"Carolina G\u00e1rate, Sofia Zaidenberg, Julien Badie, and Fran\u00e7ois Br\u00e9mond. 2014. Group tracking and behavior recognition in long video surveillance sequences. In 2014 International Conference on Computer Vision Theory and Applications (VISAPP), Vol. 2. IEEE, 396\u2013402."},{"key":"e_1_3_3_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6385460"},{"key":"e_1_3_3_58_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287041"},{"key":"e_1_3_3_59_2","doi-asserted-by":"publisher","DOI":"10.5555\/3109688.3109690"},{"key":"e_1_3_3_60_2","doi-asserted-by":"publisher","DOI":"10.4324\/9780203788387"},{"key":"e_1_3_3_61_2","volume-title":"2nd International Workshop on Cognitive Robotics Systems: Replicating Human Actions and Activities","author":"G\u00f3mez Javier V","year":"2013","unstructured":"Javier V G\u00f3mez, Nikolaos Mavridis, and Santiago Garrido. 2013. Social path planning: Generic human-robot interaction framework for robotic navigation tasks. In 2nd International Workshop on Cognitive Robotics Systems: Replicating Human Actions and Activities."},{"key":"e_1_3_3_62_2","volume-title":"FG Net Workshop on Visual Observation of Deictic Gestures","author":"Gourier Nicolas","year":"2004","unstructured":"Nicolas Gourier and James Crowley. 2004. Estimating face orientation from robust detection of salient facial structures. FG Net Workshop on Visual Observation of Deictic Gestures."},{"key":"e_1_3_3_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/SocialCom.2010.11"},{"key":"e_1_3_3_64_2","doi-asserted-by":"publisher","DOI":"10.1525\/aa.1963.65.5.02a00020"},{"key":"e_1_3_3_65_2","volume-title":"The Hidden Dimension","author":"Hall Edward T.","year":"1966","unstructured":"Edward T. Hall. 1966. The Hidden Dimension. Anchor Books. Garden City, NY."},{"key":"e_1_3_3_66_2","unstructured":"Hooman Hedayati. 2021. Retrieved from https:\/\/github.com\/cu-ironlab\/Babble"},{"key":"e_1_3_3_67_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS45743.2020.9340708"},{"key":"e_1_3_3_68_2","doi-asserted-by":"publisher","DOI":"10.1109\/HRI.2019.8673233"},{"key":"e_1_3_3_69_2","doi-asserted-by":"publisher","DOI":"10.1145\/3334480.3383015"},{"key":"e_1_3_3_70_2","first-page":"6840","article-title":"Denoising diffusion probabilistic models","volume":"33","author":"Ho Jonathan","year":"2020","unstructured":"Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems 33 (2020), 6840\u20136851.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_71_2","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3351027"},{"key":"e_1_3_3_72_2","doi-asserted-by":"publisher","DOI":"10.1145\/2663204.2663228"},{"key":"e_1_3_3_73_2","doi-asserted-by":"publisher","DOI":"10.1145\/2070481.2070525"},{"key":"e_1_3_3_74_2","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3351320"},{"key":"e_1_3_3_75_2","doi-asserted-by":"publisher","unstructured":"Helge H\u00fcttenrauch Kerstin Eklundh Anders Green and Elin Anna Topp. 2006. Investigating spatial relationships in human-robot interaction. 5052\u20135059. DOI: 10.1109\/IROS.2006.282535","DOI":"10.1109\/IROS.2006.282535"},{"key":"e_1_3_3_76_2","doi-asserted-by":"publisher","DOI":"10.1145\/2818048.2819938"},{"key":"e_1_3_3_77_2","unstructured":"Idiap Research Institute. 2021. Idiap Poster Data. Retrieved from https:\/\/www.idiap.ch\/en\/dataset\/idiap-poster-data\/idiap-poster-data"},{"key":"e_1_3_3_78_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2782743"},{"key":"e_1_3_3_79_2","doi-asserted-by":"publisher","DOI":"10.1145\/2631488.2631499"},{"key":"e_1_3_3_80_2","doi-asserted-by":"publisher","DOI":"10.5555\/2667657.2667664"},{"key":"e_1_3_3_81_2","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2009.2032969"},{"key":"e_1_3_3_82_2","volume-title":"4th International Symposium on New Frontiers in Human-Robot Interaction","author":"Karreman Daphne","year":"2015","unstructured":"Daphne Karreman, Geke Ludden, Betsy van Dijk, and Vanessa Evers. 2015. How can a tour guide robot's orientation influence visitors\u2019 orientation and formations? In 4th International Symposium on New Frontiers in Human-Robot Interaction."},{"key":"e_1_3_3_83_2","doi-asserted-by":"publisher","DOI":"10.1145\/2559636.2559839"},{"key":"e_1_3_3_84_2","doi-asserted-by":"publisher","DOI":"10.1145\/2968219.2968281"},{"key":"e_1_3_3_85_2","volume-title":"Conducting interaction: Patterns of behavior in focused encounters","author":"Kendon Adam","year":"1990","unstructured":"Adam Kendon. 1990. Conducting interaction: Patterns of behavior in focused encounters. Vol. 7. CUP Archive."},{"key":"e_1_3_3_86_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12397-9_1"},{"key":"e_1_3_3_87_2","unstructured":"Chung Min Kim Mingxuan Wu Justin Kerr Ken Goldberg Matthew Tancik and Angjoo Kanazawa. 2024. GARField: Group anything with radiance fields. arXiv:2401.09419."},{"key":"e_1_3_3_88_2","doi-asserted-by":"publisher","DOI":"10.1109\/FGCT.2012.6476562"},{"key":"e_1_3_3_89_2","volume-title":"HRI 2017","author":"Krishna Sai","year":"2017","unstructured":"Sai Krishna, Andrey Kiselev, and Amy Loutfi. 2017. Towards a method to detect F-formations in real-time to enable social robots to join groups. In HRI 2017."},{"key":"e_1_3_3_90_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2013.05.007"},{"key":"e_1_3_3_91_2","first-page":"285","volume-title":"2010 5th ACM\/IEEE International Conference on Human-Robot Interaction (HRI)","author":"Kuzuoka Hideaki","year":"2010","unstructured":"Hideaki Kuzuoka, Yuya Suzuki, Jun Yamashita, and Keiichi Yamazaki. 2010. Reconfiguring spatial formation arrangement by robot body orientation. In 2010 5th ACM\/IEEE International Conference on Human-Robot Interaction (HRI). IEEE, 285\u2013292."},{"key":"e_1_3_3_92_2","doi-asserted-by":"publisher","DOI":"10.3390\/app12052370"},{"key":"e_1_3_3_93_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2007.01089.x"},{"key":"e_1_3_3_94_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.351"},{"key":"e_1_3_3_95_2","article-title":"Multi-hypothesis social grouping and tracking for mobile robots.","author":"Luber Matthias","year":"2013","unstructured":"Matthias Luber and Kai Oliver Arras. 2013. Multi-hypothesis social grouping and tracking for mobile robots. In Robotics: Science and Systems.","journal-title":"Robotics: Science and Systems"},{"key":"e_1_3_3_96_2","doi-asserted-by":"publisher","DOI":"10.1145\/2380116.2380121"},{"key":"e_1_3_3_97_2","first-page":"445","article-title":"Using F-formations to analyse spatial patterns of interaction in physical environments","author":"Marshall Paul","year":"2011","unstructured":"Paul Marshall, Yvonne Rogers, and Nadia Pantidi. 2011. Using F-formations to analyse spatial patterns of interaction in physical environments. ACM, 445\u2013454.","journal-title":"ACM"},{"key":"e_1_3_3_98_2","first-page":"270","volume-title":"2018 ACM\/IEEE International Conference on Human-Robot Interaction","author":"Matsumoto Takahiro","year":"2018","unstructured":"Takahiro Matsumoto, Mitsuhiro Goto, Ryo Ishii, Tomoki Watanabe, Tomohiro Yamada, and Michita Imai. 2018. Where should robots talk? Spatial arrangement study from a participant workload perspective. In 2018 ACM\/IEEE International Conference on Human-Robot Interaction, 270\u2013278."},{"key":"e_1_3_3_99_2","first-page":"143","article-title":"Mind merging","author":"McNeill David","year":"2009","unstructured":"David McNeill, Susan Duncan, Amy Franklin, James Goss, Irene Kimbara, Fey Parrill, Haleema Welji, Lei Chen, Mary Harper, Francis Quek, Travis Rose, and Ronald Tuttle. 2009. Mind merging. Expressing Oneself\/Expressing One's Self: Communication, Language, Cognition, and Identity: Essays in Honor of Robert Krauss, 143\u2013164.","journal-title":"Expressing Oneself\/Expressing One's Self: Communication, Language, Cognition, and Identity: Essays in Honor of Robert Krauss"},{"key":"e_1_3_3_100_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-013-0189-8"},{"key":"e_1_3_3_101_2","volume-title":"An experimental design for studying proxemic behavior in human-robot interaction","author":"Mead Ross","year":"2011","unstructured":"Ross Mead and Maja J. Matari\u0107. 2011. An experimental design for studying proxemic behavior in human-robot interaction. Technical Report. Citeseer."},{"key":"e_1_3_3_102_2","unstructured":"MMLAB. 2021. Multimedia Signal Processing and Understanding Lab. Retrieved from http:\/\/mmlab.science.unitn.it\/USID\/"},{"key":"e_1_3_3_103_2","first-page":"301","volume-title":"7th annual ACM\/IEEE International Conference on Human-Robot Interaction","author":"Saiki Luis Yoichi Morales","year":"2012","unstructured":"Luis Yoichi Morales Saiki, Satoru Satake, Rajibul Huq, Dylan Glas, Takayuki Kanda, and Norihiro Hagita. 2012. How do people walk side-by-side? Using a computational model of human behavior for a social robot. In 7th annual ACM\/IEEE International Conference on Human-Robot Interaction, 301\u2013308."},{"key":"e_1_3_3_104_2","doi-asserted-by":"publisher","DOI":"10.1145\/1957656.1957786"},{"key":"e_1_3_3_105_2","doi-asserted-by":"publisher","DOI":"10.1145\/2074712.2074766"},{"key":"e_1_3_3_106_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7353952"},{"key":"e_1_3_3_107_2","unstructured":"Narrative. 2021. Narrative Clip 2. Retrieved from http:\/\/getnarrative.com\/"},{"key":"e_1_3_3_108_2","first-page":"1047","volume-title":"The 10th International Conference on Autonomous Agents and Multiagent Systems-Volume 3","author":"Nguyen Nhung","year":"2011","unstructured":"Nhung Nguyen and Ipke Wachsmuth. 2011. From body space to interaction space: Modeling spatial cooperation for virtual humans. In The 10th International Conference on Autonomous Agents and Multiagent Systems-Volume 3. International Foundation for Autonomous Agents and Multiagent Systems, 1047\u20131054."},{"key":"e_1_3_3_109_2","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2820759"},{"key":"e_1_3_3_110_2","unstructured":"Robert Fisher School of Informatics (Univ. of Edinburgh). 2021. BEHAVE: Computer-assisted prescreening of video streams for unusual activities. Retrieved from http:\/\/homepages.inf.ed.ac.uk\/rbf\/BEHAVE\/"},{"key":"e_1_3_3_111_2","doi-asserted-by":"publisher","DOI":"10.1145\/2675133.2675194"},{"key":"e_1_3_3_112_2","doi-asserted-by":"publisher","DOI":"10.1145\/2212776.2223723"},{"key":"e_1_3_3_113_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40498-6_3"},{"key":"e_1_3_3_114_2","doi-asserted-by":"publisher","DOI":"10.5555\/3237383.3237968"},{"key":"e_1_3_3_115_2","doi-asserted-by":"publisher","DOI":"10.3390\/s19143142"},{"key":"e_1_3_3_116_2","doi-asserted-by":"publisher","DOI":"10.1145\/3029798.3038304"},{"key":"e_1_3_3_117_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN46459.2019.8956318"},{"key":"e_1_3_3_118_2","doi-asserted-by":"publisher","DOI":"10.3390\/mti3040069"},{"key":"e_1_3_3_119_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-67401-8_45"},{"key":"e_1_3_3_120_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459260"},{"key":"e_1_3_3_121_2","doi-asserted-by":"publisher","DOI":"10.1145\/3150226"},{"key":"e_1_3_3_122_2","unstructured":"Pradip Pramanick Hrishav Bakul Barua and Chayan Sarkar. 2021. Robotic task planning for complex task instructions in natural language. US Patent App. 17\/007 391."},{"key":"e_1_3_3_123_2","unstructured":"Profs. 2014 modified. F-formation discovery in static images. Retrieved from http:\/\/profs.scienze.univr.it\/cristanm\/ssp\/"},{"key":"e_1_3_3_124_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01018"},{"key":"e_1_3_3_125_2","doi-asserted-by":"publisher","DOI":"10.1145\/3490035.3490275"},{"key":"e_1_3_3_126_2","doi-asserted-by":"publisher","unstructured":"Anoop Rajagopal Ramanathan Subramanian Radu Vieriu Elisa Ricci Oswald Lanz Kalpathi Ramakrishnan and Nicu Sebe. 2012. An adaptation framework for head-pose classification in dynamic multi-view scenarios. In Vol. 7725. 652\u2013666. DOI: 10.1007\/978-3-642-37444-9_51","DOI":"10.1007\/978-3-642-37444-9_51"},{"key":"e_1_3_3_127_2","first-page":"1","article-title":"Exploring transfer learning approaches for head pose classification from multi-view surveillance images","volume":"109","author":"Rajagopal Anoop Kolar","year":"2014","unstructured":"Anoop Kolar Rajagopal, Ramanathan Subramanian, Elisa Ricci, Radu L. Vieriu, Oswald Lanz, Kalpathi R. Ramakrishnan, and Nicu Sebe. 2014. Exploring transfer learning approaches for head pose classification from multi-view surveillance images. International Journal of Computer Vision 109, 1\u20132 (2014), 146\u2013167.","journal-title":"International Journal of Computer Vision"},{"key":"e_1_3_3_128_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACIIW.2019.8925065"},{"key":"e_1_3_3_129_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413697"},{"key":"e_1_3_3_130_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2016.7745246"},{"key":"e_1_3_3_131_2","volume-title":"Proceedings of We Robot","author":"Riek Laurel","year":"2014","unstructured":"Laurel Riek and Don Howard. 2014. A code of ethics for the human-robot interaction profession. Proceedings of We Robot."},{"key":"e_1_3_3_132_2","unstructured":"Laurel D. Riek. 2018. Robot-Centric Human Group Detection Angelique."},{"key":"e_1_3_3_133_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2011.6094496"},{"key":"e_1_3_3_134_2","unstructured":"RoboDK. 2021. Simulate Robot Applications. Retrieved from https:\/\/robodk.com\/index"},{"key":"e_1_3_3_135_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACIIW.2019.8925179"},{"key":"e_1_3_3_136_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33885-4_12"},{"key":"e_1_3_3_137_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2017.8172306"},{"key":"e_1_3_3_138_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459361"},{"key":"e_1_3_3_139_2","doi-asserted-by":"crossref","unstructured":"S. M. Bhagya P. Samarakoon M. A. Viraj J. Muthugala and A. G. Buddhika P. Jayasekara. 2018. Replicating natural approaching behavior of humans for improving robot's approach toward two persons during a conversation. In 2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN). IEEE 552\u2013558.","DOI":"10.1109\/ROMAN.2018.8525834"},{"key":"e_1_3_3_140_2","unstructured":"Chayan Sarkar Snehasis Banerjee Pradip Pramanick Hrishav Bakul Barua Soumyadip Maity Dipanjan Das Brojeshwar Bhowmick Ashis Sau Abhijan Bhattacharyya Arpan Pal Balamuralidhar Purushothaman and Ruddra Roy Chowdhury. 2021a. Knowledge partitioning for task execution by conversational tele-presence robots in a geographically separated environment. US Patent App. 17\/015 238."},{"key":"e_1_3_3_141_2","unstructured":"Chayan Sarkar Hrishav Bakul Barua Arpan Pal Balamuralidhar Purushothaman and Achanna Anil Kumar. 2021b. Attention shifting of a robot in a group conversation using audio-visual perception based speaker localization. US Patent App. 11 127 401."},{"key":"e_1_3_3_142_2","doi-asserted-by":"publisher","DOI":"10.1145\/1514095.1514117"},{"key":"e_1_3_3_143_2","unstructured":"Ashis Sau Ruddra Dev Roychoudhury Hrishav Bakul Barua Chayan Sarkar Sayan Paul Brojeshwar Bhowmick Arpan Pal and Balamuralidhar Purushothaman. 2020. Edge-centric telepresence avatar robot for geographically distributed environment. arXiv:2007.12990."},{"key":"e_1_3_3_144_2","doi-asserted-by":"publisher","DOI":"10.1145\/2957265.2962656"},{"key":"e_1_3_3_145_2","doi-asserted-by":"publisher","DOI":"10.1109\/WIAMIS.2013.6616147"},{"key":"e_1_3_3_146_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2013.6738732"},{"key":"e_1_3_3_147_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0123783"},{"key":"e_1_3_3_148_2","doi-asserted-by":"publisher","DOI":"10.1145\/3503161.3551605"},{"key":"e_1_3_3_149_2","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2011.VII.039"},{"key":"e_1_3_3_150_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-015-0285-z"},{"key":"e_1_3_3_151_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2470658"},{"key":"e_1_3_3_152_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-68585-2"},{"key":"e_1_3_3_153_2","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806343"},{"key":"e_1_3_3_154_2","doi-asserted-by":"publisher","DOI":"10.1145\/3392824"},{"key":"e_1_3_3_155_2","doi-asserted-by":"publisher","DOI":"10.1145\/3536221.3556611"},{"key":"e_1_3_3_156_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00880"},{"key":"e_1_3_3_157_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2019.00125"},{"key":"e_1_3_3_158_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2018.03.006"},{"key":"e_1_3_3_159_2","volume-title":"2016 AAAI Fall Symposium Series","author":"Taylor Angelique","year":"2016","unstructured":"Angelique Taylor and Laurel D. Riek. 2016. Robot perception of human groups in the real world: State of the art. In 2016 AAAI Fall Symposium Series."},{"key":"e_1_3_3_160_2","volume-title":"ACM Transactions on Human-Robot Interaction (THRI)","author":"Taylor Angelique Marie","year":"2019","unstructured":"Angelique Marie Taylor, Darren Chan, and Laurel Riek. 2019. Robot-centric perception of human groups. ACM Transactions on Human-Robot Interaction (THRI) (2019)."},{"key":"e_1_3_3_161_2","doi-asserted-by":"publisher","DOI":"10.1145\/3462244.3479963"},{"key":"e_1_3_3_162_2","doi-asserted-by":"publisher","DOI":"10.1145\/2935334.2935343"},{"key":"e_1_3_3_163_2","first-page":"539","volume-title":"International Conference on Computer Vision Theory and Applications (VISAPP)","author":"Tran Khai N.","year":"2013","unstructured":"Khai N. Tran, Apurva Bedagkar-Gala, Ioannis A. Kakadiaris, and Shishir K. Shah. 2013. Social cues in group formation and local interactions for collective activity analysis. In International Conference on Computer Vision Theory and Applications (VISAPP), 539\u2013548."},{"key":"e_1_3_3_164_2","doi-asserted-by":"publisher","DOI":"10.1109\/RCAR.2016.7784002"},{"key":"e_1_3_3_165_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2016.05.004"},{"key":"e_1_3_3_166_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-017-1026-6"},{"key":"e_1_3_3_167_2","first-page":"658","volume-title":"Asian Conference on Computer Vision","author":"Vascon Sebastiano","year":"2014","unstructured":"Sebastiano Vascon, Eyasu Zemene Mequanint, Marco Cristani, Hayley Hung, Marcello Pelillo, and Vittorio Murino. 2014. A game-theoretic probabilistic approach for detecting conversational groups. In Asian Conference on Computer Vision. Springer, 658\u2013675."},{"key":"e_1_3_3_168_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2015.09.012"},{"key":"e_1_3_3_169_2","doi-asserted-by":"publisher","DOI":"10.1145\/2909824.3020207"},{"key":"e_1_3_3_170_2","doi-asserted-by":"publisher","DOI":"10.3389\/frobt.2021.703807"},{"key":"e_1_3_3_171_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7353792"},{"key":"e_1_3_3_172_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2017.8172454"},{"key":"e_1_3_3_173_2","unstructured":"Computational Vision and Geometry Lab (CVGL) at Stanford. 2021. Discovering groups of people in images. Retrieved from http:\/\/cvgl.stanford.edu\/projects\/groupdiscovery\/"},{"key":"e_1_3_3_174_2","unstructured":"TeV TECHNOLOGIES OF VISION. 2021a. Resources. Retrieved from http:\/\/tev.fbk.eu\/resources"},{"key":"e_1_3_3_175_2","unstructured":"TeV TECHNOLOGIES OF VISION. 2021b. SALSA dataset. Retrieved from http:\/\/tev.fbk.eu\/salsa"},{"key":"e_1_3_3_176_2","article-title":"Deep learning for hdr imaging: State-of-the-art and future trends","author":"Wang Lin","year":"2021","unstructured":"Lin Wang and Kuk-Jin Yoon. 2021. Deep learning for hdr imaging: State-of-the-art and future trends. IEEE Transactions on Pattern Analysis and Machine Intelligence.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_3_177_2","unstructured":"Wikipedia. 2021a. Retrieved from https:\/\/en.wikipedia.org\/wiki\/Line_(formation)"},{"key":"e_1_3_3_178_2","unstructured":"Wikipedia. 2021b. Retrieved from https:\/\/en.wikipedia.org\/wiki\/Column_(formation)"},{"key":"e_1_3_3_179_2","unstructured":"Wikipedia. 2021c. Retrieved from https:\/\/en.wikipedia.org\/wiki\/Echelon_formation"},{"key":"e_1_3_3_180_2","unstructured":"Wikipedia. 2021d. Retrieved from https:\/\/en.wikipedia.org\/wiki\/V_formation"},{"key":"e_1_3_3_181_2","unstructured":"Wikipedia. 2021e. Retrieved from https:\/\/en.wikipedia.org\/wiki\/Infantry_square"},{"key":"e_1_3_3_182_2","unstructured":"Alexander Wisowaty. 2019. Group Human-Robot Interaction: A Review."},{"key":"e_1_3_3_183_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8206105"},{"key":"e_1_3_3_184_2","doi-asserted-by":"publisher","DOI":"10.1109\/SMC.2014.6973964"},{"key":"e_1_3_3_185_2","doi-asserted-by":"publisher","DOI":"10.1145\/2157689.2157794"},{"key":"e_1_3_3_186_2","doi-asserted-by":"publisher","DOI":"10.1145\/1878039.1878048"},{"key":"e_1_3_3_187_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.123"},{"key":"e_1_3_3_188_2","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2018.2855750"},{"key":"e_1_3_3_189_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.3013531"}],"container-title":["ACM Transactions on Human-Robot Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3682072","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3682072","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:10:03Z","timestamp":1750295403000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3682072"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,23]]},"references-count":188,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,12,31]]}},"alternative-id":["10.1145\/3682072"],"URL":"https:\/\/doi.org\/10.1145\/3682072","relation":{},"ISSN":["2573-9522"],"issn-type":[{"value":"2573-9522","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,23]]},"assertion":[{"value":"2021-10-13","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-07-14","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}