{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T07:52:30Z","timestamp":1775289150970,"version":"3.50.1"},"reference-count":59,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2018,12,4]],"date-time":"2018-12-04T00:00:00Z","timestamp":1543881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,12,31]]},"abstract":"<jats:p>\n                    We propose a real-time method for the infrastructure-free estimation of articulated human motion. The approach leverages a swarm of camera-equipped flying robots and\n                    <jats:italic>jointly<\/jats:italic>\n                    optimizes the swarm's and skeletal states, which include the 3D joint positions and a set of bones. Our method allows to track the motion of human subjects, for example an athlete, over long time horizons and long distances, in challenging settings and at large scale, where fixed infrastructure approaches are not applicable. The proposed algorithm uses active infra-red markers, runs in real-time and accurately estimates robot and human pose parameters online without the need for accurately calibrated or stationary mounted cameras. Our method i) estimates a global coordinate frame for the MAV swarm, ii) jointly optimizes the human pose and relative camera positions, and iii) estimates the length of the human bones. The entire swarm is then controlled via a model predictive controller to maximize visibility of the subject from multiple viewpoints even under fast motion such as jumping or jogging. We demonstrate our method in a number of difficult scenarios including capture of long locomotion sequences at the scale of a triplex gym, in non-planar terrain, while climbing and in outdoor scenarios.\n                  <\/jats:p>","DOI":"10.1145\/3272127.3275022","type":"journal-article","created":{"date-parts":[[2018,11,28]],"date-time":"2018-11-28T14:16:10Z","timestamp":1543414570000},"page":"1-14","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["Flycon"],"prefix":"10.1145","volume":"37","author":[{"given":"Tobias","family":"N\u00e4geli","sequence":"first","affiliation":[{"name":"AIT Lab, ETH Zurich"}]},{"given":"Samuel","family":"Oberholzer","sequence":"additional","affiliation":[{"name":"AIT Lab, ETH Zurich"}]},{"given":"Silvan","family":"Pl\u00fcss","sequence":"additional","affiliation":[{"name":"AIT Lab, ETH Zurich"}]},{"given":"Javier","family":"Alonso-Mora","sequence":"additional","affiliation":[{"name":"Delft University of Technology"}]},{"given":"Otmar","family":"Hilliges","sequence":"additional","affiliation":[{"name":"AIT Lab, ETH Zurich"}]}],"member":"320","published-online":{"date-parts":[[2018,12,4]]},"reference":[{"key":"e_1_2_2_1_1","unstructured":"2015. Parrot SDK. (2015). http:\/\/developer.parrot.com\/. 2015. Parrot SDK. (2015). http:\/\/developer.parrot.com\/."},{"key":"e_1_2_2_2_1","volume-title":"Distributed multi-robot formation control in dynamic environments. Autonomous Robots (July","author":"Alonso-Mora Javier","year":"2018","unstructured":"Javier Alonso-Mora , Eduardo Montijano , Tobias N\u00e4geli , Otmar Hilliges , Mac Schwager , and Daniela Rus . 2018. Distributed multi-robot formation control in dynamic environments. Autonomous Robots (July 2018 ). Javier Alonso-Mora, Eduardo Montijano, Tobias N\u00e4geli, Otmar Hilliges, Mac Schwager, and Daniela Rus. 2018. Distributed multi-robot formation control in dynamic environments. Autonomous Robots (July 2018)."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33783-3_46"},{"key":"e_1_2_2_4_1","volume-title":"Robotics: Science and Systems RSS2013","author":"Basiri Meysam","year":"2013","unstructured":"Meysam Basiri , Felix Schill , Dario Floreano , and Pedro Lima . 2013 . Audio-based relative positioning system for multiple micro air vehicle systems . In Robotics: Science and Systems RSS2013 . Meysam Basiri, Felix Schill, Dario Floreano, and Pedro Lima. 2013. Audio-based relative positioning system for multiple micro air vehicle systems. In Robotics: Science and Systems RSS2013."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46454-1_34"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/794191.794776"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.3182\/20110828-6-IT-1002.02327"},{"key":"e_1_2_2_8_1","doi-asserted-by":"crossref","unstructured":"J A Castellanos Jose Neira and Juan Domingo Tardos. 2004. Limits to the consistency of EKF-based SLAM. (2004). J A Castellanos Jose Neira and Juan Domingo Tardos. 2004. Limits to the consistency of EKF-based SLAM. (2004).","DOI":"10.1016\/S1474-6670(17)32063-3"},{"key":"e_1_2_2_9_1","unstructured":"Xianjie Chen and Alan L Yuille. 2014. Articulated pose estimation by a graphical model with image dependent pairwise relations. In NIPS. 1736--1744. Xianjie Chen and Alan L Yuille. 2014. Articulated pose estimation by a graphical model with image dependent pairwise relations. In NIPS. 1736--1744."},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1399504.1360697"},{"key":"e_1_2_2_11_1","volume-title":"2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 2237--2242","author":"de Pal\u00e9zieux N.","unstructured":"N. de Pal\u00e9zieux , T. N\u00e4geli , and O. Hilliges . 2016. Duo-VIO: Fast, light-weight, stereo inertial odometry . In 2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 2237--2242 . N. de Pal\u00e9zieux, T. N\u00e4geli, and O. Hilliges. 2016. Duo-VIO: Fast, light-weight, stereo inertial odometry. In 2016 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). 2237--2242."},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925969"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2557779"},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAC.1969.1099223"},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33783-3_53"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2858036.2858353"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3197517.3201390"},{"key":"e_1_2_2_18_1","volume-title":"Advanced Kalman filtering, least-squares and modeling","author":"Gibbs Bruce P.","unstructured":"Bruce P. Gibbs . 2011. Advanced Kalman filtering, least-squares and modeling . John Wiley & Sons . Bruce P. Gibbs. 2011. Advanced Kalman filtering, least-squares and modeling. John Wiley & Sons."},{"key":"e_1_2_2_19_1","volume-title":"Multiple View Geometry in Computer Vision (2 ed.)","author":"Hartley Richard","unstructured":"Richard Hartley and Andrew Zisserman . 2003. Multiple View Geometry in Computer Vision (2 ed.) . Cambridge University Press , New York, NY, USA . Richard Hartley and Andrew Zisserman. 2003. Multiple View Geometry in Computer Vision (2 ed.). Cambridge University Press, New York, NY, USA."},{"key":"e_1_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/9.895577"},{"key":"e_1_2_2_21_1","doi-asserted-by":"crossref","unstructured":"Chong Huang Zhenyu Yang Yan Kong Peng Chen Xin Yang and Kwang-Ting Tim Cheng. 2018. Through-the-Lens Drone Filming. (2018). Chong Huang Zhenyu Yang Yan Kong Peng Chen Xin Yang and Kwang-Ting Tim Cheng. 2018. Through-the-Lens Drone Filming. (2018).","DOI":"10.1109\/IROS.2018.8594333"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2816795.2818106"},{"key":"e_1_2_2_23_1","volume-title":"Kalman filtering for spacecraft attitude estimation. Journal of Guidance, Control, and Dynamics","author":"Lefferts Ern J","year":"1982","unstructured":"Ern J Lefferts , F Landis Markley , and Malcolm D Shuster . 1982. Kalman filtering for spacecraft attitude estimation. Journal of Guidance, Control, and Dynamics ( 1982 ). Ern J Lefferts, F Landis Markley, and Malcolm D Shuster. 1982. Kalman filtering for spacecraft attitude estimation. Journal of Guidance, Control, and Dynamics (1982)."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2016.11"},{"key":"e_1_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Hyon Lim and Sudipta Sinha. 2015. Monocular Localization of a moving person onboard a Quadrotor MAV. https:\/\/www.microsoft.com\/en-us\/research\/publication\/trajrecon\/ Hyon Lim and Sudipta Sinha. 2015. Monocular Localization of a moving person onboard a Quadrotor MAV. https:\/\/www.microsoft.com\/en-us\/research\/publication\/trajrecon\/","DOI":"10.1109\/ICRA.2015.7139487"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1944745.1944768"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2011.5980308"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073596"},{"key":"e_1_2_2_29_1","first-page":"56","article-title":"The GRASP Multiple Micro-UAV Testbed. Robotics Automation Magazine","volume":"17","author":"Michael Nathan","year":"2010","unstructured":"Nathan Michael , D. Mellinger , Q. Lindsey , and V. Kumar . 2010 . The GRASP Multiple Micro-UAV Testbed. Robotics Automation Magazine , IEEE 17 , 3 (2010), 56 -- 65 . Nathan Michael, D. Mellinger, Q. Lindsey, and V. Kumar. 2010. The GRASP Multiple Micro-UAV Testbed. Robotics Automation Magazine, IEEE 17, 3 (2010), 56--65.","journal-title":"IEEE"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2006.08.002"},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2017.2665693"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2014.6942701"},{"key":"e_1_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073712"},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298631"},{"key":"e_1_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In ECCV. 483--499. Alejandro Newell Kaiyu Yang and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In ECCV. 483--499.","DOI":"10.1007\/978-3-319-46484-8_29"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354910"},{"key":"e_1_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766993"},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2006.1641182"},{"key":"e_1_2_2_39_1","volume-title":"IEEE ICRA Workshop on Open Source Software.","author":"Quigley Morgan","unstructured":"Morgan Quigley , Ken Conley , Brian P. Gerkey , Josh Faust , Tully Foote , Jeremy Leibs , Rob Wheeler , and Andrew Y. Ng . 2009. ROS: an open-source Robot Operating System . In IEEE ICRA Workshop on Open Source Software. Morgan Quigley, Ken Conley, Brian P. Gerkey, Josh Faust, Tully Foote, Jeremy Leibs, Rob Wheeler, and Andrew Y. Ng. 2009. ROS: an open-source Robot Operating System. In IEEE ICRA Workshop on Open Source Software."},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.94"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2016.25"},{"key":"e_1_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925980"},{"key":"e_1_2_2_43_1","first-page":"8","article-title":"Moven: Full 6dof human motion tracking using miniature inertial sensors","volume":"2","author":"Roetenberg Daniel","year":"2007","unstructured":"Daniel Roetenberg , Henk Luinge , and Per Slycke . 2007 . Moven: Full 6dof human motion tracking using miniature inertial sensors . Xsen Technologies , December 2 , 3 (2007), 8 . Daniel Roetenberg, Henk Luinge, and Per Slycke. 2007. Moven: Full 6dof human motion tracking using miniature inertial sensors. Xsen Technologies, December 2, 3 (2007), 8.","journal-title":"Xsen Technologies"},{"key":"e_1_2_2_44_1","volume-title":"Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C)","volume":"2","author":"Roumeliotis S. I.","unstructured":"S. I. Roumeliotis , G. S. Sukhatme , and G. A. Bekey . 1999. Circumventing dynamic modeling: evaluation of the error-state Kalman filter applied to mobile robot localization . In Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C) , Vol. 2 . 1656--1663 vol.2. S. I. Roumeliotis, G. S. Sukhatme, and G. A. Bekey. 1999. Circumventing dynamic modeling: evaluation of the error-state Kalman filter applied to mobile robot localization. In Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C), Vol. 2. 1656--1663 vol.2."},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-10470-1_14"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2398356.2398381"},{"key":"e_1_2_2_47_1","volume-title":"Luc Van Gool, and Otmar Hilliges","author":"Song Jie","year":"2017","unstructured":"Jie Song , Limin Wang , Luc Van Gool, and Otmar Hilliges . 2017 . Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos . arXiv preprint arXiv:1703.10898 (2017). Jie Song, Limin Wang, Luc Van Gool, and Otmar Hilliges. 2017. Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos. arXiv preprint arXiv:1703.10898 (2017)."},{"key":"e_1_2_2_48_1","volume-title":"Model-based multiple view reconstruction of people. In null","author":"Starck Jonathan","unstructured":"Jonathan Starck and Adrian Hilton . 2003. Model-based multiple view reconstruction of people. In null . IEEE , 915. Jonathan Starck and Adrian Hilton. 2003. Model-based multiple view reconstruction of people. In null. IEEE, 915."},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126338"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966394.1966397"},{"key":"e_1_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.5555\/2354409.2354668"},{"key":"e_1_2_2_52_1","volume-title":"Fusing 2D Uncertainty and 3D Cues for Monocular Body Pose Estimation. arXiv preprint arXiv:1611.05708","author":"Tekin Bugra","year":"2016","unstructured":"Bugra Tekin , Pablo M\u00e1rquez-Neila , Mathieu Salzmann , and Pascal Fua . 2016. Fusing 2D Uncertainty and 3D Cues for Monocular Body Pose Estimation. arXiv preprint arXiv:1611.05708 ( 2016 ). Bugra Tekin, Pablo M\u00e1rquez-Neila, Mathieu Salzmann, and Pascal Fua. 2016. Fusing 2D Uncertainty and 3D Cues for Monocular Body Pose Estimation. arXiv preprint arXiv:1611.05708 (2016)."},{"key":"e_1_2_2_53_1","unstructured":"Jonathan J Tompson Arjun Jain Yann LeCun and Christoph Bregler. 2014. Joint training of a convolutional network and a graphical model for human pose estimation. In NIPS. 1799--1807. Jonathan J Tompson Arjun Jain Yann LeCun and Christoph Bregler. 2014. Joint training of a convolutional network and a graphical model for human pose estimation. In NIPS. 1799--1807."},{"key":"e_1_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.214"},{"key":"e_1_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13131"},{"key":"e_1_2_2_56_1","doi-asserted-by":"crossref","unstructured":"Shih-En Wei Varun Ramakrishna Takeo Kanade and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR. 4724--4732. Shih-En Wei Varun Ramakrishna Takeo Kanade and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR. 4724--4732.","DOI":"10.1109\/CVPR.2016.511"},{"key":"e_1_2_2_57_1","volume-title":"FlyCap: Markerless motion capture using multiple autonomous flying cameras","author":"Xu Lan","year":"2017","unstructured":"Lan Xu , Yebin Liu , Wei Cheng , Kaiwen Guo , Guyue Zhou , Qionghai Dai , and Lu Fang . 2017. FlyCap: Markerless motion capture using multiple autonomous flying cameras . IEEE transactions on visualization and computer graphics ( 2017 ). Lan Xu, Yebin Liu, Wei Cheng, Kaiwen Guo, Guyue Zhou, Qionghai Dai, and Lu Fang. 2017. FlyCap: Markerless motion capture using multiple autonomous flying cameras. IEEE transactions on visualization and computer graphics (2017)."},{"key":"e_1_2_2_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.537"},{"key":"e_1_2_2_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/2601097.2601165"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275022","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3272127.3275022","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T06:58:29Z","timestamp":1775285909000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275022"}},"subtitle":["real-time environment-independent multi-view human pose estimation with aerial vehicles"],"short-title":[],"issued":{"date-parts":[[2018,12,4]]},"references-count":59,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,31]]}},"alternative-id":["10.1145\/3272127.3275022"],"URL":"https:\/\/doi.org\/10.1145\/3272127.3275022","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12,4]]},"assertion":[{"value":"2018-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}