{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T10:55:55Z","timestamp":1761562555529,"version":"3.41.0"},"reference-count":65,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2019,9,9]],"date-time":"2019-09-09T00:00:00Z","timestamp":1567987200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."],"published-print":{"date-parts":[[2019,9,9]]},"abstract":"<jats:p>Smartphone localization is essential to a wide range of applications in shopping malls, museums, office buildings, and other public places. Existing solutions relying on radio fingerprints and\/or inertial sensors suffer from large location errors and considerable deployment efforts. We observe an opportunity in the recent trend of increasing numbers of security surveillance cameras installed in indoor spaces to overcome these limitations and revisit the problem of smartphone localization with a fresh perspective. However, fusing vision-based and radio-based systems is non-trivial due to the absence of absolute location, incorrespondence of identification and looseness of sensor fusion. This study proposes iVR, an integrated vision and radio localization system that achieves sub-meter accuracy with indoor semantic maps automatically generated from only two surveillance cameras, superior to precedent systems that require manual map construction or plentiful captured images. iVR employs a particle filter to fuse raw estimates from multiple systems, including vision, radio, and inertial sensor systems. By doing so, iVR outputs enhanced accuracy with zero start-up costs, while overcoming the respective drawbacks of each individual sub-system. We implement iVR on commodity smartphones and validate its performance in five different scenarios. The results show that iVR achieves a remarkable localization accuracy of 0.7m, outperforming the state-of-the-art systems by &gt;70%.<\/jats:p>","DOI":"10.1145\/3351272","type":"journal-article","created":{"date-parts":[[2019,9,10]],"date-time":"2019-09-10T15:58:26Z","timestamp":1568131106000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":32,"title":["iVR"],"prefix":"10.1145","volume":"3","author":[{"given":"Jingao","family":"Xu","sequence":"first","affiliation":[{"name":"School of Software and BNRist, Tsinghua University, Beijing, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hengjie","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Software and BNRist, Tsinghua University, Beijing, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kun","family":"Qian","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of California San Diego, San Diego, CA, US"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erqun","family":"Dong","sequence":"additional","affiliation":[{"name":"School of Software and BNRist, Tsinghua University, Beijing, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Software and BNRist, Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chenshu","family":"Wu","sequence":"additional","affiliation":[{"name":"Department of Electrical &amp; Computer Engineering, University of Maryland, College Park, Washington DC, MD, US"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Li","family":"Zhang","sequence":"additional","affiliation":[{"name":"HeFei University of Technology, HeFei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zheng","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Software and BNRist, Tsinghua University, Beijing, P.R. China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,9,9]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2015.2478451"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the USENIX NSDI.","author":"Adib Fadel","year":"2014","unstructured":"Fadel Adib , Zach Kabelac , Dina Katabi , and Robert C Miller . 2014 . 3D tracking via body radio reflections . In Proceedings of the USENIX NSDI. Fadel Adib, Zach Kabelac, Dina Katabi, and Robert C Miller. 2014. 3D tracking via body radio reflections. In Proceedings of the USENIX NSDI."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995311"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1614320.1614350"},{"volume-title":"Fully-convolutional siamese networks for object tracking","author":"Bertinetto Luca","key":"e_1_2_1_5_1","unstructured":"Luca Bertinetto , Jack Valmadre , Joao F Henriques , Andrea Vedaldi , and Philip HS Torr . 2016. Fully-convolutional siamese networks for object tracking . In ECCV. Springer . Luca Bertinetto, Jack Valmadre, Joao F Henriques, Andrea Vedaldi, and Philip HS Torr. 2016. Fully-convolutional siamese networks for object tracking. In ECCV. Springer."},{"key":"e_1_2_1_6_1","volume-title":"Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 5793","author":"Betzig Eric","year":"2006","unstructured":"Eric Betzig , George H Patterson , Rachid Sougrat , O Wolf Lindwasser , Scott Olenych , Juan S Bonifacino , Michael W Davidson , Jennifer Lippincott-Schwartz , and Harald F Hess . 2006. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 5793 ( 2006 ), 1642--1645. Eric Betzig, George H Patterson, Rachid Sougrat, O Wolf Lindwasser, Scott Olenych, Juan S Bonifacino, Michael W Davidson, Jennifer Lippincott-Schwartz, and Harald F Hess. 2006. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 5793 (2006), 1642--1645."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3214266"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.159902"},{"key":"e_1_2_1_9_1","volume-title":"Computer Vision-ECCV","author":"Doll\u00e1r Piotr","year":"2012","unstructured":"Piotr Doll\u00e1r , Ron Appel , and Wolf Kienzle . 2012. Crosstalk cascades for frame-rate pedestrian detection . In Computer Vision-ECCV 2012 . Springer , 645--659. Piotr Doll\u00e1r, Ron Appel, and Wolf Kienzle. 2012. Crosstalk cascades for frame-rate pedestrian detection. In Computer Vision-ECCV 2012. Springer, 645--659."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2019.8737640"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the IJCAI.","author":"Ferris Brian","year":"2007","unstructured":"Brian Ferris , Dieter Fox , and Neil Lawrence . 2007 . WiFi-SLAM using Gaussian process latent variable models . In Proceedings of the IJCAI. Brian Ferris, Dieter Fox, and Neil Lawrence. 2007. WiFi-SLAM using Gaussian process latent variable models. In Proceedings of the IJCAI."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2639108.2639134"},{"key":"e_1_2_1_13_1","first-page":"40","article-title":"Handbook of propagation effects for vehicular and personal mobile satellite systems","volume":"1274","author":"Goldhirsh Julius","year":"1998","unstructured":"Julius Goldhirsh and Wolfhard J Vogel . 1998 . Handbook of propagation effects for vehicular and personal mobile satellite systems . NASA Reference Publication 1274 (1998), 40 -- 67 . Julius Goldhirsh and Wolfhard J Vogel. 1998. Handbook of propagation effects for vehicular and personal mobile satellite systems. NASA Reference Publication 1274 (1998), 40--67.","journal-title":"NASA Reference Publication"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539819"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/647988.741478"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.322"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2016.2558191"},{"key":"e_1_2_1_18_1","volume-title":"High-speed tracking with kernelized correlation filters","author":"Henriques Jo\u00e3o F","year":"2015","unstructured":"Jo\u00e3o F Henriques , Rui Caseiro , Pedro Martins , and Jorge Batista . 2015. High-speed tracking with kernelized correlation filters . IEEE transactions on pattern analysis and machine intelligence 37, 3 ( 2015 ), 583--596. Jo\u00e3o F Henriques, Rui Caseiro, Pedro Martins, and Jorge Batista. 2015. High-speed tracking with kernelized correlation filters. IEEE transactions on pattern analysis and machine intelligence 37, 3 (2015), 583--596."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2632048.2636079"},{"key":"e_1_2_1_20_1","volume-title":"New Jersey","author":"Hogg Robert V","year":"1995","unstructured":"Robert V Hogg and Allen T Craig . 1995. Introduction to mathematical statistics.(5\"\" edition). Englewood Hills , New Jersey ( 1995 ). Robert V Hogg and Allen T Craig. 1995. Introduction to mathematical statistics.(5\"\" edition). Englewood Hills, New Jersey (1995)."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2011.5979643"},{"volume-title":"Machine vision","author":"Jain Ramesh","key":"e_1_2_1_22_1","unstructured":"Ramesh Jain , Rangachar Kasturi , and Brian G Schunck . 1995. Machine vision . Vol. 5 . McGraw-Hill New York . Ramesh Jain, Rangachar Kasturi, and Brian G Schunck. 1995. Machine vision. Vol. 5. McGraw-Hill New York."},{"key":"e_1_2_1_23_1","volume-title":"245 million video surveillance cameras installed globally","author":"Jenkins Niall","year":"2014","unstructured":"Niall Jenkins . 2015. 245 million video surveillance cameras installed globally in 2014 . IHS Technology ( 2015). Niall Jenkins. 2015. 245 million video surveillance cameras installed globally in 2014. IHS Technology (2015)."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2017.03.015"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517351.2517352"},{"key":"e_1_2_1_26_1","volume-title":"Snakes: Active contour models. International journal of computer vision 1, 4","author":"Kass Michael","year":"1988","unstructured":"Michael Kass , Andrew Witkin , and Demetri Terzopoulos . 1988 . Snakes: Active contour models. International journal of computer vision 1, 4 (1988), 321--331. Michael Kass, Andrew Witkin, and Demetri Terzopoulos. 1988. Snakes: Active contour models. International journal of computer vision 1, 4 (1988), 321--331."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1364\/JOSAA.8.000377"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2785956.2787487"},{"key":"e_1_2_1_29_1","volume-title":"The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2","author":"Kuhn Harold W","year":"1955","unstructured":"Harold W Kuhn . 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2 ( 1955 ), 83--97. Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1-2 (1955), 83--97."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-008-0152-6"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the USENIX NSDI.","author":"Li Liqun","year":"2014","unstructured":"Liqun Li , Pan Hu , Chunyi Peng , Guobin Shen , and Feng Zhao . 2014 . Epsilon: A visible light based positioning system . In Proceedings of the USENIX NSDI. Liqun Li, Pan Hu, Chunyi Peng, Guobin Shen, and Feng Zhao. 2014. Epsilon: A visible light based positioning system. In Proceedings of the USENIX NSDI."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995730"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2348543.2348581"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3210240.3210342"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2307636.2307656"},{"volume-title":"Binocular vision: development, depth perception and disorders","author":"McCoun Jacques","key":"e_1_2_1_38_1","unstructured":"Jacques McCoun and Lucien Reeves . 2010. Binocular vision: development, depth perception and disorders . Nova Science Publishers, Inc. Jacques McCoun and Lucien Reeves. 2010. Binocular vision: development, depth perception and disorders. Nova Science Publishers, Inc."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASS.2014.52"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1322263.1322265"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995604"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2742647.2742666"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2348543.2348580"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126302"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2016.2590996"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIE.2015.2509917"},{"volume-title":"Sequential Monte Carlo methods in practice","author":"Smith Adrian","key":"e_1_2_1_47_1","unstructured":"Adrian Smith . 2013. Sequential Monte Carlo methods in practice . Springer Science & Business Media . Adrian Smith. 2013. Sequential Monte Carlo methods in practice. Springer Science & Business Media."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/1141911.1141964"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2013.2274283"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2307636.2307655"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2973750.2973776"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2013.25"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3090094"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2014.2320254"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2017.2737004"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2015.7218639"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the USENIX NSDI.","author":"Xiong Jie","year":"2013","unstructured":"Jie Xiong and Kyle Jamieson . 2013 . ArrayTrack: a fine-grained indoor location system . In Proceedings of the USENIX NSDI. Jie Xiong and Kyle Jamieson. 2013. ArrayTrack: a fine-grained indoor location system. In Proceedings of the USENIX NSDI."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/2750858.2807516"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/2971648.2971668"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASS.2018.00050"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/2348543.2348578"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/2676430"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11276-006-0725-7"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/TETC.2016.2614383"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2639108.2639110"}],"container-title":["Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3351272","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3351272","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:25:51Z","timestamp":1750206351000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3351272"}},"subtitle":["Integrated Vision and Radio Localization with Zero Human Effort"],"short-title":[],"issued":{"date-parts":[[2019,9,9]]},"references-count":65,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2019,9,9]]}},"alternative-id":["10.1145\/3351272"],"URL":"https:\/\/doi.org\/10.1145\/3351272","relation":{},"ISSN":["2474-9567"],"issn-type":[{"type":"electronic","value":"2474-9567"}],"subject":[],"published":{"date-parts":[[2019,9,9]]},"assertion":[{"value":"2019-09-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}