{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T19:56:49Z","timestamp":1764705409707,"version":"3.46.0"},"reference-count":55,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["No. 62332016"],"award-info":[{"award-number":["No. 62332016"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018527","name":"Key Research Program of Frontier Science, Chinese Academy of Sciences","doi-asserted-by":"publisher","award":["No. ZDBS-LY-JSC001"],"award-info":[{"award-number":["No. ZDBS-LY-JSC001"]}],"id":[{"id":"10.13039\/501100018527","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."],"published-print":{"date-parts":[[2025,12,2]]},"abstract":"<jats:p>Recently, the rapid advancements of spatially intelligent applications have greatly heightened the demand for context-rich 3D maps on mobile devices. However, existing 3D maps are typically static and interaction-agnostic, making them ill-suited for dynamic user needs, leading to excessive transmission overhead. To address this, we present PocketMap, an interaction-centric 3D map serving system that delivers only the necessary content tailored to each request, considering user intent, environmental context, and available on-device resources. PocketMap adopts a multi-layered, incrementally encoded 3D map representation with varying fidelity levels, enabling adaptive map selection. Leveraging this layered map, we propose a prompt- and context-aware task performance estimator to evaluate the contribution of each map layer. A packet planner then formulates a constrained optimization problem to jointly select appropriate map layers and compression strategies. Together, these components enable on-demand, interaction-aware 3D map serving. We evaluate PocketMap across four environments and four downstream applications: object localization, 3D shape analysis, surface analysis, and image retrieval. User studies show that PocketMap improves user satisfaction by 35.5% compared to the baseline. Extensive evaluations further demonstrate a 45.1% reduction in transmission volume and a 13.7% improvement in task performance compared to the baseline.<\/jats:p>","DOI":"10.1145\/3770680","type":"journal-article","created":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T19:42:32Z","timestamp":1764704552000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["PocketMap: An Interaction-centric 3D Map Serving System for Spatial Understanding on Mobile Devices"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-1589-4222","authenticated-orcid":false,"given":"Xinran","family":"Zhang","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1882-2467","authenticated-orcid":false,"given":"Wuyang","family":"Zhang","sequence":"additional","affiliation":[{"name":"Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, China and University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5315-1839","authenticated-orcid":false,"given":"Hanqi","family":"Zhu","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-0754-3953","authenticated-orcid":false,"given":"Yifan","family":"Duan","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1515-0402","authenticated-orcid":false,"given":"Jianmin","family":"Ji","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China and Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9046-798X","authenticated-orcid":false,"given":"Yanyong","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,12,2]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"17th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 1063\u20131081","author":"Ahmad Fawad","year":"2020","unstructured":"Fawad Ahmad, Hang Qiu, Ray Eells, Fan Bai, and Ramesh Govindan. 2020. CarMap: Fast 3d feature map updates for automobiles. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 1063\u20131081."},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3610911","article-title":"AeroTraj: Trajectory Planning for Fast, and Accurate 3D Reconstruction Using a Drone-based LiDAR","volume":"7","author":"Ahmad Fawad","year":"2023","unstructured":"Fawad Ahmad, Christina Suyong Shin, Rajrup Ghosh, John D'Ambrosio, Eugene Chai, Karthikeyan Sundaresan, and Ramesh Govindan. 2023. AeroTraj: Trajectory Planning for Fast, and Accurate 3D Reconstruction Using a Drone-based LiDAR. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 7, 3 (2023), 1\u201328.","journal-title":"Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)"},{"key":"e_1_2_1_3_1","first-page":"120","article-title":"The opencv library","volume":"25","author":"Bradski Gary","year":"2000","unstructured":"Gary Bradski. 2000. The opencv library. Dr. Dobb's Journal: Software Tools for the Professional Programmer 25, 11 (2000), 120\u2013123.","journal-title":"Dr. Dobb's Journal: Software Tools for the Professional Programmer"},{"key":"e_1_2_1_4_1","unstructured":"John Brooke et al. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189 194 (1996) 4\u20137."},{"key":"e_1_2_1_5_1","volume-title":"Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark. arXiv preprint arXiv:2503.20786","author":"Bsharat Sondos Mahmoud","year":"2025","unstructured":"Sondos Mahmoud Bsharat, Mukul Ranjan, Aidar Myrzakhan, Jiacheng Liu, Bowei Guo, Shengkun Tang, Zhuang Liu, Yuanzhi Li, and Zhiqiang Shen. 2025. Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark. arXiv preprint arXiv:2503.20786 (2025)."},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the AAAI conference on artificial intelligence (AAAI)","volume":"35","author":"Cai Yuxuan","year":"2021","unstructured":"Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, and Yanzhi Wang. 2021. Yolobile: Real-time object detection on mobile devices via compression-compilation co-design. In Proceedings of the AAAI conference on artificial intelligence (AAAI), Vol. 35. 955\u2013963."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3557999"},{"key":"e_1_2_1_8_1","volume-title":"European Conference on Computer Vision (ECCV). Springer, 422\u2013438","author":"Chen Yihang","year":"2024","unstructured":"Yihang Chen, Qianyi Wu, Weiyao Lin, Mehrtash Harandi, and Jianfei Cai. 2024. Hac: Hash-grid assisted context for 3d gaussian splatting compression. In European Conference on Computer Vision (ECCV). Springer, 422\u2013438."},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Yann Collet and Murray Kucherawy. 2018. Zstandard Compression and the application\/zstd Media Type. Technical Report.","DOI":"10.17487\/RFC8478"},{"key":"e_1_2_1_10_1","volume-title":"IEEE INFOCOM 2019-IEEE conference on computer communications (INFOCOM). IEEE, 1189\u20131197","author":"Dong Erqun","year":"2019","unstructured":"Erqun Dong, Jingao Xu, Chenshu Wu, Yunhao Liu, and Zheng Yang. 2019. Pair-navi: Peer-to-peer indoor navigation with mobile visual slam. In IEEE INFOCOM 2019-IEEE conference on computer communications (INFOCOM). IEEE, 1189\u20131197."},{"key":"e_1_2_1_11_1","volume-title":"Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps. Advances in neural information processing systems (NeurIPS) 37","author":"Fan Zhiwen","year":"2024","unstructured":"Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang, et al. 2024. Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps. Advances in neural information processing systems (NeurIPS) 37 (2024), 140138\u2013140158."},{"key":"e_1_2_1_12_1","volume-title":"European Conference on Computer Vision (ECCV). Springer, 165\u2013181","author":"Fang Guangchi","year":"2024","unstructured":"Guangchi Fang and Bing Wang. 2024. Mini-splatting: Representing scenes with a constrained number of gaussians. In European Conference on Computer Vision (ECCV). Springer, 165\u2013181."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448076"},{"key":"e_1_2_1_14_1","unstructured":"GadgetVersus. 2024. Nvidia Jetson AGX Xavier GPU vs ARM Mali-G710 MC10. https:\/\/gadgetversus.com\/graphics-card\/nvidia-jetson-agx-xavier-gpu-vs-arm-mali-g710-mc10\/."},{"key":"e_1_2_1_15_1","volume-title":"European Conference on Computer Vision (ECCV). Springer, 54\u201371","author":"Girish Sharath","year":"2024","unstructured":"Sharath Girish, Kamal Gupta, and Abhinav Shrivastava. 2024. Eagles: Efficient accelerated 3d gaussians with lightweight encodings. In European Conference on Computer Vision (ECCV). Springer, 54\u201371."},{"key":"e_1_2_1_16_1","unstructured":"Google. 2025. Google Maps. https:\/\/maps.google.com. Accessed: 2025-04-28."},{"key":"e_1_2_1_17_1","volume-title":"Mobile-based artificial intelligence chatbot for self-regulated learning in a hybrid flipped classroom. Journal of Computing in Higher Education","author":"Han Insook","year":"2025","unstructured":"Insook Han, Hyangeun Ji, Seoyeon Jin, and Koun Choi. 2025. Mobile-based artificial intelligence chatbot for self-regulated learning in a hybrid flipped classroom. Journal of Computing in Higher Education (2025), 1\u201325."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3659596"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3210424.3210429"},{"key":"e_1_2_1_20_1","first-page":"1","article-title":"Structuresense: Inferring constructive assembly structures from user behaviors","volume":"6","author":"Huang Xincheng","year":"2023","unstructured":"Xincheng Huang, Keylonnie L Miller, Alanson P Sample, and Nikola Banovic. 2023. Structuresense: Inferring constructive assembly structures from user behaviors. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 6, 4 (2023), 1\u201325.","journal-title":"Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592433"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3658160"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00371"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.02052"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3699734"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3636534.3649364"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.1999.790410"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.01952"},{"key":"e_1_2_1_29_1","volume-title":"The Eleventh International Conference on Learning Representations (ICLR).","author":"Ma Xiaojian","year":"2023","unstructured":"Xiaojian Ma, Silong Yong, Zilong Zheng, Qing Li, Yitao Liang, Song-Chun Zhu, and Siyuan Huang. 2023. SQA3D: Situated Question Answering in 3D Scenes. In The Eleventh International Conference on Learning Representations (ICLR)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","first-page":"3531","DOI":"10.1109\/LRA.2022.3146502","article-title":"Volumetric instance-level semantic mapping via multi-view 2D-to-3D label diffusion","volume":"7","author":"Mascaro Ruben","year":"2022","unstructured":"Ruben Mascaro, Lucas Teixeira, and Margarita Chli. 2022. Volumetric instance-level semantic mapping via multi-view 2D-to-3D label diffusion. IEEE Robotics and Automation Letters (RA-L) 7, 2 (2022), 3531\u20133538.","journal-title":"IEEE Robotics and Automation Letters (RA-L)"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3642462"},{"key":"e_1_2_1_32_1","volume-title":"Soroush Abbasi Koohpayegani, and Hamed Pirsiavash.","author":"Navaneet KL","year":"2023","unstructured":"KL Navaneet, Kossar Pourahmadi Meibodi, Soroush Abbasi Koohpayegani, and Hamed Pirsiavash. 2023. Compact3d: Compressing gaussian splat radiance field models with vector quantization. arXiv preprint arXiv:2311.18159 (2023)."},{"key":"e_1_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Jakob Nielsen. 1994. Usability engineering. Morgan Kaufmann.","DOI":"10.1016\/B978-0-08-052029-2.50009-7"},{"key":"e_1_2_1_34_1","volume-title":"Radsplat: Radiance field-informed gaussian splatting for robust real-time rendering with 900+ fps. arXiv preprint arXiv:2403.13806","author":"Niemeyer Michael","year":"2024","unstructured":"Michael Niemeyer, Fabian Manhardt, Marie-Julie Rakotosaona, Michael Oechsle, Daniel Duckworth, Rama Gosula, Keisuke Tateno, John Bates, Dominik Kaeser, and Federico Tombari. 2024. Radsplat: Radiance field-informed gaussian splatting for robust real-time rendering with 900+ fps. arXiv preprint arXiv:2403.13806 (2024)."},{"key":"e_1_2_1_35_1","unstructured":"OpenAI. 2024. GPT-4o. https:\/\/openai.com."},{"key":"e_1_2_1_36_1","volume-title":"Mobile edge intelligence for large language models: A contemporary survey","author":"Qu Guanqiao","year":"2025","unstructured":"Guanqiao Qu, Qiyuan Chen, Wei Wei, Zheng Lin, Xianhao Chen, and Kaibin Huang. 2025. Mobile edge intelligence for large language models: A contemporary survey. IEEE Communications Surveys & Tutorials (2025)."},{"key":"e_1_2_1_37_1","volume-title":"Octree-gs: Towards consistent real-time rendering with lod-structured 3d gaussians. arXiv preprint arXiv:2403.17898","author":"Ren Kerui","year":"2024","unstructured":"Kerui Ren, Lihan Jiang, Tao Lu, Mulin Yu, Linning Xu, Zhangkai Ni, and Bo Dai. 2024. Octree-gs: Towards consistent real-time rendering with lod-structured 3d gaussians. arXiv preprint arXiv:2403.17898 (2024)."},{"key":"e_1_2_1_38_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). 4104\u20134113","author":"Schonberger Johannes L","year":"2016","unstructured":"Johannes L Schonberger and Jan-Michael Frahm. 2016. Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). 4104\u20134113."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3587819.3590981"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the 32nd ACM International Conference on Multimedia (MM). 2156\u20132165","author":"Sun Penglei","year":"2024","unstructured":"Penglei Sun, Yaoxian Song, Xiang Liu, Xiaofei Yang, Qiang Wang, Tiefeng Li, Yang Yang, and Xiaowen Chu. 2024. 3d question answering for city scene understanding. In Proceedings of the 32nd ACM International Conference on Multimedia (MM). 2156\u20132165."},{"key":"e_1_2_1_41_1","volume-title":"European conference on computer vision (ECCV). Springer, 1\u201321","author":"Wang Chien-Yao","year":"2024","unstructured":"Chien-Yao Wang, I-Hau Yeh, and Hong-Yuan Mark Liao. 2024. Yolov9: Learning what you want to learn using programmable gradient information. In European conference on computer vision (ECCV). Springer, 1\u201321."},{"key":"e_1_2_1_42_1","volume-title":"European Conference on Computer Vision (ECCV). Springer, 76\u201392","author":"Wang Henan","year":"2024","unstructured":"Henan Wang, Hanxin Zhu, Tianyu He, Runsen Feng, Jiajun Deng, Jiang Bian, and Zhibo Chen. 2024. End-to-end rate-distortion optimized 3d gaussian representation. In European Conference on Computer Vision (ECCV). Springer, 76\u201392."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3659601"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3712268"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/214762.214771"},{"key":"e_1_2_1_46_1","volume-title":"19th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 977\u2013993","author":"Xu Jingao","year":"2022","unstructured":"Jingao Xu, Hao Cao, Zheng Yang, Longfei Shangguan, Jialin Zhang, Xiaowu He, and Yunhao Liu. 2022. SwarmMap: Scaling up real-time collaborative visual SLAM at the edge. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 977\u2013993."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448417"},{"key":"e_1_2_1_48_1","first-page":"21875","article-title":"Depth anything v2","volume":"37","author":"Yang Lihe","year":"2024","unstructured":"Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao, Xiaogang Xu, Jiashi Feng, and Hengshuang Zhao. 2024. Depth anything v2. Advances in Neural Information Processing Systems (NeurIPS) 37 (2024), 21875\u201321911.","journal-title":"Advances in Neural Information Processing Systems (NeurIPS)"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3687971"},{"key":"e_1_2_1_50_1","volume-title":"Proceedings of the 32nd ACM International Conference on Multimedia (MM). 11089\u201311098","author":"Yin Daheng","year":"2024","unstructured":"Daheng Yin, Jianxin Shi, Miao Zhang, Zhaowu Huang, Jiangchuan Liu, and Fang Dong. 2024. FSVFG: Towards Immersive Full-Scene Volumetric Video Streaming with Adaptive Feature Grid. In Proceedings of the 32nd ACM International Conference on Multimedia (MM). 11089\u201311098."},{"key":"e_1_2_1_51_1","volume-title":"19th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 137\u2013154","author":"Zhang Anlan","year":"2022","unstructured":"Anlan Zhang, Chendong Wang, Bo Han, and Feng Qian. 2022. YuZu:Neural-Enhanced volumetric video streaming. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI). 137\u2013154."},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of the 28th Annual International Conference on Mobile Computing And Networking (MobiCom). 528\u2013541","author":"Zhang Jinrui","year":"2022","unstructured":"Jinrui Zhang, Huan Yang, Ju Ren, Deyu Zhang, Bangwen He, Ting Cao, Yuanchun Li, Yaoxue Zhang, and Yunxin Liu. 2022. MobiDepth: Real-time depth estimation using on-device dual cameras. In Proceedings of the 28th Annual International Conference on Mobile Computing And Networking (MobiCom). 528\u2013541."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3636534.3649386"},{"key":"e_1_2_1_54_1","volume-title":"LP-3DGS: Learning to Prune 3D Gaussian Splatting. arXiv preprint arXiv:2405.18784","author":"Zhang Zhaoliang","year":"2024","unstructured":"Zhaoliang Zhang, Tianchen Song, Yongjae Lee, Li Yang, Cheng Peng, Rama Chellappa, and Deliang Fan. 2024. LP-3DGS: Learning to Prune 3D Gaussian Splatting. arXiv preprint arXiv:2405.18784 (2024)."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3678572"}],"container-title":["Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3770680","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,2]],"date-time":"2025-12-02T19:51:53Z","timestamp":1764705113000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3770680"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,2]]},"references-count":55,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,12,2]]}},"alternative-id":["10.1145\/3770680"],"URL":"https:\/\/doi.org\/10.1145\/3770680","relation":{},"ISSN":["2474-9567"],"issn-type":[{"value":"2474-9567","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,2]]},"assertion":[{"value":"2025-12-02","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}