{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:18:27Z","timestamp":1750306707654,"version":"3.41.0"},"reference-count":26,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2015,1,7]],"date-time":"2015-01-07T00:00:00Z","timestamp":1420588800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Fundation","doi-asserted-by":"publisher","award":["CNS 0917082"],"award-info":[{"award-number":["CNS 0917082"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2015,1,7]]},"abstract":"<jats:p>Live video computing (LVC) on distributed smart cameras has many important applications; and a database approach based on a Live Video DataBase Management System (LVDBMS) has shown to be effective for general LVC application development. The performance of such a database system relies on accurate interpretation of spatial relationships among objects in the live video. With the popularity of affordable depth cameras, 3D spatial computation techniques have been applied. However, the 3D object models currently used are expensive to compute, and offer limited scalability. We address this drawback in this article by proposing an octree-based 3D spatial logic and presenting algorithms for computing 3D spatial relationships using depth cameras. To support continuous query processing on live video streams, we also develop a GPU-based implementation of the proposed technique to further enhance scalability for real-time applications. Extensive performance studies based on a public RGB-D dataset as well as the LVDBMS prototype demonstrates the correctness and efficiency of our techniques.<\/jats:p>","DOI":"10.1145\/2645864","type":"journal-article","created":{"date-parts":[[2015,1,12]],"date-time":"2015-01-12T20:02:10Z","timestamp":1421092930000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing"],"prefix":"10.1145","volume":"11","author":[{"given":"Jun","family":"Ye","sequence":"first","affiliation":[{"name":"University of Central Florida"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kien A.","family":"Hua","sequence":"additional","affiliation":[{"name":"University of Central Florida"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2015,1,7]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1631024.1631037"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-006-0063-8"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00530-011-0245-x"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.777378"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2005.857095"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the 24th CIB-W78 Conference on IT in Construction.","author":"Borrmann Andr\u00e9","year":"2007","unstructured":"Andr\u00e9 Borrmann , Stefanie Schraufstetter , and Ernst Rank . 2007 . An octree-based implementation of directional operators in a 3D spatial query language for building information models . In Proceedings of the 24th CIB-W78 Conference on IT in Construction. Andr\u00e9 Borrmann, Stefanie Schraufstetter, and Ernst Rank. 2007. An octree-based implementation of directional operators in a 3D spatial query language for building information models. In Proceedings of the 24th CIB-W78 Conference on IT in Construction."},{"volume-title":"Proceedings of the 7th IEEE International Conference on Computer Vision. 941--947","author":"Coughlan J. M.","key":"e_1_2_1_7_1","unstructured":"J. M. Coughlan and A. L. Yuille . 1999. ManhattanWorld: compass direction from a single image by Bayesian inference . In Proceedings of the 7th IEEE International Conference on Computer Vision. 941--947 . J. M. Coughlan and A. L. Yuille. 1999. ManhattanWorld: compass direction from a single image by Bayesian inference. In Proceedings of the 7th IEEE International Conference on Computer Vision. 941--947."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 17th Annual Conference of the Cognitive Science Society.","author":"Gapp Klaus-Peter","year":"1994","unstructured":"Klaus-Peter Gapp . 1994 . From vision to language: A cognitive approach to the computation of spatial relations in 3D space . In Proceedings of the 17th Annual Conference of the Cognitive Science Society. Klaus-Peter Gapp. 1994. From vision to language: A cognitive approach to the computation of spatial relations in 3D space. In Proceedings of the 17th Annual Conference of the Cognitive Science Society."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.8"},{"volume-title":"Proceedings of the 15th International Conference on Advanced Robotics. 421--426","author":"Kasper A.","key":"e_1_2_1_11_1","unstructured":"A. Kasper , R. Jakel , and R. Dillmann . 2011. Using spatial relations of objects in real world scenes for scene structuring and scene understanding . In Proceedings of the 15th International Conference on Advanced Robotics. 421--426 . A. Kasper, R. Jakel, and R. Dillmann. 2011. Using spatial relations of objects in real world scenes for scene structuring and scene understanding. In Proceedings of the 15th International Conference on Advanced Robotics. 421--426."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/526125.845626"},{"volume-title":"Proceedings of the 5th IEEE International Conference on Fuzzy Systems. 118--124","author":"Keller J. M.","key":"e_1_2_1_13_1","unstructured":"J. M. Keller and X. Wang . 1996. Learning spatial relationships in computer vision . In Proceedings of the 5th IEEE International Conference on Fuzzy Systems. 118--124 . J. M. Keller and X. Wang. 1996. Learning spatial relationships in computer vision. In Proceedings of the 5th IEEE International Conference on Fuzzy Systems. 118--124."},{"key":"e_1_2_1_14_1","unstructured":"Khronos. 2013. OpenCL. http:\/\/www.khronos.org\/opencl\/. (2013).  Khronos. 2013. OpenCL. http:\/\/www.khronos.org\/opencl\/. (2013)."},{"key":"e_1_2_1_15_1","first-page":"2","article-title":"Navigation and mapping in large-scale space","volume":"9","author":"Kuipers B. J.","year":"1988","unstructured":"B. J. Kuipers and T. S. Levitt . 1988 . Navigation and mapping in large-scale space . AI Mag. 9 , 2 . B. J. Kuipers and T. S. Levitt. 1988. Navigation and mapping in large-scale space. AI Mag. 9, 2.","journal-title":"AI Mag."},{"volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation.","author":"Lai K.","key":"e_1_2_1_16_1","unstructured":"K. Lai , L. Bo , X. Ren , and D. Fox . 2011. A Large-Scale Hierarchical Multi-View RGBD Object Dataset . In Proceedings of the IEEE International Conference on Robotics and Automation. K. Lai, L. Bo, X. Ren, and D. Fox. 2011. A Large-Scale Hierarchical Multi-View RGBD Object Dataset. In Proceedings of the IEEE International Conference on Robotics and Automation."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/0146-664X(82)90104-6"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/0165-0114(94)90021-3"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.4018\/jitn.2010010103"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1865106.1865112"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364911408155"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2011.09.005"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2011.06.016"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02532791"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2483977.2483998"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2012.6239234"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2645864","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2645864","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:19:11Z","timestamp":1750231151000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2645864"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,1,7]]},"references-count":26,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,1,7]]}},"alternative-id":["10.1145\/2645864"],"URL":"https:\/\/doi.org\/10.1145\/2645864","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"type":"print","value":"1551-6857"},{"type":"electronic","value":"1551-6865"}],"subject":[],"published":{"date-parts":[[2015,1,7]]},"assertion":[{"value":"2013-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-01-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}