{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,16]],"date-time":"2025-11-16T21:51:16Z","timestamp":1763329876569,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":49,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,3,7]],"date-time":"2021-03-07T00:00:00Z","timestamp":1615075200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,7]]},"DOI":"10.1145\/3444685.3446257","type":"proceedings-article","created":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T04:48:41Z","timestamp":1620103721000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Incremental multi-view object detection from a moving camera"],"prefix":"10.1145","author":[{"given":"Takashi","family":"Konno","sequence":"first","affiliation":[{"name":"AIST, Japan"}]},{"given":"Ayako","family":"Amma","sequence":"additional","affiliation":[{"name":"Toyota Motor Corporation, Japan"}]},{"given":"Asako","family":"Kanezaki","sequence":"additional","affiliation":[{"name":"AIST, Japan"}]}],"member":"320","published-online":{"date-parts":[[2021,5,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n.d.]. The Princeton ModelNet. http:\/\/modelnet.cs.princeton.edu\/.  [n.d.]. The Princeton ModelNet. http:\/\/modelnet.cs.princeton.edu\/."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10593-2_29"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247992"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995462"},{"key":"e_1_3_2_1_5_1","volume-title":"Object Co-detection. In Proceedings of European Conference on Computer Vision (ECCV).","author":"Bao Sid Yingze","year":"2012","unstructured":"Sid Yingze Bao , Yu Xiang , and Silvio Savarese . 2012 . Object Co-detection. In Proceedings of European Conference on Computer Vision (ECCV). Sid Yingze Bao, Yu Xiang, and Silvio Savarese. 2012. Object Co-detection. In Proceedings of European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01258-8_21"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.236"},{"key":"e_1_3_2_1_8_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems (NIPS).","author":"Chen Xiaozhi","year":"2015","unstructured":"Xiaozhi Chen , Kaustav Kundu , Yukun Zhu , Andrew G Berneshawi , Huimin Ma , Sanja Fidler , and Raquel Urtasun . 2015 . 3d object proposals for accurate object class detection . In Proceedings of Advances in Neural Information Processing Systems (NIPS). Xiaozhi Chen, Kaustav Kundu, Yukun Zhu, Andrew G Berneshawi, Huimin Ma, Sanja Fidler, and Raquel Urtasun. 2015. 3d object proposals for accurate object class detection. In Proceedings of Advances in Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.691"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01249-6_28"},{"key":"e_1_3_2_1_11_1","volume-title":"Antonino Furnari, Jian Ma, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, and Michael Wray.","author":"Damen Dima","year":"2020","unstructured":"Dima Damen , Hazel Doughty , Giovanni Maria Farinella , , Antonino Furnari, Jian Ma, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, and Michael Wray. 2020 . Rescaling Egocentric Vision. CoRR abs\/2006.13256 (2020). Dima Damen, Hazel Doughty, Giovanni Maria Farinella, , Antonino Furnari, Jian Ma, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, and Michael Wray. 2020. Rescaling Egocentric Vision. CoRR abs\/2006.13256 (2020)."},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of European Conference on Computer Vision (ECCV).","author":"Damen Dima","year":"2018","unstructured":"Dima Damen , Hazel Doughty , Giovanni Maria Farinella , Sanja Fidler , Antonino Furnari , Evangelos Kazakos , Davide Moltisanti , Jonathan Munro , Toby Perrett , Will Price , and Michael Wray . 2018 . Scaling Egocentric Vision: The EPIC-KITCHENS Dataset . In Proceedings of European Conference on Computer Vision (ECCV). Dima Damen, Hazel Doughty, Giovanni Maria Farinella, Sanja Fidler, Antonino Furnari, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, and Michael Wray. 2018. Scaling Egocentric Vision: The EPIC-KITCHENS Dataset. In Proceedings of European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_16"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.146"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of International Conference on Machine Learning (ICML).","author":"Elhoseiny Mohamed","year":"2016","unstructured":"Mohamed Elhoseiny , Tarek El-Gaaly , Amr Bakry , and Ahmed Elgammal . 2016 . A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation . In Proceedings of International Conference on Machine Learning (ICML). Mohamed Elhoseiny, Tarek El-Gaaly, Amr Bakry, and Ahmed Elgammal. 2016. A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation. In Proceedings of International Conference on Machine Learning (ICML)."},{"volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Fioraio N.","key":"e_1_3_2_1_16_1","unstructured":"N. Fioraio and L. Di Stefano . 2013. Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). N. Fioraio and L. Di Stefano. 2013. Joint Detection, Tracking and Mapping by Semantic Bundle Adjustment. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"volume-title":"Proceedings of IEEE International Conference on Robotics and Automation (ICRA).","author":"Frost D. P.","key":"e_1_3_2_1_17_1","unstructured":"D. P. Frost , O. K\u00e4hler , and D. W. Murray . 2016. Object-aware bundle adjustment for correcting monocular scale drift . In Proceedings of IEEE International Conference on Robotics and Automation (ICRA). D. P. Frost, O. K\u00e4hler, and D. W. Murray. 2016. Object-aware bundle adjustment for correcting monocular scale drift. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2015.08.009"},{"volume-title":"Proceedings of International Conference on Computer Vision (ICCV).","author":"Gay P.","key":"e_1_3_2_1_19_1","unstructured":"P. Gay , V. Bansal , C. Rubino , and A. D. Bue . 2017. Probabilistic Structure from Motion with Objects (PSfMO) . In Proceedings of International Conference on Computer Vision (ICCV). P. Gay, V. Bansal, C. Rubino, and A. D. Bue. 2017. Probabilistic Structure from Motion with Objects (PSfMO). In Proceedings of International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of Asian Conference on Computer Vision (ACCV).","author":"Jafari Omid Hosseini","year":"2018","unstructured":"Omid Hosseini Jafari , Siva Karthik Mustikovela , Karl Pertsch , Eric Brachmann , and Carsten Rother . 2018 . iPose: instance-aware 6D pose estimation of partly occluded objects . In Proceedings of Asian Conference on Computer Vision (ACCV). Omid Hosseini Jafari, Siva Karthik Mustikovela, Karl Pertsch, Eric Brachmann, and Carsten Rother. 2018. iPose: instance-aware 6D pose estimation of partly occluded objects. In Proceedings of Asian Conference on Computer Vision (ACCV)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00526"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.169"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2018.8594049"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6248066"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2014.6907298"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2011.5980382"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of AAAI Conference on Artificial Intelligence.","author":"Lai Kevin","year":"2011","unstructured":"Kevin Lai , Liefeng Bo , Xiaofeng Ren , and Dieter Fox . 2011 . A Scalable Tree-based Approach for Joint Object and Pose Recognition . In Proceedings of AAAI Conference on Artificial Intelligence. Kevin Lai, Liefeng Bo, Xiaofeng Ren, and Dieter Fox. 2011. A Scalable Tree-based Approach for Joint Object and Pose Recognition. In Proceedings of AAAI Conference on Artificial Intelligence."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2012.6225316"},{"volume-title":"Proceedings of European Conference on Computer Vision (ECCV).","author":"Li Chi","key":"e_1_3_2_1_30_1","unstructured":"Chi Li , Jin Bai , and Gregory D. Hager . 2018. A Unified Framework for Multi-view Multi-class Object Pose Estimation . In Proceedings of European Conference on Computer Vision (ECCV). Chi Li, Jin Bai, and Gregory D. Hager. 2018. A Unified Framework for Multi-view Multi-class Object Pose Estimation. In Proceedings of European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2018.00015"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2017.2705103"},{"key":"e_1_3_2_1_33_1","volume-title":"Poisson image editing. ACM Transactions on graphics (TOG) 22, 3","author":"P\u00e9rez Patrick","year":"2003","unstructured":"Patrick P\u00e9rez , Michel Gangnet , and Andrew Blake . 2003. Poisson image editing. ACM Transactions on graphics (TOG) 22, 3 ( 2003 ), 313--318. Patrick P\u00e9rez, Michel Gangnet, and Andrew Blake. 2003. Poisson image editing. ACM Transactions on graphics (TOG) 22, 3 (2003), 313--318."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2015.XI.034"},{"key":"e_1_3_2_1_35_1","volume-title":"YOLOv3: An Incremental Improvement. CoRR abs\/1804.02767","author":"Redmon Joseph","year":"2018","unstructured":"Joseph Redmon and Ali Farhadi . 2018. YOLOv3: An Incremental Improvement. CoRR abs\/1804.02767 ( 2018 ). arXiv:1804.02767 Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An Incremental Improvement. CoRR abs\/1804.02767 (2018). arXiv:1804.02767"},{"key":"e_1_3_2_1_36_1","volume-title":"Proceedings of Advances in Neural Information Processing Systems (NIPS).","author":"Ren Shaoqing","year":"2015","unstructured":"Shaoqing Ren , Kaiming He , Ross Girshick , and Jian Sun . 2015 . Faster r-cnn: Towards real-time object detection with region proposal networks . In Proceedings of Advances in Neural Information Processing Systems (NIPS). Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Proceedings of Advances in Neural Information Processing Systems (NIPS)."},{"volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Salas-Moreno R. F.","key":"e_1_3_2_1_37_1","unstructured":"R. F. Salas-Moreno , R. A. Newcombe , H. Strasdat , P. H. J. Kelly , and A. J. Davison . 2013. SLAM++ : Simultaneous Localisation and Mapping at the Level of Objects . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). R. F. Salas-Moreno, R. A. Newcombe, H. Strasdat, P. H. J. Kelly, and A. J. Davison. 2013. SLAM++ : Simultaneous Localisation and Mapping at the Level of Objects. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_38_1","volume-title":"Proceedings of European Conference on Computer Vision (ECCV).","author":"Simony Martin","year":"2018","unstructured":"Martin Simony , Stefan Milzy , Karl Amendey , and Horst-Michael Gross . 2018 . Complex-YOLO: an Euler-region-proposal for real-time 3D object detection on point clouds . In Proceedings of European Conference on Computer Vision (ECCV). Martin Simony, Stefan Milzy, Karl Amendey, and Horst-Michael Gross. 2018. Complex-YOLO: an Euler-region-proposal for real-time 3D object detection on point clouds. In Proceedings of European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.94"},{"volume-title":"Proceedings of International Conference on Computer Vision (ICCV).","author":"Su Hang","key":"e_1_3_2_1_40_1","unstructured":"Hang Su , Subhransu Maji , Evangelos Kalogerakis , and Erik G . Learned-Miller. 2015. Multi-view convolutional neural networks for 3D shape recognition . In Proceedings of International Conference on Computer Vision (ICCV). Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik G. Learned-Miller. 2015. Multi-view convolutional neural networks for 3D shape recognition. In Proceedings of International Conference on Computer Vision (ICCV)."},{"volume-title":"Proceedings of International Conference on Computer Vision (ICCV).","author":"Su Hao","key":"e_1_3_2_1_41_1","unstructured":"Hao Su , Charles R. Qi , Yangyan Li , and Leonidas J. Guibas . 2015. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views . In Proceedings of International Conference on Computer Vision (ICCV). Hao Su, Charles R. Qi, Yangyan Li, and Leonidas J. Guibas. 2015. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views. In Proceedings of International Conference on Computer Vision (ICCV)."},{"volume-title":"Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS).","author":"S\u00fcnderhauf N.","key":"e_1_3_2_1_42_1","unstructured":"N. S\u00fcnderhauf , T. T. Pham , Y. Latif , M. Milford , and I. Reid . 2017. Meaningful maps with object-oriented semantic mapping . In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS). N. S\u00fcnderhauf, T. T. Pham, Y. Latif, M. Milford, and I. Reid. 2017. Meaningful maps with object-oriented semantic mapping. In Proceedings of IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS)."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_43"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.311"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298800"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.15607\/RSS.2018.XIV.019"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40648-019-0132-3"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cosrev.2018.03.001"},{"key":"e_1_3_2_1_49_1","volume-title":"Proceedings of AAAI Conference on Artificial Intelligence.","author":"Zhang Haopeng","year":"2013","unstructured":"Haopeng Zhang , Tarek El-Gaaly , Ahmed M Elgammal , and Zhiguo Jiang . 2013 . Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis . In Proceedings of AAAI Conference on Artificial Intelligence. Haopeng Zhang, Tarek El-Gaaly, Ahmed M Elgammal, and Zhiguo Jiang. 2013. Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis. In Proceedings of AAAI Conference on Artificial Intelligence."}],"event":{"name":"MMAsia '20: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Virtual Event Singapore","acronym":"MMAsia '20"},"container-title":["Proceedings of the 2nd ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446257","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3444685.3446257","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:47:46Z","timestamp":1750193266000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446257"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,7]]},"references-count":49,"alternative-id":["10.1145\/3444685.3446257","10.1145\/3444685"],"URL":"https:\/\/doi.org\/10.1145\/3444685.3446257","relation":{},"subject":[],"published":{"date-parts":[[2021,3,7]]},"assertion":[{"value":"2021-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}