{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:31:38Z","timestamp":1750221098526,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,12,18]],"date-time":"2018-12-18T00:00:00Z","timestamp":1545091200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,12,18]]},"DOI":"10.1145\/3293353.3293376","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T22:07:32Z","timestamp":1588630052000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Spatio-Temporal Grids for Daily Living Action Recognition"],"prefix":"10.1145","author":[{"given":"Srijan","family":"Das","sequence":"first","affiliation":[{"name":"INRIA, Sophia Antipolis, Valbonne, Nice"}]},{"given":"Kaustubh","family":"Sakhalkar","sequence":"additional","affiliation":[{"name":"INRIA, Sophia Antipolis, Valbonne, Nice"}]},{"given":"Michal","family":"Koperski","sequence":"additional","affiliation":[{"name":"INRIA, Sophia Antipolis, Valbonne, Nice"}]},{"given":"Francois","family":"Bremond","sequence":"additional","affiliation":[{"name":"INRIA, Sophia Antipolis, Valbonne, Nice"}]}],"member":"320","published-online":{"date-parts":[[2020,5,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Mart\u00edn Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org.  Mart\u00edn Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2017.77"},{"volume-title":"Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Baradel Fabien","key":"e_1_3_2_1_3_1","unstructured":"Fabien Baradel , Christian Wolf , Julien Mille , and Graham W. Taylor . 2018 . Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Fabien Baradel, Christian Wolf, Julien Mille, and Graham W. Taylor. 2018. Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.502"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.28.6"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Guilhem Cheron Ivan Laptev and Cordelia Schmid. 2015. P-CNN: Pose-based CNN Features for Action Recognition. In ICCV.  Guilhem Cheron Ivan Laptev and Cordelia Schmid. 2015. P-CNN: Pose-based CNN Features for Action Recognition. In ICCV.","DOI":"10.1109\/ICCV.2015.368"},{"key":"e_1_3_2_1_7_1","unstructured":"Fran\u00e7ois Chollet et al. 2015. Keras. (2015).  Fran\u00e7ois Chollet et al. 2015. Keras. (2015)."},{"key":"e_1_3_2_1_8_1","unstructured":"Srijan Das Michal Koperski Francois Bremond and Gianpiero Francesca. 2017. Action Recognition based on a mixture of RGB and Depth based skeleton. In AVSS.  Srijan Das Michal Koperski Francois Bremond and Gianpiero Francesca. 2017. Action Recognition based on a mixture of RGB and Depth based skeleton. In AVSS."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"S. Das M. Koperski F. Bremond and G. Francesca. 2018. A Fusion of Appearance based CNNs and Temporal evolution of Skeleton with LSTM for Daily Living Action Recognition. ArXiv e-prints (Feb. 2018). arXiv:cs.CV\/1802.00421  S. Das M. Koperski F. Bremond and G. Francesca. 2018. A Fusion of Appearance based CNNs and Temporal evolution of Skeleton with LSTM for Daily Living Action Recognition. ArXiv e-prints (Feb. 2018). arXiv:cs.CV\/1802.00421","DOI":"10.1109\/AVSS.2018.8639122"},{"key":"e_1_3_2_1_10_1","volume-title":"Long-Term Recurrent Convolutional Networks for Visual Recognition and Description. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Donahue Jeffrey","year":"2015","unstructured":"Jeffrey Donahue , Lisa Anne Hendricks , Sergio Guadarrama , Marcus Rohrbach , Subhashini Venugopalan , Kate Saenko , and Trevor Darrell . 2015 . Long-Term Recurrent Convolutional Networks for Visual Recognition and Description. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Jeffrey Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell. 2015. Long-Term Recurrent Convolutional Networks for Visual Recognition and Description. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.213"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"#cr-split#-e_1_3_2_1_13_1.1","doi-asserted-by":"crossref","unstructured":"Jian-Fang Hu Wei-Shi Zheng Jianhuang Lai and JianGuo Zhang. 2015. Jointly learning heterogeneous features for RGB-D activity recognition. In CVPR. https:\/\/doi.org\/10.1109\/CVPR.2015.7299172 10.1109\/CVPR.2015.7299172","DOI":"10.1109\/CVPR.2015.7299172"},{"key":"#cr-split#-e_1_3_2_1_13_1.2","doi-asserted-by":"crossref","unstructured":"Jian-Fang Hu Wei-Shi Zheng Jianhuang Lai and JianGuo Zhang. 2015. Jointly learning heterogeneous features for RGB-D activity recognition. In CVPR. https:\/\/doi.org\/10.1109\/CVPR.2015.7299172","DOI":"10.1109\/CVPR.2015.7299172"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2640292"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Andrej Karpathy George Toderici Sanketh Shetty Thomas Leung Rahul Sukthankar and Li Fei-Fei. 2014. Large-Scale Video Classification with Convolutional Neural Networks. In CVPR.  Andrej Karpathy George Toderici Sanketh Shetty Thomas Leung Rahul Sukthankar and Li Fei-Fei. 2014. Large-Scale Video Classification with Convolutional Neural Networks. In CVPR.","DOI":"10.1109\/CVPR.2014.223"},{"key":"e_1_3_2_1_16_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2014","unstructured":"Diederik P. Kingma and Jimmy Ba . 2014 . Adam : A Method for Stochastic Optimization. CoRR abs\/1412.6980 (2014). arXiv:1412.6980 http:\/\/arxiv.org\/abs\/1412.6980 Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. CoRR abs\/1412.6980 (2014). arXiv:1412.6980 http:\/\/arxiv.org\/abs\/1412.6980"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Yu Kong and Yun Fu. 2015. Bilinear heterogeneous information machine for RGB-D action recognition. In CVPR.  Yu Kong and Yun Fu. 2015. Bilinear heterogeneous information machine for RGB-D action recognition. In CVPR.","DOI":"10.1109\/CVPR.2015.7298708"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Michal Koperski Piotr Bilinski and Fran\u00e7ois Bremond. 2014. 3D Trajectories for Action Recognition. In ICIP. https:\/\/hal.inria.fr\/hal-01054949  Michal Koperski Piotr Bilinski and Fran\u00e7ois Bremond. 2014. 3D Trajectories for Action Recognition. In ICIP. https:\/\/hal.inria.fr\/hal-01054949","DOI":"10.1109\/ICIP.2014.7025848"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/AVSS.2016.7738023"},{"key":"e_1_3_2_1_20_1","first-page":"8","article-title":"Learning Human Activities and Object Affordances from RGB-D","volume":"32","author":"Koppula Hema Swetha","year":"2013","unstructured":"Hema Swetha Koppula , Rudhir Gupta , and Ashutosh Saxena . 2013 . Learning Human Activities and Object Affordances from RGB-D Videos. Int. J. Rob. Res. 32 , 8 (July 2013), 951--970. https:\/\/doi.org\/10.1177\/0278364913478446 10.1177\/0278364913478446 Hema Swetha Koppula, Rudhir Gupta, and Ashutosh Saxena. 2013. Learning Human Activities and Object Affordances from RGB-D Videos. Int. J. Rob. Res. 32, 8 (July 2013), 951--970. https:\/\/doi.org\/10.1177\/0278364913478446","journal-title":"Videos. Int. J. Rob. Res."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the 30th International Conference on International Conference on Machine Learning -","volume":"28","author":"Hema","unstructured":"Hema S. Koppula and Ashutosh Saxena. 2013. Learning Spatio-temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation . In Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28 (ICML'13). JMLR.org, III-792-III-800. http:\/\/dl.acm.org\/citation.cfm?id=3042817.3043025 Hema S. Koppula and Ashutosh Saxena. 2013. Learning Spatio-temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation. In Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28 (ICML'13). JMLR.org, III-792-III-800. http:\/\/dl.acm.org\/citation.cfm?id=3042817.3043025"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0876-z"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46487-9_50"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV.2016.7477694"},{"key":"e_1_3_2_1_25_1","volume-title":"Range-Sample Depth Feature for Action Recognition. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. 772--779","author":"Lu C.","year":"2014","unstructured":"C. Lu , J. Jia , and C. K. Tang . 2014 . Range-Sample Depth Feature for Action Recognition. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. 772--779 . https:\/\/doi.org\/10.1109\/CVPR. 2014 .104 10.1109\/CVPR.2014.104 C. Lu, J. Jia, and C. K. Tang. 2014. Range-Sample Depth Feature for Action Recognition. In 2014 IEEE Conference on Computer Vision and Pattern Recognition. 772--779. https:\/\/doi.org\/10.1109\/CVPR.2014.104"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.333"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Omar Oreifej and Zicheng Liu. 2013. HON4D: Histogram of oriented 4D normals for activity recognition from depth sequences. In CVPR.  Omar Oreifej and Zicheng Liu. 2013. HON4D: Histogram of oriented 4D normals for activity recognition from depth sequences. In CVPR.","DOI":"10.1109\/CVPR.2013.98"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888101"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.115"},{"key":"e_1_3_2_1_30_1","first-page":"1","article-title":"Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos","volume":"99","author":"Shahroudy A.","year":"2017","unstructured":"A. Shahroudy , T. T. Ng , Y. Gong , and G. Wang . 2017 . Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos . IEEE Transactions on Pattern Analysis and Machine Intelligence PP , 99 (2017), 1 -- 1 . https:\/\/doi.org\/10.1109\/TPAMI.2017.2691321 10.1109\/TPAMI.2017.2691321 A. Shahroudy, T. T. Ng, Y. Gong, and G. Wang. 2017. Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence PP, 99 (2017), 1--1. https:\/\/doi.org\/10.1109\/TPAMI.2017.2691321","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence PP"},{"key":"e_1_3_2_1_31_1","volume-title":"Action Recognition using Visual Attention. arXiv preprint arXiv:1511.04119","author":"Sharma Shikhar","year":"2015","unstructured":"Shikhar Sharma , Ryan Kiros , and Ruslan Salakhutdinov . 2015. Action Recognition using Visual Attention. arXiv preprint arXiv:1511.04119 ( 2015 ). Shikhar Sharma, Ryan Kiros, and Ruslan Salakhutdinov. 2015. Action Recognition using Visual Attention. arXiv preprint arXiv:1511.04119 (2015)."},{"key":"e_1_3_2_1_32_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Two-stream convolutional networks for action recognition in videos. In Advances in neural information processing systems. 568--576.  Karen Simonyan and Andrew Zisserman. 2014. Two-stream convolutional networks for action recognition in videos. In Advances in neural information processing systems. 568--576."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Jaeyong Sung Colin Ponce Bart Selman and Ashutosh Saxena. 2012. Unstructured Human Activity Detection from RGBD Images. In ICRA.  Jaeyong Sung Colin Ponce Bart Selman and Ashutosh Saxena. 2012. Unstructured Human Activity Detection from RGBD Images. In ICRA.","DOI":"10.1109\/ICRA.2012.6224591"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.510"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995407"},{"key":"e_1_3_2_1_36_1","volume-title":"Action Recognition with Improved Trajectories. In IEEE International Conference on Computer Vision","author":"Wang Heng","year":"2013","unstructured":"Heng Wang and Cordelia Schmid . 2013 . Action Recognition with Improved Trajectories. In IEEE International Conference on Computer Vision . Sydney, Australia. http:\/\/hal.inria.fr\/hal-00873267 Heng Wang and Cordelia Schmid. 2013. Action Recognition with Improved Trajectories. In IEEE International Conference on Computer Vision. Sydney, Australia. http:\/\/hal.inria.fr\/hal-00873267"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299059"},{"key":"e_1_3_2_1_38_1","unstructured":"Ying Wu. 2012. Mining Actionlet Ensemble for Action Recognition with Depth Cameras. In CVPR.  Ying Wu. 2012. Mining Actionlet Ensemble for Action Recognition with Depth Cameras. In CVPR."},{"key":"e_1_3_2_1_39_1","volume-title":"On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV). 148--157","author":"Zhang S.","year":"2017","unstructured":"S. Zhang , X. Liu , and J. Xiao . 2017 . On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV). 148--157 . https:\/\/doi.org\/10.1109\/WACV. 2017 .24 10.1109\/WACV.2017.24 S. Zhang, X. Liu, and J. Xiao. 2017. On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks. In 2017 IEEE Winter Conference on Applications of Computer Vision (WACV). 148--157. https:\/\/doi.org\/10.1109\/WACV.2017.24"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.316"}],"event":{"name":"ICVGIP 2018: 11th Indian Conference on Computer Vision, Graphics and Image Processing","acronym":"ICVGIP 2018","location":"Hyderabad India"},"container-title":["Proceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3293353.3293376","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3293353.3293376","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:58:08Z","timestamp":1750208288000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3293353.3293376"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,12,18]]},"references-count":41,"alternative-id":["10.1145\/3293353.3293376","10.1145\/3293353"],"URL":"https:\/\/doi.org\/10.1145\/3293353.3293376","relation":{},"subject":[],"published":{"date-parts":[[2018,12,18]]},"assertion":[{"value":"2020-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}