{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:24:40Z","timestamp":1750220680224,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,3,7]],"date-time":"2021-03-07T00:00:00Z","timestamp":1615075200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Sichuan Science and Technology Program","award":["2019YFG0535"],"award-info":[{"award-number":["2019YFG0535"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61832001, 61672133"],"award-info":[{"award-number":["61832001, 61672133"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,3,7]]},"DOI":"10.1145\/3444685.3446296","type":"proceedings-article","created":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T04:48:41Z","timestamp":1620103721000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["RICAPS"],"prefix":"10.1145","author":[{"given":"Abdullah Aman","family":"Khan","sequence":"first","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China"}]},{"given":"Saifullah","family":"Tumrani","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China, Chengdu, China"}]},{"given":"Chunlin","family":"Jiang","sequence":"additional","affiliation":[{"name":"Sichuan Artificial Intelligence Research Institute, Yibin, China"}]},{"given":"Jie","family":"Shao","sequence":"additional","affiliation":[{"name":"Sichuan Artificial Intelligence Research Institute, Yibin, China and University of Electronic Science and Technology of China, Chengdu, China"}]}],"member":"320","published-online":{"date-parts":[[2021,5,3]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683720"},{"key":"e_1_3_2_1_2_1","volume-title":"Dynamic Image Networks for Action Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"Bilen Hakan","year":"2016","unstructured":"Hakan Bilen , Basura Fernando , Efstratios Gavves , Andrea Vedaldi , and Stephen Gould . 2016 . Dynamic Image Networks for Action Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 3034--3042. Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, and Stephen Gould. 2016. Dynamic Image Networks for Action Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 3034--3042."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.502"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2017.2786999"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2599174"},{"key":"e_1_3_2_1_6_1","volume-title":"VideoCapsuleNet: A Simplified Network for Action Detection. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018","author":"Duarte Kevin","year":"2018","unstructured":"Kevin Duarte , Yogesh Singh Rawat , and Mubarak Shah . 2018 . VideoCapsuleNet: A Simplified Network for Action Detection. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 , NeurIPS 2018, 3--8 December 2018, Montr\u00e9al, Canada. 7621--7630. Kevin Duarte, Yogesh Singh Rawat, and Mubarak Shah. 2018. VideoCapsuleNet: A Simplified Network for Action Detection. In Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3--8 December 2018, Montr\u00e9al, Canada. 7621--7630."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00685"},{"key":"e_1_3_2_1_8_1","volume-title":"Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"He Kaiming","year":"2016","unstructured":"Kaiming He , Xiangyu Zhang , Shaoqing Ren , and Jian Sun . 2016 . Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 770--778. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 770--778."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings, Part IV. 630--645","author":"He Kaiming","year":"2016","unstructured":"Kaiming He , Xiangyu Zhang , Shaoqing Ren , and Jian Sun . 2016 . Identity Mappings in Deep Residual Networks. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016 , Proceedings, Part IV. 630--645 . Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Identity Mappings in Deep Residual Networks. In Computer Vision - ECCV 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11--14, 2016, Proceedings, Part IV. 630--645."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2017.01.010"},{"volume-title":"Proceedings, Part I. 44--51","author":"Hinton Geoffrey E.","key":"e_1_3_2_1_11_1","unstructured":"Geoffrey E. Hinton , Alex Krizhevsky , and Sida D. Wang . 2011. Transforming Auto-Encoders. In Artificial Neural Networks and Machine Learning - ICANN 2011 - 21st International Conference on Artificial Neural Networks, Espoo, Finland, June 14--17, 2011 , Proceedings, Part I. 44--51 . Geoffrey E. Hinton, Alex Krizhevsky, and Sida D. Wang. 2011. Transforming Auto-Encoders. In Artificial Neural Networks and Machine Learning - ICANN 2011 - 21st International Conference on Artificial Neural Networks, Espoo, Finland, June 14--17, 2011, Proceedings, Part I. 44--51."},{"key":"e_1_3_2_1_12_1","volume-title":"6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings.","author":"Hinton Geoffrey E.","year":"2018","unstructured":"Geoffrey E. Hinton , Sara Sabour , and Nicholas Frosst . 2018 . Matrix capsules with EM routing . In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. Geoffrey E. Hinton, Sara Sabour, and Nicholas Frosst. 2018. Matrix capsules with EM routing. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings."},{"key":"e_1_3_2_1_13_1","volume-title":"Content-Aware Summarization of Broadcast Sports Videos: An Audio\u00e2\u0102\u015eVisual Feature Extraction Approach. Neural Processing Letters","author":"Khan Abdullah Aman","year":"2020","unstructured":"Abdullah Aman Khan , Jie Shao , Waqar Ali , and Saifullah Tumrani . 2020. Content-Aware Summarization of Broadcast Sports Videos: An Audio\u00e2\u0102\u015eVisual Feature Extraction Approach. Neural Processing Letters ( 2020 ). Abdullah Aman Khan, Jie Shao, Waqar Ali, and Saifullah Tumrani. 2020. Content-Aware Summarization of Broadcast Sports Videos: An Audio\u00e2\u0102\u015eVisual Feature Extraction Approach. Neural Processing Letters (2020)."},{"volume-title":"Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings.","author":"Diederik","key":"e_1_3_2_1_14_1","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015 . Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126543"},{"key":"e_1_3_2_1_16_1","volume-title":"Computer Vision - ECCV 2018 Workshops - Munich, Germany, September 8--14, 2018, Proceedings, Part IV. 193--205","author":"Lee Joonseok","year":"2018","unstructured":"Joonseok Lee , Apostol Natsev , Walter Reade , Rahul Sukthankar , and George Toderici . 2018 . The 2nd YouTube-8M Large-Scale Video Understanding Challenge . In Computer Vision - ECCV 2018 Workshops - Munich, Germany, September 8--14, 2018, Proceedings, Part IV. 193--205 . Joonseok Lee, Apostol Natsev, Walter Reade, Rahul Sukthankar, and George Toderici. 2018. The 2nd YouTube-8M Large-Scale Video Understanding Challenge. In Computer Vision - ECCV 2018 Workshops - Munich, Germany, September 8--14, 2018, Proceedings, Part IV. 193--205."},{"key":"e_1_3_2_1_17_1","first-page":"21","article-title":"Multilayer Perceptron New Method for Selecting the Architecture Based on the Choice of Different Activation Functions","volume":"11","author":"Ramchoun Hassan","year":"2019","unstructured":"Hassan Ramchoun , Mohammed Amine Janati Idrissi , Youssef Ghanou , and Mohamed Ettaouil . 2019 . Multilayer Perceptron New Method for Selecting the Architecture Based on the Choice of Different Activation Functions . IJISSS 11 , 4 (2019), 21 -- 34 . Hassan Ramchoun, Mohammed Amine Janati Idrissi, Youssef Ghanou, and Mohamed Ettaouil. 2019. Multilayer Perceptron New Method for Selecting the Architecture Based on the Choice of Different Activation Functions. IJISSS 11, 4 (2019), 21--34.","journal-title":"IJISSS"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2008.4587727"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_3_2_1_20_1","volume-title":"Dynamic Routing Between Capsules. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Sabour Sara","year":"2017","unstructured":"Sara Sabour , Nicholas Frosst , and Geoffrey E. Hinton . 2017 . Dynamic Routing Between Capsules. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 , 4--9 December 2017 , Long Beach, CA, USA. 3856--3866. Sara Sabour, Nicholas Frosst, and Geoffrey E. Hinton. 2017. Dynamic Routing Between Capsules. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4--9 December 2017, Long Beach, CA, USA. 3856--3866."},{"key":"e_1_3_2_1_21_1","volume-title":"Two-Stream Convolutional Networks for Action Recognition in Videos. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014 . Two-Stream Convolutional Networks for Action Recognition in Videos. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014 , December 8 --13 2014, Montreal, Quebec, Canada. 568--576. Karen Simonyan and Andrew Zisserman. 2014. Two-Stream Convolutional Networks for Action Recognition in Videos. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8--13 2014, Montreal, Quebec, Canada. 568--576."},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4--9","author":"Szegedy Christian","year":"2017","unstructured":"Christian Szegedy , Sergey Ioffe , Vincent Vanhoucke , and Alexander A. Alemi . 2017. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning . In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4--9 , 2017 , San Francisco, California, USA. 4278--4284. Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander A. Alemi. 2017. Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4--9, 2017, San Francisco, California, USA. 4278--4284."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_1_25_1","volume-title":"Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016","author":"Szegedy Christian","year":"2016","unstructured":"Christian Szegedy , Vincent Vanhoucke , Sergey Ioffe , Jonathon Shlens , and Zbigniew Wojna . 2016 . Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 , Las Vegas, NV, USA, June 27--30 , 2016. 2818--2826. Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, and Zbigniew Wojna. 2016. Rethinking the Inception Architecture for Computer Vision. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016. 2818--2826."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.510"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00675"},{"key":"e_1_3_2_1_28_1","unstructured":"Xinshuo Weng and Kris Kitani. 2019. Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading. (2019) 269.  Xinshuo Weng and Kris Kitani. 2019. Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading. (2019) 269."}],"event":{"name":"MMAsia '20: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Virtual Event Singapore","acronym":"MMAsia '20"},"container-title":["Proceedings of the 2nd ACM International Conference on Multimedia in Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446296","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3444685.3446296","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:03:19Z","timestamp":1750197799000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3444685.3446296"}},"subtitle":["residual inception and cascaded capsule network for broadcast sports video classification"],"short-title":[],"issued":{"date-parts":[[2021,3,7]]},"references-count":27,"alternative-id":["10.1145\/3444685.3446296","10.1145\/3444685"],"URL":"https:\/\/doi.org\/10.1145\/3444685.3446296","relation":{},"subject":[],"published":{"date-parts":[[2021,3,7]]},"assertion":[{"value":"2021-05-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}