{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:28:22Z","timestamp":1750220902067,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,12,15]],"date-time":"2019-12-15T00:00:00Z","timestamp":1576368000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Natural Science Foundation of China","award":["61801428, 61976192"],"award-info":[{"award-number":["61801428, 61976192"]}]},{"name":"Zhejiang Provincial Natural Science Foundation of China","award":["LY18F020034, LY18F020032"],"award-info":[{"award-number":["LY18F020034, LY18F020032"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,12,15]]},"DOI":"10.1145\/3338533.3366593","type":"proceedings-article","created":{"date-parts":[[2020,1,11]],"date-time":"2020-01-11T04:30:32Z","timestamp":1578717032000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Video Summarization based on Sparse Subspace Clustering with Automatically Estimated Number of Clusters"],"prefix":"10.1145","author":[{"given":"Pengyi","family":"Hao","sequence":"first","affiliation":[{"name":"Zhejiang University of Technology, Hangzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Edwin","family":"Manhando","sequence":"additional","affiliation":[{"name":"Zhejiang University of Science &amp; Technology, Hangzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Taotao","family":"Ye","sequence":"additional","affiliation":[{"name":"Zhejiang University of Technology, Hangzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cong","family":"Bai","sequence":"additional","affiliation":[{"name":"Zhejiang University of Technology, Hangzhou, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,1,10]]},"reference":[{"volume-title":"Video Summarization: Techniques and Classification. In International Conference on Computer Vision and Graphics. 1--13","author":"Ajmal M.","key":"e_1_3_2_1_1_1","unstructured":"M. Ajmal , M. H. Ashraf , M. Shakir , Y. Abbas , and F.A. Shah . 2012 . Video Summarization: Techniques and Classification. In International Conference on Computer Vision and Graphics. 1--13 . M. Ajmal, M. H. Ashraf, M. Shakir, Y. Abbas, and F.A. Shah. 2012. Video Summarization: Techniques and Classification. In International Conference on Computer Vision and Graphics. 1--13."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2010.08.004"},{"volume-title":"British Machine Vision Conference.","author":"Chatfield K.","key":"e_1_3_2_1_3_1","unstructured":"K. Chatfield , K. Simonyan , A. Vedaldi , and A. Zisserman . 2014. Return of the Devil in the Details: Delving Deep into Convolutional Nets . In British Machine Vision Conference. K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. 2014. Return of the Devil in the Details: Delving Deep into Convolutional Nets. In British Machine Vision Conference."},{"volume-title":"Video Summarization Preserving Dynamic Content. In International Workshop on TRECVID Video Summarization. 40--44","author":"Chen F.","key":"e_1_3_2_1_4_1","unstructured":"F. Chen , M. Cooper , and J. Adcock . 2007 . Video Summarization Preserving Dynamic Content. In International Workshop on TRECVID Video Summarization. 40--44 . F. Chen, M. Cooper, and J. Adcock. 2007. Video Summarization Preserving Dynamic Content. In International Workshop on TRECVID Video Summarization. 40--44."},{"volume-title":"IEEE Conference on Computer Vision and Recognition. 3584--3592","author":"Chu W.","key":"e_1_3_2_1_5_1","unstructured":"W. Chu , Y. Song , and A. Jaimes . 2015. Video Co-summarization: Video Summarization by Visual Co-occurrence . In IEEE Conference on Computer Vision and Recognition. 3584--3592 . W. Chu, Y. Song, and A. Jaimes. 2015. Video Co-summarization: Video Summarization by Visual Co-occurrence. In IEEE Conference on Computer Vision and Recognition. 3584--3592."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"M.V.M. Cirne and H. Pedrini. 2013. A Video Summarization Method Based on Spectral Clustering. In Progress in Pattern Recognition Image Analysis Computer Vision and Applications. 479--486.  M.V.M. Cirne and H. Pedrini. 2013. A Video Summarization Method Based on Spectral Clustering. In Progress in Pattern Recognition Image Analysis Computer Vision and Applications. 479--486.","DOI":"10.1007\/978-3-642-41827-3_60"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.57"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"N. Fachada M.A.T. Figueiredo V.V. Lopes R.C. Martins and A.C. Rosa. 2014. Spectrometric differentiation of yeast strains using minimum volume increase and minimum direction change clustering criteria. 45 (2014) 55--61.  N. Fachada M.A.T. Figueiredo V.V. Lopes R.C. Martins and A.C. Rosa. 2014. Spectrometric differentiation of yeast strains using minimum volume increase and minimum direction change clustering criteria. 45 (2014) 55--61.","DOI":"10.1016\/j.patrec.2014.03.008"},{"volume-title":"Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In IEEE Conference on Computer Vision and Recognition. 580--587","author":"Girshick R. B.","key":"e_1_3_2_1_9_1","unstructured":"R. B. Girshick , J. Donahue , T. Darrell , and J. Malik . 2014 . Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In IEEE Conference on Computer Vision and Recognition. 580--587 . R. B. Girshick, J. Donahue, T. Darrell, and J. Malik. 2014. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In IEEE Conference on Computer Vision and Recognition. 580--587."},{"volume-title":"European Conference on Computer Vision. 505--520","author":"Gygli M.","key":"e_1_3_2_1_10_1","unstructured":"M. Gygli , H. Grabner , H. Riemenschneider , and L. V. Gool . 2014. Creating Summaries from User Videos . In European Conference on Computer Vision. 505--520 . M. Gygli, H. Grabner, H. Riemenschneider, and L. V. Gool. 2014. Creating Summaries from User Videos. In European Conference on Computer Vision. 505--520."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"R. Hari C. P. Roopesh and M. Wilscy. 2013. Human face based approach for video summarization. In IEEE Recent Advances in Intelligent Computational Systems. 245--250.  R. Hari C. P. Roopesh and M. Wilscy. 2013. Human face based approach for video summarization. In IEEE Recent Advances in Intelligent Computational Systems. 245--250.","DOI":"10.1109\/RAICS.2013.6745481"},{"volume-title":"Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Recognition. 770--778","author":"He K.","key":"e_1_3_2_1_12_1","unstructured":"K. He , X. Zhang , S. Ren , and J. Sun . 2016 . Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Recognition. 770--778 . K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Recognition. 770--778."},{"key":"e_1_3_2_1_13_1","volume-title":"FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report UM-CS-2010-009. Technical Report","author":"Jain V.","year":"2010","unstructured":"V. Jain and E. Learned-Miller . 2010 . FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report UM-CS-2010-009. Technical Report , University of Massachusetts , Amherst. V. Jain and E. Learned-Miller. 2010. FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report UM-CS-2010-009. Technical Report, University of Massachusetts, Amherst."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.59"},{"key":"e_1_3_2_1_15_1","volume-title":"ImageNet Classification with Deep Convolutional Neural Networks. In International Conference on Neural Information Processing Systems","volume":"60","author":"Krizhevsky A.","unstructured":"A. Krizhevsky , I. Sutskever , and G.E. Hinton . 2012 . ImageNet Classification with Deep Convolutional Neural Networks. In International Conference on Neural Information Processing Systems , Vol. 60 . 1097--1105. A. Krizhevsky, I. Sutskever, and G.E. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In International Conference on Neural Information Processing Systems, Vol. 60. 1097--1105."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1989.1.4.541"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.88"},{"volume-title":"SSD: Single Shot Multi Box Detector. In European Conference on Computer Vision. 21--37","author":"Liu W.","key":"e_1_3_2_1_18_1","unstructured":"W. Liu , D. Anguelov , D. Erhan , C. Szegedy , S. Reed , C. Fu , and A.C. Berg . 2016 . SSD: Single Shot Multi Box Detector. In European Conference on Computer Vision. 21--37 . W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Fu, and A.C. Berg. 2016. SSD: Single Shot Multi Box Detector. In European Conference on Computer Vision. 21--37."},{"volume-title":"Unsupervised Video Summarization with Adversarial LSTM Networks. In IEEE Conference on Computer Vision and Recognition. 2982--2991","author":"Mahasseni B.","key":"e_1_3_2_1_19_1","unstructured":"B. Mahasseni , M. Lam , and S. Todorovic . 2017 . Unsupervised Video Summarization with Adversarial LSTM Networks. In IEEE Conference on Computer Vision and Recognition. 2982--2991 . B. Mahasseni, M. Lam, and S. Todorovic. 2017. Unsupervised Video Summarization with Adversarial LSTM Networks. In IEEE Conference on Computer Vision and Recognition. 2982--2991."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00799-005-0129-9"},{"key":"e_1_3_2_1_21_1","volume-title":"International Conference on Neural Information Processing Systems","volume":"14","author":"Ng A. Y.","unstructured":"A. Y. Ng , M. I. Jordan , and Y. Weiss . 2001. On Spectral Clustering: Analysis and an algorithm . In International Conference on Neural Information Processing Systems , Vol. 14 . 849--856. A. Y. Ng, M. I. Jordan, and Y. Weiss. 2001. On Spectral Clustering: Analysis and an algorithm. In International Conference on Neural Information Processing Systems, Vol. 14. 849--856."},{"volume-title":"Weakly Supervised Summarization of Web Videos. In IEEE International Conference on Computer Vision. 3677--3686","author":"Panda R.","key":"e_1_3_2_1_22_1","unstructured":"R. Panda , A. Das , Z. Wu , J. Ernst , and A. K. Roy-Chowdhury . 2017 . Weakly Supervised Summarization of Web Videos. In IEEE International Conference on Computer Vision. 3677--3686 . R. Panda, A. Das, Z. Wu, J. Ernst, and A. K. Roy-Chowdhury. 2017. Weakly Supervised Summarization of Web Videos. In IEEE International Conference on Computer Vision. 3677--3686."},{"volume-title":"Real-Time Object Detection. In IEEE Conference on Computer Vision and Recognition. 779--788","author":"Redmon J.","key":"e_1_3_2_1_23_1","unstructured":"J. Redmon , S. K. Divvala , R. B. Girshick , and A. Farhadi . 2016. You Only Look Once: Unified , Real-Time Object Detection. In IEEE Conference on Computer Vision and Recognition. 779--788 . J. Redmon, S. K. Divvala, R. B. Girshick, and A. Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In IEEE Conference on Computer Vision and Recognition. 779--788."},{"volume-title":"Stronger. In IEEE Conference on Computer Vision and Recognition. 6517--6525","author":"Redmon J.","key":"e_1_3_2_1_24_1","unstructured":"J. Redmon and A. Farhadi . 2017. YOLO9000: Better, Faster , Stronger. In IEEE Conference on Computer Vision and Recognition. 6517--6525 . J. Redmon and A. Farhadi. 2017. YOLO9000: Better, Faster, Stronger. In IEEE Conference on Computer Vision and Recognition. 6517--6525."},{"volume-title":"Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations.","author":"Simonyan K.","key":"e_1_3_2_1_25_1","unstructured":"K. Simonyan and A. Zisserman . 2015 . Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations. K. Simonyan and A. Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations."},{"volume-title":"IEEE Conference on Computer Vision and Recognition. 5179--5187","author":"Song Y.","key":"e_1_3_2_1_26_1","unstructured":"Y. Song , J. Vallmitjana , A. Stent , and A. Jaimes . 2015. TVSum: Summarizing web videos using titles . In IEEE Conference on Computer Vision and Recognition. 5179--5187 . Y. Song, J. Vallmitjana, A. Stent, and A. Jaimes. 2015. TVSum: Summarizing web videos using titles. In IEEE Conference on Computer Vision and Recognition. 5179--5187."},{"volume-title":"Face Recognition Data","author":"Spacek Libor","key":"e_1_3_2_1_27_1","unstructured":"Libor Spacek . 2007. Face Recognition Data , University of Essex , UK, Faces 96. http:\/\/cswww.essex.ac.uk\/mv\/allfaces\/faces96.html Libor Spacek. 2007. Face Recognition Data, University of Essex, UK, Faces 96. http:\/\/cswww.essex.ac.uk\/mv\/allfaces\/faces96.html"},{"volume-title":"Going Deeper with Convolutions. In IEEE Conference on Computer Vision and Pattern Recognition. 1--9.","author":"Szegedy C.","key":"e_1_3_2_1_28_1","unstructured":"C. Szegedy , W. Liu , Y. Jia , P. Sermanet , S. Reed , D. Anguelov , D. Erhan , V. Vanhoucke , and A. Rabinovich . 2015 . Going Deeper with Convolutions. In IEEE Conference on Computer Vision and Pattern Recognition. 1--9. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. 2015. Going Deeper with Convolutions. In IEEE Conference on Computer Vision and Pattern Recognition. 1--9."},{"key":"e_1_3_2_1_29_1","volume-title":"Degenerate and Non-degenerate. In European Conference on Computer Vision. 94--106","author":"Yan J.","year":"2006","unstructured":"J. Yan and M. Pollefeys . 2006 . A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid , Degenerate and Non-degenerate. In European Conference on Computer Vision. 94--106 . J. Yan and M.Pollefeys. 2006. A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate. In European Conference on Computer Vision. 94--106."},{"volume-title":"WIDER FACE: A Face Detection Benchmark. In IEEE Conference on Computer Vision and Recognition. 5525--5533","author":"Yang S.","key":"e_1_3_2_1_30_1","unstructured":"S. Yang , P. Luo , C. L. Chen , and X. Tang . 2016 . WIDER FACE: A Face Detection Benchmark. In IEEE Conference on Computer Vision and Recognition. 5525--5533 . S. Yang, P. Luo, C. L. Chen, and X. Tang. 2016. WIDER FACE: A Face Detection Benchmark. In IEEE Conference on Computer Vision and Recognition. 5525--5533."},{"key":"e_1_3_2_1_31_1","volume-title":"Self-Tuning Spectral Clustering. Advances in Neural Information Processing Systems","author":"Zelnikmanor L.","year":"2004","unstructured":"L. Zelnikmanor . 2004. Self-Tuning Spectral Clustering. Advances in Neural Information Processing Systems ( 2004 ), 1601--1608. L. Zelnikmanor. 2004. Self-Tuning Spectral Clustering. Advances in Neural Information Processing Systems (2004), 1601--1608."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/LSP.2016.2603342"},{"volume-title":"Object Detectors Emerge in Deep Scene CNNs. In International Conference on Learning Representations.","author":"Zhou B.","key":"e_1_3_2_1_33_1","unstructured":"B. Zhou , A. Khosla , A. Lapedriza , A. Oliva , and A. Torralba . 2015 . Object Detectors Emerge in Deep Scene CNNs. In International Conference on Learning Representations. B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba. 2015. Object Detectors Emerge in Deep Scene CNNs. In International Conference on Learning Representations."},{"volume-title":"International Conference on Neural Information Processing Systemss. 487--495","author":"Zhou B.","key":"e_1_3_2_1_34_1","unstructured":"B. Zhou , A. Lapedriza , J. Xiao , A. Torralba , and A. Oliva . 2014. Learning Deep Features for Scene Recognition using Places Database . In International Conference on Neural Information Processing Systemss. 487--495 . B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. 2014. Learning Deep Features for Scene Recognition using Places Database. In International Conference on Neural Information Processing Systemss. 487--495."}],"event":{"name":"MMAsia '19: ACM Multimedia Asia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Beijing China","acronym":"MMAsia '19"},"container-title":["Proceedings of the ACM Multimedia Asia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3338533.3366593","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3338533.3366593","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:44:47Z","timestamp":1750203887000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3338533.3366593"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,15]]},"references-count":34,"alternative-id":["10.1145\/3338533.3366593","10.1145\/3338533"],"URL":"https:\/\/doi.org\/10.1145\/3338533.3366593","relation":{},"subject":[],"published":{"date-parts":[[2019,12,15]]},"assertion":[{"value":"2020-01-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}