{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T09:59:28Z","timestamp":1773482368452,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"13","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2023,9]]},"abstract":"<jats:p>We introduce VOCALExplore, a system designed to support users in building domain-specific models over video datasets. VOCALExplore supports interactive labeling sessions and trains models using user-supplied labels. VOCALExplore maximizes model quality by automatically deciding how to select samples based on observed skew in the collected labels. It also selects the optimal video representations to use when training models by casting feature selection as a rising bandit problem. Finally, VOCALExplore implements optimizations to achieve low latency without sacrificing model performance. We demonstrate that VOCALExplore achieves close to the best possible model quality given candidate acquisition functions and feature extractors, and it does so with low visible latency (~1 second per iteration) and no expensive preprocessing.<\/jats:p>","DOI":"10.14778\/3625054.3625057","type":"journal-article","created":{"date-parts":[[2023,12,4]],"date-time":"2023-12-04T17:09:42Z","timestamp":1701709782000},"page":"4188-4201","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building"],"prefix":"10.14778","volume":"16","author":[{"given":"Maureen","family":"Daum","sequence":"first","affiliation":[{"name":"University of Washington"}]},{"given":"Enhao","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Washington"}]},{"given":"Dong","family":"He","sequence":"additional","affiliation":[{"name":"University of Washington"}]},{"given":"Stephen","family":"Mussmann","sequence":"additional","affiliation":[{"name":"University of Washington"}]},{"given":"Brandon","family":"Haynes","sequence":"additional","affiliation":[{"name":"Microsoft Gray Systems Lab"}]},{"given":"Ranjay","family":"Krishna","sequence":"additional","affiliation":[{"name":"University of Washington"}]},{"given":"Magdalena","family":"Balazinska","sequence":"additional","affiliation":[{"name":"University of Washington"}]}],"member":"320","published-online":{"date-parts":[[2023,12,4]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"2022. Amazon Rekognition. https:\/\/aws.amazon.com\/rekognition\/. 2022. Amazon Rekognition. https:\/\/aws.amazon.com\/rekognition\/."},{"key":"e_1_2_1_2_1","unstructured":"2022. Azure Video Indexer. https:\/\/learn.microsoft.com\/en-us\/azure\/azure-video-indexer\/video-indexer-overview. 2022. Azure Video Indexer. https:\/\/learn.microsoft.com\/en-us\/azure\/azure-video-indexer\/video-indexer-overview."},{"key":"e_1_2_1_3_1","unstructured":"2022. Forager: Rapid Data Exploration and Model Development. https:\/\/cs.stanford.edu\/~fpoms\/. 2022. Forager: Rapid Data Exploration and Model Development. https:\/\/cs.stanford.edu\/~fpoms\/."},{"key":"e_1_2_1_4_1","unstructured":"2022. Google Cloud Video Intelligence API. https:\/\/cloud.google.com\/video-intelligence. 2022. Google Cloud Video Intelligence API. https:\/\/cloud.google.com\/video-intelligence."},{"key":"e_1_2_1_5_1","unstructured":"2023. NVIDIA DALI. https:\/\/developer.nvidia.com\/dali. 2023. NVIDIA DALI. https:\/\/developer.nvidia.com\/dali."},{"key":"e_1_2_1_6_1","volume-title":"Wenisch","author":"Anderson Michael R.","year":"2019","unstructured":"Michael R. Anderson , Michael J. Cafarella , Germ\u00e1n Ros , and Thomas F . Wenisch . 2019 . Physical Representation-Based Predicate Optimization for a Visual Analytics Database. In IEEE. IEEE , 1466--1477. Michael R. Anderson, Michael J. Cafarella, Germ\u00e1n Ros, and Thomas F. Wenisch. 2019. Physical Representation-Based Predicate Optimization for a Visual Analytics Database. In IEEE. IEEE, 1466--1477."},{"key":"e_1_2_1_7_1","volume-title":"MIRIS: Fast Object Track Queries in Video. In SIGMOD. 1907--1921.","author":"Bastani Favyen","year":"2020","unstructured":"Favyen Bastani , Songtao He , Arjun Balasingam , Karthik Gopalakrishnan , Mohammad Alizadeh , Hari Balakrishnan , Michael J. Cafarella , Tim Kraska , and Sam Madden . 2020 . MIRIS: Fast Object Track Queries in Video. In SIGMOD. 1907--1921. Favyen Bastani, Songtao He, Arjun Balasingam, Karthik Gopalakrishnan, Mohammad Alizadeh, Hari Balakrishnan, Michael J. Cafarella, Tim Kraska, and Sam Madden. 2020. MIRIS: Fast Object Track Queries in Video. In SIGMOD. 1907--1921."},{"key":"e_1_2_1_8_1","volume-title":"Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection","author":"Beery Sara","unstructured":"Sara Beery , Guanhang Wu , Vivek Rathod , Ronny Votel , and Jonathan Huang . 2020. Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection . In CVPR. Computer Vision Foundation \/ IEEE , 13072--13082. Sara Beery, Guanhang Wu, Vivek Rathod, Ronny Votel, and Jonathan Huang. 2020. Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection. In CVPR. Computer Vision Foundation \/ IEEE, 13072--13082."},{"key":"e_1_2_1_9_1","volume-title":"The OpenCV Library. Dr. Dobb's Journal of Software Tools","author":"Bradski G.","year":"2000","unstructured":"G. Bradski . 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools ( 2000 ). G. Bradski. 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools (2000)."},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Jiashen Cao Karan Sarkar Ramyad Hadidi Joy Arulraj and Hyesoon Kim. 2022. FiGO: Fine-Grained Query Optimization in Video Analytics. In SIGMOD Zachary Ives Angela Bonifati and Amr El Abbadi (Eds.). ACM 559--572. Jiashen Cao Karan Sarkar Ramyad Hadidi Joy Arulraj and Hyesoon Kim. 2022. FiGO: Fine-Grained Query Optimization in Video Analytics. In SIGMOD Zachary Ives Angela Bonifati and Amr El Abbadi (Eds.). ACM 559--572.","DOI":"10.1145\/3514221.3517857"},{"key":"e_1_2_1_11_1","volume-title":"Ranked Window Query Retrieval over Video Repositories","author":"Chen Yueting","unstructured":"Yueting Chen , Xiaohui Yu , and Nick Koudas . 2022. Ranked Window Query Retrieval over Video Repositories . In ICDE. IEEE , 2776--2791. Yueting Chen, Xiaohui Yu, and Nick Koudas. 2022. Ranked Window Query Retrieval over Video Repositories. In ICDE. IEEE, 2776--2791."},{"key":"e_1_2_1_12_1","unstructured":"Gui Citovsky Giulia DeSalvo Claudio Gentile Lazaros Karydas Anand Rajagopalan Afshin Rostamizadeh and Sanjiv Kumar. 2021. Batch Active Learning at Scale. In NeurIPS. 11933--11944. Gui Citovsky Giulia DeSalvo Claudio Gentile Lazaros Karydas Anand Rajagopalan Afshin Rostamizadeh and Sanjiv Kumar. 2021. Batch Active Learning at Scale. In NeurIPS. 11933--11944."},{"key":"e_1_2_1_13_1","volume-title":"VOCAL: Video Organization and Interactive Compositional AnaLytics. In CIDR.","author":"Daum Maureen","year":"2022","unstructured":"Maureen Daum , Enhao Zhang , Dong He , Magdalena Balazinska , Brandon Haynes , Ranjay Krishna , Apryle Craig , and Aaron Wirsing . 2022 . VOCAL: Video Organization and Interactive Compositional AnaLytics. In CIDR. Maureen Daum, Enhao Zhang, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna, Apryle Craig, and Aaron Wirsing. 2022. VOCAL: Video Organization and Interactive Compositional AnaLytics. In CIDR."},{"key":"e_1_2_1_14_1","volume-title":"VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. CoRR abs\/2303.04068","author":"Daum Maureen","year":"2023","unstructured":"Maureen Daum , Enhao Zhang , Dong He , Stephen Mussmann , Brandon Haynes , Ranjay Krishna , and Magdalena Balazinska . 2023. VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. CoRR abs\/2303.04068 ( 2023 ). Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, and Magdalena Balazinska. 2023. VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. CoRR abs\/2303.04068 (2023)."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1111\/oik.08628"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the 29th ACM International Conference on Multimedia. https:\/\/pytorchvideo.org\/.","author":"Fan Haoqi","year":"2021","unstructured":"Haoqi Fan , Tullie Murrell , Heng Wang , Kalyan Vasudev Alwala , Yanghao Li , Yilei Li , Bo Xiong , Nikhila Ravi , Meng Li , Haichuan Yang , Jitendra Malik , Ross Girshick , Matt Feiszli , Aaron Adcock , Wan-Yen Lo , and Christoph Feichtenhofer . 2021 . PyTorchVideo: A Deep Learning Library for Video Understanding . In Proceedings of the 29th ACM International Conference on Multimedia. https:\/\/pytorchvideo.org\/. Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, and Christoph Feichtenhofer. 2021. PyTorchVideo: A Deep Learning Library for Video Understanding. In Proceedings of the 29th ACM International Conference on Multimedia. https:\/\/pytorchvideo.org\/."},{"key":"e_1_2_1_17_1","volume-title":"Multiscale Vision Transformers","author":"Fan Haoqi","unstructured":"Haoqi Fan , Bo Xiong , Karttikeya Mangalam , Yanghao Li , Zhicheng Yan , Jitendra Malik , and Christoph Feichtenhofer . 2021. Multiscale Vision Transformers . In ICCV. IEEE , 6804--6815. Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, and Christoph Feichtenhofer. 2021. Multiscale Vision Transformers. In ICCV. IEEE, 6804--6815."},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1162\/daed_a_01902","article-title":"Searching for computer vision north stars","volume":"151","author":"Fei-Fei Li","year":"2022","unstructured":"Li Fei-Fei and Ranjay Krishna . 2022 . Searching for computer vision north stars . Daedalus 151 , 2 (2022), 85 -- 99 . Li Fei-Fei and Ranjay Krishna. 2022. Searching for computer vision north stars. Daedalus 151, 2 (2022), 85--99.","journal-title":"Daedalus"},{"key":"e_1_2_1_19_1","volume-title":"A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning","author":"Feichtenhofer Christoph","unstructured":"Christoph Feichtenhofer , Haoqi Fan , Bo Xiong , Ross B. Girshick , and Kaiming He. 2021. A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning . In CVPR. Computer Vision Foundation \/ IEEE , 3299--3309. Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross B. Girshick, and Kaiming He. 2021. A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning. In CVPR. Computer Vision Foundation \/ IEEE, 3299--3309."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.106622"},{"key":"e_1_2_1_21_1","volume-title":"SISAP (Lecture Notes in Computer Science)","author":"Heller Silvan","unstructured":"Silvan Heller , Mahnaz Parian , Maurizio Pasquinelli , and Heiko Schuldt . 2020. Vitrivr-Explore: Guided Multimedia Collection Exploration for Ad-hoc Video Search . In SISAP (Lecture Notes in Computer Science) , Vol. 12440 . Springer , 379--386. Silvan Heller, Mahnaz Parian, Maurizio Pasquinelli, and Heiko Schuldt. 2020. Vitrivr-Explore: Guided Multimedia Collection Exploration for Ad-hoc Video Search. In SISAP (Lecture Notes in Computer Science), Vol. 12440. Springer, 379--386."},{"key":"e_1_2_1_22_1","volume-title":"Multi-Stage Queries and Temporal Scoring in Vitrivr","author":"Heller Silvan","unstructured":"Silvan Heller , Loris Sauter , Heiko Schuldt , and Luca Rossetto . 2020. Multi-Stage Queries and Temporal Scoring in Vitrivr . In ICME. IEEE , 1--5. Silvan Heller, Loris Sauter, Heiko Schuldt, and Luca Rossetto. 2020. Multi-Stage Queries and Temporal Scoring in Vitrivr. In ICME. IEEE, 1--5."},{"key":"e_1_2_1_23_1","volume-title":"Jamieson and Ameet Talwalkar","author":"Kevin","year":"2016","unstructured":"Kevin G. Jamieson and Ameet Talwalkar . 2016 . Non-stochastic Best Arm Identification and Hyperparameter Optimization. In AISTATS (JMLR), Arthur Gretton and Christian C. Robert (Eds .), Vol. 51 . 240--248. Kevin G. Jamieson and Ameet Talwalkar. 2016. Non-stochastic Best Arm Identification and Hyperparameter Optimization. In AISTATS (JMLR), Arthur Gretton and Christian C. Robert (Eds.), Vol. 51. 240--248."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.14778\/3372716.3372725"},{"key":"e_1_2_1_25_1","volume-title":"Annual Meeting of the Association for Computational Linguistics.","author":"Karamcheti Siddharth","unstructured":"Siddharth Karamcheti , Ranjay Krishna , Li Fei-Fei , and Christopher D. Manning . 2021. Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering . In Annual Meeting of the Association for Computational Linguistics. Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei, and Christopher D. Manning. 2021. Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. In Annual Meeting of the Association for Computational Linguistics."},{"key":"e_1_2_1_26_1","volume-title":"AutoLearn - Automated Feature Generation and Selection","author":"Kaul Ambika","unstructured":"Ambika Kaul , Saket Maheshwary , and Vikram Pudi . 2017. AutoLearn - Automated Feature Generation and Selection . In ICDM. IEEE Computer Society , 217--226. Ambika Kaul, Saket Maheshwary, and Vikram Pudi. 2017. AutoLearn - Automated Feature Generation and Selection. In ICDM. IEEE Computer Society, 217--226."},{"key":"e_1_2_1_27_1","volume-title":"Automatic model selection and hyperparameter optimization in WEKA. JMLR 18","author":"Kotthoff Lars","year":"2017","unstructured":"Lars Kotthoff , Chris Thornton , Holger H. Hoos , Frank Hutter , and Kevin Leyton-Brown . 2017. Auto- WEKA 2.0 : Automatic model selection and hyperparameter optimization in WEKA. JMLR 18 ( 2017 ), 25:1--25:5. Lars Kotthoff, Chris Thornton, Holger H. Hoos, Frank Hutter, and Kevin Leyton-Brown. 2017. Auto-WEKA 2.0: Automatic model selection and hyperparameter optimization in WEKA. JMLR 18 (2017), 25:1--25:5."},{"key":"e_1_2_1_28_1","volume-title":"Efficient Automatic CASH via Rising Bandits","author":"Li Yang","unstructured":"Yang Li , Jiawei Jiang , Jinyang Gao , Yingxia Shao , Ce Zhang , and Bin Cui . 2020. Efficient Automatic CASH via Rising Bandits . In AAAI. AAAI Press , 4763--4771. Yang Li, Jiawei Jiang, Jinyang Gao, Yingxia Shao, Ce Zhang, and Bin Cui. 2020. Efficient Automatic CASH via Rising Bandits. In AAAI. AAAI Press, 4763--4771."},{"key":"e_1_2_1_29_1","volume-title":"Girshick","author":"Li Yanghao","year":"2021","unstructured":"Yanghao Li , Saining Xie , Xinlei Chen , Piotr Doll\u00e1r , Kaiming He , and Ross B . Girshick . 2021 . Benchmarking Detection Transfer Learning with Vision Transformers. CoRR abs\/2111.11429 (2021). Yanghao Li, Saining Xie, Xinlei Chen, Piotr Doll\u00e1r, Kaiming He, and Ross B. Girshick. 2021. Benchmarking Detection Transfer Learning with Vision Transformers. CoRR abs\/2111.11429 (2021)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346452"},{"key":"e_1_2_1_31_1","volume-title":"Accelerating Machine Learning Inference with Probabilistic Predicates","author":"Lu Yao","unstructured":"Yao Lu , Aakanksha Chowdhery , Srikanth Kandula , and Surajit Chaudhuri . 2018. Accelerating Machine Learning Inference with Probabilistic Predicates . In SIGMOD, Gautam Das, Christopher M. Jermaine, and Philip A. Bernstein (Eds.). ACM , 1493--1508. Yao Lu, Aakanksha Chowdhery, Srikanth Kandula, and Surajit Chaudhuri. 2018. Accelerating Machine Learning Inference with Probabilistic Predicates. In SIGMOD, Gautam Das, Christopher M. Jermaine, and Philip A. Bernstein (Eds.). ACM, 1493--1508."},{"key":"e_1_2_1_32_1","unstructured":"TorchVision maintainers and contributors. 2016. TorchVision: PyTorch's Computer Vision library. https:\/\/github.com\/pytorch\/vision TorchVision maintainers and contributors. 2016. TorchVision: PyTorch's Computer Vision library. https:\/\/github.com\/pytorch\/vision"},{"key":"e_1_2_1_33_1","volume-title":"ExSample: Efficient Searches on Video Repositories through Adaptive Sampling","author":"Moll Oscar R.","unstructured":"Oscar R. Moll , Favyen Bastani , Sam Madden , Mike Stonebraker , Vijay Gadepally , and Tim Kraska . 2022. ExSample: Efficient Searches on Video Repositories through Adaptive Sampling . In ICDE. IEEE , 2956--2968. Oscar R. Moll, Favyen Bastani, Sam Madden, Mike Stonebraker, Vijay Gadepally, and Tim Kraska. 2022. ExSample: Efficient Searches on Video Repositories through Adaptive Sampling. In ICDE. IEEE, 2956--2968."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-008-0319-7"},{"key":"e_1_2_1_35_1","volume-title":"PyTorch: An Imperative Style","author":"Paszke Adam","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , Alban Desmaison , Andreas Kopf , Edward Yang , Zachary DeVito , Martin Raison , Alykhan Tejani , Sasank Chilamkurthy , Benoit Steiner , Lu Fang , Junjie Bai , and Soumith Chintala . 2019. PyTorch: An Imperative Style , High-Performance Deep Learning Library . In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch\u00e9 Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024--8035. http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch\u00e9 Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024--8035. http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf"},{"key":"e_1_2_1_36_1","unstructured":"Mark Raasveldt and Hannes Muehleisen. [n.d.]. DuckDB. https:\/\/github.com\/duckdb\/duckdb Mark Raasveldt and Hannes Muehleisen. [n.d.]. DuckDB. https:\/\/github.com\/duckdb\/duckdb"},{"key":"e_1_2_1_37_1","volume-title":"ICML","volume":"139","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , Gretchen Krueger , and Ilya Sutskever . 2021 . Learning Transferable Visual Models From Natural Language Supervision . In ICML , Vol. 139 . PMLR, 8748--8763. Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. In ICML, Vol. 139. PMLR, 8748--8763."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/3570690.3570695"},{"key":"e_1_2_1_39_1","volume-title":"Cineast: A Multi-feature Sketch-Based Video Retrieval Engine","author":"Rossetto Luca","year":"2014","unstructured":"Luca Rossetto , Ivan Giangreco , and Heiko Schuldt . 2014 . Cineast: A Multi-feature Sketch-Based Video Retrieval Engine . In ISM. IEEE , 18--23. Luca Rossetto, Ivan Giangreco, and Heiko Schuldt. 2014. Cineast: A Multi-feature Sketch-Based Video Retrieval Engine. In ISM. IEEE, 18--23."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1987.10478517"},{"key":"e_1_2_1_41_1","unstructured":"Ozan Sener and Silvio Savarese. 2018. Active Learning for Convolutional Neural Networks: A Core-Set Approach. In ICLR. Ozan Sener and Silvio Savarese. 2018. Active Learning for Convolutional Neural Networks: A Core-Set Approach. In ICLR."},{"key":"e_1_2_1_42_1","unstructured":"Burr Settles. 2009. Active Learning Literature Survey. Burr Settles. 2009. Active Learning Literature Survey."},{"key":"e_1_2_1_43_1","volume-title":"ECCV (Lecture Notes in Computer Science)","author":"Sigurdsson Gunnar A.","unstructured":"Gunnar A. Sigurdsson , G\u00fcl Varol , Xiaolong Wang , Ali Farhadi , Ivan Laptev , and Abhinav Gupta . 2016. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding . In ECCV (Lecture Notes in Computer Science) , Vol. 9905 . Springer , 510--526. Gunnar A. Sigurdsson, G\u00fcl Varol, Xiaolong Wang, Ali Farhadi, Ivan Laptev, and Abhinav Gupta. 2016. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding. In ECCV (Lecture Notes in Computer Science), Vol. 9905. Springer, 510--526."},{"key":"e_1_2_1_44_1","volume-title":"A Short Note on the Kinetics-700-2020 Human Action Dataset. CoRR abs\/2010.10864","author":"Smaira Lucas","year":"2020","unstructured":"Lucas Smaira , Jo\u00e3o Carreira , Eric Noland , Ellen Clancy , Amy Wu , and Andrew Zisserman . 2020. A Short Note on the Kinetics-700-2020 Human Action Dataset. CoRR abs\/2010.10864 ( 2020 ). Lucas Smaira, Jo\u00e3o Carreira, Eric Noland, Ellen Clancy, Amy Wu, and Andrew Zisserman. 2020. A Short Note on the Kinetics-700-2020 Human Action Dataset. CoRR abs\/2010.10864 (2020)."},{"key":"e_1_2_1_45_1","volume-title":"A Closer Look at Spatiotemporal Convolutions for Action Recognition","author":"Tran Du","unstructured":"Du Tran , Heng Wang , Lorenzo Torresani , Jamie Ray , Yann LeCun , and Manohar Paluri . 2018. A Closer Look at Spatiotemporal Convolutions for Action Recognition . In CVPR. IEEE , 6450--6459. Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, and Manohar Paluri. 2018. A Closer Look at Spatiotemporal Convolutions for Action Recognition. In CVPR. IEEE, 6450--6459."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2020.101822"},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA@SIGMOD, Carsten Binnig, Juliana Freire, and Eugene Wu (Eds.). ACM, 6:1--6:6.","author":"Wu Yifan","year":"2018","unstructured":"Yifan Wu , Steven Mark Drucker , Matthai Philipose , and Lenin Ravindranath . 2018 . Querying Videos Using DNN Generated Labels . In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA@SIGMOD, Carsten Binnig, Juliana Freire, and Eugene Wu (Eds.). ACM, 6:1--6:6. Yifan Wu, Steven Mark Drucker, Matthai Philipose, and Lenin Ravindranath. 2018. Querying Videos Using DNN Generated Labels. In Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA@SIGMOD, Carsten Binnig, Juliana Freire, and Eugene Wu (Eds.). ACM, 6:1--6:6."},{"key":"e_1_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Hu Xu Gargi Ghosh Po-Yao Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer and Christoph Feichtenhofer. 2021. VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. In EMNLP. 6787--6800. Hu Xu Gargi Ghosh Po-Yao Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer and Christoph Feichtenhofer. 2021. VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding. In EMNLP. 6787--6800.","DOI":"10.18653\/v1\/2021.emnlp-main.544"},{"key":"e_1_2_1_49_1","volume-title":"Joy Arulraj, and Umakishore Ramachandran.","author":"Xu Zhuangdi","year":"2022","unstructured":"Zhuangdi Xu , Gaurav Tarlok Kakkar , Joy Arulraj, and Umakishore Ramachandran. 2022 . EVA : A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views. In SIGMOD. ACM , 602--616. Zhuangdi Xu, Gaurav Tarlok Kakkar, Joy Arulraj, and Umakishore Ramachandran. 2022. EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views. In SIGMOD. ACM, 602--616."},{"key":"e_1_2_1_50_1","volume-title":"Taking human out of learning applications: A survey on automated machine learning. arXiv:1810.13306","author":"Yao Quanming","year":"2018","unstructured":"Quanming Yao , Mengshuo Wang , Yuqiang Chen , Wenyuan Dai , Yu-Feng Li , Wei-Wei Tu , Qiang Yang , and Yang Yu. 2018. Taking human out of learning applications: A survey on automated machine learning. arXiv:1810.13306 ( 2018 ). Quanming Yao, Mengshuo Wang, Yuqiang Chen, Wenyuan Dai, Yu-Feng Li, Wei-Wei Tu, Qiang Yang, and Yang Yu. 2018. Taking human out of learning applications: A survey on automated machine learning. arXiv:1810.13306 (2018)."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00271"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476249.3476269"},{"key":"e_1_2_1_53_1","volume-title":"Image-Enhancement-Based Data Augmentation for Improving Deep Learning in Image Classification Problem","author":"Zhang Weihang","unstructured":"Weihang Zhang , Yuma Kinoshita , and Hitoshi Kiya . 2020. Image-Enhancement-Based Data Augmentation for Improving Deep Learning in Image Classification Problem . In ICCE-TW. IEEE , 1--2. Weihang Zhang, Yuma Kinoshita, and Hitoshi Kiya. 2020. Image-Enhancement-Based Data Augmentation for Improving Deep Learning in Image Classification Problem. In ICCE-TW. IEEE, 1--2."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.14778\/3372716.3372721"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3625054.3625057","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,4]],"date-time":"2024-11-04T22:59:06Z","timestamp":1730761146000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3625054.3625057"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9]]},"references-count":54,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["10.14778\/3625054.3625057"],"URL":"https:\/\/doi.org\/10.14778\/3625054.3625057","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2023,9]]},"assertion":[{"value":"2023-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}