{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,10,30]],"date-time":"2022-10-30T11:31:11Z","timestamp":1667129471363},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","funder":[{"name":"Science and Technology Major Project of Commission of Science andTechnology of Shanghai","award":["No.2021SHZDZX0103"]},{"DOI":"10.13039\/501100003399","name":"Science and Technology Commission of Shanghai Municipality","doi-asserted-by":"publisher","award":["No.20511100800 and No.20511101704 and No.20511101502"]},{"name":"research of High-tech industry and technological innovation special project in Lingang New Area on ecological environment monitoring system based on 5G Internet of Things and edge computing","award":["No.SH-LG-GK-2020-02-11"]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,22]]},"DOI":"10.1145\/3508546.3508548","type":"proceedings-article","created":{"date-parts":[[2022,2,25]],"date-time":"2022-02-25T11:53:44Z","timestamp":1645790024000},"source":"Crossref","is-referenced-by-count":1,"title":["Multi-level Convolutional Transformer with Adaptive Ranking for Semi-supervised Crowd Counting"],"prefix":"10.1145","author":[{"given":"Xin","family":"Deng","sequence":"first","affiliation":[{"name":"Fudan University, China"}]},{"given":"Songjian","family":"Chen","sequence":"additional","affiliation":[{"name":"Fudan University, China"}]},{"given":"Yifan","family":"Chen","sequence":"additional","affiliation":[{"name":"Shanghai Radio Equipment Research Institute, China"}]},{"given":"Jie-Fang","family":"Xu","sequence":"additional","affiliation":[{"name":"Shanghai Radio Equipment Research Institute, China"}]}],"member":"320","published-online":{"date-parts":[[2022,2,25]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107616"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58598-3_1"},{"key":"e_1_3_2_1_3_1","unstructured":"Lokesh Boominathan Srinivas SS Kruthiventi and R Venkatesh Babu. Crowdnet: A deep convolutional network for dense crowd counting. In Lokesh Boominathan Srinivas SS Kruthiventi and R Venkatesh Babu. Crowdnet: A deep convolutional network for dense crowd counting. In"},{"key":"e_1_3_2_1_4_1","first-page":"644","volume-title":"24th ACM international conference on Multimedia","author":"Proceedings","year":"2016","unstructured":"Proceedings of the 24th ACM international conference on Multimedia , pages 640\u2013 644 , 2016 . Proceedings of the 24th ACM international conference on Multimedia, pages 640\u2013644, 2016."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.70"},{"key":"e_1_3_2_1_6_1","first-page":"4039","volume-title":"2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Sam Deepak Babu","unstructured":"Deepak Babu Sam , Shiv Surya , and R Venkatesh Babu . Switching convolutional neural network for crowd counting . In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pages 4031\u2013 4039 . IEEE, 2017. Deepak Babu Sam, Shiv Surya, and R Venkatesh Babu. Switching convolutional neural network for crowd counting. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4031\u20134039. IEEE, 2017."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00120"},{"key":"e_1_3_2_1_8_1","first-page":"6","volume-title":"2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)","author":"Sindagi Vishwanath A","unstructured":"Vishwanath A Sindagi and Vishal M Patel . Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting . In 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) , pages 1\u2013 6 . IEEE, 2017. Vishwanath A Sindagi and Vishal M Patel. Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting. In 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pages 1\u20136. IEEE, 2017."},{"key":"e_1_3_2_1_9_1","first-page":"1288","volume-title":"2019 IEEE Winter Conference on Applications of Computer Vision (WACV)","author":"Hossain Mohammad","unstructured":"Mohammad Hossain , Mehrdad Hosseinzadeh , Omit Chanda , and Yang Wang . Crowd counting using scale-aware attention networks . In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV) , pages 1280\u2013 1288 . IEEE, 2019. Mohammad Hossain, Mehrdad Hosseinzadeh, Omit Chanda, and Yang Wang. Crowd counting using scale-aware attention networks. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1280\u20131288. IEEE, 2019."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00476"},{"key":"e_1_3_2_1_11_1","first-page":"380","volume-title":"European Conference on Computer Vision","author":"von Borstel Matthias","unstructured":"Matthias von Borstel , Melih Kandemir , Philip Schmidt , Madhavi K Rao , Kumar Rajamani , and Fred A Hamprecht . Gaussian process density counting from weak supervision . In European Conference on Computer Vision , pages 365\u2013 380 . Springer, 2016. Matthias von Borstel, Melih Kandemir, Philip Schmidt, Madhavi K Rao, Kumar Rajamani, and Fred A Hamprecht. Gaussian process density counting from weak supervision. In European Conference on Computer Vision, pages 365\u2013380. Springer, 2016."},{"key":"e_1_3_2_1_12_1","volume-title":"Adaptive mixture regression network with local counting map for crowd counting. arXiv preprint arXiv:2005.05776","author":"Liu Xiyang","year":"2020","unstructured":"Xiyang Liu , Jie Yang , and Wenrui Ding . Adaptive mixture regression network with local counting map for crowd counting. arXiv preprint arXiv:2005.05776 , 2020 . Xiyang Liu, Jie Yang, and Wenrui Ding. Adaptive mixture regression network with local counting map for crowd counting. arXiv preprint arXiv:2005.05776, 2020."},{"key":"e_1_3_2_1_13_1","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition: Learning with Imperfect Data Workshop","author":"Olmschenk Greg","year":"2019","unstructured":"Greg Olmschenk , Jin Chen , Hao Tang , and Zhigang Zhu . Dense crowd counting convolutional neural networks with minimal data using semi- supervised dual-goal generative adversarial networks . In IEEE Conference on Computer Vision and Pattern Recognition: Learning with Imperfect Data Workshop , 2019 . Greg Olmschenk, Jin Chen, Hao Tang, and Zhigang Zhu. Dense crowd counting convolutional neural networks with minimal data using semi- supervised dual-goal generative adversarial networks. In IEEE Conference on Computer Vision and Pattern Recognition: Learning with Imperfect Data Workshop, 2019."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107616"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00799"},{"key":"e_1_3_2_1_16_1","volume-title":"Attention is all you need. arXiv preprint arXiv:1706.03762","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , Lukasz Kaiser , and Illia Polosukhin . Attention is all you need. arXiv preprint arXiv:1706.03762 , 2017 . Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. arXiv preprint arXiv:1706.03762, 2017."},{"key":"e_1_3_2_1_17_1","volume-title":"An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929","author":"Dosovitskiy Alexey","year":"2020","unstructured":"Alexey Dosovitskiy , Lucas Beyer , Alexander Kolesnikov , Dirk Weissenborn , Xiaohua Zhai , Thomas Unterthiner , Mostafa Dehghani , Matthias Minderer , Georg Heigold , Sylvain Gelly , An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 , 2020 . Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020."},{"key":"e_1_3_2_1_18_1","first-page":"229","volume-title":"European Conference on Computer Vision","author":"Carion Nicolas","unstructured":"Nicolas Carion , Francisco Massa , Gabriel Synnaeve , Nicolas Usunier , Alexander Kirillov , and Sergey Zagoruyko . End-to-end object detection with transformers . In European Conference on Computer Vision , pages 213\u2013 229 . Springer, 2020. Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. End-to-end object detection with transformers. In European Conference on Computer Vision, pages 213\u2013229. Springer, 2020."},{"key":"e_1_3_2_1_19_1","volume-title":"Visual transformers: Token-based image representation and processing for computer vision. arXiv preprint arXiv:2006.03677","author":"Wu Bichen","year":"2020","unstructured":"Bichen Wu , Chenfeng Xu , Xiaoliang Dai , Alvin Wan , Peizhao Zhang , Masayoshi Tomizuka , Kurt Keutzer , and Peter Vajda . Visual transformers: Token-based image representation and processing for computer vision. arXiv preprint arXiv:2006.03677 , 2020 . Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Masayoshi Tomizuka, Kurt Keutzer, and Peter Vajda. Visual transformers: Token-based image representation and processing for computer vision. arXiv preprint arXiv:2006.03677, 2020."},{"key":"e_1_3_2_1_20_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 , 2014 . Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.206"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01228-1_45"},{"key":"e_1_3_2_1_23_1","first-page":"1950","volume-title":"2019 IEEE Winter Conference on Applications of Computer Vision (WACV)","author":"Chen Xinya","unstructured":"Xinya Chen , Yanrui Bin , Nong Sang , and Changxin Gao . Scale pyramid network for crowd counting . In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV) , pages 1941\u2013 1950 . IEEE, 2019. Xinya Chen, Yanrui Bin, Nong Sang, and Changxin Gao. Scale pyramid network for crowd counting. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1941\u20131950. IEEE, 2019."},{"key":"e_1_3_2_1_24_1","volume-title":"consistency targets improve semi-supervised deep learning results. CoRR","author":"Tarvainen Antti","year":"2017","unstructured":"Antti Tarvainen and Harri Valpola . Weight-averaged , consistency targets improve semi-supervised deep learning results. CoRR \u201e vol. abs\/ 1703 , 2017 , 1780. Antti Tarvainen and Harri Valpola. Weight-averaged, consistency targets improve semi-supervised deep learning results. CoRR\u201e vol. abs\/1703, 2017, 1780."},{"key":"e_1_3_2_1_25_1","volume-title":"Interpolation consistency training for semi-supervised learning. arXiv preprint arXiv:1903.03825","author":"Verma Vikas","year":"2019","unstructured":"Vikas Verma , Alex Lamb , Juho Kannala , Yoshua Bengio , and David Lopez-Paz . Interpolation consistency training for semi-supervised learning. arXiv preprint arXiv:1903.03825 , 2019 . Vikas Verma, Alex Lamb, Juho Kannala, Yoshua Bengio, and David Lopez-Paz. Interpolation consistency training for semi-supervised learning. arXiv preprint arXiv:1903.03825, 2019."},{"key":"e_1_3_2_1_26_1","first-page":"259","volume-title":"European Conference on Computer Vision","author":"Liu Yan","unstructured":"Yan Liu , Lingqiao Liu , Peng Wang , Pingping Zhang , and Yinjie Lei . Semi-supervised crowd counting via self-training on surrogate tasks . In European Conference on Computer Vision , pages 242\u2013 259 . Springer, 2020. Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, and Yinjie Lei. Semi-supervised crowd counting via self-training on surrogate tasks. In European Conference on Computer Vision, pages 242\u2013259. Springer, 2020."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01216-8_33"}],"event":{"name":"ACAI'21: 2021 4th International Conference on Algorithms, Computing and Artificial Intelligence","location":"Sanya China","acronym":"ACAI'21"},"container-title":["2021 4th International Conference on Algorithms, Computing and Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3508546.3508548","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,4,14]],"date-time":"2022-04-14T04:59:08Z","timestamp":1649912348000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3508546.3508548"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,22]]},"references-count":27,"alternative-id":["10.1145\/3508546.3508548","10.1145\/3508546"],"URL":"http:\/\/dx.doi.org\/10.1145\/3508546.3508548","relation":{},"published":{"date-parts":[[2021,12,22]]}}}