{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T10:04:28Z","timestamp":1756893868200,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":52,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,10,19]],"date-time":"2017-10-19T00:00:00Z","timestamp":1508371200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,10,19]]},"DOI":"10.1145\/3123266.3123270","type":"proceedings-article","created":{"date-parts":[[2017,10,20]],"date-time":"2017-10-20T13:04:26Z","timestamp":1508504666000},"page":"10-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":31,"title":["SketchParse"],"prefix":"10.1145","author":[{"given":"Ravi Kiran","family":"Sarvadevabhatla","sequence":"first","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}]},{"given":"Isht","family":"Dwivedi","sequence":"additional","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}]},{"given":"Abhijat","family":"Biswas","sequence":"additional","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}]},{"given":"Sahil","family":"Manocha","sequence":"additional","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}]},{"given":"Venkatesh Babu","family":"R.","sequence":"additional","affiliation":[{"name":"Indian Institute of Science, Bangalore, India"}]}],"member":"320","published-online":{"date-parts":[[2017,10,19]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2015.2477680"},{"key":"e_1_3_2_1_2_1","volume-title":"Network of Experts for Large-Scale Image Categorization. In 14th European Conference on Computer Vision (Part-VII). Springer International Publishing, 516--532","author":"Ahmed Karim","year":"2016","unstructured":"Karim Ahmed , Mohammad Haris Baig , and Lorenzo Torresani . 2016 . Network of Experts for Large-Scale Image Categorization. In 14th European Conference on Computer Vision (Part-VII). Springer International Publishing, 516--532 . 3 Karim Ahmed, Mohammad Haris Baig, and Lorenzo Torresani. 2016. Network of Experts for Large-Scale Image Categorization. In 14th European Conference on Computer Vision (Part-VII). Springer International Publishing, 516--532. 3"},{"key":"e_1_3_2_1_3_1","unstructured":"Alessandro Bergamo and Lorenzo Torresani. 2010. Exploiting weakly-labeled web images to improve object classification: a domain adaptation approach. In NIPS. 181--189. 3   Alessandro Bergamo and Lorenzo Torresani. 2010. Exploiting weakly-labeled web images to improve object classification: a domain adaptation approach. In NIPS. 181--189. 3"},{"key":"e_1_3_2_1_4_1","unstructured":"Liang-Chieh Chen George Papandreou Iasonas Kokkinos Kevin Murphy and Alan L Yuille. 2015. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs. In ICLR. 1  Liang-Chieh Chen George Papandreou Iasonas Kokkinos Kevin Murphy and Alan L Yuille. 2015. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs. In ICLR. 1"},{"key":"e_1_3_2_1_5_1","volume-title":"Atrous Convolution, and Fully Connected CRFs. arXiv preprint arXiv:1606.00915","author":"Chen Liang-Chieh","year":"2016","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan L Yuille . 2016. DeepLab: Semantic Image Segmentation with Deep Convo- lutional Nets , Atrous Convolution, and Fully Connected CRFs. arXiv preprint arXiv:1606.00915 ( 2016 ). 2, 4, 6 Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2016. DeepLab: Semantic Image Segmentation with Deep Convo- lutional Nets, Atrous Convolution, and Fully Connected CRFs. arXiv preprint arXiv:1606.00915 (2016). 2, 4, 6"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.254"},{"volume-title":"Reweighted Random Walks for Graph Matching","author":"Cho Minsu","key":"e_1_3_2_1_7_1","unstructured":"Minsu Cho , Jungmin Lee , and Kyoung Mu Lee . 2010. Reweighted Random Walks for Graph Matching . In ECCV. Springer-Verlag , 492--505. 7 Minsu Cho, Jungmin Lee, and Kyoung Mu Lee. 2010. Reweighted Random Walks for Graph Matching. In ECCV. Springer-Verlag, 492--505. 7"},{"key":"e_1_3_2_1_8_1","volume-title":"The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1, 2, 3","author":"Dai Jifeng","year":"2016","unstructured":"Jifeng Dai , Kaiming He , and Jian Sun . 2016 . Instance-Aware Semantic Segmen- tation via Multi-Task Network Cascades . In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1, 2, 3 Jifeng Dai, Kaiming He, and Jian Sun. 2016. Instance-Aware Semantic Segmen- tation via Multi-Task Network Cascades. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1, 2, 3"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.304"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2185520.2185540"},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of ICML","volume":"48","author":"Elhoseiny Mohamed","year":"2016","unstructured":"Mohamed Elhoseiny , Tarek El-Gaaly , Amr Bakry , and Ahmed Elgammal . 2016 . A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation . In Proceedings of ICML , Vol. 48 . JMLR.org, 888--897. 3 Mohamed Elhoseiny, Tarek El-Gaaly, Amr Bakry, and Ahmed Elgammal. 2016. A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation. In Proceedings of ICML, Vol. 48. JMLR.org, 888--897. 3"},{"volume-title":"Att\u00e1ribute-centric recognition for cross-category generalization","author":"Farhadi Ali","key":"e_1_3_2_1_12_1","unstructured":"Ali Farhadi , Ian Endres , and Derek Hoiem . 2010. Att\u00e1ribute-centric recognition for cross-category generalization . In IEEE CVPR. IEEE , 2352--2359. 3 Ali Farhadi, Ian Endres, and Derek Hoiem. 2010. Att\u00e1ribute-centric recognition for cross-category generalization. In IEEE CVPR. IEEE, 2352--2359. 3"},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the IEEE CVPR. 447--456","author":"Hariharan Bharath","year":"2015","unstructured":"Bharath Hariharan , Pablo Arbel\u00e1ez , Ross Girshick , and Jitendra Malik . 2015 . Hy- percolumns for object segmentation and fine-grained localization . In Proceedings of the IEEE CVPR. 447--456 . 1, 2 Bharath Hariharan, Pablo Arbel\u00e1ez, Ross Girshick, and Jitendra Malik. 2015. Hy- percolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE CVPR. 447--456. 1, 2"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.349"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661280"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2556288.2556987"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.186"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2016.2579306"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Yi Li Timothy M. Hospedales Yi-Zhe Song and Shaogang Gong. 2014. Fine-Grained Sketch-Based Image Retrieval by Matching Deformable Part Models. In BMVC. 1  Yi Li Timothy M. Hospedales Yi-Zhe Song and Shaogang Gong. 2014. Fine-Grained Sketch-Based Image Retrieval by Matching Deformable Part Models. In BMVC. 1","DOI":"10.5244\/C.28.115"},{"key":"e_1_3_2_1_20_1","first-page":"2","article-title":"Semantic Object Parsing With Local-Global Long Short-Term Memory","volume":"1","author":"Liang Xiaodan","year":"2016","unstructured":"Xiaodan Liang , Xiaohui Shen , Donglai Xiang , Jiashi Feng , Liang Lin , and Shuicheng Yan . 2016 . Semantic Object Parsing With Local-Global Long Short-Term Memory . In The IEEE CVPR. 1 , 2 Xiaodan Liang, Xiaohui Shen, Donglai Xiang, Jiashi Feng, Liang Lin, and Shuicheng Yan. 2016. Semantic Object Parsing With Local-Global Long Short-Term Memory. In The IEEE CVPR. 1, 2","journal-title":"The IEEE CVPR."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.406"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_3_2_1_23_1","volume-title":"NIPS Workshop on Adversarial Training. 3","author":"Luc Pauline","year":"2016","unstructured":"Pauline Luc , Camille Couprie , Soumith Chintala , and Jakob Verbeek . 2016 . Seman- tic Segmentation using Adversarial Networks . In NIPS Workshop on Adversarial Training. 3 Pauline Luc, Camille Couprie, Soumith Chintala, and Jakob Verbeek. 2016. Seman- tic Segmentation using Adversarial Networks. In NIPS Workshop on Adversarial Training. 3"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.388"},{"key":"e_1_3_2_1_25_1","unstructured":"Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In ICML. 807--814. 5   Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In ICML. 807--814. 5"},{"key":"e_1_3_2_1_26_1","volume-title":"Global Deconvolutional Networks for Semantic Segmentation. CoRR abs\/1602.03930","author":"Nekrasov Vladimir","year":"2016","unstructured":"Vladimir Nekrasov , Janghoon Ju , and Jaesik Choi . 2016. Global Deconvolutional Networks for Semantic Segmentation. CoRR abs\/1602.03930 ( 2016 ). 3 Vladimir Nekrasov, Janghoon Ju, and Jaesik Choi. 2016. Global Deconvolutional Networks for Semantic Segmentation. CoRR abs\/1602.03930 (2016). 3"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.178"},{"key":"e_1_3_2_1_28_1","volume-title":"Visual domain adaptation: A survey of recent advances","author":"Patel Vishal M","year":"2015","unstructured":"Vishal M Patel , Raghuraman Gopalan , Ruonan Li , and Rama Chellappa . 2015. Visual domain adaptation: A survey of recent advances . IEEE signal processing magazine 32, 3 ( 2015 ), 53--69. 3 Vishal M Patel, Raghuraman Gopalan, Ruonan Li, and Rama Chellappa. 2015. Visual domain adaptation: A survey of recent advances. IEEE signal processing magazine 32, 3 (2015), 53--69. 3"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.128"},{"key":"e_1_3_2_1_30_1","volume-title":"Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. arXiv preprint arXiv:1603.01249","author":"Ranjan Rajeev","year":"2016","unstructured":"Rajeev Ranjan , Vishal M Patel , and Rama Chellappa . 2016 . Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. arXiv preprint arXiv:1603.01249 (2016). 2, 3 Rajeev Ranjan, Vishal M Patel, and Rama Chellappa. 2016. Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. arXiv preprint arXiv:1603.01249 (2016). 2, 3"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.352"},{"volume-title":"Adapting visual category models to new domains","author":"Saenko Kate","key":"e_1_3_2_1_32_1","unstructured":"Kate Saenko , Brian Kulis , Mario Fritz , and Trevor Darrell . 2010. Adapting visual category models to new domains . In ECCV. Springer , 213--226. 3 Kate Saenko, Brian Kulis, Mario Fritz, and Trevor Darrell. 2010. Adapting visual category models to new domains. In ECCV. Springer, 213--226. 3"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925954"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2967220"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661231"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2898351"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CBMI.2015.7153606"},{"key":"e_1_3_2_1_38_1","volume-title":"An Ontology for Domain-oriented Semantic Similarity Search on XML Data. In BTW 2003","author":"Theobald Anja","year":"2003","unstructured":"Anja Theobald . 2003 . An Ontology for Domain-oriented Semantic Similarity Search on XML Data. In BTW 2003 , Datenbanksysteme f\u00fcr Business, Technologie und Web, Tagungsband der 10. BTW-Konferenz, 26.-28. Februar 2003, Leipzig. 217--226. 4 Anja Theobald. 2003. An Ontology for Domain-oriented Semantic Similarity Search on XML Data. In BTW 2003, Datenbanksysteme f\u00fcr Business, Technologie und Web, Tagungsband der 10. BTW-Konferenz, 26.-28. Februar 2003, Leipzig. 217--226. 4"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMI.2014.2366792"},{"volume-title":"Towards weakly su- pervised semantic segmentation by means of multiple instance and multitask learning","author":"Vezhnevets Alexander","key":"e_1_3_2_1_40_1","unstructured":"Alexander Vezhnevets and Joachim M Buhmann . 2010. Towards weakly su- pervised semantic segmentation by means of multiple instance and multitask learning . In IEEE CVPR. IEEE , 3249--3256. 3 Alexander Vezhnevets and Joachim M Buhmann. 2010. Towards weakly su- pervised semantic segmentation by means of multiple instance and multitask learning. In IEEE CVPR. IEEE, 3249--3256. 3"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.184"},{"volume-title":"The Free Encyclopedia. https: \/\/en.wikipedia.org\/wiki\/Cardinal direction.","year":"2017","key":"e_1_3_2_1_42_1","unstructured":"Wikipedia. 2017. Cardinal direction -- Wikipedia , The Free Encyclopedia. https: \/\/en.wikipedia.org\/wiki\/Cardinal direction. ( 2017 ). 3 Wikipedia. 2017. Cardinal direction -- Wikipedia, The Free Encyclopedia. https: \/\/en.wikipedia.org\/wiki\/Cardinal direction. (2017). 3"},{"volume-title":"Proceedings of 14th European Conference in Computer Vision: Part V. 648--663","author":"Xia Fangting","key":"e_1_3_2_1_43_1","unstructured":"Fangting Xia , Peng Wang , Liang-Chieh Chen , and Alan L. Yuille . 2016. Zoom Better to See Clearer: Human and Object Parsing with Hierarchical Auto-Zoom Net . In Proceedings of 14th European Conference in Computer Vision: Part V. 648--663 . 1, 2 Fangting Xia, Peng Wang, Liang-Chieh Chen, and Alan L. Yuille. 2016. Zoom Better to See Clearer: Human and Object Parsing with Hierarchical Auto-Zoom Net. In Proceedings of 14th European Conference in Computer Vision: Part V. 648--663. 1, 2"},{"key":"e_1_3_2_1_44_1","unstructured":"Ren Xiaofeng and Liefeng Bo. 2012. Discriminatively trained sparse code gradients for contour detection. In Advances in neural information processing systems. 584--592. 3   Ren Xiaofeng and Liefeng Bo. 2012. Discriminatively trained sparse code gradients for contour detection. In Advances in neural information processing systems. 584--592. 3"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.314"},{"key":"e_1_3_2_1_46_1","volume-title":"Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122","author":"Yu Fisher","year":"2015","unstructured":"Fisher Yu and Vladlen Koltun . 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122 ( 2015 ). 5 Fisher Yu and Vladlen Koltun. 2015. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122 (2015). 5"},{"key":"e_1_3_2_1_47_1","volume-title":"Sketch Me That Shoe. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1","author":"Yu Qian","year":"2016","unstructured":"Qian Yu , Feng Liu , Yi-Zhe Song , Tao Xiang , Timothy M. Hospedales , and Chen-Change Loy . 2016 . Sketch Me That Shoe. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1 Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, and Chen-Change Loy. 2016. Sketch Me That Shoe. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1"},{"key":"e_1_3_2_1_48_1","volume-title":"Sketch-a-Net that Beats Humans. BMVC","author":"Yu Qian","year":"2015","unstructured":"Qian Yu , Yongxin Yang , Yi-Zhe Song , Tao Xiang , and Timothy Hospedales . 2015. Sketch-a-Net that Beats Humans. BMVC ( 2015 ). 1, 2, 3, 4 Qian Yu, Yongxin Yang, Yi-Zhe Song, Tao Xiang, and Timothy Hospedales. 2015. Sketch-a-Net that Beats Humans. BMVC (2015). 1, 2, 3, 4"},{"key":"e_1_3_2_1_49_1","volume-title":"Xiao- gang Wang, and Dimitris Metaxas","author":"Zhang Han","year":"2016","unstructured":"Han Zhang , Tao Xu , Hongsheng Li , Shaoting Zhang , Xiaolei Huang , Xiao- gang Wang, and Dimitris Metaxas . 2016 . StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks . arXiv preprint arXiv:1612.03242 (2016). 7 Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaolei Huang, Xiao- gang Wang, and Dimitris Metaxas. 2016. StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. arXiv preprint arXiv:1612.03242 (2016). 7"},{"key":"e_1_3_2_1_50_1","volume-title":"Deep Neural Networks for Free-Hand Sketch Recognition. In 17th Pacific-Rim Conference on Multimedia, Xi'an, China, September 15--16","author":"Zhang Yuqi","year":"2016","unstructured":"Yuqi Zhang , Yuting Zhang , and Xueming Qian . 2016 . Deep Neural Networks for Free-Hand Sketch Recognition. In 17th Pacific-Rim Conference on Multimedia, Xi'an, China, September 15--16 , 2016. 3 Yuqi Zhang, Yuting Zhang, and Xueming Qian. 2016. Deep Neural Networks for Free-Hand Sketch Recognition. In 17th Pacific-Rim Conference on Multimedia, Xi'an, China, September 15--16, 2016. 3"},{"key":"e_1_3_2_1_51_1","unstructured":"Bin Zhao Fei Li and Eric P Xing. 2011. Large-scale category structure aware image categorization. In NIPS. 1251--1259. 2   Bin Zhao Fei Li and Eric P Xing. 2011. Large-scale category structure aware image categorization. In NIPS. 1251--1259. 2"},{"key":"e_1_3_2_1_52_1","volume-title":"Improved Deep Learning of Object Category using Pose Information. CoRR abs\/1607.05836","author":"Zhao Jiaping","year":"2016","unstructured":"Jiaping Zhao and Laurent Itti . 2016. Improved Deep Learning of Object Category using Pose Information. CoRR abs\/1607.05836 ( 2016 ). http:\/\/arxiv.org\/abs\/1607. 05836 3 Jiaping Zhao and Laurent Itti. 2016. Improved Deep Learning of Object Category using Pose Information. CoRR abs\/1607.05836 (2016). http:\/\/arxiv.org\/abs\/1607. 05836 3"}],"event":{"name":"MM '17: ACM Multimedia Conference","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Mountain View California USA","acronym":"MM '17"},"container-title":["Proceedings of the 25th ACM international conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3123266.3123270","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3123266.3123270","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:39:28Z","timestamp":1750217968000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3123266.3123270"}},"subtitle":["Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks"],"short-title":[],"issued":{"date-parts":[[2017,10,19]]},"references-count":52,"alternative-id":["10.1145\/3123266.3123270","10.1145\/3123266"],"URL":"https:\/\/doi.org\/10.1145\/3123266.3123270","relation":{},"subject":[],"published":{"date-parts":[[2017,10,19]]},"assertion":[{"value":"2017-10-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}