{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T07:40:24Z","timestamp":1765438824777,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":67,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548299","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:42:46Z","timestamp":1665416566000},"page":"4406-4415","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Combining Vision and Language Representations for Patch-based Identification of Lexico-Semantic Relations"],"prefix":"10.1145","author":[{"given":"Prince","family":"Jha","sequence":"first","affiliation":[{"name":"Indian Intitute of Technology Patna, Patna, India"}]},{"given":"Ga\u00ebl","family":"Dias","sequence":"additional","affiliation":[{"name":"Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC, Caen, France"}]},{"given":"Alexis","family":"Lechervy","sequence":"additional","affiliation":[{"name":"Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC, CAEN, France"}]},{"given":"Jose G.","family":"Moreno","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Toulouse, IRIT UMR 5505 CNRS, Toulouse, France"}]},{"given":"Anubhav","family":"Jangra","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology Patna, Bangalore, India"}]},{"given":"Sebasti\u00e3o","family":"Pais","sequence":"additional","affiliation":[{"name":"University of Beira Interior, Covilh\u00e3, Portugal"}]},{"given":"Sriparna","family":"Saha","sequence":"additional","affiliation":[{"name":"Indian Intitute of Technology Patna, Patna, India"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Moreno","author":"Akhmouch Houssam","year":"2021","unstructured":"Houssam Akhmouch , Ga\u00ebl Dias , and Jose G . Moreno . 2021 . Understanding Feature Focus in Multitask Settings for Lexico-semantic Relation Identification. In Findings of the Association for Computational Linguistics (ACL\/IJCNLP) . ACL, Thailand, 2762--2772. Houssam Akhmouch, Ga\u00ebl Dias, and Jose G. Moreno. 2021. Understanding Feature Focus in Multitask Settings for Lexico-semantic Relation Identification. In Findings of the Association for Computational Linguistics (ACL\/IJCNLP). ACL, Thailand, 2762--2772."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.279"},{"key":"e_1_3_2_1_3_1","volume-title":"Workshop on Cognitive Aspects of the Lexicon. 86--91","author":"Attia Mohammed","year":"2016","unstructured":"Mohammed Attia , Suraj Maharjan , Younes Samih , Laura Kallmeyer , and Thamar Solorio . 2016 . CogALex-V Shared Task: GHHH - Detecting Semantic Relations via Word Embeddings . In Workshop on Cognitive Aspects of the Lexicon. 86--91 . Mohammed Attia, Suraj Maharjan, Younes Samih, Laura Kallmeyer, and Thamar Solorio. 2016. CogALex-V Shared Task: GHHH - Detecting Semantic Relations via Word Embeddings. In Workshop on Cognitive Aspects of the Lexicon. 86--91."},{"key":"e_1_3_2_1_4_1","volume-title":"Learning Lexical-Semantic Relations Using Intuitive Cognitive Links. In 41st European Conference on Information Retrieval (ECIR). 3--18","author":"Balikas Georgios","year":"2019","unstructured":"Georgios Balikas , Ga\u00ebl Dias , Rumen Moraliyski , Houssam Akhmouch , and Massih-Reza Amini . 2019 . Learning Lexical-Semantic Relations Using Intuitive Cognitive Links. In 41st European Conference on Information Retrieval (ECIR). 3--18 . Georgios Balikas, Ga\u00ebl Dias, Rumen Moraliyski, Houssam Akhmouch, and Massih-Reza Amini. 2019. Learning Lexical-Semantic Relations Using Intuitive Cognitive Links. In 41st European Conference on Information Retrieval (ECIR). 3--18."},{"key":"e_1_3_2_1_5_1","volume-title":"Multimodal machine learning: A survey and taxonomy","author":"Tadas","year":"2018","unstructured":"Tadas Baltru?aitis, Chaitanya Ahuja , and Louis-Philippe Morency . 2018. Multimodal machine learning: A survey and taxonomy . IEEE transactions on pattern analysis and machine intelligence 41, 2 ( 2018 ), 423--443. Tadas Baltru?aitis, Chaitanya Ahuja, and Louis-Philippe Morency. 2018. Multimodal machine learning: A survey and taxonomy. IEEE transactions on pattern analysis and machine intelligence 41, 2 (2018), 423--443."},{"key":"e_1_3_2_1_6_1","volume-title":"Patch-Based Identification of Lexical Semantic Relations. In 42nd European Conference on Information Retrieval (ECIR). 126--140","author":"Bannour Nesrine","year":"2020","unstructured":"Nesrine Bannour , Ga\u00ebl Dias , Youssef Chahir , and Houssam Akhmouch . 2020 . Patch-Based Identification of Lexical Semantic Relations. In 42nd European Conference on Information Retrieval (ECIR). 126--140 . Nesrine Bannour, Ga\u00ebl Dias, Youssef Chahir, and Houssam Akhmouch. 2020. Patch-Based Identification of Lexical Semantic Relations. In 42nd European Conference on Information Retrieval (ECIR). 126--140."},{"key":"e_1_3_2_1_7_1","volume-title":"Entailment Above theWord Level in Distributional Semantics. In 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 23--32","author":"Baroni Marco","year":"2012","unstructured":"Marco Baroni , Raffaella Bernardi , Ngoc-Quynh Do , and Chung chieh Shan . 2012 . Entailment Above theWord Level in Distributional Semantics. In 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 23--32 . Marco Baroni, Raffaella Bernardi, Ngoc-Quynh Do, and Chung chieh Shan. 2012. Entailment Above theWord Level in Distributional Semantics. In 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 23--32."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6242"},{"key":"e_1_3_2_1_9_1","volume-title":"UNITER: UNiversal Image-TExt Representation Learning. In European Conference on Computer Vision (ECCV). 104--120","author":"Chen Yen-Chun","year":"2020","unstructured":"Yen-Chun Chen , Linjie Li , Licheng Yu , Ahmed El Kholy , Faisal Ahmed , Zhe Gan , Yu Cheng , and Jingjing Liu . 2020 . UNITER: UNiversal Image-TExt Representation Learning. In European Conference on Computer Vision (ECCV). 104--120 . Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, and Jingjing Liu. 2020. UNITER: UNiversal Image-TExt Representation Learning. In European Conference on Computer Vision (ECCV). 104--120."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-28577-7"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11063-020-10314-8"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_13_1","volume-title":"Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 4171--4186","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . Bert: Pre-training of deep bidirectional transformers for language understanding . In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 4171--4186 . Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 4171--4186."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1091"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/2817174.2876374"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-016-9475-9"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1515\/jls-2020-2017"},{"key":"e_1_3_2_1_18_1","volume-title":"Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment. In 57th Conference of the Association for Computational Linguistics (ACL). 4824--4830","author":"Glavas Goran","year":"2019","unstructured":"Goran Glavas and Ivan Vulic . 2019 . Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment. In 57th Conference of the Association for Computational Linguistics (ACL). 4824--4830 . Goran Glavas and Ivan Vulic. 2019. Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment. In 57th Conference of the Association for Computational Linguistics (ACL). 4824--4830."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.3115\/992133.992154"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.tics.2019.05.006"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3295748"},{"key":"e_1_3_2_1_22_1","volume-title":"Attentional control and the self: the Self-Attention Network (SAN). Cognitive neuroscience 7, 1--4","author":"Humphreys Glyn W","year":"2016","unstructured":"Glyn W Humphreys and Jie Sui . 2016. Attentional control and the self: the Self-Attention Network (SAN). Cognitive neuroscience 7, 1--4 ( 2016 ), 5--17. Glyn W Humphreys and Jie Sui. 2016. Attentional control and the self: the Self-Attention Network (SAN). Cognitive neuroscience 7, 1--4 (2016), 5--17."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCI.2019.2901085"},{"key":"e_1_3_2_1_24_1","volume-title":"Specializing Distributional Vectors of All Words for Lexical Entailment. In 4th Workshop on Representation Learning for NLP (RepL4NLP). 72--83","author":"Kamath Aishwarya","year":"2019","unstructured":"Aishwarya Kamath , Jonas Pfeiffer , Edoardo Maria Ponti , Goran Glava , and Ivan Vulic . 2019 . Specializing Distributional Vectors of All Words for Lexical Entailment. In 4th Workshop on Representation Learning for NLP (RepL4NLP). 72--83 . Aishwarya Kamath, Jonas Pfeiffer, Edoardo Maria Ponti, Goran Glava, and Ivan Vulic. 2019. Specializing Distributional Vectors of All Words for Lexical Entailment. In 4th Workshop on Representation Learning for NLP (RepL4NLP). 72--83."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5120\/ijca2017914424"},{"volume-title":"Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations (ICLR), Yoshua Bengio and Yann LeCun (Eds.).","author":"Diederik","key":"e_1_3_2_1_26_1","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015 . Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations (ICLR), Yoshua Bengio and Yann LeCun (Eds.). Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations (ICLR), Yoshua Bengio and Yann LeCun (Eds.)."},{"key":"e_1_3_2_1_27_1","volume-title":"Conference on Empirical Methods in Natural Language Processing (EMNLP). 1110--1118","author":"Kozareva Zornitsa","year":"2010","unstructured":"Zornitsa Kozareva and Eduard Hovy . 2010 . A Semi-supervised Method to Learn and Construct Taxonomies Using the Web . In Conference on Empirical Methods in Natural Language Processing (EMNLP). 1110--1118 . Zornitsa Kozareva and Eduard Hovy. 2010. A Semi-supervised Method to Learn and Construct Taxonomies Using the Web. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 1110--1118."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1016"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1098"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1098"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240605"},{"key":"e_1_3_2_1_32_1","volume-title":"Microsoft COCO: Common Objects in Context. CoRR abs\/1405.0312","author":"Lin Tsung-Yi","year":"2014","unstructured":"Tsung-Yi Lin , Michael Maire , Serge J. Belongie , Lubomir D. Bourdev , Ross B. Girshick , James Hays , Pietro Perona , Deva Ramanan , Piotr Doll\u00e1r , and C. Lawrence Zitnick . 2014. Microsoft COCO: Common Objects in Context. CoRR abs\/1405.0312 ( 2014 ). Tsung-Yi Lin, Michael Maire, Serge J. Belongie, Lubomir D. Bourdev, Ross B. Girshick, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll\u00e1r, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. CoRR abs\/1405.0312 (2014)."},{"key":"e_1_3_2_1_33_1","volume-title":"DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention. ACM Transactions on Knowledge Discovery from Data (TKDD) 16, 1","author":"Liu Fenglin","year":"2021","unstructured":"Fenglin Liu , Xian Wu , Shen Ge , Xuancheng Ren , Wei Fan , Xu Sun , and Yuexian Zou . 2021. DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention. ACM Transactions on Knowledge Discovery from Data (TKDD) 16, 1 ( 2021 ), 1--19. Fenglin Liu, Xian Wu, Shen Ge, Xuancheng Ren, Wei Fan, Xu Sun, and Yuexian Zou. 2021. DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention. ACM Transactions on Knowledge Discovery from Data (TKDD) 16, 1 (2021), 1--19."},{"key":"e_1_3_2_1_34_1","volume-title":"Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. Advances in neural information processing systems (NeurIPS) 32","author":"Lu Jiasen","year":"2019","unstructured":"Jiasen Lu , Dhruv Batra , Devi Parikh , and Stefan Lee . 2019 . Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. Advances in neural information processing systems (NeurIPS) 32 (2019). Jiasen Lu, Dhruv Batra, Devi Parikh, and Stefan Lee. 2019. Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. Advances in neural information processing systems (NeurIPS) 32 (2019)."},{"key":"e_1_3_2_1_35_1","volume-title":"Survey on AI-Based Multimodal Methods for Emotion Detection. Highperformance modelling and simulation for big data applications 11400","author":"Marechal Catherine","year":"2019","unstructured":"Catherine Marechal , Dariusz Mikolajewski , Krzysztof Tyburek , Piotr Prokopowicz , Lamine Bougueroua , Corinne Ancourt , and Katarzyna Wegrzyn-Wolska . 2019. Survey on AI-Based Multimodal Methods for Emotion Detection. Highperformance modelling and simulation for big data applications 11400 ( 2019 ), 307--324. Catherine Marechal, Dariusz Mikolajewski, Krzysztof Tyburek, Piotr Prokopowicz, Lamine Bougueroua, Corinne Ancourt, and Katarzyna Wegrzyn-Wolska. 2019. Survey on AI-Based Multimodal Methods for Emotion Detection. Highperformance modelling and simulation for big data applications 11400 (2019), 307--324."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1093\/ijl\/3.4.235"},{"key":"e_1_3_2_1_37_1","volume-title":"Hierarchical Embeddings for Hypernymy Detection and Directionality. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 233--243","author":"Nguyen Kim Anh","year":"2017","unstructured":"Kim Anh Nguyen , Maximilian K\u00f6per , Sabine Schulte im Walde , and Ngoc Thang Vu . 2017 . Hierarchical Embeddings for Hypernymy Detection and Directionality. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 233--243 . Kim Anh Nguyen, Maximilian K\u00f6per, Sabine Schulte im Walde, and Ngoc Thang Vu. 2017. Hierarchical Embeddings for Hypernymy Detection and Directionality. In Conference on Empirical Methods in Natural Language Processing (EMNLP). 233--243."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1008"},{"volume-title":"GloVe: Global Vectors for Word Representation. In Conference on Empirical Methods on Natural Language Processing (EMNLP). 1532--1543","author":"Pennington Jeffrey","key":"e_1_3_2_1_40_1","unstructured":"Jeffrey Pennington , Richard Socher , and Christopher D. Manning . 2014 . GloVe: Global Vectors for Word Representation. In Conference on Empirical Methods on Natural Language Processing (EMNLP). 1532--1543 . Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Conference on Empirical Methods on Natural Language Processing (EMNLP). 1532--1543."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2017.134"},{"key":"e_1_3_2_1_42_1","volume-title":"Designing Multimodal Datasets for NLP Challenges. CoRR abs\/2105.05999","author":"Pustejovsky James","year":"2021","unstructured":"James Pustejovsky , Eben Holderness , Jingxuan Tu , Parker Glenn , Kyeongmin Rim , Kelley Lynch , and Richard Brutti . 2021. Designing Multimodal Datasets for NLP Challenges. CoRR abs\/2105.05999 ( 2021 ). arXiv:2105.05999 James Pustejovsky, Eben Holderness, Jingxuan Tu, Parker Glenn, Kyeongmin Rim, Kelley Lynch, and Richard Brutti. 2021. Designing Multimodal Datasets for NLP Challenges. CoRR abs\/2105.05999 (2021). arXiv:2105.05999"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2019.2925204"},{"key":"e_1_3_2_1_44_1","volume-title":"International Conference on Machine Learning. PMLR, 8748--8763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , 2021 . Learning transferable visual models from natural language supervision . In International Conference on Machine Learning. PMLR, 8748--8763 . Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning. PMLR, 8748--8763."},{"key":"e_1_3_2_1_45_1","volume-title":"Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever.","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong Wook Kim , Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021 . Learning Transferable Visual Models From Natural Language Supervision. CoRR abs\/2103.00020 (2021). Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. CoRR abs\/2103.00020 (2021)."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-2101"},{"key":"e_1_3_2_1_47_1","volume-title":"25th International Conference on Computational Linguistics (COLING). 1025--1036","author":"Roller Stephen","year":"2014","unstructured":"Stephen Roller , Katrin Erk , and Gemma Boleda . 2014 . Inclusive yet Selective: Supervised Distributional Hypernymy Detection . In 25th International Conference on Computational Linguistics (COLING). 1025--1036 . Stephen Roller, Katrin Erk, and Gemma Boleda. 2014. Inclusive yet Selective: Supervised Distributional Hypernymy Detection. In 25th International Conference on Computational Linguistics (COLING). 1025--1036."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-2057"},{"key":"e_1_3_2_1_49_1","volume-title":"10th International Conference on Language Resources and Evaluation (LREC). 4557--4564","author":"Santus Enrico","year":"2016","unstructured":"Enrico Santus , Alessandro Lenci , Tin-Shing Chiu , Qin Lu , and Chu-Ren Huang . 2016 . Nine Features in a Random Forest to Learn Taxonomical Semantic Relations . In 10th International Conference on Language Resources and Evaluation (LREC). 4557--4564 . Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu, and Chu-Ren Huang. 2016. Nine Features in a Random Forest to Learn Taxonomical Semantic Relations. In 10th International Conference on Language Resources and Evaluation (LREC). 4557--4564."},{"key":"e_1_3_2_1_50_1","volume-title":"15th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 65--75","author":"Santus Enrico","year":"2017","unstructured":"Enrico Santus , Vered Shwartz , and Dominik Schlechtweg . 2017 . Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection . In 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 65--75 . Enrico Santus, Vered Shwartz, and Dominik Schlechtweg. 2017. Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection. In 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL). 65--75."},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474085.3475321"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1226"},{"key":"e_1_3_2_1_53_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and AndrewZisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and AndrewZisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_1_54_1","volume-title":"Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations (ICLR).","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman . 2015 . Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations (ICLR). Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations (ICLR)."},{"volume-title":"Learning Syntactic Patterns for Automatic Hypernym Discovery. In 17th International Conference on Neural Information Processing Systems (NeurIPS). 1297--1304","author":"Snow Rion","key":"e_1_3_2_1_55_1","unstructured":"Rion Snow , Daniel Jurafsky , and Andrew Y. Ng . 2004 . Learning Syntactic Patterns for Automatic Hypernym Discovery. In 17th International Conference on Neural Information Processing Systems (NeurIPS). 1297--1304 . Rion Snow, Daniel Jurafsky, and Andrew Y. Ng. 2004. Learning Syntactic Patterns for Automatic Hypernym Discovery. In 17th International Conference on Neural Information Processing Systems (NeurIPS). 1297--1304."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2017.08.003"},{"key":"e_1_3_2_1_57_1","volume-title":"CentralNet: A Multilayer Approach for Multimodal Fusion. In European Conference on Computer Vision (ECCV). 575--589","author":"Vielzeuf Valentin","year":"2018","unstructured":"Valentin Vielzeuf , Alexis Lechervy , St\u00e9phane Pateux , and Fr\u00e9d\u00e9ric Jurie . 2018 . CentralNet: A Multilayer Approach for Multimodal Fusion. In European Conference on Computer Vision (ECCV). 575--589 . Valentin Vielzeuf, Alexis Lechervy, St\u00e9phane Pateux, and Fr\u00e9d\u00e9ric Jurie. 2018. CentralNet: A Multilayer Approach for Multimodal Fusion. In European Conference on Computer Vision (ECCV). 575--589."},{"key":"e_1_3_2_1_58_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0.","author":"Vielzeuf Valentin","year":"2018","unstructured":"Valentin Vielzeuf , Alexis Lechervy , St\u00e9phane Pateux , and Fr\u00e9d\u00e9ric Jurie . 2018 . Centralnet: a multilayer approach for multimodal fusion . In Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0. Valentin Vielzeuf, Alexis Lechervy, St\u00e9phane Pateux, and Fr\u00e9d\u00e9ric Jurie. 2018. Centralnet: a multilayer approach for multimodal fusion. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 0--0."},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S18-2020"},{"key":"e_1_3_2_1_60_1","volume-title":"Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145","author":"Vulic Ivan","year":"2018","unstructured":"Ivan Vulic and Nikola Mrk?ic. 2018 . Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145 . Ivan Vulic and Nikola Mrk?ic. 2018. Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145."},{"key":"e_1_3_2_1_61_1","volume-title":"Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145","author":"Vulic Ivan","year":"2018","unstructured":"Ivan Vulic and Nikola Mrksic . 2018 . Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145 . Ivan Vulic and Nikola Mrksic. 2018. Specialising Word Vectors for Lexical Entailment. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 1134--1145."},{"key":"e_1_3_2_1_62_1","volume-title":"Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules. In 55th Annual Meeting of the Association for Computational Linguistics (ACL). 56--68","author":"Vulic Ivan","year":"2017","unstructured":"Ivan Vulic , Nikola Mrksic , Roi Reichart , Diarmuid \u00d3 S\u00e9aghdha , Steve J. Young , and Anna Korhonen . 2017 . Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules. In 55th Annual Meeting of the Association for Computational Linguistics (ACL). 56--68 . Ivan Vulic, Nikola Mrksic, Roi Reichart, Diarmuid \u00d3 S\u00e9aghdha, Steve J. Young, and Anna Korhonen. 2017. Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules. In 55th Annual Meeting of the Association for Computational Linguistics (ACL). 56--68."},{"key":"e_1_3_2_1_63_1","volume-title":"Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning. In 54th Annual Meeting of the Association for Computational Linguistics (ACL). 1671--1682","author":"Vylomova Ekaterina","year":"2016","unstructured":"Ekaterina Vylomova , Laura Rimell , Trevor Cohn , and Timothy Baldwin . 2016 . Take and Took, Gaggle and Goose , Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning. In 54th Annual Meeting of the Association for Computational Linguistics (ACL). 1671--1682 . Ekaterina Vylomova, Laura Rimell, Trevor Cohn, and Timothy Baldwin. 2016. Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning. In 54th Annual Meeting of the Association for Computational Linguistics (ACL). 1671--1682."},{"key":"e_1_3_2_1_64_1","volume-title":"BiRRE: Learning Bidirectional Residual Relation Embeddings for Supervised Hypernymy Detection. In 58th Annual Meeting of the Association for Computational Linguistics (ACL). 3630--3640","author":"Wang Chengyu","year":"2020","unstructured":"Chengyu Wang and Xiaofeng He . 2020 . BiRRE: Learning Bidirectional Residual Relation Embeddings for Supervised Hypernymy Detection. In 58th Annual Meeting of the Association for Computational Linguistics (ACL). 3630--3640 . Chengyu Wang and Xiaofeng He. 2020. BiRRE: Learning Bidirectional Residual Relation Embeddings for Supervised Hypernymy Detection. In 58th Annual Meeting of the Association for Computational Linguistics (ACL). 3630--3640."},{"key":"e_1_3_2_1_65_1","volume-title":"5th International Conference on Computational Linguistics (COLING). 2249--2259","author":"Weeds Julie","year":"2014","unstructured":"Julie Weeds , Daoud Clarke , Jeremy Reffin , David J. Weir , and Bill Keller . 2014 . Learning to Distinguish Hypernyms and Co-Hyponyms . In 5th International Conference on Computational Linguistics (COLING). 2249--2259 . Julie Weeds, Daoud Clarke, Jeremy Reffin, David J. Weir, and Bill Keller. 2014. Learning to Distinguish Hypernyms and Co-Hyponyms. In 5th International Conference on Computational Linguistics (COLING). 2249--2259."},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00048"},{"key":"e_1_3_2_1_67_1","volume-title":"Langlotz","author":"Zhang Yuhao","year":"2020","unstructured":"Yuhao Zhang , Hang Jiang , Yasuhide Miura , Christopher D. Manning , and Curtis P . Langlotz . 2020 . Contrastive Learning of Medical Visual Representations from Paired Images and Text. CoRR abs\/2010.00747 (2020). arXiv:2010.00747 Yuhao Zhang, Hang Jiang, Yasuhide Miura, Christopher D. Manning, and Curtis P. Langlotz. 2020. Contrastive Learning of Medical Visual Representations from Paired Images and Text. CoRR abs\/2010.00747 (2020). arXiv:2010.00747"},{"key":"e_1_3_2_1_68_1","volume-title":"Multimodal Relation Extraction with Efficient Graph Alignment. In 29th ACM International Conference on Multimedia (MM). 5298--5306","author":"Zheng Changmeng","year":"2021","unstructured":"Changmeng Zheng , Junhao Feng , Ze Fu , Yi Cai , Qing Li , and Tao Wang . 2021 . Multimodal Relation Extraction with Efficient Graph Alignment. In 29th ACM International Conference on Multimedia (MM). 5298--5306 . Changmeng Zheng, Junhao Feng, Ze Fu, Yi Cai, Qing Li, and Tao Wang. 2021. Multimodal Relation Extraction with Efficient Graph Alignment. In 29th ACM International Conference on Multimedia (MM). 5298--5306."}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Lisboa Portugal","acronym":"MM '22"},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548299","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548299","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:43Z","timestamp":1750186843000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548299"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":67,"alternative-id":["10.1145\/3503161.3548299","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548299","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}