{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T21:50:36Z","timestamp":1776289836997,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,12,6]],"date-time":"2023-12-06T00:00:00Z","timestamp":1701820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"the Science and Technology Research Program of Chongqing Municipal Education Commission","award":["KJZD-K202200513"],"award-info":[{"award-number":["KJZD-K202200513"]}]},{"name":"Chongqing Natural Science Foundation","award":["CSTB2022NSCQ-MSX1417"],"award-info":[{"award-number":["CSTB2022NSCQ-MSX1417"]}]},{"name":"Humanities and social science research project of Chongqing Education Commission","award":["22SKGH100"],"award-info":[{"award-number":["22SKGH100"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,12,6]]},"DOI":"10.1145\/3595916.3626424","type":"proceedings-article","created":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T16:34:41Z","timestamp":1704126881000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Multi-view\u2013enhanced modal fusion hashing for Unsupervised cross-modal retrieval"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5224-9196","authenticated-orcid":false,"given":"Longfei","family":"Ma","sequence":"first","affiliation":[{"name":"School of Computer and Information Science, Chongqing Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-1447-2593","authenticated-orcid":false,"given":"Honggang","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Computer and Information Science, Chongqing Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-2273-6851","authenticated-orcid":false,"given":"Zheng","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Computer and Information Science, Chongqing Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5517-3633","authenticated-orcid":false,"given":"Mingyong","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer and Information Science, Chongqing Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1646396.1646452"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2821921"},{"key":"e_1_3_2_1_3_1","volume-title":"An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929","author":"Dosovitskiy Alexey","year":"2020","unstructured":"Alexey Dosovitskiy , Lucas Beyer , Alexander Kolesnikov , Dirk Weissenborn , Xiaohua Zhai , Thomas Unterthiner , Mostafa Dehghani , Matthias Minderer , Georg Heigold , Sylvain Gelly , 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 ( 2020 ). Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1460096.1460104"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2019.2897944"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00446"},{"key":"e_1_3_2_1_7_1","volume-title":"Proceedings of the 30th ACM International Conference on Multimedia. 3712\u20133721","author":"Li Liang","year":"2022","unstructured":"Liang Li , Baihua Zheng , and Weiwei Sun . 2022 . Adaptive structural similarity preserving for unsupervised cross modal hashing . In Proceedings of the 30th ACM International Conference on Multimedia. 3712\u20133721 . Liang Li, Baihua Zheng, and Weiwei Sun. 2022. Adaptive structural similarity preserving for unsupervised cross modal hashing. In Proceedings of the 30th ACM International Conference on Multimedia. 3712\u20133721."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition. 3864\u20133872","author":"Lin Zijia","year":"2015","unstructured":"Zijia Lin , Guiguang Ding , Mingqing Hu , and Jianmin Wang . 2015 . Semantics-preserving hashing for cross-view retrieval . In Proceedings of the IEEE conference on computer vision and pattern recognition. 3864\u20133872 . Zijia Lin, Guiguang Ding, Mingqing Hu, and Jianmin Wang. 2015. Semantics-preserving hashing for cross-view retrieval. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3864\u20133872."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397271.3401086"},{"key":"e_1_3_2_1_11_1","volume-title":"Deep Rank Cross-Modal Hashing with Semantic Consistent for Image-Text Retrieval. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4828\u20134832","author":"Liu Xiaoqing","year":"2022","unstructured":"Xiaoqing Liu , Huanqiang Zeng , Yifan Shi , Jianqing Zhu , and Kai-Kuang Ma . 2022 . Deep Rank Cross-Modal Hashing with Semantic Consistent for Image-Text Retrieval. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4828\u20134832 . Xiaoqing Liu, Huanqiang Zeng, Yifan Shi, Jianqing Zhu, and Kai-Kuang Ma. 2022. Deep Rank Cross-Modal Hashing with Semantic Consistent for Image-Text Retrieval. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 4828\u20134832."},{"key":"e_1_3_2_1_12_1","volume-title":"Deep unsupervised contrastive hashing for large-scale cross-modal text-image retrieval in remote sensing. arXiv preprint arXiv:2201.08125","author":"Mikriukov Georgii","year":"2022","unstructured":"Georgii Mikriukov , Mahdyar Ravanbakhsh , and Beg\u00fcm Demir . 2022. Deep unsupervised contrastive hashing for large-scale cross-modal text-image retrieval in remote sensing. arXiv preprint arXiv:2201.08125 ( 2022 ). Georgii Mikriukov, Mahdyar Ravanbakhsh, and Beg\u00fcm Demir. 2022. Deep unsupervised contrastive hashing for large-scale cross-modal text-image retrieval in remote sensing. arXiv preprint arXiv:2201.08125 (2022)."},{"key":"e_1_3_2_1_13_1","volume-title":"International conference on machine learning. PMLR, 8748\u20138763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong\u00a0Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , 2021 . Learning transferable visual models from natural language supervision . In International conference on machine learning. PMLR, 8748\u20138763 . Alec Radford, Jong\u00a0Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748\u20138763."},{"key":"e_1_3_2_1_14_1","volume-title":"Multi-view Contrastive Learning for Image-Text Pre-training. arXiv preprint arXiv:2209.15270","author":"Shan Bin","year":"2022","unstructured":"Bin Shan , Weichong Yin , Yu Sun , Hao Tian , Hua Wu , and Haifeng Wang . 2022. ERNIE-Vi L 2.0 : Multi-view Contrastive Learning for Image-Text Pre-training. arXiv preprint arXiv:2209.15270 ( 2022 ). Bin Shan, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, and Haifeng Wang. 2022. ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training. arXiv preprint arXiv:2209.15270 (2022)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Yufeng Shi Xinge You Feng Zheng Shuo Wang and Qinmu Peng. 2019. Equally-Guided Discriminative Hashing for Cross-modal Retrieval.. In IJCAI. 4767\u20134773.  Yufeng Shi Xinge You Feng Zheng Shuo Wang and Qinmu Peng. 2019. Equally-Guided Discriminative Hashing for Cross-modal Retrieval.. In IJCAI. 4767\u20134773.","DOI":"10.24963\/ijcai.2019\/662"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2022.3172716"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00312"},{"key":"e_1_3_2_1_18_1","volume-title":"Teacher-student learning: Efficient hierarchical message aggregation hashing for cross-modal retrieval","author":"Tan Wentao","year":"2022","unstructured":"Wentao Tan , Lei Zhu , Jingjing Li , Huaxiang Zhang , and Junwei Han . 2022. Teacher-student learning: Efficient hierarchical message aggregation hashing for cross-modal retrieval . IEEE Transactions on Multimedia ( 2022 ). Wentao Tan, Lei Zhu, Jingjing Li, Huaxiang Zhang, and Junwei Han. 2022. Teacher-student learning: Efficient hierarchical message aggregation hashing for cross-modal retrieval. IEEE Transactions on Multimedia (2022)."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2023.3251395"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2023.3243608"},{"key":"e_1_3_2_1_21_1","volume-title":"Deep cross-modal proxy hashing","author":"Tu Rong-Cheng","year":"2022","unstructured":"Rong-Cheng Tu , Xian-Ling Mao , Rong-Xin Tu , Binbin Bian , Chengfei Cai , Wei Wei , Heyan Huang , 2022. Deep cross-modal proxy hashing . IEEE Transactions on Knowledge and Data Engineering ( 2022 ). Rong-Cheng Tu, Xian-Ling Mao, Rong-Xin Tu, Binbin Bian, Chengfei Cai, Wei Wei, Heyan Huang, 2022. Deep cross-modal proxy hashing. IEEE Transactions on Knowledge and Data Engineering (2022)."},{"key":"e_1_3_2_1_22_1","unstructured":"Botong Wu Qiang Yang Wei-Shi Zheng Yizhou Wang and Jingdong Wang. 2015. Quantized Correlation Hashing for Fast Cross-Modal Search.. In IJCAI Vol.\u00a01. 2.  Botong Wu Qiang Yang Wei-Shi Zheng Yizhou Wang and Jingdong Wang. 2015. Quantized Correlation Hashing for Fast Cross-Modal Search.. In IJCAI Vol.\u00a01. 2."},{"key":"e_1_3_2_1_23_1","unstructured":"Gengshen Wu Zijia Lin Jungong Han Li Liu Guiguang Ding Baochang Zhang and Jialie Shen. 2018. Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval.. In IJCAI Vol.\u00a01. 5.  Gengshen Wu Zijia Lin Jungong Han Li Liu Guiguang Ding Baochang Zhang and Jialie Shen. 2018. Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval.. In IJCAI Vol.\u00a01. 5."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3372278.3390673"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240560"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i5.16592"},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of the AAAI conference on artificial intelligence, Vol.\u00a028","author":"Zhang Dongqing","year":"2014","unstructured":"Dongqing Zhang and Wu-Jun Li . 2014 . Large-scale supervised multimodal hashing with semantic correlation maximization . In Proceedings of the AAAI conference on artificial intelligence, Vol.\u00a028 . Dongqing Zhang and Wu-Jun Li. 2014. Large-scale supervised multimodal hashing with semantic correlation maximization. In Proceedings of the AAAI conference on artificial intelligence, Vol.\u00a028."},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the 29th ACM International Conference on Multimedia. 1517\u20131525","author":"Zhang Peng-Fei","year":"2021","unstructured":"Peng-Fei Zhang , Jiasheng Duan , Zi Huang , and Hongzhi Yin . 2021 . Joint-teaching: Learning to refine knowledge for resource-constrained unsupervised cross-modal retrieval . In Proceedings of the 29th ACM International Conference on Multimedia. 1517\u20131525 . Peng-Fei Zhang, Jiasheng Duan, Zi Huang, and Hongzhi Yin. 2021. Joint-teaching: Learning to refine knowledge for resource-constrained unsupervised cross-modal retrieval. In Proceedings of the 29th ACM International Conference on Multimedia. 1517\u20131525."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2021.3053766"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-020-00859-y"},{"key":"e_1_3_2_1_31_1","volume-title":"Relation-Guided Dual Hash Network for Unsupervised Cross-Modal Retrieval. In International Conference on Neural Information Processing. Springer, 497\u2013508","author":"Zheng Yuanchao","year":"2022","unstructured":"Yuanchao Zheng , Yan Dong , and Xiaowei Zhang . 2022 . Relation-Guided Dual Hash Network for Unsupervised Cross-Modal Retrieval. In International Conference on Neural Information Processing. Springer, 497\u2013508 . Yuanchao Zheng, Yan Dong, and Xiaowei Zhang. 2022. Relation-Guided Dual Hash Network for Unsupervised Cross-Modal Retrieval. In International Conference on Neural Information Processing. Springer, 497\u2013508."},{"key":"e_1_3_2_1_32_1","volume-title":"Work together: correlation-identity reconstruction hashing for unsupervised cross-modal retrieval","author":"Zhu Lei","year":"2022","unstructured":"Lei Zhu , Xize Wu , Jingjing Li , Zheng Zhang , Weili Guan , and Heng\u00a0Tao Shen . 2022. Work together: correlation-identity reconstruction hashing for unsupervised cross-modal retrieval . IEEE Transactions on Knowledge and Data Engineering ( 2022 ). Lei Zhu, Xize Wu, Jingjing Li, Zheng Zhang, Weili Guan, and Heng\u00a0Tao Shen. 2022. Work together: correlation-identity reconstruction hashing for unsupervised cross-modal retrieval. IEEE Transactions on Knowledge and Data Engineering (2022)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2020.116131"}],"event":{"name":"MMAsia '23: ACM Multimedia Asia","location":"Tainan Taiwan","acronym":"MMAsia '23","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["ACM Multimedia Asia 2023"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626424","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3595916.3626424","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:35:56Z","timestamp":1750178156000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3595916.3626424"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,6]]},"references-count":33,"alternative-id":["10.1145\/3595916.3626424","10.1145\/3595916"],"URL":"https:\/\/doi.org\/10.1145\/3595916.3626424","relation":{},"subject":[],"published":{"date-parts":[[2023,12,6]]},"assertion":[{"value":"2024-01-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}