{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:18:06Z","timestamp":1750220286559,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,3,4]],"date-time":"2022-03-04T00:00:00Z","timestamp":1646352000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,3,4]]},"DOI":"10.1145\/3529466.3529486","type":"proceedings-article","created":{"date-parts":[[2022,6,4]],"date-time":"2022-06-04T16:12:24Z","timestamp":1654359144000},"page":"57-63","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Heterogeneous Collaborative Refining for Real-Time End-to-End Image-Text Retrieval System"],"prefix":"10.1145","author":[{"given":"Nan","family":"Guo","sequence":"first","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Min","family":"Yang","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Xiaoping","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Xiao","family":"Xiao","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Chenhao","family":"Wang","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Xiaochun","family":"Ye","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]},{"given":"Dongrui","family":"Fan","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, China"}]}],"member":"320","published-online":{"date-parts":[[2022,6,4]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT","volume":"1","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . BERT : pre-training of deep bidirectional transformers for language understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT , Minneapolis, MN, USA, June 2-7 , Volume 1 , 2019 . Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, Minneapolis, MN, USA, June 2-7, Volume 1, 2019."},{"key":"e_1_3_2_1_4_1","volume-title":"3rd International Conference on Learning Representations, ICLR","author":"Simonyan Karen","year":"2015","unstructured":"Karen Simonyan and Andrew Zisserman . Very deep convolutional networks for large-scale image recognition . In 3rd International Conference on Learning Representations, ICLR , San Diego, CA, USA , May 7-9, 2015 . Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. In 3rd International Conference on Learning Representations, ICLR, San Diego, CA, USA, May 7-9, 2015."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_6_1","volume-title":"British Machine Vision Conference, BMVC","author":"Faghri Fartash","year":"2018","unstructured":"Fartash Faghri , David J. Fleet , Jamie Ryan Kiros , and Sanja Fidler . VSE++ : improving visual-semantic embeddings with hard negatives . In British Machine Vision Conference, BMVC , Newcastle, UK , September 3-6, 2018 . Fartash Faghri, David J. Fleet, Jamie Ryan Kiros, and Sanja Fidler. VSE++: improving visual-semantic embeddings with hard negatives. In British Machine Vision Conference, BMVC, Newcastle, UK, September 3-6, 2018."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2019.2916167"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3343031.3350875"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/tpami.2016.2577031"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45482-9"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00475"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093614"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2018.11.089"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2969808"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2008.2005605"},{"key":"e_1_3_2_1_16_1","first-page":"1","article-title":"Learning dual semantic relations with graph attention for image-text matching","author":"Wen Keyu","year":"2020","unstructured":"Keyu Wen , Xiaodong Gu , and Qingrong Cheng . Learning dual semantic relations with graph attention for image-text matching . IEEE Transactions on Circuits and Systems for Video Technology, pages 1 \u2013 1 , 2020 . Keyu Wen, Xiaodong Gu, and Qingrong Cheng. Learning dual semantic relations with graph attention for image-text matching. IEEE Transactions on Circuits and Systems for Video Technology, pages 1\u20131, 2020.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology, pages"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58577-8_8"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.77"},{"key":"e_1_3_2_1_19_1","volume-title":"Visualsparta: Sparse transformer fragment-level matching for large-scale text-to-image search. CoRR, abs\/2101.00265","author":"Lu Xiaopeng","year":"2021","unstructured":"Xiaopeng Lu , Tiancheng Zhao , and Kyusong Lee . Visualsparta: Sparse transformer fragment-level matching for large-scale text-to-image search. CoRR, abs\/2101.00265 , 2021 , unpublished. Xiaopeng Lu, Tiancheng Zhao, and Kyusong Lee. Visualsparta: Sparse transformer fragment-level matching for large-scale text-to-image search. CoRR, abs\/2101.00265, 2021, unpublished."},{"key":"e_1_3_2_1_20_1","volume-title":"25th International Conference on Pattern Recognition, ICPR, Virtual Event \/ Milan","author":"Messina Nicola","year":"2020","unstructured":"Nicola Messina , Fabrizio Falchi , Andrea Esuli , and Giuseppe Amato . Transformer reasoning network for image- text matching and retrieval . In 25th International Conference on Pattern Recognition, ICPR, Virtual Event \/ Milan , Italy , January 10-15, 2020 . Nicola Messina, Fabrizio Falchi, Andrea Esuli, and Giuseppe Amato. Transformer reasoning network for image- text matching and retrieval. In 25th International Conference on Pattern Recognition, ICPR, Virtual Event \/ Milan, Italy, January 10-15, 2020."},{"key":"e_1_3_2_1_21_1","volume-title":"Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Lukasz Kaiser , and Illia Polosukhin . Attention is all you need . In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 , December 4-9, Long Beach, CA, USA , 2017 . Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, Long Beach, CA, USA, 2017."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3371154"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/SBAC-PAD49847.2020.00036"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00293"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00447"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00286"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2017.64"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00636"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00166"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00586"}],"event":{"name":"ICIAI 2022: 2022 the 6th International Conference on Innovation in Artificial Intelligence","acronym":"ICIAI 2022","location":"Guangzhou China"},"container-title":["2022 the 6th International Conference on Innovation in Artificial Intelligence (ICIAI)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3529466.3529486","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3529466.3529486","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:31:25Z","timestamp":1750188685000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3529466.3529486"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,4]]},"references-count":31,"alternative-id":["10.1145\/3529466.3529486","10.1145\/3529466"],"URL":"https:\/\/doi.org\/10.1145\/3529466.3529486","relation":{},"subject":[],"published":{"date-parts":[[2022,3,4]]},"assertion":[{"value":"2022-06-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}