{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T01:10:26Z","timestamp":1755825026348,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":57,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,6,30]]},"DOI":"10.1145\/3731715.3733433","type":"proceedings-article","created":{"date-parts":[[2025,6,25]],"date-time":"2025-06-25T18:31:39Z","timestamp":1750876299000},"page":"1912-1921","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Single-Source Dual-Stream Representation Learning for DNA Sequence Classification"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-4142-2907","authenticated-orcid":false,"given":"Jiarui","family":"Zhou","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3880-8913","authenticated-orcid":false,"given":"Zongmeng","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4641-1799","authenticated-orcid":false,"given":"Min","family":"Wang","sequence":"additional","affiliation":[{"name":"Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1690-9836","authenticated-orcid":false,"given":"Wengang","family":"Zhou","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2188-3028","authenticated-orcid":false,"given":"Houqiang","family":"Li","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,6,30]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Abd El-Samie","author":"Alhalem Samia M.","year":"2020","unstructured":"Samia M. Abd --Alhalem, Naglaa F. Soliman, Salah Eldin, S. E. Abd Elrahman, Nabil A. Ismail, El-Sayed M. El-Rabaie, and Fathi E. Abd El-Samie. 2020. Bacterial classification with convolutional neural networks based on different data reduction layers. Nucleosides, Nucleotides &#38; Nucleic Acids, Vol. 39, 4 (2020), 493--503."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Emmanuel Adetiba Joke A Badejo Surendra Thakur Victor O Matthews Marion O Adebiyi and Ezekiel F Adebiyi. 2017. Experimental investigation of frequency chaos game representation for in silico and accurate classification of viral pathogens from genomic sequences. In Bioinformatics and Biomedical Engineering: International Work-Conference. 155--164.","DOI":"10.1007\/978-3-319-56148-6_13"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASYU52992.2021.9599084"},{"key":"e_1_3_2_1_4_1","volume-title":"Lipman","author":"Altschul Stephen F.","year":"1990","unstructured":"Stephen F. Altschul, Warren Gish, Webb Miller, Eugene W. Myers, and David J. Lipman. 1990. Basic local alignment search tool. Journal of Molecular Biology (1990), 403--410."},{"key":"e_1_3_2_1_5_1","volume-title":"Multi-aspect candidates for repositioning: data fusion methods using heterogeneous information sources. Current medicinal chemistry","author":"Arany A","year":"2013","unstructured":"A Arany, Bence Bolg\u00e1r, Bal\u00e1zs Balogh, Peter Antal, and P\u00e9ter M\u00e1tyus. 2013. Multi-aspect candidates for repositioning: data fusion methods using heterogeneous information sources. Current medicinal chemistry , Vol. 20, 1 (2013), 95--107."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Pablo Mill\u00e1n Arias Fatemeh Alipour Kathleen A. Hill and Lila Kari. 2021. DeLUCS: Deep learning for unsupervised clustering of DNA sequences. (2021).","DOI":"10.1371\/journal.pone.0261531"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2644615"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2798607"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2798607"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. In ECCV. 213--229.","DOI":"10.1007\/978-3-030-58452-8_13"},{"key":"e_1_3_2_1_11_1","volume-title":"Hgmf: Heterogeneous graph-based fusion for multimodal data with incompleteness. In SIGKDD. 1295--1305.","author":"Chen Jiayi","year":"2020","unstructured":"Jiayi Chen and Aidong Zhang. 2020. Hgmf: Heterogeneous graph-based fusion for multimodal data with incompleteness. In SIGKDD. 1295--1305."},{"key":"e_1_3_2_1_12_1","first-page":"1","article-title":"TransVG: End-to-End Visual Grounding with Language Conditioned Vision Transformer","volume":"01","author":"Deng Jiajun","year":"2023","unstructured":"Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, and Wanli Ouyang. 2023. TransVG: End-to-End Visual Grounding with Language Conditioned Vision Transformer. TPAMI 01 (2023), 1--17.","journal-title":"TPAMI"},{"key":"e_1_3_2_1_13_1","unstructured":"Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly Jakob Uszkoreit and Neil Houlsby. 2021. An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2016.2605068"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.2174\/1574893615666200224095531"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioadv\/vbad092"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco_a_01273"},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the Brazilian Symposium on Multimedia and the Web. 128--136","author":"Silva G\u00f4lo Marcos Paulo","year":"2023","unstructured":"Marcos Paulo Silva G\u00f4lo, Marcelo Isaias De Moraes, Rudinei Goularte, and Ricardo Marcondes Marcacini. 2023. On the use of early fusion operators on heterogeneous graph neural networks for one-class learning. In Proceedings of the Brazilian Symposium on Multimedia and the Web. 128--136."},{"key":"e_1_3_2_1_19_1","volume-title":"S. Deepa Kanmani, Chandran Venkatesan, and C. Suresh Gnana Dhas.","author":"Gunasekaran Hemalatha","year":"2021","unstructured":"Hemalatha Gunasekaran, K. Ramalakshmi, A. Rex Macedo Arokiaraj, S. Deepa Kanmani, Chandran Venkatesan, and C. Suresh Gnana Dhas. 2021. Analysis of DNA sequence classification using CNN and hybrid models. Computational and Mathematical Methods in Medicine (2021), 1--12."},{"key":"e_1_3_2_1_20_1","volume-title":"Md Motaleb Hossen Manik, and Bangladesh Khulna","author":"Habib Md Ahsan","year":"2022","unstructured":"Md Ahsan Habib, Md Motaleb Hossen Manik, and Bangladesh Khulna. 2022. Classification of DNA sequence using machine learning techniques."},{"key":"e_1_3_2_1_21_1","volume-title":"Improved python package for DNA sequence encoding using frequency chaos game representation. bioRxiv - Bioinformatics","author":"Halder Abhishek","year":"2024","unstructured":"Abhishek Halder, Piyush, Bernadette Mathew, and Debarka Sengupta. 2024. Improved python package for DNA sequence encoding using frequency chaos game representation. bioRxiv - Bioinformatics , Vol. 29 1 (2024)."},{"key":"e_1_3_2_1_22_1","unstructured":"Kaiming He Xinlei Chen Saining Xie Yanghao Li Piotr Doll\u00e1r and Ross Girshick. 2022. Masked autoencoders are scalable vision learners. In CVPR. 16000--16009."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btab083"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.4018\/IJRQEH.299963"},{"key":"e_1_3_2_1_25_1","volume-title":"BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL. 4171--4186.","author":"Ming-Wei Chang Jacob Devlin","year":"2019","unstructured":"Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL. 4171--4186."},{"key":"e_1_3_2_1_26_1","volume-title":"P Niharika, and P Sai Rohan.","author":"Kiranmayee BV","year":"2023","unstructured":"BV Kiranmayee, Chalumuru Suresh, K Sneha, LK Srinivas Karthik, P Niharika, and P Sai Rohan. 2023. A survey on gene classification based on dna sequence. In ICISS. 573--585."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Zhirui Kuai Yulu Zhou Qi Xie and Li Kuang. 2024. Multi-source augmentation and composite prompts for visual recognition with missing modality. In ICMR. 543--551.","DOI":"10.1145\/3652583.3658105"},{"key":"e_1_3_2_1_28_1","volume-title":"Briefings in Bioinformatics","volume":"22","author":"Khanh Le Nguyen Quoc","year":"2021","unstructured":"Nguyen Quoc Khanh Le, Quang-Thai Ho, Trinh-Trung-Duong Nguyen, and Yu-Yen Ou. 2021. A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information. Briefings in Bioinformatics , Vol. 22, 5 (2021), bbab005."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bty191"},{"key":"e_1_3_2_1_30_1","unstructured":"Junnan Li Dongxu Li Silvio Savarese and Steven Hoi. 2023. Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In ICML. 19730--19742."},{"key":"e_1_3_2_1_31_1","volume-title":"International conference on machine learning. PMLR, 12888--12900","author":"Li Junnan","year":"2022","unstructured":"Junnan Li, Dongxu Li, Caiming Xiong, and Steven Hoi. 2022b. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. In International conference on machine learning. PMLR, 12888--12900."},{"key":"e_1_3_2_1_32_1","first-page":"9694","article-title":"Align before fuse: Vision and language representation learning with momentum distillation","volume":"34","author":"Li Junnan","year":"2021","unstructured":"Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare, Shafiq Joty, Caiming Xiong, and Steven Chu Hong Hoi. 2021. Align before fuse: Vision and language representation learning with momentum distillation. NeurIPS, Vol. 34 (2021), 9694--9705.","journal-title":"NeurIPS"},{"key":"e_1_3_2_1_33_1","volume-title":"Multimodal alignment and fusion: A Survey. arXiv preprint arXiv:2411.17040","author":"Li Songtao","year":"2024","unstructured":"Songtao Li and Hao Tang. 2024. Multimodal alignment and fusion: A Survey. arXiv preprint arXiv:2411.17040 (2024)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Yikang Li Jenhao Hsiao and Chiuman Ho. 2022a. VideoCLIP: A cross-attention model for fast video-text retrieval task with image clip. In ICMR. 29--33.","DOI":"10.1145\/3512527.3531429"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1093\/nargab\/lqaa009"},{"key":"e_1_3_2_1_36_1","first-page":"34892","article-title":"Visual Instruction Tuning","volume":"36","author":"Liu Haotian","year":"2023","unstructured":"Haotian Liu, Chunyuan Li, Qingyang Wu, and Yong Jae Lee. 2023. Visual Instruction Tuning. In NeurIPS, Vol. 36. 34892--34916.","journal-title":"NeurIPS"},{"key":"e_1_3_2_1_37_1","volume-title":"RoBERTa: A Robustly Optimized BERT Pretraining Approach. arxiv","author":"Liu Yinhan","year":"1907","unstructured":"Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arxiv: 1907.11692 [cs.CL]"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Iis Setiawan Mangkunegara and Purwono Purwono. 2022. Analysis of DNA sequence classification using SVM model with hyperparameter tuning grid search CV. In CyberneticsCom. 427--432.","DOI":"10.1109\/CyberneticsCom55287.2022.9865624"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btab184"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.2122636119"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmgm.2021.107942"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Xiaoyu Qiu Hao Feng Yuechen Wang Wengang Zhou and Houqiang Li. 2024. Progressive multi-modal conditional prompt tuning. In ICMR. 46--54.","DOI":"10.1145\/3652583.3658049"},{"key":"e_1_3_2_1_43_1","volume-title":"Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al.","author":"Radford Alec","year":"2021","unstructured":"Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In ICML. 8748--8763."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2017.2738401"},{"key":"e_1_3_2_1_45_1","volume-title":"Massimo La Rosa, and Alfonso Urso","author":"Rizzo Riccardo","year":"2016","unstructured":"Riccardo Rizzo, Antonino Fiannaca, Massimo La Rosa, and Alfonso Urso. 2016. Classification experiments of DNA sequences by using a deep neural network and chaos game representation. In CompSysTech. 222--228."},{"key":"e_1_3_2_1_46_1","first-page":"405","article-title":"GenCoder: A novel convolutional Neural network based autoencoder for genomic sequence data compression","volume":"21","author":"Sheena KS","year":"2024","unstructured":"KS Sheena and Madhu S Nair. 2024. GenCoder: A novel convolutional Neural network based autoencoder for genomic sequence data compression. TCBB, Vol. 21, 3 (2024), 405--415.","journal-title":"TCBB"},{"key":"e_1_3_2_1_47_1","volume-title":"Charformer: A glyph fusion based attentive framework for high-precision character image denoising. In ACMMM. 1147--1155.","author":"Shi Daqian","year":"2022","unstructured":"Daqian Shi, Xiaolei Diao, Lida Shi, Hao Tang, Yang Chi, Chuntao Li, and Hao Xu. 2022. Charformer: A glyph fusion based attentive framework for high-precision character image denoising. In ACMMM. 1147--1155."},{"key":"e_1_3_2_1_48_1","volume-title":"Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nature microbiology","author":"Sieber Christian MK","year":"2018","unstructured":"Christian MK Sieber, Alexander J Probst, Allison Sharrar, Brian C Thomas, Matthias Hess, Susannah G Tringe, and Jillian F Banfield. 2018. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nature microbiology, Vol. 3, 7 (2018), 836--843."},{"key":"e_1_3_2_1_49_1","volume-title":"MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology","author":"Steinegger Martin","year":"2017","unstructured":"Martin Steinegger and Johannes S\u00f6ding. 2017. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology (2017), 1026--1028."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jon Shlens and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In CVPR. 2818--2826.","DOI":"10.1109\/CVPR.2016.308"},{"key":"e_1_3_2_1_51_1","volume-title":"Df-gan: A simple and effective baseline for text-to-image synthesis. In CVPR. 16515--16525.","author":"Tao Ming","year":"2022","unstructured":"Ming Tao, Hao Tang, Fei Wu, Xiao-Yuan Jing, Bing-Kun Bao, and Changsheng Xu. 2022. Df-gan: A simple and effective baseline for text-to-image synthesis. In CVPR. 16515--16525."},{"key":"e_1_3_2_1_52_1","first-page":"6000","article-title":"Attention is all you need","volume":"30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NeurIPS, Vol. 30. 6000--6010.","journal-title":"NeurIPS"},{"key":"e_1_3_2_1_53_1","volume-title":"Saksham Singhal, Subhojit Som, et al.","author":"Wang Wenhui","year":"2023","unstructured":"Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, et al. 2023. Image as a foreign language: Beit pretraining for vision and vision-language tasks. In CVPR. 19175--19186."},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2012.2189550"},{"key":"e_1_3_2_1_55_1","volume-title":"Coca: Contrastive captioners are image-text foundation models. arXiv preprint arXiv:2205.01917","author":"Yu Jiahui","year":"2022","unstructured":"Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, and Yonghui Wu. 2022. Coca: Contrastive captioners are image-text foundation models. arXiv preprint arXiv:2205.01917 (2022)."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-023-05469-9"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-025-06136-x"}],"event":{"name":"ICMR '25: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Chicago IL USA","acronym":"ICMR '25"},"container-title":["Proceedings of the 2025 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3731715.3733433","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T04:14:03Z","timestamp":1755749643000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3731715.3733433"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,30]]},"references-count":57,"alternative-id":["10.1145\/3731715.3733433","10.1145\/3731715"],"URL":"https:\/\/doi.org\/10.1145\/3731715.3733433","relation":{},"subject":[],"published":{"date-parts":[[2025,6,30]]},"assertion":[{"value":"2025-06-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}