{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T02:12:38Z","timestamp":1771467158755,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2021,11,30]],"date-time":"2021-11-30T00:00:00Z","timestamp":1638230400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2017YFE0130600"],"award-info":[{"award-number":["2017YFE0130600"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61772441"],"award-info":[{"award-number":["61772441"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61872309"],"award-info":[{"award-number":["61872309"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62072384"],"award-info":[{"award-number":["62072384"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62072385"],"award-info":[{"award-number":["62072385"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,1,17]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The interaction between microribonucleic acid and long non-coding ribonucleic acid plays a very important role in biological processes, and the prediction of the one is of great significance to the study of its mechanism of action. Due to the limitations of traditional biological experiment methods, more and more computational methods are applied to this field. However, the existing methods often have problems, such as inadequate acquisition of potential features of the sequence due to simple coding and the need to manually extract features as input. We propose a deep learning model, preMLI, based on rna2vec pre-training and deep feature mining mechanism. We use rna2vec to train the ribonucleic acid (RNA) dataset and to obtain the RNA word vector representation and then mine the RNA sequence features separately and finally concatenate the two feature vectors as the input of the prediction task. The preMLI performs better than existing methods on benchmark datasets and has cross-species prediction capabilities. Experiments show that both pre-training and deep feature mining mechanisms have a positive impact on the prediction performance of the model. To be more specific, pre-training can provide more accurate word vector representations. The deep feature mining mechanism also improves the prediction performance of the model. Meanwhile, The preMLI only needs RNA sequence as the input of the model and has better cross-species prediction performance than the most advanced prediction models, which have reference value for related research.<\/jats:p>","DOI":"10.1093\/bib\/bbab470","type":"journal-article","created":{"date-parts":[[2021,10,15]],"date-time":"2021-10-15T22:27:51Z","timestamp":1634336871000},"source":"Crossref","is-referenced-by-count":31,"title":["preMLI: a pre-trained method to uncover microRNA\u2013lncRNA potential interactions"],"prefix":"10.1093","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9846-1937","authenticated-orcid":false,"given":"Xinyu","family":"Yu","sequence":"first","affiliation":[{"name":"Department of Computer Science and Technology, Xiamen University, Xiamen 361005, China"}]},{"given":"Likun","family":"Jiang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, Xiamen University, Xiamen 361005, China"}]},{"given":"Shuting","family":"Jin","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, Xiamen University, Xiamen 361005, China"},{"name":"National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China"}]},{"given":"Xiangxiang","family":"Zeng","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Hunan University, Changsha 410082, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9885-1978","authenticated-orcid":false,"given":"Xiangrong","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, Xiamen University, Xiamen 361005, China"},{"name":"National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China"}]}],"member":"286","published-online":{"date-parts":[[2021,11,30]]},"reference":[{"issue":"1","key":"2022012000295979300_ref1","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/j.molcel.2017.09.015","article-title":"A peptide encoded by a putative lncRNA HOXB-AS3 suppresses colon cancer growth[J]","volume":"68","author":"Huang","year":"2017","journal-title":"Mol Cell"},{"issue":"1","key":"2022012000295979300_ref2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/ncomms6383","article-title":"The oestrogen receptor alpha-regulated lncRNA NEAT1 is a critical modulator of prostate cancer[J]","volume":"5","author":"Chakravarty","year":"2014","journal-title":"Nat Commun"},{"issue":"21","key":"2022012000295979300_ref3","doi-asserted-by":"crossref","first-page":"6299","DOI":"10.1158\/0008-5472.CAN-16-0356","article-title":"LncRNA HOXA11-AS promotes proliferation and invasion of gastric cancer by scaffolding the chromatin modification factors PRC2, LSD1, and DNMT1[J]","volume":"76","author":"Sun","year":"2016","journal-title":"Cancer Res"},{"issue":"10","key":"2022012000295979300_ref4","first-page":"6776","article-title":"Decreased expression of lncRNA GAS5 predicts a poor prognosis in cervical cancer[J]","volume":"7","author":"Cao","year":"2014","journal-title":"Int J Clin Exp Pathol"},{"issue":"21","key":"2022012000295979300_ref5","doi-asserted-by":"crossref","first-page":"2746","DOI":"10.1038\/onc.2015.340","article-title":"LncRNA HOTAIR enhances ER signaling and confers tamoxifen resistance in breast cancer[J]","volume":"35","author":"Xue","year":"2016","journal-title":"Oncogene"},{"issue":"1","key":"2022012000295979300_ref6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41438-018-0096-0","article-title":"Tomato lncRNA23468 functions as a competing endogenous RNA to modulate NBS-LRR genes by decoying miR482b in the tomato-Phytophthora infestans interaction[J]","volume":"6","author":"Jiang","year":"2019","journal-title":"Horticulture Res"},{"issue":"12","key":"2022012000295979300_ref7","doi-asserted-by":"crossref","first-page":"3012","DOI":"10.1105\/tpc.17.00363","article-title":"Arabidopsis pollen fertility requires the transcription factors CITF1 and SPL7 that regulate copper delivery to anthers and jasmonic acid synthesis[J]","volume":"29","author":"Yan","year":"2017","journal-title":"Plant Cell"},{"issue":"24","key":"2022012000295979300_ref8","doi-asserted-by":"crossref","first-page":"4172","DOI":"10.1093\/bioinformatics\/bty519","article-title":"BMC3C: binning metagenomic contigs using codon usage, sequence composition and read coverage[J]","volume":"34","author":"Yu","year":"2018","journal-title":"Bioinformatics"},{"key":"2022012000295979300_ref9","doi-asserted-by":"crossref","first-page":"3235","DOI":"10.1007\/s00122-020-03690-1","article-title":"Interactions and links among the noncoding RNAs in plants under stresses[J]","volume":"133","author":"Zhou","year":"2020","journal-title":"Theor Appl Genet"},{"issue":"7","key":"2022012000295979300_ref10","doi-asserted-by":"crossref","first-page":"1665","DOI":"10.1016\/j.cell.2014.11.021","article-title":"A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping[J]","volume":"159","author":"Rao","year":"2014","journal-title":"Cell"},{"issue":"12","key":"2022012000295979300_ref11","doi-asserted-by":"crossref","first-page":"1905","DOI":"10.1101\/gr.176586.114","article-title":"Genome-wide map of regulatory interactions in the human genome[J]","volume":"24","author":"Heidari","year":"2014","journal-title":"Genome Res"},{"issue":"D1","key":"2022012000295979300_ref12","doi-asserted-by":"crossref","first-page":"D125","DOI":"10.1093\/nar\/gkaa1017","article-title":"LnCeCell: a comprehensive database of predicted lncRNA-associated ceRNA networks at single-cell resolution[J]","volume":"49","author":"Wang","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022012000295979300_ref13","first-page":"D111","article-title":"LnCeVar: a comprehensive database of genomic variations that disturb ceRNA network regulation[J]","volume":"48","author":"Wang","year":"2020","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022012000295979300_ref14","doi-asserted-by":"crossref","first-page":"D231","DOI":"10.1093\/nar\/gkv1270","article-title":"DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts[J]","volume":"44","author":"Paraskevopoulou","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"15","key":"2022012000295979300_ref15","doi-asserted-by":"crossref","first-page":"2062","DOI":"10.1093\/bioinformatics\/bts344","article-title":"miRcode: a map of putative microRNA target sites in the long non-coding transcriptome[J]","volume":"28","author":"Jeggari","year":"2012","journal-title":"Bioinformatics"},{"issue":"D1","key":"2022012000295979300_ref16","doi-asserted-by":"crossref","first-page":"D121","DOI":"10.1093\/nar\/gky1144","article-title":"LncACTdb 2.0: an updated database of experimentally supported ceRNA interactions curated from low-and high-throughput experiments[J]","volume":"47","author":"Wang","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"2022012000295979300_ref17","doi-asserted-by":"crossref","first-page":"812","DOI":"10.1093\/bioinformatics\/btx672","article-title":"Constructing prediction models from expression profiles for large scale lncRNA\u2013miRNA interaction profiling[J]","volume":"34","author":"Huang","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"2022012000295979300_ref18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-018-5227-3","article-title":"Prediction of plant-derived xenomiRs from plant miRNA sequences using random forest and one-dimensional convolutional neural network models[J]","volume":"19","author":"Zhao","year":"2018","journal-title":"BMC Genomics"},{"issue":"10","key":"2022012000295979300_ref19","doi-asserted-by":"crossref","first-page":"2986","DOI":"10.1093\/bioinformatics\/btaa074","article-title":"PmliPred: a method based on hybrid model and fuzzy decision for plant miRNA\u2013lncRNA interaction prediction[J]","volume":"36","author":"Kang","year":"2020","journal-title":"Bioinformatics"},{"issue":"19","key":"2022012000295979300_ref20","doi-asserted-by":"crossref","first-page":"4372","DOI":"10.3390\/molecules25194372","article-title":"LncMirNet: predicting LncRNA\u2013miRNA interaction based on deep learning of ribonucleic acid sequences[J]","volume":"25","author":"Yang","year":"2020","journal-title":"Molecules"},{"key":"2022012000295979300_ref21","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1007\/s12539-021-00434-7","article-title":"Ensemble deep learning based on multi-level information enhancement and greedy fuzzy decision for plant miRNA\u2013lncRNA interaction prediction[J]","volume":"13","author":"Kang","year":"2021","journal-title":"Interdiscip Sci"},{"key":"2022012000295979300_ref22","article-title":"dna2vec: consistent vector representations of variable-length k-mers","author":"Ng","year":"2017"},{"key":"2022012000295979300_ref23","first-page":"146","article-title":"Distributional structure[J]","author":"Harris","year":"1954"},{"key":"2022012000295979300_ref24","article-title":"Efficient estimation of word representations in vector space[J]","author":"Mikolov","year":"2013"},{"key":"2022012000295979300_ref25","article-title":"Bert: pre-training of deep bidirectional transformers for language understanding[J]","author":"Devlin","year":"2018"},{"issue":"4","key":"2022012000295979300_ref26","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: a pre-trained biomedical language representation model for biomedical text mining[J]","volume":"36","author":"Lee","year":"2020","journal-title":"Bioinformatics"},{"key":"2022012000295979300_ref27","first-page":"2112","article-title":"DNABERT: pre-trained bidirectional encoder representations from transformers model for DNA-language in genome[J]","author":"Ji","year":"2021"},{"key":"2022012000295979300_ref28","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1007\/978-1-4939-9045-0_26","volume-title":"CANTATAdb 2.0: Expanding the Collection of Plant Long Noncoding RNAs[M]\/\/Plant Long Non-Coding RNAs","author":"Szcze\u015bniak","year":"2019"},{"issue":"suppl_1","key":"2022012000295979300_ref29","doi-asserted-by":"crossref","first-page":"D806","DOI":"10.1093\/nar\/gkp818","article-title":"PMRD: plant microRNA database[J]","volume":"38","author":"Zhang","year":"2010","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2022012000295979300_ref30","doi-asserted-by":"crossref","first-page":"D155","DOI":"10.1093\/nar\/gky1141","article-title":"miRBase: from microRNA sequences to function[J]","volume":"47","author":"Kozomara","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"2022012000295979300_ref31","first-page":"D1161","article-title":"GREENC: a Wiki-based database of plant lncRNAs[J]","volume":"44","author":"Gallart","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2022012000295979300_ref32","doi-asserted-by":"crossref","first-page":"1033","DOI":"10.1038\/ng2079","article-title":"Target mimicry provides a new mechanism for regulation of microRNA activity[J]","volume":"39","author":"Franco-Zorrilla","year":"2007","journal-title":"Nat Genet"},{"key":"2022012000295979300_ref33","doi-asserted-by":"crossref","DOI":"10.3115\/v1\/D14-1181","article-title":"Convolutional neural networks for sentence classification","author":"Kim","year":"2014"},{"issue":"6","key":"2022012000295979300_ref34","doi-asserted-by":"crossref","first-page":"2133","DOI":"10.1093\/bib\/bbz133","article-title":"MotifCNN-fold: protein fold recognition based on fold-specific features extracted by motif-based convolutional neural networks[J]","volume":"21","author":"Li","year":"2020","journal-title":"Brief Bioinform"},{"issue":"1","key":"2022012000295979300_ref35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-12-489","article-title":"Predicting RNA-protein interactions using only sequence information[J]","volume":"12","author":"Muppirala","year":"2011","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2022012000295979300_ref36","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-016-2931-8","article-title":"IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction[J]","volume":"17","author":"Pan","year":"2016","journal-title":"BMC Genomics"},{"issue":"14","key":"2022012000295979300_ref37","doi-asserted-by":"crossref","first-page":"i252","DOI":"10.1093\/bioinformatics\/btx257","article-title":"Exploiting sequence-based features for predicting enhancer\u2013promoter interactions[J]","volume":"33","author":"Yang","year":"2017","journal-title":"Bioinformatics"},{"key":"2022012000295979300_ref38","article-title":"Recurrent neural network regularization[J]","author":"Zaremba","year":"2014"},{"key":"2022012000295979300_ref39","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks[J]","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv Neural Inf Process Syst"},{"key":"2022012000295979300_ref40","doi-asserted-by":"crossref","DOI":"10.3115\/v1\/D14-1179","article-title":"Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]","author":"Cho","year":"2014"},{"key":"2022012000295979300_ref41","first-page":"5998","article-title":"Attention is all you need[C]","author":"Vaswani","year":"2017","journal-title":"Adv Neural Inf Process Syst"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/1\/bbab470\/42231626\/bbab470.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/1\/bbab470\/42231626\/bbab470.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,11]],"date-time":"2023-11-11T07:33:42Z","timestamp":1699688022000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab470\/6446267"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,30]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1,17]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab470","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,1]]},"published":{"date-parts":[[2021,11,30]]},"article-number":"bbab470"}}