{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,26]],"date-time":"2025-06-26T07:46:06Z","timestamp":1750923966048,"version":"3.37.3"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,6,29]],"date-time":"2022-06-29T00:00:00Z","timestamp":1656460800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,6,29]],"date-time":"2022-06-29T00:00:00Z","timestamp":1656460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004377","name":"Hong Kong Polytechnic University","doi-asserted-by":"publisher","award":["G-YW4H","#RTVU","G-YW4H"],"award-info":[{"award-number":["G-YW4H","#RTVU","G-YW4H"]}],"id":[{"id":"10.13039\/501100004377","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014717","name":"National Outstanding Youth Science Fund Project of National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62006203"],"award-info":[{"award-number":["62006203"]}],"id":[{"id":"10.13039\/100014717","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61976147","61976147"],"award-info":[{"award-number":["61976147","61976147"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>The COVID-19 pandemic has increasingly accelerated the publication pace of scientific literature. How to efficiently curate and index this large amount of biomedical literature under the current crisis is of great importance. Previous literature indexing is mainly performed by human experts using Medical Subject Headings (MeSH), which is labor-intensive and time-consuming. Therefore, to alleviate the expensive time consumption and monetary cost, there is an urgent need for automatic semantic indexing technologies for the emerging COVID-19 domain.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this research, to investigate the semantic indexing problem for COVID-19, we first construct the new COVID-19 Semantic Indexing dataset, which consists of more than 80 thousand biomedical articles. We then propose a novel semantic indexing framework based on the multi-probe attention neural network (MPANN) to address the COVID-19 semantic indexing problem. Specifically, we employ a k-nearest neighbour based MeSH masking approach to generate candidate topic terms for each input article. We encode and feed the selected candidate terms as well as other contextual information as probes into the downstream attention-based neural network. Each semantic probe carries specific aspects of biomedical knowledge and provides informatively discriminative features for the input article. After extracting the semantic features at both term-level and document-level through the attention-based neural network, MPANN adopts a linear multi-view classifier to conduct the final topic prediction for COVID-19 semantic indexing.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>The experimental results suggest that MPANN promises to represent the semantic features of biomedical texts and is effective in predicting semantic topics for COVID-19 related biomedical articles.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-022-04803-x","type":"journal-article","created":{"date-parts":[[2022,6,29]],"date-time":"2022-06-29T09:07:58Z","timestamp":1656493678000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Multi-probe attention neural network for COVID-19 semantic indexing"],"prefix":"10.1186","volume":"23","author":[{"given":"Jinghang","family":"Gu","sequence":"first","affiliation":[]},{"given":"Rong","family":"Xiang","sequence":"additional","affiliation":[]},{"given":"Xing","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Jing","family":"Li","sequence":"additional","affiliation":[]},{"given":"Wenjie","family":"Li","sequence":"additional","affiliation":[]},{"given":"Longhua","family":"Qian","sequence":"additional","affiliation":[]},{"given":"Guodong","family":"Zhou","sequence":"additional","affiliation":[]},{"given":"Chu-Ren","family":"Huang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,6,29]]},"reference":[{"key":"4803_CR1","unstructured":"Wang LL, Lo K, Chandrasekhar Y, et al. CORD-19: The Covid-19 Open Research Dataset. ArXiv preprint. 2020; http:\/\/arxiv.org\/abs\/2004.10706v2."},{"key":"4803_CR2","doi-asserted-by":"crossref","unstructured":"Esteva A, Anuprit K, Romain P, et al. Co-search: Covid-19 information retrieval with semantic search, question answering, and abstractive summarization. ArXiv preprint. 2020; http:\/\/arxiv.org\/abs\/2006.09595.","DOI":"10.1038\/s41746-021-00437-0"},{"issue":"D1","key":"4803_CR3","doi-asserted-by":"publisher","first-page":"D1534","DOI":"10.1093\/nar\/gkaa952","volume":"49","author":"Q Chen","year":"2021","unstructured":"Chen Q, Allot A, Lu Z. LitCovid: an open database of COVID-19 literature. Nucleic Acids Res. 2021;49(D1):D1534\u201340.","journal-title":"Nucleic Acids Res."},{"key":"4803_CR4","doi-asserted-by":"publisher","DOI":"10.1016\/j.clim.2020.108427","author":"K Yuki","year":"2020","unstructured":"Yuki K, Fujiogi M, Koutsogiannaki S. COVID-19 pathophysiology: A review. Clin Immunol. 2020. https:\/\/doi.org\/10.1016\/j.clim.2020.108427.","journal-title":"Clin Immunol"},{"issue":"5","key":"4803_CR5","doi-asserted-by":"publisher","first-page":"438","DOI":"10.1038\/s41562-020-0866-1","volume":"4","author":"C Betsch","year":"2020","unstructured":"Betsch C. How behavioural science data helps mitigate the COVID-19 crisis. Nat Hum Behav. 2020;4(5):438.","journal-title":"Nat Hum Behav."},{"key":"4803_CR6","doi-asserted-by":"publisher","DOI":"10.4081\/monaldi.2020.1298","author":"I Madabhavi","year":"2020","unstructured":"Madabhavi I, Sarkar M, Kadakol N. COVID-19: a review. Monaldi Arch Chest Dis. 2020. https:\/\/doi.org\/10.4081\/monaldi.2020.1298.","journal-title":"Monaldi Arch Chest Dis"},{"key":"4803_CR7","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1186\/s12575-020-00128-2","volume":"22","author":"H Esakandari","year":"2020","unstructured":"Esakandari H, Mohsen NA, Javad FA, et al. A comprehensive review of COVID-19 characteristics. Biol Proced Online. 2020;22:19.","journal-title":"Biol Proced Online."},{"issue":"3","key":"4803_CR8","first-page":"265","volume":"88","author":"CE Lipscomb","year":"2000","unstructured":"Lipscomb CE. Medical subject headings (MeSH). Bull Med Libr Assoc. 2000;88(3):265.","journal-title":"Bull Med Libr Assoc"},{"key":"4803_CR9","unstructured":"Anastasios N, Georgios K, Eirini V, et al. Overview of BioASQ 2021: The ninth BioASQ challenge on large-scale biomedical semantic indexing and question answering. In International Conference of the Cross-Language Evaluation Forum for European Languages. 2021;239\u201363."},{"issue":"1","key":"4803_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13326-017-0113-5","volume":"8","author":"J Mork","year":"2017","unstructured":"Mork J, Aronson A, Demner-Fushman D. 12 years on-Is the NLM medical text indexer still useful and relevant? J Biomed Semant. 2017;8(1):1\u201310.","journal-title":"J Biomed Semant"},{"issue":"5","key":"4803_CR11","doi-asserted-by":"publisher","first-page":"660","DOI":"10.1136\/amiajnl-2010-000055","volume":"18","author":"M Huang","year":"2011","unstructured":"Huang M, Aur\u00e9lie N, Lu Z. Recommending mesh terms for annotating biomedical articles. J Am Med Inform Assoc. 2011;18(5):660\u20137.","journal-title":"J Am Med Inform Assoc."},{"issue":"2","key":"4803_CR12","first-page":"176","volume":"71","author":"ME Funk","year":"1983","unstructured":"Funk ME, Reid CA. Indexing consistency in MEDLINE. Bull Med Libr Assoc. 1983;71(2):176.","journal-title":"Bull Med Libr Assoc"},{"key":"4803_CR13","unstructured":"Mork JG, Jimeno-Yepes A, Aronson AR. The NLM Medical Text Indexer System for Indexing Biomedical Literature. BioASQ@CLEF. 2013;1."},{"issue":"1","key":"4803_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-015-0564-6","volume":"16","author":"G Tsatsaronis","year":"2015","unstructured":"Tsatsaronis G, Balikas G, Malakasiotis P, et al. An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition. BMC Bioinformatics. 2015;16(1):1\u201328.","journal-title":"BMC Bioinformatics"},{"key":"4803_CR15","doi-asserted-by":"crossref","unstructured":"Nentidis A, Bougiatiotis K, Krithara A, et al. Results of the fifth edition of the bioasq challenge. In BioNLP. 2017;48\u201357.","DOI":"10.18653\/v1\/W17-2306"},{"key":"4803_CR16","doi-asserted-by":"crossref","unstructured":"Nentidis A, Bougiatiotis K, Krithara A, et al. Results of the seventh edition of the bioasq challenge. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. 2019;553\u2013568.","DOI":"10.1007\/978-3-030-43887-6_51"},{"key":"4803_CR17","doi-asserted-by":"crossref","unstructured":"Nentidis A, Krithara A, Bougiatiotis K, et al. Overview of BioASQ 2020: The Eighth BioASQ Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering. In International Conference of the Cross-Language Evaluation Forum for European Languages. 2020;194-214.","DOI":"10.1007\/978-3-030-58219-7_16"},{"key":"4803_CR18","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1016\/j.jclinepi.2020.04.014","volume":"123","author":"F Shokraneh","year":"2020","unstructured":"Shokraneh F, Tony R. Lessons from covid-19 to future evidence synthesis efforts: first living search strategy and out of date scientific publishing and indexing industry. J Clin Epidemiol. 2020;123:171\u20133.","journal-title":"J Clin Epidemiol"},{"issue":"9","key":"4803_CR19","doi-asserted-by":"publisher","first-page":"1431","DOI":"10.1093\/jamia\/ocaa091","volume":"27","author":"K Roberts","year":"2020","unstructured":"Roberts K, Tasmeer A, Steven B, et al. Trec-covid: rationale and structure of an information retrieval shared task for covid-19. J Am Med Inform Assoc. 2020;27(9):1431\u20136.","journal-title":"J Am Med Inform Assoc"},{"key":"4803_CR20","doi-asserted-by":"publisher","first-page":"102187","DOI":"10.1016\/j.ijinfomgt.2020.102187","volume":"55","author":"H Rao","year":"2020","unstructured":"Rao H, Naga V, Patricia A, et al. Retweets of officials\u2019 alarming vs reassuring messages during the covid-19 pandemic: Implications for crisis management. Int J Inf Manag. 2020;55:102187.","journal-title":"Int J Inf Manag"},{"issue":"2","key":"4803_CR21","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1073\/pnas.98.2.381","volume":"98","author":"R Roberts","year":"2001","unstructured":"Roberts R. PubMed Central: the GenBank of the published literature. Proc Natl Acad Sci. 2001;98(2):381\u20132.","journal-title":"Proc Natl Acad Sci"},{"key":"4803_CR22","unstructured":"Aronson AR, Mork JG, Gay CW, et al. The NLM indexing initiative's medical text indexer. Medinfo. 2004;89."},{"issue":"12","key":"4803_CR23","doi-asserted-by":"publisher","first-page":"i339","DOI":"10.1093\/bioinformatics\/btv237","volume":"31","author":"K Liu","year":"2015","unstructured":"Liu K, Peng S, Wu J, Zhai C, et al. MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence. Bioinformatics. 2015;31(12):i339\u201347.","journal-title":"Bioinformatics"},{"issue":"1","key":"4803_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13326-017-0123-3","volume":"8","author":"Y Mao","year":"2017","unstructured":"Mao Y, Lu Z. MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank. J Biomed Semant. 2017;8(1):1\u20139.","journal-title":"J Biomed Semant"},{"issue":"19","key":"4803_CR25","doi-asserted-by":"publisher","first-page":"3794","DOI":"10.1093\/bioinformatics\/btz142","volume":"35","author":"G Xun","year":"2019","unstructured":"Xun G, Jha K, Yuan Y, et al. MeSHProbeNet: a self-attentive probe net for MeSH indexing. Bioinformatics. 2019;35(19):3794\u2013802.","journal-title":"Bioinformatics."},{"key":"4803_CR26","first-page":"1","volume":"15","author":"G Xun","year":"2020","unstructured":"Xun G, Jha K, Aidong Z. MeSHProbeNet-P: improving Large-scale MeSH indexing with personalizable MeSH probes. ACM Trans Knowl Dis Data. 2020;15:1\u201314.","journal-title":"ACM Trans Knowl Dis Data"},{"issue":"12","key":"4803_CR27","doi-asserted-by":"publisher","first-page":"i70","DOI":"10.1093\/bioinformatics\/btw294","volume":"32","author":"SW Peng","year":"2016","unstructured":"Peng SW, You R, Wang HN, et al. Deepmesh: deep semantic representation for improving large-scale mesh indexing. Bioinformatics. 2016;32(12):i70\u20139.","journal-title":"Bioinformatics."},{"issue":"5","key":"4803_CR28","doi-asserted-by":"publisher","first-page":"1533","DOI":"10.1093\/bioinformatics\/btz756","volume":"36","author":"S Dai","year":"2020","unstructured":"Dai S, You R, Lu Z, et al. FullMeSH: improving large-scale MeSH indexing with full text. Bioinformatics. 2020;36(5):1533\u201341.","journal-title":"Bioinformatics."},{"key":"4803_CR29","doi-asserted-by":"crossref","unstructured":"Jin Q, Dhingra B, Cohen W, et al. Attentionmesh: Simple, effective and interpretable automatic mesh indexer. In Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering. 2018;47\u201356.","DOI":"10.18653\/v1\/W18-5306"},{"key":"4803_CR30","unstructured":"Ebadi N and Najafirad P. A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic. ArXiv preprint. 2020; http:\/\/arxiv.org\/abs\/2010.03544."},{"key":"4803_CR31","unstructured":"Fang L and Wang K. Team Bioformer at BioCreative VII LitCovid Track: Multic-label topic classification for COVID-19 literature with a compact BERT model. In Proceedings of the seventh BioCreative challenge evaluation workshop. 2021;272\u2013274."},{"key":"4803_CR32","doi-asserted-by":"crossref","unstructured":"Gu J, Wang X, Chersoni E, et al. Team PolyU-CBSNLP at BioCreative-VII LitCovid Track: Ensemble Learning for COVID-19 Multilabel Classification.\u00a0In Proceedings of the seventh BioCreative challenge evaluation workshop.\u00a02021;326\u2013331.","DOI":"10.1093\/database\/baac103"},{"issue":"1","key":"4803_CR33","first-page":"1","volume":"16","author":"G Tsatsaronis","year":"2005","unstructured":"Tsatsaronis G, Balikas G, Malakasiotis P, et al. An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition. BMC Bioinformatics. 2005;16(1):1\u201328.","journal-title":"BMC Bioinformatics"},{"key":"4803_CR34","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2204.09781","author":"Q Chen","year":"2022","unstructured":"Chen Q, Allot A, Leaman R, et al. Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations. ArXiv preprint. 2022. https:\/\/doi.org\/10.48550\/arXiv.2204.09781.","journal-title":"ArXiv preprint."},{"issue":"3","key":"4803_CR35","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1561\/1500000016","volume":"3","author":"TY Liu","year":"2009","unstructured":"Liu TY. Learning to rank for information retrieval. Found Trends Inf Retr. 2009;3(3):225\u2013331.","journal-title":"Found Trends Inf Retr."},{"key":"4803_CR36","first-page":"2493","volume":"12","author":"R Collobert","year":"2011","unstructured":"Collobert R, Weston J, Bottou L, et al. Natural language processing (almost) from scratch. J Mach Learn Res. 2011;12:2493\u2013537.","journal-title":"J Mach Learn Res."},{"key":"4803_CR37","unstructured":"Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017;6000\u201310."},{"key":"4803_CR38","unstructured":"Devlin J, Chang MW, Lee K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding. ArXiv preprint. 2018. http:\/\/arxiv.org\/abs\/1810.04805."},{"key":"4803_CR39","unstructured":"Liu Y, Ott M, Goyal N, et al. Roberta: A robustly optimized bert pretraining approach. ArXiv preprint. 2019; http:\/\/arxiv.org\/abs\/1907.11692."},{"issue":"4","key":"4803_CR40","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","volume":"36","author":"J Lee","year":"2020","unstructured":"Lee J, Yoon W, Kim S, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234\u201340.","journal-title":"Bioinformatics."},{"key":"4803_CR41","unstructured":"Loshchilov I and Hutter F. Decoupled weight decay regularization. ArXiv preprint. 2017; http:\/\/arxiv.org\/abs\/1711.05101"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04803-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04803-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04803-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,10]],"date-time":"2023-02-10T00:02:08Z","timestamp":1675987328000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04803-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,29]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4803"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04803-x","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2022,6,29]]},"assertion":[{"value":"21 June 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 June 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 June 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"259"}}