{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T17:12:16Z","timestamp":1774631536476,"version":"3.50.1"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2021,5,31]],"date-time":"2021-05-31T00:00:00Z","timestamp":1622419200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,31]],"date-time":"2021-05-31T00:00:00Z","timestamp":1622419200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100011710","name":"Shaanxi Provincial Science and Technology Department","doi-asserted-by":"crossref","award":["2019ZDLGY03-10"],"award-info":[{"award-number":["2019ZDLGY03-10"]}],"id":[{"id":"10.13039\/501100011710","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2022,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Sentence matching is widely used in various natural language tasks, such as natural language inference, paraphrase identification and question answering. For these tasks, we need to understand the logical and semantic relationship between two sentences. Most current methods use all information within a sentence to build a model and hence determine its relationship to another sentence. However, the information contained in some sentences may cause redundancy or introduce noise, impeding the performance of the model. Therefore, we propose a sentence matching method based on multi keyword-pair matching (MKPM), which uses keyword pairs in two sentences to represent the semantic relationship between them, avoiding the interference of redundancy and noise. Specifically, we first propose a sentence-pair-based attention mechanism<jats:italic>sp-attention<\/jats:italic>to select the most important word pair from the two sentences as a keyword pair, and then propose a Bi-task architecture to model the semantic information of these keyword pairs. The Bi-task architecture is as follows: 1. In order to understand the semantic relationship at the word level between two sentences, we design a word-pair task (WP-Task), which uses these keyword pairs to complete sentence matching independently. 2. We design a sentence-pair task (SP-Task) to understand the sentence level semantic relationship between the two sentences by sentence denoising. Through the integration of the two tasks, our model can understand sentences more accurately from the two granularities of word and sentence. Experimental results show that our model can achieve state-of-the-art performance in several tasks. Our source code is publicly available<jats:sup>1<\/jats:sup>.<\/jats:p>","DOI":"10.1007\/s10489-021-02306-5","type":"journal-article","created":{"date-parts":[[2021,5,31]],"date-time":"2021-05-31T16:31:33Z","timestamp":1622478693000},"page":"1878-1892","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["MKPM: Multi keyword-pair matching for natural language sentences"],"prefix":"10.1007","volume":"52","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5139-196X","authenticated-orcid":false,"given":"Xin","family":"Lu","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0065-734X","authenticated-orcid":false,"given":"Yao","family":"Deng","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8823-7480","authenticated-orcid":false,"given":"Ting","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Yi","family":"Gao","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0706-2103","authenticated-orcid":false,"given":"Jun","family":"Feng","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0572-641X","authenticated-orcid":false,"given":"Xia","family":"Sun","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5549-5691","authenticated-orcid":false,"given":"Richard","family":"Sutcliffe","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,5,31]]},"reference":[{"key":"2306_CR1","unstructured":"Wang Z., Hamza W., Florian R. Bilateral multi-perspective matching for natural language sentences, arXiv:1702.03814"},{"key":"2306_CR2","unstructured":"Madnani N., Tetreault J., Chodorow M. (2012) Re-examining machine translation metrics for paraphrase identification. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics. Human Language Technologies, Association for Computational Linguistics, pp. 182\u2013190"},{"key":"2306_CR3","unstructured":"Chen Q., Zhu X., Ling Z., Wei S., Jiang H., Inkpen D. Enhanced lstm for natural language inference, arXiv:1609.06038"},{"key":"2306_CR4","unstructured":"Wang Z., Mi H., Ittycheriah A. Sentence similarity learning by lexical decomposition and composition, arXiv:1602.07019"},{"key":"2306_CR5","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1016\/j.ins.2019.12.002","volume":"514","author":"M Esposito","year":"2020","unstructured":"Esposito M., Damiano E., Minutolo A., De Pietro G., Fujita H. (2020) Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering. Inf Sci 514:88\u2013105","journal-title":"Inf Sci"},{"key":"2306_CR6","doi-asserted-by":"crossref","unstructured":"Liu P., Qiu X., Chen J., Huang X.-J. (2016) Deep fusion lstms for text semantic matching. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1034\u20131043","DOI":"10.18653\/v1\/P16-1098"},{"key":"2306_CR7","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1016\/j.knosys.2018.11.003","volume":"164","author":"M Pota","year":"2019","unstructured":"Pota M., Marulli F., Esposito M., De Pietro G., Fujita H. (2019) Multilingual pos tagging by a composite deep architecture based on character-level features and on-the-fly enriched word embeddings. Knowl-Based Syst 164:309\u2013323","journal-title":"Knowl-Based Syst"},{"issue":"14","key":"2306_CR8","doi-asserted-by":"publisher","first-page":"4710","DOI":"10.3390\/app10144710","volume":"10","author":"M Pota","year":"2020","unstructured":"Pota M., Esposito M., Pietro G.D., Fujita H. (2020) Best practices of convolutional neural networks for question classification. Appl Sci 10(14):4710","journal-title":"Appl Sci"},{"key":"2306_CR9","doi-asserted-by":"crossref","unstructured":"Catelli R., Casola V., De Pietro G., Fujita H., Esposito M. (2021) Combining contextualized word representation and sub-document level analysis through bi-lstm+ crf architecture for clinical de-identification. Knowl-Based Syst 213:106649","DOI":"10.1016\/j.knosys.2020.106649"},{"key":"2306_CR10","unstructured":"Catelli R., Gargiulo F., Casola V., De Pietro G., Fujita H., Esposito M. A novel covid-19 data set and an effective deep learning approach for the de-identification of italian medical records. IEEE Access"},{"key":"2306_CR11","unstructured":"Gong Y., Luo H., Zhang J. Natural language inference over interaction space, arXiv:1709.04348"},{"key":"2306_CR12","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1162\/tacl_a_00097","volume":"4","author":"W Yin","year":"2016","unstructured":"Yin W., Sch\u00fctze H, Xiang B., Zhou B. (2016) ABCNN: Attention-Based convolutional neural network for modeling sentence pairs. Trans Assoc Comput Linguist 4:259\u2013272","journal-title":"Trans Assoc Comput Linguist"},{"key":"2306_CR13","unstructured":"Liu X., Chen Q., Deng C., Zeng H., Chen J., Li D., Tang B. (2018) LCQMC: A large-scale Chinese question matching corpus. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 1952\u20131962"},{"key":"2306_CR14","unstructured":"MacCartney B., Manning C.D. (2009) Natural language inference. Citeseer"},{"key":"2306_CR15","doi-asserted-by":"crossref","unstructured":"Bilotti M.W., Ogilvie P., Callan J., Nyberg E. (2007) Structured retrieval for question answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 351\u2013358","DOI":"10.1145\/1277741.1277802"},{"key":"2306_CR16","unstructured":"Wang M., Smith N.A., Mitamura T. (2007) What is the Jeopardy model? a quasi-synchronous grammar for QA. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 22\u201332"},{"issue":"1","key":"2306_CR17","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.jal.2005.12.005","volume":"5","author":"D Moldovan","year":"2007","unstructured":"Moldovan D., Clark C., Harabagiu S., Hodges D. (2007) Cogex: a semantically and contextually enriched logic prover for question answering. J Appl Log 5(1):49\u201369","journal-title":"J Appl Log"},{"key":"2306_CR18","doi-asserted-by":"crossref","unstructured":"Chen J., Chen Q., Liu X., Yang H., Lu D., Tang B. (2018) The BQ corpus: A large-scale domain-specific Chinese corpus for sentence semantic equivalence identification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4946\u20134951","DOI":"10.18653\/v1\/D18-1536"},{"key":"2306_CR19","doi-asserted-by":"crossref","unstructured":"Bowman S.R., Angeli G., Potts C., Manning C.D. A large annotated corpus for learning natural language inference. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics","DOI":"10.18653\/v1\/D15-1075"},{"key":"2306_CR20","unstructured":"Kim Y. Convolutional neural networks for sentence classification, arXiv:1408.5882"},{"key":"2306_CR21","doi-asserted-by":"crossref","unstructured":"Dey R., Salemt F.M. (2017) Gate-variants of gated recurrent unit (GRU) neural networks. In: 2017 IEEE 60th international midwest symposium on circuits and systems (MWSCAS). IEEE, pp. 1597\u20131600","DOI":"10.1109\/MWSCAS.2017.8053243"},{"issue":"10","key":"2306_CR22","doi-asserted-by":"publisher","first-page":"2222","DOI":"10.1109\/TNNLS.2016.2582924","volume":"28","author":"K Greff","year":"2016","unstructured":"Greff K., Srivastava R.K., Koutn\u00edk J., Steunebrink B.R., Schmidhuber J. (2016) Lstm: A search space odyssey. IEEE Trans Neural Netw Learn Syst 28(10):2222\u20132232","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"2306_CR23","doi-asserted-by":"crossref","unstructured":"Mueller J., Thyagarajan A. (2016) Siamese recurrent architectures for learning sentence similarity. In: Thirtieth AAAI conference on artificial intelligence","DOI":"10.1609\/aaai.v30i1.10350"},{"key":"2306_CR24","doi-asserted-by":"crossref","unstructured":"Varior R.R., Shuai B., Lu J., Xu D., Wang G. (2016) A siamese long short-term memory architecture for human re-identification. In: European Conference on Computer Vision. Springer, pp. 135\u2013153","DOI":"10.1007\/978-3-319-46478-7_9"},{"key":"2306_CR25","unstructured":"Nie Y., Bansal M. Shortcut-stacked sentence encoders for multi-domain inference, arXiv:1708.02312"},{"key":"2306_CR26","unstructured":"Shen T., Zhou T., Long G., Jiang J., Wang S., Zhang C. Reinforced self-attention network: A hybrid of hard and soft attention for sequence modeling, arXiv:1801.10296"},{"key":"2306_CR27","doi-asserted-by":"crossref","unstructured":"Huang Q., Bu J., Xie W., Yang S., Wu W., Liu L. (2019) Multi-task sentence encoding model for semantic retrieval in question answering systems. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp. 1\u20138","DOI":"10.1109\/IJCNN.2019.8852327"},{"key":"2306_CR28","unstructured":"Wang S., Jiang J. A compare-aggregate model for matching text sequences, arXiv:1611.01747"},{"key":"2306_CR29","doi-asserted-by":"crossref","unstructured":"Kim S., Kang I., Kwak N. (2019) Semantic sentence matching with densely-connected recurrent and co-attentive information. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 33, pp. 6586\u20136593","DOI":"10.1609\/aaai.v33i01.33016586"},{"key":"2306_CR30","doi-asserted-by":"crossref","unstructured":"Zheng G., Mukherjee S., Dong X.L., Li F. (2018) Opentag: Open attribute value extraction from product profiles. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1049\u20131058","DOI":"10.1145\/3219819.3219839"},{"key":"2306_CR31","doi-asserted-by":"crossref","unstructured":"Zhang K., Lv G., Wang L., Wu L., Chen E., Wu F., Xie X. (2019) DRr-Net: Dynamic re-read network for sentence semantic matching. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol 33, pp. 7442\u20137449","DOI":"10.1609\/aaai.v33i01.33017442"},{"key":"2306_CR32","unstructured":"Mikolov T., Chen K., Corrado G., Dean J. Efficient estimation of word representations in vector space, arXiv:1301.3781"},{"key":"2306_CR33","doi-asserted-by":"crossref","unstructured":"Pennington J., Socher R., Manning C.D. (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532\u20131543","DOI":"10.3115\/v1\/D14-1162"},{"key":"2306_CR34","unstructured":"Li S., Zhao Z., Hu R., Li W., Liu T., Du X. Analogical reasoning on Chinese morphological and semantic relations, arXiv:1805.06504"},{"key":"2306_CR35","unstructured":"Ioffe S., Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift, arXiv:1502.03167"},{"key":"2306_CR36","unstructured":"Wang Z., Mi H., Ittycheriah A. Semi-supervised clustering for short text via deep representation learning, arXiv:1602.06797"},{"key":"2306_CR37","unstructured":"Tomar G.S., Duque T., T\u00e4ckstr\u00f6m O, Uszkoreit J., Das D. Neural paraphrase identification of questions with noisy pretraining, arXiv:1704.04565"},{"key":"2306_CR38","unstructured":"Tay Y., Tuan L.A., Hui S.C. A compare-propagate architecture with alignment factorization for natural language inference. arXiv:1801.00102"},{"key":"2306_CR39","unstructured":"Choi J., Yoo K.M., Lee S-g Learning to compose task-specific tree structures, arXiv:1707.02786"},{"key":"2306_CR40","unstructured":"Im J., Cho S. Distance-based self-attention network for natural language inference, arXiv:1712.02047"}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-021-02306-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-021-02306-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-021-02306-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,29]],"date-time":"2022-12-29T05:31:26Z","timestamp":1672291886000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-021-02306-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,31]]},"references-count":40,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,1]]}},"alternative-id":["2306"],"URL":"https:\/\/doi.org\/10.1007\/s10489-021-02306-5","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"value":"0924-669X","type":"print"},{"value":"1573-7497","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,31]]},"assertion":[{"value":"2 March 2021","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 May 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}