{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T01:17:15Z","timestamp":1768353435378,"version":"3.49.0"},"reference-count":61,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2023,6,8]],"date-time":"2023-06-08T00:00:00Z","timestamp":1686182400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["52275249"],"award-info":[{"award-number":["52275249"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["FJ2021B128"],"award-info":[{"award-number":["FJ2021B128"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Social Science Foundation of Fujian Province","award":["52275249"],"award-info":[{"award-number":["52275249"]}]},{"name":"Social Science Foundation of Fujian Province","award":["FJ2021B128"],"award-info":[{"award-number":["FJ2021B128"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems"],"abstract":"<jats:p>Patent application is one of the important ways to protect innovation achievements that have great commercial value for enterprises; it is the initial step for enterprises to set the business development track, as well as a powerful means to protect their core competitiveness. The emergence of a large amount of patent data makes the effective detection of patent data difficult, and patent infringement cases occur frequently. Manual measurement in patent detection is slow, costly, and subjective, and can only play an auxiliary role in measuring the validity of patents. Protecting the inventive achievements of patent holders and realizing more accurate and effective patent detection were the issues explored by academics. There are five main methods to measure patent similarity: clustering-based method, vector space model (VSM)-based method, subject\u2013action\u2013object (SAO) structure-based method, deep learning-based method, and patent structure-based method. To solve this problem, this paper proposes a calculation method to fuse the similarity of patent text and image. Firstly, the SAO structure extraction technique is used for the patent text to obtain the effective content of the text, and the SAO structure is compared for similarity; secondly, the patent image information is extracted and compared; finally, the patent similarity is obtained by fusing the two aspects of information. The feasibility and effectiveness of the scheme are proven by studying a large number of patent similarity cases in the field of mechanical structures.<\/jats:p>","DOI":"10.3390\/systems11060294","type":"journal-article","created":{"date-parts":[[2023,6,9]],"date-time":"2023-06-09T01:32:34Z","timestamp":1686274354000},"page":"294","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Measuring Patent Similarity Based on Text Mining and Image Recognition"],"prefix":"10.3390","volume":"11","author":[{"given":"Wenguang","family":"Lin","sequence":"first","affiliation":[{"name":"School of Mechanical and Automotive Engineering, Xiamen University of Technology, Xiamen 361024, China"}]},{"given":"Wenqiang","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Mechanical and Automotive Engineering, Xiamen University of Technology, Xiamen 361024, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0951-2734","authenticated-orcid":false,"given":"Renbin","family":"Xiao","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.wpi.2016.11.004","article-title":"\u201c80% of technical information found only in patents\u201d\u2014Is there proof of this?","volume":"48","author":"Asche","year":"2017","journal-title":"World Pat. Inf."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Zhai, C.Y., Du, D.B., and Shi, W.T. (2023). Spatiotemporal Evolution and Determinants of the Geography of Chinese Patents Abroad: A Case Study of Strategic Emerging Industries. Systems, 11.","DOI":"10.3390\/systems11010033"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ma, H.K. (2023). The Dynamics of China\u2019s Collaborative Innovation Network in Agricultural Biotechnology: A Spatial-Topological Perspective. Systems, 11.","DOI":"10.3390\/systems11020073"},{"key":"ref_4","unstructured":"(2023, May 14). International Patent Applications Defy 2022 Challenges, Continue Upward Trend. Available online: https:\/\/www.wipo.int\/pressroom\/en\/articles\/2023\/article_0002.html."},{"key":"ref_5","unstructured":"(2023, March 07). Global Innovation Index 2022. Available online: https:\/\/www.wipo.int\/global_innovation_index\/en\/2022\/."},{"key":"ref_6","unstructured":"(2023, March 20). WIPO: China\u2019s Global Ranking in Innovation Steadily Improves. Available online: https:\/\/baijiahao.baidu.com\/s?id=1745312150286743555&wfr=spider&for=pc."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1002\/smj.2699","article-title":"Text matching to measure patent similarity","volume":"39","author":"Arts","year":"2018","journal-title":"Strat. Manag. J."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3605","DOI":"10.1016\/j.eswa.2013.11.045","article-title":"Creating patents on the new technology using analogy-based patent mining","volume":"41","author":"Jeong","year":"2014","journal-title":"Expert Syst. Appl."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1108\/LHT-12-2018-0201","article-title":"Bibliometric analysis of patent infringement retrieval model based on self-organizing map neural network algorithm","volume":"38","author":"Zhu","year":"2020","journal-title":"Libr. Hi Tech"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1016\/j.technovation.2008.10.006","article-title":"An approach to discovering new technology opportunities: Keyword-based patent map approach","volume":"29","author":"Lee","year":"2009","journal-title":"Technovation"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1111\/1467-9310.00261","article-title":"On the development and application of a self-organizing feature map-based patent map","volume":"32","author":"Yoon","year":"2002","journal-title":"R&D Manag."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1016\/j.ipm.2003.11.004","article-title":"Using the patent co-citation approach to establish a new patent classification system","volume":"41","year":"2005","journal-title":"Inform. Process. Manag."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1007\/s11192-009-0046-6","article-title":"Exploring the feasibility and accuracy of Latent Semantic Analysis based text mining techniques to detect similarity between patent documents and scientific publications","volume":"82","author":"Magerman","year":"2010","journal-title":"Scientometrics"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"5385","DOI":"10.1007\/s11192-021-04187-4","article-title":"Doc2vec-based link prediction approach using SAO structures: Application to patent network","volume":"127","author":"Yoon","year":"2022","journal-title":"Scientometrics"},{"key":"ref_15","unstructured":"Le, Q., and Mikolov, T. (2014, January 21). Distributed representations of sentences and documents. Proceedings of the 31st International Conference on International Conference on Machine Learning (PMLR), Beijing, China."},{"key":"ref_16","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013, January 2\u20134). Efficient Estimation of Word Representations in Vector Space. Proceedings of the International Conference on Learning Representations (ICLR), Scottsdale, AZ, USA."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"103379","DOI":"10.1016\/j.compind.2020.103379","article-title":"Patent infringement analysis using a text mining technique based on SAO structure","volume":"125","author":"Kim","year":"2021","journal-title":"Comput. Ind."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Jang, H.J., Park, S.J., and Yoon, B. (2022). Exploring Technology Opportunities Based on User Needs: Application of Opinion Mining and SAO Analysis. Eng. Manag. J., 1\u201314.","DOI":"10.1080\/10429247.2022.2050130"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1007\/s11192-011-0522-7","article-title":"Identifying patent infringement using SAO based semantic technological similarities","volume":"90","author":"Park","year":"2012","journal-title":"Scientometrics"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"66538","DOI":"10.1109\/ACCESS.2020.2984024","article-title":"Generic SAO Similarity Measure via Extended Sorensen-Dice Index","volume":"8","author":"Li","year":"2020","journal-title":"IEEE Access"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"297","DOI":"10.2307\/1932409","article-title":"Measures of the Amount of Ecologic Association Between Species","volume":"26","author":"Dice","year":"1945","journal-title":"Ecology"},{"key":"ref_22","first-page":"1","article-title":"A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons","volume":"5","year":"1948","journal-title":"Biol. Skr."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1007\/s11192-012-0830-6","article-title":"Identifying technological competition trends for R&D planning using dynamic patent maps: SAO-based content analysis","volume":"94","author":"Yoon","year":"2013","journal-title":"Scientometrics"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s11192-011-0383-0","article-title":"Identifying rapidly evolving technological trends for R&D planning using SAO-based semantic patent networks","volume":"88","author":"Yoon","year":"2011","journal-title":"Scientometrics"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"855","DOI":"10.1080\/09537325.2014.909926","article-title":"A semantic analysis approach for identifying patent infringement based on a product\u2013patent map","volume":"26","author":"Park","year":"2014","journal-title":"Technol. Anal. Strat. Manag."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1007\/s11192-020-03385-w","article-title":"Research on classification and similarity of patent citation based on deep learning","volume":"123","author":"Lu","year":"2020","journal-title":"Scientometrics"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Ma, C., Zhao, T., and Li, H. (2018, January 7\u20138). A Method for Calculating Patent Similarity Using Patent Model Tree Based on Neural Network. Proceedings of the 9th International Conference on Brain Inspired Cognitive System (BICS), Xi\u2019an, China.","DOI":"10.1007\/978-3-030-00563-4_62"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Mueller, J., and Thyagarajan, A. (2016, January 12\u201317). Siamese Recurrent Architectures for Learning Sentence Similarity. Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA.","DOI":"10.1609\/aaai.v30i1.10350"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Neculoiu, P., Versteegh, M., and Rotaru, M. (2016, January 7\u201312). Learning text similarity with siamese recurrent networks. Proceedings of the 1st Workshop on Representation Learning for NLP (RepL4NLP), Berlin, Germany.","DOI":"10.18653\/v1\/W16-1617"},{"key":"ref_30","first-page":"1108","article-title":"A hybrid similarity measure method for patent portfolio analysis","volume":"10","author":"Zhang","year":"2016","journal-title":"J. Inf."},{"key":"ref_31","unstructured":"Fujii, A., and Ishikawa, T. (2005, January 6\u20139). Document Structure Analysis for the NTCIR-5 Patent Retrieval Task. Proceedings of the NTCIR-5 Workshop Meeting (NTCIR), Tokyo, Japan."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Robertson, S.E., and Walker, S. (1994, January 3\u20136). Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. Proceedings of the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Dublin, Ireland.","DOI":"10.1007\/978-1-4471-2099-5_24"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Kim, B.T., and Hyun, E. (2023). Mapping the Landscape of Blockchain Technology Knowledge: A Patent Co-Citation and Semantic Similarity Approach. Systems, 11.","DOI":"10.3390\/systems11030111"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.techfore.2011.06.009","article-title":"A stochastic patent citation analysis approach to assessing future technological impacts","volume":"79","author":"Lee","year":"2012","journal-title":"Technol. Forecast. Soc. Chang."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1007\/s11192-015-1531-8","article-title":"New multi-stage similarity measure for calculation of pairwise patent similarity in a patent citation network","volume":"103","author":"Rodriguez","year":"2015","journal-title":"Scientometrics"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"984","DOI":"10.1002\/asi.23734","article-title":"Which Type of Citation Analysis Generates the Most Accurate Taxonomy of Scientific and Technical Knowledge?","volume":"68","author":"Klavans","year":"2017","journal-title":"J. Am. Soc. Inf. Sci Technol."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Wu, H., Chen, H., Lee, K., and Liu, Y. (2010, January 7\u201310). A method for assessing patent similarity using direct and indirect citation links. Proceedings of the 2010 IEEE International Conference on Industrial Engineering and Engineering Management, Macao, China.","DOI":"10.1109\/IEEM.2010.5674439"},{"key":"ref_38","first-page":"158","article-title":"The Patent-Classification Technology\/Function Matrix\u2014A Systematic Method for Design around","volume":"18","author":"Cheng","year":"2013","journal-title":"JIPR"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"2531","DOI":"10.1002\/asi.21414","article-title":"Adapting semantic natural language processing technology to address information overload in influenza epidemic management","volume":"61","author":"Keselman","year":"2010","journal-title":"J. Am. Soc. Inf. Sci. Technol."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s11192-019-03191-z","article-title":"Measuring patent similarity with SAO semantic analysis","volume":"121","author":"Wang","year":"2019","journal-title":"Scientometrics"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1007\/s11192-018-2962-9","article-title":"Investigating technology opportunities: The use of SAOx analysis","volume":"118","author":"Kim","year":"2019","journal-title":"Scientometrics"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1109\/TEM.2020.2970972","article-title":"Integrating Technology-Relationship-Technology Semantic Analysis and Technology Roadmapping Method: A Case of Elderly Smart Wear Technology","volume":"69","author":"Miao","year":"2022","journal-title":"IEEE Trans. Eng. Manag."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1016\/j.techsoc.2019.02.002","article-title":"Demand identification model of potential technology based on SAO structure semantic analysis: The case of new energy and energy saving fields","volume":"58","author":"He","year":"2019","journal-title":"Technol. Soc."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1109\/TEM.2020.3047370","article-title":"Improved Technology Similarity Measurement in the Medical Field based on Subject-Action-Object Semantic Structure: A Case Study of Alzheimer\u2019s Disease","volume":"70","author":"Li","year":"2023","journal-title":"IEEE Trans. Eng. Manag."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Lin, W., Liu, X., and Xiao, R. (2022). Research on Product Core Component Acquisition Based on Patent Semantic Network. Entropy, 24.","DOI":"10.3390\/e24040549"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"656","DOI":"10.1007\/s11633-018-1117-z","article-title":"An Overview of Contour Detection Approaches","volume":"15","author":"Gong","year":"2018","journal-title":"Int. J. Autom. Comput."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1109\/TPAMI.2007.1027","article-title":"Laplacian operator-based edge detectors","volume":"29","author":"Wang","year":"2007","journal-title":"IEEE Trans Pattern Anal. Mach. Intell."},{"key":"ref_48","unstructured":"Nixon, M.S., and Aguado, A.S. (2012). Feature Extraction & Image Processing for Computer Vision, Elsevier. [3rd ed.]."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1109\/TPAMI.1986.4767851","article-title":"A computational approach to edge detection","volume":"8","author":"Canny","year":"1986","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1109\/TPAMI.2004.1273918","article-title":"Learning to detect natural image boundaries using local brightness, color, and texture cues","volume":"26","author":"Martin","year":"2004","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1007\/BF01420590","article-title":"A Bayesian multiple-hypothesis approach to edge grouping and contour segmentation","volume":"11","author":"Cox","year":"1993","journal-title":"Int. J. Comput. Vis."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1109\/34.659934","article-title":"A generic grouping algorithm and its quantitative analysis","volume":"20","author":"Amir","year":"1998","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Elder, J.H., and Zucker, S.W. (1996, January 15\u201318). Computing Contour Closure. Proceedings of the 4th European Conference on Computer Vision (ECCV), Cambridge, UK.","DOI":"10.1007\/BFb0015553"},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1109\/TPAMI.2003.1190570","article-title":"Segmentation of multiple salient closed contours from real images","volume":"25","author":"Mahamud","year":"2003","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_55","unstructured":"Arbelaez, P. (2006, January 17\u201322). Boundary Extraction in Natural Images Using Ultrametric Contour Maps. Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), New York, NY, USA."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Arbelaez, P., Maire, M., Fowlkes, C., and Malik, J. (2009, January 20\u201325). From contours to regions: An empirical evaluation. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206707"},{"key":"ref_57","unstructured":"Bengio, Y., Ducharme, R., and Vincent, P. (2000, January 1). A neural probabilistic language model. Proceedings of the Advances in Neural Information Processing Systems (NIPS), Denver, CO, USA."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Mnih, A., and Hinton, G. (2007, January 20\u201324). Three new graphical models for statistical language modelling. Proceedings of the 24th International Conference on Machine Learning (ICML), Corvalis, OR, USA.","DOI":"10.1145\/1273496.1273577"},{"key":"ref_59","first-page":"1","article-title":"Collective intelligence: Conception, research progress and application analysis","volume":"41","author":"Xiao","year":"2022","journal-title":"J. Nanchang Inst. Technol."},{"key":"ref_60","first-page":"1","article-title":"From swarm intelligence optimization to swarm intelligence evolution","volume":"42","author":"Xiao","year":"2023","journal-title":"J. Nanchang Inst. Technol."},{"key":"ref_61","first-page":"1","article-title":"From design for mass customization to design for mass personalization","volume":"40","author":"Xiao","year":"2021","journal-title":"J. Nanchang Inst. Technol."}],"container-title":["Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-8954\/11\/6\/294\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:50:47Z","timestamp":1760125847000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-8954\/11\/6\/294"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,8]]},"references-count":61,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2023,6]]}},"alternative-id":["systems11060294"],"URL":"https:\/\/doi.org\/10.3390\/systems11060294","relation":{},"ISSN":["2079-8954"],"issn-type":[{"value":"2079-8954","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,8]]}}}