{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,8]],"date-time":"2026-06-08T20:05:12Z","timestamp":1780949112954,"version":"3.54.1"},"reference-count":35,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2017,4,17]],"date-time":"2017-04-17T00:00:00Z","timestamp":1492387200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Most of the previous works on relation extraction between named entities are often limited to extracting the pre-defined types; which are inefficient for massive unlabeled text data. Recently; with the appearance of various distributional word representations; unsupervised methods for many natural language processing (NLP) tasks have been widely researched. In this paper; we focus on a new finding of unsupervised relation extraction; which is called distributional relation representation. Without requiring the pre-defined types; distributional relation representation aims to automatically learn entity vectors and further estimate semantic similarity between these entities. We choose global vectors (GloVe) as our original model to train entity vectors because of its excellent balance between local context and global statistics in the whole corpus. In order to train model more efficiently; we improve the traditional GloVe model by using cosine similarity between entity vectors to approximate the entity occurrences instead of dot product. Because cosine similarity can convert vector to unit vector; it is intuitively more reasonable and more easily converge to a local optimum. We call the improved model RGloVe. Experimental results on a massive corpus of Sina News show that our proposed model outperforms the traditional global vectors. Finally; a graph database of Neo4j is introduced to store these relationships between named entities. The most competitive advantage of Neo4j is that it provides a highly accessible way to query the direct and indirect relationships between entities.<\/jats:p>","DOI":"10.3390\/a10020042","type":"journal-article","created":{"date-parts":[[2017,4,18]],"date-time":"2017-04-18T11:22:04Z","timestamp":1492514524000},"page":"42","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["RGloVe: An Improved Approach of Global Vectors for Distributional Entity Relation Representation"],"prefix":"10.3390","volume":"10","author":[{"given":"Ziyan","family":"Chen","sequence":"first","affiliation":[{"name":"Department of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yu","family":"Huang","sequence":"additional","affiliation":[{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuexian","family":"Liang","sequence":"additional","affiliation":[{"name":"Department of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100190, China"},{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yang","family":"Wang","sequence":"additional","affiliation":[{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xingyu","family":"Fu","sequence":"additional","affiliation":[{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kun","family":"Fu","sequence":"additional","affiliation":[{"name":"The Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2017,4,17]]},"reference":[{"key":"ref_1","first-page":"837","article-title":"The automatic content extraction (ACE) program-tasks, data, and evaluation","volume":"2","author":"Doddington","year":"2004","journal-title":"LREC"},{"key":"ref_2","unstructured":"Banko, M., Etzioni, O., and Center, T. (2008, January 15\u201320). The Tradeoffs between Open and Traditional Relation Extraction. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, Columbus, OH, USA."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1145\/1409360.1409378","article-title":"Open information extraction from the web","volume":"51","author":"Etzioni","year":"2008","journal-title":"Commun. ACM"},{"key":"ref_4","unstructured":"Etzioni, O., Fader, A., Christensen, J., Soderland, S., and Mausam, M.I. (2011, January 16\u201322). Open Information Extraction: The Second Generation. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain."},{"key":"ref_5","unstructured":"Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., and Etzioni, O. (2007, January 6\u201312). Open Information Extraction for the Web. Proceedings of the 20th International Joint Conference on Artifical Intelligence, Hyderabad, India."},{"key":"ref_6","unstructured":"Kalyanpur, A., and Murdock, J.W. (2015, January 28\u201331). Unsupervised Entity-Relation Analysis in IBM Watson. Proceedings of the Third Annual Conference on Advances in Cognitive Systems ACS, Atlanta, GA, USA."},{"key":"ref_7","unstructured":"Fader, A., Soderland, S., and Etzioni, O. (2011, January 27\u201331). Identifying Relations for Open Information Extraction. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Edinburgh, UK."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Tseng, Y.-H., Lee, L.-H., Lin, S.-Y., Liao, B.-S., Liu, M.-J., Chen, H.-H., Etzioni, O., and Fader, A. (2014, January 26\u201330). Chinese Open Relation Extraction for Knowledge Acquisition. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden.","DOI":"10.3115\/v1\/E14-4003"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C.D. (2014, January 25\u201329). Glove: Global Vectors for Word Representation. Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"969","DOI":"10.1016\/j.ipm.2006.09.012","article-title":"Extracting relation information from text documents by exploring various types of knowledge","volume":"43","author":"Zhou","year":"2007","journal-title":"Inf. Process. Manag."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Khayyamian, M., Mirroshandel, S.A., and Abolhassani, H. (June, January 31). Syntactic Tree-Based Relation Extraction Using a Generalization of Collins and Duffy Convolution Tree Kernel. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Boulder, CO, USA.","DOI":"10.3115\/1620932.1620944"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1016\/j.ipm.2012.04.002","article-title":"Social relation extraction from texts using a support-vector-machine-based dependency trigram kernel","volume":"49","author":"Choi","year":"2013","journal-title":"Inf. Process. Manag."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1007\/s11042-013-1380-5","article-title":"An intensive case study on kernel-based relation extraction","volume":"71","author":"Choi","year":"2014","journal-title":"Multimed. Tools Appl."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Zhang, C., Xu, W., Gao, S., and Guo, J. (2014, January 12\u201314). A Bottom-Up Kernel of Pattern Learning for Relation Extraction. Proceedings of the Chinese Spoken Language Processing (ISCSLP), Singapore.","DOI":"10.1109\/ISCSLP.2014.6936605"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Nguyen, T.H., Plank, B., and Grishman, R. (2015, January 27\u201331). Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-Based Method for Relation Extraction. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Beijing, China.","DOI":"10.3115\/v1\/P15-1062"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1016\/j.csl.2009.03.001","article-title":"Label propagation via bootstrapped support vectors for semantic relation extraction between named entities","volume":"23","author":"Zhou","year":"2009","journal-title":"Comput. Speech Lang."},{"key":"ref_17","unstructured":"Sun, A., Grishman, R., and Sekine, S. (2011, January 19\u201324). Semi-Supervised Relation Extraction with Large-Scale Word Clustering. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, USA."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Fukui, K.-I., Ono, S., Megano, T., and Numao, M. (2013, January 4\u20136). Evolutionary Distance Metric Learning Approach to Semi-Supervised Clustering with Neighbor Relations. Proceedings of the 2013 IEEE 25th International Conference on Tools with Artificial Intelligence (ICTAI), Herndon, VA, USA.","DOI":"10.1109\/ICTAI.2013.66"},{"key":"ref_19","unstructured":"Maziero, E., Hirst, G., and Pardo, T. (2015, January 5\u201311). Semi-Supervised Never-Ending Learning in Rhetorical Relation Identification. Proceeding of the Recent Advances in Natural Language Processing, Hissar, Bulgaria."},{"key":"ref_20","unstructured":"Min, B., Shi, S., Grishman, R., and Lin, C.-Y. (2012, January 12\u201314). Ensemble Semantics for Large-Scale Unsupervised Relation Extraction. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, J., Jing, Y., Teng, Y., and Li, Q. (2012, January 22\u201324). A Novel Clustering Algorithm for Unsupervised Relation Extraction. Proceedings of the Seventh International Conference Digital Information Management (ICDIM), Macau, Macao.","DOI":"10.1109\/ICDIM.2012.6360156"},{"key":"ref_22","unstructured":"De Lacalle, O.L., and Lapata, M. (2013, January 18\u201321). Unsupervised Relation Extraction with General Domain Knowledge. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA."},{"key":"ref_23","unstructured":"Takase, S., Okazaki, N., and Inui, K. (November, January 30). Fast and large-scale unsupervised relation extraction. Proceedings of 29th Pacific Asia Conference on Language, Information and Computation, Shanghai, China."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Remus, S. (2014, January 26\u201330). Unsupervised Relation Extraction of In-Domain Data From Focused Crawls. Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden.","DOI":"10.3115\/v1\/E14-3002"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1016\/j.compbiomed.2016.01.014","article-title":"Unsupervised entity and relation extraction from clinical records in Italian","volume":"72","author":"Alicante","year":"2016","journal-title":"Comput. Biol. Med."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1037\/0033-295X.104.2.211","article-title":"A solution to plato\u2019s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge","volume":"104","author":"Landauer","year":"1997","journal-title":"Psychol. Rev."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1162\/coli.2006.32.3.379","article-title":"Similarity of semantic relations","volume":"32","author":"Turney","year":"2006","journal-title":"Comput. Linguist."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1162\/coli.2007.33.2.161","article-title":"Dependency-based construction of semantic space models","volume":"33","author":"Sebastian","year":"2007","journal-title":"Comput. Linguist."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1007\/s10579-010-9129-5","article-title":"Is singular value decomposition useful for word similarity extraction?","volume":"45","author":"Gamallo","year":"2011","journal-title":"Lang. Resour. Eval."},{"key":"ref_30","first-page":"1137","article-title":"A neural probabilistic language model","volume":"3","author":"Bengio","year":"2003","journal-title":"Mach. Learn. Res."},{"key":"ref_31","first-page":"2493","article-title":"Natural language processing (almost) from scratch","volume":"12","author":"Collobert","year":"2011","journal-title":"Mach. Learn. Res."},{"key":"ref_32","unstructured":"Mikolov, T., Chen, K., Corrado, G., and Dean, J. (arXiv, 2013). Efficient estimation of word representations in vector space, arXiv."},{"key":"ref_33","first-page":"2121","article-title":"Adaptive subgradient methods for online learning and stochastic optimization","volume":"12","author":"Duchi","year":"2011","journal-title":"Mach. Learn. Res."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhang, H.-P., Liu, Q., Cheng, X.-Q., Zhang, H., and Yu, H.-K. (2003, January 11\u201312). Chinese Lexical Analysis Using Hierarchical Hidden Markov Model. Proceedings of the second SIGHAN workshop on Chinese language processing, Sapporo, Japan.","DOI":"10.3115\/1119250.1119259"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1037\/h0031619","article-title":"Measuring nominal scale agreement among many raters","volume":"76","author":"Fleiss","year":"1971","journal-title":"Psychol. Bull."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/10\/2\/42\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T18:32:51Z","timestamp":1760207571000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/10\/2\/42"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,4,17]]},"references-count":35,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2017,6]]}},"alternative-id":["a10020042"],"URL":"https:\/\/doi.org\/10.3390\/a10020042","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,4,17]]}}}