{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,13]],"date-time":"2023-09-13T21:05:11Z","timestamp":1694639111290},"reference-count":82,"publisher":"Cambridge University Press (CUP)","issue":"3","license":[{"start":{"date-parts":[[2010,6,15]],"date-time":"2010-06-15T00:00:00Z","timestamp":1276560000000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2010,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We show how a quantitative context may be established for what is essentially qualitative in nature by topologically embedding a lexicon (here, WordNet) in a complete metric space. This novel transformation establishes a natural connection between the order relation in the lexicon (e.g., hyponymy) and the notion of distance in the metric space, giving rise to effective word-level and document-level lexical semantic distance measures. We provide a formal account of the topological transformation and demonstrate the value of our metrics on several experiments involving information retrieval and document clustering tasks.<\/jats:p>","DOI":"10.1017\/s1351324910000045","type":"journal-article","created":{"date-parts":[[2010,6,15]],"date-time":"2010-06-15T13:17:38Z","timestamp":1276607858000},"page":"245-275","source":"Crossref","is-referenced-by-count":0,"title":["A topological embedding of the lexicon for semantic distance computation"],"prefix":"10.1017","volume":"16","author":[{"given":"N.","family":"DAVIS","sequence":"first","affiliation":[]},{"given":"C.","family":"GIRAUD-CARRIER","sequence":"additional","affiliation":[]},{"given":"D.","family":"JENSEN","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2010,6,15]]},"reference":[{"key":"S1351324910000045_ref5","first-page":"993","article-title":"Latent Dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"S1351324910000045_ref81","first-page":"1617","volume-title":"Proceedings of the Eighteenth Annual Conference on Neural Information Processing Systems","author":"Zhang","year":"2004"},{"key":"S1351324910000045_ref79","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009982220290"},{"key":"S1351324910000045_ref78","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90027-1"},{"key":"S1351324910000045_ref75","first-page":"54","volume-title":"Proceedings of the Thirty-Ninth Hawaii International Conference on System Sciences","author":"Wang","year":"2006"},{"key":"S1351324910000045_ref64","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog0901_5"},{"key":"S1351324910000045_ref63","doi-asserted-by":"publisher","DOI":"10.1145\/365628.365657"},{"key":"S1351324910000045_ref62","first-page":"405","volume-title":"Proceedings of the Fourteenth ACM International Conference on Information and Knowledge Management","author":"Roy","year":"2005"},{"key":"S1351324910000045_ref19","volume-title":"Matrix Computations","author":"Golub","year":"1996"},{"key":"S1351324910000045_ref67","doi-asserted-by":"publisher","DOI":"10.1109\/SKG.2007.154"},{"key":"S1351324910000045_ref61","first-page":"299","volume-title":"Proceedings of the Second International Conference of the Global WordNet Association","author":"Rosso","year":"2004"},{"key":"S1351324910000045_ref60","first-page":"374","volume-title":"Proceedings of the Thirteenth International World Wide Web Conference","author":"Rocha","year":"2004"},{"key":"S1351324910000045_ref11","volume-title":"Functional Analysis: Theory and Applications","author":"Edwards","year":"1965"},{"key":"S1351324910000045_ref77","first-page":"1065","volume-title":"Proceedings of the Twenty-First International Conference on Computational Linguistics and Forty-Fourth Annual Meeting of the ACL","author":"Wiebe","year":"2006"},{"key":"S1351324910000045_ref59","unstructured":"Richardson R. , and Smeaton A. F. 1995. Using WordNet in a knowledge-based approach to information retrieval. Technical Report CA-0395, Dublin City University, School of Computer Applications."},{"key":"S1351324910000045_ref73","first-page":"61","volume-title":"Proceedings of the Seventeenth ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Voorhees","year":"1994"},{"key":"S1351324910000045_ref35","doi-asserted-by":"publisher","DOI":"10.1080\/01638539809545028"},{"key":"S1351324910000045_ref58","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1613\/jair.514","article-title":"Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language","volume":"11","author":"Resnik","year":"1999","journal-title":"Journal of Artificial Intelligence Research"},{"key":"S1351324910000045_ref12","doi-asserted-by":"publisher","DOI":"10.1111\/j.1749-6632.1993.tb52513.x"},{"key":"S1351324910000045_ref55","first-page":"179","volume-title":"WordNet: An Electronic Lexical Database","author":"Priss","year":"1998"},{"key":"S1351324910000045_ref36","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.104.2.211"},{"key":"S1351324910000045_ref32","first-page":"258","volume-title":"Proceedings of the Twenty-Seventh ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Kim","year":"2004"},{"key":"S1351324910000045_ref54","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1109\/COGINF.2003.1225968","volume-title":"Proceedings of the Second IEEE International Conference on Cognitive Informatics","author":"Prince","year":"2003"},{"key":"S1351324910000045_ref52","first-page":"1024","volume-title":"Proceedings of the Nineteenth National Conference on Artificial Intelligence","author":"Pedersen","year":"2004"},{"key":"S1351324910000045_ref82","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1145\/1060745.1060812","volume-title":"Proceedings of the Fourteenth International World Wide Web Conference","author":"Zhang","year":"2005"},{"key":"S1351324910000045_ref49","doi-asserted-by":"publisher","DOI":"10.1145\/1459352.1459355"},{"key":"S1351324910000045_ref16","first-page":"59","volume-title":"Proceedings of the SIAM International Conference on Data Mining","author":"Fung","year":"2003"},{"key":"S1351324910000045_ref39","first-page":"477","volume-title":"Handbook of Natural Language Processing","author":"Lebart","year":"2000"},{"key":"S1351324910000045_ref9","first-page":"318","volume-title":"Proceedings of the Fifteenth ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Cutting","year":"1992"},{"key":"S1351324910000045_ref31","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316801"},{"key":"S1351324910000045_ref43","first-page":"1","volume-title":"Proceedings of the COLING Workshop on Building and Using Semantic Networks (SemaNet'02)","author":"Mann","year":"2002"},{"key":"S1351324910000045_ref15","doi-asserted-by":"publisher","DOI":"10.1007\/BF00114265"},{"key":"S1351324910000045_ref6","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2006.32.1.13"},{"key":"S1351324910000045_ref26","first-page":"2","article-title":"Introduction to the Special Issue on Word Sense Disambiguation","volume":"24","author":"Ide","year":"1998","journal-title":"Computational Linguistics"},{"key":"S1351324910000045_ref20","doi-asserted-by":"publisher","DOI":"10.1145\/775152.775250"},{"key":"S1351324910000045_ref57","unstructured":"Resnik P. 1993. Selection and Information: A Class-based Approach to Lexical Relationships. PhD thesis, University of Pennsylvania, Philadelphia, PA."},{"key":"S1351324910000045_ref50","volume-title":"Invited Talk at the Sixth Annual Workshop on Technology for Family History and Genealogical Research","author":"Norvig","year":"2006"},{"key":"S1351324910000045_ref29","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2007.11.001"},{"key":"S1351324910000045_ref21","first-page":"384","volume-title":"Proceedings of the Twenty-Fourth ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Henstock","year":"2001"},{"key":"S1351324910000045_ref72","first-page":"467","volume-title":"Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing","author":"Tsuruoka","year":"2005"},{"key":"S1351324910000045_ref8","unstructured":"Curran J. R. 2004. From Distributional to Semantic Similarity. PhD thesis, University of Edinburgh, Edinburgh, UK."},{"key":"S1351324910000045_ref42","first-page":"281","volume-title":"Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability","author":"MacQueen","year":"1967"},{"key":"S1351324910000045_ref10","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"S1351324910000045_ref2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4020-4809-8"},{"key":"S1351324910000045_ref22","volume-title":"Topology","author":"Hocking","year":"1961"},{"key":"S1351324910000045_ref48","first-page":"35","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Mohammad","year":"2006"},{"key":"S1351324910000045_ref7","volume-title":"Proceedings of the Eleventh International World Wide Web Conference","author":"Choudhary","year":"2002"},{"key":"S1351324910000045_ref30","doi-asserted-by":"publisher","DOI":"10.1007\/BF02289588"},{"key":"S1351324910000045_ref80","first-page":"46","volume-title":"Proceedings of the Twenty-First ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Zamir","year":"1998"},{"key":"S1351324910000045_ref1","first-page":"19","volume-title":"Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics \u2013 Human Language Technologies","author":"Agirre","year":"2009"},{"key":"S1351324910000045_ref68","first-page":"140","volume-title":"Proceedings of the Twentieth ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Stairmand","year":"1997"},{"key":"S1351324910000045_ref4","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775110"},{"key":"S1351324910000045_ref25","doi-asserted-by":"publisher","DOI":"10.1007\/BF01908075"},{"key":"S1351324910000045_ref33","first-page":"63","volume-title":"Proceedings of the ICDM Workshop on Clustering Large Data Sets","author":"Kogan","year":"2003"},{"key":"S1351324910000045_ref23","first-page":"50","volume-title":"Proceedings of the Twenty-Second ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Hofmann","year":"1999"},{"key":"S1351324910000045_ref45","first-page":"280","volume-title":"Proceedings of the Thirteenth IEEE International Conference on Tools with Artificial Intelligence","author":"Mihalcea","year":"2001"},{"key":"S1351324910000045_ref17","first-page":"1606","volume-title":"Proceedings of the Twentieth International Joint Conference on Artificial Intelligence","author":"Gabrilovich","year":"2007"},{"key":"S1351324910000045_ref41","unstructured":"Lucene 2007. An open source information retrieval library. http:\/\/lucene.apache.org\/java\/docs\/index.html"},{"key":"S1351324910000045_ref24","first-page":"30","volume-title":"Proceedings of the IJCAI Workshop on Text Learning: Beyond Supervision","author":"Hotho","year":"2001"},{"key":"S1351324910000045_ref27","volume-title":"Algorithms for Clustering Data","author":"Jain","year":"1988"},{"key":"S1351324910000045_ref34","doi-asserted-by":"publisher","DOI":"10.1007\/BF00337288"},{"key":"S1351324910000045_ref71","volume-title":"Vector Spaces and Matrices","author":"Thrall","year":"1970"},{"key":"S1351324910000045_ref40","unstructured":"Leouski A. , and Croft W. 1996. An evaluation of techniques for clustering search results. Technical Report IR-76, Department of Computer Science, University of Massachusetts, Amherst, MA."},{"key":"S1351324910000045_ref74","doi-asserted-by":"publisher","DOI":"10.1145\/1067268.1067272"},{"key":"S1351324910000045_ref56","doi-asserted-by":"publisher","DOI":"10.1109\/21.24528"},{"key":"S1351324910000045_ref47","doi-asserted-by":"publisher","DOI":"10.1093\/ijl\/3.4.235"},{"key":"S1351324910000045_ref3","first-page":"407","volume-title":"Proceedings of the Sixth ACM SIGKDD International Conference","author":"Beeferman","year":"2000"},{"key":"S1351324910000045_ref37","first-page":"331","volume-title":"Proceedings of the Twelfth International Conference of Machine Learning","author":"Lang","year":"1995"},{"key":"S1351324910000045_ref38","doi-asserted-by":"publisher","DOI":"10.1145\/312129.312186"},{"key":"S1351324910000045_ref44","first-page":"775","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Marton","year":"2009"},{"key":"S1351324910000045_ref46","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219748"},{"key":"S1351324910000045_ref76","first-page":"1015","volume-title":"Proceedings of the Twentieth International Conference of Computational Linguistics","author":"Weeds","year":"2004"},{"key":"S1351324910000045_ref66","first-page":"208","volume-title":"Proceedings of the Twenty-Third ACM SIGIR International Conference on Research and Development in Information Retrieval","author":"Slonim","year":"2000"},{"key":"S1351324910000045_ref70","first-page":"49","volume-title":"Proceedings of the IJCAI Workshop on Ontology Learning","author":"Termier","year":"2001"},{"key":"S1351324910000045_ref65","doi-asserted-by":"publisher","DOI":"10.1145\/361219.361220"},{"key":"S1351324910000045_ref13","volume-title":"Cluster Analysis","author":"Everitt","year":"1993"},{"key":"S1351324910000045_ref69","first-page":"109","volume-title":"Proceedings of the KDD Workshop on Text Mining","author":"Steinbach","year":"2000"},{"key":"S1351324910000045_ref53","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1613\/jair.2308","article-title":"Knowledge derived from Wikipedia for computing semantic relatedness","volume":"30","author":"Ponzetto","year":"2007","journal-title":"Journal of Artificial Intelligence Research"},{"key":"S1351324910000045_ref18","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(89)90046-5"},{"key":"S1351324910000045_ref51","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2006.06.004"},{"key":"S1351324910000045_ref14","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/7287.001.0001","volume-title":"WordNet: An Electronic Lexical Database","author":"Fellbaum","year":"1998"},{"key":"S1351324910000045_ref28","first-page":"259","volume-title":"Proceedings of the Seventh International Workshop on Computational Semantics","author":"Jensen","year":"2007"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324910000045","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,10,29]],"date-time":"2021-10-29T13:52:12Z","timestamp":1635515532000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324910000045\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,6,15]]},"references-count":82,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2010,7]]}},"alternative-id":["S1351324910000045"],"URL":"https:\/\/doi.org\/10.1017\/s1351324910000045","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,6,15]]}}}