{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:22:03Z","timestamp":1750306923817,"version":"3.41.0"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2013,3,1]],"date-time":"2013-03-01T00:00:00Z","timestamp":1362096000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Comput. Cult. Herit."],"published-print":{"date-parts":[[2013,3]]},"abstract":"<jats:p>This article describes methods for semiautomatic thesaurus construction, for a cross generation, cross genre, and cross cultural corpus. Semiautomatic thesaurus construction is a complex task, and applying it on a cross generation corpus brings its own challenges. We used a Jewish juristic corpus containing documents and genres that were written across 2000 years, and contain a mix of different languages, dialects, geographies, and writing styles. We evaluated different first and second order methods, and introduced a special annotation scheme for this problem, which showed that first order methods performed surprisingly well. We found that in our case, improving the coverage is the more difficult task, for this we introduce a new algorithm to increase recall (coverage)\u2014which is applicable to many other problems as well, and demonstrates significant improvement in our corpus.<\/jats:p>","DOI":"10.1145\/2442080.2442084","type":"journal-article","created":{"date-parts":[[2013,4,9]],"date-time":"2013-04-09T12:17:58Z","timestamp":1365509878000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Automatic thesaurus construction for cross generation corpus"],"prefix":"10.1145","volume":"6","author":[{"given":"Hadas","family":"Zohar","sequence":"first","affiliation":[{"name":"Bar-Ilan University, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chaya","family":"Liebeskind","sequence":"additional","affiliation":[{"name":"Bar-Ilan University, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathan","family":"Schler","sequence":"additional","affiliation":[{"name":"Bar-Ilan University, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ido","family":"Dagan","sequence":"additional","affiliation":[{"name":"Bar-Ilan University, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,4,11]]},"reference":[{"volume-title":"Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). 19--27","author":"Agirre E.","key":"e_1_2_1_1_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_2_1","DOI":"10.1145\/1242572.1242591"},{"doi-asserted-by":"publisher","key":"e_1_2_1_3_1","DOI":"10.1016\/j.ipm.2006.09.003"},{"volume-title":"Proceedings of the Biennial GSCL Conference. 31--40","year":"2009","author":"Bouma G.","key":"e_1_2_1_4_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_5_1","DOI":"10.3115\/1034678.1034697"},{"volume-title":"Proceedings of the Forum of Governmental Webmasters, Givat Ram Campus (lecture notes in Hebrew).","year":"1997","author":"Choueka Y.","key":"e_1_2_1_6_1"},{"key":"e_1_2_1_7_1","first-page":"4","article-title":"Word association norms, mutual information and lexicography","volume":"16","author":"Church K. W.","year":"1990","journal-title":"ACL Comput. Ling."},{"doi-asserted-by":"crossref","unstructured":"Cover T. M. and Thomas J. A. 1991. Elements of Information Theory. Wiley.   Cover T. M. and Thomas J. A. 1991. Elements of Information Theory. Wiley.","key":"e_1_2_1_8_1","DOI":"10.1002\/0471200611"},{"unstructured":"Curran J. R. 2003. From Distributional to Semantic Similarity. Ph.D thesis Institute for Communicating and Collaborative Systems School of Informatics University of Edinburgh.  Curran J. R. 2003. From Distributional to Semantic Similarity. Ph.D thesis Institute for Communicating and Collaborative Systems School of Informatics University of Edinburgh.","key":"e_1_2_1_9_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_10_1","DOI":"10.3115\/1073083.1073123"},{"volume-title":"ACL: HLT (Short Papers Companion Volume). 265--268.","year":"2008","author":"Elsayed T.","key":"e_1_2_1_11_1"},{"doi-asserted-by":"crossref","unstructured":"Fellbaum C. 1998. WordNet: An Electronic Lexical Database. The MIT Press Cambridge MA.  Fellbaum C. 1998. WordNet: An Electronic Lexical Database. The MIT Press Cambridge MA.","key":"e_1_2_1_12_1","DOI":"10.7551\/mitpress\/7287.001.0001"},{"volume-title":"Proceedings of the FLAIRS Conference. AAAI Press.","author":"Girju R.","key":"e_1_2_1_13_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_14_1","DOI":"10.1007\/978-3-642-13489-0_9"},{"doi-asserted-by":"crossref","unstructured":"Grefenstette G. 1994. Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers Boston.   Grefenstette G. 1994. Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers Boston.","key":"e_1_2_1_15_1","DOI":"10.1007\/978-1-4615-2710-7"},{"doi-asserted-by":"publisher","key":"e_1_2_1_16_1","DOI":"10.1007\/978-3-642-25631-8_42"},{"volume-title":"Proceedings of the 4th International Conference on Advances in Natural Language (LNAI).","author":"HaCohen-Kerner Y.","key":"e_1_2_1_17_1"},{"volume-title":"Proceedings of ACL (Companion Volume).","author":"Hacohen-Kerner Y.","key":"e_1_2_1_18_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_19_1","DOI":"10.3115\/992133.992154"},{"volume-title":"Proceedings of the AMIA Symposium. 344--348","author":"Hersh W.","key":"e_1_2_1_20_1"},{"volume-title":"Proceedings of the Conference on Natural Language Processing and knowledge Processing. 335--345","author":"Iwanska L.","key":"e_1_2_1_21_1"},{"unstructured":"Koppel M. 2008 The Responsa Project: Some promising future directions. Department of Computer Science Bar-Ilan University Ramat-Gan.  Koppel M. 2008 The Responsa Project: Some promising future directions. Department of Computer Science Bar-Ilan University Ramat-Gan.","key":"e_1_2_1_22_1"},{"volume-title":"Proceedings of the ACL-IJCNLP Conference. 69--72","author":"Kotlerman L.","key":"e_1_2_1_23_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_24_1","DOI":"10.1037\/0033-295X.104.2.211"},{"volume-title":"Proceedings of the 15th International Conference On Machine Learning. 296--304","year":"1998","author":"Lin D.","key":"e_1_2_1_25_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_26_1","DOI":"10.3115\/980691.980696"},{"doi-asserted-by":"publisher","key":"e_1_2_1_27_1","DOI":"10.1002\/asi.4630200106"},{"doi-asserted-by":"publisher","key":"e_1_2_1_28_1","DOI":"10.1016\/S0306-4573(99)00068-0"},{"doi-asserted-by":"crossref","unstructured":"Manning C. D. Raghavan P. and Sch\u00fctze H. 2008. Introduction to Information Retrieval. Cambridge University Press.   Manning C. D. Raghavan P. and Sch\u00fctze H. 2008. Introduction to Information Retrieval. Cambridge University Press.","key":"e_1_2_1_29_1","DOI":"10.1017\/CBO9780511809071"},{"doi-asserted-by":"publisher","key":"e_1_2_1_30_1","DOI":"10.1016\/S0306-4573(97)00009-5"},{"doi-asserted-by":"publisher","key":"e_1_2_1_31_1","DOI":"10.1162\/coli.2007.33.2.161"},{"volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 938--947","author":"Pantel P.","key":"e_1_2_1_32_1"},{"volume-title":"Proceedings of the CoSMO Workshop held in Conjunction with CONTEXT-07","author":"Peirsman Y.","key":"e_1_2_1_33_1"},{"volume-title":"Proceedings of the Actes des 9i'emes Journ'ees internationales d'Analyse statistique des Donn'ees Textuelles (JADT). 907--916","author":"Peirsman Y.","key":"e_1_2_1_34_1"},{"volume-title":"Proceedings of the Advances in Natural Language Processing and Applications.","author":"Perez-Aguera J. R.","key":"e_1_2_1_35_1"},{"volume-title":"InProceedings of the 1st European Cognitive Science Conference.","author":"Prior A.","key":"e_1_2_1_36_1"},{"volume-title":"Proceedings of the International Symposium Information Technology (ITSim). 1404--1409","author":"Rahman N. A.","key":"e_1_2_1_37_1"},{"unstructured":"Rocchio J. J. 1971. Relevance feedback in information retrieval. In Salton G. (Ed.) The SMART Retrieval System\u2014Experiments in Automatic Document Processing Prentice-Hall Englewood Cliffs NJ 313--323.  Rocchio J. J. 1971. Relevance feedback in information retrieval. In Salton G. (Ed.) The SMART Retrieval System\u2014Experiments in Automatic Document Processing Prentice-Hall Englewood Cliffs NJ 313--323.","key":"e_1_2_1_38_1"},{"volume-title":"Proceedings of ACL-07","author":"Rychly P.","key":"e_1_2_1_39_1"},{"key":"e_1_2_1_40_1","first-page":"115","article-title":"Experiments in automatic thesaurus construction for information retrieval","volume":"71","author":"Salton G.","year":"1971","journal-title":"Inform. Process."},{"unstructured":"Salton G. 1971a. The SMART Retrieval System\u2014Experiments in Automatic Document Processing. Prentice-Hall Englewood Cliffs NJ.   Salton G. 1971a. The SMART Retrieval System\u2014Experiments in Automatic Document Processing. Prentice-Hall Englewood Cliffs NJ.","key":"e_1_2_1_41_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_42_1","DOI":"10.1016\/S0306-4573(96)00068-4"},{"volume-title":"Proceedings of the RIAO Conference. 266--274","author":"Sch\u00fctze H.","key":"e_1_2_1_43_1"},{"key":"e_1_2_1_44_1","first-page":"1","article-title":"Retrieving collocations from text","volume":"19","author":"Smadja F.","year":"1993","journal-title":"Xtract. Comput. Ling."},{"doi-asserted-by":"publisher","key":"e_1_2_1_45_1","DOI":"10.1108\/00220410610673873"},{"unstructured":"Van Rijsbergen C. J. 1979. Information Retrieval. London: Butterworths.   Van Rijsbergen C. J. 1979. Information Retrieval. London: Butterworths.","key":"e_1_2_1_46_1"},{"doi-asserted-by":"publisher","key":"e_1_2_1_47_1","DOI":"10.3115\/1220355.1220501"},{"doi-asserted-by":"publisher","key":"e_1_2_1_48_1","DOI":"10.1023\/B:AIRE.0000020865.73561.bc"},{"doi-asserted-by":"publisher","key":"e_1_2_1_49_1","DOI":"10.1016\/j.eswa.2009.02.059"},{"doi-asserted-by":"publisher","key":"e_1_2_1_50_1","DOI":"10.1145\/243199.243202"},{"volume-title":"Proceedings of the 31st Australasian Conference on Computer Science (ACSC). 147--156","author":"Yang D.","key":"e_1_2_1_51_1"}],"container-title":["Journal on Computing and Cultural Heritage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2442080.2442084","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2442080.2442084","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:35:25Z","timestamp":1750235725000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2442080.2442084"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,3]]},"references-count":51,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,3]]}},"alternative-id":["10.1145\/2442080.2442084"],"URL":"https:\/\/doi.org\/10.1145\/2442080.2442084","relation":{},"ISSN":["1556-4673","1556-4711"],"issn-type":[{"type":"print","value":"1556-4673"},{"type":"electronic","value":"1556-4711"}],"subject":[],"published":{"date-parts":[[2013,3]]},"assertion":[{"value":"2012-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-04-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}