{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T06:26:38Z","timestamp":1770272798314,"version":"3.49.0"},"reference-count":22,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2019,3,23]],"date-time":"2019-03-23T00:00:00Z","timestamp":1553299200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,5,14]]},"abstract":"<jats:p>One of the most challenging research problems in natural language processing (NLP) is that of word sense induction (WSI). It involves discovering senses of a word given its contexts of usage without the use of a sense inventory which differentiates it from traditional word sense disambiguation (WSD). This paper reports a work on sense induction in Bengali, a less-resourced language, based on distributional semantics and translation based context vectors learned from parallel corpora to improve the task performance. The performance of the proposed method of sense induction was compared with the k-means algorithm, which was considered as the baseline in our work. A dataset for sense induction was created for 15 Bengali words, encompassing a total of 111 contexts. The proposed model, in both mono and cross-lingual settings, outperformed k-means in precision (P), recall (R) and F-scores. K-means based sense induction produced average P, R and F-scores of 0.71, 0.73 and 0.66 respectively. The average P, R and F-scores produced by the mono-and cross-lingual settings of the proposed algorithm are 0.77, 0.73, 0.68 and 0.81, 0.77 and 0.72 respectively.<\/jats:p>","DOI":"10.3233\/jifs-179030","type":"journal-article","created":{"date-parts":[[2019,3,26]],"date-time":"2019-03-26T18:55:53Z","timestamp":1553626553000},"page":"4821-4832","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":2,"title":["Word sense induction in bengali using parallel corpora and distributional semantics"],"prefix":"10.1177","volume":"36","author":[{"given":"Saptarshi","family":"Sengupta","sequence":"first","affiliation":[{"name":"University of Minnesota Duluth, Duluth, MN, USA"}]},{"given":"Rajat","family":"Pandit","sequence":"additional","affiliation":[{"name":"West Bengal State University, Kokata, West Bengal, India"}]},{"given":"Parag","family":"Mitra","sequence":"additional","affiliation":[{"name":"Jadavpur University, Kolkata, West Bengal, India"}]},{"given":"Sudip Kumar","family":"Naskar","sequence":"additional","affiliation":[{"name":"Jadavpur University, Kolkata, West Bengal, India"}]},{"given":"Mohini Mohan","family":"Sardar","sequence":"additional","affiliation":[{"name":"West Bengal State University, Kokata, West Bengal, India"}]}],"member":"179","published-online":{"date-parts":[[2019,3,23]]},"reference":[{"key":"e_1_3_3_2_2","first-page":"41","article-title":"Word sense discrimination by clustering contexts in vector and similarity spaces","author":"Purandare A.","year":"2004","unstructured":"A.Purandare and T.Pedersen, Word sense discrimination by clustering contexts in vector and similarity spaces. In Proceedings of CoNLL, Boston, MA, USA, 2004, pp. 41\u201348.","journal-title":"In Proceedings of CoNLL"},{"key":"e_1_3_3_3_2","first-page":"368","article-title":"Max Max: A Graph-Based Soft Clustering Algorithm Applied to Word Sense Induction","author":"Hope D.","year":"2013","unstructured":"D.Hope and B.Keller, Max Max: A Graph-Based Soft Clustering Algorithm Applied to Word Sense Induction, Springer Berlin Heidelberg, Berlin, Heidelberg, 2013, pp. 368\u2013381.","journal-title":"Springer Berlin Heidelberg"},{"key":"e_1_3_3_4_2","first-page":"768","article-title":"Automatic retrieval and clustering of similar words","volume":"2","author":"Lin D.","year":"1998","unstructured":"D.Lin, Automatic retrieval and clustering of similar words, In Proceedings of the 17th International Conference on Computational Linguistics - Volume 2 (COLING \u201998), Vol. 2. Association for Computational Linguistics, Stroudsburg, PA, USA, 1998, pp. 768\u2013774.","journal-title":"In Proceedings of the 17th International Conference on Computational Linguistics - Volume 2 (COLING \u201998),"},{"key":"e_1_3_3_5_2","first-page":"1093","article-title":"A Graph Model for Unsuper-vised Lexical Acquisition","author":"Widdows D.","year":"2002","unstructured":"D.Widdows and B.Dorow, A Graph Model for Unsuper-vised Lexical Acquisition, 19th International Conference on Computational Linguistics, Taipei, 2002, pp. 1093\u20131099.","journal-title":"19th International Conference on Computational Linguistics"},{"key":"e_1_3_3_6_2","first-page":"585","article-title":"Two graph-based algorithms for state-of-the-art WSD","author":"Agirre E.","year":"2006","unstructured":"E.Agirre, D.Martinez, O.L.de Lacalle and A.Soroa, Two graph-based algorithms for state-of-the-art WSD. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP \u201906) Association for Computational Linguistics, Stroudsburg, PA, USA, 2006, pp. 585\u2013593.","journal-title":"In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP \u201906) Association for Computational Linguistics"},{"issue":"1","key":"e_1_3_3_7_2","first-page":"97","article-title":"Automatic word sense discrimination","volume":"24","author":"Sch\u00fctze H.","year":"1998","unstructured":"H.Sch\u00fctze, Automatic word sense discrimination, Comput Linguist24(1) (1998), 97\u2013123. ISSN 0891-2017.","journal-title":"Comput Linguist"},{"key":"e_1_3_3_8_2","first-page":"298","article-title":"Word sense induction using graphs of collocations","author":"Klapaftis I.P.","year":"2008","unstructured":"I.P.Klapaftis and S.Manandhar, Word sense induction using graphs of collocations, In Proceedings of the 2008 Conference on ECAI2008: 18th European Conference on Artificial Intelligence, Amsterdam, The Netherlands, 2008, pp. 298\u2013302.","journal-title":"In Proceedings of the 2008 Conference on ECAI2008: 18th European Conference on Artificial Intelligence"},{"key":"e_1_3_3_9_2","first-page":"1","article-title":"A synopsis of linguistic theory 1930-1955","author":"Firth J.R.","year":"1957","unstructured":"J.R.Firth, A synopsis of linguistic theory 1930-1955. In Studies in Linguistic Analysis, 1957, pp. 1\u201332. Oxford: Philological Society. Reprinted in Palmer,F.R.(ed.), Selected Papers of J.R. Firth 1952-1959, London: Longman, 1968.","journal-title":"In Studies in Linguistic Analysis"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2004.05.002"},{"key":"e_1_3_3_11_2","first-page":"343","article-title":"Bergamaschi, Word sense induction with multilingual features representation","volume":"02","author":"Albano L.","year":"2014","unstructured":"L.Albano, D.BeneventanoandS. Bergamaschi, Word sense induction with multilingual features representation, In Proceedings of the 2014 IEEE\/WIC\/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02, WI-IAT \u201914, Washington, DC, USA, IEEE Computer Society, 2014, pp. 343\u2013349.","journal-title":"In Proceedings of the 2014 IEEE\/WIC\/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_3_3_13_2","article-title":"Translation-oriented word sense induction based on parallel corpora","author":"Apidianaki M.","year":"2008","unstructured":"M.Apidianaki, Translation-oriented word sense induction based on parallel corpora, In Proceedings of Language Resources and Evaluation Conference (LREC), 2008.","journal-title":"In Proceedings of Language Resources and Evaluation Conference (LREC)"},{"key":"e_1_3_3_14_2","doi-asserted-by":"crossref","unstructured":"C.Manning P.Raghavan and H.Sch\u00fctze Introduction to Information Retrieval Cambridge: Cambridge University Press 2008.","DOI":"10.1017\/CBO9780511809071"},{"key":"e_1_3_3_15_2","article-title":"Unsupervised Word Sense Disambiguation with Sense Embeddings","author":"Pelevina M.","year":"2016","unstructured":"M.Pelevina, Unsupervised Word Sense Disambiguation with Sense Embeddings, Master-Thesis, Technische Uni-versitat Darmstadt, 2016.","journal-title":"Master-Thesis, Technische Uni-versitat Darmstadt"},{"key":"e_1_3_3_16_2","article-title":"The Word-Space Model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces","author":"Sahlgren M.","year":"2006","unstructured":"M.Sahlgren, The Word-Space Model: Using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces. PhD thesis, Department of Linguistics, Stockholm University, 2006.","journal-title":"PhD thesis, Department of Linguistics"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/775047.775138"},{"key":"e_1_3_3_18_2","article-title":"Word sense induction: Triplet-based clustering and automatic evaluation","author":"Bordag S.","year":"2006","unstructured":"S.Bordag, Word sense induction: Triplet-based clustering and automatic evaluation, In 11th Conference of the European Chapter of the Association for Computational Linguistics, 2006.","journal-title":"In 11th Conference of the European Chapter of the Association for Computational Linguistics"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.104.2.211"},{"key":"e_1_3_3_20_2","first-page":"1476","article-title":"Latent semantic word sense induction and disambiguation","volume":"1","author":"Van de Cruys T.","year":"2011","unstructured":"T.Van de Cruys and M.Apidianaki, Latent semantic word sense induction and disambiguation, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, Association for Computational Linguistics, 2011, pp. 1476\u20131485.","journal-title":"In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies"},{"key":"e_1_3_3_21_2","first-page":"1137","article-title":"A neural probabilistic language model","volume":"3","author":"Bengio Y.","year":"2003","unstructured":"Y.Bengio, R.Ducharme, P.Vincent and C.Janvin, A neural probabilistic language model, J Mach Learn Res3 (2003), 1137\u20131155.","journal-title":"J Mach Learn Res"},{"key":"e_1_3_3_22_2","first-page":"410","article-title":"Chinese word sense induction with basic clustering algorithms","author":"Jia Y.","year":"2010","unstructured":"Y.Jia, S.Yu and Z.Chen, Chinese word sense induction with basic clustering algorithms, In CIP-SSIGHAN Joint Conference on Chinese Language Processing, 2010, pp. 410\u2013414.","journal-title":"In CIP-SSIGHAN Joint Conference on Chinese Language Processing"},{"key":"e_1_3_3_23_2","doi-asserted-by":"crossref","unstructured":"N.S.Dash P.Bhattacharyya and J.D.Pawar The Wordnet in Indian Languages (1st ed.). Springer Publishing Company Incorporated 2016.","DOI":"10.1007\/978-981-10-1909-8"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179030","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179030","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179030","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T17:21:14Z","timestamp":1770225674000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179030"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3,23]]},"references-count":22,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,5,14]]}},"alternative-id":["10.3233\/JIFS-179030"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179030","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,3,23]]}}}