{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:23:51Z","timestamp":1750220631679,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,9,23]],"date-time":"2020-09-23T00:00:00Z","timestamp":1600819200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,9,23]]},"DOI":"10.1145\/3419604.3419779","type":"proceedings-article","created":{"date-parts":[[2020,11,8]],"date-time":"2020-11-08T15:27:41Z","timestamp":1604849261000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["K-means, HAC and FCM Which Clustering Approach for Arabic Text?"],"prefix":"10.1145","author":[{"given":"Lahbib","family":"Ajallouda","sequence":"first","affiliation":[{"name":"ENSIAS, Mohammed V University in Rabat, Rabat, Morocco"}]},{"given":"Fatima Zahra","family":"Fagroud","sequence":"additional","affiliation":[{"name":"LTIM - FSBM, Hassan II university, Casablanca, Morocco"}]},{"given":"Ahmed","family":"Zellou","sequence":"additional","affiliation":[{"name":"ENSIAS, Mohammed V University in Rabat, Rabat, Morocco"}]},{"given":"El Habib","family":"Benlahmar","sequence":"additional","affiliation":[{"name":"LTIM - FSBM, Hassan II university, Casablanca, Morocco"}]}],"member":"320","published-online":{"date-parts":[[2020,11,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/2588240"},{"key":"e_1_3_2_1_2_1","first-page":"4","article-title":"Speaker-independent recognition of isolated words using clustering techniques","volume":"27","author":"Abril Patricia S.","year":"1979","unstructured":"Patricia S. Abril and Robert Plant . 1979 . Speaker-independent recognition of isolated words using clustering techniques . IEEE Transactions on Acoustics, Speech, and Signal Processing 27 , 4 (Aug. 1979), 336--349. Patricia S. Abril and Robert Plant. 1979. Speaker-independent recognition of isolated words using clustering techniques. IEEE Transactions on Acoustics, Speech, and Signal Processing 27, 4 (Aug. 1979), 336--349.","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/53.3-4.311"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.3102\/10769986001001039"},{"volume-title":"Data mining for scientifc and engineering applications","author":"Dhillon Inderjit S","key":"e_1_3_2_1_5_1","unstructured":"Inderjit S Dhillon , James Fan , and Yuqiang Guan . 2001. Efcient clustering of very large document collections . In Data mining for scientifc and engineering applications . Springer , 357--381. Inderjit S Dhillon, James Fan, and Yuqiang Guan. 2001. Efcient clustering of very large document collections. In Data mining for scientifc and engineering applications. Springer, 357--381."},{"key":"e_1_3_2_1_6_1","volume-title":"TextMining Workshop at KDD2000 (May","author":"George Karypis Michael Steinbach","year":"2000","unstructured":"Michael Steinbach George Karypis , Vipin Kumar , and Michael Steinbach . 2000 . A comparison of document clustering techniques . In TextMining Workshop at KDD2000 (May 2000). Michael Steinbach George Karypis, Vipin Kumar, and Michael Steinbach. 2000. A comparison of document clustering techniques. In TextMining Workshop at KDD2000 (May 2000)."},{"key":"e_1_3_2_1_7_1","first-page":"451","article-title":"Interpreting and assessing the results of cluster analyses","volume":"47","author":"Gnanadesikan Ram","year":"1977","unstructured":"Ram Gnanadesikan , Jon R Kettenring , and James M Landwehr . 1977 . Interpreting and assessing the results of cluster analyses . Bulletin of the International Statistical Institute 47 , 2 (1977), 451 -- 463 . Ram Gnanadesikan, Jon R Kettenring, and James M Landwehr. 1977. Interpreting and assessing the results of cluster analyses. Bulletin of the International Statistical Institute 47, 2 (1977), 451--463.","journal-title":"Bulletin of the International Statistical Institute"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CATA.2018.8398666"},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the sixth new zealand computer science research student conference (NZCSRSC2008)","author":"Huang Anna","year":"2008","unstructured":"Anna Huang . 2008 . Similarity measures for text document clustering . In Proceedings of the sixth new zealand computer science research student conference (NZCSRSC2008) , Christchurch, New Zealand (Christchurch, New Zealand). 49--56. Anna Huang. 2008. Similarity measures for text document clustering. In Proceedings of the sixth new zealand computer science research student conference (NZCSRSC2008), Christchurch, New Zealand (Christchurch, New Zealand). 49--56."},{"volume-title":"2009 First Asian Himalayas International Conference on Internet (Kathmandu, Nepal). IEEE, 1--6.","author":"Raihana","key":"e_1_3_2_1_10_1","unstructured":"Raihana Ferdous et al. 2009. An efcient k-means algorithm integrated with Jaccard distance measure for document clustering . In 2009 First Asian Himalayas International Conference on Internet (Kathmandu, Nepal). IEEE, 1--6. Raihana Ferdous et al. 2009. An efcient k-means algorithm integrated with Jaccard distance measure for document clustering. In 2009 First Asian Himalayas International Conference on Internet (Kathmandu, Nepal). IEEE, 1--6."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/553876"},{"key":"e_1_3_2_1_12_1","volume-title":"Fuzzy C-means (FCM) clustering algorithm: a decade review from 2000 to","author":"Nayak Janmenjoy","year":"2014","unstructured":"Janmenjoy Nayak , Bighnaraj Naik , and H Sr Behera . 2015. Fuzzy C-means (FCM) clustering algorithm: a decade review from 2000 to 2014 . In Computational intelligence in data mining-volume 2. Springer , 133--149. Janmenjoy Nayak, Bighnaraj Naik, and HSr Behera. 2015. Fuzzy C-means (FCM) clustering algorithm: a decade review from 2000 to 2014. In Computational intelligence in data mining-volume 2. Springer, 133--149."},{"key":"e_1_3_2_1_13_1","first-page":"133","article-title":"Comparisons Between Data Clustering Algorithms","volume":"5","author":"Abbas Osama Abu","year":"2008","unstructured":"Osama Abu Abbas . 2008 . Comparisons Between Data Clustering Algorithms . International Arab Journal of Information Technology (IAJIT) 5 , 3 (2008), 133 -- 149 . Osama Abu Abbas. 2008. Comparisons Between Data Clustering Algorithms. International Arab Journal of Information Technology (IAJIT) 5, 3 (2008), 133--149.","journal-title":"International Arab Journal of Information Technology (IAJIT)"},{"key":"e_1_3_2_1_15_1","first-page":"241","article-title":"Evaluation and comparison of clustering algorithms in analyzing ES cell gene expression data","volume":"12","author":"Chen Gengxin","year":"2002","unstructured":"Gengxin Chen , Saied A Jaradat , Nila Banerjee , Tetsuya S Tanaka , Minoru SH Ko , and Michael Q Zhang . 2002 . Evaluation and comparison of clustering algorithms in analyzing ES cell gene expression data . Statistica Sinica 12 , 1 (2002), 241 -- 262 . Gengxin Chen, Saied A Jaradat, Nila Banerjee, Tetsuya S Tanaka, Minoru SH Ko, and Michael Q Zhang. 2002. Evaluation and comparison of clustering algorithms in analyzing ES cell gene expression data. Statistica Sinica 12, 1 (2002), 241--262.","journal-title":"Statistica Sinica"},{"key":"e_1_3_2_1_16_1","first-page":"108","article-title":"Review and comparison between clustering algorithms with duplicate entities detection purpose","volume":"3","author":"Bakhshi Maryam","year":"2012","unstructured":"Maryam Bakhshi , Mohammad-Reza Feizi-Derakhshi , and E Zafarani . 2012 . Review and comparison between clustering algorithms with duplicate entities detection purpose . International Journal of Computer Science & Emerging Technologies 3 , 3 (2012), 108 -- 114 . Maryam Bakhshi, Mohammad-Reza Feizi-Derakhshi, and E Zafarani. 2012. Review and comparison between clustering algorithms with duplicate entities detection purpose. International Journal of Computer Science & Emerging Technologies 3, 3 (2012), 108--114.","journal-title":"International Journal of Computer Science & Emerging Technologies"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/SOCPAR.2014.7008028"},{"key":"e_1_3_2_1_18_1","volume-title":"A survey of clustering algorithms for an industrial context. Procedia computer science 148","author":"Benabdellah Abla Chouni","year":"2019","unstructured":"Abla Chouni Benabdellah , Asmaa Benghabrit , and Imane Bouhaddou . 2019. A survey of clustering algorithms for an industrial context. Procedia computer science 148 ( 2019 ), 291--302. Abla Chouni Benabdellah, Asmaa Benghabrit, and Imane Bouhaddou. 2019. A survey of clustering algorithms for an industrial context. Procedia computer science 148 (2019), 291--302."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-98539-8_11"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2017.10.117"},{"key":"e_1_3_2_1_21_1","first-page":"2279","article-title":"A survey on text mining process and techniques","volume":"3","author":"Kumar Sathees","year":"2014","unstructured":"Sathees Kumar and R Karthika . 2014 . A survey on text mining process and techniques . International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) 3 , 7 (2014), 2279 -- 2284 . Sathees Kumar and R Karthika. 2014. A survey on text mining process and techniques. International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) 3, 7 (2014), 2279--2284.","journal-title":"International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)"},{"key":"e_1_3_2_1_22_1","volume-title":"Efcient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov , Kai Chen , Greg Corrado , and Jeffrey Dean . 2013. Efcient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 ( 2013 ). Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efcient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2016.071153"},{"key":"e_1_3_2_1_24_1","first-page":"137","article-title":"Methods of arabic language baseline detection-The state of art","volume":"8","author":"Atallah Shatnawi","year":"2008","unstructured":"AL- Shatnawi Atallah and Khairuddin Omar . 2008 . Methods of arabic language baseline detection-The state of art . IJCSNS 8 , 10 (2008), 137 . AL-Shatnawi Atallah and Khairuddin Omar. 2008. Methods of arabic language baseline detection-The state of art. IJCSNS 8, 10 (2008), 137.","journal-title":"IJCSNS"},{"key":"e_1_3_2_1_25_1","volume-title":"Proceedings of TALN 2013 (Volume 1: Long Papers). 435--449","author":"Keskes Iskandar","year":"2013","unstructured":"Iskandar Keskes , Farah Benamara , and Lamia Hadrich Belguith . 2013 . Segmenting Arabic Texts into Elementary Discourse Units (Segmentation de textes arabes en unit\u00e9s discursives minimales)[in French] . In Proceedings of TALN 2013 (Volume 1: Long Papers). 435--449 . Iskandar Keskes, Farah Benamara, and Lamia Hadrich Belguith. 2013. Segmenting Arabic Texts into Elementary Discourse Units (Segmentation de textes arabes en unit\u00e9s discursives minimales)[in French]. In Proceedings of TALN 2013 (Volume 1: Long Papers). 435--449."},{"key":"e_1_3_2_1_26_1","volume-title":"Stemming arabic text","author":"Khoja Shereen","year":"1999","unstructured":"Shereen Khoja and Roger Garside . 1999. Stemming arabic text . Lancaster, UK , Computing Department, Lancaster University ( 1999 ). Shereen Khoja and Roger Garside. 1999. Stemming arabic text. Lancaster, UK, Computing Department, Lancaster University (1999)."},{"volume-title":"Stemming versus light stemming as feature selection techniques for Arabic text categorization. In 2007 Innovations in Information Technologies (IIT)","author":"Duwairi Rehab","key":"e_1_3_2_1_27_1","unstructured":"Rehab Duwairi , Mohammad Al-Refai , and Natheer Khasawneh . 2007. Stemming versus light stemming as feature selection techniques for Arabic text categorization. In 2007 Innovations in Information Technologies (IIT) . IEEE , 446--450. Rehab Duwairi, Mohammad Al-Refai, and Natheer Khasawneh. 2007. Stemming versus light stemming as feature selection techniques for Arabic text categorization. In 2007 Innovations in Information Technologies (IIT). IEEE, 446--450."},{"key":"e_1_3_2_1_28_1","volume-title":"The use of an association measure based on character structure to identify semantically related pairs of words and document titles. Information storage and retrieval 10, 7-8","author":"Adamson George W","year":"1974","unstructured":"George W Adamson and Jillian Boreham . 1974. The use of an association measure based on character structure to identify semantically related pairs of words and document titles. Information storage and retrieval 10, 7-8 ( 1974 ), 253--260. George W Adamson and Jillian Boreham. 1974. The use of an association measure based on character structure to identify semantically related pairs of words and document titles. Information storage and retrieval 10, 7-8 (1974), 253--260."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.4018\/IJIRR.2011070104"},{"key":"e_1_3_2_1_30_1","volume-title":"A statistical interpretation of term specifcity and its application in retrieval. Journal of documentation","author":"Jones Karen Sparck","year":"1972","unstructured":"Karen Sparck Jones . 1972. A statistical interpretation of term specifcity and its application in retrieval. Journal of documentation ( 1972 ). Karen Sparck Jones. 1972. A statistical interpretation of term specifcity and its application in retrieval. Journal of documentation (1972)."},{"key":"e_1_3_2_1_31_1","first-page":"1","article-title":"NbClust: an R package for determining the relevant number of clusters in a data Set","volume":"61","author":"Malika Charrad","year":"2014","unstructured":"Charrad Malika , N Ghazzali , V Boiteau , and A Niknafs . 2014 . NbClust: an R package for determining the relevant number of clusters in a data Set . J. Stat. Softw 61 (2014), 1 -- 36 . Charrad Malika, N Ghazzali, V Boiteau, and A Niknafs. 2014. NbClust: an R package for determining the relevant number of clusters in a data Set. J. Stat. Softw 61 (2014), 1--36.","journal-title":"J. Stat. Softw"},{"key":"e_1_3_2_1_32_1","volume-title":"Determining the optimal number of clusters: 3 must know methods. Available onli ne: https:\/\/www.datanovia.com\/en\/lessons\/determiningthe-optimal-number-of-clusters-3-must-know-methods\/.(accessed on","author":"Kassambara Alboukadel","year":"2018","unstructured":"Alboukadel Kassambara . 2017. Determining the optimal number of clusters: 3 must know methods. Available onli ne: https:\/\/www.datanovia.com\/en\/lessons\/determiningthe-optimal-number-of-clusters-3-must-know-methods\/.(accessed on 31 April 2018 ) (2017). Alboukadel Kassambara. 2017. Determining the optimal number of clusters: 3 must know methods. Available onli ne: https:\/\/www.datanovia.com\/en\/lessons\/determiningthe-optimal-number-of-clusters-3-must-know-methods\/.(accessed on 31 April 2018) (2017)."}],"event":{"name":"SITA'20: Theories and Applications","acronym":"SITA'20","location":"Rabat Morocco"},"container-title":["Proceedings of the 13th International Conference on Intelligent Systems: Theories and Applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3419604.3419779","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3419604.3419779","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:01:42Z","timestamp":1750197702000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3419604.3419779"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,23]]},"references-count":31,"alternative-id":["10.1145\/3419604.3419779","10.1145\/3419604"],"URL":"https:\/\/doi.org\/10.1145\/3419604.3419779","relation":{},"subject":[],"published":{"date-parts":[[2020,9,23]]},"assertion":[{"value":"2020-11-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}