{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:20:34Z","timestamp":1750306834371,"version":"3.41.0"},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2014,6,1]],"date-time":"2014-06-01T00:00:00Z","timestamp":1401580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2014,6]]},"abstract":"<jats:p>The Malay language has two types of writing script, known as Rumi and Jawi. Most previous stemmer results have reported on Malay Rumi characters and only a few have tested Jawi characters. In this article, a new Jawi stemmer has been proposed and tested for document retrieval. A total of 36 queries and datasets from the transliterated Jawi Quran were used. The experiment shows that the mean average precision for a \u201cstemmed Jawi\u201d document is 8.43%. At the same time, the mean average precision for a \u201cnonstemmed Jawi\u201d document is 5.14%. The result from a paired sample t-test showed that the use of a \u201cstemmed Jawi\u201d document increased the precision in document retrieval. Further experiments were performed to examine the precision of the relevant documents that were retrieved at various cutoff points for all 36 queries. The results for the \u201cstemmed Jawi\u201d document showed a significantly different start, at a cutoff of 40, compared with the \u201cnonstemmed Jawi\u201d documents. This result shows the usefulness of a Jawi stemmer for retrieving relevant documents in the Jawi script.<\/jats:p>","DOI":"10.1145\/2540988","type":"journal-article","created":{"date-parts":[[2014,6,24]],"date-time":"2014-06-24T12:16:16Z","timestamp":1403612176000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["The Effectiveness of a Jawi Stemmer for Retrieving Relevant Malay Documents in Jawi Characters"],"prefix":"10.1145","volume":"13","author":[{"given":"Suliana","family":"Sulaiman","sequence":"first","affiliation":[{"name":"Sultan Idris Education University, Malaysia"}]},{"given":"Khairuddin","family":"Omar","sequence":"additional","affiliation":[{"name":"Universiti Kebangsaan, Malaysia"}]},{"given":"Nazlia","family":"Omar","sequence":"additional","affiliation":[{"name":"Universiti Kebangsaan, Malaysia"}]},{"given":"Mohd Zamri","family":"Murah","sequence":"additional","affiliation":[{"name":"Universiti Kebangsaan, Malaysia"}]},{"given":"Hamdan Abdul","family":"Rahman","sequence":"additional","affiliation":[{"name":"Universiti Kebangsaan, Malaysia"}]}],"member":"320","published-online":{"date-parts":[[2014,6]]},"reference":[{"key":"e_1_2_1_2_1","first-page":"433","article-title":"Rules frequency order stemmer for Malay language","volume":"9","author":"Abdullah M. T.","year":"2009","unstructured":"Abdullah , M. T. , Ahmad , F. , Sembok T. M. T. 2009 . Rules frequency order stemmer for Malay language . Int. J. Comput. Sci. Netw. Secur. 9 , 2, 433 -- 438 . Abdullah, M. T., Ahmad, F., Sembok T. M. T. 2009. Rules frequency order stemmer for Malay language. Int. J. Comput. Sci. Netw. Secur. 9, 2, 433--438.","journal-title":"Int. J. Comput. Sci. Netw. Secur."},{"key":"e_1_2_1_3_1","unstructured":"Abu Bakar Z. 1999. Evaluation of retrieval effectiveness of n-gram string similarity matching on Malay documents. Tech. rep. Universiti Kebangsaan Malaysia. Bangi.  Abu Bakar Z. 1999. Evaluation of retrieval effectiveness of n-gram string similarity matching on Malay documents. Tech. rep. Universiti Kebangsaan Malaysia. Bangi."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1316457.1316459"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199612)47:12%3C909::AID-ASI4%3E3.3.CO;2-D"},{"volume-title":"Proceedings of the 13th Nordic Computational Linguistics Conference (NODALIDA\u201901)","author":"Carlberger J.","key":"e_1_2_1_7_1","unstructured":"Carlberger , J. , Dalianis , H. , Hassel , M. , and Knutsson , O . 2001. Improving precision in information retrieval for Swedish using stemming . In Proceedings of the 13th Nordic Computational Linguistics Conference (NODALIDA\u201901) 1--5. Carlberger, J., Dalianis, H., Hassel, M., and Knutsson, O. 2001. Improving precision in information retrieval for Swedish using stemming. In Proceedings of the 13th Nordic Computational Linguistics Conference (NODALIDA\u201901) 1--5."},{"key":"e_1_2_1_8_1","unstructured":"Cleverdon C. W. Mills J. and Keen M. 1966. Factors determining the performance of indexing systems. Tech. rep. College of Aeronautics University of Michigan MI.  Cleverdon C. W. Mills J. and Keen M. 1966. Factors determining the performance of indexing systems. Tech. rep. College of Aeronautics University of Michigan MI."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12320-7_2"},{"volume-title":"Information retrieval: Data Structures and Algorithms","author":"Frakes W.","key":"e_1_2_1_10_1","unstructured":"Frakes , W. and Baeza-Yates , R. 1992. Information retrieval: Data Structures and Algorithms . Prentice-Hall . Frakes, W. and Baeza-Yates, R. 1992. Information retrieval: Data Structures and Algorithms. Prentice-Hall."},{"volume-title":"Proceedings of the International Conference on Electrical Engineering and Informatics (ICEEI\u201909)","author":"Ghani R. A.","key":"e_1_2_1_11_1","unstructured":"Ghani , R. A. , Zakaria , M. S. , and Omar , K . 2009. Jawi-Malay transliteration . In Proceedings of the International Conference on Electrical Engineering and Informatics (ICEEI\u201909) . 154--157. Ghani, R. A., Zakaria, M. S., and Omar, K. 2009. Jawi-Malay transliteration. In Proceedings of the International Conference on Electrical Engineering and Informatics (ICEEI\u201909). 154--157."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199101)42:1<7::AID-ASI2>3.0.CO;2-P"},{"key":"e_1_2_1_13_1","unstructured":"Hassan A. 1974. The Morphology of Malay. Dewan Bahasa Dan Pustaka Kementerian Pelajaran Malaysia.  Hassan A. 1974. The Morphology of Malay . Dewan Bahasa Dan Pustaka Kementerian Pelajaran Malaysia."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199601)47:1%3C70::AID-ASI7%3E3.3.CO;2-Q"},{"volume-title":"Proceedings of 1st International Conference on Digital Communications and Computer Applications (DCCA\u201907)","author":"Islam M. Z.","key":"e_1_2_1_15_1","unstructured":"Islam , M. Z. , Uddin , M. N. , and Khan , M . 2007. A light weight stemmer for Bengali and its use in spelling checker . In Proceedings of 1st International Conference on Digital Communications and Computer Applications (DCCA\u201907) . Islam, M. Z., Uddin, M. N., and Khan, M. 2007. A light weight stemmer for Bengali and its use in spelling checker. In Proceedings of 1st International Conference on Digital Communications and Computer Applications (DCCA\u201907)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/243199.243209"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the 8th International Symposium on Malay\/Indonesia Linguistic (ISMIL\u201904)","author":"Malacon B. R","year":"2004","unstructured":"Malacon , B. R , 2004 . Computational analysis of affixed words in Malay language . In Proceedings of the 8th International Symposium on Malay\/Indonesia Linguistic (ISMIL\u201904) . Malacon, B. R, 2004. Computational analysis of affixed words in Malay language. In Proceedings of the 8th International Symposium on Malay\/Indonesia Linguistic (ISMIL\u201904)."},{"key":"e_1_2_1_18_1","unstructured":"Manning C. D. Raghavan P. and Schutze H. 2008. Introduction to Iinformation Retrieval. Cambridge University Press. UK.   Manning C. D. Raghavan P. and Schutze H. 2008. Introduction to Iinformation Retrieval . Cambridge University Press. UK."},{"key":"e_1_2_1_19_1","volume-title":"Proceeding of the 32nd International Congress for Asian and North African Studies.","author":"Ming D. C.","year":"1986","unstructured":"Ming , D. C. 1986 . Access to Malay manuscripts . In Proceeding of the 32nd International Congress for Asian and North African Studies. Ming, D. C. 1986. Access to Malay manuscripts. In Proceeding of the 32nd International Congress for Asian and North African Studies."},{"key":"e_1_2_1_20_1","first-page":"11","article-title":"Sejarah Tulisan Jawi","volume":"35","author":"Moain A. J.","year":"1992","unstructured":"Moain , A. J. 1992 . Sejarah Tulisan Jawi . Jurnal Bahasa 35 , 11 . 101--1012. Moain, A. J. 1992. Sejarah Tulisan Jawi. Jurnal Bahasa 35, 11. 101--1012.","journal-title":"Jurnal Bahasa"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGIV.2008.36"},{"key":"e_1_2_1_22_1","unstructured":"Othman A. 1993. Pengakar perkataan Melayu dan sistem capaian dokumen. Tech. rep. Universiti Kebangsaan.  Othman A. 1993. Pengakar perkataan Melayu dan sistem capaian dokumen. Tech. rep. Universiti Kebangsaan."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/188490.188499"},{"key":"e_1_2_1_24_1","first-page":"3","article-title":"An algorithm for suffix stripping","volume":"14","author":"Porter M. F.","year":"1980","unstructured":"Porter , M. F. 1980 . An algorithm for suffix stripping . Program. Electron. Libr. Inf. Syst. 14 , 3 . 130--137. Porter, M. F. 1980. An algorithm for suffix stripping. Program. Electron. Libr. Inf. Syst. 14, 3. 130--137.","journal-title":"Program. Electron. Libr. Inf. Syst."},{"key":"e_1_2_1_25_1","unstructured":"Rahman H. A. 1999. Panduan Menulis dan Mengeja Jawi. Dewan Bahasa dan Pustaka. Kuala Lumpur.  Rahman H. A. 1999. Panduan Menulis dan Mengeja Jawi . Dewan Bahasa dan Pustaka. Kuala Lumpur."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the International Conference on Electrical Engineering and Informatics. 154--157","author":"Roslan G.","year":"2009","unstructured":"Roslan , G. 2009 . Jawi-Malay transliteration . In Proceedings of the International Conference on Electrical Engineering and Informatics. 154--157 . Roslan, G. 2009. Jawi-Malay transliteration. In Proceedings of the International Conference on Electrical Engineering and Informatics. 154--157."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(1999)50:10%3C944::AID-ASI9%3E3.3.CO;2-H"},{"key":"e_1_2_1_28_1","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/978-3-540-24594-0_17","article-title":"Istilah sains: A Malay-English terminology retrieval system experiment using stemming and n-grams approach in malay words. In Proceeding of the 6th International Conference on Asian Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access (ICADL\u201903)","volume":"2911","author":"Sembok T. M. T","year":"2003","unstructured":"Sembok , T. M. T , Palasundram , K. , Ali , N. M , Yahya , A. , and Wook , T. S. M. T. 2003 . Istilah sains: A Malay-English terminology retrieval system experiment using stemming and n-grams approach in malay words. In Proceeding of the 6th International Conference on Asian Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access (ICADL\u201903) . Lecture Notes in Computer Science , Vol. 2911 , Spring er, 173 -- 177 . Sembok, T. M. T, Palasundram, K., Ali, N. M, Yahya, A., and Wook, T. S. M. T. 2003. Istilah sains: A Malay-English terminology retrieval system experiment using stemming and n-grams approach in malay words. In Proceeding of the 6th International Conference on Asian Digital Libraries: Technology and Management of Indigenous Knowledge for Global Access (ICADL\u201903). Lecture Notes in Computer Science, Vol. 2911, Springer, 173--177.","journal-title":"Lecture Notes in Computer Science"},{"key":"e_1_2_1_29_1","first-page":"95","article-title":"Word stemming algorithm and retrieval effectiveness in Malay and Arabic documents retrieval systems","volume":"2911","author":"Sembok T. M. T.","year":"2005","unstructured":"Sembok , T. M. T. 2005 . Word stemming algorithm and retrieval effectiveness in Malay and Arabic documents retrieval systems . World Acad. Sci. Engin. Techno. 2911 , 95 -- 97 , 173--177. Sembok, T. M. T. 2005. Word stemming algorithm and retrieval effectiveness in Malay and Arabic documents retrieval systems. World Acad. Sci. Engin. Techno. 2911, 95--97, 173--177.","journal-title":"World Acad. Sci. Engin. Techno."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321528"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1571941.1572050"},{"volume-title":"Proceeding of the International Conference on Intelligence Analysis.","author":"Strohman T.","key":"e_1_2_1_32_1","unstructured":"Strohman , T. , Metzler , D. , Turtle , H. , and Croft , W. B . 2005. Indri. A language-model based search engine for complex queries . In Proceeding of the International Conference on Intelligence Analysis. Strohman, T., Metzler, D., Turtle, H., and Croft, W. B. 2005. Indri. A language-model based search engine for complex queries. In Proceeding of the International Conference on Intelligence Analysis."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25832-9_68"},{"key":"e_1_2_1_34_1","unstructured":"Teruja 2010. Transliteration engine for Rumi to Jawi http:\/\/www.jawi.ukm.my\/.  Teruja 2010. Transliteration engine for Rumi to Jawi http:\/\/www.jawi.ukm.my\/."},{"key":"e_1_2_1_35_1","unstructured":"Yatim O. M. 1990. Epigrafi islam terawal di Nusantara. Dewan Bahasa dan Pustaka.  Yatim O. M. 1990. Epigrafi islam terawal di Nusantara . Dewan Bahasa dan Pustaka."},{"key":"e_1_2_1_36_1","unstructured":"Yonhendri. 2009. Transliterasi rumi ke jawi berasaskan petua. Master Thesis Universiti Kebangsaan Malaysia.  Yonhendri. 2009. Transliterasi rumi ke jawi berasaskan petua. Master Thesis Universiti Kebangsaan Malaysia."}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2540988","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2540988","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:09:54Z","timestamp":1750234194000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2540988"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,6]]},"references-count":34,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2014,6]]}},"alternative-id":["10.1145\/2540988"],"URL":"https:\/\/doi.org\/10.1145\/2540988","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"type":"print","value":"1530-0226"},{"type":"electronic","value":"1558-3430"}],"subject":[],"published":{"date-parts":[[2014,6]]},"assertion":[{"value":"2013-03-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-06-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}