{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,4]],"date-time":"2026-03-04T17:27:53Z","timestamp":1772645273951,"version":"3.50.1"},"reference-count":48,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2019,11,26]],"date-time":"2019-11-26T00:00:00Z","timestamp":1574726400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Development of Text-to-Speech synthesis for Indian Languages Phase II","award":["11(7)\/2011HCC(TDIL)"],"award-info":[{"award-number":["11(7)\/2011HCC(TDIL)"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2020,3,31]]},"abstract":"<jats:p>\n            The primary objective of this work is to classify Hindi and Telugu stories into three genres:\n            <jats:italic>fable, folk-tale,<\/jats:italic>\n            and\n            <jats:italic>legend<\/jats:italic>\n            . In this work, we are proposing a framework for story classification (SC) using keyword and part-of-speech (POS) features. For improving the performance of SC system, feature reduction techniques and combinations of various POS tags are explored. Further, we investigated the performance of SC by dividing the story into parts depending on its semantic structure. In this work, stories are (i) manually divided into parts based on their semantics as\n            <jats:italic>introduction, main,<\/jats:italic>\n            and\n            <jats:italic>climax<\/jats:italic>\n            ; and (ii) automatically divided into equal parts based on number of sentences in a story as\n            <jats:italic>initial, middle,<\/jats:italic>\n            and\n            <jats:italic>end<\/jats:italic>\n            . We have also examined\n            <jats:italic>sentence increment model,<\/jats:italic>\n            which aims at determining an optimum number of sentences required to identify story genre by incremental selection of sentences in a story. Experiments are conducted on Hindi and Telugu story corpora consisting of 300 and 150 short stories, respectively. The performance of SC system is evaluated using different combinations of keyword and POS-based features, with three well-established machine learning classifiers: (i) Naive Bayes (NB), (ii) k-Nearest Neighbour (KNN), and (iii) Support Vector Machine (SVM). Performance of the classifier is evaluated using 10-fold cross-validation and effectiveness of classifier is measured using precision, recall, and F-measure. From the classification results, it is observed that adding linguistic information boosts the performance of story classification. In view of the structure of the story, main, and initial parts of the story have shown comparatively better performance. The results from the sentence incremental model have indicated that the first nine and seven sentences in Hindi and Telugu stories, respectively, are sufficient for better classification of stories. In most of the studies, SVM models outperformed the other models in classification accuracy.\n          <\/jats:p>","DOI":"10.1145\/3342356","type":"journal-article","created":{"date-parts":[[2019,11,26]],"date-time":"2019-11-26T13:08:55Z","timestamp":1574773735000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Children\u2019s Story Classification in Indian Languages Using Linguistic and Keyword-based Features"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9077-184X","authenticated-orcid":false,"given":"D. M.","family":"Harikrishna","sequence":"first","affiliation":[{"name":"Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India"}]},{"given":"K. Sreenivasa","family":"Rao","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India"}]}],"member":"320","published-online":{"date-parts":[[2019,11,26]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1039621.1039623"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the IJCAI and the Workshop on Shallow Parsing for South Asian Languages (SPSAL\u201907)","author":"Bharati Akshar"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASONAM.2012.97"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199"},{"key":"e_1_2_1_5_1","volume-title":"Recent Advances in Intelligent Informatics","author":"Deepamala N."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976698300017197"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/288627.288651"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0031619"},{"key":"e_1_2_1_10_1","volume-title":"An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3, (Mar","author":"Forman George","year":"2003"},{"key":"e_1_2_1_11_1","volume-title":"Introduction to Statistical Pattern Recognition","author":"Fukunaga Keinosuke"},{"key":"e_1_2_1_12_1","volume-title":"Computational Linguistics and Intelligent Text Processing","author":"Guerini Marco"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the International Conference on Advances in Computing Communications and Informatics.","author":"Harikrishna D. M."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the International Conference on Computer Communication and Control.","author":"Harikrishna D. M."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/IC3.2015.7346682"},{"key":"e_1_2_1_17_1","first-page":"9","article-title":"Introduction to the special Issue on Indian language information retrieval","volume":"9","author":"Harman Donna","year":"2010","journal-title":"ACM Trans. Asian Lang. Inform. Proc."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-00969-8_86"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-0906"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the International Conference of Soft Computing and Pattern Recognition (SoCPaR\u201911)","author":"Jayashree R."},{"key":"e_1_2_1_21_1","volume-title":"Text Categorization with Support Vector Machines: Learning with Many Relevant Features","author":"Joachims Thorsten"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/1046920.1046922"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072228.1072331"},{"key":"e_1_2_1_24_1","volume-title":"Mojtaba Heidarysafa, Sanjana Mendu, Laura E. Barnes, and Donald E. Brown.","author":"Kowsari Kamran","year":"2019"},{"key":"e_1_2_1_25_1","volume-title":"Computational Linguistics and Intelligent Text Processing","author":"Kr\u00e1l Pavel"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02759469"},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the 4th IEEE International Conference on Data Mining (ICDM\u201904)","author":"Liu Tao","year":"2004"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the Language Resources and Evaluation Conference (LREC\u201910)","author":"Lobo Paula Vaz","year":"2010"},{"key":"e_1_2_1_29_1","first-page":"551","article-title":"Latent semantic indexing for patent documents","volume":"15","author":"Moldovan Andreea","year":"2005","journal-title":"Int. J. Appl. Math. Comput. Sci."},{"key":"e_1_2_1_30_1","volume-title":"International Symposium on Linguistics, Quantification and Computation. 119--127","author":"Murthy Kavi Narayana","year":"2003"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (COLING\u201912)","author":"Vishal Gupta Nidhi","year":"2012"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the Oriental COCOSDA Held Jointly with the Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA\/CASLRE\u201913)","author":"Patil Hemant A."},{"key":"e_1_2_1_33_1","volume-title":"Comparison of Marathi text classifiers. Assoc. Comput. Electron. Elect. Eng. 4","author":"Patil Meera","year":"2014"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.7763\/IJMLC.2012.V2.158"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2006.876123"},{"key":"e_1_2_1_36_1","volume-title":"R: A Language and Environment for Statistical Computing","author":"Team R Development Core","year":"2008"},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of the Indian International Conference on Artificial Intelligence.","author":"Raghuveer K.","year":"2007"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2009.02.010"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2010-622"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.5555\/1273073.1273173"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the 3rd Global Wordnet Conference (GWC\u201906)","author":"Sinha Manish","year":"2006"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1099554.1099687"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the IEEE ICDM Workshop on Text Mining. Citeseer, 800--806","author":"Torkkola Kari","year":"2001"},{"key":"e_1_2_1_44_1","volume-title":"Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing.","author":"Tummalapalli Madhuri","year":"2018"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2008-560"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the 40th Meeting on Association for Computational Linguistics. ACL, 417--424","author":"Turney Peter D.","year":"2002"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CSSE.2008.571"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312647"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3342356","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3342356","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:26:02Z","timestamp":1750206362000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3342356"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,26]]},"references-count":48,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,3,31]]}},"alternative-id":["10.1145\/3342356"],"URL":"https:\/\/doi.org\/10.1145\/3342356","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,26]]},"assertion":[{"value":"2015-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-11-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}