{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T17:25:35Z","timestamp":1769707535647,"version":"3.49.0"},"reference-count":45,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IFS"],"published-print":{"date-parts":[[2023,8,24]]},"abstract":"<jats:p>The method for document image classification presented in this paper mainly focuses on six different Malayalam palm leaf manuscripts categories. The proposed approach consists of three phases: dataset analysis, building a bag of words repository followed by recognition and classification using a voting approach. The palm leaf manuscripts are initially subject to pre-processing and subjective analysis techniques to create a bag of words repository during the dataset analysis phase. Next, the textual components from the manuscripts are extracted for recognition using Tesseract 4 OCR with default and self-adapted training sets and a deep-learning algorithm. The Bag of Words approach is used in the third phase to categorize the palm leaf manuscripts based on textual components recognized by OCR using a voting process. Experimental analysis was done to analyze the proposed approach with and without the voting techniques, varying the size of the Bag of Words with default\/self-adapted training datasets using Tesseract OCR and a deep learning model. Experimental analysis proves that the proposed approach works equally well with\/ without voting with a bag of words technique using Tesseract OCR. It is noticed that, for document classification, an overall accuracy of 83% without voting and 84.5% with voting is achieved with an F-score of 0.90 in both cases using Teserract OCR. Overall, the proposed approach proves to be high generalizable based on trial wise experiments with Bag of Words, offering a reliable way for classifying deteriorated Malayalam handwritten palm manuscripts.<\/jats:p>","DOI":"10.3233\/jifs-223713","type":"journal-article","created":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T11:17:01Z","timestamp":1687259821000},"page":"4031-4049","source":"Crossref","is-referenced-by-count":7,"title":["Deteriorated image classification model for malayalam palm leaf manuscripts"],"prefix":"10.1177","volume":"45","author":[{"given":"B.J.","family":"Bipin Nair","sequence":"first","affiliation":[{"name":"Department of Computer Science, School of Computing, Mysuru Campus, Amrita Vishwa Vidyapeetham, India"}]},{"given":"N.","family":"Shobha Rani","sequence":"additional","affiliation":[{"name":"Department of Computer Science, School of Computing, Mysuru Campus, Amrita Vishwa Vidyapeetham, India"}]},{"given":"Mustaqeem","family":"Khan","sequence":"additional","affiliation":[{"name":"Department of Computer Vision, Mohamed Bin Zayed University of Artificial Intelligence, Abu Dhabi, UAE"}]}],"member":"179","reference":[{"issue":"21","key":"10.3233\/JIFS-223713_ref1","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1007\/s10032-004-0138-z","article-title":"Camera-based analysis of text and documents: a survey","volume":"7","author":"Liang","year":"2005","journal-title":"International Journal of Document Analysis and Recognition (IJDAR)"},{"issue":"12","key":"10.3233\/JIFS-223713_ref2","doi-asserted-by":"crossref","first-page":"1921","DOI":"10.1016\/S0031-3203(98)00079-X","article-title":"On image classification: City images vs. landscapes","volume":"31","author":"Vailaya","year":"1998","journal-title":"Pattern Recognition"},{"key":"10.3233\/JIFS-223713_ref3","unstructured":"Wilson E. Bridger and Rice J.M. , Palm Leaf Manuscripts in South Asia, (2019)."},{"key":"10.3233\/JIFS-223713_ref4","doi-asserted-by":"crossref","unstructured":"Kang L. , Kumar J. , Ye P. , Li Y. and Doermann D. , Convolutional neural networks for document image classification. In 2014 22nd International Conference on Pattern Recognition, (2014) (pp. 3168\u20133172). IEEE.","DOI":"10.1109\/ICPR.2014.546"},{"key":"10.3233\/JIFS-223713_ref5","doi-asserted-by":"crossref","unstructured":"Harley A.W. , Ufkes A. and Derpanis K.G. , Evaluation of deep convolutional nets for document image classification and retrieval. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (2015) (pp. 991\u2013995). IEEE.","DOI":"10.1109\/ICDAR.2015.7333910"},{"key":"10.3233\/JIFS-223713_ref6","doi-asserted-by":"crossref","unstructured":"Jain R. and Wigington C. , Multimodal document image classification. In 2019 International Conference on Document Analysis and Recognition (ICDAR) (2019) (pp. 71\u201377). IEEE.","DOI":"10.1109\/ICDAR.2019.00021"},{"key":"10.3233\/JIFS-223713_ref7","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1016\/j.neucom.2021.04.114","article-title":"Document image classification: Progress over two decades","volume":"453","author":"Liu","year":"2021","journal-title":"Neurocomputing"},{"issue":"4","key":"10.3233\/JIFS-223713_ref8","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1109\/TPAMI.2003.1190578","article-title":"Hidden tree Markov models for document image classification","volume":"25","author":"Diligenti","year":"2003","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"10.3233\/JIFS-223713_ref9","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.patrec.2013.10.030","article-title":"Structural similarity for document image classification and retrieval","volume":"43","author":"Kumar","year":"2014","journal-title":"Pattern Recognition Letters"},{"key":"10.3233\/JIFS-223713_ref10","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1016\/j.neucom.2021.04.114","article-title":"Document image classification: Progress over two decades","volume":"453","author":"Liu","year":"2021","journal-title":"Neurocomputing"},{"key":"10.3233\/JIFS-223713_ref11","doi-asserted-by":"crossref","unstructured":"Bakkali S. , Ming Z. , Coustaty M. and Rusinol M. , Visual and textual deep feature fusion for document image classification. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition workshops (2020), (pp. 562\u2013563).","DOI":"10.1109\/CVPRW50498.2020.00289"},{"key":"10.3233\/JIFS-223713_ref12","doi-asserted-by":"crossref","unstructured":"Raghunandan K.S. , Shivakumara P. , Navya B.J. , Pooja G. , Prakash N. , Kumar G.H. and Lu T. , Fourier coefficients for fraud handwritten document classification through age analysis. In 2016 15th International Conference on Frontiers in handwriting recognition (ICFHR) (2016) (pp. 25\u201330). IEEE.","DOI":"10.1109\/ICFHR.2016.0018"},{"issue":"4","key":"10.3233\/JIFS-223713_ref13","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1007\/PL00013566","article-title":"Classification of document pages using structure-based features","volume":"3","author":"Shin","year":"2001","journal-title":"International Journal on Document Analysis and Recognition"},{"key":"10.3233\/JIFS-223713_ref14","doi-asserted-by":"crossref","unstructured":"Kumar J. and Doermann D. , Unsupervised classification of structurally similar document images. In 2013 12th International Conference on Document Analysis and Recognition, (2013) (pp. 1225\u20131229) IEEE.","DOI":"10.1109\/ICDAR.2013.248"},{"key":"10.3233\/JIFS-223713_ref15","doi-asserted-by":"crossref","unstructured":"Reddy K.U. and Govindaraju V. , Form classification. Document Recognition and Retrieval XV (2008) (Vol. 6815, pp. 302\u2013307). SPIE.","DOI":"10.1117\/12.766737"},{"key":"10.3233\/JIFS-223713_ref16","unstructured":"Le D.X. and Thoma G.R. , Page layout classification technique for biomedical documents. In Proc. World Multiconference on Systems, Cyeberntics and Informatics (SCI 2000) (2000) (pp. 348\u201352)."},{"key":"10.3233\/JIFS-223713_ref17","doi-asserted-by":"crossref","unstructured":"Antonacopoulos A. and Ritchings R.T. , Segmentation and classification of document images. IEE Colloquium on Document Image Processing and Multimedia Environments (1995) (pp. 16\u20131). IET.","DOI":"10.1049\/ic:19951197"},{"key":"10.3233\/JIFS-223713_ref18","doi-asserted-by":"crossref","unstructured":"Hu J. , Kashi R. and Wilfong G. , Document image layout comparison and classification. In Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR\u201999 (Cat. No. PR8) (1999) (pp. 285\u2013288). IEEE.","DOI":"10.1109\/ICDAR.1999.791780"},{"key":"10.3233\/JIFS-223713_ref19","doi-asserted-by":"crossref","unstructured":"Zhalehpour S. , Piper A. , Wellmon C. and Cheriet M. , Footnote-based document image classification. In International Conference Image Analysis and Recognition (2017) (pp. 634\u2013642). Springer, Cham.","DOI":"10.1007\/978-3-319-59876-5_70"},{"issue":"9","key":"10.3233\/JIFS-223713_ref20","doi-asserted-by":"crossref","first-page":"1182","DOI":"10.1016\/j.patrec.2008.01.012","article-title":"Wavelet-based co-occurrence histogram features for texture classification with an application to script identification in a document image","volume":"29","author":"Hiremath","year":"2008","journal-title":"Pattern Recognition Letters"},{"key":"10.3233\/JIFS-223713_ref21","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1109\/DIA.1997.627086","article-title":"Script and language identification from document images","author":"Peake","year":"1997","journal-title":"Proceedings Workshop on Document Image Analysis (DIA\u201997)"},{"issue":"11","key":"10.3233\/JIFS-223713_ref22","doi-asserted-by":"crossref","first-page":"2483","DOI":"10.1016\/S0031-3203(03)00128-6","article-title":"Hierarchical content classification and script determination for automatic document image processing","volume":"36","author":"Chi","year":"2003","journal-title":"Pattern Recognition"},{"key":"10.3233\/JIFS-223713_ref23","doi-asserted-by":"crossref","unstructured":"Joshi G.D. , Garg S. and Sivaswamy J. , Script identification from Indian documents. In International Workshop on Document Analysis Systems, (2006) (pp. 255\u2013267). Springer, Berlin, Heidelberg.","DOI":"10.1007\/11669487_23"},{"key":"10.3233\/JIFS-223713_ref24","first-page":"2014","article-title":"Script identification from printed Indian document images and performance evaluation using different classifiers,","author":"Obaidullah","journal-title":"Applied Computational Intelligence and Soft Computing"},{"key":"10.3233\/JIFS-223713_ref25","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1016\/j.procs.2015.06.067","article-title":"Numeral script identification from handwritten document images","volume":"54","author":"Obaidullah","year":"2015","journal-title":"Procedia Computer Science"},{"key":"10.3233\/JIFS-223713_ref26","first-page":"968","article-title":"Gabor Filter-Based Multi-class Classifier for Scanned Document Images","volume":"3","author":"Ma","year":"2003","journal-title":"ICDAR"},{"key":"10.3233\/JIFS-223713_ref27","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1007\/978-3-030-30436-2_9","volume-title":"Advances in Biometrics","author":"Dixit","year":"2019"},{"issue":"1","key":"10.3233\/JIFS-223713_ref28","first-page":"012032","article-title":"An Approach to Pattern Recognition for Identification of Devnagari Script Based on Fingertips and Palm","volume":"2327","author":"Tantarpale","year":"2022","journal-title":"Journal of Physics: Conference Series"},{"key":"10.3233\/JIFS-223713_ref29","doi-asserted-by":"crossref","first-page":"104916","DOI":"10.1016\/j.engappai.2022.104916","article-title":"ConvPatchTrans: A script identification network with global and local semantics deeply integrated,","volume":"113","author":"Yang","year":"2022","journal-title":"Engineering Applications of Artificial Intelligence"},{"issue":"1","key":"10.3233\/JIFS-223713_ref30","first-page":"2088","article-title":"Robust recognition technique for handwritten Kannada character recognition using capsule networks","volume":"12","author":"Rani","year":"2022","journal-title":"International Journal of Electrical & Computer Engineering"},{"key":"10.3233\/JIFS-223713_ref31","unstructured":"Preethi P. and Mamatha H.R. , Region-based CNN for Segmenting Text in Epigraphical Images, Artificial Intelligence and Applications, 2022."},{"issue":"03","key":"10.3233\/JIFS-223713_ref32","doi-asserted-by":"crossref","first-page":"2140011","DOI":"10.1142\/S0219467821400118","article-title":"Script identification for printed and handwritten Indian documents: An empirical study of different feature classifier combinations","volume":"22","author":"Rani","year":"2022","journal-title":"International Journal of Image and Graphics"},{"key":"10.3233\/JIFS-223713_ref33","doi-asserted-by":"crossref","unstructured":"Biswas K. , Shivakumara P. , Sivanthi S. , Pal U. , Lu Y. , Liu C.L. and Ayub M.N.B. , A New Deep Fuzzy Based MSER Model for Multiple Document Images Classification. International Conference on Pattern Recognition and Artificial Intelligence (2022) (pp. 358\u2013370). Springer, Cham.","DOI":"10.1007\/978-3-031-09037-0_30"},{"key":"10.3233\/JIFS-223713_ref34","unstructured":"Najla A.Q. , Khayyat M. and Suen C.Y. , Novel Features to Detect Gender from Handwritten Documents, Pattern Recognition Letters (2022)."},{"issue":"16","key":"10.3233\/JIFS-223713_ref35","doi-asserted-by":"crossref","first-page":"712","DOI":"10.17485\/IJST\/v15i16.88","article-title":"Classification of North and South Indian Handwritten Scripts using Gabor Wavelet Features","volume":"15","author":"Shreesha","year":"2022","journal-title":"Indian Journal of Science and Technology"},{"key":"10.3233\/JIFS-223713_ref36","doi-asserted-by":"crossref","unstructured":"Kamble P.M. , Ruikar D.D. , Houde K.V. and Hegadi R.S. , Adaptive Threshold-Based Database Preparation Method for Handwritten Image Classification. In International Conference on Recent Trends in Image Processing and Pattern Recognition, (2022) (pp. 280\u2013288). Springer, Cham.","DOI":"10.1007\/978-3-031-07005-1_24"},{"key":"10.3233\/JIFS-223713_ref37","first-page":"1","article-title":"Using statistical and motif texture analysis, pen ink discrimination in handwritten documents: A classification-based approach","author":"Dansena","year":"2022","journal-title":"Multimedia Tools and Applications"},{"issue":"2","key":"10.3233\/JIFS-223713_ref38","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1504\/IJISDC.2018.096333","article-title":"Identification and classification of historical Kannada handwritten document images using LBP features","volume":"2","author":"Bannigidad","year":"2018","journal-title":"International Journal of Intelligent Systems Design and Computing"},{"key":"10.3233\/JIFS-223713_ref39","doi-asserted-by":"crossref","unstructured":"Hassanpour M. and Malek H. , Document image classification using squeeze net convolutional neural network. In 2019 5th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS) (2019) (pp. 1\u20134). IEEE.","DOI":"10.1109\/ICSPIS48872.2019.9066032"},{"key":"10.3233\/JIFS-223713_ref40","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1109\/ICFHR.2016.0060","article-title":"Phocnet: A deep convolutional neural network for word spotting in handwritten documents","author":"Sudholt","year":"2016","journal-title":"2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)"},{"key":"10.3233\/JIFS-223713_ref41","first-page":"103","article-title":"Handwritten and machine-printed text separation in document images using the Bag of visual words paradigm","author":"Zagoris","year":"2012","journal-title":"International Conference on Frontiers in Handwriting Recognition"},{"issue":"3","key":"10.3233\/JIFS-223713_ref42","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1109\/TPAMI.2004.1262324","article-title":"Machine-printed text and handwriting identification in noisy document images","volume":"26","author":"Zheng","year":"2004","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"10.3233\/JIFS-223713_ref43","doi-asserted-by":"crossref","first-page":"107832","DOI":"10.1016\/j.patcog.2021.107832","article-title":"Multi-task learning for simultaneous script identification and keyword spotting in document images,","volume":"113","author":"Cheikhrouhou","year":"2021","journal-title":"Pattern Recognition"},{"key":"10.3233\/JIFS-223713_ref45","first-page":"117","article-title":"Extracting text lines in handwritten documents by perceptual grouping","author":"Likforman-Sulem","year":"1994","journal-title":"Advances in Handwriting and Drawing: A Multidisciplinary Approach"},{"key":"10.3233\/JIFS-223713_ref48","unstructured":"Wallach H. , Evaluation metrics for hard classifiers. Cambridge: Cavendish Laboratory, University of Cambridge, (2006)."}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JIFS-223713","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T05:50:21Z","timestamp":1769665821000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JIFS-223713"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,24]]},"references-count":45,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/jifs-223713","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,24]]}}}