{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:29:54Z","timestamp":1750220994163,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,9,20]],"date-time":"2019-09-20T00:00:00Z","timestamp":1568937600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,9,20]]},"DOI":"10.1145\/3352631.3352639","type":"proceedings-article","created":{"date-parts":[[2019,10,18]],"date-time":"2019-10-18T12:57:15Z","timestamp":1571403435000},"page":"31-36","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Using Balanced Training to Minimize Biased Classification"],"prefix":"10.1145","author":[{"given":"Redy","family":"Andriyansah","sequence":"first","affiliation":[{"name":"Ministry of Finance Indonesia, Jakarta, Indonesia"}]},{"given":"Syed Saqib","family":"Bukhari","sequence":"additional","affiliation":[{"name":"German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany"}]},{"given":"Martin","family":"Jenckel","sequence":"additional","affiliation":[{"name":"German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany"}]},{"given":"Andreas","family":"Dengel","sequence":"additional","affiliation":[{"name":"Department of Computer Science TU Kaiserslautern, German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany"}]}],"member":"320","published-online":{"date-parts":[[2019,9,20]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"139","volume-title":"International Workshop on Camera-Based Document Analysis and Recognition","author":"Afzal Muhammad Zeshan","year":"2013","unstructured":"Muhammad Zeshan Afzal , Martin Kr\u00e4mer , Syed Saqib Bukhari , Mohammad Reza Yousefi , Faisal Shafait , and Thomas M Breuel . Robust binarization of stereo and monocular document images using percentile filter . In International Workshop on Camera-Based Document Analysis and Recognition , pages 139 -- 149 . Springer , 2013 . Muhammad Zeshan Afzal, Martin Kr\u00e4mer, Syed Saqib Bukhari, Mohammad Reza Yousefi, Faisal Shafait, and Thomas M Breuel. Robust binarization of stereo and monocular document images using percentile filter. In International Workshop on Camera-Based Document Analysis and Recognition, pages 139--149. Springer, 2013."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.149"},{"key":"e_1_3_2_1_3_1","volume-title":"Augmentor: an image augmentation library for machine learning. arXiv preprint arXiv:1708.04680","author":"Bloice Marcus D","year":"2017","unstructured":"Marcus D Bloice , Christof Stocker , and Andreas Holzinger . Augmentor: an image augmentation library for machine learning. arXiv preprint arXiv:1708.04680 , 2017 . Marcus D Bloice, Christof Stocker, and Andreas Holzinger. Augmentor: an image augmentation library for machine learning. arXiv preprint arXiv:1708.04680, 2017."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/335603.335611"},{"key":"e_1_3_2_1_5_1","volume-title":"Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321-357","author":"Chawla Nitesh V","year":"2002","unstructured":"Nitesh V Chawla , Kevin W Bowyer , Lawrence O Hall , and W Philip Kegelmeyer . Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321-357 , 2002 . Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer. Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321-357, 2002."},{"key":"e_1_3_2_1_6_1","volume-title":"SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping","author":"Collins-Thompson Kevyn","year":"2002","unstructured":"Kevyn Collins-Thompson and Radoslav Nickolov . A clustering-based algorithm for automatic document separation . In SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping , Meaning, Tampere, Finland , 2002 . Kevyn Collins-Thompson and Radoslav Nickolov. A clustering-based algorithm for automatic document separation. In SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping, Meaning, Tampere, Finland, 2002."},{"key":"e_1_3_2_1_7_1","volume-title":"Document image classification with intra-domain transfer learning and stacked generalization of deep convolutional neural networks. arXiv preprint arXiv:1801.09321","author":"Das Arindam","year":"2018","unstructured":"Arindam Das , Saikat Roy , and Ujjwal Bhattacharya . Document image classification with intra-domain transfer learning and stacked generalization of deep convolutional neural networks. arXiv preprint arXiv:1801.09321 , 2018 . Arindam Das, Saikat Roy, and Ujjwal Bhattacharya. Document image classification with intra-domain transfer learning and stacked generalization of deep convolutional neural networks. arXiv preprint arXiv:1801.09321, 2018."},{"key":"e_1_3_2_1_8_1","first-page":"31","volume-title":"Information Theory, 2004. ISIT 2004. Proceedings. International Symposium on","author":"Fuglede Bent","unstructured":"Bent Fuglede and Flemming Topsoe . Jensen-shannon divergence and hilbert space embedding . In Information Theory, 2004. ISIT 2004. Proceedings. International Symposium on , page 31 . IEEE, 2004. Bent Fuglede and Flemming Topsoe. Jensen-shannon divergence and hilbert space embedding. In Information Theory, 2004. ISIT 2004. Proceedings. International Symposium on, page 31. IEEE, 2004."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333910"},{"key":"e_1_3_2_1_10_1","volume-title":"Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167","author":"Ioffe Sergey","year":"2015","unstructured":"Sergey Ioffe and Christian Szegedy . Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 , 2015 . Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167, 2015."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2014.546"},{"key":"e_1_3_2_1_12_1","volume-title":"in 2nd Int. Conf. on Computer Vision Theory and Applications. Citeseer","author":"Keysers Daniel","year":"2007","unstructured":"Daniel Keysers , Faisal Shafait , and Thomas M Breuel . Document image zone classification-a simple high-performance approach . In in 2nd Int. Conf. on Computer Vision Theory and Applications. Citeseer , 2007 . Daniel Keysers, Faisal Shafait, and Thomas M Breuel. Document image zone classification-a simple high-performance approach. In in 2nd Int. Conf. on Computer Vision Theory and Applications. Citeseer, 2007."},{"key":"e_1_3_2_1_13_1","first-page":"1097","volume-title":"Advances in neural information processing systems","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey E Hinton . Imagenet classification with deep convolutional neural networks . In Advances in neural information processing systems , pages 1097 -- 1105 , 2012 . Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097--1105, 2012."},{"key":"e_1_3_2_1_14_1","first-page":"950","volume-title":"Advances in neural information processing systems","author":"Krogh Anders","year":"1992","unstructured":"Anders Krogh and John A Hertz . A simple weight decay can improve generalization . In Advances in neural information processing systems , pages 950 -- 957 , 1992 . Anders Krogh and John A Hertz. A simple weight decay can improve generalization. In Advances in neural information processing systems, pages 950--957, 1992."},{"key":"e_1_3_2_1_15_1","first-page":"1558","volume-title":"Pattern Recognition (ICPR), 2012 21st International Conference on","author":"Kumar Jayant","year":"2012","unstructured":"Jayant Kumar , Peng Ye , and David Doermann . Learning document structure for retrieval and classification . In Pattern Recognition (ICPR), 2012 21st International Conference on , pages 1558 -- 1561 . IEEE, 2012 . Jayant Kumar, Peng Ye, and David Doermann. Learning document structure for retrieval and classification. In Pattern Recognition (ICPR), 2012 21st International Conference on, pages 1558--1561. IEEE, 2012."},{"key":"e_1_3_2_1_16_1","volume-title":"The impact of imbalanced training data for convolutional neural networks","author":"Masko David","year":"2015","unstructured":"David Masko and Paulina Hensman . The impact of imbalanced training data for convolutional neural networks , 2015 . David Masko and Paulina Hensman. The impact of imbalanced training data for convolutional neural networks, 2015."},{"key":"e_1_3_2_1_17_1","volume-title":"URL http:\/\/www.ocr-d.de. Last accessed","author":"\u00c3\u016drderinitiative","year":"2019","unstructured":"ocr d. Ocr-d:koordinierte f \u00c3\u016drderinitiative zur weiterentwicklung von ocr f\u00c3ijr historische dokumente, 2019. URL http:\/\/www.ocr-d.de. Last accessed 7 February 2019 . ocr d. Ocr-d:koordinierte f\u00c3\u016drderinitiative zur weiterentwicklung von ocr f\u00c3ijr historische dokumente, 2019. URL http:\/\/www.ocr-d.de. Last accessed 7 February 2019."},{"key":"e_1_3_2_1_18_1","volume-title":"University of washingtoniii english\/technical document image database","author":"Phillips Ihsin","year":"1995","unstructured":"Ihsin Phillips , Robert Haralick , and Bhabatosh Chanda . University of washingtoniii english\/technical document image database , 1995 . Ihsin Phillips, Robert Haralick, and Bhabatosh Chanda. University of washingtoniii english\/technical document image database, 1995."},{"key":"e_1_3_2_1_19_1","first-page":"606","volume-title":"IPCV","author":"Shin Christian K","year":"2006","unstructured":"Christian K Shin and David S Doermann . Document image retrieval based on layout structural similarity . In IPCV , pages 606 -- 612 , 2006 . Christian K Shin and David S Doermann. Document image retrieval based on layout structural similarity. In IPCV, pages 606--612, 2006."},{"key":"e_1_3_2_1_20_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 , 2014 . Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/2627435.2670313"},{"key":"e_1_3_2_1_22_1","volume-title":"Analysis of convolutional neural networks for document image classification. arXiv preprint arXiv:1708.03273","author":"Tensmeyer Chris","year":"2017","unstructured":"Chris Tensmeyer and Tony Martinez . Analysis of convolutional neural networks for document image classification. arXiv preprint arXiv:1708.03273 , 2017 . Chris Tensmeyer and Tony Martinez. Analysis of convolutional neural networks for document image classification. arXiv preprint arXiv:1708.03273, 2017."}],"event":{"name":"HIP '19: The 5th International Workshop on Historical Document Imaging and Processing","sponsor":["FamilySearch FamilySearch"],"location":"Sydney NSW Australia","acronym":"HIP '19"},"container-title":["Proceedings of the 5th International Workshop on Historical Document Imaging and Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3352631.3352639","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3352631.3352639","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:25:30Z","timestamp":1750206330000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3352631.3352639"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,20]]},"references-count":22,"alternative-id":["10.1145\/3352631.3352639","10.1145\/3352631"],"URL":"https:\/\/doi.org\/10.1145\/3352631.3352639","relation":{},"subject":[],"published":{"date-parts":[[2019,9,20]]},"assertion":[{"value":"2019-09-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}