{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T10:21:09Z","timestamp":1769941269510,"version":"3.49.0"},"reference-count":34,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2021,5,23]],"date-time":"2021-05-23T00:00:00Z","timestamp":1621728000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Breast cancer, like most forms of cancer, is a fatal disease that claims more than half a million lives every year. In 2020, breast cancer overtook lung cancer as the most commonly diagnosed form of cancer. Though extremely deadly, the survival rate and longevity increase substantially with early detection and diagnosis. The treatment protocol also varies with the stage of breast cancer. Diagnosis is typically done using histopathological slides from which it is possible to determine whether the tissue is in the Ductal Carcinoma In Situ (DCIS) stage, in which the cancerous cells have not spread into the encompassing breast tissue, or in the Invasive Ductal Carcinoma (IDC) stage, wherein the cells have penetrated into the neighboring tissues. IDC detection is extremely time-consuming and challenging for physicians. Hence, this can be modeled as an image classification task where pattern recognition and machine learning can be used to aid doctors and medical practitioners in making such crucial decisions. In the present paper, we use an IDC Breast Cancer dataset that contains 277,524 images (with 78,786 IDC positive images and 198,738 IDC negative images) to classify the images into IDC(+) and IDC(-). To that end, we use feature extractors, including textural features, such as SIFT, SURF and ORB, and statistical features, such as Haralick texture features. These features are then combined to yield a dataset of 782 features. These features are ensembled by stacking using various Machine Learning classifiers, such as Random Forest, Extra Trees, XGBoost, AdaBoost, CatBoost and Multi Layer Perceptron followed by feature selection using Pearson Correlation Coefficient to yield a dataset with four features that are then used for classification. From our experimental results, we found that CatBoost yielded the highest accuracy (92.55%), which is at par with other state-of-the-art results\u2014most of which employ Deep Learning architectures. The source code is available in the GitHub repository.<\/jats:p>","DOI":"10.3390\/s21113628","type":"journal-article","created":{"date-parts":[[2021,5,24]],"date-time":"2021-05-24T00:01:20Z","timestamp":1621814480000},"page":"3628","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":40,"title":["Computer Aided Breast Cancer Detection Using Ensembling of Texture and Statistical Image Features"],"prefix":"10.3390","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7931-0508","authenticated-orcid":false,"given":"Soumya Deep","family":"Roy","sequence":"first","affiliation":[{"name":"Department of Metallurgical and Material Engineering, Jadavpur University, Kolkata 700032, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8943-2672","authenticated-orcid":false,"given":"Soham","family":"Das","sequence":"additional","affiliation":[{"name":"Department of Metallurgical and Material Engineering, Jadavpur University, Kolkata 700032, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1404-1076","authenticated-orcid":false,"given":"Devroop","family":"Kar","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5118-0812","authenticated-orcid":false,"given":"Friedhelm","family":"Schwenker","sequence":"additional","affiliation":[{"name":"Institute of Neural Information Processing, Ulm University, 89081 Ulm, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8813-4086","authenticated-orcid":false,"given":"Ram","family":"Sarkar","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India"}]}],"member":"1968","published-online":{"date-parts":[[2021,5,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1205","DOI":"10.1016\/S0033-8389(22)00653-4","article-title":"Digital mammography, computer-aided diagnosis, and telemammography","volume":"33","author":"Feig","year":"1995","journal-title":"Radiol. Clin. N. Am."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Sung, H., Ferlay, J., Siegel, R.L., Laversanne, M., Soerjomataram, I., Jemal, A., and Bray, F. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.","DOI":"10.3322\/caac.21660"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Bay, H., Tuytelaars, T., and Van Gool, L. (2006, January 7\u201313). SURF: Speeded up robust features. Proceedings of the 9th European Conference on Computer Vision, Graz, Austria.","DOI":"10.1007\/11744023_32"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Rublee, E., Rabaud, V., Konolige, K., and Bradski, G. (2011, January 6\u201313). ORB: An efficient alternative to SIFT or SURF. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126544"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1109\/TSMC.1973.4309314","article-title":"Textural Features for Image Classification","volume":"SMC-3","author":"Haralick","year":"1973","journal-title":"IEEE Trans. Syst. Man Cybern."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random Forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_8","unstructured":"Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., and Gulin, A. (2018, January 3\u20138). CatBoost: Unbiased Boosting with Categorical Features. Proceedings of the 32nd International Conference on Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_9","unstructured":"Freund, Y., and Schapire, R.E. (1999, January 18\u201322). A Short Introduction to Boosting. Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, Orlando, FL, USA."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Doyle, S., Agner, S., Madabhushi, A., Feldman, M., and Tomaszewski, J. (2008, January 14\u201317). Automated grading of breast cancer histopathology using spectral clusteringwith textural and architectural image features. Proceedings of the 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Paris, France.","DOI":"10.1109\/ISBI.2008.4541041"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1977","DOI":"10.1109\/TBME.2011.2110648","article-title":"Computerized classification of intraductal breast lesions using histopathological images","volume":"58","author":"Dundar","year":"2011","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Niwas, S.I., Palanisamy, P., Zhang, W., Isa, N.A.M., and Chibbar, R. (2011, January 17\u201318). Log-gabor wavelets based breast carcinoma classification using least square support vector machine. Proceedings of the 2011 IEEE International Conference on Imaging Systems and Techniques, Batu Ferringhi, Malaysia.","DOI":"10.1109\/IST.2011.5962184"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Kral, P., and Lenc, L. (2016, January 25\u201328). LBP features for breast cancer detection. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7532838"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yasiran, S.S., Salleh, S., and Mahmud, R. (2016). Haralick texture and invariant moments features for breast cancer classification. AIP Conf. Proc.","DOI":"10.1063\/1.4954535"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Narayanan, B.N., Krishnaraja, V., and Ali, R. (2019, January 15\u201319). Convolutional Neural Network for Classification of Histopathology Images for Breast Cancer Detection. Proceedings of the 2019 IEEE National Aerospace and Electronics Conference (NAECON), Dayton, OH, USA.","DOI":"10.1109\/NAECON46414.2019.9058279"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Debelee, T.G., Amirian, M., Ibenthal, A., Palm, G., and Schwenker, F. (2017, January 25\u201327). Classification of mammograms using convolutional neural network based feature extraction. Proceedings of the International Conference on Information and Communication Technology for Develoment for Africa, Bahir Dar, Ethiopia.","DOI":"10.1007\/978-3-319-95153-9_9"},{"key":"ref_17","first-page":"79","article-title":"Classification of mammograms using texture and cnn based extracted features","volume":"42","author":"Debelee","year":"2019","journal-title":"J. Biomim. Biomater. Biomed. Eng."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Rahman, M.J.U., Sultan, R.I., Mahmud, F., Ahsan, S.A., and Matin, A. (2018, January 28\u201331). Automatic System for Detecting Invasive Ductal Carcinoma Using Convolutional Neural Networks. Proceedings of the TENCON 2018\u20142018 IEEE Region 10 Conference, Jeju, Korea.","DOI":"10.1109\/TENCON.2018.8650376"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Romano, A.M., and Hernandez, A.A. (2019, January 25\u201328). Enhanced Deep Learning Approach for Predicting Invasive Ductal Carcinoma from Histopathology Images. Proceedings of the 2019 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China.","DOI":"10.1109\/ICAIBD.2019.8837044"},{"key":"ref_20","unstructured":"Gurcan, M.N., and Madabhushi, A. (2014). Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks. Medical Imaging 2014: Digital Pathology, SPIE."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, J.L., Ibrahim, A.K., Zhuang, H., Ali, A.M., Li, A.Y., and Wu, A. (2018, January 12\u201314). A Study on Automatic Detection of IDC Breast Cancer with Convolutional Neural Networks. Proceedings of the 2018 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA.","DOI":"10.1109\/CSCI46756.2018.00141"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Sanyal, R., Jethanandani, M., and Sarkar, R. (2020). DAN: Breast Cancer Classification from High-Resolution Histology Images Using Deep Attention Network. Advances in Intelligent Systems and Computing, Springer.","DOI":"10.1007\/978-981-15-6067-5_35"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sanyal, R., Kar, D., and Sarkar, R. (2021). Carcinoma type classification from high-resolution breast microscopy images using a hybrid ensemble of deep convolutional features and gradient boosting trees classifiers. IEEE\/ACM Trans. Comput. Biol. Bioinform.","DOI":"10.1109\/TCBB.2021.3071022"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Chapala, H., and Sujatha, B. (2020, January 2\u20134). ResNet: Detection of Invasive Ductal Carcinoma in Breast Histopathology Images Using Deep Learning. Proceedings of the 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India.","DOI":"10.1109\/ICESC48915.2020.9155805"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s12530-019-09297-2","article-title":"Survey of deep learning in breast cancer image analysis","volume":"11","author":"Debelee","year":"2020","journal-title":"Evol. Syst."},{"key":"ref_26","unstructured":"Schwenker, F., Roli, F., and Kittler, J. (July, January 29). Multiple Classifier Systems. Proceedings of the 12th International Workshop, G\u00fcnzburg, Germany. Lecture Notes in Computer Science."},{"key":"ref_27","first-page":"17","article-title":"Learning of decision fusion mappings for pattern recognition","volume":"6","author":"Schwenker","year":"2005","journal-title":"Int. J. Artif. Intell. Mach. Learn. (AIML)"},{"key":"ref_28","unstructured":"Pedrycz, W., and Chen, S.M. (2018). Multi-classifier-Systems: Architectures, Algorithms and Applications. Computational Intelligence for Pattern Recognition, Springer International Publishing."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"K\u00e4chele, M., Thiam, P., Palm, G., Schwenker, F., and Schels, M. (2015, January 26). Ensemble methods for continuous affect recognition: Multi-modality, temporality, and challenges. Proceedings of the 5th International Workshop on Audio\/Visual Emotion Challenge, Brisbane Australia.","DOI":"10.1145\/2808196.2811637"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.cosrev.2018.01.003","article-title":"Cluster ensembles: A survey of approaches with recent extensions and applications","volume":"28","author":"Boongoen","year":"2018","journal-title":"Comput. Sci. Rev."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/s11063-013-9334-5","article-title":"Neural network ensembles in reinforcement learning","volume":"41","author":"Schwenker","year":"2015","journal-title":"Neural Process. Lett."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-006-6226-1","article-title":"Extremely randomized trees","volume":"63","author":"Geurts","year":"2006","journal-title":"Mach. Learn."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Alghodhaifi, H., Alghodhaifi, A., and Alghodhaifi, M. (2019, January 15\u201319). Predicting Invasive Ductal Carcinoma in breast histology images using Convolutional Neural Network. Proceedings of the 2019 IEEE National Aerospace and Electronics Conference (NAECON), Dayton, OH, USA.","DOI":"10.1109\/NAECON46414.2019.9057822"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/11\/3628\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:06:21Z","timestamp":1760162781000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/11\/3628"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,23]]},"references-count":34,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2021,6]]}},"alternative-id":["s21113628"],"URL":"https:\/\/doi.org\/10.3390\/s21113628","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,23]]}}}