{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T10:54:24Z","timestamp":1775040864798,"version":"3.50.1"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,8,17]],"date-time":"2021-08-17T00:00:00Z","timestamp":1629158400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,8,17]],"date-time":"2021-08-17T00:00:00Z","timestamp":1629158400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Data mining techniques were used to investigate the use of knowledge extraction in predicting customer churn in insurance companies. Data were included from a health insurance company for providing insight into churn behaviour based on a design and application of a prediction model. Additionally, three promising data mining techniques were identified for the prediction of modeling, including logistic regression, neural network, and K-means. The decision tree method was used in the modeling phase of CRISP-DM for identifying the attributes of churned customers. The predictive analysis task is undertaken through classification and regression techniques. K-means clustering variation is selected for exploring if the clustering algorithms categorize the customers in churning and non-churning groups with homogeneous profiles. The findings of the study show that data mining procedures can be very successful in extracting hidden information and get to know customer's information. The 50:50 training set distribution resulted in effective outcomes when the logistic regression technique was used throughout this study. A 70:30 distribution worked effectively for the neural network technique. In this regard, it is concluded that each technique works effectively with a different training set distribution. The predicted findings can have direct implications for the marketing department of the selected insurance company, whereas the models are anticipated to be readily applicable in other environments via this data mining approach. This study has shown that the prediction models can be utilized throughout a health insurance company's marketing strategy and in a general academic context with a combination of a research-based emphasis with a business problem-solving approach.<\/jats:p>","DOI":"10.1186\/s40537-021-00500-3","type":"journal-article","created":{"date-parts":[[2021,8,17]],"date-time":"2021-08-17T12:33:42Z","timestamp":1629203622000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["The use of knowledge extraction in predicting customer churn in B2B"],"prefix":"10.1186","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5062-8942","authenticated-orcid":false,"given":"Arwa A.","family":"Jamjoom","sequence":"first","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,8,17]]},"reference":[{"key":"500_CR1","doi-asserted-by":"publisher","first-page":"242","DOI":"10.1016\/j.neucom.2016.12.009","volume":"237","author":"A Amin","year":"2017","unstructured":"Amin A, Anwar S, Adnan A, Nawaz M, Alawfi K, Hussain A, Huang K. Customer churn prediction in the telecommunication sector using a rough set approach. Neurocomputing. 2017;237:242\u201354. https:\/\/doi.org\/10.1016\/j.neucom.2016.12.009.","journal-title":"Neurocomputing"},{"key":"500_CR2","first-page":"91","volume":"8","author":"H Amjad","year":"2015","unstructured":"Amjad H, Reham D, Osama H, Ruba O, Hossam F. Hybrid data mining models for predicting customer churn. Int J Commun Netw Syst Sci. 2015;8:91\u20136.","journal-title":"Int J Commun Netw Syst Sci"},{"issue":"11","key":"500_CR3","first-page":"57","volume":"9","author":"MO Arowolo","year":"2017","unstructured":"Arowolo MO, Abdulsalam SO, Isiaka RM, Gbolagade KA. A hybrid dimensionality reduction model for classification of microarray dataset. Int J Inf Technol Comput Sci. 2017;9(11):57\u201363.","journal-title":"Int J Inf Technol Comput Sci"},{"key":"500_CR4","doi-asserted-by":"publisher","first-page":"182422","DOI":"10.1109\/ACCESS.2020.3029234","volume":"8","author":"MO Arowolo","year":"2020","unstructured":"Arowolo MO, Adebiyi MO, Adebiyi AA, Okesola OJ. A hybrid heuristic dimensionality reduction method for classifying malaria vector gene expression data. IEEE Access. 2020;8:182422\u201330.","journal-title":"IEEE Access"},{"key":"500_CR5","doi-asserted-by":"publisher","DOI":"10.1109\/ICMCECS47690.2020.240881","author":"MO Arowolo","year":"2020","unstructured":"Arowolo MO, Adebiyi M, Adebiyi AA, Okesola JO. PCA model for RNA-Seq malaria vector data classification using KNN and decision tree algorithm. Int Conf Math Comput Eng Comput Sci. 2020. https:\/\/doi.org\/10.1109\/ICMCECS47690.2020.240881.","journal-title":"Int Conf Math Comput Eng Comput Sci"},{"key":"500_CR6","doi-asserted-by":"publisher","DOI":"10.4108\/eai.25-9-2017.153147","author":"MO Arowolo","year":"2017","unstructured":"Arowolo MO, Isiaka RM, Abdulsalam SO, Saheed YK, Gbolagade KA. A comparative analysis of feature extraction methods for classifying colon cancer microarray data. EAI Endorsed Trans Scalable Inf Syst. 2017. https:\/\/doi.org\/10.4108\/eai.25-9-2017.153147.","journal-title":"EAI Endorsed Trans Scalable Inf Syst"},{"key":"500_CR7","doi-asserted-by":"crossref","unstructured":"Dolatabadi SH, Keynia F. Designing of customer and employee churn prediction model based on data mining method and neural predictor. In: 2017 2nd International Conference on Computer and Communication Systems (ICCCS). New York: IEEE; 2017. p. 74\u20137.","DOI":"10.1109\/CCOMS.2017.8075270"},{"issue":"5","key":"500_CR8","doi-asserted-by":"publisher","first-page":"732","DOI":"10.1108\/K-07-2015-0172","volume":"45","author":"M Fathian","year":"2016","unstructured":"Fathian M, Hoseinpoor Y, Minaei-Bidgoli B. Offering a hybrid approach of data mining to predict the customer churn based on bagging and boosting methods. Kybernetes. 2016;45(5):732\u201343.","journal-title":"Kybernetes"},{"key":"500_CR9","doi-asserted-by":"publisher","first-page":"378","DOI":"10.1007\/978-3-030-33742-1_30","volume-title":"International Conference on Software Business","author":"I Figalist","year":"2019","unstructured":"Figalist I, Elsner C, Bosch J, Olsson HH. Customer Churn Prediction in B2B Contexts. In: Hyrynsalmi S, Suoranta M, Nguyen-Duc A, Tyrv\u00e4inen P, Abrahamsson P, editors. International Conference on Software Business. Cham: Springer; 2019. p. 378\u201386."},{"key":"500_CR10","doi-asserted-by":"publisher","unstructured":"Franciska I, Swaminathan B (2017) Churn prediction analysis using various clustering algorithms in KNIME analytics platform. In: 2017 Third International Conference on Sensing, Signal Processing and Security (ICSSS). New York: IEEE; 2017. p. 166\u201370. https:\/\/doi.org\/10.1109\/ssps.2017.8071585.","DOI":"10.1109\/ssps.2017.8071585"},{"key":"500_CR11","doi-asserted-by":"publisher","unstructured":"He Y, Xiong Y, Tsai Y. Machine learning based approaches to predict customer churn for an insurance company. In: 2020 Systems and Information Engineering Design Symposium (SIEDS). New York: IEEE; 2020. p. 1\u20136. https:\/\/doi.org\/10.1109\/sieds49339.2020.9106691.","DOI":"10.1109\/sieds49339.2020.9106691"},{"key":"500_CR12","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/s40854-016-0029-6","volume":"2","author":"A Keramati","year":"2016","unstructured":"Keramati A, Ghaneei H, Mirmohammadi SM. Developing a prediction model for customer churn from electronic banking services using data mining. Financial Innov. 2016;2:2\u201313. https:\/\/doi.org\/10.1186\/s40854-016-0029-6.","journal-title":"Financial Innov"},{"key":"500_CR13","first-page":"59","volume":"19","author":"RQ Liu","year":"2018","unstructured":"Liu RQ, Lee YC, Mu HL. Customer classification and market basket analysis using K-means clustering and association rules: evidence from distribution big data of korean retailing company. Knowl Manag Res. 2018;19:59\u201376.","journal-title":"Knowl Manag Res"},{"key":"500_CR14","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2962680","author":"F Mart\u00ednez-Plumed","year":"2019","unstructured":"Mart\u00ednez-Plumed F, Contreras-Ochando L, Ferri C, et al. CRISP-DM twenty years later: from data mining processes to data science trajectories. IEEE Trans Knowl Data Eng. 2019. https:\/\/doi.org\/10.1109\/TKDE.2019.2962680.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"500_CR15","doi-asserted-by":"crossref","unstructured":"Nadali A, Kakhky EN, Nosratabadi HE. Evaluating the success level of data mining projects based on CRISP-DM methodology by a Fuzzy expert system. In: 2011 3rd International Conference on Electronics Computer Technology. New York: IEEE; 2011. 6:161\u20135.","DOI":"10.1109\/ICECTECH.2011.5942073"},{"issue":"27","key":"500_CR16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.17485\/ijst\/2018\/v11i27\/121478","volume":"11","author":"AN Nadeem","year":"2018","unstructured":"Nadeem AN, Umar S, Shahzad MS. A review on customer churn prediction data mining modeling techniques. Indian J Sci Technol. 2018;11(27):1\u20137. https:\/\/doi.org\/10.17485\/ijst\/2018\/v11i27\/121478.","journal-title":"Indian J Sci Technol"},{"key":"500_CR17","unstructured":"Nagesh Singh Chauhan (Decision Tree Algorithm)\u00a0Decision Tree Algorithm. 2021. https:\/\/www.kdnuggets.com\/2020\/01\/decision-tree-algorithm-explained.html. Accessed April 2021."},{"key":"500_CR18","first-page":"1","volume":"29","author":"AM Olaolu","year":"2018","unstructured":"Olaolu AM, Abdulsalam SO, Mope IR, Kazeem GA. A comparative analysis of feature selection and feature extraction models for classifying microarray dataset. Comput Inf Syst J. 2018;29:1.","journal-title":"Comput Inf Syst J"},{"key":"500_CR19","first-page":"34","volume":"3","author":"S Renjith","year":"2017","unstructured":"Renjith S. B2C E-commerce customer churn management: churn detection using support vector machine and personalized retention using hybrid recommendations. Int J Future Revolut Comput Sci Commun Eng. 2017;3:34\u20139.","journal-title":"Int J Future Revolut Comput Sci Commun Eng"},{"key":"500_CR20","doi-asserted-by":"publisher","DOI":"10.5120\/7702-1059","author":"KV Rodpysh","year":"2012","unstructured":"Rodpysh KV. Model to predict the behavior of customers churn at the industry. Int J Comput Appl. 2012. https:\/\/doi.org\/10.5120\/7702-1059.","journal-title":"Int J Comput Appl"},{"issue":"3","key":"500_CR21","first-page":"7","volume":"5","author":"M Rohini","year":"2017","unstructured":"Rohini M, Devaki P. Analysis of customer churn by big data clustering. Int J Innov Res Comput Commun Eng. 2017;5(3):7.","journal-title":"Int J Innov Res Comput Commun Eng"},{"key":"500_CR22","first-page":"131","volume":"5","author":"A Santharam","year":"2018","unstructured":"Santharam A, Krishnan SB. Survey on customer churn prediction techniques. Int Res J Eng Technol. 2018;5:131\u20137.","journal-title":"Int Res J Eng Technol"},{"key":"500_CR23","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1007\/978-3-030-59065-9_21","volume-title":"Big data analytics and knowledge discovery","author":"M Scriney","year":"2020","unstructured":"Scriney M, Nie D, Roantree M. Predicting customer churn for insurance data. In: Song M, Song Y, Kotsis G, Tjoa M, Khalil I, editors. Big data analytics and knowledge discovery. Cham: Springer; 2020. p. 256\u201365."},{"key":"500_CR24","volume-title":"Consumer behaviour. Anmol","author":"SD Singh","year":"2012","unstructured":"Singh SD. Consumer behaviour. Anmol. Mumbai: Himalaya publishing house; 2012."},{"key":"500_CR25","doi-asserted-by":"crossref","unstructured":"Spiteri M. Customer churn prediction for an insurance company. Master\u2019s thesis, University of Malta. 2018.","DOI":"10.1109\/ICDIM.2018.8847066"},{"key":"500_CR26","unstructured":"Stucki O (2019) Predicting the Customer Churn with Machine Learning Methods: Case: Private Insurance Customer Data. Master\u2019s Thesis. LUT University. 2019."},{"key":"500_CR27","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1080\/0267257x.2017.1295094","volume":"33","author":"A Tamaddoni","year":"2017","unstructured":"Tamaddoni A, Stakhovych S, Ewing M. The impact of personalized incentives on the profitability of customer retention campaigns. J Mark Manag. 2017;33:327\u201347. https:\/\/doi.org\/10.1080\/0267257x.2017.1295094.","journal-title":"J Mark Manag"},{"issue":"3","key":"500_CR28","doi-asserted-by":"publisher","first-page":"1270","DOI":"10.1109\/TII.2016.2547584","volume":"12","author":"B Wenjie","year":"2016","unstructured":"Wenjie B, Meili C, Mengqi L, Guo L. A Big Data Clustering Algorithm for Mitigating the Risk of Customer Churn. IEEE Trans Industr Inf. 2016;12(3):1270\u20132181. https:\/\/doi.org\/10.1109\/TII.2016.2547584.","journal-title":"IEEE Trans Industr Inf"},{"key":"500_CR29","doi-asserted-by":"crossref","unstructured":"Zhang W, Mo T, Li W, Huang H, Tian X. The comparison of decision tree-based insurance churn prediction between spark ML and SPSS. In: 2016 9th International Conference on Service Science (ICSS). New York: IEEE; 2016. p. 134\u20139.","DOI":"10.1109\/ICSS.2016.25"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-021-00500-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-021-00500-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-021-00500-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,17]],"date-time":"2021-08-17T12:38:23Z","timestamp":1629203903000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-021-00500-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,17]]},"references-count":29,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["500"],"URL":"https:\/\/doi.org\/10.1186\/s40537-021-00500-3","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,17]]},"assertion":[{"value":"12 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 August 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 August 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not required.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The author declares no competing interest.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"110"}}