Modelling and predicting customer churn from an insurance company. (English) Zbl 1401.91144

Summary: Within a company’s customer relationship management strategy, finding the customers most likely to leave is a central aspect. We present a dynamic modelling approach for predicting individual customers’ risk of leaving an insurance company. A logistic longitudinal regression model that incorporates time-dynamic explanatory variables and interactions is fitted to the data. As an intermediate step in the modelling procedure, we apply generalised additive models to identify non-linear relationships between the logit and the explanatory variables. Both out-of-sample and out-of-time prediction indicate that the model performs well in terms of identifying customers likely to leave the company each month. Our approach is general and may be applied to other industries as well.


91B30 Risk theory, insurance (MSC2010)
62J05 Linear regression; mixed models
62P05 Applications of statistics to actuarial sciences and financial mathematics


Full Text: DOI


[1] Ahn, J.-H., Han, S.-P. and Lee, Y.-S.2006. Customer churn analysis: churn determinants and mediation effects of partial defection in the Korean mobile telecommunications service industry. Telecommunications Policy, 30(10-11): 552-568.
[2] Bolton, R. N.1998. A dynamic model of the duration of the customer’s relationship with a continuous service provider: the role of satisfaction. Marketing Science, 17(1): 45-65.
[3] Brockett, P. L., Golden, L. L., Guillen, M., Nielsen, J. P., Parner, J. and Perez-Marin, A. M.2008. Survival analysis of a household portfolio of insurance policies: how much time do you have to stop total customer defection?. The Journal of Risk and Insurance, 75(3): 713-737.
[4] Burez, J. and Van den Poel, D.2007. CRM at a pay-TV company: using analytical models to reduce customer attrition by targeted marketing for subscription services. Expert Systems with Applications, 32(2): 277-288.
[5] Coussement, C., Benoit, D. F. and Van den Poel, D.2010. Improved marketing decision making in a customer churn prediction context using generalized additive models. Expert Systems with Applications, 37: 2132-2143.
[6] Cox, D. R.1975. Partial likelihood. Biometrika, 62(2): 269-276. · Zbl 0312.62002
[7] Drew, J. H., Mani, D. R., Betz, A. L. and Datta, P.2001. Targeting customers with statistical and data-mining techniques. Journal of Service Research, 3(3): 205-219.
[8] Fawcett, T.2006. An introduction to ROC analysis. Pattern Recognition Letters, 19: 861-874.
[9] Hastie, T. J. and Tibshirani, R. J.1990. Generalized additive models, London: Chapman and Hall. · Zbl 0747.62061
[10] Hung, S.-Y., Yen, D. C. and Wang, H.-Y.2006. Applying data mining to telecom churn management. Expert Systems with Applications, 31(3): 514-524.
[11] Jamal, Z. and Bucklin, R. E.2006. Improving the diagnosis and prediction of customer churn: a heterogeneous hazard modeling approach. Journal of Interactive Marketing, 20(3-4): 16-29.
[12] Kim, H.-S. and Yoon, C.-H.2004. Determinants of subscriber churn and customer loyalty in the Korean mobile telephony market. Telecommunications Policy, 28(9-10): 751-765.
[13] Lemmens, A. and Croux, C.2006. Bagging and boosting classification trees to predict churn. Journal of Marketing Research, 43(2): 276-286.
[14] Mani , D. R. , Drew , J. , Betz , A. & Datta , P.1999 . Statistics and data mining techniques for lifetime value modeling . Proceedings of the fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA94 - 103 .
[15] McCullagh, P. and Nelder, J. A.1989. Generalized linear models, 2nd ed, London: Chapman and Hall/CRC. · Zbl 0744.62098
[16] Mozer, M. C., Wolniewicz, R., Grimes, D. B., Johnson, E. and Kaushansky, H.2000. Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry. IEEE Transactions on Neural Networks - Special Issue on Data Mining and Knowledge Representation, 11(3): 690-696.
[17] Neslin, S. A., Gupta, S., Kamakura, W., Lu, J. and Mason, C. H.2006. Defection detection: measuring and understanding the predictive accuracy of customer churn models. Journal of Marketing Research, 43(2): 204-211.
[18] R Development Core Team . 2008 . R: A language and environment for statistical computing . Vienna , , Austria : R Foundation for Statistical Computing .
[19] Schweidel, D. A., Fader, P. S. and Bradlow, E. T.2008. Modeling retention within and across cohorts. Journal of Marketing, 72(1): 82-94.
[20] Torkzadeh, G., Chang, J. C. J. and Hansen, G. W.2006. Identifying issues in customer relationship management at Merck-Medco. Decision Support Systems, 42(2): 1116-1130.
[21] Wei, C.-P. and Chiu, I.2002. Turning telecommunications call details to churn prediction: a data mining approach. Expert Systems with Applications, 23(2): 103-112.
[22] Zhang, Y., Qi, J., Shu, H. and Li, Y.2006. Case study on CRM: detecting likely churners with limited information of fixed-line subscriber. Proceedings of the International Conference on Service Systems and Service Management, 2: 1495-1500.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.