×

Tree-structured modelling of varying coefficients. (English) Zbl 1430.62164

Summary: The varying-coefficient model is a strong tool for the modelling of interactions in generalized regression. It is easy to apply if both the variables that are modified as well as the effect modifiers are known. However, in general one has a set of explanatory variables, and it is unknown which covariates are modified by which variables. A recursive partitioning strategy is proposed that is able to deal with this complex selection problem. The tree-structured modelling yields for each covariate, which is modified by other variables, a tree that visualizes the modified effects. The performance of the method is investigated in simulations, and two applications illustrate its usefulness.

MSC:

62J12 Generalized linear models (logistic models)
05C90 Applications of graph theory

Software:

TSVC; AER; R
PDFBibTeX XMLCite
Full Text: DOI arXiv

References:

[1] Berger, M.: TSVC: Tree-Structured Modelling of Varying Coefficients. R package version, vol. 1 (2018)
[2] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, J.C.: Classification and Regression Trees. Wadsworth, Monterey (1984) · Zbl 0541.62042
[3] Bürgin, R., Ritschard, G.: Tree-based varying coefficient regression for longitudinal ordinal responses. Comput. Stat. Data Anal. 86(C), 65-80 (2015) · Zbl 1468.62033 · doi:10.1016/j.csda.2015.01.003
[4] Bürgin, R., Ritschard, G.: Coefficient-wise tree-based varying coefficient regression with vcrpart. J. Stat. Softw. 80(6), 1-33 (2017) · doi:10.18637/jss.v080.i06
[5] Cameron, A.C., Trivedi, P.K.: Econometric models based on count data: comparisons and applications of some estimators and tests. J. Appl. Econom. 1(1), 29-53 (1986) · doi:10.1002/jae.3950010104
[6] Cameron, A.C., Trivedi, P.K.: Regression Analysis of Count Data. Econometric Society Monographs No. 30. Cambridge University Press, Cambridge (1998) · Zbl 0924.62004 · doi:10.1017/CBO9780511814365
[7] Fan, J., Zhang, W.: Statistical estimation in varying coefficient models. Ann. Stat. 27(5), 1491-1518 (1999) · Zbl 0977.62039 · doi:10.1214/aos/1017939139
[8] Fan, J., Zhang, W.: Statistical methods with varying coefficient models. Stat. Interface 1(1), 179-195 (2008) · Zbl 1230.62031 · doi:10.4310/SII.2008.v1.n1.a15
[9] Gerfin, M.: Parametric and semi-parametric estimation of the binary response model of labour market participation. J. Appl. Econom. 11(3), 321-339 (1996) · doi:10.1002/(SICI)1099-1255(199605)11:3<321::AID-JAE391>3.0.CO;2-K
[10] Gertheiss, J., Tutz, G.: Regularization and model selection with categorial effect modifiers. Stat. Sin. 22(3), 957-982 (2012) · Zbl 1257.62078
[11] Hastie, T., Tibshirani, R.: Varying-coefficient models. J. R. Stat. Soc. B 55, 757-796 (1993) · Zbl 0796.62060
[12] Hofner, B., Hothorn, T., Kneib, T.: Variable selection and model choice in structured survival models. Comput. Stat. 28(3), 1079-1101 (2013) · Zbl 1305.65043 · doi:10.1007/s00180-012-0337-x
[13] Hoover, D., Rice, J.A., Wu, C., Yang, L.: Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85(4), 809-822 (1998) · Zbl 0921.62045 · doi:10.1093/biomet/85.4.809
[14] Hothorn, T., Lausen, B.: On the exact distribution of maximally selected rank statistics. Comput. Stat. Data Anal. 43(2), 121-137 (2003) · Zbl 1429.62542 · doi:10.1016/S0167-9473(02)00225-6
[15] Hothorn, T., Hornik, K., Zeileis, A.: Unbiased recursive partitioning: a conditional inference framework. J. Comput. Graph. Stat. 15(3), 651-674 (2006) · doi:10.1198/106186006X133933
[16] Kauermann, G., Tutz, G.: Local likelihood estimation in varying coefficient models including additive bias correction. J. Nonparametric Stat. 12(3), 343-371 (2000) · Zbl 0945.62044 · doi:10.1080/10485250008832812
[17] Kleiber, C., Zeileis, A.: Applied Econometrics with R. New York. ISBN: 978-0-387-77316-2. http://CRAN.R-project.org/package=AER (2008) · Zbl 1155.91004
[18] Leng, C.: A simple approach for varying-coefficient model selection. J. Stat. Plan. Inference 139(7), 2138-2146 (2009) · Zbl 1160.62067 · doi:10.1016/j.jspi.2008.10.009
[19] Lu, Y., Zhang, R., Zhu, L.: Penalized spline estimation for varying-coefficient models. Commun. Stat. Theory Methods 37(14), 2249-2261 (2008) · Zbl 1143.62023 · doi:10.1080/03610920801931887
[20] Oelker, M.R., Gertheiss, J., Tutz, G.: Regularization and model selection with categorical predictors and effect modifiers in generalized linear models. Stat. Model. 14(2), 157-177 (2014) · Zbl 07257900 · doi:10.1177/1471082X13503452
[21] Ripley, B.D.: Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge (1996) · Zbl 0853.62046 · doi:10.1017/CBO9780511812651
[22] Shih, Y.S.: A note on split selection bias in classification trees. Comput. Stat. Data Anal. 45(3), 457-466 (2004) · Zbl 1429.62264 · doi:10.1016/S0167-9473(03)00064-1
[23] Shih, Y.S., Tsai, H.: Variable selection bias in regression trees with constant fits. Comput. Stat. Data Anal. 45(3), 595-607 (2004) · Zbl 1429.62725 · doi:10.1016/S0167-9473(03)00036-7
[24] Su, X., Meneses, K., McNees, P., Johnson, W.O.: Interaction trees: exploring the differential effects of an intervention programme for breast cancer survivors. J. R. Stat. Soc. C 60(3), 457-474 (2011) · doi:10.1111/j.1467-9876.2010.00754.x
[25] Wang, H., Xia, Y.: Shrinkage estimation of the varying coefficient model. J. Am. Stat. Assoc. 104(486), 747-757 (2009) · Zbl 1388.62213 · doi:10.1198/jasa.2009.0138
[26] Wang, J.C., Hastie, T.: Boosted varying-coefficient regression models for product demand prediction. J. Comput. Graph. Stat. 23(2), 361-382 (2014) · doi:10.1080/10618600.2013.778777
[27] Wang, L., Li, H., Haung, J.: Variable selection in nonparametric varying-coefficient models for analysis of repeated measurements. J. Am. Stat. Assoc. 103(484), 1556-1569 (2008) · Zbl 1286.62034 · doi:10.1198/016214508000000788
[28] Wedderburn, R.W.M.: Quasilikelihood functions, generalized linear models and the Gauss-Newton method. Biometrika 61(3), 439-447 (1974) · Zbl 0292.62050
[29] Wong, H., Guo, S., Chen, M., Wai-Cheung, I.P.: On locally weighted estimation and hypothesis testing of varying-coefficient models with missing covariates. J. Stat. Plan. Inference 139(9), 2933-2951 (2009) · Zbl 1170.62028 · doi:10.1016/j.jspi.2009.01.016
[30] Wu, C., Chiang, C., Hoover, D.: Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data. J. Am. Stat. Assoc. 93(444), 1388-1402 (1998) · Zbl 1064.62523 · doi:10.1080/01621459.1998.10473800
[31] Zhao, P., Xue, L.: Variable selection for semiparametric varying coefficient partially linear models. Stat. Probab. Lett. 79(20), 2148-2157 (2009) · Zbl 1171.62026 · doi:10.1016/j.spl.2009.07.004
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.