×

Variable selection with spatially autoregressive errors: a generalized moments Lasso estimator. (English) Zbl 1437.62273

Summary: We propose generalized moments Lasso estimator, combining Lasso with GMM, for penalized variable selection and estimation under the spatial error model with spatially autoregressive errors. We establish parameter consistency and selection sign consistency of the proposed estimator in the low dimensional setting when the parameter dimension \(p < \text{ sample size } n\) , as well as the high dimensional setting with \(p\) greater than and growing with \(n\). Finite sample performance of the method is examined by simulation, compared against the Lasso for IID data. The methods are applied to estimation of a spatial Durbin model for the Aveiro housing market (Portugal).

MSC:

62J07 Ridge regression; shrinkage estimators (Lasso)
62P20 Applications of statistics to economics
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)

Software:

glmnet
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Ahrens, A.; Bhattacharjee, A., Two-step lasso estimation of the spatial weights matrix, Econometrics (MDPI), 3, 1, 128-155 (2015)
[2] Ando, T.; Bai, J., Panel data models with grouped factor structure under unknown group membership, J. Appl. Econ., 31, 163-191 (2016)
[3] Anselin, L., Spatial Econometrics: Methods and Models (1988), Boston: Kluwer Academic, Boston
[4] Bai, Zd, Methodologies in spectral analysis of large dimensional random matrices: A review, Statistica Sinica, 9, 611-677 (1999) · Zbl 0949.60077
[5] Bailey, N.; Holly, S.; Pesaran, Mh, A two-stage approach to spatio-temporal analysis with strong and weak cross-sectional dependence, J. Appl. Econ., 31, 249-280 (2016)
[6] Belloni, A. and Chernozhukov, V. (2011). High dimensional sparse econometric models: An introduction. arXiv:1106.5242v2. · Zbl 1209.62064
[7] Belloni, A.; Chernozhukov, V., Least squares after model selection in high-dimensional sparse models, Bernoulli, 19, 521-547 (2013) · Zbl 1456.62066
[8] Belloni, A.; Chernozhukov, V.; Wang, L., Square-root LASSO: pivotal recovery of sparse signals via conic programming, Biometrika, 98, 791-806 (2011) · Zbl 1228.62083
[9] Belloni, A.; Chen, D.; Chernozhukov, V.; Hansen, C., Sparse models and methods for optimal instruments with an application to eminent domain, Econometrica, 80, 2369-2429 (2012) · Zbl 1274.62464
[10] Belloni, A., Chernozhukov, V. and Wei, Y. (2016). Post-selection inference for generalized linear models with many controls. Journal of Business and Economic Statistics, forthcoming.
[11] Bhattacharjee, A.; Castro, Ea; Marques, Jl, Understanding spatial diffusion with factor-based hedonic pricing models: the urban housing market of Aveiro, Portugal, Spat. Econ. Anal., 7, 1, 133-167 (2012)
[12] Bhattacharjee, A.; Castro, E.; Maiti, T.; Marques, J., Endogenous spatial regression and delineation of submarkets: A new framework with application to housing markets, J. Appl. Econ., 31, 32-57 (2016)
[13] Bickel, Pj; Ritov, Y.; Tsybakov, Ab, Simultaneous analysis of LASSO and Dantzig selector, Ann. Stat., 37, 1705-1732 (2009) · Zbl 1173.62022
[14] Brady, Rr, Measuring the diffusion of housing prices across space and over time, J. Appl. Econ., 26, 2, 213-231 (2011)
[15] Bühlmann, P. and van de Geer, S. (2011). Statistics for High-Dimensional Data. Springer. · Zbl 1273.62015
[16] Caner, M.; Zhang, Hh, Adaptive elastic net for generalized methods of moments, J. Bus. Econ. Stat., 32, 1, 30-47 (2014)
[17] Castle, Jl; Hendry, Df, Model selection in under-specified equations with breaks, J. Econ., 178, 286-293 (2014) · Zbl 1293.62179
[18] Castle, Jl; Doornik, Ja; Hendry, Df; Pretis, F., Detecting location shifts during model selection by step-indicator saturation, Econometrics (MDPI), 3, 2, 240-264 (2015)
[19] Chudik, A.; Pesaran, Mh, Infinite-dimensional VARs and factor models, J. Econ., 163, 1, 4-22 (2011) · Zbl 1441.62652
[20] Chudik, A., Grossman, V. and Pesaran, M.H. (2016). A multi-country approach to forecasting output growth using PMIs. Journal of Econometrics, forthcoming. · Zbl 1420.62481
[21] Cliff, A.D. and Ord, J.K. (1973). Spatial Autocorrelation. London: Pion.
[22] Cuaresma, C.; Feldkircher, M., Spatial filtering, model uncertainty and the speed of income convergence in Europe, J. Appl. Econ., 28, 4, 720-741 (2013)
[23] Feng, Wenning; Lim, Chae Young; Maiti, Tapabrata; Zhang, Zhen, Spatial regression and estimation of disease risks: A clustering-based approach, Statistical Analysis and Data Mining: The ASA Data Science Journal, 9, 6, 417-434 (2016) · Zbl 07260609
[24] Flores-Lagunes, A.; Schnier, Ke, Estimation of sample selection models with spatial dependence, J. Appl. Econ., 27, 2, 173-204 (2012)
[25] Friedman, J.; Hastie, T.; Tibshirani, R., Regularization paths for generalized linear models via coordinate descent, J. Stat. Softw., 33, 1-22 (2010)
[26] Fu, W.; Knight, K., Asymptotics for LASSO-type estimators, Ann. Stat., 28, 1356-1378 (2000) · Zbl 1105.62357
[27] Geyer, C.J. (1996). On the asymptotics of convex stochastic optimization. Unpublished manuscript.
[28] Hall, P.; Horowitz, Jl, Nonparametric methods for inference in the presence of instrumental variables, Ann. Stat., 33, 2904-2929 (2005) · Zbl 1084.62033
[29] Hendry, Df; Johansen, S.; Santos, C., Automatic selection of indicators in a fully saturated regression, Comput. Stat., 33, 317-335 (2008) · Zbl 1222.62091
[30] Ishwaran, H.; Rao, Js, Spike and slab variable selection: Frequentist and Bayesian strategies, Ann. Stat., 33, 2, 730-773 (2005) · Zbl 1068.62079
[31] Johansen, Søren; Nielsen, Bent, An Analysis of the Indicator Saturation Estimator as a Robust Regression Estimator*, The Methodology and Practice of Econometrics, 1-36 (2009) · Zbl 1384.62232
[32] Kapoor, M.; Kelejian, Hh; Prucha, Ir, Panel data models with spatially correlated error components, J. Econ., 140, 97-130 (2007) · Zbl 1418.62482
[33] Kelejian, Hh; Prucha, Ir, A generalized moments estimator for the autoregressive parameter in a spatial model, Int. Econ. Rev., 40, 509-533 (1999)
[34] Kelejian, Hh; Prucha, Ir, Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances, J. Econ., 157, 53-67 (2010) · Zbl 1431.62636
[35] Kock, Ab; Callot, L., Oracle inequalities for high dimensional vector autoregressions, J. Econ., 186, 2, 325-344 (2015) · Zbl 1331.62348
[36] Lam, C. and Souza, P.C. (2016). Regularization for spatial panel time series using the adaptive LASSO. Journal of Regional Science, forthcoming.
[37] Lee, L-F, Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models, Econometrica, 72, 6, 1899-1925 (2004) · Zbl 1142.62312
[38] Lee, L-F; Yu, J., Estimation of spatial autoregressive panel data models with fixed effects, J. Econ., 154, 165-185 (2010) · Zbl 1431.62643
[39] Lee, L-F; Yu, J., Identification of spatial Durbin panel models, J. Appl. Econ., 31, 1, 133-162 (2016)
[40] Lin, X.; Lee, L-F, GMM estimation of spatial autoregressive models with unknown heteroskedasticity, J. Econ., 157, 1, 34-52 (2010) · Zbl 1431.62399
[41] Lounici, K., Sup-norm convergence rate and sign concentration property of Lasso and Dantzig estimators, Electr. J. Stat., 2, 90-102 (2008) · Zbl 1306.62155
[42] Meinshausen, N.; Bühlmann, P., High dimensional graphs and variable selection with the LASSO, Ann. Stat., 34, 1436-1462 (2006) · Zbl 1113.62082
[43] Nandy, S., Lim, C. and Maiti, T. (2016). Additive model building for spatial regression. Journal of the Royal Statistical Society Series B, forthcoming. · Zbl 1411.62207
[44] Nowak, A. and Smith, P. (2017). Textual analysis in real estate. Journal of Applied Econometrics, forthcoming.
[45] Pesaran, Mh; Schuermann, T.; Weiner, Sm, Modelling regional interdependencies using a global error-correcting macroeconometric model, J. Business Econ. Stat., 22, 2, 129-162 (2004)
[46] Pollard, D., Asymptotic for least absolute deviation regression estimators, Econ. Theory, 7, 186-199 (1991)
[47] Stock, Jh; Watson, Mw, Forecasting using principle components from a large number of predictors, J. Am. Stat. Assoc., 97, 1167-1179 (2002) · Zbl 1041.62081
[48] Su, L.; Yang, Z., QML estimation of dynamic panel data models with spatial errors, J. Econ., 185, 1, 230-258 (2015) · Zbl 1331.62486
[49] Tibshirani, R., Regression shrinkage and selection via the LASSO, J. R. Stat. Soc. Series B, 58, 267-288 (1996) · Zbl 0850.62538
[50] Varian, Hr, Big data: new tricks for econometrics, J. Econ. Perspect., 28, 3-28 (2014)
[51] Whittle, P., On stationary processes in the plane, Biometrica, 41, 434-449 (1954) · Zbl 0058.35601
[52] Yu, Jihai; De Jong, Robert; Lee, Lung-Fei, Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large, Journal of Econometrics, 146, 1, 118-134 (2008) · Zbl 1418.62549
[53] Zhao, P.; Yu, B., On model selection consistency of LASSO, J. Mach. Learn. Res., 7, 2541-2563 (2006) · Zbl 1222.62008
[54] Zou, H., The adaptive lasso and its oracle properties, J. Amer. Stat. Assoc., 101, 1418-1429 (2006) · Zbl 1171.62326
[55] Zou, H.; Zhang, H., On the adaptive elastic-net with a diverging number of parameters, Ann. Stat., 37, 1733-1751 (2009) · Zbl 1168.62064
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.