×

Evaluation of missing data mechanisms in two and three dimensional incomplete tables. (English) Zbl 1416.62308

Summary: The analysis of incomplete contingency tables is a practical and an interesting problem. In this paper, we provide characterizations for the various missing mechanisms of a variable in terms of response and non-response odds for two and three dimensional incomplete tables. Log-linear parametrization and some distinctive properties of the missing data models for the above tables are discussed. All possible cases in which data on one, two or all variables may be missing are considered. We study the missingness of each variable in a model, which is more insightful for analyzing cross-classified data than the missingness of the outcome vector. For sensitivity analysis of the incomplete tables, we propose easily verifiable procedures to evaluate the missing at random (MAR), missing completely at random (MCAR) and not missing at random (NMAR) assumptions of the missing data models. These methods depend only on joint and marginal odds computed from fully and partially observed counts in the tables, respectively. Finally, some real-life datasets are analyzed to illustrate our results, which are confirmed based on simulation studies.

MSC:

62H17 Contingency tables
62H30 Classification and discrimination; cluster analysis (statistical aspects)
PDFBibTeX XMLCite
Full Text: DOI arXiv

References:

[1] Baker, S. G.; Laird, N. M., Regression analysis for categorical variables with outcome subject to nonignorable nonresponse, Journal of the American Statistical Association, 83, 62-69 (1988)
[2] Baker, S. G.; Rosenberger, W. F.; Dersimonian, R., Closed-form estimates for missing counts in two-way contingency tables, Statistics in Medicine, 11, 643-657 (1992)
[3] Clarke, P. S.; Smith, P. W.F., On maximum likelihood estimation for log-linear models with non-ignorable non-responses, Statistics & Probability Letters, 73, 441-448 (2005) · Zbl 1075.62018
[4] Ghosh, S., & Vellaisamy, P. (2016b). Closed form estimates for missing counts in multidimensional incomplete tables. arXiv:1602.00947; Ghosh, S., & Vellaisamy, P. (2016b). Closed form estimates for missing counts in multidimensional incomplete tables. arXiv:1602.00947
[5] Ghosh, S.; Vellaisamy, P., On the occurrence of boundary solutions in multidimensional incomplete tables, Statistics & Probability Letters, 119, 63-75 (2016) · Zbl 1398.62148
[6] Ghosh, S.; Vellaisamy, P., On the occurrence of boundary solutions in two-way incomplete tables, (REVSTAT (2017)), (forthcoming) · Zbl 1398.62148
[7] Kim, S.; Park, Y.; Kim, D., On missing-at-random mechanism in two-way incomplete contingency tables, Statistics & Probability Letters, 96, 196-203 (2015) · Zbl 1396.62114
[8] Little, J. A.; Rubin, D. B., Statistical analysis with missing data (2002), Wiley: Wiley New York · Zbl 1011.62004
[9] Molenberghs, G.; Beunckens, C.; Sotto, C.; Kenward, M. G., Every missing not at random model has a missingness at random counterpart with equal fit, Journal of the Royal Statistical Society: Series B, 70, 371-388 (2008) · Zbl 1148.62046
[10] Molenberghs, G.; Kenward, M. G.; Goetghebeur, E., Sensitivity analysis for incomplete contingency tables : the Slovenian plebiscite case, Journal of the Royal Statistical Society. Series C. Applied Statistics, 50, 15-29 (2001) · Zbl 1021.62045
[11] Park, Y.; Kim, D.; Kim, S., Identification of the occurrence of boundary solutions in a contingency table with nonignorable nonresponse, Statistics & Probability Letters, 93, 34-40 (2014) · Zbl 1400.62125
[12] Rochani, H. D.; Vogel, R. L.; Samawi, H. M.; Linder, D. F., Estimates for cell counts and common odds ratio in three-way contingency tables by homogeneous log-linear models with missing data, AStA Advances in Statistical Analysis, 101, 51-65 (2017) · Zbl 1443.62168
[13] Rubin, D. B.; Stern, H. S.; Vehovar, V., Handling Don’t know survey responses : the case of the Slovenian plebiscite, Journal of the American Statistical Association, 90, 822-828 (1995)
[14] Vansteelandt, S.; Goetghebeur, E.; Kenward, M. G.; Molenberghs, G., Ignorance and uncertainty regions as inferential tools in a sensitivity analysis, Statistica Sinica, 16, 953-979 (2006) · Zbl 1108.62005
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.