A new approach to distribution free tests in contingency tables.

*(English)*Zbl 1394.62074Summary: We suggest an extremely wide class of asymptotically distribution free goodness of fit tests for testing independence in two-way contingency tables, or equivalently, independence of two discrete random variables. The nature of these tests is that the test statistics can be viewed as definite functions of the transformation of \(\widehat{T}_n = (\widehat{T}_{ij})=\Big (\frac{\nu_{ij}- n\hat{a}_i\hat{b}_j}{\sqrt{n\hat{a}_i\hat{b}_j}}\Big )\) where \(\nu_{ij}\) are frequencies and \(\hat{a}_i\), \(\hat{b}_j\) are estimated marginal distributions. Our method is also applicable for testing independence of two discrete random vectors. We make some comparisons on statistical powers of the new tests with the conventional chi-square test and suggest some cases in which this class is significantly more powerful.

Full Text:
DOI

##### References:

[1] | Bedrick, EJ, Adjusted chi-squared tests for cross-classified tables of survey data, Biometrika, 70, 591-595, (1983) · Zbl 0543.62036 |

[2] | Gilula, Z; Haberman, SJ, Canonical analysis of contingency tables by maximum likelihood, J Am Stat Assoc, 81, 780-788, (1986) · Zbl 0623.62047 |

[3] | Gilula, Z; Krieger, AM; Ritov, Y, Ordinal association in contingency tables: some interpretive aspects, J Am Stat Assoc, 83, 540-545, (1988) · Zbl 0644.62065 |

[4] | Goodman, LA, Association models and canonical correlation in the analysis of cross-classifications having ordered categories, J Am Stat Assoc, 76, 320-334, (1981) |

[5] | Goodman, LA, The analysis of cross-classified data having ordered and/or unordered categories: association models, correlation models, and asymmetry models for contingency tables with or without missing entries, Ann Stat, 13, 10-69, (1985) · Zbl 0613.62070 |

[6] | Haberman, SJ, Tests for independence in two-way contingency tables based on canonical correlation and on linear-by-linear interaction, Ann Stat, 9, 1178-1186, (1981) · Zbl 0479.62043 |

[7] | Holt, D; Scott, AJ; Ewings, PD, Chi-squared tests with survey data, J R Stat Soc, 143, 302-320, (1980) · Zbl 0461.62014 |

[8] | Khmaladze, E, Goodness of fit problem and scanning innovation martingales, Ann Stat, 21, 798-829, (1993) · Zbl 0801.62043 |

[9] | Khmaladze, E, Note on distribution free testing for discrete distribution, Ann Stat, 41, 2979-2993, (2013) · Zbl 1294.62095 |

[10] | Khmaladze, E, Unitary transformations, empirical processes and distribution free testing, Bernoulli, 22, 563-588, (2016) · Zbl 1345.60094 |

[11] | Koch, GC; Freeman, DH; Freeman, J, Strategies in the multivariate analysis of data from complex surveys, Int Stat Rev, 43, 59-78, (1975) · Zbl 0303.62009 |

[12] | Nelsen RB (2006) An introduction to Copulas. Springer, New York · Zbl 1152.62030 |

[13] | Rao CR (1920) Linear statistical inference and its applications. Wiley, New York |

[14] | Rao, JNK; Scott, AJ, On chi-squared test for multiway contingency tables with cell proportions estimated from survey data, Ann Stat, 12, 46-60, (1984) · Zbl 0622.62059 |

[15] | van der Vaart AW (2000) Asymptotic statistics. Cambridge University Press, Cambridge · Zbl 0910.62001 |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.