Selection adjusted confidence intervals with more power to determine the sign.

*(English)*Zbl 06158333Summary: In many current large-scale problems, confidence intervals (CIs) are constructed only for the parameters that are large, as indicated by their estimators, ignoring the smaller parameters. Such selective inference poses a problem to the usual marginal CIs that no longer offer the right level of coverage, not even on the average over the selected parameters. We address this problem by developing three methods to construct short and valid CIs for the location parameter of a symmetric unimodal distribution, while conditioning on its estimator being larger than some constant threshold. In two of these methods, the CI is further required to offer early sign determination, that is, to avoid including parameters of both signs for relatively small values of the estimator. One of the two, the Conditional Quasi-Conventional CI, offers a good balance between length and sign determination while protecting from the effect of selection. The CI is not symmetric, extending more toward 0 than away from it, nor is it of constant shape. However, when the estimator is far away from the threshold, the proposed CI tends to the usual marginal one. In spite of its complexity, it is specified by closed form expressions, up to a small set of constants that are each the solution of a single variable equation.

When multiple testing procedures are used to control the false discovery rate or other error rates, the resulting threshold for selecting may be data dependent. We show that conditioning the above CIs on the data-dependent threshold still offers false coverage-statement rate (FCR) for many widely used testing procedures. For these reasons, the conditional CIs for the parameters selected this way are an attractive alternative to the available general FCR adjusted intervals. We demonstrate the use of the method in the analysis of some 14,000 correlations between hormone change and brain activity change in response to the subjects being exposed to stressful movie clips. Supplementary materials for this article are available online.

When multiple testing procedures are used to control the false discovery rate or other error rates, the resulting threshold for selecting may be data dependent. We show that conditioning the above CIs on the data-dependent threshold still offers false coverage-statement rate (FCR) for many widely used testing procedures. For these reasons, the conditional CIs for the parameters selected this way are an attractive alternative to the available general FCR adjusted intervals. We demonstrate the use of the method in the analysis of some 14,000 correlations between hormone change and brain activity change in response to the subjects being exposed to stressful movie clips. Supplementary materials for this article are available online.

##### MSC:

62 | Statistics |

PDF
BibTeX
XML
Cite

\textit{A. Weinstein} et al., J. Am. Stat. Assoc. 108, No. 501, 165--176 (2013; Zbl 06158333)

Full Text:
DOI

##### References:

[1] | Benjamini Y., Journal of the Royal Statistical Society,Series B 57 pp 289– (1995) |

[2] | Benjamini Y., Journal of the American Statistical Association 93 pp 309– (1998) · doi:10.1080/01621459.1998.10474112 |

[3] | DOI: 10.1198/016214504000001907 · Zbl 1117.62302 · doi:10.1198/016214504000001907 |

[4] | Finner H., The Annals of Statistics 22 pp 1502– (1994) · Zbl 0818.62021 · doi:10.1214/aos/1176325639 |

[5] | Gelman A., Data Analysis Using Regression and Multilevel/Hierarchical Models, 2. ed. (2009) |

[6] | Lehmann E., Testing Statistical Hypotheses (1986) · Zbl 0608.62020 · doi:10.1007/978-1-4757-1923-9 |

[7] | DOI: 10.1080/01621459.1961.10480644 · doi:10.1080/01621459.1961.10480644 |

[8] | Tukey J., Statistical Science 6 pp 100– (1991) · doi:10.1214/ss/1177011945 |

[9] | Zhong H., Biostatistics 9 pp 621– (2008) · doi:10.1093/biostatistics/kxn001 |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.