Matched case-control data analysis with selection bias. (English) Zbl 1209.62306

Summary: Case-control studies offer a rapid and efficient way to evaluate hypotheses. On the other hand, proper selection of the controls is challenging, and the potential for selection bias is a major weakness. Valid inferences about parameters of interest cannot be drawn if selection bias exists. Furthermore, the selection bias is difficult to evaluate. Even in situations where selection bias can be estimated, few methods are available. In the matched case-control Northern Manhattan Stroke Study (NOMASS), stroke-free controls are sampled in two stages. First, a telephone survey ascertains demographic and exposure status from a large random sample. Then, in an in-person interview, detailed information is collected for the selected controls to be used in a matched case – control study. The telephone survey data provides information about the selection probability and the potential selection bias. In this article, we propose bias-corrected estimators in a case-control study using a joint estimating equation approach. The proposed bias-corrected estimate and its standard error can be easily obtained by standard statistical software.


62P10 Applications of statistics to biology and medical sciences; meta analysis
62N02 Estimation in survival analysis and censored data
Full Text: DOI


[1] Clayton, Analysis of longitudinal binary data from multiphase sampling, Journal of the Royal Statistical Society, Series B 60 pp 71– (1998) · Zbl 0909.62005
[2] Kleinbaum, Selection bias in epidemiologic studies, American Journal of Epidemiology 113 pp 452– (1981)
[3] Liang, Longitudinal data analysis using generalized linear models, Biometrika 73 pp 13– (1986) · Zbl 0595.62110
[4] Lipsitz, Jackknife estimators of variance for parameter estimates from estimating equations with applications to clustered survival data, Biometrics 50 pp 842– (1994) · Zbl 0825.62783
[5] Maclure, Analysis of selection bias in a case-control study of renal adenocarcinoma, Epidemiology 1 pp 441– (1990)
[6] Paik, Matched case-control data analyses with missing covariates, Applied Statistics 49 pp 145– (2000) · Zbl 0973.62104
[7] Prentice, Retrospective studies and failure time models, Biometrika 65 pp 153– (1978) · Zbl 0377.92014
[8] Prentice, Logistic disease incidence models and case-control studies, Biometrika 66 pp 403– (1979) · Zbl 0428.62078
[9] Rosner, Measurement error models for ordinal exposure variables measured with error, Statistics in Medicine 15 pp 293– (1996)
[10] Sacco, The protective effect of moderate alcohol consumption on ischemic stroke, Journal of the American Medical Association 281 pp 53– (1999)
[11] Weinberg, The design and analysis of case-control studies with biased sampling, Biometrics 46 pp 953– (1990)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.