A review of regression procedures for randomized response data, including univariate and multivariate logistic regression, the proportional odds model and item response model, and self-protective responses.

*(English)*Zbl 1365.62030
Chaudhuri, Arijit (ed.) et al., Data gathering, analysis and protection of privacy through randomized response techniques: qualitative and quantitative human traits. Amsterdam: Elsevier/North Holland (ISBN 978-0-444-63570-9/hbk; 978-0-444-63571-6/ebook). Handbook of Statistics 34, 287-315 (2016).

Summary: In survey research, it is often problematic to ask people sensitive questions because they may refuse to answer or they may provide a socially desirable answer that does not reveal their true status on the sensitive question. To solve this problem S. L. Warner [J. Am. Stat. Assoc. 60, No. 309, 63–69 (1965; Zbl 1298.62024)] proposed randomized response (RR). Here, a chance mechanism hides why respondents say yes or no to the question being asked. Thus far RR has been mainly used in research to estimate the prevalence of sensitive characteristics. It is not uncommon that researchers wrongly believe that the RR procedure has the drawback that it is not possible to relate the sensitive characteristics to explanatory variables. Here, we provide a review of the literature of regression procedures for dichotomous RR data. Univariate RR data can be analyzed with a version of logistic regression that is adapted so that it can handle data collected by RR. Subsequently the manuscript presents extensions towards repeated cross-sectional data that allowed for a change in the design with which the RR data are collected. We also review regression procedures for multivariate dichotomous RR data, such as the model by G. F. V. Glonek and P. McCullagh [J. R. Stat. Soc., Ser. B 57, No. 3, 533–546 (1995; Zbl 0827.62059)], a model for the sum of a set of dichotomous RR data, and a model from item response theory that assumes a latent variable that explains the answers on the RR variables. We end with a discussion of a recent development in the analysis of multivariate RR data, namely models that take into account that there may be respondents that do not follow the instructions of the RR design by answering no whatever the sensitive question asked. These are coined self-protective responses.

For the entire collection see [Zbl 1349.62001].

For the entire collection see [Zbl 1349.62001].

##### MSC:

62D05 | Sampling theory, sample surveys |

94A50 | Theory of questionnaires |

62J12 | Generalized linear models (logistic models) |

62P15 | Applications of statistics to psychology |