Test of significance when data are curves. (English) Zbl 1064.62525

Summary: With modern technology, massive data can easily be collected in a form of multiple sets of curves. New statistical challenges include testing whether there is any statistically significant difference among these sets of curves. In this article we propose some new tests for comparing two groups of curves based on the adaptive Neyman test and the wavelet thresholding techniques introduced earlier by Fan. We demonstrate that these tests inherit the properties outlined by Fan and that they are simple and powerful for detecting differences between two sets of curves. We then further generalize the idea to compare multiple sets of curves, resulting in an adaptive high-dimensional analysis of variance, called HANOVA. These newly developed techniques are illustrated by using a dataset on pizza commercials where observations are curves and an analysis of cornea topography in ophthalmology where images of individuals are observed. A simulation example is also presented to illustrate the power of the adaptive Neyman test.


62G10 Nonparametric hypothesis testing
65C60 Computational problems in statistics (MSC2010)
Full Text: DOI Link