Congratulations to Mirza Uzair Baig at the University of Hawai’i at Mānoa, who wrote an excellent solution to the problem.
Note that the statistic Tn may be represented as
Denote the empirical CDF of by and that of by . Then, this above representation yields
Use the fact that for given and are binomial random variables with success probabilities and . Now use the iterated expectation formula by conditioning on the minima and the maxima to get the mean, and similarly, but with a longer calculation, the variance.
It is useful to think of as approximately a sum of two geometrics. Suppose is a negative binomial with parameters . Then for not too small, would have a point mass at zero mixed with the negative binomial. That is, write down a Bernoulii variable with parameter ; then (in law) is approximately . This gives a quick explanation for why the mean and the variance under the null of should be about and . You can see a plot below of the
null distribution of below when ; it is distribution-free in its usual sense.
Under specified alternatives, the negative binomial would be replaced by a sum of two geometrics, approximately independent, but not i.i.d.
—
The next puzzle, number 21, is here. Can you solve it? Send us your answer by September 7.
Comments on “Solution to Student Puzzle 20”