Bulletin Editor Anirban DasGupta sets this problem. Student members of the IMS are invited to submit solutions (to bulletin@imstat.org with subject “Student Puzzle Corner”). The deadline is now September 8, 2016.


 

It is the turn of a problem on probability this time. We will consider a problem that looks like a problem on analysis. Many of you know that analysis and probability share a strong synergistic relationship; there are a number of classic texts on how analysis and probability feed into each other. The problem will be left slightly open ended to whet your imagination. Here is the exact problem of this issue:

(a) Let f be a given function on the unit interval [0,1]. Define now a sequence of functions fn by the rule fn(x)=The average value of f over the interval[k2n,k+12n], if x[k2n,k+12n].
What is the weakest sufficient condition you can provide under which fn(x)f(x) for almost all x? Give a proof of your claim.

(b) For extra credit only: Fix an ϵ>0. Is it true that on some set B with Bdx<ϵ,supxB|fn(x)f(x)|0? That is, is it true that in fact outside of a set of arbitrarily small measure, fn will converge uniformly to f?


 

Solution to Puzzle 14

We had Tom Berrett at Oxford University, Promit Ghosal at Columbia University, and Haozhe Zhang at the Iowa State University [below, left–right] send us correct answers to both parts of the previous puzzle; congratulations to all of them.

Tom Berrett Promit Ghosal Haozhe Zhang
Tom Berrett Promit Ghosal Haozhe Zhang

 

Let us recall the problem. Suppose given p,X1,X2, are iid Bernoulli with parameter p. Let p have a prior distribution with a Lebesgue density g. Suppose θn=Pg(Xn+1==X2n=1|X1==Xn=1) denote the Bayesian predictive probability that the next n trials will all be successes if the previous n have all been so. Then, find the limit of the sequence θn when g is a general Beta density with parameters α,β, and for a general g which is infinitely smooth
at p=1.

First consider the case of a Beta prior. In that case, the posterior distribution of p given that X1==Xn=1 is Beta with parameters n+α and β. Therefore,

θn=Ep|X1==Xn=1[P(Xn+1==X2n=1|X1==Xn=1,p)]
=Ep|X1==Xn=1(pn)
=Γ(α+β+n)Γ(α+n)Γ(β)01pnpn+α1(1p)β1dp
=Γ(α+2n)Γ(α+β+n)Γ(α+n)Γ(α+β+2n)

By Stirling’s approximation to Γ(x) as x, and using the notation to mean that the ratio of the two sequences converges to 1, we now get

θneα2n(1+α2n)2n(2n)2n+αeαβn(1+α+βn)nnn+α+βeαβ2n(1+α+β2n)2n(2n)2n+α+βeαn(1+αn)nnn+α
2β.

Notice that the limit of θn does not depend on α; this is because only the local behavior of g
near p=1 matters for the limiting behavior of θn. In particular, if p has a uniform prior, then the limit of θn is 12: you ought to be 5050 about the question {\it will our Sun last for the next 4.5 billion years}? Very interesting!

Consider now the case of a general prior density g with infinitely smooth local behavior at p=1. Let k be defined as the unique nonnegative integer such that g(1)==g(k1)(1)=0,g(k)(1)0. Then, as in the case of the special Beta prior,

θn=01p2ng(p)dp01png(p)dp.

Using {\it Watson’s Lemma}, and once again, Stirling’s approximation,

01png(p)dp(1)kg(k)(1)k!Γ(n+1)Γ(k+1)Γ(n+k+2)(1)kg(k)(1)nk+1.

Hence,

θn1(2n)k+11nk+1=2k1.