January 9, 2021
Robin of Foxley has entered the FiveThirtyEight archery tournament. Her aim is excellent (relatively speaking), as she is guaranteed to hit the circular target, which has no subdivisions β itβs just one big circle. However, her arrows are equally likely to hit each location within the target.
Her true love, Marian, has issued a challenge. Robin must fire as many arrows as she can, such that each arrow is closer to the center of the target than the previous arrow. For example, if Robin fires three arrows, each closer to the center than the previous, but the fourth arrow is farther than the third, then she is done with the challenge and her score is four.
On average, what score can Robin expect to achieve in this archery challenge?
Answer:
Explanation:
Solving this problem requires patiently going through steps. It's all math from here. There will be no computer simulations involved as I had pledged:
I will take a "No Simulations" challenge this time for the circle question just because it's too tempting. π
β David Ding (@DavidYiweiDing) January 8, 2021
So, let's begin!
Without loss of generality, let's assume the target is a unit circle (i.e with radius of 1). From the problem, it is given that Robin's arrow has an equal chance of hitting any points within the circle. In polar coordinates, let a point be
Clearly, there's more points "to be picked" as we increase the radius if all points are equally likely to be picked. Therefore, we would be more likely to pick points closer to the circle itself, i.e. points with larger radius values. We can even quantify this by using the formula for circle's circumference:
So we have:
By definition:
The function
The cdf of
From the original puzzle, Robin is to shoot arrows until she shoots one that is further away from the center than the previous one. Therefore, her penultimate shot will hit a point with the minimum radius. Here let's figure out the probability that her nth shot will be the one yielding the smallest radius.
Proposition: Let
Intuitively, this makes sense. All
The method we use here is induction. The base case is when we only have two radius values, i.e.
We have shown in the base case that
Then, let us take a look at the
Where the last line we made use of our induction hypothesis.
Nonetheless, the expression
So for now, let us denote
Okay, so that was a lot to unpack here, so let me describe everything line by line. First, by definition, the cdf of a continuous random variable is the probability that the random variable is less than or equal to some value, here denoted as
Computation courtesy of WolframAlpha.
Therefore, by the principles of mathematical induction,
We now revisit our puzzle setup and figure out the expected score for Robin. Here we make use of a stochastic chart:
Robin takes her first shot and yields
The probabilities written on each stochastic path comes from our proven proposition:
From the stochastic chart it is easy to see that for
Hence, the expected value is:
Here I used a change of dummy variables to let
Marian now uses a target with 10 concentric circles, whose radii are 1, 2, 3, etc., all the way up to 10 β the radius of the entire target. This time, Robin must fire as many arrows as she can, such that each arrow falls within a smaller concentric circle than the previous arrow. On average, what score can Robin expect to achieve in this version of the archery challenge?
Answer: Approximately 2.5585
Explanation:
Here we are taking a computer-science detour and taking dynamic programming for a spin. The strategy is to understand that once Robin has hit a ring, say ring 5, the next shot is either ring 5 or higher in which case she ends the journey, or she hits ring 1, 2, 3, or 4. Suppose we already know the expected score for those 4 rings, then the overall expectation would be either 2 shots and out or 1 shot + however many expected shots for ring 4 and lower. Dynamic programming is all about building upon previous solutions, and here we will make use of that.
Let
For the p's, since inside the target all points are equally likely to be hit, the probability of hitting a ring would simply be the ratio of the area of the ring divided by the area of the entire circle (ring 1 is technically the unit circle, I know). I will leave this to the readers to show that
As for the E's, we will leverage dynamic program and build our solution from the center out. Starting with
Since the probability of NOT hitting ring 1 to extend the score would be
And so on and so forth. When we are computing
The following is the algorithm for computing the expected score using MATLAB:
pInput = 1:10; p = getProb(pInput); %p = (2*pInput - 1) / 100 E = NaN(1, 10); E(1) = 2; for k = 2:10 E(k) = 2*(1 - sum(p(1:k-1))); for i = 1:k-1 E(k) = E(k) + p(i)*(E(i) + 1); end end EFinal = sum(p.*E);
The result displayed is:
>> EFinal EFinal = 2.5585
Here is the various values of
>> E E = 2.0000 2.0100 2.0403 2.0923 2.1688 2.2740 2.4141 2.5979 2.8376 3.1500
Please note: even though I used the aid of computers, it was not a simulation, but rather a computation.
It is interesting to note that the value of 2.5585 is very close to the value of