Edexcel S4 — Question 6 18 marks

Exam BoardEdexcel
ModuleS4 (Statistics 4)
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicCentral limit theorem
TypeEstimator properties and bias
DifficultyStandard +0.3 This is a structured multi-part question on unbiased estimators that follows a standard template for S4. Parts (a)-(c) involve routine application of E(aX)=aE(X) and variance formulas. Parts (d)-(f) require understanding of consistency and efficiency but with heavy scaffolding. While it's Further Maths content, the question guides students through each step with minimal novel insight required, making it slightly easier than average.
Spec5.02b Expectation and variance: discrete random variables5.02c Linear coding: effects on mean and variance5.05b Unbiased estimates: of population mean and variance

6. A statistics student is trying to estimate the probability, \(p\), of rolling a 6 with a particular die. The die is rolled 10 times and the random variable \(X _ { 1 }\) represents the number of sixes obtained. The random variable \(R _ { 1 } = \frac { X _ { 1 } } { 10 }\) is proposed as an estimator of \(p\).
  1. Show that \(R _ { 1 }\) is an unbiased estimator of \(p\). The student decided to roll the die again \(n\) times ( \(n > 10\) ) and the random variable \(X _ { 2 }\) represents the number of sixes in these \(n\) rolls. The random variable \(R _ { 2 } = \frac { X _ { 2 } } { n }\) and the random variable \(Y = \frac { 1 } { 2 } \left( R _ { 1 } + R _ { 2 } \right)\).
  2. Show that both \(R _ { 2 }\) and \(Y\) are unbiased estimators of \(p\).
  3. Find \(\operatorname { Var } \left( R _ { 2 } \right)\) and \(\operatorname { Var } ( Y )\).
  4. State giving a reason which of the 3 estimators \(R _ { 1 } , R _ { 2 }\) and \(Y\) are consistent estimators of \(p\).
  5. For the case \(n = 20\) state, giving a reason, which of the 3 estimators \(R _ { 1 } , R _ { 2 }\) and \(Y\) you would recommend. The student's teacher pointed out that a better estimator could be found based on the random variable \(X _ { 1 } + X _ { 2 }\).
  6. Find a suitable estimator and explain why it is better than \(R _ { 1 } , R _ { 2 }\) and \(Y\). END

Question 6:
Part (a)
AnswerMarks Guidance
\(X_1 \sim B(10, p)\) \(\therefore E(X_1) = 10p \Rightarrow E(R_1) = E\!\left(\dfrac{X_1}{10}\right) = \dfrac{10p}{10} = p\)B1 (1 mark)
Part (b)
AnswerMarks Guidance
\(X_2 \sim B(n, p)\) \(\therefore E(X_2) = np \Rightarrow E(R_2) = E\!\left(\dfrac{X_2}{n}\right) = \dfrac{np}{n} = p\)B1
\(E(Y) = E\!\left(\tfrac{1}{2}[R_1 + R_2]\right) = \tfrac{1}{2}[E(R_1) + E(R_2)] = \tfrac{1}{2}[p + p] = p\)B1 (2 marks)
Part (c)
AnswerMarks
\(\text{Var}(R_2) = \dfrac{1}{n^2}\text{Var}(X_2) = \dfrac{np(1-p)}{n^2} = \dfrac{p(1-p)}{n}\)B1
\(\text{Var}(R_1) = \dfrac{p(1-p)}{10}\) \(\therefore \text{Var}(Y) = \tfrac{1}{4}[\text{Var}(R_1) + \text{Var}(R_2)]\)
AnswerMarks Guidance
\(= \dfrac{1}{4}\!\left[\dfrac{p(1-p)}{10} + \dfrac{p(1-p)}{n}\right]\)M1, A1 (3 marks)
Part (d)
AnswerMarks Guidance
Since \(\text{Var}(R_2) = \dfrac{p(1-p)}{n} \to 0\) as \(n \to \infty\), \(\therefore R_2\) is consistentM1, A1 (2 marks)
Part (e)
\(\text{Var}(R_1) = \dfrac{p(1-p)}{10} > \dfrac{p(1-p)}{20} = \text{Var}(R_2)\)
AnswerMarks Guidance
\(\text{Var}(Y) = \dfrac{p(1-p)}{4}\!\left[\dfrac{1}{10} + \dfrac{1}{20}\right] = \dfrac{p(1-p)}{80} \times 3 < \text{Var}(R_2)\)M1
Since all 3 are unbiased, we select the one with minimum variance, i.e. \(Y\)A1 (2 marks)
Part (f)
AnswerMarks
\(X_1 + X_2 \sim B(n+10, p)\) so consider \(\dfrac{X_1 + X_2}{n+10}\)B1
\(E\!\left(\dfrac{X_1+X_2}{n+10}\right) = \dfrac{(n+10)p}{(n+10)} = p\) (show unbiased)M1
\(\text{Var}\!\left(\dfrac{X_1+X_2}{n+10}\right) = \dfrac{p(1-p)}{n+10}\) (find variance)M1
\(\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{10}\) \(\therefore\) always better than \(R_1\)
AnswerMarks Guidance
and \(\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{n}\) \(\therefore\) always better than \(R_2\)A1 both
\(\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{4}\!\left[\dfrac{n+10}{10n}\right]\)
\(\Leftrightarrow 40n < 100 + 20n + n^2\)
AnswerMarks Guidance
\(\Leftrightarrow 0 < (10-n)^2\) Show better than \(Y\); use of \(n=20\) acceptableM1
\(\therefore \dfrac{X_1+X_2}{n+10}\) is unbiased and always has smaller varianceA1 cso (6 marks) (16 marks total)
# Question 6:

## Part (a)
$X_1 \sim B(10, p)$ $\therefore E(X_1) = 10p \Rightarrow E(R_1) = E\!\left(\dfrac{X_1}{10}\right) = \dfrac{10p}{10} = p$ | B1 | (1 mark)

## Part (b)
$X_2 \sim B(n, p)$ $\therefore E(X_2) = np \Rightarrow E(R_2) = E\!\left(\dfrac{X_2}{n}\right) = \dfrac{np}{n} = p$ | B1 |

$E(Y) = E\!\left(\tfrac{1}{2}[R_1 + R_2]\right) = \tfrac{1}{2}[E(R_1) + E(R_2)] = \tfrac{1}{2}[p + p] = p$ | B1 | (2 marks)

## Part (c)
$\text{Var}(R_2) = \dfrac{1}{n^2}\text{Var}(X_2) = \dfrac{np(1-p)}{n^2} = \dfrac{p(1-p)}{n}$ | B1 |

$\text{Var}(R_1) = \dfrac{p(1-p)}{10}$ $\therefore \text{Var}(Y) = \tfrac{1}{4}[\text{Var}(R_1) + \text{Var}(R_2)]$

$= \dfrac{1}{4}\!\left[\dfrac{p(1-p)}{10} + \dfrac{p(1-p)}{n}\right]$ | M1, A1 | (3 marks)

## Part (d)
Since $\text{Var}(R_2) = \dfrac{p(1-p)}{n} \to 0$ as $n \to \infty$, $\therefore R_2$ is consistent | M1, A1 | (2 marks)

## Part (e)
$\text{Var}(R_1) = \dfrac{p(1-p)}{10} > \dfrac{p(1-p)}{20} = \text{Var}(R_2)$

$\text{Var}(Y) = \dfrac{p(1-p)}{4}\!\left[\dfrac{1}{10} + \dfrac{1}{20}\right] = \dfrac{p(1-p)}{80} \times 3 < \text{Var}(R_2)$ | M1 |

Since all 3 are unbiased, we select the one with minimum variance, i.e. $Y$ | A1 | (2 marks)

## Part (f)
$X_1 + X_2 \sim B(n+10, p)$ so consider $\dfrac{X_1 + X_2}{n+10}$ | B1 |

$E\!\left(\dfrac{X_1+X_2}{n+10}\right) = \dfrac{(n+10)p}{(n+10)} = p$ (show unbiased) | M1 |

$\text{Var}\!\left(\dfrac{X_1+X_2}{n+10}\right) = \dfrac{p(1-p)}{n+10}$ (find variance) | M1 |

$\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{10}$ $\therefore$ always better than $R_1$

and $\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{n}$ $\therefore$ always better than $R_2$ | A1 | both

$\dfrac{p(1-p)}{n+10} < \dfrac{p(1-p)}{4}\!\left[\dfrac{n+10}{10n}\right]$

$\Leftrightarrow 40n < 100 + 20n + n^2$

$\Leftrightarrow 0 < (10-n)^2$ Show better than $Y$; use of $n=20$ acceptable | M1 |

$\therefore \dfrac{X_1+X_2}{n+10}$ is unbiased and always has smaller variance | A1 cso | (6 marks) **(16 marks total)**
6. A statistics student is trying to estimate the probability, $p$, of rolling a 6 with a particular die. The die is rolled 10 times and the random variable $X _ { 1 }$ represents the number of sixes obtained. The random variable $R _ { 1 } = \frac { X _ { 1 } } { 10 }$ is proposed as an estimator of $p$.
\begin{enumerate}[label=(\alph*)]
\item Show that $R _ { 1 }$ is an unbiased estimator of $p$.

The student decided to roll the die again $n$ times ( $n > 10$ ) and the random variable $X _ { 2 }$ represents the number of sixes in these $n$ rolls. The random variable $R _ { 2 } = \frac { X _ { 2 } } { n }$ and the random variable $Y = \frac { 1 } { 2 } \left( R _ { 1 } + R _ { 2 } \right)$.
\item Show that both $R _ { 2 }$ and $Y$ are unbiased estimators of $p$.
\item Find $\operatorname { Var } \left( R _ { 2 } \right)$ and $\operatorname { Var } ( Y )$.
\item State giving a reason which of the 3 estimators $R _ { 1 } , R _ { 2 }$ and $Y$ are consistent estimators of $p$.
\item For the case $n = 20$ state, giving a reason, which of the 3 estimators $R _ { 1 } , R _ { 2 }$ and $Y$ you would recommend.

The student's teacher pointed out that a better estimator could be found based on the random variable $X _ { 1 } + X _ { 2 }$.
\item Find a suitable estimator and explain why it is better than $R _ { 1 } , R _ { 2 }$ and $Y$.

END
\end{enumerate}

\hfill \mbox{\textit{Edexcel S4  Q6 [18]}}