Edexcel S4 2004 June — Question 5 15 marks

Exam BoardEdexcel
ModuleS4 (Statistics 4)
Year2004
SessionJune
Marks15
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicDiscrete Random Variables
TypeCalculating bias of estimator
DifficultyStandard +0.3 This is a straightforward S4 question testing standard definitions and routine calculations of bias and variance for estimators. Students need to apply E(X_i) = np and Var(X_i) = np(1-p), then use linearity of expectation—all mechanical applications of known formulas with no novel insight required. Slightly easier than average due to its formulaic nature.
Spec2.04b Binomial distribution: as model B(n,p)5.05b Unbiased estimates: of population mean and variance

5. (a) Explain briefly what you understand by
  1. an unbiased estimator,
  2. a consistent estimator.
    of an unknown population parameter \(\theta\). From a binomial population, in which the proportion of successes is \(p , 3\) samples of size \(n\) are taken. The number of successes \(X _ { 1 } , X _ { 2 }\), and \(X _ { 3 }\) are recorded and used to estimate \(p\).
    (b) Determine the bias, if any, of each of the following estimators of \(p\). $$\begin{aligned} & \hat { p } _ { 1 } = \frac { X _ { 1 } + X _ { 2 } + X _ { 3 } } { 3 n } \\ & \hat { p } _ { 2 } = \frac { X _ { 1 } + 3 X _ { 2 } + X _ { 3 } } { 6 n } \\ & \hat { p } _ { 3 } = \frac { 2 X _ { 1 } + 3 X _ { 2 } + X _ { 3 } } { 6 n } \end{aligned}$$ (c) Find the variance of each of these estimators.
    (d) State, giving a reason, which of the three estimators for \(p\) is
  3. the best estimator,
  4. the worst estimator.

AnswerMarks
(a)(i) \(E(\hat{\theta}) = \theta\)B1
(ii) \(E(\hat{\theta}) = \theta\) or \(E(\hat{\theta}) \to \theta\) and \(\text{Var}(\hat{\theta}) \to 0\) as \(n \to \infty\) where \(n\) is the sample sizeB1; B1
(3)
(b) \(E(\hat{p}_1) = p, \therefore \text{Bias} = 0\)B1
\(E(\hat{p}_2) = \frac{5p}{6}, \therefore \text{Bias} = \frac{1}{6}p\)B1 B1
\(E(\hat{p}_3) = p, \therefore \text{Bias} = 0\)B1
(4)
(c) \(\text{Var}(\hat{p}_1) = \frac{1}{9n^2}\{npq + npq + npq\} = \frac{pq}{3n}\)M1; A1
\(\text{Var}(\hat{p}_2) = \frac{1}{36n^2}\{npq + 9npq + npq\} = \frac{11pq}{36n}\)A1
\(\text{Var}(\hat{p}_3) = \frac{1}{36n^2}\{4npq + 9npq + npq\} = \frac{7pq}{18n}\)A1
(4)
(d)(i) \(\hat{p}_1\); unbiased and smallest varianceB1 dep; B1
(ii) \(\hat{p}_2\); biasedB1 dep; B1
(4)
(15 marks)
| **(a)(i)** $E(\hat{\theta}) = \theta$ | B1 |
| **(ii)** $E(\hat{\theta}) = \theta$ or $E(\hat{\theta}) \to \theta$ and $\text{Var}(\hat{\theta}) \to 0$ as $n \to \infty$ where $n$ is the sample size | B1; B1 |
| | (3) |
| **(b)** $E(\hat{p}_1) = p, \therefore \text{Bias} = 0$ | B1 |
| $E(\hat{p}_2) = \frac{5p}{6}, \therefore \text{Bias} = \frac{1}{6}p$ | B1 B1 |
| $E(\hat{p}_3) = p, \therefore \text{Bias} = 0$ | B1 |
| | (4) |
| **(c)** $\text{Var}(\hat{p}_1) = \frac{1}{9n^2}\{npq + npq + npq\} = \frac{pq}{3n}$ | M1; A1 |
| $\text{Var}(\hat{p}_2) = \frac{1}{36n^2}\{npq + 9npq + npq\} = \frac{11pq}{36n}$ | A1 |
| $\text{Var}(\hat{p}_3) = \frac{1}{36n^2}\{4npq + 9npq + npq\} = \frac{7pq}{18n}$ | A1 |
| | (4) |
| **(d)(i)** $\hat{p}_1$; unbiased and smallest variance | B1 dep; B1 |
| **(ii)** $\hat{p}_2$; biased | B1 dep; B1 |
| | (4) |
| | (15 marks) |
5. (a) Explain briefly what you understand by
\begin{enumerate}[label=(\roman*)]
\item an unbiased estimator,
\item a consistent estimator.\\
of an unknown population parameter $\theta$.

From a binomial population, in which the proportion of successes is $p , 3$ samples of size $n$ are taken. The number of successes $X _ { 1 } , X _ { 2 }$, and $X _ { 3 }$ are recorded and used to estimate $p$.\\
(b) Determine the bias, if any, of each of the following estimators of $p$.

$$\begin{aligned}
& \hat { p } _ { 1 } = \frac { X _ { 1 } + X _ { 2 } + X _ { 3 } } { 3 n } \\
& \hat { p } _ { 2 } = \frac { X _ { 1 } + 3 X _ { 2 } + X _ { 3 } } { 6 n } \\
& \hat { p } _ { 3 } = \frac { 2 X _ { 1 } + 3 X _ { 2 } + X _ { 3 } } { 6 n }
\end{aligned}$$

(c) Find the variance of each of these estimators.\\
(d) State, giving a reason, which of the three estimators for $p$ is
\item the best estimator,
\item the worst estimator.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S4 2004 Q5 [15]}}