OCR S4 2015 June — Question 8 12 marks

Exam BoardOCR
ModuleS4 (Statistics 4)
Year2015
SessionJune
Marks12
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicDiscrete Random Variables
TypeVariance of estimators
DifficultyChallenging +1.2 This is a Further Maths S4 question on estimator properties requiring standard variance calculations for binomial distributions, algebraic manipulation to compare variances, and application of a given inequality hint. While it involves multiple steps and some algebraic work, the techniques are routine for this level and the inequality hint significantly guides part (ii).
Spec5.05b Unbiased estimates: of population mean and variance

8 The independent random variables \(X _ { 1 }\) and \(X _ { 2 }\) have the distributions \(\mathrm { B } \left( n _ { 1 } , \theta \right)\) and \(\mathrm { B } \left( n _ { 2 } , \theta \right)\) respectively. Two possible estimators for \(\theta\) are $$T _ { 1 } = \frac { 1 } { 2 } \left( \frac { X _ { 1 } } { n _ { 1 } } + \frac { X _ { 2 } } { n _ { 2 } } \right) \text { and } T _ { 2 } = \frac { X _ { 1 } + X _ { 2 } } { n _ { 1 } + n _ { 2 } } .$$
  1. Show that \(T _ { 1 }\) and \(T _ { 2 }\) are both unbiased estimators, and calculate their variances.
  2. Find \(\frac { \operatorname { Var } \left( T _ { 1 } \right) } { \operatorname { Var } \left( T _ { 2 } \right) }\). Given that \(n _ { 1 } \neq n _ { 2 }\), use the inequality \(\left( n _ { 1 } - n _ { 2 } \right) ^ { 2 } > 0\) to find which of \(T _ { 1 }\) and \(T _ { 2 }\) is the more efficient estimator.

(i) Show that \(T_1\) and \(T_2\) are unbiased, and calculate their variances
AnswerMarks Guidance
\(T_1\) and \(T_2\) both unbiased. \(\text{Var}(T_1) = \frac{\theta^2(1-\theta)}{n_1} + \frac{\theta^2(1-\theta)}{n_2}\), \(\text{Var}(T_2) = \frac{\theta(1-\theta)}{n_1 + n_2}\)M1 M1 M1 M1 A1 A1 For \(T_1 = \frac{1}{2}\left(\frac{X_1}{n_1} + \frac{X_2}{n_2}\right)\): \(E(T_1) = \frac{1}{2}(\theta + \theta) = \theta\). For \(T_2 = \frac{X_1 + X_2}{n_1 + n_2}\): \(E(T_2) = \frac{n_1\theta + n_2\theta}{n_1 + n_2} = \theta\). Both unbiased. \(\text{Var}(T_1) = \frac{1}{4}\left[\frac{\theta(1-\theta)}{n_1} + \frac{\theta(1-\theta)}{n_2}\right] = \frac{\theta(1-\theta)}{4}\left(\frac{1}{n_1} + \frac{1}{n_2}\right)\). \(\text{Var}(T_2) = \frac{\theta(1-\theta)}{n_1 + n_2}\)
(ii) Find \(\frac{\text{Var}(T_1)}{\text{Var}(T_2)}\) and determine more efficient estimator
AnswerMarks Guidance
\(\frac{\text{Var}(T_1)}{\text{Var}(T_2)} = \frac{(n_1 + n_2)}{4(n_1 n_2)} \cdot \frac{1}{(n_1 + n_2)} \cdots\) leading to result. \(T_2\) is more efficient when \(n_1 \neq n_2\)M1 M1 M1 M1 A1 \(\frac{\text{Var}(T_1)}{\text{Var}(T_2)} = \frac{\frac{\theta(1-\theta)}{4}\left(\frac{n_1 + n_2}{n_1n_2}\right)}{\frac{\theta(1-\theta)}{n_1+n_2}} = \frac{(n_1+n_2)^2}{4n_1n_2}\). Since \((n_1 - n_2)^2 > 0\), we have \(n_1^2 + n_2^2 + 2n_1n_2 > 4n_1n_2\), so \((n_1+n_2)^2 > 4n_1n_2\), meaning \(\frac{\text{Var}(T_1)}{\text{Var}(T_2)} > 1\). Therefore \(T_2\) is the more efficient estimator.
**(i) Show that $T_1$ and $T_2$ are unbiased, and calculate their variances**

| $T_1$ and $T_2$ both unbiased. $\text{Var}(T_1) = \frac{\theta^2(1-\theta)}{n_1} + \frac{\theta^2(1-\theta)}{n_2}$, $\text{Var}(T_2) = \frac{\theta(1-\theta)}{n_1 + n_2}$ | M1 M1 M1 M1 A1 A1 | For $T_1 = \frac{1}{2}\left(\frac{X_1}{n_1} + \frac{X_2}{n_2}\right)$: $E(T_1) = \frac{1}{2}(\theta + \theta) = \theta$. For $T_2 = \frac{X_1 + X_2}{n_1 + n_2}$: $E(T_2) = \frac{n_1\theta + n_2\theta}{n_1 + n_2} = \theta$. Both unbiased. $\text{Var}(T_1) = \frac{1}{4}\left[\frac{\theta(1-\theta)}{n_1} + \frac{\theta(1-\theta)}{n_2}\right] = \frac{\theta(1-\theta)}{4}\left(\frac{1}{n_1} + \frac{1}{n_2}\right)$. $\text{Var}(T_2) = \frac{\theta(1-\theta)}{n_1 + n_2}$ |

**(ii) Find $\frac{\text{Var}(T_1)}{\text{Var}(T_2)}$ and determine more efficient estimator**

| $\frac{\text{Var}(T_1)}{\text{Var}(T_2)} = \frac{(n_1 + n_2)}{4(n_1 n_2)} \cdot \frac{1}{(n_1 + n_2)} \cdots$ leading to result. $T_2$ is more efficient when $n_1 \neq n_2$ | M1 M1 M1 M1 A1 | $\frac{\text{Var}(T_1)}{\text{Var}(T_2)} = \frac{\frac{\theta(1-\theta)}{4}\left(\frac{n_1 + n_2}{n_1n_2}\right)}{\frac{\theta(1-\theta)}{n_1+n_2}} = \frac{(n_1+n_2)^2}{4n_1n_2}$. Since $(n_1 - n_2)^2 > 0$, we have $n_1^2 + n_2^2 + 2n_1n_2 > 4n_1n_2$, so $(n_1+n_2)^2 > 4n_1n_2$, meaning $\frac{\text{Var}(T_1)}{\text{Var}(T_2)} > 1$. Therefore $T_2$ is the more efficient estimator. |
8 The independent random variables $X _ { 1 }$ and $X _ { 2 }$ have the distributions $\mathrm { B } \left( n _ { 1 } , \theta \right)$ and $\mathrm { B } \left( n _ { 2 } , \theta \right)$ respectively. Two possible estimators for $\theta$ are

$$T _ { 1 } = \frac { 1 } { 2 } \left( \frac { X _ { 1 } } { n _ { 1 } } + \frac { X _ { 2 } } { n _ { 2 } } \right) \text { and } T _ { 2 } = \frac { X _ { 1 } + X _ { 2 } } { n _ { 1 } + n _ { 2 } } .$$

(i) Show that $T _ { 1 }$ and $T _ { 2 }$ are both unbiased estimators, and calculate their variances.\\
(ii) Find $\frac { \operatorname { Var } \left( T _ { 1 } \right) } { \operatorname { Var } \left( T _ { 2 } \right) }$. Given that $n _ { 1 } \neq n _ { 2 }$, use the inequality $\left( n _ { 1 } - n _ { 2 } \right) ^ { 2 } > 0$ to find which of $T _ { 1 }$ and $T _ { 2 }$ is the more efficient estimator.

\hfill \mbox{\textit{OCR S4 2015 Q8 [12]}}