Question 3 - A-Level Maths

OCR MEI S4 2012 June — Question 3 24 marks

Exam Board	OCR MEI
Module	S4 (Statistics 4)
Year	2012
Session	June
Marks	24
Paper	Download PDF ↗
Mark scheme	Download PDF ↗
Topic	Wilcoxon tests
Type	Wilcoxon rank-sum test (Mann-Whitney U test)
Difficulty	Standard +0.3 This is a straightforward application of standard Wilcoxon rank-sum and t-test procedures with small datasets. The question provides clear context, pre-sorted data, and asks for routine hypothesis testing with standard interpretation. While it requires careful ranking and calculation, it involves no novel insight or complex problem-solving beyond textbook methods for Further Maths Statistics.
Spec	5.05c Hypothesis test: normal distribution for population mean 5.07a Non-parametric tests: when to use 5.07d Paired vs two-sample: selection

3 At an agricultural research station, trials are being made of two fertilisers, A and B, to see whether they differ in their effects on the yield of a crop. Preliminary investigations have established that the underlying variances of the distributions of yields using the two fertilisers may be assumed equal. Scientific analysis of the fertilisers has suggested that fertiliser A may be inferior in that it leads, on the whole, to lower yield. A statistical analysis is being carried out to investigate this. The crop is grown in carefully controlled conditions in 14 experimental plots, 6 with fertiliser A and 8 with fertiliser B. The yields, in kg per plot, are as follows, arranged in ascending order for each fertiliser.

Fertiliser A	9.8	10.2	10.9	11.5	12.7	13.3
Fertiliser B	10.8	11.9	12.0	12.2	12.9	13.5	13.6	13.7

Carry out a Wilcoxon rank sum test at the \(5 \%\) significance level to examine appropriate hypotheses.
Carry out a \(t\) test at the \(5 \%\) significance level to examine appropriate hypotheses.
Goodness of fit tests based on more extensive data sets from other trials with these fertilisers have failed to reject hypotheses of underlying Normal distributions. Discuss the relative merits of the analyses in parts (i) and (ii).

Show mark scheme Show mark scheme source

Part (i)

Answer	Marks	Guidance
\(H_0\): population medians are equal \(H_1\): population median for A \(<\) population median for B Wilcoxon rank sum test (or Mann-Whitney form of test) Ranks are: A: 1 2 4 5 9 11; B: 3 6 7 8 10 12 13 14 \(W = 1 + 2 + 4 + 5 + 9 + 11 = 32\) [or \(0 + 0 + 1 + 1 + 4 + 5 = 11\) if M-W used] Refer to \(W_{6,8}\) [or \(MW_{6,8}\) tables] Lower 5% critical point is 31 [or 10 if M-W used] Result is not significant Seems median yields may be assumed equal	B1, B1, M1, A1, B1, M1, A1, A1	[Note: "population" must be explicit]. Explicit statement re shapes of distributions (eg that they are the same shape) is not required. More formal statements of hypotheses gain both marks [eg cdfs are \(F(x)\) and \(F(x - \Delta)\), \(H_0\) is \(\Delta = 0\) etc.]. Combined ranking. Correct [allow up to 2 errors; FT provided M1 earned]. No FT if wrong. No FT if wrong.

Part (ii)

Answer	Marks	Guidance
\(H_0\): population means are equal \(H_1\): population mean for A \(<\) population mean for B For A: \(\bar{x} = 11.4\), \(s_{n-1}^2 = 1.912\) [\(s_n \cdot = 1.38275\)] For B: \(\bar{y} = 12.575\), \(s_{n-1}^2 = 1.051\) [\(s_{n-1} = 1.025\)] Pooled \(s^2 = \frac{(5 \times 1.912) + (7 \times 1.051)}{12} = \frac{16.915}{12} = 1.4096\) Test statistic \(= \frac{11.4 - 12.575}{\sqrt{1.4096 \sqrt{\frac{1}{5} + \frac{1}{8}}}} = \frac{-1.175}{0.6412} = -1.83(25)\) Refer to \(t_{12}\) Lower single-tailed 5% critical point is \(-1.782\) Significant Seems mean yield for A is less than that for B	B1, B1, M1, A1, M1, A1, A1, A1	"population" must be explicit, either in words or by notation. For all. Use of \(s_n\) scores B0. For any reasonable attempt at pooling (but not if \(s_n^2\) used). If correct. Ft if incorrect. No FT if wrong. No FT if wrong. Must compare \(-1.83\) with \(-1.782\) unless it is clear and explicit that absolute values are being used.

Part (iii)

Answer	Marks	Guidance
\(t\) test is "more sensitive" if Normality is correct. Non-rejection of Normality supports \(t\). But Wilcoxon is more reliable if not Normal – and we do not have proof of Normality.	E1, E1, E1, E1	[4]

**Part (i)**

| $H_0$: population medians are equal $H_1$: population median for A $<$ population median for B Wilcoxon rank sum test (or Mann-Whitney form of test) Ranks are: A: 1 2 4 5 9 11; B: 3 6 7 8 10 12 13 14 $W = 1 + 2 + 4 + 5 + 9 + 11 = 32$ [or $0 + 0 + 1 + 1 + 4 + 5 = 11$ if M-W used] Refer to $W_{6,8}$ [or $MW_{6,8}$ tables] Lower 5% critical point is 31 [or 10 if M-W used] Result is not significant Seems median yields may be assumed equal | B1, B1, M1, A1, B1, M1, A1, A1 | [Note: "population" must be explicit]. Explicit statement re shapes of distributions (eg that they are the same shape) is not required. More formal statements of hypotheses gain both marks [eg cdfs are $F(x)$ and $F(x - \Delta)$, $H_0$ is $\Delta = 0$ etc.]. Combined ranking. Correct [allow up to 2 errors; FT provided M1 earned]. No FT if wrong. No FT if wrong. |

**Part (ii)**

| $H_0$: population means are equal $H_1$: population mean for A $<$ population mean for B For A: $\bar{x} = 11.4$, $s_{n-1}^2 = 1.912$ [$s_n \cdot = 1.38275$] For B: $\bar{y} = 12.575$, $s_{n-1}^2 = 1.051$ [$s_{n-1} = 1.025$] Pooled $s^2 = \frac{(5 \times 1.912) + (7 \times 1.051)}{12} = \frac{16.915}{12} = 1.4096$ Test statistic $= \frac{11.4 - 12.575}{\sqrt{1.4096 \sqrt{\frac{1}{5} + \frac{1}{8}}}} = \frac{-1.175}{0.6412} = -1.83(25)$ Refer to $t_{12}$ Lower single-tailed 5% critical point is $-1.782$ Significant Seems mean yield for A is less than that for B | B1, B1, M1, A1, M1, A1, A1, A1 | "population" must be explicit, either in words or by notation. For all. Use of $s_n$ scores B0. For any reasonable attempt at pooling (but not if $s_n^2$ used). If correct. Ft if incorrect. No FT if wrong. No FT if wrong. Must compare $-1.83$ with $-1.782$ unless it is clear and explicit that absolute values are being used. |

**Part (iii)**

| $t$ test is "more sensitive" if Normality is correct. Non-rejection of Normality supports $t$. But Wilcoxon is more reliable if not Normal – and we do not have proof of Normality. | E1, E1, E1, E1 | [4] |

---

Show LaTeX source

3 At an agricultural research station, trials are being made of two fertilisers, A and B, to see whether they differ in their effects on the yield of a crop. Preliminary investigations have established that the underlying variances of the distributions of yields using the two fertilisers may be assumed equal. Scientific analysis of the fertilisers has suggested that fertiliser A may be inferior in that it leads, on the whole, to lower yield. A statistical analysis is being carried out to investigate this.

The crop is grown in carefully controlled conditions in 14 experimental plots, 6 with fertiliser A and 8 with fertiliser B. The yields, in kg per plot, are as follows, arranged in ascending order for each fertiliser.

\begin{center}
\begin{tabular}{ l r r r r r r r r }
Fertiliser A & 9.8 & 10.2 & 10.9 & 11.5 & 12.7 & 13.3 &  &  \\
Fertiliser B & 10.8 & 11.9 & 12.0 & 12.2 & 12.9 & 13.5 & 13.6 & 13.7 \\
\end{tabular}
\end{center}

(i) Carry out a Wilcoxon rank sum test at the $5 \%$ significance level to examine appropriate hypotheses.\\
(ii) Carry out a $t$ test at the $5 \%$ significance level to examine appropriate hypotheses.\\
(iii) Goodness of fit tests based on more extensive data sets from other trials with these fertilisers have failed to reject hypotheses of underlying Normal distributions. Discuss the relative merits of the analyses in parts (i) and (ii).

\hfill \mbox{\textit{OCR MEI S4 2012 Q3 [24]}}

This paper (3 questions)

View full paper

Q1 24 Q2 24 Q3 24