OCR MEI S4 2010 June — Question 4 24 marks

Exam BoardOCR MEI
ModuleS4 (Statistics 4)
Year2010
SessionJune
Marks24
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared test of independence
TypeStandard 3×3 contingency table
DifficultyStandard +0.3 This is a straightforward application of one-way ANOVA with all calculations scaffolded by given summaries. Parts (i) and (ii) test standard recall of experimental design and model interpretation. Part (iii) requires routine computation of SS_between and SS_within using provided summaries, then a standard F-test lookup. No conceptual challenges or novel problem-solving required—slightly easier than average A-level due to the computational scaffolding.
Spec5.06c Fit other distributions: discrete and continuous5.06d Goodness of fit: chi-squared test

4 At an agricultural research station, a trial is made of four varieties (A, B, C, D) of a certain crop in an experimental field. The varieties are grown on plots in the field and their yields are measured in a standard unit.
  1. It is at first thought that there may be a consistent trend in the natural fertility of the soil in the field from the west side to the east, though no other trends are known. Name an experimental design that should be used in these circumstances and give an example of an experimental layout. Initial analysis suggests that any natural fertility trend may in fact be ignored, so the data from the trial are analysed by one-way analysis of variance.
  2. The usual model for one-way analysis of variance of the yields \(y _ { i j }\) may be written as $$y _ { i j } = \mu + \alpha _ { i } + e _ { i j }$$ where the \(e _ { i j }\) represent the experimental errors. Interpret the other terms in the model. State the usual distributional assumptions for the \(e _ { i j }\).
  3. The data for the yields are as follows, each variety having been used on 5 plots.
    Variety
    ABCD
    12.314.214.113.6
    11.913.113.212.8
    12.813.114.613.3
    12.212.513.714.3
    13.512.713.413.8
    $$\left[ \Sigma \Sigma y _ { i j } = 265.1 , \quad \Sigma \Sigma y _ { i j } ^ { 2 } = 3524.31 . \right]$$ Construct the usual one-way analysis of variance table and carry out the usual test, at the 5\% significance level. Report briefly on your conclusions. {www.ocr.org.uk} after the live examination series.
    If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity. For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the

Question 4:
Part (i)
AnswerMarks
Randomised blocksB1
Plots in strips (blocks)M1
Correctly aligned w.r.t. fertility trendE1
Each letter occurs at least once in each blockM1
In a random arrangementE1
Part (ii) - Model
AnswerMarks
\(\mu =\) population grand mean for whole experimentB1
\(\alpha_i =\) population mean amount by which the \(i\)th treatment differs from \(\mu\)B1
\(e_{ij} \sim\) ind \(N\) [accept "uncorrelated"] \((0, \sigma^2)\)B1, B1, B1
Part (ii) - ANOVA
AnswerMarks Guidance
Totals: \(62.7,\ 65.6,\ 69.0,\ 67.8\), all from samples of size 5
Grand total \(265.1\); CF \(= 265.1^2/20 = 3513.9005\)
Total SS \(= 3524.31 - \text{CF} = 10.4095\)M1 Attempt to form three sums of squares
Between varieties SS \(= \frac{62.7^2}{5} + \frac{65.6^2}{5} + \frac{69.0^2}{5} + \frac{67.8^2}{5} - \text{CF} = 4.5975\)M1, A1 Correct method; each calculated SS correct
Residual SS \(= 10.4095 - 4.5975 = 5.8120\)A1
Between varieties: SS \(= 4.5975\), df \(= 3\), MS \(= 1.5325\), MS ratio \(= 4.22\)B1, M1, A1 cao
Residual: SS \(= 5.8120\), df \(= 16\), MS \(= 0.36325\)B1
Refer MS ratio to \(F_{3,16}\)M1 No FT if wrong
Upper 5% point is \(3.24\); SignificantA1 No FT if wrong
Seems the mean yields of the varieties are not all the sameE1, E1
# Question 4:

## Part (i)
| Randomised blocks | B1 | |
|---|---|---|
| Plots in strips (blocks) | M1 | |
| Correctly aligned w.r.t. fertility trend | E1 | |
| Each letter occurs at least once in each block | M1 | |
| In a random arrangement | E1 | |

## Part (ii) - Model
| $\mu =$ population grand mean for whole experiment | B1 | |
|---|---|---|
| $\alpha_i =$ population mean amount by which the $i$th treatment differs from $\mu$ | B1 | |
| $e_{ij} \sim$ ind $N$ [accept "uncorrelated"] $(0, \sigma^2)$ | B1, B1, B1 | |

## Part (ii) - ANOVA
| Totals: $62.7,\ 65.6,\ 69.0,\ 67.8$, all from samples of size 5 | | |
|---|---|---|
| Grand total $265.1$; CF $= 265.1^2/20 = 3513.9005$ | | |
| Total SS $= 3524.31 - \text{CF} = 10.4095$ | M1 | Attempt to form three sums of squares |
| Between varieties SS $= \frac{62.7^2}{5} + \frac{65.6^2}{5} + \frac{69.0^2}{5} + \frac{67.8^2}{5} - \text{CF} = 4.5975$ | M1, A1 | Correct method; each calculated SS correct |
| Residual SS $= 10.4095 - 4.5975 = 5.8120$ | A1 | |
| Between varieties: SS $= 4.5975$, df $= 3$, MS $= 1.5325$, MS ratio $= 4.22$ | B1, M1, A1 cao | |
| Residual: SS $= 5.8120$, df $= 16$, MS $= 0.36325$ | B1 | |
| Refer MS ratio to $F_{3,16}$ | M1 | No FT if wrong |
| Upper 5% point is $3.24$; Significant | A1 | No FT if wrong |
| Seems the mean yields of the varieties are not all the same | E1, E1 | |
4 At an agricultural research station, a trial is made of four varieties (A, B, C, D) of a certain crop in an experimental field. The varieties are grown on plots in the field and their yields are measured in a standard unit.\\
(i) It is at first thought that there may be a consistent trend in the natural fertility of the soil in the field from the west side to the east, though no other trends are known. Name an experimental design that should be used in these circumstances and give an example of an experimental layout.

Initial analysis suggests that any natural fertility trend may in fact be ignored, so the data from the trial are analysed by one-way analysis of variance.\\
(ii) The usual model for one-way analysis of variance of the yields $y _ { i j }$ may be written as

$$y _ { i j } = \mu + \alpha _ { i } + e _ { i j }$$

where the $e _ { i j }$ represent the experimental errors. Interpret the other terms in the model. State the usual distributional assumptions for the $e _ { i j }$.\\
(iii) The data for the yields are as follows, each variety having been used on 5 plots.

\begin{center}
\begin{tabular}{ | c | c | c | c | }
\hline
\multicolumn{4}{|c|}{Variety} \\
\hline
A & B & C & D \\
\hline
12.3 & 14.2 & 14.1 & 13.6 \\
11.9 & 13.1 & 13.2 & 12.8 \\
12.8 & 13.1 & 14.6 & 13.3 \\
12.2 & 12.5 & 13.7 & 14.3 \\
13.5 & 12.7 & 13.4 & 13.8 \\
\hline
\end{tabular}
\end{center}

$$\left[ \Sigma \Sigma y _ { i j } = 265.1 , \quad \Sigma \Sigma y _ { i j } ^ { 2 } = 3524.31 . \right]$$

Construct the usual one-way analysis of variance table and carry out the usual test, at the 5\% significance level. Report briefly on your conclusions.

{www.ocr.org.uk} after the live examination series.\\
If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity. For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.\\
OCR is part of the

\hfill \mbox{\textit{OCR MEI S4 2010 Q4 [24]}}