OCR MEI S3 2011 January — Question 3 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2011
SessionJanuary
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeChi-squared goodness of fit: Normal
DifficultyStandard +0.3 This is a standard chi-squared goodness of fit test with Normal distribution, requiring routine calculations of mean/SD from grouped data, finding expected frequencies using Normal tables, computing the test statistic, and interpreting results. While it involves multiple steps and some statistical reasoning in part (iv), all techniques are standard S3 material with no novel problem-solving required.
Spec2.02g Calculate mean and standard deviation2.04e Normal distribution: as model N(mu, sigma^2)2.04f Find normal probabilities: Z transformation5.06b Fit prescribed distribution: chi-squared test

3 The masses, in kilograms, of a random sample of 100 chickens on sale in a large supermarket were recorded as follows.
Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
Frequency2830421152
  1. Assuming that the first and last classes are the same width as the other classes, calculate an estimate of the sample mean and show that the corresponding estimate of the sample standard deviation is 0.2227 kg . A Normal distribution using the mean and standard deviation found in part (i) is to be fitted to these data. The expected frequencies for the classes are as follows.
    Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
    Expected
    frequency
    2.1710.92\(f\)33.8519.225.130.68
  2. Use the Normal distribution to find \(f\).
  3. Carry out a goodness of fit test of this Normal model using a significance level of 5\%.
  4. Discuss the outcome of the test with reference to the contributions to the test statistic and to the possibility of other significance levels.

Question 3:
Part (i)
AnswerMarks Guidance
AnswerMark Guidance
Using mid-intervals 1.5, 1.7, etcM1
\(\bar{x} = \frac{205}{100} = 2.05\)A1 Mean
\(s = \sqrt{\frac{425.16 - 100 \times 2.05^2}{99}} = 0.2227(01\ldots)\)E1 s.d. Answer given; must show convincingly. Total: 3
Part (ii)
AnswerMarks Guidance
AnswerMark Guidance
\(f = 100 \times P(1.8 \leq M < 2.0)\)M1 Probability \(\times\) 100
\(= 100 \times P(-1.1226 \leq z < -0.2245)\)A1 Correct Normal probabilities. ft c's mean
\(= 100 \times ((1 - 0.5888) - (1 - 0.8691))\) Must show convincingly using Normal distribution
\(= 100 \times (0.4112 - 0.1309) = 28.03\)A1 ft c's mean. Total: 3
Part (iii)
AnswerMarks Guidance
AnswerMark Guidance
\(H_0\): The Normal model fits the data. \(H_1\): The Normal model does not fit the dataB1, B1 Ignore any reference to parameters
Merge first 2 and last 2 cellsM1
\(X^2 = 0.7294 + 0.1384 + 1.9623 + 3.5155 + 0.2437 = 6.589(3)\)M1, A1 Calculation of \(X^2\); c.a.o.
Refer to \(\chi^2_2\)M1 Allow correct df (= cells \(- 3\)) from wrongly grouped table and ft. Otherwise no ft if wrong. \(P(X^2 > 6.589) = 0.0371\)
Upper 5% point is 5.991A1 No ft from here if wrong
SignificantA1 ft only c's test statistic
Evidence suggests that the model does not fit the dataA1 ft only c's test statistic. Conclusion in context. Total: 9
Part (iv)
AnswerMarks Guidance
AnswerMark Guidance
The model overestimates in the \(2.2 - 2.4\) classE1
Underestimates in the \(2 - 2.2\) classE1
At lower significance levels the test would not have been significantE1 Total: 3
# Question 3:

## Part (i)
| Answer | Mark | Guidance |
|--------|------|----------|
| Using mid-intervals 1.5, 1.7, etc | M1 | |
| $\bar{x} = \frac{205}{100} = 2.05$ | A1 | Mean |
| $s = \sqrt{\frac{425.16 - 100 \times 2.05^2}{99}} = 0.2227(01\ldots)$ | E1 | s.d. Answer given; must show convincingly. Total: 3 |

## Part (ii)
| Answer | Mark | Guidance |
|--------|------|----------|
| $f = 100 \times P(1.8 \leq M < 2.0)$ | M1 | Probability $\times$ 100 |
| $= 100 \times P(-1.1226 \leq z < -0.2245)$ | A1 | Correct Normal probabilities. ft c's mean |
| $= 100 \times ((1 - 0.5888) - (1 - 0.8691))$ | | Must show convincingly using Normal distribution |
| $= 100 \times (0.4112 - 0.1309) = 28.03$ | A1 | ft c's mean. Total: 3 |

## Part (iii)
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0$: The Normal model fits the data. $H_1$: The Normal model does not fit the data | B1, B1 | Ignore any reference to parameters |
| Merge first 2 and last 2 cells | M1 | |
| $X^2 = 0.7294 + 0.1384 + 1.9623 + 3.5155 + 0.2437 = 6.589(3)$ | M1, A1 | Calculation of $X^2$; c.a.o. |
| Refer to $\chi^2_2$ | M1 | Allow correct df (= cells $- 3$) from wrongly grouped table and ft. Otherwise no ft if wrong. $P(X^2 > 6.589) = 0.0371$ |
| Upper 5% point is 5.991 | A1 | No ft from here if wrong |
| Significant | A1 | ft only c's test statistic |
| Evidence suggests that the model does not fit the data | A1 | ft only c's test statistic. Conclusion in context. Total: 9 |

## Part (iv)
| Answer | Mark | Guidance |
|--------|------|----------|
| The model overestimates in the $2.2 - 2.4$ class | E1 | |
| Underestimates in the $2 - 2.2$ class | E1 | |
| At lower significance levels the test would not have been significant | E1 | Total: 3 |

---
3 The masses, in kilograms, of a random sample of 100 chickens on sale in a large supermarket were recorded as follows.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | }
\hline
Mass $( m \mathrm {~kg} )$ & $m < 1.6$ & $1.6 \leqslant m < 1.8$ & $1.8 \leqslant m < 2.0$ & $2.0 \leqslant m < 2.2$ & $2.2 \leqslant m < 2.4$ & $2.4 \leqslant m < 2.6$ & $2.6 \leqslant m$ \\
\hline
Frequency & 2 & 8 & 30 & 42 & 11 & 5 & 2 \\
\hline
\end{tabular}
\end{center}

(i) Assuming that the first and last classes are the same width as the other classes, calculate an estimate of the sample mean and show that the corresponding estimate of the sample standard deviation is 0.2227 kg .

A Normal distribution using the mean and standard deviation found in part (i) is to be fitted to these data. The expected frequencies for the classes are as follows.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | }
\hline
Mass $( m \mathrm {~kg} )$ & $m < 1.6$ & $1.6 \leqslant m < 1.8$ & $1.8 \leqslant m < 2.0$ & $2.0 \leqslant m < 2.2$ & $2.2 \leqslant m < 2.4$ & $2.4 \leqslant m < 2.6$ & $2.6 \leqslant m$ \\
\hline
\begin{tabular}{ l }
Expected \\
frequency \\
\end{tabular} & 2.17 & 10.92 & $f$ & 33.85 & 19.22 & 5.13 & 0.68 \\
\hline
\end{tabular}
\end{center}

(ii) Use the Normal distribution to find $f$.\\
(iii) Carry out a goodness of fit test of this Normal model using a significance level of 5\%.\\
(iv) Discuss the outcome of the test with reference to the contributions to the test statistic and to the possibility of other significance levels.

\hfill \mbox{\textit{OCR MEI S3 2011 Q3 [18]}}