OCR MEI S3 2006 January — Question 4 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2006
SessionJanuary
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicWilcoxon tests
TypeCritical region or test statistic properties
DifficultyStandard +0.3 This is a standard S3 question testing routine application of chi-squared goodness of fit test and Wilcoxon signed-rank test. Part (i) requires mechanical calculation of chi-squared statistic with given expected frequencies (no parameter estimation complications). Part (ii)(A) is straightforward discussion of normality/sample size for t-tests. Part (ii)(B) involves standard Wilcoxon procedure with small sample. All techniques are textbook applications with no novel insight required, though the 18 marks indicate it's somewhat lengthy. Slightly easier than average due to the procedural nature.
Spec5.06b Fit prescribed distribution: chi-squared test

Quality control inspectors in a factory are investigating the lengths of glass tubes that will be used to make laboratory equipment.
  1. Data on the observed lengths of a random sample of 200 glass tubes from one batch are available in the form of a frequency distribution as follows.
    Length \(x\) (mm)Observed frequency
    \(x \leq 298\)1
    \(298 < x \leq 300\)30
    \(300 < x \leq 301\)62
    \(301 < x \leq 302\)70
    \(302 < x \leq 304\)34
    \(x > 304\)3
    The sample mean and standard deviation are 301.08 and 1.2655 respectively. The corresponding expected frequencies for the Normal distribution with parameters estimated by the sample statistics are
    Length \(x\) (mm)Expected frequency
    \(x \leq 298\)1.49
    \(298 < x \leq 300\)37.85
    \(300 < x \leq 301\)55.62
    \(301 < x \leq 302\)58.32
    \(302 < x \leq 304\)44.62
    \(x > 304\)2.10
    Examine the goodness of fit of a Normal distribution, using a 5\% significance level. [7]
  2. It is thought that the lengths of tubes in another batch have an underlying distribution similar to that for the batch in part (i) but possibly with different location and dispersion parameters. A random sample of 10 tubes from this batch gives the following lengths (in mm). 301.3 \quad 301.4 \quad 299.6 \quad 302.2 \quad 300.3 \quad 303.2 \quad 302.6 \quad 301.8 \quad 300.9 \quad 300.8
    1. Discuss briefly whether it would be appropriate to use a \(t\) test to examine a hypothesis about the population mean length for this batch. [2]
    2. Use a Wilcoxon test to examine at the 10\% significance level whether the population median length for this batch is 301 mm. [9]

(i)
AnswerMarks Guidance
\(\chi^2 = 1.7681 + 0.7318 + 2.3392 + 2.0222 = 6.86\)M, A1 for grouping. Allow the M1 for correct method from wrongly grouped or ungrouped table.
Refer to \(\chi_1^2\)M Allow correct df (= cells − 3) from wrongly grouped or ungrouped table, and FT. Otherwise, no FT if wrong.
Upper 5% point is 3.84. Significant. Suggests Normal model does not fitA1, E1, E1 No ft from here if wrong. ft only c's test statistic. ft only c's test statistic.
(ii)(A)
AnswerMarks Guidance
\(t\) test unwise ...E1
... because underlying population appears non-NormalE1 FT from result of candidate's work in (i)
(ii)(B)
AnswerMarks Guidance
DataMedian 301 Difference
301.3 0.3
301.4 0.4
299.6 −1.4
302.2 1.2
300.3 −0.7
303.2 2.2
302.6 1.6
301.8 0.8
300.9 −0.1
300.8 −0.2
Mfor differences. ZERO in this section if differences not used. 9 marks
\(T = 1 + 2 + 5 + 8 = 16\) (or \(3+4+6+7+9+10 = 39\))B1
Refer to tables of Wilcoxon single sample (/paired) statisticM
Lower (or upper if 39 used) 5% tail is neededM
Value for \(n = 10\) is 10 (or 45 if 39 used)A1
Result is not significant. No evidence against median being 301E1, E1
## (i)
$\chi^2 = 1.7681 + 0.7318 + 2.3392 + 2.0222 = 6.86$ | M, A1 | for grouping. Allow the M1 for correct method from wrongly grouped or ungrouped table. | 7 marks

Refer to $\chi_1^2$ | M | Allow correct df (= cells − 3) from wrongly grouped or ungrouped table, and FT. Otherwise, no FT if wrong. |

Upper 5% point is 3.84. Significant. Suggests Normal model does not fit | A1, E1, E1 | No ft from here if wrong. ft only c's test statistic. ft only c's test statistic. |

## (ii)(A)
$t$ test unwise ... | E1 | | 2 marks

... because underlying population appears non-Normal | E1 | FT from result of candidate's work in (i) |

## (ii)(B)
| Data | Median 301 | Difference | Rank of [diff] |
|------|-----------|-----------|-----------------|
| 301.3 | | 0.3 | 3 |
| 301.4 | | 0.4 | 4 |
| 299.6 | | −1.4 | 8 |
| 302.2 | | 1.2 | 7 |
| 300.3 | | −0.7 | 5 |
| 303.2 | | 2.2 | 10 |
| 302.6 | | 1.6 | 9 |
| 301.8 | | 0.8 | 6 |
| 300.9 | | −0.1 | 1 |
| 300.8 | | −0.2 | 2 |

| M | for differences. ZERO in this section if differences not used. | 9 marks

$T = 1 + 2 + 5 + 8 = 16$ (or $3+4+6+7+9+10 = 39$) | B1 | |

Refer to tables of Wilcoxon single sample (/paired) statistic | M | |

Lower (or upper if 39 used) 5% tail is needed | M | |

Value for $n = 10$ is 10 (or 45 if 39 used) | A1 | |

Result is not significant. No evidence against median being 301 | E1, E1 | |
Quality control inspectors in a factory are investigating the lengths of glass tubes that will be used to make laboratory equipment.

\begin{enumerate}[label=(\roman*)]
\item Data on the observed lengths of a random sample of 200 glass tubes from one batch are available in the form of a frequency distribution as follows.

\begin{tabular}{|c|c|}
\hline
Length $x$ (mm) & Observed frequency \\
\hline
$x \leq 298$ & 1 \\
$298 < x \leq 300$ & 30 \\
$300 < x \leq 301$ & 62 \\
$301 < x \leq 302$ & 70 \\
$302 < x \leq 304$ & 34 \\
$x > 304$ & 3 \\
\hline
\end{tabular}

The sample mean and standard deviation are 301.08 and 1.2655 respectively.

The corresponding expected frequencies for the Normal distribution with parameters estimated by the sample statistics are

\begin{tabular}{|c|c|}
\hline
Length $x$ (mm) & Expected frequency \\
\hline
$x \leq 298$ & 1.49 \\
$298 < x \leq 300$ & 37.85 \\
$300 < x \leq 301$ & 55.62 \\
$301 < x \leq 302$ & 58.32 \\
$302 < x \leq 304$ & 44.62 \\
$x > 304$ & 2.10 \\
\hline
\end{tabular}

Examine the goodness of fit of a Normal distribution, using a 5\% significance level. [7]

\item It is thought that the lengths of tubes in another batch have an underlying distribution similar to that for the batch in part (i) but possibly with different location and dispersion parameters. A random sample of 10 tubes from this batch gives the following lengths (in mm).

301.3 \quad 301.4 \quad 299.6 \quad 302.2 \quad 300.3 \quad 303.2 \quad 302.6 \quad 301.8 \quad 300.9 \quad 300.8

\begin{enumerate}[label=(\Alph*)]
\item Discuss briefly whether it would be appropriate to use a $t$ test to examine a hypothesis about the population mean length for this batch. [2]

\item Use a Wilcoxon test to examine at the 10\% significance level whether the population median length for this batch is 301 mm. [9]
\end{enumerate}
\end{enumerate}

\hfill \mbox{\textit{OCR MEI S3 2006 Q4 [18]}}