OCR S3 2009 January — Question 8 14 marks

Exam BoardOCR
ModuleS3 (Statistics 3)
Year2009
SessionJanuary
Marks14
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeChi-squared goodness of fit: Binomial
DifficultyStandard +0.3 This is a straightforward chi-squared goodness of fit test with standard steps: estimating a parameter from sample mean, calculating expected frequencies, computing chi-squared statistic, and comparing to critical values. Part (i) is routine parameter estimation, part (ii) follows standard test procedure (though requires combining cells), and part (iii) is simple interpretation given the test statistic. The binomial context and calculations are typical S3 material with no novel insights required.
Spec5.06b Fit prescribed distribution: chi-squared test5.06c Fit other distributions: discrete and continuous

8 A soft drinks factory produces lemonade which is sold in packs of 6 bottles. As part of the factory's quality control, random samples of 75 packs are examined at regular intervals. The number of underfilled bottles in a pack of 6 bottles is denoted by the random variable \(X\). The results of one quality control check are shown in the following table.
Number of underfilled bottles0123
Number of packs442083
A researcher assumes that \(X \sim \mathrm {~B} ( 3 , p )\).
  1. By finding the sample mean, show that an estimate of \(p\) is 0.2 .
  2. Show that, at the \(5 \%\) significance level, there is evidence that this binomial distribution does not fit the data.
  3. Another researcher suggests that the goodness of fit test should be for \(\mathrm { B } ( 6 , p )\). She finds that the corresponding value of \(\chi ^ { 2 }\) is 2.74 , correct to 3 significant figures. Given that the number of degrees of freedom is the same as in part (ii), state the conclusion of the test at the same significance level.

Question 8:
Part (i):
AnswerMarks Guidance
AnswerMark Guidance
Mean \(= (20+16+9)/75 = 0.6\)M1, A1
\(3p = 0.6\), \(p = 0.2\) AGA1 3
Part (ii):
AnswerMarks Guidance
AnswerMark Guidance
\(H_0\): \(B(3,p)\) fits the data; \(H_1\): \(B(3,p)\) does not fit the dataB1 Or: \(X \sim B(3,p)\) or \(B(3,0.2)\). Not 'Data fits model'
Expected values: 38.4, 28.8, 7.2, 0.6M1, A1, A1 Use \(B(3,0.2)\times 75\). At least 2 correct, All correct
Combine last two cellsB1
\(\chi^2 = 5.6^2/38.4 + 8.8^2/28.8 + 3.2^2/7.8 = 4.818\)M1, \(\text{A1}\sqrt{}\), A1 With one correct. At least 2 correct Ft values. Accept 4.82 cao
\(4.818 > 3.841\)\(\text{B1}\sqrt{}\) ft 4.818. SR1 if cells not combined: B1M1A1A1B0M1A1A0B1(5.991)M1
Reject \(H_0\) and conclude there is insufficient evidence that \(B(3,p)\) fits the dataM1 10 SR2: E-values rounded: B1M1A1A1A0B1M1A1A0(4.865)B1M1
Part (iii):
AnswerMarks Guidance
AnswerMark Guidance
\(2.74 < 3.841\), accept \(H_0\), conclude that \(B(6,p)\) fits the dataB1 1 Accept with no reason if evidence of method in (ii)
# Question 8:

## Part (i):
| Answer | Mark | Guidance |
|--------|------|----------|
| Mean $= (20+16+9)/75 = 0.6$ | M1, A1 | |
| $3p = 0.6$, $p = 0.2$ AG | A1 **3** | |

## Part (ii):
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0$: $B(3,p)$ fits the data; $H_1$: $B(3,p)$ does not fit the data | B1 | Or: $X \sim B(3,p)$ or $B(3,0.2)$. Not 'Data fits model' |
| Expected values: 38.4, 28.8, 7.2, 0.6 | M1, A1, A1 | Use $B(3,0.2)\times 75$. At least 2 correct, All correct |
| Combine last two cells | B1 | |
| $\chi^2 = 5.6^2/38.4 + 8.8^2/28.8 + 3.2^2/7.8 = 4.818$ | M1, $\text{A1}\sqrt{}$, A1 | With one correct. At least 2 correct Ft values. Accept 4.82 cao |
| $4.818 > 3.841$ | $\text{B1}\sqrt{}$ | ft 4.818. SR1 if cells not combined: B1M1A1A1B0M1A1A0B1(5.991)M1 |
| Reject $H_0$ and conclude there is insufficient evidence that $B(3,p)$ fits the data | M1 **10** | SR2: E-values rounded: B1M1A1A1A0B1M1A1A0(4.865)B1M1 |

## Part (iii):
| Answer | Mark | Guidance |
|--------|------|----------|
| $2.74 < 3.841$, accept $H_0$, conclude that $B(6,p)$ fits the data | B1 **1** | Accept with no reason if evidence of method in (ii) |
8 A soft drinks factory produces lemonade which is sold in packs of 6 bottles. As part of the factory's quality control, random samples of 75 packs are examined at regular intervals. The number of underfilled bottles in a pack of 6 bottles is denoted by the random variable $X$. The results of one quality control check are shown in the following table.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | }
\hline
Number of underfilled bottles & 0 & 1 & 2 & 3 \\
\hline
Number of packs & 44 & 20 & 8 & 3 \\
\hline
\end{tabular}
\end{center}

A researcher assumes that $X \sim \mathrm {~B} ( 3 , p )$.\\
(i) By finding the sample mean, show that an estimate of $p$ is 0.2 .\\
(ii) Show that, at the $5 \%$ significance level, there is evidence that this binomial distribution does not fit the data.\\
(iii) Another researcher suggests that the goodness of fit test should be for $\mathrm { B } ( 6 , p )$. She finds that the corresponding value of $\chi ^ { 2 }$ is 2.74 , correct to 3 significant figures. Given that the number of degrees of freedom is the same as in part (ii), state the conclusion of the test at the same significance level.

\hfill \mbox{\textit{OCR S3 2009 Q8 [14]}}