AQA S1 2011 January — Question 3 13 marks

Exam BoardAQA
ModuleS1 (Statistics 1)
Year2011
SessionJanuary
Marks13
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicNormal Distribution
TypeEstimate from grouped frequency data
DifficultyModerate -0.3 This is a standard S1 grouped data question requiring routine calculations (mean, standard deviation from grouped data) and straightforward confidence interval construction. Part (a) tests basic understanding of grouped data boundaries, (b) is mechanical calculation, (c) applies standard normal confidence interval formula, and (d) requires simple interpretation. All techniques are textbook exercises with no novel problem-solving required, making it slightly easier than average.
Spec2.02g Calculate mean and standard deviation2.04e Normal distribution: as model N(mu, sigma^2)2.04f Find normal probabilities: Z transformation5.05c Hypothesis test: normal distribution for population mean

3 The volume, \(X\) litres, of orange juice in a 1-litre carton may be modelled by a normal distribution with unknown mean \(\mu\). The volumes, \(x\) litres, recorded to the nearest 0.01 litre, in a random sample of 100 cartons are shown in the table.
Volume ( \(\boldsymbol { x }\) litres)Number of cartons (f)
0.95-0.972
0.98-1.007
1.01-1.0315
1.04-1.0632
1.07-1.0922
1.10-1.1214
1.13-1.157
1.16-1.181
Total100
  1. For the group ' \(0.98 - 1.00\) ':
    1. show that it has a mid-point of 0.99 litres;
    2. state the minimum and the maximum values of \(x\) that could be included in this group.
  2. Calculate, to three decimal places, estimates of the mean and the standard deviation of these 100 volumes.
    1. Construct an approximate \(99 \%\) confidence interval for \(\mu\).
    2. State why use of the Central Limit Theorem was not required when calculating this confidence interval.
    3. Give a reason why the confidence interval is approximate rather than exact.
  3. Give a reason in support of the claim that:
    1. \(\mu > 1\);
    2. \(\mathrm { P } ( 0.94 < X < 1.16 )\) is approximately 1 .
      \includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-10_2486_1714_221_153}
      \includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-11_2486_1714_221_153}

Question 3:
Part (a)(i)
AnswerMarks Guidance
Mid-point of 0.98–1.00 is \(\frac{0.98 + 1.00}{2} = 0.99\)B1 Show working clearly
Part (a)(ii)
AnswerMarks Guidance
Minimum: \(x = 0.975\), Maximum: \(x = 1.005\)B1B1 B1 each value; accept 0.9750... and 1.0049...
Part (b)
AnswerMarks Guidance
\(\bar{x} = \frac{\sum fx}{\sum f} = \frac{2(0.96)+7(0.99)+15(1.02)+32(1.05)+22(1.08)+14(1.11)+7(1.14)+1(1.17)}{100}\)M1 Using midpoints with frequencies
\(\bar{x} = \frac{105.39}{100} = 1.054\) (3 d.p.)A1
\(s^2 = \frac{\sum fx^2}{\sum f} - \bar{x}^2\), \(s = \sqrt{\frac{\sum fx^2}{100} - \bar{x}^2}\)M1 Correct formula for standard deviation
\(s = 0.049\) (3 d.p.)A1 Accept \(\sigma = 0.049\)
Part (c)(i)
AnswerMarks Guidance
\(\bar{x} \pm 2.576 \times \frac{s}{\sqrt{n}}\)M1 Correct structure of CI
\(1.054 \pm 2.576 \times \frac{0.049}{\sqrt{100}}\)A1 Correct values substituted
\(1.054 \pm 0.01262...\)A1
\((1.041, 1.067)\)A1 Accept \((1.0414, 1.0666)\)
Part (c)(ii)
AnswerMarks Guidance
\(X\) is already normally distributed (so \(\bar{X}\) is exactly normal for any \(n\))B1 Must reference normality of \(X\)
Part (c)(iii)
AnswerMarks Guidance
The standard deviation \(\sigma\) is unknown and estimated by \(s\)B1 Or: population variance unknown/estimated
Part (d)(i)
AnswerMarks Guidance
The confidence interval lies entirely above 1 (lower limit \(> 1\)), supporting \(\mu > 1\)B1 Or: sample mean \(= 1.054 > 1\)
Part (d)(ii)
AnswerMarks Guidance
The interval \((0.94, 1.16)\) covers approximately \(\mu \pm 2\sigma\) (since \(s \approx 0.049\), \(2s \approx 0.1\)), and for a normal distribution \(P(\mu - 2\sigma < X < \mu + 2\sigma) \approx 0.954 \approx 1\)B1 Accept: range is approximately \(\mu \pm 2\sigma\), nearly all data lies within this interval
# Question 3:

## Part (a)(i)
| Mid-point of 0.98–1.00 is $\frac{0.98 + 1.00}{2} = 0.99$ | B1 | Show working clearly |

## Part (a)(ii)
| Minimum: $x = 0.975$, Maximum: $x = 1.005$ | B1B1 | B1 each value; accept 0.9750... and 1.0049... |

## Part (b)
| $\bar{x} = \frac{\sum fx}{\sum f} = \frac{2(0.96)+7(0.99)+15(1.02)+32(1.05)+22(1.08)+14(1.11)+7(1.14)+1(1.17)}{100}$ | M1 | Using midpoints with frequencies |
| $\bar{x} = \frac{105.39}{100} = 1.054$ (3 d.p.) | A1 | |
| $s^2 = \frac{\sum fx^2}{\sum f} - \bar{x}^2$, $s = \sqrt{\frac{\sum fx^2}{100} - \bar{x}^2}$ | M1 | Correct formula for standard deviation |
| $s = 0.049$ (3 d.p.) | A1 | Accept $\sigma = 0.049$ |

## Part (c)(i)
| $\bar{x} \pm 2.576 \times \frac{s}{\sqrt{n}}$ | M1 | Correct structure of CI |
| $1.054 \pm 2.576 \times \frac{0.049}{\sqrt{100}}$ | A1 | Correct values substituted |
| $1.054 \pm 0.01262...$ | A1 | |
| $(1.041, 1.067)$ | A1 | Accept $(1.0414, 1.0666)$ |

## Part (c)(ii)
| $X$ is already normally distributed (so $\bar{X}$ is exactly normal for any $n$) | B1 | Must reference normality of $X$ |

## Part (c)(iii)
| The standard deviation $\sigma$ is unknown and estimated by $s$ | B1 | Or: population variance unknown/estimated |

## Part (d)(i)
| The confidence interval lies entirely above 1 (lower limit $> 1$), supporting $\mu > 1$ | B1 | Or: sample mean $= 1.054 > 1$ |

## Part (d)(ii)
| The interval $(0.94, 1.16)$ covers approximately $\mu \pm 2\sigma$ (since $s \approx 0.049$, $2s \approx 0.1$), and for a normal distribution $P(\mu - 2\sigma < X < \mu + 2\sigma) \approx 0.954 \approx 1$ | B1 | Accept: range is approximately $\mu \pm 2\sigma$, nearly all data lies within this interval |
3 The volume, $X$ litres, of orange juice in a 1-litre carton may be modelled by a normal distribution with unknown mean $\mu$.

The volumes, $x$ litres, recorded to the nearest 0.01 litre, in a random sample of 100 cartons are shown in the table.

\begin{center}
\begin{tabular}{|l|l|}
\hline
Volume ( $\boldsymbol { x }$ litres) & Number of cartons (f) \\
\hline
0.95-0.97 & 2 \\
\hline
0.98-1.00 & 7 \\
\hline
1.01-1.03 & 15 \\
\hline
1.04-1.06 & 32 \\
\hline
1.07-1.09 & 22 \\
\hline
1.10-1.12 & 14 \\
\hline
1.13-1.15 & 7 \\
\hline
1.16-1.18 & 1 \\
\hline
Total & 100 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item For the group ' $0.98 - 1.00$ ':
\begin{enumerate}[label=(\roman*)]
\item show that it has a mid-point of 0.99 litres;
\item state the minimum and the maximum values of $x$ that could be included in this group.
\end{enumerate}\item Calculate, to three decimal places, estimates of the mean and the standard deviation of these 100 volumes.
\item \begin{enumerate}[label=(\roman*)]
\item Construct an approximate $99 \%$ confidence interval for $\mu$.
\item State why use of the Central Limit Theorem was not required when calculating this confidence interval.
\item Give a reason why the confidence interval is approximate rather than exact.
\end{enumerate}\item Give a reason in support of the claim that:
\begin{enumerate}[label=(\roman*)]
\item $\mu > 1$;
\item $\mathrm { P } ( 0.94 < X < 1.16 )$ is approximately 1 .

\begin{center}
\includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-10_2486_1714_221_153}
\end{center}

\begin{center}
\includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-11_2486_1714_221_153}
\end{center}
\end{enumerate}\end{enumerate}

\hfill \mbox{\textit{AQA S1 2011 Q3 [13]}}