Edexcel S3 2023 January — Question 1 12 marks

Exam BoardEdexcel
ModuleS3 (Statistics 3)
Year2023
SessionJanuary
Marks12
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicConfidence intervals
TypeUnbiased estimates then CI
DifficultyStandard +0.3 This is a straightforward S3 hypothesis testing question with standard coded data manipulation. Part (a) involves routine mean/variance calculations from coded data using given formulas, (b)-(d) are textbook hypothesis test steps with critical values from tables, and (e) requires a simple justification about large sample size. All techniques are standard with no novel problem-solving required, making it slightly easier than average.
Spec2.04e Normal distribution: as model N(mu, sigma^2)2.04f Find normal probabilities: Z transformation5.05b Unbiased estimates: of population mean and variance5.05c Hypothesis test: normal distribution for population mean

1 A machine fills bottles with mineral water.
The machine is checked every day to ensure that it is working correctly. On a particular day a random sample of 100 bottles is taken. The volume of water, \(x\) millilitres, for each bottle is measured and each measurement is coded using $$y = x - 1000$$ The results are summarised below $$\sum y = 847 \quad \sum y ^ { 2 } = 13510.09$$
    1. Show that the value of the unbiased estimate of the mean of \(x\) is 1008.47
    2. Calculate the unbiased estimate of the variance of \(x\) The machine was initially set so that the volume of water in a bottle had a mean value of 1010 millilitres. Later, a test at the \(5 \%\) significance level is used to determine whether or not the mean volume of water in a bottle has changed. If it has changed then the machine is stopped and reset.
  1. Write down suitable null and alternative hypotheses for a 2-tailed test.
  2. Find the critical region for \(\bar { X }\) in the above test.
  3. Using your answer to part (a) and your critical region found in part (c), comment on whether or not the machine needs to be stopped and reset.
    Give a reason for your answer.
  4. Explain why the use of \(\sigma ^ { 2 } = s ^ { 2 }\) is reasonable in this situation.

Question 1:
Part (a)(i)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\(\bar{y} = \frac{847}{100} = 8.47\) or \(847 + 100 \times 1000 = 100847\)M1 For 8.47 or \(\frac{847}{100}\) or \(847 + 100 \times 1000\) or \(847 = \sum x - 100 \times 1000\) or 100847 seen
\(\bar{x} = 1000 + \frac{847}{100} = 1008.47\)A1* cso correct solution including \(\bar{x} = ..\) and \(...= 1008.47\); allow alt notation for \(\bar{x}\) but must refer to \(x\) not \(y\); must not be just \(x\) e.g. \(E(X)\), \(\mu_x\), mean of \(x\)
Part (a)(ii)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\(s_x^2 = s_y^2 = \frac{13510.09 - 100 \times \text{"8.47"}^2}{99}\) or \(s_x^2 = \frac{101707510.1 - \frac{\text{"100847"}^2}{100}}{99}\)M1 For a correct expression ft their 100847; allow answer of 1064
\(= 64\)A1 Cao do not ISW; allow 64.00...
Part (b)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\(H_0: \mu_x = 1010\), \(H_1: \mu_x \neq 1010\)B1 Both hypotheses correct; must be in terms of \(\mu\); allow \(H_0: \mu_y = 10\), \(H_1: \mu_y \neq 10\)
Part (c)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\(\frac{\bar{X} - 1010}{\text{"8"}/\sqrt{100}} = -1.96\) or \(\frac{\bar{X} - 1010}{\text{"8"}/\sqrt{100}} = 1.96\)M1 B1 For \(\pm\) standardisation with 1010 and their sd; allow equivalent e.g. \(1010 \pm n \times \text{"8"}/\sqrt{100}\); SC condone use of 1008.47 for 1010 or for \(\bar{X}\); for c.v. = \(\pm 1.96\) or better (Calculator gives 1.95996...); condone 1.6449 or better if they have a one tail hypothesis in (b)
\(\bar{X} = 1008.432\) and \(\bar{X} = 1011.568\), awrt 1008 and 1012 (or 1011)A1 For both limits 1008 or better and 1012 or better seen (condone 1011 from correct working)
\(\bar{X} \leq \text{"1008.432"}\) or \(\bar{X} \geq \text{"1011.568"}\)A1ft For selecting the correct region ft their figures (not \(z\) value); allow use of \(<\) and \(>\); allow other letters (condone \(\mu\)); allow other notation e.g. \([1012, \infty]\), \((\infty, 1008]\) etc.
Part (d)
AnswerMarks Guidance
Answer/WorkingMark Guidance
1008.47 is not in the critical regionM1 ft their CR if the final A mark in part (c) is awarded; must refer to 1008.47; allow writing in the form "1008.432" \(< 1008.47 <\) "1011.568" etc but if in middle it must have both ends; if no clear CR it is M0A0
The machine does not need to be stopped/resetA1ft dep on M1; correct conclusion consistent with comparing 1008.47 with their CR; if in the CR they must say it needs to be reset/stopped; if not in the CR it must say it does not need to be stopped/reset
SC: if CR is of the form "1008.432" \(< \bar{X} <\) "1011.568" then award M0A1 for concluding the machine does not need to be stopped/resetSC
Part (e)
AnswerMarks Guidance
Answer/WorkingMark Guidance
It is reasonable since the sample size is (reasonably) largeB1 Any suitable comment about the sample being large e.g. \(n\) is large
# Question 1:

## Part (a)(i)
| Answer/Working | Mark | Guidance |
|---|---|---|
| $\bar{y} = \frac{847}{100} = 8.47$ or $847 + 100 \times 1000 = 100847$ | M1 | For 8.47 or $\frac{847}{100}$ or $847 + 100 \times 1000$ or $847 = \sum x - 100 \times 1000$ or 100847 seen |
| $\bar{x} = 1000 + \frac{847}{100} = 1008.47$ | A1* | cso correct solution including $\bar{x} = ..$ and $...= 1008.47$; allow alt notation for $\bar{x}$ but must refer to $x$ not $y$; must not be just $x$ e.g. $E(X)$, $\mu_x$, mean of $x$ |

## Part (a)(ii)
| Answer/Working | Mark | Guidance |
|---|---|---|
| $s_x^2 = s_y^2 = \frac{13510.09 - 100 \times \text{"8.47"}^2}{99}$ or $s_x^2 = \frac{101707510.1 - \frac{\text{"100847"}^2}{100}}{99}$ | M1 | For a correct expression ft their 100847; allow answer of 1064 |
| $= 64$ | A1 | Cao do not ISW; allow 64.00... |

## Part (b)
| Answer/Working | Mark | Guidance |
|---|---|---|
| $H_0: \mu_x = 1010$, $H_1: \mu_x \neq 1010$ | B1 | Both hypotheses correct; must be in terms of $\mu$; allow $H_0: \mu_y = 10$, $H_1: \mu_y \neq 10$ |

## Part (c)
| Answer/Working | Mark | Guidance |
|---|---|---|
| $\frac{\bar{X} - 1010}{\text{"8"}/\sqrt{100}} = -1.96$ or $\frac{\bar{X} - 1010}{\text{"8"}/\sqrt{100}} = 1.96$ | M1 B1 | For $\pm$ standardisation with 1010 and their sd; allow equivalent e.g. $1010 \pm n \times \text{"8"}/\sqrt{100}$; SC condone use of 1008.47 for 1010 or for $\bar{X}$; for c.v. = $\pm 1.96$ or better (Calculator gives 1.95996...); condone 1.6449 or better if they have a one tail hypothesis in (b) |
| $\bar{X} = 1008.432$ and $\bar{X} = 1011.568$, awrt 1008 and 1012 (or 1011) | A1 | For **both** limits 1008 or better **and** 1012 or better seen (condone 1011 from correct working) |
| $\bar{X} \leq \text{"1008.432"}$ or $\bar{X} \geq \text{"1011.568"}$ | A1ft | For selecting the correct region ft their figures (not $z$ value); allow use of $<$ and $>$; allow other letters (condone $\mu$); allow other notation e.g. $[1012, \infty]$, $(\infty, 1008]$ etc. |

## Part (d)
| Answer/Working | Mark | Guidance |
|---|---|---|
| 1008.47 is not in the critical region | M1 | ft their CR if the final A mark in part (c) is awarded; must refer to 1008.47; allow writing in the form "1008.432" $< 1008.47 <$ "1011.568" etc but if in middle it must have both ends; if no clear CR it is M0A0 |
| The machine does not need to be stopped/reset | A1ft | dep on M1; correct conclusion consistent with comparing 1008.47 with their CR; if in the CR they must say it needs to be reset/stopped; if not in the CR it must say it does not need to be stopped/reset |
| SC: if CR is of the form "1008.432" $< \bar{X} <$ "1011.568" then award M0A1 for concluding the machine does not need to be stopped/reset | SC | |

## Part (e)
| Answer/Working | Mark | Guidance |
|---|---|---|
| It is reasonable since the sample size is (reasonably) large | B1 | Any suitable comment about the sample being large e.g. $n$ is large |

---
1 A machine fills bottles with mineral water.\\
The machine is checked every day to ensure that it is working correctly. On a particular day a random sample of 100 bottles is taken. The volume of water, $x$ millilitres, for each bottle is measured and each measurement is coded using

$$y = x - 1000$$

The results are summarised below

$$\sum y = 847 \quad \sum y ^ { 2 } = 13510.09$$
\begin{enumerate}[label=(\alph*)]
\item \begin{enumerate}[label=(\roman*)]
\item Show that the value of the unbiased estimate of the mean of $x$ is 1008.47
\item Calculate the unbiased estimate of the variance of $x$

The machine was initially set so that the volume of water in a bottle had a mean value of 1010 millilitres.

Later, a test at the $5 \%$ significance level is used to determine whether or not the mean volume of water in a bottle has changed. If it has changed then the machine is stopped and reset.
\end{enumerate}\item Write down suitable null and alternative hypotheses for a 2-tailed test.
\item Find the critical region for $\bar { X }$ in the above test.
\item Using your answer to part (a) and your critical region found in part (c), comment on whether or not the machine needs to be stopped and reset.\\
Give a reason for your answer.
\item Explain why the use of $\sigma ^ { 2 } = s ^ { 2 }$ is reasonable in this situation.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S3 2023 Q1 [12]}}