Edexcel S2 2024 June — Question 3 15 marks

Exam BoardEdexcel
ModuleS2 (Statistics 2)
Year2024
SessionJune
Marks15
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicHypothesis test of binomial distributions
TypeExplain sampling frames and units
DifficultyModerate -0.8 This is a straightforward S2 question testing standard bookwork on sampling terminology (parts a-c), routine binomial critical region calculation (part d), and normal approximation to binomial (part f). All parts follow textbook procedures with no novel problem-solving required; the calculations are mechanical and the context is simple.
Spec2.01c Sampling techniques: simple random, opportunity, etc2.01d Select/critique sampling: in context5.05a Sample mean distribution: central limit theorem5.05b Unbiased estimates: of population mean and variance5.05c Hypothesis test: normal distribution for population mean

3 Jian owns a large group of shops. She decides to visit a random sample of the shops to check if the stocktaking system is being used incorrectly.
  1. Suggest a suitable sampling frame for Jian to use.
  2. Identify the sampling units.
  3. Give one advantage and one disadvantage of taking a sample rather than a census. Jian believes that the stocktaking system is being used incorrectly in \(40 \%\) of the shops.
    To investigate her belief, a random sample of 30 of the shops is taken.
  4. Using a 5\% level of significance, find the critical region for a two-tailed test of Jian's belief.
    You should state the probability in each tail, which should each be as close as possible to 2.5\% The total number of shops, in the sample of 30, where the stocktaking system is being used incorrectly is 20
  5. Using the critical region from part (d), state what this suggests about Jian's belief. Give a reason for your answer. Jian introduces a new, simpler, stocktaking system to all the shops.
    She takes a random sample of 150 shops and finds that in 47 of these shops the new stocktaking system is being used incorrectly.
  6. Using a suitable approximation, test, at the \(5 \%\) level of significance, whether or not there is evidence that the proportion of shops where the stocktaking system is being used incorrectly is now less than 0.4 You should state your hypotheses and show your working clearly.

Question 3:
Part (a):
AnswerMarks Guidance
Working/AnswerMark Guidance
A list of all the shopsB1 For the idea of a list/database of all shops; list of all stocktaking systems is B0
Part (b):
AnswerMarks Guidance
Working/AnswerMark Guidance
The shopsB1 For allowing shop or store(s); the number of shops is B0; the stocktaking systems at each shop is B0
Part (c):
AnswerMarks Guidance
Working/AnswerMark Guidance
Advantage: A sample is quicker/cheaper/easier to processB1 For a correct advantage; e.g. allow census takes longer than a sample; 'a sample is more uncertain' on its own is B0
Disadvantage: less accurate/may be biased/may not be representativeB1 For a correct disadvantage e.g. a census is more accurate than a sample. If no reference to sample or census assume referring to sample. Ignore extraneous non-contradictory comments
Part (d):
AnswerMarks Guidance
Working/AnswerMark Guidance
\(P(X \leq 6) = 0.0172\) or \(P(X \leq 18) = 0.0212\) or \(P(X \geq 17) = 0.9788\) or \(X \leq 6\) or \(X \geq 18\)M1 For one of these probability statements correct or awrt 0.017 or awrt 0.021 or awrt 0.98 or one correct CR
\([P(X \leq 6)] = 0.0172\) and \([P(X \leq 18)] = 0.0212\)A1 For both probabilities awrt 0.0172 and awrt 0.0212
CR: \([0, 6] \cup X \geq 6\), \(6\), \(18\), \(X[\leq 30]\)A1 For both CR correct e.g. \(X < 7\), \(X > 17\); ignore symbol between two CR tails; allow any letter
Part (e):
AnswerMarks Guidance
Working/AnswerMark Guidance
20 is in the critical region therefore there is evidence that Jian's belief is incorrectB1ft For stating 20 is in the CR and give correct statement. Only ft if CR is in form \(X \leq C_1 \cup X \geq C_2\)
Part (f):
AnswerMarks Guidance
Working/AnswerMark Guidance
\(H_0: p = 0.4 \quad H_1: p < 0.4\)B1 For both hypotheses correct, using \(p\) or \(\pi\). Must be attached to \(H_0\) and \(H_1\)
\(J \sim B(150, 0.4) \Rightarrow \approx N(60, 36)\)M1A1 For writing or using \(N(60,...)\); for writing or using \(N(60, 36)\)
\(P\left(J \leq 47\right) \approx P\left(Z \leq \frac{47.5 - 60}{6}\right) = -2.08333...\)M1 For standardising (allow \(\pm\)) using "60" and "6" with either 46.5, 47 or 47.5
\(\frac{(n+0.5)-60}{6} = -1.6449\)M1 For CR method: \(n\), \(n+0.5\) or \(n-0.5\) and equate to \(-1.6449\) or better
\(= 0.0188\) (calc 0.018610...)A1 For awrt 0.019 or CR: \(J <\) awrt 49.6 or \(J + 0.5 <\) awrt 50.1
There is sufficient evidence that the proportion of shops using system incorrectly is less than 0.4/decreasedA1 Dep on previous A1 for correct conclusion in context using bold word. Do not allow 'number' for 'proportion'
# Question 3:

## Part (a):
| Working/Answer | Mark | Guidance |
|---|---|---|
| A list of all the **shops** | B1 | For the idea of a list/database of **all shops**; list of all stocktaking systems is B0 |

## Part (b):
| Working/Answer | Mark | Guidance |
|---|---|---|
| The shops | B1 | For allowing shop or store(s); the number of shops is B0; the stocktaking systems at each shop is B0 |

## Part (c):
| Working/Answer | Mark | Guidance |
|---|---|---|
| Advantage: A sample is quicker/cheaper/easier to process | B1 | For a correct advantage; e.g. allow census takes longer than a sample; 'a sample is more uncertain' on its own is B0 |
| Disadvantage: less accurate/may be biased/may not be representative | B1 | For a correct disadvantage e.g. a census is more accurate than a sample. If no reference to sample or census assume referring to sample. Ignore extraneous non-contradictory comments |

## Part (d):
| Working/Answer | Mark | Guidance |
|---|---|---|
| $P(X \leq 6) = 0.0172$ or $P(X \leq 18) = 0.0212$ or $P(X \geq 17) = 0.9788$ or $X \leq 6$ or $X \geq 18$ | M1 | For one of these probability statements correct or awrt 0.017 or awrt 0.021 or awrt 0.98 or one correct CR |
| $[P(X \leq 6)] = 0.0172$ **and** $[P(X \leq 18)] = 0.0212$ | A1 | For both probabilities awrt 0.0172 and awrt 0.0212 |
| CR: $[0, 6] \cup X \geq 6$, $6$, $18$, $X[\leq 30]$ | A1 | For both CR correct e.g. $X < 7$, $X > 17$; ignore symbol between two CR tails; allow any letter |

## Part (e):
| Working/Answer | Mark | Guidance |
|---|---|---|
| 20 is in the critical region therefore there is evidence that **Jian's belief** is incorrect | B1ft | For stating 20 is in the CR **and** give correct statement. Only ft if CR is in form $X \leq C_1 \cup X \geq C_2$ |

## Part (f):
| Working/Answer | Mark | Guidance |
|---|---|---|
| $H_0: p = 0.4 \quad H_1: p < 0.4$ | B1 | For both hypotheses correct, using $p$ or $\pi$. Must be attached to $H_0$ and $H_1$ |
| $J \sim B(150, 0.4) \Rightarrow \approx N(60, 36)$ | M1A1 | For writing or using $N(60,...)$; for writing or using $N(60, 36)$ |
| $P\left(J \leq 47\right) \approx P\left(Z \leq \frac{47.5 - 60}{6}\right) = -2.08333...$ | M1 | For standardising (allow $\pm$) using "60" and "6" with either 46.5, 47 or 47.5 |
| $\frac{(n+0.5)-60}{6} = -1.6449$ | M1 | For CR method: $n$, $n+0.5$ or $n-0.5$ and equate to $-1.6449$ or better |
| $= 0.0188$ (calc 0.018610...) | A1 | For awrt 0.019 or CR: $J <$ awrt 49.6 or $J + 0.5 <$ awrt 50.1 |
| There is sufficient evidence that the **proportion** of shops using system incorrectly **is less than 0.4/decreased** | A1 | Dep on previous A1 for correct conclusion in context using bold word. Do not allow 'number' for 'proportion' |

---
3 Jian owns a large group of shops. She decides to visit a random sample of the shops to check if the stocktaking system is being used incorrectly.
\begin{enumerate}[label=(\alph*)]
\item Suggest a suitable sampling frame for Jian to use.
\item Identify the sampling units.
\item Give one advantage and one disadvantage of taking a sample rather than a census.

Jian believes that the stocktaking system is being used incorrectly in $40 \%$ of the shops.\\
To investigate her belief, a random sample of 30 of the shops is taken.
\item Using a 5\% level of significance, find the critical region for a two-tailed test of Jian's belief.\\
You should state the probability in each tail, which should each be as close as possible to 2.5\%

The total number of shops, in the sample of 30, where the stocktaking system is being used incorrectly is 20
\item Using the critical region from part (d), state what this suggests about Jian's belief. Give a reason for your answer.

Jian introduces a new, simpler, stocktaking system to all the shops.\\
She takes a random sample of 150 shops and finds that in 47 of these shops the new stocktaking system is being used incorrectly.
\item Using a suitable approximation, test, at the $5 \%$ level of significance, whether or not there is evidence that the proportion of shops where the stocktaking system is being used incorrectly is now less than 0.4 You should state your hypotheses and show your working clearly.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S2 2024 Q3 [15]}}