| Exam Board | Edexcel |
|---|---|
| Module | S3 (Statistics 3) |
| Session | Specimen |
| Marks | 17 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Linear combinations of normal random variables |
| Type | Two-sample t-test (unknown variances) |
| Difficulty | Moderate -0.3 This is a standard S3 two-sample hypothesis test with straightforward calculations and textbook definitions. Parts (a)-(b) test basic sampling knowledge, part (c) is a routine z-test application with given summary statistics, and parts (d)-(g) require standard interpretations. The question involves multiple steps but no novel insight or complex reasoning—slightly easier than average due to its formulaic nature. |
| Spec | 2.01a Population and sample: terminology2.01c Sampling techniques: simple random, opportunity, etc2.01d Select/critique sampling: in context5.05c Hypothesis test: normal distribution for population mean |
| Mean score \(( \bar { x } )\) |
| |||
| Full-time staff | 52 | 21 | ||
| Part-time staff | 50 | 19 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Label full time staff 1-6000, part time staff 1-4000 | M1 | For attempt at labelling full-time and part-time staff. One set of correct numbers |
| Use random numbers to select | M1 | For mentioning use of random numbers |
| Simple random sample of 120 full time staff and 80 part time staff | A1 | For s.r.s. of 120 full-time and 80 part-time |
| (3) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Enables estimation of statistics/errors for each strata or "reduce variability" or "more representative" or "reflects population structure" NOT "more accurate" | B1 | |
| (1) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| \(H_0: \mu_f = \mu_p\), \(\quad H_1: \mu_f \neq \mu_p\) \(\quad\) (accept \(\mu_1, \mu_2\)) | B1 | |
| \(\text{s.e.} = \sqrt{\dfrac{21}{80} + \dfrac{19}{80}}\) | M1 | For attempt at s.e. - condone one number wrong. NB correct s.e. \(= \sqrt{\frac{1}{2}}\) |
| \(z = \dfrac{52-50}{\sqrt{\dfrac{21}{80}+\dfrac{19}{80}}} = (2\sqrt{2})\) | M1 | For using their s.e. in correct formula for test statistic. Must be \(\dfrac{\pm(52-50)}{\sqrt{\frac{p}{q}+\frac{r}{s}}}\) |
| \(= 2.828\ldots\) (awrt 2.83) | A1 | |
| Two tailed critical value \(z = 2.5758\) (or prob of awrt 0.002 (<0.005) or 0.004 (<0.01)) | B1 | |
| \([2.828 > 2.5758\) so] significant evidence to reject \(H_0\) | dM1 | Dependent on 2nd M1, for correct statement based on their normal cv and their test statistic |
| There is evidence of a difference in policy awareness between full time and part time staff | A1ft | For correct comment in context. Must mention "scores" or "policy awareness" and types of "staff". Award A0 for a one-tailed comment. Allow ft |
| (7) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Can use mean full time and mean part time | B1 | For mention of mean(s) or use of \(\bar{X}\), provided \(\bar{X}\) clearly refers to full-time or part-time |
| \(\sim\) Normal | B1 | For stating that distribution can be assumed normal. e.g. "mean score of the test is normally distributed" gets B1B1 |
| (2) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Have assumed \(s^2 = \sigma^2\) or variance of sample = variance of population | B1 | |
| (1) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| \(2.53 < 2.5758\), not significant or do not reject \(H_0\) | M1 | For correct statement (may be implied by correct contextualised comment) |
| So there is insufficient evidence of a difference in mean awareness | A1ft | For correct contextualised comment. Accept "no difference in mean scores". Allow ft |
| (2) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Training course has closed the gap between full time staff and part time staff's mean awareness of company policy. | B1 | For correct comment in context that implies training was effective. This must be supported by their (c) and (f). Condone one-tailed comment here. |
| (1) | ||
| Total: 17 |
# Question 7:
## Part (a)
| Answer | Mark | Guidance |
|--------|------|----------|
| Label full time staff 1-6000, part time staff 1-4000 | M1 | For attempt at labelling full-time and part-time staff. One set of correct numbers |
| Use random numbers to select | M1 | For mentioning use of random numbers |
| Simple random sample of 120 full time staff and 80 part time staff | A1 | For s.r.s. of 120 full-time and 80 part-time |
| | **(3)** | |
## Part (b)
| Answer | Mark | Guidance |
|--------|------|----------|
| Enables estimation of statistics/errors for each strata **or** "reduce variability" **or** "more representative" **or** "reflects population structure" **NOT** "more accurate" | B1 | |
| | **(1)** | |
## Part (c)
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0: \mu_f = \mu_p$, $\quad H_1: \mu_f \neq \mu_p$ $\quad$ (accept $\mu_1, \mu_2$) | B1 | |
| $\text{s.e.} = \sqrt{\dfrac{21}{80} + \dfrac{19}{80}}$ | M1 | For attempt at s.e. - condone one number wrong. NB correct s.e. $= \sqrt{\frac{1}{2}}$ |
| $z = \dfrac{52-50}{\sqrt{\dfrac{21}{80}+\dfrac{19}{80}}} = (2\sqrt{2})$ | M1 | For using their s.e. in correct formula for test statistic. Must be $\dfrac{\pm(52-50)}{\sqrt{\frac{p}{q}+\frac{r}{s}}}$ |
| $= 2.828\ldots$ (awrt **2.83**) | A1 | |
| Two tailed critical value $z = 2.5758$ (or prob of awrt 0.002 (<0.005) or 0.004 (<0.01)) | B1 | |
| $[2.828 > 2.5758$ so] significant evidence to reject $H_0$ | dM1 | Dependent on 2nd M1, for correct statement based on their normal cv and their test statistic |
| There is evidence of a difference in policy awareness between full time and part time staff | A1ft | For correct comment in context. Must mention "scores" or "policy awareness" and types of "staff". Award **A0** for a one-tailed comment. Allow ft |
| | **(7)** | |
## Part (d)
| Answer | Mark | Guidance |
|--------|------|----------|
| Can use mean full time and mean part time | B1 | For mention of mean(s) **or** use of $\bar{X}$, provided $\bar{X}$ clearly refers to full-time or part-time |
| $\sim$ Normal | B1 | For stating that distribution can be assumed normal. e.g. "mean score of the test is normally distributed" gets B1B1 |
| | **(2)** | |
## Part (e)
| Answer | Mark | Guidance |
|--------|------|----------|
| Have assumed $s^2 = \sigma^2$ or variance of sample = variance of population | B1 | |
| | **(1)** | |
## Part (f)
| Answer | Mark | Guidance |
|--------|------|----------|
| $2.53 < 2.5758$, not significant **or** do not reject $H_0$ | M1 | For correct statement (may be implied by correct contextualised comment) |
| So there is insufficient evidence of a difference in mean awareness | A1ft | For correct contextualised comment. Accept "no difference in mean scores". Allow ft |
| | **(2)** | |
## Part (g)
| Answer | Mark | Guidance |
|--------|------|----------|
| Training course has closed the gap between full time staff and part time staff's mean awareness of company policy. | B1 | For correct comment in context that implies training was effective. This must be supported by their (c) and (f). Condone one-tailed comment here. |
| | **(1)** | |
| | **Total: 17** | |
\begin{enumerate}
\item A large company surveyed its staff to investigate the awareness of company policy. The company employs 6000 full-time staff and 4000 part-time staff.\\
(a) Describe how a stratified sample of 200 staff could be taken.\\
(b) Explain an advantage of using a stratified sample rather than a simple random sample.
\end{enumerate}
A random sample of 80 full-time staff and an independent random sample of 80 part-time staff were given a test of policy awareness. The results are summarised in the table below.
\begin{center}
\begin{tabular}{ | c | c | c | }
\hline
& Mean score $( \bar { x } )$ & \begin{tabular}{ c }
Variance of \\
scores $\left( s ^ { 2 } \right)$ \\
\end{tabular} \\
\hline
Full-time staff & 52 & 21 \\
\hline
Part-time staff & 50 & 19 \\
\hline
\end{tabular}
\end{center}
(c) Stating your hypotheses clearly, test, at the $1 \%$ level of significance, whether or not the mean policy awareness scores for full-time and part-time staff are different.\\
(d) Explain the significance of the Central Limit Theorem to the test in part (c).\\
(e) State an assumption you have made in carrying out the test in part (c).
After all the staff had completed a training course the 80 full-time staff and the 80 part-time staff were given another test of policy awareness. The value of the test statistic $z$ was 2.53\\
(f) Comment on the awareness of company policy for the full-time and part-time staff in light of this result. Use a $1 \%$ level of significance.\\
(g) Interpret your answers to part (c) and part (f).
\hfill \mbox{\textit{Edexcel S3 Q7 [17]}}